Skip to content

Instantly share code, notes, and snippets.

View andyyehoo's full-sized avatar

Andy Huang andyyehoo

View GitHub Profile
@andyyehoo
andyyehoo / StreamingHLL.scala
Created May 15, 2016 12:44 — forked from MLnick/StreamingHLL.scala
Spark Streaming meets Algebird's HyperLogLog Monoid
import spark.streaming.StreamingContext._
import spark.streaming.{Seconds, StreamingContext}
import spark.SparkContext._
import spark.storage.StorageLevel
import spark.streaming.examples.twitter.TwitterInputDStream
import com.twitter.algebird.HyperLogLog._
import com.twitter.algebird._
/**
* Example of using HyperLogLog monoid from Twitter's Algebird together with Spark Streaming's