Skip to content

Instantly share code, notes, and snippets.

View mslitao's full-sized avatar
😀
Welcome! I'm a machine learning scientist,and a full-stack data engineer.

OG mslitao

😀
Welcome! I'm a machine learning scientist,and a full-stack data engineer.
View GitHub Profile
def main(args: Array[String]) {
// Prepare your environment
val ssc = new StreamingContext(conf, Seconds(batchDurationInSec))
// Do your processing
sys.ShutdownHookThread {
log.info("Gracefully stopping Spark Streaming Application")
import spark.streaming.StreamingContext._
import spark.streaming.{Seconds, StreamingContext}
import spark.SparkContext._
import spark.storage.StorageLevel
import spark.streaming.examples.twitter.TwitterInputDStream
import com.twitter.algebird.HyperLogLog._
import com.twitter.algebird._
/**
* Example of using HyperLogLog monoid from Twitter's Algebird together with Spark Streaming's