Netflix open sources its data traffic cop, Suro
Dec 10, 2013, 13:00 (0 Talkback[s])
Netflix's various applications generate tens of billions of events per day, and Suro collects them all before sending them on their way. Most head to Hadoop (via Amazon S3) for batch processing, while others head to Druid and ElasticSearch (via Apache Kafka) for real-time analysis. According to the Netflix blog post explaining Suro (which goes into much more depth), the company is also looking at how it might use real-time processing engines such as Storm or Samza to perform machine learning on event data.
Complete Story
Related Stories:
- Dear Netflix(Jan 28, 2010)
- The Netflix Linux Conjecture: How Netflix snubs the Linux comunity(Aug 16, 2010)
- Netflix Benchmarks Cassandra on AWS(Dec 01, 2011)
- $100 Netflix DVD Downloader Runs Linux?(May 21, 2008)
- Netflix Movie Downloads Come to Linux PCs(Dec 04, 2008)
- Netflix to Open Source Army of Cloud Monkeys(Apr 16, 2012)
- What Netflix Needs is Linux(Jun 19, 2007)
- Life without Netflix, streaming on Linux can be awesome!(Mar 26, 2012)
- Netflix open sources Eureka mid-tier load balancer(Sep 06, 2012)