Apache Spark sorts 100 TB of data in world-record 23 minutesJan 15, 2015, 14:00 (0 Talkback[s])
(Other stories by Reynold Xin)
Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes.
0 Talkback[s] (click to add your comment)