dcsimg
Linux Today: Linux News On Internet Time.




More on LinuxToday


Apache Spark sorts 100 TB of data in world-record 23 minutes

Jan 15, 2015, 14:00 (0 Talkback[s])
(Other stories by Reynold Xin)

Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes.

Complete Story

Related Stories: