[ Thanks to James
Maguire for this link. ]
“The open source Hadoop project is all about providing
the ability to manage and understand large datasets. Yahoo which
uses Hadoop to manage 120 terabytes of data per day, this week
released a new version of their edition of Hadoop but they weren’t
the only ones with a new Hadoop release this week.“Commercial Hadoop vendor Cloudera this week announced
Cloudera’s Distribution for Hadoop (CDH) version 3, including some
technologies that were previous closed source. In addition to the
new version of CDH, Cloudera is announcing a new Enterprise version
of their Hadoop distribution, providing additional usability and
management features for enterprise users.“CDH is a version of the Apache Hadoop project that bundles
additional projects and technologies to make Hadoop more usable for
enterprises. CDH includes the Yahoo developed open source Oozie
workflow engine as well as including projects originated by
Cloudera. Among the Cloudera-originated projects is one called HUE
(Hadoop User Experience), which began its life as the closed source
Cloudera Desktop.”