Distributed data processing with HadoopMay 26, 2010, 09:02 (0 Talkback[s])
(Other stories by M. Tim Jones)
[ Thanks to An Anonymous Reader for this link. ]
"Although Hadoop is the core of data reduction for some of the largest search engines, it's better described as a framework for the distributed processing of data. And not just data, but massive amounts of data, as would be required for search engines and the crawled data they collect. As a distributed framework, Hadoop enables many applications that benefit from parallelization of data processing.
0 Talkback[s] (click to add your comment)