Webopedia Term of the Day: What is Hadoop?
Jun 06, 2011, 16:02 (0 Talkback[s])
(Other stories by Webopedia)
"Formally called Apache Hadoop, it is an Apache Software
Foundation project and is an open source software platform for
scalable, distributed computing. Hadoop can provide fast and
reliable analysis of both structured data and unstructured data .
Given its capabilities to handle large data sets, it is often
associated with the phrase Big Data.
"The Apache Hadoop software library is essentially a framework
that allows for the distributed processing of large datasets across
clusters of computers using a simple programming model. Hadoop can
scale up from single servers to thousands of machines, each
offering local computation and storage."