bashreduce: A Bare-Bones MapReduce | Linux Today

bashreduce: A Bare-Bones MapReduce

Written By
Web Webster
Web Webster
Jul 8, 2009

[ Thanks to An Anonymous Reader for
this link. ]

“If you’ve managed to somehow miss most of the
MapReduce revolution, Wikipedia describes it pretty well: MapReduce
is a framework for computing certain kinds of distributable
problems using a large number of computers (nodes), collectively
referred to as a cluster.

Computational processing can occur on data stored either in a
filesystem (unstructured) or within a database (structured).

“Map” step: The master node takes the input, chops
it up into smaller sub-problems, and distributes those to worker
nodes. A worker node may do this again in turn, leading to a
multi-level tree structure. The worker node processes that smaller
problem, and passes the answer back to its master node.

“Reduce” step: The master node then takes the
answers to all the sub-problems and combines them in a way to get
the output – the answer to the problem it was originally trying to
solve.

“In fact, the MapReduce model has proven so useful that the
Apache Hadoop project (an Open Source implementation of the
infrastructure described in the Google paper) has become very
popular in the last few years. Yahoo, which employs numerous Hadoop
committers, recently hosted their annual Hadoop Summit which
attracted over 500 users and developers.”

Complete
Story

Web Webster

Web Webster

Web Webster has more than 20 years of writing and editorial experience in the tech sector. He’s written and edited news, demand generation, user-focused, and thought leadership content for business software solutions, consumer tech, and Linux Today, he edits and writes for a portfolio of tech industry news and analysis websites including webopedia.com, and DatabaseJournal.com.

Linux Today Logo

LinuxToday is a trusted, contributor-driven news resource supporting all types of Linux users. Our thriving international community engages with us through social media and frequent content contributions aimed at solving problems ranging from personal computing to enterprise-level IT operations. LinuxToday serves as a home for a community that struggles to find comparable information elsewhere on the web.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.