Wednesday, June 13, 2012

Getting started with Hadoop and MapReduce

Map Reduce is a programming paradigm developed for creating high scale data crunching programs by dividing the workload among several parallel machines. Hadoop MapReduce is the framework on which such programs are written.

Input data is fed as Key-Value pairs and the output is also in the form of Key-Value pairs, which enables Chaining of multiple MapReduce jobs one after the other.

This is what I ll be reading to get started -

http://developer.yahoo.com/hadoop/tutorial/

http://developer.yahoo.com/blogs/hadoop/

No comments:

Powered By Blogger
Custom Search