July 2010
Distributed data processing with Hadoop - Part-3: App Build
Leran how to develop applications within the Hadoop infrastructure. This article explores the Hadoop APIs and data flow and demonstrates their use with a simple mapper and reducer application.
Distributed data processing with Hadoop, Part 2: Going further
Learn advanced setup that uses multiple nodes for parallel processing. It demonstrates the various node types required for multinode clusters and explores MapReduce functionality in a parallel environment. This article also digs into the management aspects of Hadoop—both command line and Web based.
May 2010
Distributed data processing with Hadoop
This article explores the Hadoop framework, including its fundamental elements, such as the Hadoop file system (HDFS), and node types that are commonly used. Learn how to install and configure a single-node Hadoop cluster, and delve into the MapReduce application. Finally, discover ways to monitor and manage Hadoop using its core Web interfaces.
October 2009
Cloud computing with Linux and Apache Hadoop
This article shows you how to use Apache Hadoop to build a MapReduce framework to make a Hadoop Cluster and how to create a sample MapReduce application which runs on Hadoop. You will also learn how to set up a time/disk-consuming task on the cloud.
1
(4 marks)