public marks

PUBLIC MARKS from solrac with tags linux & hadoop

July 2010

Distributed data processing with Hadoop - Part-3: App Build

Leran how to develop applications within the Hadoop infrastructure. This article explores the Hadoop APIs and data flow and demonstrates their use with a simple mapper and reducer application.

Distributed data processing with Hadoop, Part 2: Going further

Learn advanced setup that uses multiple nodes for parallel processing. It demonstrates the various node types required for multinode clusters and explores MapReduce functionality in a parallel environment. This article also digs into the management aspects of Hadoop—both command line and Web based.

May 2010

Distributed data processing with Hadoop

This article explores the Hadoop framework, including its fundamental elements, such as the Hadoop file system (HDFS), and node types that are commonly used. Learn how to install and configure a single-node Hadoop cluster, and delve into the MapReduce application. Finally, discover ways to monitor and manage Hadoop using its core Web interfaces.

October 2009

Cloud computing with Linux and Apache Hadoop

This article shows you how to use Apache Hadoop to build a MapReduce framework to make a Hadoop Cluster and how to create a sample MapReduce application which runs on Hadoop. You will also learn how to set up a time/disk-consuming task on the cloud.