Installing Hadoop on Windows

Below is the steps you can follow to install Hadoop on windows:

Step 1.I downloaded the following file: http://www.poolsaboveground.com/apache//hadoop/core/hadoop-0.23.0/hadoop-0.23.0.tar.gz/

Step 2. Copy into C:/Cygwin/home folder.

Step 3. Extract: tar -xvf hadoop-0.23.0.tar.gz

Step 4. Open up /hadoop/conf/yarn-site.xml. Copy the following between </configuration></configuration>

<!-- Site specific YARN configuration properties -->
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9100</value>
</property>
<property>
<name>mapred.job.tracker</name>
<value>localhost:9101</value>
</property>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>

Step 5. Create log folder: hadoop> mkdir logs
Step 6. Format NameNode :

hadoop>bin/hadoop namenode -format 

Output:

Formatting using clusterid: CID-e8adf4f5-d339-40aa-8845-3dea10a28701
12/01/30 19:20:03 INFO util.HostsFileReader: Refreshing hosts (include/exclude) list
12/01/30 19:20:03 INFO blockmanagement.DatanodeManager: dfs.block.invalidate.limit=1000
12/01/30 19:20:03 INFO util.GSet: VM type       = 64-bit
12/01/30 19:20:03 INFO util.GSet: 2% max memory = 17.77875 MB
12/01/30 19:20:03 INFO util.GSet: capacity      = 2^21 = 2097152 entries
12/01/30 19:20:03 INFO util.GSet: recommended=2097152, actual=2097152
12/01/30 19:20:03 INFO blockmanagement.BlockManager: dfs.block.access.token.enable=false
12/01/30 19:20:03 INFO blockmanagement.BlockManager: defaultReplication = 3
12/01/30 19:20:03 INFO blockmanagement.BlockManager: maxReplication     = 512
12/01/30 19:20:03 INFO blockmanagement.BlockManager: minReplication     = 1
12/01/30 19:20:03 INFO blockmanagement.BlockManager: maxReplicationStreams      = 2
12/01/30 19:20:03 INFO blockmanagement.BlockManager: shouldCheckForEnoughRacks  = false
12/01/30 19:20:03 INFO blockmanagement.BlockManager: replicationRecheckInterval = 3000
12/01/30 19:20:04 INFO namenode.FSNamesystem: fsOwner=thys_michels
12/01/30 19:20:04 INFO namenode.FSNamesystem: supergroup=supergroup
12/01/30 19:20:04 INFO namenode.FSNamesystem: isPermissionEnabled=true
12/01/30 19:20:05 INFO namenode.NameNode: Caching file names occuring more than 10 times
12/01/30 19:20:06 INFO namenode.NNStorage: Storage directory \test\dfs\name has been successfully formatted.
12/01/30 19:20:06 INFO namenode.FSImage: Saving image file \test\dfs\name\current\fsimage.ckpt_0000000000000000000 using no compression
12/01/30 19:20:06 INFO namenode.FSImage: Image file of size 127 saved in 0 seconds.
12/01/30 19:20:06 INFO namenode.NNStorageRetentionManager: Going to retain 1 images with txid >= 0
12/01/30 19:20:06 INFO namenode.NameNode: SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down NameNode at us-lap085/64.9.237.126
************************************************************/

Step 7. Start Cluster

bin/start-dfs.sh
Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: