Below is the steps you can follow to install Hadoop on windows:
Step 1.I downloaded the following file: http://www.poolsaboveground.com/apache//hadoop/core/hadoop-0.23.0/hadoop-0.23.0.tar.gz/
Step 2. Copy into C:/Cygwin/home folder.
Step 3. Extract: tar -xvf hadoop-0.23.0.tar.gz
Step 4. Open up /hadoop/conf/yarn-site.xml. Copy the following between </configuration></configuration>
<!-- Site specific YARN configuration properties --> <property> <name>fs.default.name</name> <value>hdfs://localhost:9100</value> </property> <property> <name>mapred.job.tracker</name> <value>localhost:9101</value> </property> <property> <name>dfs.replication</name> <value>1</value> </property>
Step 5. Create log folder: hadoop> mkdir logs
Step 6. Format NameNode :
hadoop>bin/hadoop namenode -format
Output:
Formatting using clusterid: CID-e8adf4f5-d339-40aa-8845-3dea10a28701 12/01/30 19:20:03 INFO util.HostsFileReader: Refreshing hosts (include/exclude) list 12/01/30 19:20:03 INFO blockmanagement.DatanodeManager: dfs.block.invalidate.limit=1000 12/01/30 19:20:03 INFO util.GSet: VM type = 64-bit 12/01/30 19:20:03 INFO util.GSet: 2% max memory = 17.77875 MB 12/01/30 19:20:03 INFO util.GSet: capacity = 2^21 = 2097152 entries 12/01/30 19:20:03 INFO util.GSet: recommended=2097152, actual=2097152 12/01/30 19:20:03 INFO blockmanagement.BlockManager: dfs.block.access.token.enable=false 12/01/30 19:20:03 INFO blockmanagement.BlockManager: defaultReplication = 3 12/01/30 19:20:03 INFO blockmanagement.BlockManager: maxReplication = 512 12/01/30 19:20:03 INFO blockmanagement.BlockManager: minReplication = 1 12/01/30 19:20:03 INFO blockmanagement.BlockManager: maxReplicationStreams = 2 12/01/30 19:20:03 INFO blockmanagement.BlockManager: shouldCheckForEnoughRacks = false 12/01/30 19:20:03 INFO blockmanagement.BlockManager: replicationRecheckInterval = 3000 12/01/30 19:20:04 INFO namenode.FSNamesystem: fsOwner=thys_michels 12/01/30 19:20:04 INFO namenode.FSNamesystem: supergroup=supergroup 12/01/30 19:20:04 INFO namenode.FSNamesystem: isPermissionEnabled=true 12/01/30 19:20:05 INFO namenode.NameNode: Caching file names occuring more than 10 times 12/01/30 19:20:06 INFO namenode.NNStorage: Storage directory \test\dfs\name has been successfully formatted. 12/01/30 19:20:06 INFO namenode.FSImage: Saving image file \test\dfs\name\current\fsimage.ckpt_0000000000000000000 using no compression 12/01/30 19:20:06 INFO namenode.FSImage: Image file of size 127 saved in 0 seconds. 12/01/30 19:20:06 INFO namenode.NNStorageRetentionManager: Going to retain 1 images with txid >= 0 12/01/30 19:20:06 INFO namenode.NameNode: SHUTDOWN_MSG: /************************************************************ SHUTDOWN_MSG: Shutting down NameNode at us-lap085/64.9.237.126 ************************************************************/
Step 7. Start Cluster
bin/start-dfs.sh