Links for downloads for Cloudera vmware: https://ccp.cloudera.com/display/SUPPORT/Downloads 1. CDH3 Packages and Downloads- download virtual machine- download for vmware 2. ungzip the .gz file vmware player: https://www.vmware.com/tryvmware/?p=player&lp=1&form=DLCDF8&pc=MDDC&src=IE-SearchBox 1. one will need to register to download the vmware 2. VMware Player 4.0.1 from the downloads click on manually Download 3. install the VMware once downloaded 4. Open VMware player 5. File->Open virtual Machine -> "give link to the untared folder of cloudera" 6. select file name (note- dont select file name starting with . or / out of the two files displayed) 7. select user cloudera and login entering password as cloudera 8. open terminal, execute command: hadoop dfs -mkdir test to check if hadoop is working ------------------------------------------------ Steps for executing word count: 1. javac -classpath /usr/lib/hadoop/hadoop-core.jar -d . WordCount.java 2. jar -Mcvf wordcount.jar org/ 3. hadoop dfs -copyFromLocal small.txt /user/cloudera/word2/input/small.txt 4. hadoop jar wordcount.jar org.myorg.WordCount /user/cloudera/word2/input/ /user/cloudera/word2/output 5. hadoop dfs -copyToLocal /user/cloudera/word2/output/part* . 6. cat part-00000 ------------------------------------------------