Deploy Cloudera (CDH3) in Standalone mode:
| COMMAND | DESCRIPTION | 
|---|---|
| $ sudo add-apt-repository "deb http://archive.canonical.com/ lucid partner" | If you are using ubuntu 10.04 LTS run this command | 
| sudo apt-get install sun-java6-jdk | Install java | 
| lsb_release –c | Name of the your distribution (let DISTRO)(eg: hardy or jaunty or lucid etc.) | 
| vi /etc/apt/sources.list.d/cloudera.list Then type: deb http://archive.cloudera.com/debian DISTRO-cdh3 contrib deb-src http://archive.cloudera.com/debian DISTRO-cdh3 contrib | A repository enables your package manager to install cloudera replace DISTRO with the name of your distribution | 
| sudo apt-get -y install curl | install curl | 
| curl -s http://archive.cloudera.com/debian/archive.key | sudo apt-key add - | Add a repository key. Add the Cloudera Public GPG Key to your repository | 
| sudo apt-get update | Update APT package index | 
| apt-cache search hadoop | List Hadoop packages on Debian systems | 
| apt-get -y install hadoop-0.20 | Install hadoop | 
| dpkg -L hadoop-0.20 | List the installed files | 
| man hier | See that the Hadoop package has been configured | 
| Congratulations Cloudrea Setup is Completed. Now lets run some examples | |
| hadoop jar /usr/lib/hadoop-0.20/hadoop-*-examples.jar pi 10 100 | Run pi example | 
| cd /tmp mkdir input cp /etc/hadoop/conf/*.xml input hadoop jar /usr/lib/hadoop-0.20/hadoop-*-examples.jar grep input output 'dfs[a-z.]+' cat output/* | Run grep example | 
| cd /tmp mkdir inputwords cp /etc/hadoop/conf/*.xml inputwords hadoop jar /usr/lib/hadoop-0.20/hadoop-*-examples.jar wordcount inputwords outputwords | Run word count example | 
 
No comments:
Post a Comment