Deploy Cloudera (CDH3) in Standalone mode:
COMMAND | DESCRIPTION |
---|---|
$ sudo add-apt-repository "deb http://archive.canonical.com/ lucid partner" | If you are using ubuntu 10.04 LTS run this command |
sudo apt-get install sun-java6-jdk | Install java |
lsb_release –c | Name of the your distribution (let DISTRO)(eg: hardy or jaunty or lucid etc.) |
vi /etc/apt/sources.list.d/cloudera.list Then type: deb http://archive.cloudera.com/debian DISTRO-cdh3 contrib deb-src http://archive.cloudera.com/debian DISTRO-cdh3 contrib | A repository enables your package manager to install cloudera replace DISTRO with the name of your distribution |
sudo apt-get -y install curl | install curl |
curl -s http://archive.cloudera.com/debian/archive.key | sudo apt-key add - | Add a repository key. Add the Cloudera Public GPG Key to your repository |
sudo apt-get update | Update APT package index |
apt-cache search hadoop | List Hadoop packages on Debian systems |
apt-get -y install hadoop-0.20 | Install hadoop |
dpkg -L hadoop-0.20 | List the installed files |
man hier | See that the Hadoop package has been configured |
Congratulations Cloudrea Setup is Completed. Now lets run some examples | |
hadoop jar /usr/lib/hadoop-0.20/hadoop-*-examples.jar pi 10 100 | Run pi example |
cd /tmp mkdir input cp /etc/hadoop/conf/*.xml input hadoop jar /usr/lib/hadoop-0.20/hadoop-*-examples.jar grep input output 'dfs[a-z.]+' cat output/* | Run grep example |
cd /tmp mkdir inputwords cp /etc/hadoop/conf/*.xml inputwords hadoop jar /usr/lib/hadoop-0.20/hadoop-*-examples.jar wordcount inputwords outputwords | Run word count example |
No comments:
Post a Comment