I got Hadoop from some mirrored place like this.
Just get the tar.gz. I messed around way too long with the RPM and God knows what mischief it got up to.
yum install java and seem to have received
This could be a problem since the instructions ask for:
"JavaTM 1.6.x, preferably from Sun, must be installed."
So over at the evil Oracle site they’re promising free blow jobs with:
which unzips to:
Then I think you need to do something like this:
And installs to a mess that doesn’t quite work as far as I can tell.
Note that on modern CentOS 6 systems (and probably others for all I
know) there is a sketchy thing called
alternatives. Find your Java’s
real location with something like:
alternatives --config java
SSH needs to be present and working. If it’s not already, you probably have no business doing serious things with your computing hardware.
Installing and Testing
Unzip and untar the hadoop package. Should unpack ok and you can test
right from that directory. It may not be ideal, but I put it in
So in the Hadoop directory that you just unpacked, do this:
$ export JAVA_HOME=/usr/lib/jvm/jre-1.6.0-openjdk.x86_64/ $ cd ~/sometestdirectory $ mkdir input $ cp /usr/local/src/hadoop-1.0.4/conf/*xml input/ $ /usr/local/src/hadoop-1.0.4/bin/hadoop \ jar /usr/local/src/hadoop-1.0.4/hadoop-examples-1.0.4.jar \ grep input output 'dfs[a-z.]+'
If the Java is happy it should run and process an example. Hopefully
there are no errors or sad exit codes. A directory called
should appear and "results" can be checked with:
$ cat output/*
Look for it somewhere like this repo.