This is a short memo to install Hadoop and sqoop (Hadoop interface with db backend) in Ubuntu Lucid.

First it is necessary to add the following debian repository from Cloudera, the host of Hadoop and sqoop.
This can be added from System -> Update manager -> Settings (bottom-left) -> Other sources (tab) -> add.
deb http://archive.cloudera.com/debian -cdh3 contrib
On Lucid, has to be replaced by lucid, giving:
deb http://archive.cloudera.com/debian -cdh3 contrib

A Java environment is necessary, you should have at least default-jdk 1.6.
Then install the software itself:
sudo apt-get install hadoop
sudo apt-get install sqoop
sudo apt-get install hadoop-hbase

Once trying to launch sqoop on certain tables through PostgreSQL, you may find the following error:
sqoop import --table test --connect jdbc:postgresql://localhost/postgres --verbose
...
ERROR sqoop.Sqoop: Got exception running Sqoop: java.lang.RuntimeException: Could not load db driver class: org.postgresql.Driver

This means that JDBC driver of PostgreSQL is not installed correctly.
You have to download it from here.
Then copy it in /usr/lib/sqoop/.

More details about the installation can be found here.

After a couple of hours fighting, I finally went through a way to compile documentation of postgres under Ubuntu Lucid.

For man pages and html, you need the following packages.
docbook-xsl
docbook
docbook2x
docbook-dsssl
jade

Then, before running configure, the following setup is necessary:
export DOCBOOKSTYLE=/usr/share/sgml/docbook/stylesheet/dsssl/modular

Compilation of html pages need the following command in doc/src/sgml:
make html
Result is then found in doc/src/sgml/html.

Compilation of man pages need the following command in doc/src/sgml:
make man
Result is then found in doc/src/sgml/manX. X being 1, 3 or 7.

Then when compiling the code for pdf documentation, the following package is required.
pdfjadetex
Be aware that this had dependencies with tex, and several packages.

Compilation of pdf is done with the following command:
make postgres-A4.pdf

However, pdf compilation still shows issues due to incorrect parameters in /etc/texmf/texmf.cnf.
hash_extra = 50000
hash_size.mpost = 120000

©2010-2013 Michael Paquier All content is ©Copyright of Otacoo.com 2010-2013. Privacy Policy - Terms of Use