Import RDBMS table to HDFS with sqoop from postgreSQL


1. Download JDBC driver

 2. Copy: cp /home/cloudera/Desktop/postgresql-9.3-1102.jdbc4.jar /usr/lib/sqoop/lib/

3. Configure: /var/lib/pgsql/data/pg_hba.conf file. You need to allow the IP/host of machine running hadoop.

Restart postgreSQL using pg_ctl restart

4. Run sqoop: Open the terminal on machine running hadoop and type the below command.

cloudera@cloudera-vm:/usr/lib/sqoop$ bin/sqoop import –connect jdbc:postgresql:                                  // –table employee –username postgres -P –target-dir   /sqoopOut1 -m 1

Enter password:



  • Cloudera hadoop VM distribution or any other machine running hadoop.
  • postgreSQL installation.
  • database Testdb and employee table on a running instance of postgreSQL (e.g.; in point 4).


All set! Your pgsql table data is now available on HDFS of  VM hadoop cluster.


Enjoy hadoop learning!

Citrus Perl Raspberry Pi dev

Anyone interested in GUI Perl dev in Pi? Please go through the link here and download the distribution from sourceforge project site.

I am using Citrus Perl on Pi (Raspbian Wheezy OS) for quite some time major issue.


Enjoy GUI dev on Pi .




Cracking the Primes – a primality-proving algorithm

Know Your Indian Prime Man !!!!!  Man behind AKS Primality Test algorithm

Dr. Manindra Agarwal’s Journey to the Primality Testing Algorithm