hello, You can use bulkload in to pahses, in MapR distribution we use this:
1. first phase : map metadata to table hbase hadoop jar /opt/mapr/hbase/hbase-0.94.5/hbase-0.94.5-mapr.jar importtsv -Dimporttsv.separator=';' -Dimporttsv.bulk.output=folder_bulk_local -Dimporttsv.columns=HBASE_ROW_KEY,cf1:column, cfx:columnxx, TableNameHBase folder_bulk_hdfs_input 2. seconde phase : complete bulkload hadoop jar /opt/mapr/hbase/hbase-0.94.5/hbase-0.94.5-mapr.jar completebulkload folder_bulk_local TableNameHBase Regards ;) 2014-02-14 16:59 GMT+01:00 Kennedy, Sean C. <sean.kenn...@merck.com>: > I am trying to load 1.4M records in a 7 column CSV file into hbase. > > > > Question 1: Is this feasible? > > > > Question 2: What type of tuning on hbase and or hdfs would be needed? > > > > > > I am using apache hbase 0.94.15 and apache hadoop 1.2.1 > > > > Here is my command string: > > > > /hd/hadoop/bin/hadoop jar /hbase/hbase-0.94.15/hbase-0.94.15.jar importtsv > '-Dimporttsv.separator=,' > -Dimporttsv.columns=HBASE_ROW_KEY,BATCH_ID,B_ITEM_NO,B_ITEM_DESCRIPTION,CONS_BATCH_ID,C_ITEM_NO,C_ITEM_DESC,QTY_ISSUED > MIIBIG /md/test_hdfs_input/large.csv > > > > Any help apprecieated.. > > > > Sincerely, > > Sean > > Notice: This e-mail message, together with any attachments, contains > information of Merck & Co., Inc. (One Merck Drive, Whitehouse Station, > New Jersey, USA 08889), and/or its affiliates Direct contact information > for affiliates is available at > http://www.merck.com/contact/contacts.html) that may be confidential, > proprietary copyrighted and/or legally privileged. It is intended solely > for the use of the individual or entity named on this message. If you are > not the intended recipient, and have received this message in error, > please notify us immediately by reply e-mail and then delete it from > your system. > -- *Matouk IFTISSEN | Consultant BI & Big Data [image: http://www.ysance.com] * 24 rue du sentier - 75002 Paris - www.ysance.com <http://www.ysance.com/> Mob : +33 6 78 51 18 69 || Fax : +33 1 73 72 97 26 *Ysance sur* :*Twitter* <http://twitter.com/ysance>* | Facebook <https://www.facebook.com/pages/Ysance/131036788697> | Google+ <https://plus.google.com/u/0/b/115710923959357341736/115710923959357341736/posts> | LinkedIn <http://www.linkedin.com/company/ysance> | Newsletter <http://www.ysance.com/nous-contacter.html>* *Nos autres sites* : *ys4you* <http://wwww.ys4you.com/>* | labdecisionnel <http://www.labdecisionnel.com/> | decrypt <http://decrypt.ysance.com/>*