Re: HBase - prioritizing writes over reads?

2013-05-07 Thread kzurek
Thanks for the help. -- View this message in context: http://apache-hbase.679495.n3.nabble.com/HBase-prioritizing-writes-over-reads-tp4042838p4043876.html Sent from the HBase User mailing list archive at Nabble.com.

Re: EC2 Elastic MapReduce HBase install recommendations

2013-05-07 Thread Marcos Luis Ortiz Valmaseda
I think that you when you are talking about RMap, you are referring to MapR´s distribution. I think that MapR´s team released a very good version of its Hadoop distribution focused on HBase called M7. You can see its overview here: http://www.mapr.com/products/mapr-editions/m7-edition But this rel

Re: EC2 Elastic MapReduce HBase install recommendations

2013-05-07 Thread ramkrishna vasudevan
Your EC2 instance is having EBS or instance type as the data store? If it is EBS then the latency is bit high and this is as per Andrew's experience. Regards Ram On Wed, May 8, 2013 at 8:01 AM, Marcos Luis Ortiz Valmaseda < marcosluis2...@gmail.com> wrote: > I think that Andrew talked about thi

Re: EC2 Elastic MapReduce HBase install recommendations

2013-05-07 Thread Marcos Luis Ortiz Valmaseda
I think that Andrew talked about this some years ago and he created some scripts for that. You can find them here: https://github.com/apurtell/hbase-ec2 Then, you can review some links about this topic: http://blog.cloudera.com/blog/2012/10/set-up-a-hadoophbase-cluster-on-ec2-in-about-an-hour/ htt

EC2 Elastic MapReduce HBase install recommendations

2013-05-07 Thread Pal Konyves
Hi, Has anyone got some recommendations about running HBase on EC2? I am testing it, and so far I am very disappointed with it. I did not change anything about the default 'Amazon distribution' installation. It has one MasterNode and two slave nodes, and write performance is around 2500 small rows

Re: [DISCUSS] Namespace Delimiter

2013-05-07 Thread Shahab Yunus
'.' would have been ideal but I agree with it causing serious migration issues. I think we can use # It is not a widely used character in names or naming of objects or variables (I don't have any scientific data, just observation) also in Pig the key.value in case of accessing the map representing

Re: Export / Import and table splits

2013-05-07 Thread Jeremy Carroll
https://github.com/phobos182/hadoop-hbase-tools/blob/master/hbase/copy_table.rb I wrote a quick script to do it with mechanize + ruby. I have a new tool which I'm polishing up that does the same thing in Python but using the HBase REST interface to get the data. On Tue, May 7, 2013 at 3:23 PM, J

Re: [DISCUSS] Namespace Delimiter

2013-05-07 Thread Ted Yu
Interesting discussion. On Tue, May 7, 2013 at 4:49 PM, Ian Varley wrote: > I would also submit that "." is a pretty universal standard (citation > needed) in relational databases for separating namespaces (schemas, etc.) > from tables. We use that now to represent the same idea, and using a > d

Re: availability of 0.94.4 and 0.94.5 in maven repo?

2013-05-07 Thread lars hofhansl
Done for 0.94.7. (Stack looked over my shoulder this time). -- Lars From: lars hofhansl To: "user@hbase.apache.org" Sent: Monday, May 6, 2013 2:04 PM Subject: Re: availability of 0.94.4 and 0.94.5 in maven repo? Will add it tomorrow afternoon (if nothing

Re: Export / Import and table splits

2013-05-07 Thread Ted Yu
I almost forgot: for 0.94.6.1 and newer releases, you can: 1. take a snapshot of the original table 2. export the snapshot to target cluster 3. clone the exported snapshot to a new table. Cheers On Tue, May 7, 2013 at 4:11 PM, Ted Yu wrote: > Currently the Import tool doesn't create the table

Re: Export / Import and table splits

2013-05-07 Thread Ted Yu
Currently the Import tool doesn't create the table on target cluster, if we choose approach #2, Import tool should be enhanced with table creation capability. Cheers On Tue, May 7, 2013 at 4:02 PM, Jean-Marc Spaggiari wrote: > @Mohammad: The end goal is really more regarding the splits more tha

Re: Export / Import and table splits

2013-05-07 Thread Michael Segel
I don't see much value in duplicating the table's structure, but IMHO, the jury is still out. On May 7, 2013, at 6:02 PM, Jean-Marc Spaggiari wrote: > @Mohammad: The end goal is really more regarding the splits more than > the model. So I don't think Lars' options are good for this usecase. >

Re: Export / Import and table splits

2013-05-07 Thread Jean-Marc Spaggiari
@Mohammad: The end goal is really more regarding the splits more than the model. So I don't think Lars' options are good for this usecase. @Mike: I agree that things were not configured correctly. User should have had split the table before doing the import. I like the idea of looking at the files

Re: Export / Import and table splits

2013-05-07 Thread Michael Segel
Silly question... If you're doing a simple export, then you end up with all of your prior regions as separate files in a directory, right? So in theory, you could find the first row and the last complete row of each file and then do your pre-splits based on the start key and end key that you

Re: Export / Import and table splits

2013-05-07 Thread Mohammad Tariq
Hello JM, You might find these post by Lars useful : http://www.larsgeorge.com/2009/01/changing-hbase-tables-in-code.html http://www.larsgeorge.com/2009/05/hbase-schema-manager.html I'm not sure whether this is exactly what you need, but I feel you could use something like this to achieve your

Re: Export / Import and table splits

2013-05-07 Thread Ted Yu
I am not aware of a tool which can pre-split table using another table's region boundaries as template. Such a tool would be nice to have. Cheers On Tue, May 7, 2013 at 3:23 PM, Jean-Marc Spaggiari wrote: > Hi, > > When we are doing an export, we are only exporting the data. Then when > we are

Export / Import and table splits

2013-05-07 Thread Jean-Marc Spaggiari
Hi, When we are doing an export, we are only exporting the data. Then when we are importing that back, we need to make sure the table is pre-splitted correctly else we might hotspot some servers. If you simply export then import without pre-splitting at all, you will most probably brought some se

problem building lzo

2013-05-07 Thread kaveh minooie
Hi everyone I am trying to follow this tutorial https://wiki.apache.org/hadoop/UsingLzoCompression and I am getting an error I don't know how to solve. i have CFLAGS='-m64' and run this: CLASSPATH=/hadoop/hadoop-core-1.1.1.jar ant compile-native and I get this: compile-java: [javac] /s

Re: problem building lzo

2013-05-07 Thread Ted Yu
This query is more related to hadoop than HBase. Putting user@hbase in Bcc. Cheers On Tue, May 7, 2013 at 1:29 AM, kaveh minooie wrote: > Hi everyone > > I am trying to follow this tutorial https://wiki.apache.org/** > hadoop/UsingLzoCompression

Re: Failed deleting my ephemeral node

2013-05-07 Thread Ted Yu
Can you tell us a bit more about your zookeeper setup ? Checking zookeeper log around 2013-04-16 14:31:24 would help, too. Cheers On Tue, May 7, 2013 at 6:05 AM, Fabien Chung wrote: > Hi all, > > i have a cluster with 8 machines (CDH4). I use an ETL (Talend) to insert > data into hbase. Mostof

Re: Eternal RIT problem when RS tries to access wrong region-folder on HDFS

2013-05-07 Thread Dimitri Goldin
Hi Again, On 05/03/2013 04:41 PM, Kevin O'dell wrote: This was the correct approach, if the directory /hbase/documents/ 5b9c16898a371de58f31f0bdf86b1f**8b did not exist, then it was a smart move to get rid of the pointers to it. I don't think we have a JIRA for this yet...BUT we really need one

Re: Failed deleting my ephemeral node

2013-05-07 Thread Varun Sharma
Do you have NTP on your cluster - I have seen this manifest due to clock skew.. Varun On Tue, May 7, 2013 at 6:05 AM, Fabien Chung wrote: > Hi all, > > i have a cluster with 8 machines (CDH4). I use an ETL (Talend) to insert > data into hbase. Mostof time that works perfectly, but sometimes ro

Failed deleting my ephemeral node

2013-05-07 Thread Fabien Chung
Hi all, i have a cluster with 8 machines (CDH4). I use an ETL (Talend) to insert data into hbase. Mostof time that works perfectly, but sometimes rows are not inserted, and i don't have any clue about the reason of the failure. I have 0 errors on Talend. That usually happens when i delete the tabl

problem building lzo

2013-05-07 Thread kaveh minooie
Hi everyone I am trying to follow this tutorial https://wiki.apache.org/hadoop/UsingLzoCompression and I am getting an error I don't know how to solve. i have CFLAGS='-m64' and run this: CLASSPATH=/hadoop/hadoop-core-1.1.1.jar ant compile-native and I get this: compile-java: [javac] /s