Re: RegionServer stuck in internalObtainRowLock forever - HBase 0.94.7

2014-04-29 Thread Asaf Mesika
https://issues.apache.org/jira/browse/HBASE-7711 ? This time we checked, and no failed disks on the machine, but same problem with failing writing to local data node and then it was excluded. On Wed, Feb 19, 2014 at 1:02 AM, Stack wrote: > On Mon, Feb 17, 2014 at 1:59 AM, Asaf Mesika

Coprocessor coprocessor execution result saved in buffer as whole - why?

2014-04-22 Thread Asaf Mesika
Hi, I've noticed that in 0.94.7, when you execute a coprocessor, the result object is converted into a byte buffer, using write() method which is on the result object. So, if my result object is 500mb in size, another 500mb is consumed from the heap, since it is converted to a byte buffer before s

Re: HBase Unable to find Region Server - No Exception being thrown

2014-04-10 Thread Asaf Mesika
There is a property you can tune to lower default num of retries from 10 to any number like 2. On Wednesday, April 9, 2014, kanwal wrote: > I'm currently running into an issue on my local setup where my application > is > unable to connect to the hbase table but I'm successfully able to query th

Re: endpoint coprocessor

2014-04-10 Thread Asaf Mesika
Bear in mind each region will return its top n, then you will have to run another top n in your client code. This introduce a numerical error : top on top. On Thursday, April 10, 2014, Bogala, Chandra Reddy wrote: > Hi, > I am planning to write endpoint coprocessor to calculate TOP N results for

Re: BlockCache for large scans.

2014-04-09 Thread Asaf Mesika
The Jira says it's enabled by auto. Is there an official explaining this feature? On Wednesday, April 9, 2014, Ted Yu wrote: > Please take a look at http://www.n10k.com/blog/blockcache-101/ > > For D, hbase.regionserver.global.memstore.size is specified in terms of > percentage of heap. Unless y

Re: HBase- Hive Integration

2014-03-24 Thread Asaf Mesika
Is it more like Phoenix or Apache Drill? On Wednesday, March 19, 2014, Tapper, Gunnar wrote: > Hi, > > Sorry it took me so long to respond to this; I got very busy with shipping > HPDSM. :) > > Trafodion has been tested with 0.94.6; 0.94.22 testing is in progress. > > You can't download the Traf

Re: RegionServer stuck in internalObtainRowLock forever - HBase 0.94.7

2014-02-17 Thread Asaf Mesika
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.hbase.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:320) at org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1426) On Mon, Feb 10, 2014 a

RegionServer stuck in internalObtainRowLock forever - HBase 0.94.7

2014-02-10 Thread Asaf Mesika
Hi, We have HBase 0.94.7 deployed in production with 54 Region Servers (Hadoop 1). Couple of days ago, we had an incident which made our system unusable for several hours. HBase started emitting WARN exceptions indefinitely, thus failing any writes to it. Until stopped this RS, the issue wasn't re

Re: Questions about HBase load balancing and HFile

2014-01-21 Thread Asaf Mesika
If hot means many requests, then it's only in 0.96 right? 0.94 is only addressing capacity load on terms of numbers of regions per region server of the same table. On Monday, January 20, 2014, Ted Yu wrote: > bq. under heavy load by serving to hot regions > > Did you mean 'two hot regions' ? > I

Re: Fast scan with PrefixFilter?

2014-01-17 Thread Asaf Mesika
Your start rowkey is the prefix it self. Your end row key is the prefix +1. Increase the last byte by 1. I can send you the increase function if you need. No filter needed. On Wednesday, January 15, 2014, Ramon Wang wrote: > Hi Folks > > We have a table with fixed pattern row key design, the for

Re: Is get a private case of scan ?

2014-01-16 Thread Asaf Mesika
You also can't be sure the second rowkey will be in the same region which is requires you to go on the network to another region server. On Thursday, January 16, 2014, Amit Sela wrote: > I have a case where I want to "split" rows with a lot of qualifiers (a very > small amount of rows < 1%, with

Re: hbase-orm

2014-01-05 Thread Asaf Mesika
One strong storage efficient feature would be to serialize your Pojo to binary using Avro (as one example) and saving it as a column. This saves you on storage since you don't have to write your row key for every column. If you load the entire Pojo it will save on read time. Also it enables to supp

Re: a short introduction of simplehbase(hbase ORM)

2014-01-05 Thread Asaf Mesika
There are two projects doing orm for HBase, one named Kundera - check it out. On Sunday, January 5, 2014, zhang_xzhi wrote: > I just read some short introduction of Phoenix these days. > If I am wrong, pls let me know, thx. > It seems to me Phoenix will provide a JDBC-driver against hbase. > But

Re: use hbase as distributed crawl's scheduler

2014-01-03 Thread Asaf Mesika
Couple of notes: 1. When updating to status you essentially add a new rowkey into HBase, I would give it up all together. The essential requirement seems to point at retrieving a list of urls in a certain order. 2. Wouldn't salting ruin the sort order required? Priority, date added? On Friday, Jan

Re: secondary index feature

2014-01-03 Thread Asaf Mesika
Are the regions scanned in parallel? On Friday, January 3, 2014, rajeshbabu chintaguntla wrote: > > Here are some performance numbers with RLI. > > No Region servers : 4 > Data per region: 2 GB > > Regions/RS| Total regions| Blocksize(kb) |No#rows matching values| Time > taken(sec)| > 50 |

Re: Performance tuning

2013-12-28 Thread Asaf Mesika
Does Phoenix exposes metrics about its code execution? Network time, coprocessor time, client time, etc? On Sunday, December 22, 2013, lars hofhansl wrote: > You would have to measure the incoming/outgoing traffic on the affected > machine. > > The easiest is to periodically check the output of i

Filter thread safe?

2013-12-22 Thread Asaf Mesika
Hi, Are Filter implementations need to be thread-safe? Thanks, Asaf

ColumnQualifier List Of Ranges filter

2013-12-22 Thread Asaf Mesika
Hi, Does anybody have an open source implementation of ColumnRangeFilter but for multiple ranges?

Re: Strange Problem on using HRegion's getScanner Method in RegionServer

2013-12-04 Thread Asaf Mesika
How do you check in your code you only have one region? How do you retrieve the HRegion exactly? Since you are running in inside a coprocessor I presume you have such rpc server per region since Hbase creates an instance of your coprocessor for each region. On Tuesday, December 3, 2013, Wukang Lin

Re: Hbase Region Size

2013-12-02 Thread Asaf Mesika
In this method, you can get the region's Load per region: private Map getRegionsLoad() { try { Map regionsNameToLoad = new HashMap(); ClusterStatus clusterStatus = hAdmin.getClusterStatus(); for (ServerName serverName : clusterStatus.getServers()) {

Re: Replication metrics with more than one Peer/Slave

2013-11-28 Thread Asaf Mesika
I tacked the same problem, and was answerd it was fixed in 0.95 On Thursday, November 28, 2013, Pablo Medina wrote: > Hi all, > > Knowing that replication metrics are global at the region server level in > HBase 0.94.13, what is the meaning of a metric like sizeOfLogQueue when > replicating to mo

Re: HBase value design

2013-11-28 Thread Asaf Mesika
On our project we store nested record structures with 10-40 fields. We have decided to save on storage and write throughout by writing a serialized avro record as value. We place one byte before to allow versioning. We did it since each column is written with its rowkey, cq, cf and timestamp. Your

Re: How to improve the write performance?

2013-11-28 Thread Asaf Mesika
thread? > > And how to optimize it? > > Config BLOCKSIZE/WriteBuffer ...? > > I'm a newbie. > Thanks for help! > > ------ > > jingych > > *From:* Asaf Mesika > *Date:* 2013-11-26 19:20 > *To:* user@hbase.apache.org 'use

Re: HBase: Paralel Query

2013-11-26 Thread Asaf Mesika
The need is too broad. You need to do through the HBase jmx metrics, ands machine metrics to see what is your bottleneck. On Tuesday, November 26, 2013, Job Thomas wrote: > > > Hi All, > > How can we configure Hbase inorder to perform multythreading/parallel > query faster . > > These are some

Re: How to improve the write performance?

2013-11-26 Thread Asaf Mesika
How many client threads? On Tuesday, November 26, 2013, jingych wrote: > Hi, All! > > Recently, I just test the Client write performance with HBase Java API. > > I think the performance X-factor is : > > 1) client buffer size; > 2) server memstore size; > > But when i did the test, I have the re

Re: How to get Metadata information in Hbase

2013-11-25 Thread Asaf Mesika
bin/hbase shell In there: Type help and you'll get along On Monday, November 25, 2013, ashishkshukladb wrote: > I want to get the metadata information in Hbase. My basic purpose is to - > > 1.get the information about tables like how many tables and name of tables. > > 2.get the columnfamilies na

Re: Uneven write request to regions

2013-11-20 Thread Asaf Mesika
with it, so it may work > > for you, too. > > > > Otis > > -- > > Performance Monitoring * Log Analytics * Search Analytics > > Solr & Elasticsearch Support * http://sematext.com/ > > > > > > On Wed, Nov 20, 2013 at 1:00 AM, Asaf Mesika &

Re: Uneven write request to regions

2013-11-20 Thread Asaf Mesika
oring * Log Analytics * Search Analytics > Solr & Elasticsearch Support * http://sematext.com/ > > > On Wed, Nov 20, 2013 at 1:00 AM, Asaf Mesika > wrote: > > Thanks for clearing that out. > > I'm using your message to ping anyone who assist as to it appears the use

Re: Uneven write request to regions

2013-11-19 Thread Asaf Mesika
; But I agree that in general with this kind of workload, we should handle > WAL files more intelligently and free up those WAL files which don't have > any dependency (that is, all their entries are already flushed) when > archiving. We do that in trunk but not in any released version,

Re: HBase - number of regionserver used

2013-11-18 Thread Asaf Mesika
The crashing sounds odd. What's your exception? On Monday, November 11, 2013, Andrea wrote: > Hi, I have HBase above Hadoop with one master node, one zookeeper node and > four regionservers. When I turn up everything, on two nodes (or sometime > the > same node) are created the tables .META. and

Re: HBase as a transformation engine

2013-11-18 Thread Asaf Mesika
Are you reading using HBase client or do you have an inputFormat for reading HFiles? On Wednesday, November 13, 2013, Amit Sela wrote: > Hi, > > We do something like that programmatically. > Read blobbed HBase data (qualifiers represent cross-sections such as > country_product and blob data such

Re: Save the bandwidth usage

2013-11-17 Thread Asaf Mesika
My "dream" is to get to your point. I was always stopped before reaching network limit. My premise was the WAL synchronize was the key bottleneck in write. How much data are you inserting? How much client threads? Batch size in client? Share some more info on your cluster and test setup. On Thurs

Re: Get HBase Read and Write requests per second separately

2013-11-17 Thread Asaf Mesika
Using java, you can use HBaseAdmin to get cluster load, there, for each region you can get write and read request. Just sum up all regions to get total read and total write request. On Thursday, November 14, 2013, Sandeep L wrote: > Is it possible to get from api instead of hbase_metrics. > > Tha

Re: Uneven write request to regions

2013-11-16 Thread Asaf Mesika
e. > > Hope this helps, > Mike > > > On Sat, Nov 16, 2013 at 1:16 AM, Ted Yu wrote: > > > bq. all regions of that customer > > > > Since the rowkey starts with , any single customer would only > > span few regions (normally 1 region), right ? > > &

Re: Uneven write request to regions

2013-11-16 Thread Asaf Mesika
bucket. On Sat, Nov 16, 2013 at 8:16 AM, Ted Yu wrote: > bq. all regions of that customer > > Since the rowkey starts with , any single customer would only > span few regions (normally 1 region), right ? > > > On Fri, Nov 15, 2013 at 9:56 PM, Asaf Mesika > wrote: > >

Re: Uneven write request to regions

2013-11-15 Thread Asaf Mesika
ince you > query > > on it > > > > Have you looked at this API in Scan ? > > > > public Scan setTimeRange(long minStamp, long maxStamp) > > > > > > Cheers > > > > > > On Fri, Nov 15, 2013 at 1:28 PM, Asaf Mesika > > wrote: > &g

Re: Uneven write request to regions

2013-11-15 Thread Asaf Mesika
gt; Have you looked at this API in Scan ? > > public Scan setTimeRange(long minStamp, long maxStamp) > > > Cheers > > > On Fri, Nov 15, 2013 at 1:28 PM, Asaf Mesika > wrote: > > > The problem is that I do know my rowkey design, and it follows people's > >

Re: Uneven write request to regions

2013-11-15 Thread Asaf Mesika
region" > > Cheers > Ramon > > > On Thu, Nov 14, 2013 at 8:47 PM, Asaf Mesika > wrote: > > > It's from the same table. > > The thing is that some simply have less data saved in HBase, > > while others have x50 (max) data. > > I'm tryin

Re: HBase with multiple interfaces

2013-11-14 Thread Asaf Mesika
We are using both of the following properties: hbase.regionserver.dns.interface, base.master.dns.interface. Both set to the interface name we want. We have two interfaces as you described - one for inner communication and one for external. What exactly is not working for you? On Wed, Nov 13, 20

Re: Uneven write request to regions

2013-11-14 Thread Asaf Mesika
or different table, you may consider to combine some > cold regions for some tables. > > Thanks > Ramon > > > On Thu, Nov 14, 2013 at 4:59 PM, Asaf Mesika > wrote: > > > Hi, > > > > Have anyone ran into a case where a Region Server is hosting regions, in >

Uneven write request to regions

2013-11-14 Thread Asaf Mesika
Hi, Have anyone ran into a case where a Region Server is hosting regions, in which some regions are getting lots of write requests, and the rest gets maye 1/1000 of the rate of write requests? This leads to a situation where the HLog queue reaches its maxlogs limit since, those HLogs containing t

MSLAB Metrics

2013-11-12 Thread Asaf Mesika
Hi, I see that https://issues.apache.org/jira/browse/HBASE-3680 is talking about adding MSLAB metrics (size of mslab for region, etc). I see that it is fixed in 0.92.3, but when looking at the source code of 0.94.7 I can't find it in RegionServerMetrics.java, or any other java file. Am I missing

Re: So where is the store file flushed to?

2013-11-10 Thread Asaf Mesika
Also note that due to HDFS placement policy the store file is written locally - we'll first replica of each of its blocks so it's dispersed amongst the drives you have onto on the data node which is also the region server. On Thursday, November 7, 2013, Ted Yu wrote: > The store file is flushed t

Re: Column qualifiers with hierarchy and filters

2013-11-10 Thread Asaf Mesika
Both are created when you declare the table and not in runtime so in shouldn't matter to you anyway On Thursday, November 7, 2013, Nasron Cheong wrote: > Why is that? Afaik everything is just a byte sequence, what prevents > non-printable chars from being used in CF/table names? > > - Nasron > >

Re: setCaching and setBatch

2013-11-04 Thread Asaf Mesika
On this note: If I hammer a region server with scans that has let's say 100 rows caching - is there an OOME protections which lowers the caching in case ther server is stressed in memory? Or will it fail the region server with OOME? On Monday, November 4, 2013, Anoop John wrote: > Have u tested t

Re: OutOfMemoryError in MapReduce Job

2013-11-03 Thread Asaf Mesika
ly. The > question is, where the bytes are compressed? Directly in the map-function > (If no, is it possible to compress it there with lzo?!) or in the region > server? > > kind regards > > > 2013/11/2 Asaf Mesika > > > If mean, if you take all those bytes if the bit

Re: OutOfMemoryError in MapReduce Job

2013-11-02 Thread Asaf Mesika
you know an implementation? > > kind regards > > > 2013/11/2 Asaf Mesika > > > > I would try to compress this bit set. > > > > On Nov 2, 2013, at 2:43 PM, John > > wrote: > > > > > Hi, > > > > > > thanks for your answe

Re: OutOfMemoryError in MapReduce Job

2013-11-02 Thread Asaf Mesika
I would try to compress this bit set. On Nov 2, 2013, at 2:43 PM, John wrote: > Hi, > > thanks for your answer! I increase the "Map Task Maximum Heap Size" to 2gb > and it seems to work. The OutOfMemoryEroror is gone. But the HBase Region > server are now crashing all the time :-/ I try to stor

Re: You Are Dead Exception due to promotion failure

2013-11-01 Thread Asaf Mesika
Can you please explain why is this suspicious? On Monday, October 7, 2013, Jean-Daniel Cryans wrote: > This line: > > [CMS-concurrent-mark: 12.929/88.767 secs] [Times: user=14.30 sys=3.74, > real=88.77 > secs] > > Is suspicious. Are you swapping? > > J-D > > > On Mon, Oct 7, 2013 at 8:34 AM, prak

CDH 4.4 and java7

2013-11-01 Thread Asaf Mesika
Hi, I've been reading here that HBase 0.94.x has been working in production for a few folks here with Java7. I also read that CDH4.4 is not recommended to work with Java 7 in production. Anybody has any idea why?

Re: HBase Random Read latency > 100ms

2013-11-01 Thread Asaf Mesika
How many Parallel GC were you using? Regarding block cache - just to see I understood this right: if your are doing a massive read in HBase it's better to turn off block caching through the Scan attribute? On Thursday, October 10, 2013, Otis Gospodnetic wrote: > Hi Ramu, > > I think I saw mentio

Re: row filter - binary comparator at certain range

2013-11-01 Thread Asaf Mesika
Bucket seems like a rather good name for it. The method for generating could be Hash, running sequence modded, etc. So HashBucket, RoundRobinBucket, etc. On Tuesday, October 22, 2013, James Taylor wrote: > One thing I neglected to mention is that the table is pre-split at the > "prepending-row-ke

Re: Column qualifiers with hierarchy and filters

2013-10-31 Thread Asaf Mesika
Can you give an example of your query? On Friday, November 1, 2013, Nasron Cheong wrote: > Hi, > > I'm trying to determine the best way to serialize a sequence of > integers/strings that represent a hierarchy for a column qualifier, which > would be compatible with the ColumnPrefixFilters, and Bi

Re: High Full GC count for Region server

2013-10-28 Thread Asaf Mesika
Check through HDFS UI that your cluster haven't reached maximum disk capacity On Thursday, October 24, 2013, Vimal Jain wrote: > Hi Ted/Jean, > Can you please help here ? > > > On Tue, Oct 22, 2013 at 10:29 PM, Vimal Jain > > wrote: > > > Hi Ted, > > Yes i checked namenode and datanode logs and i

Re: [ANNOUNCE] Phoenix v 2.1 released

2013-10-28 Thread Asaf Mesika
I couldn't get the Row Value Constructor feature. Do you perhaps have a real world use case to demonstrate this? On Friday, October 25, 2013, James Taylor wrote: > The Phoenix team is pleased to announce the immediate availability of > Phoenix 2.1 [1]. > More than 20 individuals contributed to th

Re: Linear Scalability in HBase

2013-10-28 Thread Asaf Mesika
That seems like too much client threads. How much mb/sec did you on that 1 RS? On Friday, October 25, 2013, Vladimir Rodionov wrote: > You can not saturate region server with one client (unless you probably > use hbase-async) if all data is cached in RAM. > In our performance tests we have run 10

Re: HBase wont responding when one region server goes down

2013-10-28 Thread Asaf Mesika
If your query (scan) needs a region on the failed region server, the client will fail and silently retry about 10 times. The sleep time increase as each retry fails and can reach 10min. On the server side, the master takes 3min to realize the RS failed thus issue a region move, which might take a f

Re: HBASE help

2013-10-28 Thread Asaf Mesika
How much data do you that you need NoSQL db? On Monday, October 28, 2013, saiprabhur wrote: > Hi Folks, > > New to NOSQL designing data model for primary care system. i have > normalized > sample DB relationship model e.g. HBASE-0.94.0 > > Patient table: > > 1) Patient_id - PK > 2) A

Endpoint Coprocessor Protocol and implementation versioning

2013-10-17 Thread Asaf Mesika
Hi, I'm writing an Endpoint Coprocessor for HBase v0.94.6 (cdh4.3.1). I'm trying to understand how the CoprocessorProtocol and CoprocessorEndPoint implementation versioning works, both backwards and forward. For instance, I have my protocol as: public interface MyQueryProtocol extends Coprocess

Re: timeouts with lots of coprocessor puts on single row

2013-08-26 Thread Asaf Mesika
We did the same but on the client side, without any issue On Monday, August 26, 2013, Olle MÃ¥rtensson wrote: > Hi, > > I have developed a coprocessor that is extending BaseRegionObserver and > implements the > postPut method. The postPut method scans the columns of the row that the > put was issu

Re: CVE-2013-2193: Apache HBase Man in the Middle Vulnerability

2013-08-24 Thread Asaf Mesika
Any Cloudera release for that as well? On Saturday, August 24, 2013, Aaron T. Myers wrote: > Hello, > > Please see below for the official announcement of a serious security > vulnerability which has been discovered and subsequently fixed in Apache > HBase releases. > > Best, > Aaron > > -BEGI

Re: Client Get vs Coprocessor scan performance

2013-08-17 Thread Asaf Mesika
Ted, can you elaborate a little bit why this issue boosts performance? I couldn't figure out from the issue comments if they execCoprocessor scans the entire .META. table or and entire table, to understand the actual improvement. Thanks! On Fri, Aug 9, 2013 at 8:44 AM, Ted Yu wrote: > I thin

Re: Hbase update use case

2013-08-12 Thread Asaf Mesika
If you can mark a row by adding a column qualifier which will be used as your flag by its existence, and its name will be lexicographically first, then it won't be slow as you said about filters below. On Monday, August 12, 2013, ccalugaru wrote: > Hi all, > I have the following hbase use case: >

Re: Memory distribution for Hadoop/Hbase processes

2013-08-08 Thread Asaf Mesika
Please list down your jvm switches for hbase On Wednesday, August 7, 2013, Vimal Jain wrote: > Hi Ted, > I am using centOS. > I could not get output of "ps aux | grep pid" as currently the hbase/hadoop > is down in production due to some internal reasons. > > Can you please help me in figuring ou

Re: HDFS Restart with Replication

2013-08-06 Thread Asaf Mesika
Yep. That's a confusing one. When running /hbase stop master, it sets the shutdown flag in ZK. RS listen in on this flag, and once they see it set, they shut them selfs down. Once they are all down, the master goes down as well. On Saturday, August 3, 2013, Jean-Daniel Cryans wrote: > Ah then doi

Re: Elastic Search Plugin (River) for HBase

2013-08-06 Thread Asaf Mesika
Cool! Do you guys store your data twice then? Once on HBase and another on ES? And then add index data? On Tuesday, August 6, 2013, Alex Newman wrote: > Based on the previous work using async libraries to index HBase into > elastic search, I've created. > > https://github.com/posix4e/Elasticsearc

Re: Puppetized HBASE + Passwordless SSH examples

2013-08-06 Thread Asaf Mesika
We have also developed a fabric customization for HBase stack and all our components. Anyone had experience with Ambari relative to Fabric? On Tuesday, August 6, 2013, Jay Vyas wrote: > thanks .. ! > > well there certainly are a few customized ones on github but they are > varying degrees of mai

M/R to HBase - affinity to Region Server

2013-07-19 Thread Asaf Mesika
Hi, Is there a way in MRv2 to direct a reducer to run in the RS it will write its Puts, thus achieve write locality? Thanks! Asaf

Re: Making HBase easier to understand

2013-07-14 Thread Asaf Mesika
Very cute. On Monday, July 15, 2013, anil gupta wrote: > Hey Swarnim, > > Really nice work. Keep on going! > > ~Anil > > > > On Sun, Jul 14, 2013 at 5:59 PM, kulkarni.swar...@gmail.com < > kulkarni.swar...@gmail.com > wrote: > > > Hello all, > > > > In order to help understand the intricacies of

Re: Calculating size in HBase.

2013-07-14 Thread Asaf Mesika
If you get KeyValue object by the client, then you have its size for free (it's the size of its internal byte array) - so why is it time consuming? On Fri, Jul 5, 2013 at 2:23 AM, Bikash Agrawal wrote: > Hi , > > Need to calculate data size in HBase. I can do it by using KV length. But > it is t

Re: the scan will be executed parallel if not use coprocessor?

2013-07-14 Thread Asaf Mesika
To my knowledge, scan is not parallel, hence the speed of queries of Impala, Phoenix, and other similar projects. On Saturday, July 13, 2013, ch huang wrote: > hi ted ,for example i have a table with 10 regions, if i offer the > condition hit the data of 8 regions,is it different do it use oragin

Re: small hbase doubt

2013-07-13 Thread Asaf Mesika
gt; On Thu, Jul 11, 2013 at 9:11 PM, Asaf Mesika > > > wrote: > > > Do you think prefix compression can also be utilized here? In our use > case > > we sent a list of Put of counters in which the key is quite long and the > > keys are quite similar to one another.

Re: if i wirte myself endpoint ,how can i load it onto server?

2013-07-12 Thread Asaf Mesika
Dump the jar in the lib directory in hbase , for ever region server. On Friday, July 12, 2013, ch huang wrote: > ATT >

Re: problem in testing coprocessor endpoint

2013-07-12 Thread Asaf Mesika
do is just load it into my test table ,only for > the regions of the table > > On Fri, Jul 12, 2013 at 12:07 PM, Asaf Mesika > wrote: > > > The only way to register endpoint coprocessor jars is by placing them in > > lib dir if hbase and modifying hbase-site.xml to point

Re: HBase issues since upgrade from 0.92.4 to 0.94.6

2013-07-12 Thread Asaf Mesika
You need to see the jvm crash in .out log file and see if maybe its the .so native Hadoop code that making the problem. In our case we Downgraded from jvm 1.6.0-37 to 33 and it solved the issue. On Friday, July 12, 2013, David Koch wrote: > Hello, > > NOTE: I posted the same message in the the C

Re: small hbase doubt

2013-07-11 Thread Asaf Mesika
k at http://hbase.apache.org/book.html#d2617e13654 and section > J.4.3.2 > > On Thu, Jul 11, 2013 at 9:01 PM, Asaf Mesika > > > wrote: > > > I thought that in 0.95 ProtoBuf provides RPC compression, no? > > > > On Friday, July 12, 2013, Alok Singh Mahor wrote: > >

Re: problem in testing coprocessor endpoint

2013-07-11 Thread Asaf Mesika
The only way to register endpoint coprocessor jars is by placing them in lib dir if hbase and modifying hbase-site.xml to point to it under a property name I forgot at the moment. What you described is a way to register an Observer type coprocessor. On Friday, July 12, 2013, ch huang wrote: > i

Re: small hbase doubt

2013-07-11 Thread Asaf Mesika
I thought that in 0.95 ProtoBuf provides RPC compression, no? On Friday, July 12, 2013, Alok Singh Mahor wrote: > To Jean : > Thanks for replying. well could you please elaborate your answer..and by > that 'query' ..i meant can anyone clear my doubt :-) > > To Doug: > Thanks for replying. but the

Re: HBasecon 2013 slides

2013-07-11 Thread Asaf Mesika
Great! Waiting for the videos as it looks like a very interesting conference. On Wednesday, July 10, 2013, Azuryy Yu wrote: > Hi dear all, > > HBase con 2013 slides are available now. > > http://www.hbasecon.com/schedule/ > > Just share information here. >

Re: G1 before/after GC time graph

2013-07-11 Thread Asaf Mesika
This means you can safely run Hadoop and Hbase on jvm 7? We were just considering switching in production to java 7. On Thursday, July 11, 2013, Azuryy Yu wrote: > Otis, > > I will do this test, maybe on the end of this month. because I haven't big > memory server for test now util the end of thi

Re: question about clienttrace logs in hdfs and shortcircuit read

2013-07-04 Thread Asaf Mesika
What the hdfs data locality metric? And remote read and local read? On Thursday, July 4, 2013, Viral Bajaria wrote: > Currently datanode shows a lot of clienttrace logs for DFSClient. I did a > quick command line check to see how many clienttrace do I get per active > RegionServer and it seems th

Re: question about clienttrace logs in hdfs and shortcircuit read

2013-07-03 Thread Asaf Mesika
I think there is a metric in HBase and HDFS (JMX) reflecting that. If you find it and find it useful, do tell... On Thursday, July 4, 2013, Viral Bajaria wrote: > Hi, > > If I have enabled shortcircuit reads, should I ever be seeing clienttrace > logs in the datanode for the regionserver DFSClien

Re: Help in designing row key

2013-07-03 Thread Asaf Mesika
Seems right. You can make it more efficient by creating your result array in advance and then fill it. Regarding time filtering. Have you see that in Scan you can set start time and end time? On Wednesday, July 3, 2013, Flavio Pompermaier wrote: > All my enums produce positive integers so I don't

Re: Hbase with Servlet and JSP

2013-07-03 Thread Asaf Mesika
Did you somehow turned the Security flag on for HBase since your exception is Security related. On Wednesday, July 3, 2013, SamSalman wrote: > Hello Experts, > > I am quite new to Hbase and Hadoop and above all new to java too. Recently > started working on Hbase and Java. I have successfully ins

Re: When to expand vertically vs. horizontally in Hbase

2013-07-03 Thread Asaf Mesika
Do you have only 5 static author names? Keep in mind the column family name is defined when creating the table. Regarding tall vs wide debate: HBase is first and for most a Key Value database thus reads and writes in the column-value level. So it doesn't really care about rows. But it's not entire

Re: Schema design for filters

2013-06-28 Thread Asaf Mesika
Yep. Other DBs like Mongo may have the stuff you need out of the box. Another option is to encode the whole class using Avro, and writing a filter on top of that. You basically use one column and store it there. Yes, you pay the penalty of loading your entire class and extract the fields you need t

Re: Replication not suited for intensive write applications?

2013-06-22 Thread Asaf Mesika
rt of the code. > > > > I have no good explanation then, and also no good suggestion about how > to improve this. > > > > > > > > > > From: Asaf Mesika > > To: "user@hbase.apache.org" ; lars hofhansl < > la...@apach

Re: Replication not suited for intensive write applications?

2013-06-21 Thread Asaf Mesika
in the local cluster. > > Also my first reply - upon re-reading it - sounded a bit rough, that was > not intended. > > -- Lars > > > - Original Message - > From: Asaf Mesika > To: "user@hbase.apache.org" ; lars hofhansl < > la...@apache.org> &

Re: CopyTable

2013-06-20 Thread Asaf Mesika
Did you guys thought about adding coprocessors hooks to the replication, like preReplicateLogEntries, or something like that? I mean, in his case, perhaps utilizing it could have changed the table name before running through the replication process at the sink RS. On Fri, Jun 21, 2013 at 1:48 AM

Re: Writing unit tests against HBase

2013-06-20 Thread Asaf Mesika
You shouldn't tweak any Configuration at all. It simply works - setting up both mini HDFS and mini HBase. On Fri, Jun 21, 2013 at 2:36 AM, Varun Sharma wrote: > Hi Christopher, I am looking into getting maven 3.0.4 installed before i > start playing with it. > > Elliot, does the HBaseTestingUt

Re: Possibility of using timestamp as row key in HBase

2013-06-20 Thread Asaf Mesika
On Thu, Jun 20, 2013 at 9:42 PM, yun peng wrote: > Thanks Asaf, I made the response inline. > > On Thu, Jun 20, 2013 at 9:32 AM, Asaf Mesika > wrote: > > > On Thu, Jun 20, 2013 at 12:59 AM, yun peng > wrote: > > > > > Thanks for the reply. The id

Re: Replication not suited for intensive write applications?

2013-06-20 Thread Asaf Mesika
e time it takes to run the HTable.batch call on the slave RS, right? If I enable compression somehow (hack HBase code to test drive it), I will only speed up transfer time of the batch to the slave RS, but still wait on the insertion of this batch into the slave cluster. > -- Lars >

Re: Replication not suited for intensive write applications?

2013-06-20 Thread Asaf Mesika
lly, but this under utilizing my cluster hardware, no? > As for your other question, more threads can lead to better interleaving > of CPU and IO, thus leading to better throughput (this relationship is not > linear, though). > > > > -- Lars > > > > - O

Re: Issues adding column families

2013-06-20 Thread Asaf Mesika
if (!admin.tableExists(tableName)){ Shouldn't it be: if (admin.tableExists(tableName)){ ? On Thu, Jun 20, 2013 at 6:42 PM, Aji Janis wrote: > I have the following function for adding column family but its not working > for some reason. I am able to get proper configuration and establish the >

Re: Replication not suited for intensive write applications?

2013-06-20 Thread Asaf Mesika
bably includes HDFS replication. There's much I can do about that though. We should note that the network capacity is not the issue, since it was measured 30MB/sec Receive and 20MB/sec Transmit, thus it's far from the measured max bandwidth of 111MB/sec (measured by running nc - netcat).

Re: Possibility of using timestamp as row key in HBase

2013-06-20 Thread Asaf Mesika
gion mapping. > Why do you need the sequential key approach? Let's say you have a group data correlated in some way but is scattered in 2-3 RS. You can always write a coprocessor to run some logic close to the data, and then run it again on the merged data in the client side, right? > >

Replication not suited for intensive write applications?

2013-06-20 Thread Asaf Mesika
Hi, I've been conducting lots of benchmarks to test the maximum throughput of replication in HBase. I've come to the conclusion that HBase replication is not suited for write intensive application. I hope that people here can show me where I'm wrong. *My setup* *Cluster (*Master and slave are al

Re: Possibility of using timestamp as row key in HBase

2013-06-19 Thread Asaf Mesika
omize such policy on hmaster? Or there is a similiar > way as what CoProcessor does on region servers... > > > On Wed, Jun 19, 2013 at 4:58 PM, Asaf Mesika > > > wrote: > > > The new splitted region might be moved due to load balancing. Aren't you > > experienci

Re: Possibility of using timestamp as row key in HBase

2013-06-19 Thread Asaf Mesika
The new splitted region might be moved due to load balancing. Aren't you experiencing the classic hot spotting? Only 1 RS getting all write traffic? Just place a preceding byte before the time stamp and round robin each put on values 1-num of region servers. On Wednesday, June 19, 2013, yun peng w

Re: HBase Replication ageOfLastShippedOp

2013-06-19 Thread Asaf Mesika
:09 AM, Asaf Mesika > wrote: > > > Does HBase Region Server arbitrarily chooses which one to publish to its > > > > The metrics reports the last source to report in. So for recovered edits > since it will probably have higher throughput than the live log, you'll >

  1   2   3   >