Accumulo Storage Manager

2015-09-08 Thread peter.mar...@baesystems.com
Hi, I have been trying out the Hive Accumulo Manager as described here https://cwiki.apache.org/confluence/display/Hive/AccumuloIntegration and it seems to work as advertised. Thanks. However I don't seem to be able to get any sensible results when I have a Hive column of type ARRAY<> like ARRA

RE: Accumulo Storage Manager

2015-09-10 Thread peter.mar...@baesystems.com
Hi Josh, At this stage I don't know whether there's anything wrong with Hive or it's just user error. Perhaps if I go through what I have done you can see where the error lies. Unfortunately this is going to be wordy. Apologies in advance for the long email. So I created a "normal" table in HDF

RE: Accumulo Storage Manager

2015-09-21 Thread peter.mar...@baesystems.com
unit tests would also be great :). The type handling can be a little difficult but there are likely some places to start in the accumulo or hbase handler tests. At worst, we can start by writing a qtest that will reproduce your errors using an full environment (Accumulo minicluster, etc). peter.

Table statistics

2015-12-15 Thread peter.mar...@baesystems.com
Hi, I was wondering if there is any "recognized" way to obtain table statistics. Ideally, given a Key range I would like to know the number of distinct rowids, entries and amount of data (in bytes) in that key range. I assume that Accumulo holds at least some of this information internally, part

FW: Table statistics

2015-12-15 Thread peter.mar...@baesystems.com
Sorry, wrong list. Z From: peter.mar...@baesystems.com [mailto:peter.mar...@baesystems.com] Sent: 15 December 2015 09:39 To: user@hive.apache.org Subject: Table statistics Hi, I was wondering if there is any "recognized" way to obtain table statistics. Ideally, given a Key range I wou

Stored By

2016-01-21 Thread peter.mar...@baesystems.com
Hi, So I am using the AccumuloStorageHandler to allow me to access Accumulo tables from Hive. This works fine. So typically I would use something like this: CREATE EXTERNAL TABLE test_text (rowid STRING, testint INT, testbig BIGINT, testfloat FLOAT, testdouble DOUBLE, teststring STRING, testboo

RE: Stored By

2016-02-16 Thread peter.mar...@baesystems.com
tored By Hi Why not write your own storage handler extending AccumuloStorageHandler and overriding getInputFormatClass() to return your HiveAccumuloTableInputFormat subclass. hth Gabriel Balan On 1/21/2016 10:46 AM, peter.mar...@baesystems.com<mailto:peter.mar...@baesystems.com> wrote: H