Is Insert Overwrite table partition on s3 is an atomic operation ?

2021-01-11 Thread Mark Norkin
;ve missed it and it is documented somewhere, or maybe someone knows from their experience? If it's an atomic operation, is there any difference whether the table is external or a managed one? Thank you, Mark

Materialized views generation

2018-11-30 Thread mark pasterkamp
Could anyone point me a bit into the right direction as to where such code is supposed to be written? With kind regards, Mark

Fw: very informative article

2016-08-28 Thread Mark Markovic
Hey, I was looking for some info and found that nice article, just read it here <http://spardepengi.paidmarketing.tips/aepysl> Wishes, Mark Markovic

unsubscribe

2016-08-23 Thread Mark Memory

how can I access HDFS(namenode HA) by using hive-common-1.2.1.jar??

2016-05-19 Thread Mark Memory
Hi guys, who knows why class HiveConf has no HADOOPCONF enum type in hive common jar now? I write code using hive-common-1.2.1.jar HiveConf class to access HDFS(HA namenode), and I get an error below. I realized my code didn't config HADOOPCONF so it can't connect to HDFS, but there is no HADOOPCO

Re: Hello, I have an issue about hcatalog

2016-05-18 Thread Mark Memory
-19 8:12 GMT+08:00 Alan Gates : > This looks to me like a Hadoop issue rather than Hive. It appears that > you cannot connect to HDFS. Have you tried connecting to HDFS outside of > Hive/HCatalog? > > Alan. > > > On May 18, 2016, at 04:24, Mark Memory wrote: > > &g

Hello, I have an issue about hcatalog

2016-05-18 Thread Mark Memory
hello guys, sorry to bother you. I'm using hcatalog to write hive tables, but I don't know how to do with namenode HA my code was copied from https://github.com/apache/hive/blob/master/hcatalog/core/src/test/java/org/apache/hive/hcatalog/data/TestReaderWriter.java *below is my config:* hiveC

Re: Hive function to convert numeric IP address to "dot" format?

2015-11-05 Thread Mark Sunderlin
Actually, I found a UDF that does this very nicely! https://github.com/petrabarus/HiveUDFs/blob/master/src/main/java/net/petrabarus/hiveudfs/LongToIP.java Share and enjoy! -- Mark E. Sunderlin Data Architect // Data Solutions P: 703-265-6935 // C: 540-327-6222 // 22000 AOL Way, Dulles, VA

Hive function to convert numeric IP address to "dot" format?

2015-10-22 Thread Mark Sunderlin
it in the "dot" format. raw_formatdot_format166136010699.6.87.234162645867796.241.202.53 -- Mark E. Sunderlin Data Architect // Data Solutions P: 703-265-6935 // C: 540-327-6222 // 22000 AOL Way, Dulles, VA 20166 AIM: MESunderlin // Slack: @Mark.Sunderlin

Database view of the file system

2014-07-26 Thread Mark Wildig
very much appreciated Thanks in advance Mark Wildig Sent from my iPhone

Re: Efficient Equality Joins of Large Tables

2014-06-09 Thread Mark Desnoyer
join because you're using the partitioning step in order to enforce the join predicate instead of actually evaluating it. This could be much more efficient in some cases, so I was wondering if Hive can do it and if so, how can I trigger it? Cheers, Mark On Mon, Jun 9, 2014 at 10:34 AM,

Efficient Equality Joins of Large Tables

2014-06-09 Thread Mark Desnoyer
d the associated rows are rare, it's very inefficient. So, can hive perform the join the first way I described? If so, any idea how I trigger it? Or do I have to write my own MR job? Cheers, Mark

Question on converting an array of structs into a map

2014-04-08 Thread Sunderlin, Mark
A single result set that looks like: Sequence_numint User_id string User-attributes map /* A map of keyid=valueid for each struct from the original array */ Can this be done in Hive? Is inline() the key to doing this? -- Mark E. Sunderlin Data Architect // AOL Platforms P: 703-265-6

Size of a Hive Map column in characters!

2013-12-06 Thread Sunderlin, Mark
The size(map) function is defined as follows: size(Map)Returns the number of elements in the map type What if I want the total size of the map for that row? This doesn't work: select length(MAP); How can I get the total size of a map column in either bytes or characters? --- M

Hive metatool JVM Heap error

2013-11-11 Thread Mark Schramm
Hive Metatool experts, Trying to use the hive "metatool" to update the locations for all of our tables after a configuration change did not do this automatically. The tool gets a heap error even though we upped the JVM to 4 GB. Does anyone know of a workaround? Thanks, Mark

Re: [VOTE] Apache Hive 0.12.0 Release Candidate 1

2013-10-14 Thread Mark Grover
n Mon, Oct 14, 2013 at 12:18 PM, Mark Grover wrote: > I wasn't able to start webhcat server. It seems to be related to file > permissions. > > FWIW, it's not a regression (the same problem existed in Hive 0.11). > Having said that, it makes webhcat pretty unusable out of the

Re: [VOTE] Apache Hive 0.12.0 Release Candidate 1

2013-10-14 Thread Mark Grover
I wasn't able to start webhcat server. It seems to be related to file permissions. FWIW, it's not a regression (the same problem existed in Hive 0.11). Having said that, it makes webhcat pretty unusable out of the box. I created HIVE-5534 to track this. On Sun, Oct 13, 2013 at 4:37 PM, Carl Stei

RE: Hive Query via Hue, Only column headers in downloaded CSV or XSL results, sometimes

2013-09-30 Thread Sunderlin, Mark
Hmm.. No replies on this one? Is no one use Hue? :-) That would be interesting to know .. if not Hue, how are others exposing Hive to "end users?" without given them a direct login to a node on the cluster? --- Mark E. Sunderlin Data Architect | AOL NETWORKS BDM P: 703-265-6935

Hive Query via Hue, Only column headers in downloaded CSV or XSL results, sometimes

2013-09-18 Thread Sunderlin, Mark
lines of output, so it cannot be a number of rows issue. Is there a limit to the width of the data you can download? A limit on the number of columns? Anyone seen this before? Does anyone know a fix or a work around? --- Mark E. Sunderlin Data Architect | AOL NETWORKS BDM P: 703-265-6935

Re: from_unix time problem

2013-09-17 Thread Mark Grover
Stephen is right. Here is some context: Looking at the UDF source code ( https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFFromUnixTime.java#L121), it's using SimpleDateFormat where the format string follows the convention at http://docs.oracle.com/javase/1.4.2

Re: Hive index error

2013-08-01 Thread Mark Wagner
Hi Omkar, What Serde does the table use? Given that you're able to index on leg_id it doesn't sound quite the same as the issue I hit. Is there any partitioning of the table? -Mark On Wed, Jul 31, 2013 at 10:39 PM, Omkar Joshi wrote: > I'm facing issues while building an

Re: Write access for the wiki

2013-07-30 Thread Mark Wagner
; > On Tue, Jul 30, 2013 at 1:40 PM, Mark Wagner wrote: > >> My id is mwagner. Thanks! >> >> >> On Tue, Jul 30, 2013 at 1:36 PM, Ashutosh Chauhan >> wrote: >> >>> Mark, >>> >>> Do you have an account on hive cwiki. Whats your id

Re: Write access for the wiki

2013-07-30 Thread Mark Wagner
My id is mwagner. Thanks! On Tue, Jul 30, 2013 at 1:36 PM, Ashutosh Chauhan wrote: > Mark, > > Do you have an account on hive cwiki. Whats your id ? > > Thanks, > Ashutosh > > > On Tue, Jul 30, 2013 at 1:06 PM, Mark Wagner wrote: > >> Hi all, >> >>

Write access for the wiki

2013-07-30 Thread Mark Wagner
Hi all, Would someone with the right permissions grant me write access to the Hive wiki? I'd like to update some information on the Avro Serde. Thanks, Mark

RE: Issues with Hive 'Like" and multiple '%' wildcard in a pattern

2013-06-13 Thread Sunderlin, Mark
?&icid=mainmoiz%dl' LIKE ('%?icid=main%') THEN 'T' ELSE 'F' END AS case1, CASE WHEN '?&icid=mainmoiz%dl' LIKE ('%?icid=main%dl%') THEN 'T' ELSE 'F' END

RE: Issues with Hive 'Like" and multiple '%' wildcard in a pattern

2013-06-12 Thread Sunderlin, Mark
We are using Hive 0.80. --- Mark E. Sunderlin Solutions Architect |AOL Networks BDM P: 703-265-6935 |C: 540-327-6222 | AIM: MESunderlin 22000 AOL Way | Dulles, VA | 20166 From: Stephen Sprague [mailto:sprag...@gmail.com] Sent: Wednesday, June 12, 2013 1:00 PM To: user@hive.apache.org Subject

Issues with Hive 'Like" and multiple '%' wildcard in a pattern

2013-06-12 Thread Sunderlin, Mark
wo or more '%' wild cards in a Hive like expression a known issue? --- Mark E. Sunderlin Solutions Architect |AOL NETWORKS BDM P: 703-265-6935 |C: 540-327-6222 | AIM: MESunderlin 22000 AOL Way, Dulles, VA 20166

Re: using Avro unions with HIVE

2013-05-23 Thread Mark Wagner
Avro SerDe transparently. This JIRA is tracking imporved union support for Hive, but it's not being actively worked on: https://issues.apache.org/jira/browse/HIVE-2390. Thanks, Mark On Thu, May 23, 2013 at 11:45 AM, Scott Carey wrote: > The Hive mailing list would have more info on the Avr

Re: Syntax for filters on timstamp data type

2013-04-11 Thread Mark Grover
Glad to hear, Steffan. Do you mind creating a JIRA for this at https://issues.apache.org/jira? Thanks in advance! Mark On Mon, Apr 8, 2013 at 6:38 AM, LUTTER, Steffen wrote: > Hi Mark, > > ** ** > > Correct, I just did some tests and the cast is the way to go. While f

Re: Bucketing external tables

2013-04-06 Thread Mark Grover
Glad to hear! On Fri, Apr 5, 2013 at 3:02 PM, Sadananda Hegde wrote: > Thanks, Mark. > > I found the problem. For some reason, Hive is not able to write Avro > output file when the schema has a complex field with NULL option. It read > without any problem; but cannot write with

Re: Syntax for filters on timstamp data type

2013-04-05 Thread Mark Grover
Steffan, One thing that may be different is that equal can cast operands to make equals work but that may not be true for IN. FWIW, this is me just speculating, I haven't looked at the code just yet. Perhaps, you could explicit casting to get around this? On Fri, Apr 5, 2013 at 7:36 AM, LUTTER, S

Re: Bucketing external tables

2013-04-03 Thread Mark Grover
Can you please check your Jobtracker logs? The is a generic error related to grabbing the Task Attempt Log URL, the real error is in JT logs. On Wed, Apr 3, 2013 at 7:17 PM, Sadananda Hegde wrote: > Hi Dean, > > I tried inserting a bucketed hive table from a non-bucketed table using > insert ove

Metastore question

2013-04-03 Thread Mark
Trying to make sure I understand this correctly. All databases and tables are stored in hive.metastore.warehouse.dir but the actual metadata for the database and tables (columns, types, partitions, etc) are stored in the hive database (ie.. mysql)? Is that correct?

Re: 回复: Round function display

2013-04-02 Thread Mark Grover
itters can grant you permissions to edit the wiki. Thanks again! Mark On Mon, Mar 25, 2013 at 9:36 AM, chenchun wrote: > Hi mark, > I read the the wiki of PhabricatorCodeReview > https://cwiki.apache.org/confluence/display/Hive/PhabricatorCodeReview and > send a review https://reviews.

Re: Noob question on creating tables

2013-03-29 Thread Mark
rectory name: > /path/to/hive -f /path/to/addlogpartition.hql -d YEAR=$YEAR -d MON=$MONTH -d > DAY=$DAY > > > (Of course, all the /path/to will be different...) > > So, be careful of how how "03" vs. "3" is handled in both the ALTER TABLE > statement

Re: Noob question on creating tables

2013-03-29 Thread Mark
code.google.com/p/snappy/ > > > Note that for example - gzip format is not splittable..so huge gzip files > without splits are not recommended as input to maps > > Hope this helps > > sanjay > > > On 3/29/13 10:19 AM, "Mark" wrote: > >> We have e

Re: Noob question on creating tables

2013-03-29 Thread Mark
Thanks Does this mean I need to create a partition for each day manually? There is no way to have infer that from my directory structure? On Mar 29, 2013, at 10:32 AM, Dean Wampler wrote: > > > On Fri, Mar 29, 2013 at 12:19 PM, Mark wrote: > We have existing log data in direct

Noob question on creating tables

2013-03-29 Thread Mark
We have existing log data in directories in the format of YEAR/MONTH/DAY. - How can we create a table over this table without hive modifying and/or moving it? - How can we tell Hive to partition this data so it knows about each day of logs? - Does hive out of the box work with reading compresse

Re: Using TABLESAMPLE on inner queries

2013-03-20 Thread Mark Grover
Hey Dean, I am not a power user of the sampling feature but my understanding was that sampling in Hive only works on bucketed tables. I am happy to be corrected though. Mark On Wed, Mar 20, 2013 at 12:20 PM, Dean Wampler < dean.wamp...@thinkbiganalytics.com> wrote: > Mark, > >

Re: Using TABLESAMPLE on inner queries

2013-03-20 Thread Mark Grover
Then you will be able to be sample through it. If the predicate is on partition column, you may be able to get around the intermediate table requirement but in general, as far as I know, intermediate bucketed table might be the only choice. Mark On Wed, Mar 20, 2013 at 9:56 AM, Robert Li wrote

Re: hive-0.10 and nullpointerexception

2013-03-18 Thread Mark Grover
For the sake of readers, I just saw another manifestation of this problem. The issue was that mapreduce.framework.name in mapred-site.xml was set to "yarn" instead of "classic" when the intent was to use MR1. Hope that helps, Mark On Mon, Mar 4, 2013 at 8:26 AM, Mark Grover

Re: 回复: Round function display

2013-03-17 Thread Mark Grover
Hi Chenchun, Can you please: 1. Send a separate email to this same mailing list to add you as a contributor to the JIRA? I don't have credentials to do so and seems like other people are not watching this thread:-) 2. Upload this patch to the JIRA? Thanks! Mark On Sat, Mar 16, 2013 at 4:

Re: 回复: Round function display

2013-03-14 Thread Mark Grover
Someone will have to add you to the project contributors on JIRA. Could you please share your JIRA account ID and email address? Mark On Thu, Mar 14, 2013 at 9:58 AM, chenchun wrote: > Cool, I'd like to steal it from you. I'd like to get a few days to solve it > but how I can

Re: Round function display

2013-03-14 Thread Mark Grover
ke to contribute a patch (along with updated tests), please feel free to steal it from me. Mark On Thu, Mar 14, 2013 at 12:44 AM, chenchun wrote: > Hi, > I have a question about HIVE-3675 NaN does not work correctly for round(n). > The issue did solve the NaN problem, but the hql below result

Re: Find current db we r using in Hive

2013-03-09 Thread Mark Grover
Created https://issues.apache.org/jira/browse/HIVE-4144 On Fri, Mar 8, 2013 at 5:25 AM, Dean Wampler wrote: > It's odd that there is no such command. The trick Ramki mentioned is the > only one I know of. Two points about it, though: > > 1. It only works on Hive v0.8+. > 2. I've seen a few cases

Re: hive issue with sub-directories

2013-03-09 Thread Mark Grover
Suresh, By default, the partition column name has to be appear in HDFS directory structure. e.g. /user/hive/warehouse//=/data1.txt /user/hive/warehouse//=/data2.txt On Thu, Mar 7, 2013 at 7:20 AM, Suresh Krishnappa wrote: > Hi All, > I have the following directory structure in hdfs > > /test/a/

Re: Hive insert into RCFILE issue with timestamp columns

2013-03-05 Thread Mark Grover
or.java:471) >> at >> org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:762) >> at >> org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:529) >> ... 9 more >> Caused by: java.lang.IllegalArgumentException: Timestamp

Re: Hive sample test

2013-03-05 Thread Mark Grover
I typically change my query to query from a limited version of the whole table. Change select really_expensive_select_clause from really_big_table where something=something group by something=something to select really_expensive_select_clause from ( select * from really_big_table limit 100 )t w

Re: show tables in bin does not display the tables

2013-03-05 Thread Mark Grover
derby metastore and use MySQL or PostgreSQL for metastore instead. Googling should give you some nice articles on how to do that. Mark On Tue, Mar 5, 2013 at 3:48 AM, Sai Sai wrote: > Hello > > I have noticed when i execute the following command from hive shell in diff > folders it beh

Re: Hive insert into RCFILE issue with timestamp columns

2013-03-04 Thread Mark Grover
Hi Dilip, Are you able to run this query successfully? select d_date_sk, d_date_id, d_date, d_month_seq, d_week_seq, d_quarter_seq, d_dow, d_moy, d_dom, d_qoy, d_fy_year, d_fy_quarter_seq, d_fy_week_seq, d_day_name, d_quarter_name, d_holiday, d_weekend, d_following_holiday, d_first_dom, d_last_dom

Re: [ANNOUNCE] Kevin Wilfong elected to Hive PMC

2013-03-04 Thread Mark Grover
Congrats, Kevin. Keep up the good work! On Mon, Mar 4, 2013 at 11:54 AM, Carl Steinbach wrote: > On behalf of the Apache Hive PMC I am pleased to welcome > Kevin Wilfong as a member of the Apache Hive PMC. > > Please join me in congratulating Kevin on his new role! > > Thanks. > > Carl

Re: hive-0.10 and nullpointerexception

2013-03-04 Thread Mark Grover
Sorry for the delayed response, Marcin, I was away. This error in the Gist means that the jar containing the class in question is not on your classpath. Exception in thread "Thread-39" java.lang.NoClassDefFoundError: org/apache/hadoop/mapreduce/util/HostUtil at org.apache.hadoop.hive.shim

Re: regexp_replace with unicode chars

2013-03-01 Thread Mark Grover
s original problem because it doesn't take ranges. I created HIVE-4100 for improving regex_replace UDF. Mark On Fri, Mar 1, 2013 at 9:31 AM, Dean Wampler wrote: > Anyone know if translate takes ranges, like some implementations? e.g., > > translate ('[a-z]', '[A-Z]

Re: Need tab separated output file and put limit on number of lines in a output file

2013-02-20 Thread Mark Grover
Chunky, There may be another way to do this but to get tab separated output, I usually create an external table that's tab separated and insert overwrite into that table. For limiting the number of records in the output, you can use the limit clause in your query. Mark On Tue, Feb 19, 20

Re: hive-0.10 and nullpointerexception

2013-02-19 Thread Mark Grover
that fixes it? Also, if you could provide me any information (your config, setup, which queries fail, etc.) that could help me in reproducing the problem on hive trunk would be greatly appreciated! Mark On Tue, Feb 19, 2013 at 1:08 AM, Marcin Cylke wrote: > > On 14/02/13 08:27, Ma

Re: hive-0.10 and nullpointerexception

2013-02-13 Thread Mark Grover
://issues.apache.org/jira/browse/HIVE-4003 which is assigned to me, I will try to get this in sooner than later. In the meanwhile, a workaround is to set yarn.resourcemanager.address property to some value other than "local" so Hive queries can run on your MR2 cluster. Mark On Wed, Feb 13, 20

Re: Union in Multi Insert

2013-02-11 Thread Mark Grover
mple with sample data and expected output, that might be helpful. Mark On Mon, Feb 11, 2013 at 7:34 PM, John Omernik wrote: > I am trying to do a union, group by, and multi insert all at once. I know > this convoluted but I what I am trying to do is avoid having to scan > through the origina

Re: How to load hive metadata from conf dir

2013-02-11 Thread Mark Grover
and see if that fits your use case? Mark On Mon, Feb 11, 2013 at 2:59 PM, Parag Sarda wrote: > Hello Hive Users, > > I am writing a program in java which is bundled as JAR and executed using > hadoop jar command. I would like to access hive metadata (read partitions > informations) i

Re: Help to solve UDAF errors!

2013-02-10 Thread Mark Grover
Hi Abhishek, The code looks incomplete. See the comment at https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/UDAF.java#L22 Those are all the methods your UDAF class needs to implement but you seem to be missing them. Mark On Sat, Feb 9, 2013 at 11:08 PM

Re: Help to solve UDAF errors!

2013-02-08 Thread Mark Grover
on github somewhere so it's easier to analyze? Mark On Fri, Feb 8, 2013 at 2:05 PM, Abhishek Bhattacharya wrote: > Hi, > > I have implemented a simple UDAF for top-n-percent as follows: > import java.util.ArrayList; > import java.util.Collections; > > import org.apach

Re: msck repair table not adding partitions which contains data.

2013-02-07 Thread Mark Grover
Suresh, Take a look at this: https://issues.apache.org/jira/browse/HIVE-3231 On Thu, Feb 7, 2013 at 11:46 AM, Krishnappa, Suresh < suresh.krishna...@rsa.com> wrote: > Hi All, > > I have created a partitioned HIVE external table as follows > > ** ** > > create external table test_part (key

Re: DROP TABLE does not respect the IF EXISTS clause

2013-01-31 Thread Mark Grover
Thanks for the update, Marc. Could you please create a Hive JIRA for this? Thanks! Harsh, it's most likely the case with your JIRA as well that the error message is misleading. Thanks for reporting it! Mark On Thu, Jan 31, 2013 at 4:58 PM, Marc Limotte wrote: > Thanks, Harsh. M

Re: DROP TABLE does not respect the IF EXISTS clause

2013-01-31 Thread Mark Grover
Marc, It seemed to work on my installation of Hive 0.9. Is this a table or a view you are trying to drop? Mind sharing the create table definition? Mark On Thu, Jan 31, 2013 at 10:42 AM, Marc Limotte wrote: > Hi, > > I'm testing with Hive 0.9.0 (in Amazon EC2, but NOT emr). I&#x

Re: ALTER TABLE CHANGE COLUMN issue

2013-01-30 Thread Mark Grover
, you'll have to look into the documentation to see if there is an alter table partition (partition spec)... command that will let you alter metadata about the partition. If I am not mistaken, alter table doesn't touch your existing columns, just modifies the partitions going forward. Ma

Re: Automating the partition creation process

2013-01-29 Thread Mark Grover
ns: 1. Download the patch, rebuild hive and use it 2. Find a hacky way to recover your partitions when they are empty and populate them later. Sorry for the inconvenience. Mark On Tue, Jan 29, 2013 at 5:09 PM, Sadananda Hegde wrote: > Thanks Mark, > > Recover partition feature will sa

Re: [VOTE] Amend Hive Bylaws + Add HCatalog Submodule

2013-01-29 Thread Mark Grover
Measure 1: +1 Measure 2: +1 On Tue, Jan 29, 2013 at 4:21 PM, Zhenxiao Luo wrote: > > Measure 1: +1 > Measure 2: +1 > > > Thanks, > Zhenxiao > >

Re: Automating the partition creation process

2013-01-28 Thread Mark Grover
Sadananda, See if this helps: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-Recoverpartitions On Mon, Jan 28, 2013 at 8:05 PM, Sadananda Hegde wrote: > Hello, > > My hive table is partitioned by year, month and day. I have defined it as > external table. Th

Re: Lateral View in sub query issue

2013-01-26 Thread Mark Grover
Hi John, Thanks for reporting this. Can you please take a look at the Lateral View issues here: https://issues.apache.org/jira/issues/?jql=project%20%3D%20HIVE%20AND%20text%20~%20%22lateral%20view%22 and create a new JIRA with the details if one previously doesn't exist? Thanks again, Mar

Re: updating RegexSerde on existing partitions

2013-01-25 Thread Mark Grover
Thanks for the update, Viral. One way I can think of is to drop the metadata for the entire table and recreate the table with the updated regex and recover partitions from HDFS. This, of course, works if your table is external. Mark On Fri, Jan 25, 2013 at 12:21 PM, Viral Bajaria wrote: > W

Re: updating RegexSerde on existing partitions

2013-01-25 Thread Mark Grover
Viral, To me, I think the only option seems to be here to alter all the existing partitions. I would be interested in knowing what you ended up finding/doing. Thanks, Mark On Sun, Jan 20, 2013 at 3:58 PM, Viral Bajaria wrote: > Hi, > > I have a hive table which has pre-defined schema

Re: HIVE: java.lang.ArrayIndexOutOfBoundsException: 2 during JOIN

2013-01-15 Thread Mark Grover
uot;data_table" has around 5k fields, all doubles. > As for the "age_mean" table, here it is: > > hive> desc age_mean; > OK > id string > name string > age_mean double > Time taken: 0.127 seconds > > Does this help? > > Thanks! > Fernando > &

Re: HIVE: java.lang.ArrayIndexOutOfBoundsException: 2 during JOIN

2013-01-15 Thread Mark Grover
Fernando, Could you share your table definitions as well please? On Tue, Jan 15, 2013 at 10:31 AM, Fernando Andrés Doglio Turissini < fernando.dog...@globant.com> wrote: > Hello everyone, I'm struggling with an exception I'm getting on a > particular query that's driving me crazy! > > Here is the

Re: Timestamp, Epoch Time, Functions and other Frustrations

2013-01-04 Thread Mark Grover
Brad is correct, there is a JIRA about this already: https://issues.apache.org/jira/browse/HIVE-3822 Sorry for the inconvenience. Mark On Fri, Jan 4, 2013 at 8:25 AM, Brad Cavanagh wrote: > Try multiplying your values by 1000, then running the conversions. I bet > they expect millis

Re: HIVE left semi join multiple tables with columns from multiple table

2012-12-29 Thread Mark Grover
Souvik, I am not sure I understand, you may want to play around with distinct keyword if you want to join on unique values. In any case, it looks like you've got it figured out already, so it's all good:-) Happy trails! Mark On Sat, Dec 29, 2012 at 2:19 PM, Souvik Banerjee wrote: >

Re: HIVE left semi join multiple tables with columns from multiple table

2012-12-28 Thread Mark Grover
); Mark On Thu, Dec 27, 2012 at 9:26 PM, Souvik Banerjee wrote: > Thanks a lot Mark for your attention. > But I think I cannot go for INNER join, the reason behind the fact being > that I want all rows of Table3 irrespective of there is any row > corresponding to that ID in Table

Re: HIVE left semi join multiple tables with columns from multiple table

2012-12-27 Thread Mark Grover
rted merge join or sorted merge bucketed joins depending on your use case. Mark On Thu, Dec 27, 2012 at 3:16 PM, Souvik Banerjee wrote: > Hi, > > I am struggling with a problem described below. > Any help how to resolve this problem is highly appreciated. > > I have got few tables

Re: hive add jar question

2012-12-21 Thread Mark Grover
About #2, doesn't sound normal to me. Did you figure that out or still running into it? Mark On Thu, Dec 20, 2012 at 5:01 PM, java8964 java8964 wrote: > Hi, I have 2 questions related to the hive behavior when using 'add jar'. > > I am testing the implementing of my o

Re: hive jobs pending so long

2012-12-20 Thread Mark Grover
reboot? Mark On Wed, Dec 19, 2012 at 11:21 PM, Neil Guo wrote: > another log, > > cd $HADOOP_LOG_DIR && tail -n 0 -f *.log |tee log_merge.log > > > ==> hadoop-neil-datanode-Ct-bj108.log <== > > ==> hadoop-neil-jobtracker-Ct-bj108.log <== > > ==>

Re: Hive producing difference outputs

2012-12-20 Thread Mark Grover
Hi Cheng, It would really help if you could simplify your example so it's easy to reproduce by one of us. Also, can you share the value of hive.optimize.ppd property by doing hive> set hive.optimize.ppd; Does inverting the property (making it false, if it's true) change anything? On Thu, Nov 15,

Re: xpath UDF in hive support namespace?

2012-12-19 Thread Mark Grover
like make additions to those UDFs, you are welcome to do so by creating a JIRA and posting a patch. UDFs are an easy and excellent way to contribute back to the Hive community. Thanks! Mark On Wed, Dec 19, 2012 at 8:52 AM, java8964 java8964 wrote: > Hi, I have a question related to the X

Re: HiveHistory and HiveHistoryViewer

2012-12-17 Thread Mark Grover
And clients may be on mac, which takes '\r' as line delimiter > > So I think '\r' should also be replaced with space in HiveHistory.log, so > that HiveHistory.parseHiveHistory could be consistent with HiveHistory.log > and allow clients from mac > > Thanks!

Re: HiveHistory and HiveHistoryViewer

2012-12-17 Thread Mark Grover
will take another look. Thanks! Mark On Mon, Dec 17, 2012 at 5:48 AM, Qiang Wang wrote: > anybody has an idea about this ? > > https://issues.apache.org/jira/browse/HIVE-3810 > > > > 2012/12/16 Qiang Wang >> >> glad to receive your reply! >> >> her

Re: reg : reteriveing a single column from hive serde table

2012-12-17 Thread Mark Grover
;s code to see what's going on. Mark On Mon, Dec 17, 2012 at 2:19 AM, Mohit Chaudhary01 wrote: > > I am using serde in hive to store data into hive table from xml file. > > Whenever I retrieve data using command select * from table it give all > records from table. > &g

Re: Possible to set map/reduce log level in configuration file?

2012-12-17 Thread Mark Grover
Krishna, I usually put it in my home directory and that works. Did you try that? HIVE-2911, adds another location to where it can be picked up from. If your present version supports .hiverc (which is most likely the case), home directory should work as well. Mark On Mon, Dec 17, 2012 at 5:44 AM

Re: Database limits for Hive

2012-12-17 Thread Mark Grover
sure. I am not sure of any such restriction for #1. For #3 your best bet is to look at the ODBC driver code. Mark On Mon, Dec 17, 2012 at 11:48 AM, Omer, Farah wrote: > Hi, > > Does anyone know of the SQL limits for Hive? Particularly the following: > > > 1. The maximum n

Re: Unable to issue count(*) query versus movielens data

2012-12-16 Thread Mark Grover
Sekine, You may want to go to the jobtracker web interface and look at the failed task logs for more info. Mark On Sun, Dec 16, 2012 at 1:36 PM, Sékine Coulibaly wrote: > Hi there, > > I loaded data from the movielens database into hive, into a u_data table. I > wish I could cou

Re: Running commands at hive cli or hive thirft startup

2012-12-14 Thread Mark Grover
No, .hiverc only works for CLI. UDFs are tricky. The only way I can think of is to add them to the function registry (https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java) and recompile Hive. On Mon, Dec 10, 2012 at 8:01 AM, John Omernik wrot

Re: Creating external table poiting to s3 folder with files not loading data

2012-12-14 Thread Mark Grover
Fernando, It is more likely related to your SerDe and the underlying data not matching up to it and than being related to the table being external on S3. Mark On Tue, Dec 11, 2012 at 6:05 AM, Fernando Andrés Doglio Turissini wrote: > Long subject, I know.. let me explain a bit more about

Re: Partition by directory

2012-12-14 Thread Mark Grover
which the last I saw was 0.8*), it would be interesting how Shark's Hive 0.9 is going to play around with EMR's version of Hive. FWIW, commands like "ALTER TABLE RECOVER PARTITIONS" are only available in EMR Hive. Keep us posted! Mark On Mon, Dec 10, 2012 at 1:46 PM, Erik Th

Re: help on failed MR jobs (big hive files)

2012-12-12 Thread Mark Grover
date?), also make sure you don't suffer from the small file problem: http://www.cloudera.com/blog/2009/02/the-small-files-problem/ Good luck! Mark On Wed, Dec 12, 2012 at 11:36 PM, Nitin Pawar wrote: > 6GB size is nothing. We have done it with few TB of data in hive. > Error you are seeing is o

Re: map side join with group by

2012-12-12 Thread Mark Grover
This is just my understanding, the full proof answer would lie in checking out the explain plans and the Semantic Analyzer code. And for completeness, there is a conditional task (starting Hive 0.7) that will convert your joins automatically to map joins where applicable. This can be enabled by enab

Re: Hive double-precision question

2012-12-07 Thread Mark Grover
is to this discussion but I will have to look further to comment more. In the meanwhile take a look at the UDF code and see if your personal Java code on Linux is equivalent to the Hive UDF code. Keep us posted! Mark On Fri, Dec 7, 2012 at 1:27 PM, Periya.Data wrote: > Hi Hive Users, >

Re: Determining DateDiff By Minutes in Hive

2012-12-05 Thread Mark Grover
lost (or gained) minutes due to day light savings time. If that's important, it will be best to convert precise timestamps to UTC and then subtract them. Mark On Wed, Dec 5, 2012 at 12:33 PM, Babe Ruth wrote: > Hello All, > > I would like to find the total amount of minutes between t

Re: handling null argument in custom udf

2012-12-04 Thread Mark Grover
ive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFTranslate.java#L172 Good luck! Mark On Tue, Dec 4, 2012 at 6:58 AM, Søren wrote: > Thanks. Did you mean I should handle null in my udf or my serde? > > I did try to check for null inside the code in my udf, bu

Re: cross join

2012-12-03 Thread Mark Grover
arounds go, you could try having each table in a sub-query and have an extra virtual column ("1 as one") and join on that virtual column. Mark On Mon, Dec 3, 2012 at 9:26 AM, Periya.Data wrote: > Hi Hive users, > I have Hive CDH - 0.7.1. I want to know if I can do cross-j

Re: Hive 0.7 use the old mapred API

2012-12-03 Thread Mark Grover
Yong, In my opinion, #2 wouldn't be that hard to do. Of course, upgrading to the new API (albeit by updating Hive) would the right thing to do long term. Mark On Mon, Dec 3, 2012 at 7:42 AM, java8964 java8964 wrote: > Hi, > > Our company current is using CDH3 release, which c

Re: Skew join failure

2012-12-03 Thread Mark Grover
Hey David, Sure thing. Play around with that property's value, see if that makes any difference. Also, if you could search to see if a file with a name like *hive_skew_join_ **bigkeys* exists on HDFS. Perhaps, it's looking at a different path. If so, we can figure out how to fix that.

Re: Skew join failure

2012-11-30 Thread Mark Grover
Hi David, It seems like Hive is unable to find the skewed keys on HDFS. Did you set *hive.skewjoin.key property? If so, to what value?* Mark On Fri, Nov 30, 2012 at 2:10 AM, David Morel wrote: > Hi, > > I am trying to solve the "last reducer hangs because of GC because of > t

Re: Storing data in TSV with changing headers

2012-11-30 Thread Mark Grover
So, to summarize, a single table may work, if the following 3 conditions are met. 1. New columns get added to the very end of the existing columns 2. No existing columns are deleted 3. You are ok with NULL showing up as value for old records that don't have a given new column. I tested the above wi

Re: Hive Loading Zip CSV Files

2012-11-29 Thread Mark Grover
Adding user@hive.apache.org Ben, That's great to hear. It would be awesome if you'd like to contribute this back to Hive so others in the community could use it too. Let us know what you think! Mark On Wed, Nov 28, 2012 at 8:05 PM, ben wrote: > Mark, > > Just wanted to let

  1   2   3   4   >