Re: Latest hive source code

2011-10-28 Thread Adriaan Tijsseling
http://mahout.apache.org/ Also, this is the Hive mailinglist. On 2011/10/28, at 12:56, vikas srivastava wrote: > Hey , > > > > I m new to MAHOUT can you guys give me some idea about MAHOUT or any pdf on > that !!! > > > > With Regards > Vikas Srivastava > > DWH & Analytics Team > M:

Re: When trying to create table Iam getting exception

2011-10-19 Thread Adriaan Tijsseling
me > FAILED: Execution Error, return code 1 from > org.apache.hadoop.hive.ql.exec.DDLTask > > -Original Message- From: Adriaan Tijsseling > Sent: Wednesday, October 19, 2011 3:01 PM > To: user@hive.apache.org > Subject: Re: When trying to create table Iam getting exception

Re: When trying to create table Iam getting exception

2011-10-19 Thread Adriaan Tijsseling
You should provide more information in order to get proper support such as the exact command you used to create the table. On 2011/10/19, at 11:16, kiranprasad wrote: > Hi > > Iam new to HIVE, when trying to create a table getting below exception. > FAILED: Error in metadata: java.lang.Illega

Re: How to load lines into Hive while breaking them by words?

2011-09-27 Thread Adriaan Tijsseling
Use a regexserde to split the text in words. There's documentation on the hive wiki. But it might be better to use a script. See the post by Shouguo Li earlier on this mailing list. After all, when you use a Python script, for example, you could use the Natural Language Processing Toolkit to ge

Re: Skip first line of CSV loading

2011-09-27 Thread Adriaan Tijsseling
You could try to remove the first line of a file before passing it on to hive. Something like `sed -i '1d' filename`. Adriaan On 2011/09/27, at 01:58, Bradford Stephens wrote: > Any thoughts on this? > > On Wed, Apr 13, 2011 at 1:55 PM, Daniel Jue wrote: >> Is there a way to have hive skip t

\x01 considered harmful

2011-09-22 Thread Adriaan Tijsseling
I've been having problems getting data properly into something like CREATE TABLE raw(a string, b string, c string) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\001' STORED AS TEXTFILE; using a script for the INSERT OVERWRITE call. Only column a data would be filled. The problems whent away when I

Re: Query using dynamic partitions

2011-09-18 Thread Adriaan Tijsseling
e STRING, bar MAP STRING>) > CLUSTER BY date > > > Sumanth > > > > On Sun, Sep 18, 2011 at 12:28 PM, Adriaan Tijsseling > wrote: > >> I looked at your solution, but the problem is still that the "data" column >> needs to be processed still. What I

Re: Query using dynamic partitions

2011-09-18 Thread Adriaan Tijsseling
om raw > insert overwrite table raw_2 partition (partition1, partition2) > select data, partition1, partition2; > > This will dynamically create the 2 partitions based on the values of > partition1 and partition2 and insert the values of 'data' in the appropriate > p

Re: Query using dynamic partitions

2011-09-18 Thread Adriaan Tijsseling
sert the values of 'data' in the appropriate > partition. > > Regards, > Sumanth > > > > On Sat, Sep 17, 2011 at 2:18 PM, Adriaan Tijsseling > wrote: > >> Hi, >> >> I have a table created with >> >> CREATE TABLE raw(part

Query using dynamic partitions

2011-09-17 Thread Adriaan Tijsseling
Hi, I have a table created with CREATE TABLE raw(partition1 string, partition2 string, data string) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\001' STORED AS TEXTFILE; I want to further process "data" and put it in a partition (partition1, partition2) defined by the values in the relevant row

Re: Managing Hive logging detail (per Hive statement) in /tmp/${user.name}/

2011-09-12 Thread Adriaan Tijsseling
Should be here: ./conf/hive-exec-log4j.properties ./conf/hive-log4j.properties Cheers, Adriaan On 2011/09/12, at 21:32, Jasper Knulst wrote: > Hi, > > Does anyone know how I can reduce the level of logging of the Hive History > file which are created per Hive statement executed in the (default

Re: CREATE INDEX

2011-09-12 Thread Adriaan Tijsseling
Hive/LanguageManual+DDL#LanguageManualDDL-Create%2FDropIndexhas > DDL commands using which you can create and manipulate indices in > Hive. > > Hope it helps, > Ashutosh > > On Mon, Sep 12, 2011 at 05:46, Adriaan Tijsseling > wrote: > >> Does anyone know the proper

Re: Loading data into hive table from stdin

2011-09-12 Thread Adriaan Tijsseling
ath. > > On Mon, Sep 12, 2011 at 7:24 PM, Adriaan Tijsseling > wrote: > >> Sorry, I wasn't clear. What I meant was if the data in path $path can >> itself be a script that feeds the data in streaming fashion? Something like >> "load data using script 'l

Re: Loading data into hive table from stdin

2011-09-12 Thread Adriaan Tijsseling
ot;load data local in path '$path' into table $table > partition(date='$date')" > > On Mon, Sep 12, 2011 at 7:04 PM, Adriaan Tijsseling > wrote: > >> Do you have the syntax for the proper hive QL command? >> >> Thanks! >> >> On 2

Re: Loading data into hive table from stdin

2011-09-12 Thread Adriaan Tijsseling
Do you have the syntax for the proper hive QL command? Thanks! On 2011/09/12, at 15:23, Vikas Srivastava wrote: > ya you can simple load the data by a simple script without using any api or > anything!! > > On Mon, Sep 12, 2011 at 6:29 PM, Adriaan Tijsseling > wrote: > &g

Loading data into hive table from stdin

2011-09-12 Thread Adriaan Tijsseling
Is it possible to load data into a hive table that is fed in using a unix pipe or streamed via a script? I've only seen streaming options when using SELECT TRANSFORM. Thanks in advance, Adriaan

CREATE INDEX

2011-09-12 Thread Adriaan Tijsseling
Does anyone know the proper documentation for using the "CREATE INDEX" command in hive? The hive homepage doesn't have the information. Cheers, Adriaan

Re: Accessing individual columns from a Hive table which is row delimited by RegexSerde

2011-09-09 Thread Adriaan Tijsseling
I figured it out. A jar is missing. ./src/build/contrib/hive-contrib-0.7.1.jar If you don't see this jar, you need to go into src and run `ant` to build the extra jars. Then, you can do: DROP TABLE serde_regex; add jar ./src/build/contrib/hive-contrib-0.7.1.jar; create the table, load the data

Re: Accessing individual columns from a Hive table which is row delimited by RegexSerde

2011-09-09 Thread Adriaan Tijsseling
I can replicate it using this example. A SELECT * FROM serde_regex; works, but SELECT host FROM serde_regex; doesn't: hive> SELECT host FROM serde_regex; Total MapReduce jobs = 1 Launching Job 1 out of 1 Number of reduce tasks is set to 0 since there's no reduce operator Starting Job = job_201109

Loading XML data

2011-09-07 Thread Adriaan Tijsseling
Hi, I'm very new to Hadoop+Hive. I've worked through the Hadoop in Action book and understand the basic principles. However, I've had a hard time finding information on loading xml data into a hive table. CSV and TSV are straightforward, but for XML it seems you either need to write some custom