RE: How to install CDH3 on CentOS5.5 without internet

2012-03-28 Thread binhnt22
Thanks Bejoy, That's exactly what I need. Best regards Nguyen Thanh Binh (Mr) Cell phone: (+84)98.226.0622 From: Bejoy Ks [mailto:bejoy...@yahoo.com] Sent: Wednesday, March 28, 2012 5:41 PM To: user@hive.apache.org Subject: Re: How to install CDH3 on CentOS5.5 without internet Hi

RE: Hive server concurrency question

2012-03-28 Thread Steven Wong
There are jiras on Hive Server concurrency-related issues, some open and some closed, including: https://issues.apache.org/jira/browse/HIVE-80 https://issues.apache.org/jira/browse/HIVE-1019 https://issues.apache.org/jira/browse/HIVE-1884 https://issues.apache.org/jira/browse/HIVE-2022 https://is

RE: Help in aggregating comma separated values

2012-03-28 Thread Saurabh S
I downloaded the data to a file and checked. Like you said, there was a space after the '3'. Removing that space gives the exact, expected result. In fact, I do have these artifacts in the 'real' data and will have to treat them first. Thanks a lot, Matt! I've been trying to crack this problem

RE: Help in aggregating comma separated values

2012-03-28 Thread Tucker, Matt
Are you sure that the values in the ID column are the same? If it's a string datatype, do you have any spaces after the '3' but before the tab? Matt Tucker From: Saurabh S [mailto:saurab...@live.com] Sent: Wednesday, March 28, 2012 2:45 PM To: user@hive.apache.org Subject: RE: Help in aggregatin

RE: Help in aggregating comma separated values

2012-03-28 Thread Saurabh S
Thanks for the reply, Matt. This is exactly what I'm looking for. I'll look into the explode function more deeply. Another question: For that example table, your query generates a slightly different different result. The values for id 3 are not aggregated. So the result is as follows. Any idea

Re: Percentage of rows in a Hive Table

2012-03-28 Thread Abhishek Pratap Singh
I don't know how it much can help. Select * from TABLE_DATA order by ROW_NAME DESC limit COUNT. Here calculation of count as top 5% is bit tricky. I don't think so this calculation can even be done in single query. Regards, Abhishek On Wed, Mar 28, 2012 at 9:09 AM, James Newhaven wrote: > Thank

Re: Hive server concurrency question

2012-03-28 Thread sumadhur
Hi phil,   It works ok, but it is a little flakey. Could you please let me know the kind of problems you faced? A few examples would help.   I would want the ability to connect to the server from multiple clients and execute queries parallely. So, wanted to be aware of the pitfalls.   Thanks, S

RE: Help in aggregating comma separated values

2012-03-28 Thread Tucker, Matt
Hi, The values column needs to be split across multiple rows, so you can then aggregate them. Try this: SELECT id, value, COUNT(*) cnt FROM table1 LATERAL VIEW explode(split(values, ",")) values_tbl as value GROUP BY id, value Matt Tucker -Original Message- From: Saurabh S [mailto:s

Re: Hive server concurrency question

2012-03-28 Thread Philip Tromans
I've used Hive in a multiple connections per server instance setup. It works ok, but it is a little flakey. I have some snapshot of trunk > 0.8.0 deployed. When I have some time, I'd like to help increase the test coverage for multithreaded clients. Phil. On 28 March 2012 19:19, Abhishek Pratap S

Help in aggregating comma separated values

2012-03-28 Thread Saurabh S
Hello, How do I get count from a list of comma separated values? For the lack of better wording, here is an example: Suppose there is a table with two columns, id (integers) and values (string) in the following fashion. id   values 1    10,20,30 2    20,30 2    20 3    20,30,40 3    40,20,30

Re: Hive server concurrency question

2012-03-28 Thread Abhishek Pratap Singh
AFAIK hive does not support concurrency however Multiple connection can be made to HiveServer. Hive or i should say Hadoop is write once and read multiple architecture. Hive does not have a build in support for DELETE OR UPDATE. The inherent architecture of these kind of solution works best when yo

Re: Hive server concurrency question

2012-03-28 Thread sumadhur
Resending due to lack of response. From: sumadhur To: "user@hive.apache.org" Sent: Tuesday, March 27, 2012 2:14 AM Subject: Hive server concurrency question Hi, Does Hive server support multiple concurrent client connections? The following page says it

Re: Percentage of rows in a Hive Table

2012-03-28 Thread James Newhaven
Thanks for the suggestion. I don't think sampling helps here, as I need to get the top 5% of rows ordered by a particular column (not a random sampling) On Wed, Mar 28, 2012 at 5:03 PM, Gabi D wrote: > James, > See if sampling >

Re: Percentage of rows in a Hive Table

2012-03-28 Thread Gabi D
James, See if sampling is what you need On Wed, Mar 28, 2012 at 5:53 PM, James Newhaven wrote: > I am trying to write a query that will return the first 5% of rows in a > table. > > I've struggled with this for quite a wh

Percentage of rows in a Hive Table

2012-03-28 Thread James Newhaven
I am trying to write a query that will return the first 5% of rows in a table. I've struggled with this for quite a while and can't figure out a command that works in Hive. Has anyone done this? Thanks, James

Re: How to install CDH3 on CentOS5.5 without internet

2012-03-28 Thread Bejoy Ks
Hi Binh        If you don't have internet access on your cluster, you may need to create a local yum repository. Detailed instructions can be found here at https://ccp.cloudera.com/display/CDHDOC/Creating+a+Local+Yum+Repository  Regards Bejoy KS From: binhnt22

How to install CDH3 on CentOS5.5 without internet

2012-03-28 Thread binhnt22
Hi, My servers are in restricted environment without internet. But I can connect to internet from my PC and copy file to server. How can I install CDH3 on CentOS5.5 server in this situation. Best regards, -- Ng

Re: error in running simple Hive Query

2012-03-28 Thread Bejoy Ks
Hi Khoya       The error is relevant from the stack trace itself. Setting the hive class path correctly would resolved the issue. I'd recommend you to upgrade the CDH you are using, alteast upgrade to CDH3u3 which has hive 0.7. Regards Bejoy KS From: Kirk True