Re: Hive in EC2

2011-08-31 Thread MIS
But my concern is that I cannot run the Elastic Mapreduce on specific instances which we already own and have elastic IPs. If it is possible to do so, then using Hive EMR should be fine enough. Thanks, MIS On Wed, Aug 31, 2011 at 12:21 AM, Aggarwal, Vaibhav wrote: > You could also choose

Re: Hive in EC2

2011-09-03 Thread MIS
Thanks for your reply. That cleared few concerns. Thanks, MIS. On Thu, Sep 1, 2011 at 2:05 AM, Steven Wong wrote: > When you launch an EMR cluster (or "job flow" in EMR terminology), it > launches new EC2 instances, optionally with an Elastic IP assigned to the > cluster

Re: Review Request: HIVE-2026. Parallelize UpdateInputAccessTimeHook

2011-03-03 Thread MIS
Hi, Ning Just to be clear on what I was suggesting, I have created a patch only for this file. Please have a look. Thanks, MIS. On Thu, Mar 3, 2011 at 5:50 PM, M IS wrote: >This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/460/ > >

Work around for using OR in Joins

2011-03-22 Thread MIS
n the join expression. Thanks, MIS.

Re: Work around for using OR in Joins

2011-03-22 Thread MIS
Found it at *org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.java* line no. 1122 There is some concern mentioned that supporting OR would lead to data explosion. Is it discussed/documneted in a little more detail somewhere ? If so, some pointers towards the same will be helpful. Thanks, MIS

Re: Work around for using OR in Joins

2011-03-23 Thread MIS
ERE clause }, but I see some difference in the results produced with what is expected. Since, I'm not using an OUTER join, I expected desired results. Any idea why is the OR expression in filtering not working as desired ? Any thought on this are welcome. Thanks, MIS. On Wed, Mar 23, 2

Re: Work around for using OR in Joins

2011-03-23 Thread MIS
e2.column8)='2011-01-06') or (table1.coulmn6 in ('val3') and to_date(table2.column9)='2011-01-06'))) where group by table1.column10, table1.column11;* Thanks, MIS. On Wed, Mar 23, 2011 at 12:54 PM, Ning Zhang wrote: > What is your OR conditions? Are they inv

An issue with Hive on hadoop cluster

2011-05-23 Thread MIS
I'm getting into an issue when trying to run hive over the hadoop cluster. The hadoop cluster is working fine, in a stand alone manner. I'm using hadoop 0.20.2 and hive 0.7.0 versions. The problem is that the hive is not considering the fs.default.name property that I am setting in the core-site.

Hive in EC2

2011-08-29 Thread MIS
Hi, Can somebody point me to production level setup of Hive in EC2. The intent is to know the setup best practices being employed. Thanks.

count of distinct FROM multiple columns

2012-06-22 Thread MIS
Hi All, I have a table in Hive as below: dummy { col1 STRING, col2 INT, col3 INT } And in that there is some sample data as : *col1 col2 col3 * ABC45 XYZ12 ABC13 ABC51 XYZ31 What should be my query so as to get the below result: *ABC 4 XYZ

[jira] [Commented] (HIVE-1451) Creating a table stores the full address of namenode in the metadata. This leads to problems when the namenode address changes.

2011-08-31 Thread MIS (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13094706#comment-13094706 ] MIS commented on HIVE-1451: --- +1 for the issue. This is one of those features which many as

[jira] Commented: (HIVE-1883) Periodic cleanup of Hive History log files.

2011-02-17 Thread MIS (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12995934#comment-12995934 ] MIS commented on HIVE-1883: --- Carl is right on this. There is no need to have a 'schedul

[jira] Commented: (HIVE-1959) Potential memory leak when same connection used for long time. TaskInfo and QueryInfo objects are getting accumulated on executing more queries on the same connection.

2011-03-01 Thread MIS (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13000970#comment-13000970 ] MIS commented on HIVE-1959: --- How about using WeakHashMap in place of using HashMap instea

[jira] Commented: (HIVE-2051) getInputSummary() to call FileSystem.getContentSummary() in parallel

2011-03-17 Thread MIS (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008328#comment-13008328 ] MIS commented on HIVE-2051: --- Yes it is necessary for the executor to be terminated if the

[jira] Commented: (HIVE-2051) getInputSummary() to call FileSystem.getContentSummary() in parallel

2011-03-17 Thread MIS (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008331#comment-13008331 ] MIS commented on HIVE-2051: --- The solution to this issue resembles that of HIVE-2026, so we

[jira] [Commented] (HIVE-2181) Clean up the scratch.dir (tmp/hive-root) while restarting Hive server.

2011-08-11 Thread MIS (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13083306#comment-13083306 ] MIS commented on HIVE-2181: --- -1 for the issue. What if I'm running multiple hive s