svn move and INFRA-3036

2010-10-07 Thread Edward Capriolo
All, Part of the move to TLP will require us moving our SVN. https://issues.apache.org/jira/browse/INFRA-3036 Infra is going to tackle item #2 soon. After creates the new svn, we need to do the svn mv's into it. Users will have to run for their workspaces: 'svn switch https://svn.apache.org/rep

hive svn is up and hive.apache.org is up

2010-10-08 Thread Edward Capriolo
The subversion site is up: http://svn.apache.org/repos/asf/hive/ (All with hive commit should have write access) We can try to move the SVN this way. 'svn mv http://svn.apache.org/repos/asf/hadoop/hive/trunk http://svn.apache.org/repos/asf/hive/trunk' Also hive.apache.org is ready. just put fi

Re: hive svn is up and hive.apache.org is up

2010-10-14 Thread Edward Capriolo
On Wed, Oct 13, 2010 at 9:27 PM, Namit Jain wrote: > Let us punt it for now > > From: John Sichi [jsi...@facebook.com] > Sent: Wednesday, October 13, 2010 6:08 PM > To: > Cc: > Subject: Re: hive svn is up and hive.apache.org is up > > A related question i

Re: hive svn is up and hive.apache.org is up

2010-10-14 Thread Edward Capriolo
On Thu, Oct 14, 2010 at 4:41 PM, John Sichi wrote: > > On Oct 14, 2010, at 1:14 PM, Edward Capriolo wrote: > http://hive.apache.org is up with content. Building forest requires > java 1.5. There are two ways we can do this from now on. > > 1) Move the forest source (only) into

Re: [VOTE] hive 0.6.0 release candidate 0

2010-10-21 Thread Edward Capriolo
On Wed, Oct 20, 2010 at 6:38 PM, John Sichi wrote: > The tarballs are at > > http://people.apache.org/~jvs/hive-0.6.0-candidate-0 > > Carl did some sanity testing on it already, but any additional testing you > can do before voting helps to ensure a quality release. > > JVS > > I am checking it

Re: [VOTE] hive 0.6.0 release candidate 0

2010-10-25 Thread Edward Capriolo
figurations where JDO is told >> not to automatically update the schema.  This is recommended for production >> environments. >> >> For this particular release, taking a downtime while running the scripts is >> a good idea due to the nature of the changes (e.g. alteri

Re: svn move and INFRA-3036

2010-10-26 Thread Edward Capriolo
On Tue, Oct 26, 2010 at 3:10 PM, John Sichi wrote: > I'm starting on the svn move in a little bit.  Committers, please hold off on > further commits until you see an update on this. > > JVS > > On Oct 7, 2010, at 10:45 AM, Edward Capriolo wrote: > >> All, &g

Re: meeting minutes for 25-Oct-2010 contributor meeting

2010-10-29 Thread Edward Capriolo
On Fri, Oct 29, 2010 at 3:42 PM, John Sichi wrote: > http://wiki.apache.org/hadoop/Hive/Development/ContributorsMeetings/HiveContributorsMinutes101025 > > JVS > > Carl Steinbach proposed making 0.7.0 a time-based release (rather than a feature-based release), and that we should start on it soon s

Re: [ANNOUNCE] New Committer - Carl Steinbach

2010-11-02 Thread Edward Capriolo
On Tue, Nov 2, 2010 at 2:23 PM, Ashish Thusoo wrote: > Hi Folks, > > The Hive PMC has passed the vote to make Carl Steinbach a new committer on > the Apache Hive project. Carl has made a lot of contributions to Hive with > the latest being him serving as the release manager for 0.6.0 release. >

Re: [ANNOUNCE] New Hive Committer - Amareshwari Sriramadasu

2010-11-09 Thread Edward Capriolo
Welcome! On Tue, Nov 9, 2010 at 3:28 AM, Yongqiang He wrote: > Congrats amareshwari! > > Yongqiang > > On Nov 8, 2010, at 6:00 PM, Namit Jain wrote: > >> Hi Folks, >> >> The Hive PMC has passed the vote to make Amareshwari Sriramadasu a >> new committer on the Apache Hive project. >> >> Followin

Re: jmx metrics for metastore server

2010-11-09 Thread Edward Capriolo
On Tue, Nov 9, 2010 at 5:38 PM, Sushanth Sowmyan wrote: > Hi all, > > We were looking at monitoring requirements from within Howl, which > essentially translates to monitoring requirements for the Metastore > server and would like to add in volume and latency jmx counters to > metastore server cal

Review Request: Hive Variables

2010-11-29 Thread Edward Capriolo
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/124/ --- Review request for hive. Summary --- This patch adds hive variables Diffs

Re: Review Request: HIVE-1096 Hive Variables

2010-11-29 Thread Edward Capriolo
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/124/ --- (Updated 2010-11-29 10:27:46.947077) Review request for hive. Summary (updated)

Review Request: added 5 udfs

2010-12-02 Thread Edward Capriolo
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/143/ --- Review request for hive. Summary --- Some UDFS. Diffs - /trunk/comm

RoadMap vs Jira

2010-12-11 Thread Edward Capriolo
In jira we have ~ 600 scheduled issues. These issues range from unconfirmed bugs, general wish list items, very complex additions such as new syntax or new expanding the scope of hive. Almost everything is marked as a MAJOR - BUG, when many things are minor wishes. I believe we should encourage pe

Re: Review Request: HIVE-1262

2010-12-24 Thread Edward Capriolo
> On 2010-12-23 16:04:03, John Sichi wrote: > > http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFAesDecrypt.java, > > line 37 > > > > > > Shouldn't this be deterministic=

Re: Regarding Hive History File(s).

2011-01-04 Thread Edward Capriolo
On Tue, Jan 4, 2011 at 7:03 AM, Mohit wrote: > Hello All, > > > > What is the purpose of maintaining hive history files which contain session > information like session start, query start, query end, task start, task end > etc.? Are they being used later (say by a tool) for some purpose? > > > > I

Re: Regarding Hive History File(s).

2011-01-05 Thread Edward Capriolo
, >> which is intended only for the person or entity whose address is listed >> above. Any use of the information contained herein in any way (including, >> but not limited to, total or partial disclosure, reproduction, or >> dissemination) by persons other than the intended rec

Re: Regarding Hive History File(s).

2011-01-06 Thread Edward Capriolo
** > *** >> >> This e-mail and attachments contain confidential information from HUAWEI, >> which is intended only for the person or entity whose address is listed >> above. Any use of the information contained herein in any way (including,

Re: patch review process

2011-01-20 Thread Edward Capriolo
On Thu, Jan 20, 2011 at 5:56 PM, Namit Jain wrote: > I agree on the problem of comments. > For a new reviewer, it makes the process more painful. > > But, I think, the issues are not major, and we should > decide one approach or another. > > Even for option 2., we can make it mandatory for the con

Re: Scheduling the Hive 0.7.0 Release

2011-01-24 Thread Edward Capriolo
On Mon, Jan 24, 2011 at 4:11 PM, Carl Steinbach wrote: > Hi, > > There's been a lot of talk recently about Hive 0.7.0, so in the > interest of getting the process moving I'd like to propose that we > create the release branch this Friday (1/28), and aim to > have the release out the door by the fo

Re: Scheduling the Hive 0.7.0 Release

2011-01-24 Thread Edward Capriolo
On Mon, Jan 24, 2011 at 5:03 PM, Carl Steinbach wrote: > Hi Ed, > > This seems aggressive. We have 4 open blockers and 68 other issues >> slated for 0.7.0. >> > > Three of the four open blockers involve writing metastore upgrade scripts. > I'm willing to assume responsibility for this and can get

Re: Run CliDriver in Eclipse

2011-01-24 Thread Edward Capriolo
On Mon, Jan 24, 2011 at 4:19 PM, Pei HE wrote: > Hi, > I am trying to debug Hive in Eclipse. > > My problem is Hive doesn't work with Hadoop, when the command line is > invoked in Eclipse. > > The version of Hadoop is 0.19.2. > > There is no problem when I run Hive by ./build/dist/bin/hive. But, >

Re: hadoop core 0.20.2 not found

2011-02-07 Thread Edward Capriolo
You do not need to compile against a specific minor (it is a bug in the build process) Try: ant -Dhadoop.version=0.20.0 package The resulting binary will be compatible with 0.20.2 Edward On Mon, Feb 7, 2011 at 3:27 AM, abhinav narain wrote: > hi, >  I have mailed my problem on dev-list and hav

Re: hadoop core 0.20.2 not found

2011-02-07 Thread Edward Capriolo
On Mon, Feb 7, 2011 at 10:23 AM, abhinav narain wrote: > I get the following error about hbase, then ? > > this is the code recently checked out from svn > >    Host repository.apache.org not found. url= > https://repository.apache.org/content/repositories/snapshots/org/apache/hbase/hbase/0.89.0-S

Re: hadoop core 0.20.2 not found

2011-02-08 Thread Edward Capriolo
On Tue, Feb 8, 2011 at 5:40 AM, abhinav narain wrote: > On Mon, Feb 7, 2011 at 11:29 PM, Carl Steinbach wrote: > >> Hi Abhinav, >> >> I am using a proxy. >> > I am using cntlm for authentication. >> > I have added localhost:1234 in ANT_OPTS ... as above before compiling >> > >> > >> It looks like

Re: [VOTE] officially stop supporting hadoop 0.20.x in hive 0.14 ?

2014-10-07 Thread Edward Capriolo
+1 On Tue, Oct 7, 2014 at 7:51 PM, Vikram Dixit wrote: > +1 > > On Tue, Oct 7, 2014 at 4:44 PM, Bill Busch > wrote: > > +1 > > > > Follow me on @BigData73 > > > > -Original Message- > > From: Gunther Hagleitner [mailto:ghagleit...@hortonworks.com] > > Sent: Tuesday, October 07, 2014 7

Update pmc page

2023-10-18 Thread Edward Capriolo
Don't know if you guys are trolling me or not but I dont currently nor ever worked at hortonwoks lol -- Sorry this was sent from mobile. Will do less grammar and spell check than usual.

Re: Update pmc page

2023-10-18 Thread Edward Capriolo
https://hive.apache.org/community/people/ On Wednesday, October 18, 2023, Edward Capriolo wrote: > Don't know if you guys are trolling me or not but I dont currently nor > ever worked at hortonwoks lol > > -- > Sorry this was sent from mobile. Will do less grammar and spel

Declare myself emeritus

2019-03-19 Thread Edward Capriolo
Hello, I would like to declare myself emeritus. Thank you, Edward Capriolo -- Sorry this was sent from mobile. Will do less grammar and spell check than usual.

Re: Any plan for new hive 3 or 4 release?

2021-02-26 Thread Edward Capriolo
Hive was releasable trunk for the longest time. Facebook days. Then the big data vendors got more involved. Then it became a pissing match about features. This vendor likes tez this vendor dont, this vendor likes hive on spark this one dont. Then this vendor wants to tell everyone hive stinks use

Re: Time to Remove Hive-on-Spark

2021-02-26 Thread Edward Capriolo
I do not know how it works for most of the world. But in cloudera where the TEZ options were never popular hive-on-spark represents a solid way to get things done for small datasets lower latency. As for the spark adoption. You know a while ago I came up with some ways to make hive more spark lik

Re: Any plan for new hive 3 or 4 release?

2021-02-27 Thread Edward Capriolo
t; It will be amazing if the community could produce a release every > quarter/6months. :-) > > Le ven. 26 févr. 2021 à 14:30, Edward Capriolo a > écrit : > >> Hive was releasable trunk for the longest time. Facebook days. Then the >> big data vendors got more involved. Then it

Re: hive-exec vs. hive-exec:core

2021-09-21 Thread Edward Capriolo
recommendation from the Hive team is to use the hive-exec.jar artifact. You know about 10 years ago. I mentioned that oozie should just use hive-service or hive jdbc. After a big fight where folks kept bringing up concurrency bugs in hive-server-1 my prs were rejected (even though hive server2 wou

Dealing with monstrous hive startup overhead

2014-07-10 Thread Edward Capriolo
So Everyone is running around saying "hive is slow" "x is faster". I think hive's biggest issue is that the mr2 entire process to acquire containers and then launch a job in them is super overkill. I see it result in 40 seconds startup time for what amounts to a 2 second job. In the old hadoop 0.20

Re: Created branch 1.0

2015-01-22 Thread Edward Capriolo
If we do a 1.0.0 release there is no problem with us later releasing 14.1 or a 14.2. I think everyone would understand that the 14.2 being released after 1.0.0 would likely have back ported features. Releasing a 15.0 after 1.0 would not make as much sense as we probably do not want two active branc

Re: Creating a branch for hbase metastore work

2015-01-23 Thread Edward Capriolo
A while back the datastax people did this for brisk + cassandra. We should probably make this pluggable so it works with nosql beyond hbase. On Fri, Jan 23, 2015 at 1:48 AM, Nick Dimiduk wrote: > +1 > > On Thursday, January 22, 2015, Brock Noland wrote: > > > +1 > > > > On Thu, Jan 22, 2015 at

Re: [DISCUSS] deprecation policy

2018-02-13 Thread Edward Capriolo
I do not think we should remove: https://issues.apache.org/jira/browse/HIVE-18692 One of the challenges of HQL is not having a procedure language. This forces users into oozie/airflow/spark even to orchestrate simple flows. Lets give that one some time to marinate, but I agree technically it coul

Re: Created branch 1.0

2015-02-09 Thread Edward Capriolo
Question. https://issues.apache.org/jira/browse/HIVE-8614 Did we not just agree in this thread that hive will no long have dependency that are SNAPSHOT? On Fri, Jan 23, 2015 at 1:06 AM, Brock Noland wrote: > Hi Alan, > > I agree with Xuefu and what was suggested in your statement. I was > thin

Re: Created branch 1.0

2015-02-09 Thread Edward Capriolo
Because we can not really have a stable api if by definition we build around snapshot dependencies. On Mon, Feb 9, 2015 at 11:38 AM, Edward Capriolo wrote: > Question. > > https://issues.apache.org/jira/browse/HIVE-8614 > > Did we not just agree in this thread that hive wi

Re: Is it allowed to include GPLv2 software to Apache projects

2015-02-21 Thread Edward Capriolo
Recently I found out about the http://www.gnu.org/licenses/gcc-exception.html which is interesting, and governs some projects/components. Like intel's high performance collection library. On Fri, Feb 20, 2015 at 9:16 PM, Alan Gates wrote: > No. > > Alan. > > Alexander Pivovarov > February 20

Re: ORC separate project

2015-04-02 Thread Edward Capriolo
To reiterate, one thing I want to avoid is having hive rely on code that sits in several tiny silos across Apache projects, or Apache Licensed but not ASF projects. Hive is a mature TLP with a large number of committers and it would not be a good situation if often work gets bottle necked because c

Re: Branch for HIVE-10511: Replacing the implementation of Hive CLI using Beeline

2015-04-30 Thread Edward Capriolo
For what it is worth, even with all the enhancements made to hive-server 2 it is just not as reliable as the CLI. I am actually using cloudera 5.3 and we have multiple hive-server2 instances behind a load balancer. The load we are doing is not particularly heavy and the mean time to failure is < 3

Re: Branch for HIVE-10511: Replacing the implementation of Hive CLI using Beeline

2015-05-01 Thread Edward Capriolo
using this instability. Like i said, I get around it by running a load balancer etc, but I do not want to be 100% reliant on it. On Thu, Apr 30, 2015 at 10:39 AM, Edward Capriolo wrote: > For what it is worth, even with all the enhancements made to hive-server 2 > it is just not as rel

Re: [DISCUSS] Supporting Hadoop-1 and experimental features

2015-05-18 Thread Edward Capriolo
Up until recently Hive supported numerous versions of Hadoop code base with a simple shim layer. I would rather we stick to the shim layer. I think this was easily the best part about hive was that a single release worked well regardless of your hadoop version. It was also a key element to hive's s

Re: [DISCUSS] Supporting Hadoop-1 and experimental features

2015-05-18 Thread Edward Capriolo
API was (mapred, new mapreduce), no one know what to link off of cdh?, vanilla?, yahoo distribution?. IMHO. This is just going to increase fragmentation. On Mon, May 18, 2015 at 1:04 PM, Edward Capriolo wrote: > Up until recently Hive supported numerous versions of Hadoop code base > w

[DISCUSS] Spark's fork of hive

2017-03-02 Thread Edward Capriolo
All, I have compiled a short (non exhaustive) list of items related to Spark's forking of Apache Hive code and usage of Apache Hive trademarks. 1) The original spark proposal repeatedly claims that Spark "inter operates" with hive. https://wiki.apache.org/incubator/S

Re: [DISCUSS] Spark's fork of hive

2017-03-02 Thread Edward Capriolo
gt; > Reading about Hive at Facebook, I feel like we've already solved those > problems that were due to FB Corona + Hadoop-1 (or, 0.20 *shudder*) > limitations. > > Spark does not need be limited by Corona and the version of Hive being > compared might not have YARN or Tez

[DISCUSS] Looking to the future hivemall graduation

2017-03-02 Thread Edward Capriolo
Hivemall in the incubator has a fairly impressive set of features that do machine learning directly from hive. http://hivemall.incubator.apache.org/overview.html https://github.com/myui/hivemall/wiki/Logistic-regression-dataset-generation While we can not put the cart before the horse, i can imag

Re: [DISCUSS] Looking to the future hivemall graduation

2017-03-03 Thread Edward Capriolo
early to discuss but we welcome your suggestion. > > Thanks, > Makoto > > 2017-03-03 15:15 GMT+09:00 Edward Capriolo : > > Hivemall in the incubator has a fairly impressive set of features that do > > machine learning directly from hive. > > > > http://hivemal

Re: [DISCUSS] Spark's fork of hive

2017-03-03 Thread Edward Capriolo
nd ask that they not do this. > > As for them dissing on us in benchmarks, we all know you can set up Hive > to run like mule (use MR on text files) and people do it all the time to > make their stuff look good. I'm not sure what to do about that other than > publish our own benc

Re: Backward incompatible changes

2017-03-03 Thread Edward Capriolo
Im not a fan of this our feature branching was supposed to protect us from broken trumk syndrome. I am a believer in releaseable trunk. Make it work and keep it working. On Friday, March 3, 2017, Sergey Shelukhin wrote: > Can we at least release 2.2 first? There’s massive amount of unreleased >

Re: Backward incompatible changes

2017-03-04 Thread Edward Capriolo
On Fri, Mar 3, 2017 at 8:46 PM, Thejas Nair wrote: > +1 > There are some features that are incomplete and what I would not recommend > for any real production use.The 'legacy authorization mode' is a great > example of that - > https://cwiki.apache.org/confluence/display/Hive/Hive+ > Default+Auth

Re: Backward incompatible changes

2017-03-04 Thread Edward Capriolo
Also i dont follow how we remove On Saturday, March 4, 2017, Edward Capriolo wrote: > > > On Fri, Mar 3, 2017 at 8:46 PM, Thejas Nair > wrote: > >> +1 >> There are some features that are incomplete and what I would not recommend >> for any real production use.T

Re: ops on #hive IRC channel

2017-04-03 Thread Edward Capriolo
I think we should take this opportunity to move to apache-hive. I dont think anyone every used this channel for anything. There is many things on the internet named hive and naturally it will get spam-ish. We could also renew an effort to actually use IRC. On Fri, Mar 31, 2017 at 8:59 PM, Sergey S

Apache Hive metastore and Impala

2017-04-05 Thread Edward Capriolo
Hello impala devs! Let me say that I have used impala a lot and am very impressed with it. I know impala is moving into the Apache incubator (I have an incubator prodling gossip so I know this is challenging). There are few things I want to bring to your attention/discuss, so that they do not bec

Re: ops on #hive IRC channel

2017-04-06 Thread Edward Capriolo
On Mon, Apr 3, 2017 at 7:40 PM, Edward Capriolo wrote: > I think we should take this opportunity to move to apache-hive. I dont > think anyone every used this channel for anything. There is many things on > the internet named hive and naturally it will get spam-ish. We could also &

Re: [DISCUSS] Separating out the metastore as its own TLP

2017-07-02 Thread Edward Capriolo
I am not sure I am on the fence with this. On Sat, Jul 1, 2017 at 6:38 AM, Chaoyu Tang wrote: > +1. A nice move which might help HMS to be more easily adopted by more > other components. > > On Sat, Jul 1, 2017 at 12:41 AM, Sushanth Sowmyan > wrote: > > > +1 > > > > On Jun 30, 2017 17:05, "Owen

Re: [DISCUSS] Separating out the metastore as its own TLP

2017-07-02 Thread Edward Capriolo
On Fri, Jun 30, 2017 at 2:49 PM, Julian Hyde wrote: > +1 > > As a Calcite PMC member, I am very pleased to see this change. Calcite > reads metadata from a variety of sources (including JDBC databases, NoSQL > databases such as Cassandra and Druid, and streaming systems), and if more > of those s

Re: [DISCUSS] Separating out the metastore as its own TLP

2017-07-02 Thread Edward Capriolo
ean "you" or "anyone" as a statement to a particular person in this chain. I meant that: Forming a TLP should not directly increase the commiter/pmc list to anyone not currently in the Hive pmc/committer list. On Sun, Jul 2, 2017 at 6:50 PM, Edward Capriolo wrote: > >

Re: [DISCUSS] Separating out the metastore as its own TLP

2017-07-03 Thread Edward Capriolo
On Sun, Jul 2, 2017 at 10:15 PM, Alan Gates wrote: > Comments inlined. > > On Sun, Jul 2, 2017 at 3:22 PM, Edward Capriolo > wrote: > > > I am not sure I am on the fence with this. > > > > I am -1, and I offer this -1 with the hope of being convinced otherwise

Re: [DISCUSS] Separating out the metastore as its own TLP

2017-07-05 Thread Edward Capriolo
On Wed, Jul 5, 2017 at 1:51 PM, Alan Gates wrote: > On Mon, Jul 3, 2017 at 6:20 AM, Edward Capriolo > wrote: > > > > > We already have things in the meta-store not directly tied to language > > features. For example hive metastore has a "retention" propert

Re: [DISCUSS] Supporting Hadoop-1 and experimental features

2015-05-24 Thread Edward Capriolo
becomes a > burden, and it’s outdated with 2 alternatives, one of which has been > around for 2 releases. > The branches are a graceful way to get rid of the legacy burden. > > Alternatively, when sweeping changes are made, we can do what Hbase did > (which is not pretty imho), wher

Re: Analyzing Bitcoin blockchain data with Hive

2016-05-01 Thread Edward Capriolo
Good stuff! On Fri, Apr 29, 2016 at 1:30 PM, Jörn Franke wrote: > Dear all, > > I prepared a small Serde to analyze Bitcoin blockchain data with Hive: > > https://snippetessay.wordpress.com/2016/04/28/hive-bitcoin-analytics-on-blockchain-data-with-sql/ > > There are some example queries, but I w

Re: [VOTE] Apache Hive 0.8.0 Release Candidate 3

2011-12-10 Thread Edward Capriolo
I have hive-0.8.0-candidate-3with hadoop in local mode against hadoop 0.20.2. Looks like hive server is not launching. [edward@tablitha hive-0.8.0-bin]$ bin/hive --service hiveserver Starting Hive Thrift Server Exception in thread "main" java.lang.NoSuchMethodError: org.apache.thrift.server.TThrea

Re: [VOTE] Apache Hive 0.8.0 Release Candidate 3

2011-12-10 Thread Edward Capriolo
:107: java.lang.ClassNotFoundException: org.apache.tools.ant.taskdefs.optional.TraXLiaison Do I need a newer ant or more optional packages? [edward@tablitha branch-0.8-r2]$ ant -version Apache Ant version 1.7.1 compiled on April 16 2010 On Sat, Dec 10, 2011 at 12:45 PM, Edward Capriolo wrote

Re: [VOTE] Apache Hive 0.8.0 Release Candidate 3

2011-12-10 Thread Edward Capriolo
tables, group bys etc. That all looks good. On Sat, Dec 10, 2011 at 1:11 PM, Edward Capriolo wrote: > Also I know I have been out of the game for a minute but.. > extract-functions: > [mkdir] Created dir: /home/edward/branch-0.8-r2/build/builtins/metadata >

Re: [VOTE] Apache Hive 0.8.0 Release Candidate 3

2011-12-11 Thread Edward Capriolo
, John Sichi wrote: > You need ant 1.8.x > > JVS > > On Dec 10, 2011, at 10:11 AM, Edward Capriolo wrote: > > > Also I know I have been out of the game for a minute but.. > > extract-functions: > >[mkdir] Created dir: > /home/edward/branch-0.8-r

Re: [VOTE] Apache Hive 0.8.0 Release Candidate 3

2011-12-11 Thread Edward Capriolo
ds > > [edward@tablitha branch-0.8-r2]$ which ant > > ~/apache-ant-1.8.2/bin/ant > > > > > > On Sat, Dec 10, 2011 at 3:12 PM, John Sichi wrote: > > > >> You need ant 1.8.x > >> > >> JVS > >> > >> On Dec 10, 2011, at 10

Re: [VOTE] Apache Hive 0.8.0 Release Candidate 4

2011-12-13 Thread Edward Capriolo
+1 Good job all. On Tue, Dec 13, 2011 at 3:28 PM, Carl Steinbach wrote: > I'm +1 on rc4. > On Dec 13, 2011 11:58 AM, "John Sichi" wrote: > > > +1 from me on the release candidate. > > > > JVS > > > > >>> On Mon, Dec 12, 2011 at 4:55 PM, Carl Steinbach > > >> wrote: > > Apache Hive 0.8.0 R

Re: [VOTE] Apache Hive 0.8.1 Release Candidate 1

2012-01-27 Thread Edward Capriolo
+1. Got the binary, extracted ran jobs against 0.20.2 (vanilla) in local mode all was well. Edward On Fri, Jan 27, 2012 at 2:18 PM, Carl Steinbach wrote: > +1 (binding) > > On Thu, Jan 26, 2012 at 5:27 PM, Roman Shaposhnik wrote: > > > > Apache Hive 0.8.1 Release Candidate 1 is available here:

Re: HCatalog-Hive StorageHandler compatibility

2012-03-02 Thread Edward Capriolo
Of the handlers that exist Cassandra outside of tree Hypertable outside of tree Mongo github outside of tree >From the way I look at it is if you change the handler interface your patch should fix the only in hive tree hbase handler. As long as the tests keep passing I d not see it as an issue. Th

Re: Hive 0.8.1 build errors using IBM JAVA

2012-03-05 Thread Edward Capriolo
I am sure you are going to love hearing this since you are from IBM, but there have been a few outstanding issues with Hive and the IBM jvm. I just searched to find one as an example. https://issues.apache.org/jira/browse/HIVE-1686 I am not sure how many users in the wild are running Hive with th

Re: Hive projects for Google Summer of code 2012 ?

2012-03-07 Thread Edward Capriolo
For the record we are 0-2 in gsoc. On Wed, Mar 7, 2012 at 1:42 AM, Namit Jain wrote: > Done > > > On 3/6/12 10:32 PM, "bharath vissapragada" > wrote: > >>Hi Namit, >> >>Is it possible to add Hive-1362 [1] to the list? I am interested in >>working >>on that and I've contacted Ashutosh regarding t

Automatic/parallel patch testing

2012-03-07 Thread Edward Capriolo
I am trying to get myself more involved in the patch review and committing process, but running ant tests takes multiple hours. Two ideas: https://issues.apache.org/jira/browse/HIVE-1175 https://cwiki.apache.org/Hive/unit-test-parallel-execution.html Can we get a farm of test servers to get tes

Re: Hive

2012-03-14 Thread Edward Capriolo
That could happen (if hive lets you create a database with . in the name, which I am not sure it does) but you should avoid that by specifying the location of the database or table/database names that do not collide. I so not see it as much of an issue. Write and enforce a db/table naming policy an

Re: HWI Interface

2012-03-27 Thread Edward Capriolo
Awesome. Read the hive wiki about adding contributing. Open a jira issue and mark me as a watcher and I will take a look at your patch. On Tue, Mar 27, 2012 at 10:43 AM, Hugo wrote: > Heya all, > > Completely new to this list, but i've been working with hive for some time > now. > > In the compa

Re: Is HiveMetastoreClient a public interface?

2012-03-29 Thread Edward Capriolo
Ideally you want to do everything through the hiveQL language. Calling the read only methods are safe and would be easier/better then Calling 'show tables' and attempting to parse. But I would consider the rest of the interface for "experts" only. Also this interface can change slightly between ver

Looking at the columns table

2012-04-11 Thread Edward Capriolo
Hey all. Our metastore in mysql is fairly large over 12GB. All the storage here is the columns table. It seems that each column is stored for each partition/storage descriptor as a one-many relationship. In our case all the partitions have the same column definition. My thinking. Should the relati

Re: Problems with Arc/Phabricator

2012-04-11 Thread Edward Capriolo
If we are going to switch from fabricator we just might as well go back to not using anything. Review board was really clunky and confusing. Edward On Tue, Apr 10, 2012 at 9:27 PM, Kevin Wilfong wrote: > +John Sichi as he did a lot of the work on getting Hive to use Phabricator. > > Is there any

Re: Problems with Arc/Phabricator

2012-04-11 Thread Edward Capriolo
I have arc working on linux. Fedora core 12 I think. On Wed, Apr 11, 2012 at 8:45 PM, Ashutosh Chauhan wrote: > Is mac only supported OS ? Arc doesn't work for me on linux, which is > unfortunate since thats where I do all my testing. > > Thanks, > Ashutosh > On Wed, Apr 11, 2012 at 17:37, John S

Re: Problems with Arc/Phabricator

2012-04-11 Thread Edward Capriolo
most of the time it doesnt work. >> >> Ashutosh >> >> On Wed, Apr 11, 2012 at 11:57, Owen O'Malley wrote: >> >> > On Wed, Apr 11, 2012 at 11:48 AM, Edward Capriolo > > >> > wrote: >> > > If we are going to switch from fab

Re: Question about Transform and M/R scripts

2011-02-16 Thread Edward Capriolo
On Wed, Feb 16, 2011 at 5:07 PM, Vijay wrote: > Hi, > > I'm trying this use case: do a simple select from an existing table > and pass the results through a reduce script to do some analysis. The > table has web logs so the select uses a pseudo user ID as the key and > the rest of the data as valu

Trying to make sense of build/ql/tmp/hive.log

2011-02-18 Thread Edward Capriolo
It seems that each time the test argument is run hive makes tens hundreds of... 011-02-18 07:46:50,243 WARN zookeeper.ClientCnxn (ClientCnxn.java:run(1120)) - Session 0x12e39732268 for server null, unexpected error, closing socket con nection and attempting reconnect java.net.ConnectException:

Re: how to load hive table schema programatically?

2011-03-14 Thread Edward Capriolo
On Mon, Mar 14, 2011 at 2:44 PM, Jae Lee wrote: > Ah... thanks alot... that worked :) > > Is there any other recommended way to load hive table meta data? I suppose > accessing meta-store via HiveMetaStoreClient make it possible to change > underlying data storage implementation choice. > > J > >

Re: how to load hive table schema programatically?

2011-03-14 Thread Edward Capriolo
On Mon, Mar 14, 2011 at 3:01 PM, Carl Steinbach wrote: > Hi Ed, > I'm pretty sure HiveMetaStoreClient is intended to be a public API. > > On Mon, Mar 14, 2011 at 11:49 AM, Edward Capriolo > wrote: >> >> On Mon, Mar 14, 2011 at 2:44 PM, Jae Lee wrote: >&

Re: skew join optimization

2011-03-20 Thread Edward Capriolo
On Sun, Mar 20, 2011 at 10:30 AM, Ted Yu wrote: > Can someone re-attach the missing figures for that wiki ? > > Thanks > > On Sun, Mar 20, 2011 at 7:15 AM, bharath vissapragada > wrote: >> >> Hi Igor, >> >> See http://wiki.apache.org/hadoop/Hive/JoinOptimization and see the >> jira 1642 which aut

Re: skew join optimization

2011-03-20 Thread Edward Capriolo
On Sun, Mar 20, 2011 at 11:20 AM, Ted Yu wrote: > How about link to http://imageshack.us/ or TinyPic ? > > Thanks > > On Sun, Mar 20, 2011 at 7:56 AM, Edward Capriolo > wrote: >> >> On Sun, Mar 20, 2011 at 10:30 AM, Ted Yu wrote: >> > Can someone re-a

Re: [VOTE] Hive 0.7.0 Release Candidate 1

2011-03-24 Thread Edward Capriolo
Release looks great many great new features +1. Good work all. On Sun, Mar 20, 2011 at 5:32 AM, Carl Steinbach wrote: > Hive 0.7.0 Release Candidate 1 is available here: > > http://people.apache.org/~cws/hive-0.7.0-candidate-1 > > We need 3 +1 votes from Hive PMC members in order to release. Plea

Jira issues

2011-03-30 Thread Edward Capriolo
The number of open tickets keeps growing. Searching out keywords in open issues is at this point very hard to do and real productivity killer. I believe we should be much more pro-active about catching issues as they open and routing them properly. My comments on: HIVE-2085 Document GenericUD(A|T

Re: Jira issues

2011-03-30 Thread Edward Capriolo
On Wed, Mar 30, 2011 at 11:59 AM, Lars Francke wrote: >> HIVE-2085 Document GenericUD(A|T)F. >> >> This is the second ticket I have seen calling for "more >> documentation". All tickets like these should be closed instantly >> unless >> >> 1) User wants to write an xdoc or java doc and get it comm

Re: [ANNOUNCE] New Hive Committer - Siying Dong

2011-04-14 Thread Edward Capriolo
Congrats Siying. Great work so far. On Thu, Apr 14, 2011 at 2:30 AM, Namit Jain wrote: > Hi Folks, > > The Hive PMC has passed the vote to make Siying Dong a > new committer on the Apache Hive project. > > Following is a list of the contributions that Siying has made to the project: > > http://bi

Re: [REMINDER] Hive Contrib Meeting Today + Preliminary Agenda

2011-04-25 Thread Edward Capriolo
On Mon, Apr 25, 2011 at 2:38 PM, Carl Steinbach wrote: > Hi, > > The April Hive Contributors meeting is convening later today at Facebook. > See the Meetup page for more details: > http://www.meetup.com/Hive-Contributors-Group/ > > Here's the preliminary agenda for the meeting: > > 1) Should we do

Re: purging old releases

2011-06-08 Thread Edward Capriolo
I am ok we should keep one pre 20 version around for a while. On Tuesday, June 7, 2011, John Sichi wrote: > Apache Infra has asked us to delete from our dist area releases which are no > longer under active development: > > http://www.apache.org/dist/hive/ > > They suggested deleting 0.6; I'll g

Re: resend: Is a collocated join (a-la-netezza) theoretically possible in hive?

2011-08-07 Thread Edward Capriolo
On Sun, Aug 7, 2011 at 12:49 PM, Ido Hadanny wrote: > When you join tables which are distributed on the same key and used these > key columns in the join condition, then each SPU (machine) in netezza works > 100% independent of the other (see > nz-interview< > http://www.folkstalk.com/2011/06/net

Re: Problems with Arc/Phabricator

2012-04-19 Thread Edward Capriolo
tosh/work/hive/arcanist/src/workflow/diff/ArcanistDiffWorkflow.php(341): > ConduitClient->callMethodSynchronous('differential.cr...', Array) > #4 /Users/ashutosh/work/hive/arcanist/scripts/arcanist.php(266): > ArcanistDiffWo in > /Users/ashutosh/work/hive/libphutil/src/conduit/

Re: hive 0.9.0 RC1

2012-04-19 Thread Edward Capriolo
Am I missing something about? https://issues.apache.org/jira/browse/HIVE-2777 I see no way to access this feature from the hive QL language. Should we delay releases on features not usable? Edward On Thu, Apr 19, 2012 at 1:28 PM, Ashutosh Chauhan wrote: > RC0 failed to pass because of variety

Re: [VOTE] Apache Hive 0.9.0 Release Candidate 2

2012-04-25 Thread Edward Capriolo
+1 great work all. On Wed, Apr 25, 2012 at 7:30 PM, John Sichi wrote: > +1 > >> -- Forwarded message -- >> From: Carl Steinbach >> Date: Wed, Apr 25, 2012 at 1:00 PM >> Subject: Re: [VOTE] Apache Hive 0.9.0 Release Candidate 2 >> To: dev@hive.apache.org >> >> >> +1 (binding) >> >

Hive locking retries

2012-04-26 Thread Edward Capriolo
All, I use hive locking and I commonly get this error. HiveServerException(message:Query returned non-zero code: 10, cause: FAILED: Error in acquiring locks: locks on the underlying objects cannot be acquired. retry after some time, errorCode:10, SQLState:42000) The default settings for hive are

Re: Getting Started with Hive

2012-05-01 Thread Edward Capriolo
I think there is some bug with hive-builtins. It does not get generated until package, but you can not test without it. so you need to do ant package ant test On Mon, Apr 30, 2012 at 10:14 PM, Radhika Malik wrote: > A group of us is trying to write some code on Hive as a project for a class. >

  1   2   3   4   5   6   7   8   9   10   >