Re: "PermGen space" error

2016-02-08 Thread Stephen Bly
You can play around with these settings: http://stackoverflow.com/questions/8356416/xxmaxpermsize-with-or-without-xxpermsize We ran into the same problem at my last company. Turns out it had to do with the way Hive Server was loading jars, IIRC it was because Hive was loading the same class defini

Re: Create table from ORC or Parquet file?

2015-12-08 Thread Stephen Bly
I am working on a similar problem — creating a Hive table from Parquet data and using the embedded schema to determine the columns. You will have to create your own SerDe and InputFormat I believe (that’s what I’m doing).

SerDe, InputFormat, and OutputFormat

2015-12-04 Thread Stephen Bly
Hi. I am confused on the difference between the Serializer/Deserializer and InputFormat/OutputFormat in the CREATE TABLE command. For example, let=E2=80=99s say I am using Sequence files, but my records are of some unknown type not known to Hive (i.e. not Thrift, Protocol Buffer, etc.). Could I st

Re: Persistent (and possibly asynchronous) Hive access from within Scala

2015-08-10 Thread Stephen Bly
We can use JDBC. But there’s also this new library http://hive.apache.org/javadocs/r1.1.1/api/org/apache/hive/jdbc/package-frame.html based on this Thrift interface (https://github.com/apache/hive/blob/0af6cb4

Persistent (and possibly asynchronous) Hive access from within Scala

2015-08-06 Thread Stephen Bly
What library should I use if I want to make persistent connections from within Scala/Java? I’m working on a web service that sends Hive queries to our HiveServer (we are about to upgrade to Hive 1.1 with Hive Server 2). Right now I’m using the Hive Driver for JDBC but that does not have the capa