[GitHub] hive pull request #518: HIVE-20419: Vectorization: Prevent mutation of Vecto...

2019-01-21 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/518 HIVE-20419: Vectorization: Prevent mutation of VectorPartitionDesc af… …ter being used in a hashmap key (Teddy Choi) Change-Id: Id2219d74cd09db8efc6c464acef27aa0bb95fe2b You can merge

[GitHub] hive pull request #517: HIVE-21091: Arrow serializer sets null at wrong inde...

2019-01-20 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/517 HIVE-21091: Arrow serializer sets null at wrong index You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-21091 Alternatively you

[GitHub] hive pull request #507: HIVE-21041: NPE, ParseException in getting schema fr...

2018-12-17 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/507 HIVE-21041: NPE, ParseException in getting schema from logical plan (… …Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic

[GitHub] hive pull request #503: HIVE-21011: Upgrade MurmurHash 2.0 to 3.0 in vectori...

2018-12-05 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/503 HIVE-21011: Upgrade MurmurHash 2.0 to 3.0 in vectorized map and reduc… …e operators (Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https

[GitHub] hive pull request #501: HIVE-20994: Upgrade arrow version to 0.10.0 in branc...

2018-12-03 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/501 HIVE-20994: Upgrade arrow version to 0.10.0 in branch-3 (Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20994

[GitHub] hive pull request #492: HIVE-20954: Vector RS operator is not using uniform ...

2018-11-21 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/492 HIVE-20954: Vector RS operator is not using uniform hash function for… … TPC-DS query 95 (Teddy Choi) Change-Id: Ia23b5ddefc2b35cda9ed7d817bdbd767ec7f7671 You can merge this pull request

[GitHub] hive pull request #485: HIVE-20873: Use Murmur hash for VectorHashKeyWrapper...

2018-11-06 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/485 HIVE-20873: Use Murmur hash for VectorHashKeyWrapperTwoLong to reduce… … hash collision (Teddy Choi) Change-Id: Ie3ae307acb331c48bc5e1cb9c417cd5d1d792f50 You can merge this pull request

[GitHub] hive pull request #480: HIVE-20827: Inconsistent results for empty arrays (T...

2018-10-29 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/480 HIVE-20827: Inconsistent results for empty arrays (Teddy Choi) Signed-off-by: Teddy Choi You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic

[GitHub] hive pull request #433: HIVE-20552: Get Schema from LogicalPlan for HWC (Ted...

2018-09-13 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/433 HIVE-20552: Get Schema from LogicalPlan for HWC (Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20552

[GitHub] hive pull request #423: HIVE-20445: Add randomized tests to TestArrowColumna...

2018-08-27 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/423 HIVE-20445: Add randomized tests to TestArrowColumnarBatchSerDe You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20445

[GitHub] hive pull request #419: HIVE-20423: Set NULLS LAST as the default null order...

2018-08-21 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/419 HIVE-20423: Set NULLS LAST as the default null ordering You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20423 Alternatively

[GitHub] hive pull request #414: HIVE-20368: Remove VectorTopNKeyOperator lock (Teddy...

2018-08-11 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/414 HIVE-20368: Remove VectorTopNKeyOperator lock (Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20368 Alternatively

[GitHub] hive pull request #411: HIVE-20150: TopNKey pushdown (Teddy Choi)

2018-08-09 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/411 HIVE-20150: TopNKey pushdown (Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20150 Alternatively you can review

[GitHub] hive pull request #397: HIVE-20044: Arrow Serde should pad char values and h...

2018-07-10 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/397 HIVE-20044: Arrow Serde should pad char values and handle empty strin… …gs correctly (Teddy Choi) Change-Id: Ia87503aabf38c9599b887795d2b62566d8965f9b You can merge this pull request

[GitHub] hive pull request #396: HIVE-20044: Arrow Serde should pad char values and h...

2018-07-10 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/396 HIVE-20044: Arrow Serde should pad char values and handle empty strin… …gs correctly (Teddy Choi) Change-Id: Iaa4417056a6e2a88335ed0cd5436142e128009c6 You can merge this pull request

[GitHub] hive pull request #386: HIVE-20052: Arrow serde should fill ArrowColumnVecto...

2018-07-02 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/386 HIVE-20052: Arrow serde should fill ArrowColumnVector(Decimal) with t… …he given schema precision/scale (Teddy Choi) You can merge this pull request into a Git repository by running: $ git

[GitHub] hive pull request #382: HIVE-20033: Backport HIVE-19432 to branch-2, branch-...

2018-06-29 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/382 HIVE-20033: Backport HIVE-19432 to branch-2, branch-3 (Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20033-branch

[GitHub] hive pull request #381: HIVE-20033: Backport HIVE-19432 to branch-2, branch-...

2018-06-29 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/381 HIVE-20033: Backport HIVE-19432 to branch-2, branch-3 (Teddy Choi) You can merge this pull request into a Git repository by running: $ git pull https://github.com/pudidic/hive HIVE-20033-branch

[GitHub] hive pull request #371: HIVE-19853: Arrow serializer needs to create a TimeS...

2018-06-11 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/371 HIVE-19853: Arrow serializer needs to create a TimeStampMicroTZVector… … instead of TimeStampMicroVector You can merge this pull request into a Git repository by running: $ git pull https

[GitHub] hive pull request #369: HIVE-19723: Arrow serde: "Unsupported data type: Tim...

2018-06-10 Thread pudidic
Github user pudidic closed the pull request at: https://github.com/apache/hive/pull/369 ---

[GitHub] hive pull request #369: HIVE-19723: Arrow serde: "Unsupported data type: Tim...

2018-06-07 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/369 HIVE-19723: Arrow serde: "Unsupported data type: Timestamp(NANOSECOND, null)" This pull request added a randomized unit test, supports microsecond for Spark integration, a

[GitHub] hive pull request #360: HIVE-19723: Arrow serde: "Unsupported data type: Tim...

2018-06-07 Thread pudidic
Github user pudidic closed the pull request at: https://github.com/apache/hive/pull/360 ---

[GitHub] hive pull request #360: HIVE-19723: Arrow serde: "Unsupported data type: Tim...

2018-05-28 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/360 HIVE-19723: Arrow serde: "Unsupported data type: Timestamp(NANOSECOND, null)" Spark's Arrow support only provides Timestamp at MICROSECOND granularity. Spark 2.3.0 won't accept N

[GitHub] hive pull request #355: HIVE-19584: Dictionary encoding for string types

2018-05-24 Thread pudidic
GitHub user pudidic opened a pull request: https://github.com/apache/hive/pull/355 HIVE-19584: Dictionary encoding for string types Apache Arrow supports dictionary encoding for some data types. So implement dictionary encoding for string types in Arrow SerDe. You can merge this