Update:

Running the test suite in IntelliJ that was (and is) having consistent test
failures via CLI, the issue seems to be resolved.
So I do think it is indeed a local JVM set up issue.

Investigating the differences now, but the class in question is
*org.apache.iceberg.mr.hive.TestHiveIcebergStorageHandlerWithEngine*

It seems to be caused by a NoClassDefFoundError, specifically for
org.xerial.snappy.Snappy. It also happens for ORC, but not for parquet.

Included is a sample output:
```
java.lang.NoClassDefFoundError: Could not initialize class
org.xerial.snappy.Snappy
at org.xerial.snappy.SnappyOutputStream.<init>(SnappyOutputStream.java:99)
~[snappy-java-1.1.8.jar:1.1.8]
at org.xerial.snappy.SnappyOutputStream.<init>(SnappyOutputStream.java:91)
~[snappy-java-1.1.8.jar:1.1.8]
at org.xerial.snappy.SnappyOutputStream.<init>(SnappyOutputStream.java:81)
~[snappy-java-1.1.8.jar:1.1.8]
at
org.apache.tez.common.TezUtils.createByteStringFromConf(TezUtils.java:81)
~[tez-api-0.10.1.jar:0.10.1]
```

Apologies for speaking too soon. *I'm now +0 [non-binding] *provided we fix
the 0.13.x branch and associated commitId to not be in a detached state.
The tag *apache-iceberg-0.13.2-rc1 *works just fine, but the 0.13.x branch
doesn't have the commit ID in question. Not sure if that's a major concern
or not.

Cheers,
Kyle

On Sun, Jun 5, 2022 at 11:51 AM Kyle Bendickson <k...@tabular.io> wrote:

> Thanks Eduard!
>
> I have:
> - verified the signature
> - verified the checksum in the file given as well as of the artifact
> - ran all unit tests on Java 11, all passed
> - ran all unit tests on Java 8, some hive-3 tests consistently fail (I do
> notice they passed on Github - but the tests which fail are consistent
> despite giving the JVM more memory and checking for OOM)
> - ran a simple smoke test suite of CRUD on namespaces and v1 and v2 tables
> with Spark (3.2, 3.1) and Flink (1.13 and 1.14).
> - ran some upsert related tests on Flink 1.13 and 1.14 (1.12 is provided a
> deprecation notice)
>
> *Problems:*
> I did notice that the *given commit ID is considered unattached (and I
> wasn't able to check it out).* I am running my tests by using the
> provided JAR with engines and then running unit tests locally for the
> commit just prior (with commit ID
> *fae977a9f0a79266a04647b0df2ab540cf0dcff4*).
>
> Not sure if this is a huge issue, but outside of this unattached commit,
> my only concern is the `iceberg-hive3` failing tests, but as they passed in
> CI it's possibly an issue with my local setup locally.
>
> Running hive-3 test suite alone, the same tests failed multiple times but
> again might be something to do with my computer / JVM configuration.
>
> *I am -1 (non-binding)*, primarily based on the detached commit (as I had
> quite a good bit of trouble trying to fetch it through my normal processes)
> as well as the failing hive3 tests (though that's not exactly within my
> area of expertise).
>
> If the hive3 test failures are only something that occurs for me, then if
> we fix the "Add version.txt commit" in branch 0.3.x such that when I fetch
> branch 0.3.x it's present, I'd be +1. Unfortunately, I can't help with
> cleaning up with the release branch outside of advising somebody else (if
> desired), but I'm happy to help with that.
>
> The hive3 test failures for me seem to be OOM related, but I raised my
>
> Find attached a picture of the detached commit ID,
> *0784d64a659abd4fdaa82cdb599a250a7514facf*, per Github.
>
> [image: image.png]
>
> Example test failures
> org.apache.iceberg.mr.hive.TestHiveIcebergStorageHandlerWithEngine >
> testCBOWithSelectedColumnsOverlapJoin[fileFormat=AVRO, engine=tez,
> catalog=HIVE_CATALOG, isVectorized=false] FAILED
>     java.lang.IllegalArgumentException: Failed to execute Hive query
> 'SELECT c.first_name, o.order_id FROM default.orders o JOIN
> default.customers c ON o.customer_id = c.customer_id ORDER BY o.order_id
> DESC': Error while processing statement: FAILED: Execution Error, return
> code 1 from org.apache.hadoop.hive.ql.exec.tez.TezTask
>         at
> org.apache.iceberg.mr.hive.TestHiveShell.executeStatement(TestHiveShell.java:152)
>         at
> org.apache.iceberg.mr.hive.TestHiveIcebergStorageHandlerWithEngine.testCBOWithSelectedColumnsOverlapJoin(TestHiveIcebergStorageHandlerWithEngine.java:236)
>
>         Caused by:
>         org.apache.hive.service.cli.HiveSQLException: Error while
> processing statement: FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.tez.TezTask
>             at
> org.apache.hive.service.cli.operation.Operation.toSQLException(Operation.java:335)
>             at
> org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:226)
>             at
> org.apache.hive.service.cli.operation.SQLOperation.runInternal(SQLOperation.java:263)
>             at
> org.apache.hive.service.cli.operation.Operation.run(Operation.java:247)
>             at
> org.apache.hive.service.cli.session.HiveSessionImpl.executeStatementInternal(HiveSessionImpl.java:541)
>             at
> org.apache.hive.service.cli.session.HiveSessionImpl.executeStatement(HiveSessionImpl.java:510)
>             at
> org.apache.hive.service.cli.CLIService.executeStatement(CLIService.java:267)
>             at
> org.apache.iceberg.mr.hive.TestHiveShell.executeStatement(TestHiveShell.java:139)
>             ... 1 more
>
> Thanks you for working on this,
> Kyle
>
>
>
>
> On Wed, Jun 1, 2022 at 11:12 PM Eduard Tudenhoefner <edu...@dremio.com>
> wrote:
>
>> Hi Everyone,
>>
>> I propose that we release the following RC as the official Apache Iceberg
>> 0.13.2 release.
>>
>> The commit ID is *0784d64a659abd4fdaa82cdb599a250a7514facf*
>>
>>
>>    - This corresponds to the tag: *apache-iceberg-0.13.2-rc1*
>>    - https://github.com/apache/iceberg/commits/apache-iceberg-0.13.2-rc1
>>    -
>>    
>> https://github.com/apache/iceberg/tree/0784d64a659abd4fdaa82cdb599a250a7514facf
>>
>>
>> The release tarball, signature, and checksums are here:
>>
>>    -
>>    https://dist.apache.org/repos/dist/dev/iceberg/apache-iceberg-0.13.2-rc1
>>
>>
>> You can find the KEYS file here:
>>
>>    - https://dist.apache.org/repos/dist/dev/iceberg/KEYS
>>
>>
>> Convenience binary artifacts are staged on Nexus. The Maven repository
>> URL is:
>>
>>    -
>>    https://repository.apache.org/content/repositories/orgapacheiceberg-1088/
>>
>>
>> Please download, verify, and test.
>>
>> Please vote in the next 72 hours.
>>
>> [ ] +1 Release this as Apache Iceberg <VERSION>
>> [ ] +0
>> [ ] -1 Do not release this because...
>>
>

Reply via email to