[ https://issues.apache.org/jira/browse/ARROW-17066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated ARROW-17066: ----------------------------------- Labels: pull-request-available (was: ) > [C++][Python][Substrait] "ignore_unknown_fields" should be specified when > converting JSON to binary > --------------------------------------------------------------------------------------------------- > > Key: ARROW-17066 > URL: https://issues.apache.org/jira/browse/ARROW-17066 > Project: Apache Arrow > Issue Type: Bug > Reporter: Richard Tia > Assignee: Vibhatha Lakmal Abeykoon > Priority: Blocker > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > [https://developers.google.com/protocol-buffers/docs/reference/cpp/google.protobuf.util.json_util#JsonParseOptions] > > When converting a substrait JSON to binary, there are many unknown fields > that may exist since substrait is being built every week. > ignore_unknown_fields should be specified when doing this conversion. > > This is resulting in frequent errors similar to this: > {code:java} > E pyarrow.lib.ArrowInvalid: JsonToBinaryStream returned > INVALID_ARGUMENT:(relations[0].root.input.sort.input.aggregate.measures[0].measure) > arguments: Cannot find field. > pyarrow/error.pxi:100: ArrowInvalid {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)