David Li created ARROW-17229:
--------------------------------

             Summary: [C++] ReadRel is translated to a source node that emits 
unexpected fields
                 Key: ARROW-17229
                 URL: https://issues.apache.org/jira/browse/ARROW-17229
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: David Li


Currently, a Substrait plan with a RelRoot containing a ReadRel will contain 
extra, unexpected fields, namely {{__fragment_index}} et. al. Right now they 
are always included by default. There are a few things to be done:

* ReadRel's {{base_schema}} could be converted into a 
{{ScanOptions.dataset_schema}} to limit the fields read. (Also see ARROW-15585, 
these fields should be used for pushdown projection)
* The scanner always adds these extra fields - maybe it should be opt-in instead
* There's no way to manually insert a Project to "fix" things because as 
implemented, it can only add new columns



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to