worker24h created ARROW-5282:
--------------------------------

             Summary: Can't read data from parquet file in C++ library
                 Key: ARROW-5282
                 URL: https://issues.apache.org/jira/browse/ARROW-5282
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: worker24h


Specified the second param *parquet::ReaderProperties* When I used 
parquet::ParquetFileReader::Open, it can't work.
 The following code:
{code:java}
parquet::ReaderProperties _properties;
_properties = parquet::ReaderProperties(); 
_properties.enable_buffered_stream();  // used  buffer stream.  Don't set 
buffer-size
parquet_reader = parquet::ParquetFileReader::Open(_parquet, _properties);
...
int32_t value;
parquet::Int32Reader* int32_reader =
static_cast<parquet::Int32Reader*>(column_reader.get());
int32_reader->Skip(_current_line_of_group);// skip lines of processed.
rows_read = int32_reader->ReadBatch(1, nullptr, nullptr, &value, &values_read); 
 

{code}
The interface *Skip* throw exception:

{color:#FF0000}{{Couldn't deserialize thrift: TProtocolException: Invalid data 
Deserializing page header failed.}}{color}

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to