dragongu opened a new issue, #3032:
URL: https://github.com/apache/parquet-java/issues/3032
### Describe the usage question you have. Please include as many useful
details as possible.
The following code currently has getDataSize as an estimated value. The
Iceberg rolling file write operation relies on this method, which may result in
writing files that are much smaller than expected.
```JAVA
/**
* @return the total size of data written to the file and buffered in memory
*/
public long getDataSize() {
return lastRowGroupEndPos + columnStore.getBufferedSize();
}
```
Could we provide a potentially larger getDataSize? I can't think of any
downsides at the moment.
### Component(s)
_No response_
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]