Re: [Doc] ORC-related documentation

2021-11-25 Thread Ian Joiner
Hi Joris, Really thanks for pointing out where the doc sources are! I will start the PR and share it with you so that we can work on it together. You know, I can do the ORC reader & writer with options and you can do the dataset integration that you did. Best, Ian On Thursday, November 25, 2021,

Re: [Doc] ORC-related documentation

2021-11-25 Thread Joris Van den Bossche
Hi Ian, Yes, more documentation regarding ORC would be very welcome! I think your list of missing docs is correct: - It's briefly mentioned in the Python API docs (https://arrow.apache.org/docs/python/api/formats.html#orc-files), but incomplete - The C++ reference docs list the OrcFileFormat for

[Doc] ORC-related documentation

2021-11-23 Thread Ian Joiner
Hi, Today I found that pretty much none of our ORC-related work (e.g. ORC writer in C++ & Python, Arrow Dataset with ORC) has ever been documented. This is something we have to fix or users won’t even be aware that ORC support exists, let alone how to use it. >From my understanding it seems that