alamb commented on PR #79: URL: https://github.com/apache/datafusion-site/pull/79#issuecomment-3040245413
I just pushed a commit that reworked the intro a bit and started filling out the background <img width="917" alt="Screenshot 2025-07-05 at 5 06 45 PM" src="https://github.com/user-attachments/assets/efe36816-7fed-44d7-9158-1b2fc19ffb19" /> <img width="722" alt="Screenshot 2025-07-05 at 5 06 41 PM" src="https://github.com/user-attachments/assets/07e36677-46a6-4f2e-8b8a-c5eab9545167" /> @JigaoLuo the outlook section you describe sounds great. I envision it right after the ``` ## 1. Parquet 101: File Anatomy & Standard Index Structures ``` Section Perhaps like ``` ## 2. Extending Parquet with Special Indexes ``` (this is where figure 2 goes and where we will explain how to embed a custom index). So it makes a lot of sense to mention here the potential usecases (and that the index can be written after each row group or at the end of the file, and it can have information for each row group, individual row groups, columns, etc, whatever you want I would also be interested to hear what @zhuqi-lucas thinks -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org