timsaucer opened a new issue, #842:
URL: https://github.com/apache/datafusion-python/issues/842

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   
   In addition to the information we already have in the online documentation, 
it would be helpful to write a tutorial guiding a user through the various 
portions of DataFusion and how to get started. This issue is to collect ideas 
for what people would like to see in such a tutorial
   
   **Describe the solution you'd like**
   
   Please comment with topics that should be covered.
   
   **Additional context**
   
   Things I would like to see (unsorted list)
   
   - Creating a small dataframe from a pyarrow array
   - Reading/Writing data from/to csv and parquet
   - Zero copy import of data from pyarrow
   - Transferring DataFrame to/from pandas/polars
   - Displaying data via show(), _repr_html_(), and [great 
tables](https://posit-dev.github.io/great-tables/articles/intro.html)
   - Basic column selection, including indexing into fields and element for 
structs and arrays
   - Performing joins
   - Performing window and aggregate functions, including how default and 
custom window frames work
   - Integrating with [deltalake](https://pypi.org/project/deltalake/)
   - Using object store from S3, Google Cloud, Azure
   - Unnesting columns
   - Making structs and arrays
   - Chaining DataFrame operations with `transform` (PR in review)
   - Doing a variety of conditional operations (both `case` and `when` without 
base statement)
   - Examples of string manipulation
   - Doing date time conversion
   - Writing a UDF (advanced topic: writing a rust UDF and using with 
datafusion-python)
   
   Please add to the list what you would like to see!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to