alamb commented on issue #11442:
URL: https://github.com/apache/datafusion/issues/11442#issuecomment-2353490505

   >  I think especially in the RND world (industrial and academic), Datafusion 
makes research easier and more interesting, since you're starting from a 
already-present foundation and extending it/modifying it as you need. I think 
that Datafusion could probably benefit greatly from more academic 
collaborations?  I'd imagine that a lot of the performance optimisations, but 
also other kinds of projects, would make a great Master thesis or research 
paper in the DBMS world. 
   
   I agree entirely @Abdullahsab3 -- thank you. In fact I believe it is exactly 
the plan of @XiangpengHao to do so. Perhaps he has some insights about how to 
make it more appealing to researchers
   
   I also think Andy Pavlo's Advanced Database Course was an early adopter and 
tried to make projects based on DataFusion Spring 2024:  
https://15721.courses.cs.cmu.edu/spring2024/project.html . I didn't hear much 
about how this actually went or what we could do to make it easier next time. 
   
   > It might also be interesting if projects that are built on top of 
Datafusion could also present and explain how they used Datafusion to build 
their project and what the advantages were of using Datafusion.
   
   100% agree. This was the topic of many of the DataFusion San Franciso meetup 
talks recently, and I spoked about it in this talk:
   *  [DataCouncil 2024: 
](https://www.datacouncil.ai/talks24/building-influxdb-30-with-apache-arrow-datafusion-flight-and-parquet?hsLang=en)Building
 InfluxDB 3.0 with Apache Arrow, DataFusion, Flight and Parquet. 
[slides](https://docs.google.com/presentation/d/12kdYHLyH79B5__9xs3de_hZyG9geW4jC3vUpiy39VA0),
 [recording](https://www.youtube.com/watch?v=I-Z7kFGsYRI),
   
   I am particularly excited about the CMU database series this spring promises 
to be full of such explanations (the majority of those systems use DataFusion 
in some  way) : https://db.cs.cmu.edu/seminar2024/


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to