Personally I would use a .csv format and store the file on a S3/GCS bucket. Xcom is meant to store small amount of data.
T. On Tue, Dec 24, 2019 at 10:33 AM Robin Edwards <r...@bidnamic.com> wrote: > Feather is probably a good option for data frames: > > > https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_feather.html > > R > > On Tue, 24 Dec 2019 at 07:52, Deng Xiaodong <xd.den...@gmail.com> wrote: > > > > Hi David. > > > > The only “out of box” way to share data/information between tasks is > XCom ( > > > https://airflow.apache.org/docs/stable/concepts.html?highlight=xcom#xcoms > ). > > > > For you case, the quick suggestion I can share is > > > > - either merging your tasks > > - or persisting your Pandas Dataframes somewhere then load it in your 2nd > > task (e.g. using pickle) > > > > > > XD > > > > On Tue, Dec 24, 2019 at 15:00 David Muñoz <david.munoz4...@gmail.com> > wrote: > > > > > Hi, > > > > > > Excuse me, I am new to this and maybe this topic has already been > treated. > > > > > > I would like to know if there is a way to "share/pass" pandas > dataframes > > > between tasks in airflow. > > > > > > Any help would be appreciated. > > > > > > Thank you!!! > > > > > > David. > > > > -- Tomasz Urbaszek Polidea <https://www.polidea.com/> | Software Engineer M: +48 505 628 493 <+48505628493> E: tomasz.urbas...@polidea.com <tomasz.urbasz...@polidea.com> Unique Tech Check out our projects! <https://www.polidea.com/our-work>