Personally I would use a .csv format and store the file on a S3/GCS bucket.
Xcom is meant to store small amount of data.

T.

On Tue, Dec 24, 2019 at 10:33 AM Robin Edwards <r...@bidnamic.com> wrote:

> Feather is probably a good option for data frames:
>
>
> https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_feather.html
>
> R
>
> On Tue, 24 Dec 2019 at 07:52, Deng Xiaodong <xd.den...@gmail.com> wrote:
> >
> > Hi David.
> >
> > The only “out of box” way to share data/information between tasks is
> XCom (
> >
> https://airflow.apache.org/docs/stable/concepts.html?highlight=xcom#xcoms
> ).
> >
> > For you case, the quick suggestion I can share is
> >
> > - either merging your tasks
> > - or persisting your Pandas Dataframes somewhere then load it in your 2nd
> > task (e.g. using pickle)
> >
> >
> > XD
> >
> > On Tue, Dec 24, 2019 at 15:00 David Muñoz <david.munoz4...@gmail.com>
> wrote:
> >
> > > Hi,
> > >
> > > Excuse me, I am new to this and maybe this topic has already been
> treated.
> > >
> > > I would like to know if there is a way to "share/pass" pandas
> dataframes
> > > between tasks in airflow.
> > >
> > > Any help would be appreciated.
> > >
> > > Thank you!!!
> > >
> > > David.
> > >
>


-- 

Tomasz Urbaszek
Polidea <https://www.polidea.com/> | Software Engineer

M: +48 505 628 493 <+48505628493>
E: tomasz.urbas...@polidea.com <tomasz.urbasz...@polidea.com>

Unique Tech
Check out our projects! <https://www.polidea.com/our-work>

Reply via email to