Hi,

What is the writing speed that we should expect from the Arrow Parquet
writer?

I'm writing a RecordBatch with two columns and 1,000,000 records and it
takes a lot of time to write the batch to the file (close to 2 secs).

This is what I'm doing

let schema = Schema::new(vec![
>     Field::new("col_1", DataType::Utf8, false),
>     Field::new("col_2", DataType::Utf8, false),
> ]);
> let batch = RecordBatch::try_new(
>     Arc::new(schema.clone()),
>     vec![Arc::new(array_1), Arc::new(array_2)],
> )
> .unwrap();
> let mut writer = ArrowWriter::try_new(file, Arc::new(schema.clone()),
> None).unwrap();
> writer.write(&batch).unwrap();
> writer.close().unwrap();


 I'm comparing a similar operation with Pandas and that is almost
immediate.

Is there something I'm missing?

Thanks,

Reply via email to