Hi,
What is the writing speed that we should expect from the Arrow Parquet
writer?
I'm writing a RecordBatch with two columns and 1,000,000 records and it
takes a lot of time to write the batch to the file (close to 2 secs).
This is what I'm doing
let schema = Schema::new(vec![
> Field::new("col_1", DataType::Utf8, false),
> Field::new("col_2", DataType::Utf8, false),
> ]);
> let batch = RecordBatch::try_new(
> Arc::new(schema.clone()),
> vec![Arc::new(array_1), Arc::new(array_2)],
> )
> .unwrap();
> let mut writer = ArrowWriter::try_new(file, Arc::new(schema.clone()),
> None).unwrap();
> writer.write(&batch).unwrap();
> writer.close().unwrap();
I'm comparing a similar operation with Pandas and that is almost
immediate.
Is there something I'm missing?
Thanks,