Hi All,

New to this. And still trying to figure out where exactly Arrow fits in the
ecosystem of various Big Data technologies.

In that respect first thing which came to my mind is how does Arrow compare
with parquet.

In my understanding Parquet also supports a very efficient columnar format
(with support for nested structure). It is already embraced (supported) by
various technologies like Impala (origin), Spark, Drill etc.

The only think I see missing in Parquet is support for SIMD based
vectorized operations.

Am I right or am I missing many other differences between Arrow and parquet
?

Regards,
Sourav

Reply via email to