Text data structures-optimized layout in Arrow

Edmon Begoli Sat, 02 Mar 2019 19:40:35 -0800

Colleagues:

A colleague and I are working on optimized structures for memory and disk
layout for raw and pre-processed text using specialized data structures,
and with a goal of efficient I/O, inter-process transmissions, and
media/memory storage of text-oriented data (e.g. clinical narratives,
radiology and pathology reports, etc.)


Has anyone on the Arrow dev team tackled this problem of efficient text
storage yet?
(not just plain text, but storing data structures in an arrow format)

If not, would you welcome a contribution?

Thank you,
Edmon

Text data structures-optimized layout in Arrow

Reply via email to