Colleagues: A colleague and I are working on optimized structures for memory and disk layout for raw and pre-processed text using specialized data structures, and with a goal of efficient I/O, inter-process transmissions, and media/memory storage of text-oriented data (e.g. clinical narratives, radiology and pathology reports, etc.)
Has anyone on the Arrow dev team tackled this problem of efficient text storage yet? (not just plain text, but storing data structures in an arrow format) If not, would you welcome a contribution? Thank you, Edmon