Hello,

I am working on converting some internal data sources to Arrow data. One
particularly sets of data we have contains many string columns that can be
dictionary-encoded (basically string enums)

The current internal C++ API I am using gives me an iterator of "row"
objects, for each string column, the row object exposes a method
"getStringField(index)" that return me a "string_view" and I want to
construct a dictionary-encoded Arrow string column from it.

My question is:
(1) Is there a way to do this using the Arrow C++ API?
(2) Does the internal C++ API need to return something other than a
"string_view" to support this? Internally the string column is already
dictionary-encoded (although not in Arrow format) and it might already know
the dictionary and the encoded (int) value for each string field, but it
doesn't expose it now.

Thanks,
Li

Reply via email to