FWIW we'd made a similar assumption. In Schema.fbs [1] the type is called
Utf8, as well as the Java `ArrowType.Utf8` class - is this a required
assumption to work with other language Arrow libs, maybe?

James

[1] https://github.com/apache/arrow/blob/master/format/Schema.fbs

On Thu, 29 Sept 2022 at 18:57, Larry White <ljw1...@gmail.com> wrote:

> Hi Kevin,
>
> I don't know of any particular restriction regarding string encoding.
> VarCharVector stores data as a byte array, and the encoding can be set
> using the Charset class when you convert Strings to and from bytes. Since
> java strings use UTF-16 internally, I would expect this to 'just work'.
>
> larry
>
> On Thu, Sep 29, 2022 at 12:46 PM Kevin Bambrick <kevinbambri...@gmail.com>
> wrote:
>
> > Hi.
> >
> > Was just wondering was support for UTF-16 Strings considered? As far as I
> > am aware VarChar vectors only support UTF-8. Are they something that may
> be
> > supported in the future?
> >
> > Regards.
> > Kevin.
> >
>


-- 
*James Henderson*
XTDB Development Manager at *JUXT*

Email j...@juxt.pro
Website https://juxt.pro

[image: photo]

Reply via email to