As another item for consideration -- in C++ at least, the dictionary
id is dealt with as an internal detail of the IPC message production
process. When serializing the Schema, id's are assigned to each
dictionary-encoded field in the DictionaryMemo object, see
https://github.com/apache/arrow/blob/
Neal Richardson created ARROW-8369:
--
Summary: [CI] Fix crossbow R group
Key: ARROW-8369
URL: https://issues.apache.org/jira/browse/ARROW-8369
Project: Apache Arrow
Issue Type: Bug
I opened a JIRA to track a potential change or at least clarification
about this use case. One major use case for the C interface will be in
database clients (e.g. this question arose out of using the C
interface for Kudu -- a database) and this may be a common question.
https://issues.apache.org/
Wes McKinney created ARROW-8368:
---
Summary: [Format] In C interface, clarify resource management for
consumers needing only a subset of child fields in ArrowArray
Key: ARROW-8368
URL: https://issues.apache.org/jira/b
Ben Kietzman created ARROW-8367:
---
Summary: [C++] Is FromString(..., pool) worthwhile
Key: ARROW-8367
URL: https://issues.apache.org/jira/browse/ARROW-8367
Project: Apache Arrow
Issue Type: Impr
hey Paul,
Take a look at how dictionaries work in the IPC protocol
https://github.com/apache/arrow/blob/master/docs/source/format/Columnar.rst#serialization-and-interprocess-communication-ipc
Dictionaries are sent as separate messages. When a field is tagged as
dictionary encoded in the schema,
Le 07/04/2020 à 19:39, Wes McKinney a écrit :
>
> Re-orienting the discussion on something more concrete, suppose that an
> ArrowArray is used to convey a result set from a database query, and
> suppose that the resources associated with each column in the result set
> are independent of the oth
On Tue, Apr 7, 2020, 12:04 PM Antoine Pitrou wrote:
>
> Le 07/04/2020 à 18:49, Todd Lipcon a écrit :
> >>
> >> Hmm, the spec may not be clear enough on this, but if you move a child
> >> and release the parent, then the other children are not usable anymore.
> >>
> >> In your case, you don't call
Le 07/04/2020 à 18:49, Todd Lipcon a écrit :
>>
>> Hmm, the spec may not be clear enough on this, but if you move a child
>> and release the parent, then the other children are not usable anymore.
>>
>> In your case, you don't call release() on every child. You just call
>> release() on the pare
On Tue, Apr 7, 2020 at 2:40 AM Antoine Pitrou wrote:
>
> Le 06/04/2020 à 19:22, Todd Lipcon a écrit :
> >
> > The spec should also probably cover thread-safety: if the consumer gets
> an
> > ArrowArray, is it safe to pass off the children to multiple threads and
> > have them call release() concu
I did a bit more research on JIRA and we seem to have this open topic there
also in https://issues.apache.org/jira/browse/ARROW-6959 which is the similar
topic as my mail is about and in
https://issues.apache.org/jira/browse/ARROW-7009 we wanted to remove some of
the interfaces with reference-t
Hello,
I'm trying to build a Rust based Flight server and I'd like to use
Dictionary encoding for a number of string columns in my data. I've seen
that StringDictionary was recently added to Rust here:
https://github.com/apache/arrow/commit/c7a7d2dcc46ed06593b994cb54c5eaf9ccd1d21d#diff-72812e308734
Andy Grove created ARROW-8366:
-
Summary: [Rust] Need to revert recent arrow-flight build change
Key: ARROW-8366
URL: https://issues.apache.org/jira/browse/ARROW-8366
Project: Apache Arrow
Issue T
Juan Galvez created ARROW-8365:
--
Summary: arrow-cpp: Error when writing files to S3 larger than 5 GB
Key: ARROW-8365
URL: https://issues.apache.org/jira/browse/ARROW-8365
Project: Apache Arrow
I
Or created ARROW-8364:
-
Summary: Get Access to the type_to_type_id dictionary
Key: ARROW-8364
URL: https://issues.apache.org/jira/browse/ARROW-8364
Project: Apache Arrow
Issue Type: Improvement
Krisztian Szucs created ARROW-8363:
--
Summary: [Archery] Comment bot should report any errors happening
during crossbow submit
Key: ARROW-8363
URL: https://issues.apache.org/jira/browse/ARROW-8363
Pro
Krisztian Szucs created ARROW-8362:
--
Summary: [Crossbow] Ensure that the locally generated version is
used in the docker tasks
Key: ARROW-8362
URL: https://issues.apache.org/jira/browse/ARROW-8362
Pr
Hello all,
I'm in the progress of changing the implementation of the Take kernel to work
on ChunkedArrays without concatenating them into a single Array first. While
working on the implementation, I realised that we switch often between Datum
and the specific-typed parameters. This works quite
I guess it would be helpful, when trying to achieve zero-modification
between R and another language, if the standard used for communication
would allow for that. Or when setting all nulls to zero for an algorithm
and then saving it to a database for later use. But at the same time, I
only know
Le 06/04/2020 à 19:22, Todd Lipcon a écrit :
>
> The spec should also probably cover thread-safety: if the consumer gets an
> ArrowArray, is it safe to pass off the children to multiple threads and
> have them call release() concurrently? In other words, do I need to use a
> thread-safe referenc
Antoine Pitrou created ARROW-8361:
-
Summary: [C++] Add Result APIs to Buffer methods and functions
Key: ARROW-8361
URL: https://issues.apache.org/jira/browse/ARROW-8361
Project: Apache Arrow
21 matches
Mail list logo