Here are some realistic tabular data sets...
https://github.com/lemire/RealisticTabularDataSets
They are small by modern standards but they are also one GitHub clone away.
- Daniel
On Wed, Jan 24, 2018 at 2:26 PM, Wes McKinney wrote:
> Thanks Ted. I will echo these comments and recommend to r
You might be missing a "-l" flag or two in addition to the "-I" flag. You
might also need a "-L" flag.
On Thu, Dec 7, 2017 at 1:34 PM, Renato MarroquĂn Mogrovejo <
renatoj.marroq...@gmail.com> wrote:
> Hi devs,
>
> I have also sent this question to the parquet mailing list, but I guess
> this is
I don't know the answer per se but my understanding is that
Arrow enables ccmputational kernels that can be highly optimized.
I plan to do some work in this direction myself.
- Daniel
Hi,
>
> I wonder if anyone can comment on how does Apache Arrow accomplish, or help
> accomplish the following,
[
https://issues.apache.org/jira/browse/ARROW-273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15438920#comment-15438920
]
Daniel Lemire edited comment on ARROW-273 at 8/26/16 1:2
[
https://issues.apache.org/jira/browse/ARROW-273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15438920#comment-15438920
]
Daniel Lemire commented on ARROW-273:
-
If the max value is going to be 2^31-1, th