Arrow R package development sync call - Thursday 9th Feb at 17:30 UTC

2023-02-08 Thread Nic Crane
The next Arrow R package dev community call is on Thursday 9th February at 17:30 UTC. You can add items to the agenda on the day, or by adding a comment to the meeting notes here: https://docs.google.com/document/d/1nSIfJw8mfqtvScqvSVqmktpWff80pFmkqiZT7nTtiDo/edit?usp=sharing Joining instructions

Re: Reason behind using Apache Nightlies directory instead of Apache Nexus Snapshot Repository

2023-02-08 Thread Matthew Benedict de Detrich
> But as I am not a Java developer I would be open to changes if there is a clear benefit for the devs using the nightlies. So the benefits of using the Apache Nexus repository I can list quickly here * Since the repo is managed by Sonatype nexus software, it handles a lot of things that are cons

Re: Reason behind using Apache Nightlies directory instead of Apache Nexus Snapshot Repository

2023-02-08 Thread Matthew Benedict de Detrich
> If we use the snapshot repository, can we simplify this? https://arrow.apache.org/docs/dev/developers/java/building.html#installing-nightly-packages Yes, in fact you can just defer the documentation to https://infra.apache.org/repository-faq.html#basic On Wed, Jan 25, 2023 at 9:28 PM Sutou Kouh

Re: Reason behind using Apache Nightlies directory instead of Apache Nexus Snapshot Repository

2023-02-08 Thread Jacob Wujciak
Yes the java nightlies alone are 450mb. Without checking this could be due to JNI bundling the c++ binary multiple times? Also there are quite a large number of java artifacts but I have no deeper knowledge about that outside of the CI involved :D Matthew Benedict de Detrich schrieb am Mi., 8. F

Re: Reason behind using Apache Nightlies directory instead of Apache Nexus Snapshot Repository

2023-02-08 Thread Jacob Wujciak
I asked INFRA and there are no space restrictions in place so we could move the builds there. On Wed, Feb 8, 2023 at 7:21 PM Jacob Wujciak wrote: > Yes the java nightlies alone are 450mb. > > Without checking this could be due to JNI bundling the c++ binary multiple > times? Also there are quite

Re: Reason behind using Apache Nightlies directory instead of Apache Nexus Snapshot Repository

2023-02-08 Thread Sutou Kouhei
Hi, I noticed that *-sources.jar also have *.{so,dylib,dll}. e.g.: https://nightlies.apache.org/arrow/java/org/apache/arrow/arrow-gandiva/12.0.0-SNAPSHOT/arrow-gandiva-12.0.0-SNAPSHOT-sources.jar We can remove them from *-sources.jar. We can reduce 150MB by removing them. Thanks, -- kou In

Re: R arrow package question

2023-02-08 Thread Angelo Casalan
Hi Everyone, Thanks for the responses. I hope you are all well. Hi Dewey. As to the problematic column error message: Invalid: Could not open CSV input source 'folder/name.CSV': Invalid: In CSV column #30: Row #5: CSV conversion error to int32: invalid value '' I manually opened the csv and saw