Hi,

Thanks for pointing out this! I agree that the R-related CI
jobs covers more areas missed by the C++-related CI
jobs. Let's add more C++-related CI jobs to cover R
use-cases before we revisit splitting R out.

BTW, I remembered that there is a similar discussion when I
dropped support for Ubuntu 20.04:
https://github.com/apache/arrow/pull/45507#discussion_r1955436604


apache/arrow-java also faced a similar problem. The
C++-related CI jobs don't cover build options for JNI. I'll
add jobs for it:
https://github.com/apache/arrow/issues/45683


Thanks,
-- 
kou

In <CAOCv4himzNFYFwx=6-o1szgoeooihrghfthai+gksurjdi4...@mail.gmail.com>
  "Re: [DISCUSS] Split R release process" on Mon, 3 Mar 2025 09:32:19 -0500,
  Neal Richardson <neal.p.richard...@gmail.com> wrote:

> Thanks for raising this, Kou. I'm personally torn on this because I see
> some of the upsides of splitting R out, particularly at the project's state
> of maturity, but it's also not as simple as Rust or Java or others we've
> split out in the past because of the hard dependency on the C++ libraries.
> It's not just about integration testing the IPC format.
> 
> For better or worse, I can remember tons of instances where the R-related
> CI jobs have caught something in a C++ PR because they test with different
> compilers and toolchains. If we split the projects and all of the R CI jobs
> are only running on the arrow-r repo, does that mean that the R maintainers
> will be continually finding CI failures in the tests that build with the
> latest version of the C++ library and filing bug reports back for the C++
> project? Either way, it seems that the monorepo would want to keep some R
> testing jobs in crossbow to be able to validate changes, at least to be
> able to confirm that the PR for the issue that the R maintainers filed
> fixes the issue. Maybe this is the way it should be, but it's not clear
> that it reduces the collective maintenance burden.
> 
> Just my thoughts based on the historical perspective, I'm happy to defer to
> the judgment of those who are currently shouldering that maintenance burden.
> 
> Neal
> 
> On Sun, Mar 2, 2025 at 7:53 PM Sutou Kouhei <k...@clear-code.com> wrote:
> 
>> Hi,
>>
>> This is a similar discussion to the "[DISCUSS] Split Go
>> release process" thread[1] and the "[DISCUSS] Split Java
>> release process" thread[2]:
>>
>> [1] https://lists.apache.org/thread/fstyfvzczntt9mpnd4f0b39lzb8cxlyf
>> [2] https://lists.apache.org/thread/b99wp2f3rjhy09sx7jqvrfqjkqn9lnyy
>>
>> We've split them and they were released from separated
>> repositories.
>>
>> Let's discuss the next target.
>>
>> We raised JavaScript as the next candidate in the Java
>> discussion[3] but we may not find one or more active release
>> managers for JavaScript.
>>
>> [3] https://lists.apache.org/thread/bdko84zy72nlg3k82t772f7pq6zpd0sz
>>
>> I propose R as the next candidate because:
>>
>> * We have many active committers and PMC members who can
>>   focus on R
>> * The current R release process is semi-separated
>>   * In general, we release R packages to CRAN by non-trivial
>>     release process after our monorepo release.
>>     e.g.: https://github.com/apache/arrow/issues/45581
>> * The R bindings can also work with old C++ versions
>>   * The R bindings don't need to align with the monorepo
>>     versioning. The R bindings can avoid major version up
>>     per 3-4 months.
>> * We have many R related CI jobs. If we split the R
>>   bindings, we can remove many CI jobs from monorepo.
>>
>>
>> What do you think about this?
>>
>>
>> Thanks,
>> --
>> kou
>>

Reply via email to