[Rd] write.csv performance improvements?

2023-03-29 Thread Toby Hocking
Dear R-devel, I did a systematic comparison of write.csv with similar functions, and observed two asymptotic inefficiencies that could be improved. 1. write.csv is quadratic time (N^2) in the number of columns N. Can write.csv be improved to use a linear time algorithm, so it can handle CSV files

[Rd] read.csv quadratic time in number of columns

2023-03-29 Thread Toby Hocking
Dear R-devel, A number of people have observed anecdotally that read.csv is slow for large number of columns, for example: https://stackoverflow.com/questions/7327851/read-csv-is-extremely-slow-in-reading-csv-files-with-large-numbers-of-columns I did a systematic comparison of read.csv with similar

Re: [Rd] Incorrect behavior of ks.test and psmirnov functions with exact=TRUE

2023-03-29 Thread Kurt Hornik
> Alexey Sergushichev writes: Thanks. This is now fixed for the upcoming 4.3.0 release. Best -k > HI, > I've noticed what I think is an incorrect behavior of stats::psmirnov > function and consequently of ks.test when run in an exact mode. > For example: > psmirnov(1, sizes=c(50, 50), z=1: