Hi,

At the bottom please find the session_info.

When creating north of 200-300 batches, the writing to the arrow file crashes R 
– it doesn’t even show an error message. Rstudio just aborts.

I have the feeling that maybe each batch becomes a stream and R has issues with 
the connections, but that’s a total guess.

Any help would be appreciated.

##

Here is the function. When running it with 3000 it crashes immediately.

Before that I ran it with 100, and then increased it slowly, and then it 
randomly crashed again.

write_arrow_custom(data.frame(A=c(1:100000),B=c(1:100000)),'C:/Temp/test.arrow',3000)

write_arrow_custom <- function(df,targetarrow,nrbatches) {
  ct <- nrbatches
  idxs <- c(0:ct)/ct*nrow(df)
  idxs <- round(idxs,0) %>% as.integer()
  idxs[length(idxs)] <- nrow(df)
  df_nav <- idxs %>% as.data.frame() %>% rename(colfrom=1) %>% 
mutate(colto=lead(colfrom)) %>% mutate(colfrom=colfrom+1) %>% 
filter(!is.na(colto)) %>% mutate(R=row_number())
  stopifnot(df_nav %>% mutate(chk=colto-colfrom+1) %>% '$'('chk') %>% 
sum()==nrow(df))
  table_df <- Table$create(name=rownames(df[1,]),df[1,])
  writer <- RecordBatchFileWriter$create(targetarrow,table_df$schema)
  df_nav %>% dlply(c('R'),function(df_nav){
    catl(glue('{df_nav$colfrom[1]}:{df_nav$colto[1]} / {df_nav$R[1]}...'))
    tmp <- df[df_nav$colfrom[1]:df_nav$colto[1],]
    writer$write_batch(record_batch(name = rownames(tmp), tmp))
    NULL
  }) -> batch_lst
  writer$close()
  rm(batch_lst)
  gc()
}

[cid:image001.jpg@01D5C628.B003ACC0]

##



- Session info 
-----------------------------------------------------------------------------------------------------------------------------------------------------------
setting  value
 version  R version 3.6.1 (2019-07-05)
os       Windows 10 x64
 system   x86_64, mingw32
 ui       RStudio
 language (EN)
 collate  English_United States.1252
 ctype    English_United States.1252
 tz       America/New_York
 date     2020-01-08

- Packages 
---------------------------------------------------------------------------------------------------------------------------------------------------------------
! package      * version     date       lib source
   acepack        1.4.1       2016-10-29 [1] CRAN (R 3.6.1)
   arrow        * 0.15.1.1    2019-11-05 [1] CRAN (R 3.6.2)
   askpass        1.1         2019-01-13 [1] CRAN (R 3.6.1)
   assertthat     0.2.1       2019-03-21 [1] CRAN (R 3.6.1)
   backports      1.1.5       2019-10-02 [1] CRAN (R 3.6.1)
   base64enc      0.1-3       2015-07-28 [1] CRAN (R 3.6.0)
   bit            1.1-14      2018-05-29 [1] CRAN (R 3.6.0)
   bit64          0.9-7       2017-05-08 [1] CRAN (R 3.6.0)
   blob           1.2.0       2019-07-09 [1] CRAN (R 3.6.1)
   callr          3.3.1       2019-07-18 [1] CRAN (R 3.6.1)
   cellranger     1.1.0       2016-07-27 [1] CRAN (R 3.6.1)
   checkmate      1.9.4       2019-07-04 [1] CRAN (R 3.6.1)
   cli            1.1.0       2019-03-19 [1] CRAN (R 3.6.1)
   cluster        2.1.0       2019-06-19 [2] CRAN (R 3.6.1)
   codetools      0.2-16      2018-12-24 [2] CRAN (R 3.6.1)
   colorspace     1.4-1       2019-03-18 [1] CRAN (R 3.6.1)
   commonmark     1.7         2018-12-01 [1] CRAN (R 3.6.1)
   crayon         1.3.4       2017-09-16 [1] CRAN (R 3.6.1)
   credentials    1.1         2019-03-12 [1] CRAN (R 3.6.2)
   curl         * 4.2         2019-09-24 [1] CRAN (R 3.6.1)
   data.table     1.12.2      2019-04-07 [1] CRAN (R 3.6.1)
   DBI          * 1.0.0       2018-05-02 [1] CRAN (R 3.6.1)
   desc           1.2.0       2018-05-01 [1] CRAN (R 3.6.1)
   devtools     * 2.2.0       2019-09-07 [1] CRAN (R 3.6.1)
   digest         0.6.23      2019-11-23 [1] CRAN (R 3.6.1)
   dplyr        * 0.8.3       2019-07-04 [1] CRAN (R 3.6.1)
   DT             0.9         2019-09-17 [1] CRAN (R 3.6.1)
   ellipsis       0.3.0       2019-09-20 [1] CRAN (R 3.6.1)
   evaluate       0.14        2019-05-28 [1] CRAN (R 3.6.1)
   foreign        0.8-71      2018-07-20 [2] CRAN (R 3.6.1)
   Formula      * 1.2-3       2018-05-03 [1] CRAN (R 3.6.0)
   fs             1.3.1       2019-05-06 [1] CRAN (R 3.6.1)
   fst          * 0.9.0       2019-04-09 [1] CRAN (R 3.6.1)
   future       * 1.15.0-9000 2019-11-19 [1] Github 
(HenrikBengtsson/future@bc241c7)
   ggplot2      * 3.2.1       2019-08-10 [1] CRAN (R 3.6.1)
   globals        0.12.4      2018-10-11 [1] CRAN (R 3.6.0)
   glue         * 1.3.1       2019-03-12 [1] CRAN (R 3.6.1)
   gridExtra      2.3         2017-09-09 [1] CRAN (R 3.6.1)
   gt           * 0.1.0       2019-11-27 [1] Github (rstudio/gt@284bbe5)
   gtable         0.3.0       2019-03-25 [1] CRAN (R 3.6.1)
   Hmisc        * 4.3-0       2019-11-07 [1] CRAN (R 3.6.1)
   htmlTable      1.13.2      2019-09-22 [1] CRAN (R 3.6.1)
 D htmltools      0.3.6.9004  2019-09-20 [1] Github (rstudio/htmltools@c49b29c)
   htmlwidgets    1.3         2018-09-30 [1] CRAN (R 3.6.1)
   jsonlite     * 1.6         2018-12-07 [1] CRAN (R 3.6.1)
   knitr          1.25        2019-09-18 [1] CRAN (R 3.6.1)
   lattice      * 0.20-38     2018-11-04 [2] CRAN (R 3.6.1)
   latticeExtra   0.6-28      2016-02-09 [1] CRAN (R 3.6.1)
   lazyeval       0.2.2       2019-03-15 [1] CRAN (R 3.6.1)
   lifecycle      0.1.0       2019-08-01 [1] CRAN (R 3.6.1)
   listenv        0.7.0       2018-01-21 [1] CRAN (R 3.6.1)
   lubridate    * 1.7.4       2018-04-11 [1] CRAN (R 3.6.1)
   magrittr     * 1.5         2014-11-22 [1] CRAN (R 3.6.1)
   Matrix         1.2-17      2019-03-22 [2] CRAN (R 3.6.1)
   memoise        1.1.0       2017-04-21 [1] CRAN (R 3.6.1)
   munsell        0.5.0       2018-06-12 [1] CRAN (R 3.6.1)
   nnet           7.3-12      2016-02-02 [2] CRAN (R 3.6.1)
   openssl        1.4.1       2019-07-18 [1] CRAN (R 3.6.1)
   outliers     * 0.14        2011-01-24 [1] CRAN (R 3.6.0)
   pillar         1.4.2       2019-06-29 [1] CRAN (R 3.6.1)
   pkgbuild       1.0.5       2019-08-26 [1] CRAN (R 3.6.1)
   pkgconfig      2.0.2       2018-08-16 [1] CRAN (R 3.6.1)
   pkgload        1.0.2       2018-10-29 [1] CRAN (R 3.6.1)
   plyr         * 1.8.4       2016-06-08 [1] CRAN (R 3.6.1)
   prettyunits    1.0.2       2015-07-13 [1] CRAN (R 3.6.1)
   processx       3.4.1       2019-07-18 [1] CRAN (R 3.6.1)
   pryr         * 0.1.4       2018-02-18 [1] CRAN (R 3.6.1)
   ps             1.3.0       2018-12-21 [1] CRAN (R 3.6.1)
   purrr        * 0.3.2       2019-03-15 [1] CRAN (R 3.6.1)
   R6           * 2.4.1       2019-11-12 [1] CRAN (R 3.6.1)
   RColorBrewer   1.1-2       2014-12-07 [1] CRAN (R 3.6.0)
   Rcpp           1.0.3       2019-11-08 [1] CRAN (R 3.6.1)
   readxl       * 1.3.1       2019-03-13 [1] CRAN (R 3.6.1)
   remotes        2.1.0       2019-06-24 [1] CRAN (R 3.6.1)
   rlang        * 0.4.2       2019-11-23 [1] CRAN (R 3.6.1)
   rmarkdown    * 2.0.3       2019-12-19 [1] Github (rstudio/rmarkdown@26cc3b1)
   RODBC        * 1.3-16      2019-09-03 [1] CRAN (R 3.6.1)
   roxygen2     * 6.1.1       2018-11-07 [1] CRAN (R 3.6.1)
   rpart          4.1-15      2019-04-12 [2] CRAN (R 3.6.1)
   rprojroot      1.3-2       2018-01-03 [1] CRAN (R 3.6.1)
   RSQLite      * 2.1.2       2019-07-24 [1] CRAN (R 3.6.1)
   rstudioapi     0.10        2019-03-19 [1] CRAN (R 3.6.1)
   scales         1.0.0       2018-08-09 [1] CRAN (R 3.6.1)
   sessioninfo    1.1.1       2018-11-05 [1] CRAN (R 3.6.1)
   slide        * 0.0.0.9002  2019-11-27 [1] Github (DavisVaughan/slide@92e8e02)
   ssh            0.6         2019-04-09 [1] CRAN (R 3.6.2)
   stringi        1.4.3       2019-03-12 [1] CRAN (R 3.6.0)
   stringr      * 1.4.0       2019-02-10 [1] CRAN (R 3.6.1)
   survival     * 2.44-1.1    2019-04-01 [2] CRAN (R 3.6.1)
   testthat       2.2.1       2019-07-25 [1] CRAN (R 3.6.1)
   tibble         2.1.3       2019-06-06 [1] CRAN (R 3.6.1)
   tidyr        * 1.0.0       2019-09-11 [1] CRAN (R 3.6.1)
   tidyselect     0.2.5       2018-10-11 [1] CRAN (R 3.6.1)
   usethis      * 1.5.1       2019-07-04 [1] CRAN (R 3.6.1)
   varhandle    * 2.0.3       2018-07-04 [1] CRAN (R 3.6.0)
   vctrs          0.2.0.9007  2019-11-27 [1] Github (r-lib/vctrs@945809e)
   withr          2.1.2       2018-03-15 [1] CRAN (R 3.6.1)
   xfun           0.9         2019-08-21 [1] CRAN (R 3.6.1)
   xml2         * 1.2.2       2019-08-09 [1] CRAN (R 3.6.1)
   xts          * 0.11-2      2018-11-05 [1] CRAN (R 3.6.1)
   zoo          * 1.8-6       2019-05-28 [1] CRAN (R 3.6.1)

[1] C:/Users/cklar/Desktop/R packages
[2] C:/Program Files/R/R-3.6.1/library

P -- Loaded and on-disk path mismatch.
D -- DLL MD5 mismatch, broken installation.


Christian Klar
TFG Asset Management
Tetragon Financial Management
399 Park Avenue, 22nd Floor | New York, NY 10022 | United States
Direct: +1 212 359 7369 | Main: +1 212 359 7300 | Mobile: +1 607 216 5045
ck...@tetragoninv.com<mailto:ck...@tetragoninv.com>
www.tetragoninv.com<http://www.tetragoninv.com>


This communication and all or some of the information contained therein may be 
confidential. If you have received this communication in error, please destroy 
all electronic and paper copies and notify the sender immediately. Unless 
specifically indicated, this communication is not a confirmation, an offer to 
sell or solicitation of any offer to buy any financial product, or an official 
statement of Tetragon Financial Group or its affiliates. TFG Asset Management 
L.P. and Tetragon Financial Management LP are registered as investment advisers 
under the U.S. Investment Advisers Act of 1940.


Reply via email to