Re: R arrow package question

2023-02-09 Thread Nic Crane
Hi Angelo, I think what might be happening here is that you have space characters in your integer column, which are causing problems. I created what could be a reproducible example of your problem at: https://gist.github.com/thisisnic/af265166d5cd1ebce605cf3e478ee6d8 In short, can you try includ

Re: R arrow package question

2023-02-08 Thread Angelo Casalan
Hi Everyone, Thanks for the responses. I hope you are all well. Hi Dewey. As to the problematic column error message: Invalid: Could not open CSV input source 'folder/name.CSV': Invalid: In CSV column #30: Row #5: CSV conversion error to int32: invalid value '' I manually opened the csv and saw

Re: R arrow package question

2023-02-01 Thread Dewey Dunnington
In case it hasn't already been mentioned here, I wonder if manually setting `schema()` would help. You're correct that the invalid value isn't scientific notation (i.e., it's a blank string) so maybe that column should be a string column instead. You could get the guessed schema from the original o

Re: R arrow package question

2023-01-31 Thread Nic Crane
Hi Angelo, The original code with just `open_dataset()` works as it's created a dataset without actually pulling the data into your R session. The subsequent commands you tried (i.e. involving `collect()` read in the files, resulting in an error when the data is read in. It looks like there's an

Re: R arrow package question

2023-01-30 Thread Angelo Casalan
Hi Jacob, Thanks. To provide some specifics on my query: 1.which version of arrow are you running? - 10.0.1 2. The error message provides an exact col,row position, have you checked the value there? Yes. It is int64. This is after running open_dataset without specifying schema: ''' arrow<-open_d

Re: R arrow package question

2023-01-26 Thread Jacob Wujciak
Hello Angelo, just for completeness sake, which version of arrow are you running? The error message provides an exact col,row position, have you checked the value there? That could help pinpoint the cause of the error. "Invalid: In CSV column #12: Row #580. CSV conversion error to int64: invalid v

R arrow package question

2023-01-26 Thread Angelo Casalan
Hi, I hope you are well. I am migrating to arrow from disk.frame in RStudio. I am really impressed with how fast arrow compared to disk.frame but I need help to solve some errors. I wish to ask how I can resolve this error: "CSV conversion error to int64: invalid value" To give an idea

R arrow package question

2023-01-26 Thread Angelo Casalan
Hi , I hope you are well. I wish to ask how I can resolve this error: "CSV conversion error to int64: invalid value" To give an idea of my dataset. I have 4 csvs all placed in a local folder. The code below worked when importing: arrow<-open_dataset( sources="csv location", format="csv")