Hi WILL, Yes, because the spark-sql use hive metastore as the base, spark-sql just optimizes the query entry. For more details, you can check the relative config as the branch of kylin4_on_cloud.
-- Best regards. Tengting Xu At 2022-10-11 13:28:55, "Will Glass-Husain" <wgl...@forio.com> wrote: Thank you. I uploaded the file to s3, then created an external table using spark-sql based on this data. I was then able to access it for syncing in Kylin. Question -- does the spark-sql and the hive store the data in the same place? In other words, is he result the same of using spark sql or hive to upload the csv? Thanks, WILL On Mon, Oct 10, 2022 at 7:07 PM Mukvin <boyboys...@163.com> wrote: Hi WILL, I have checked the CSV uploading feature locally with your sample.csv file. And I got the same error Yes, the best way is to load into hive directly. Two Method: 1. you can follow the https://kylin.apache.org/blog/2022/04/20/kylin4-on-cloud-part1/ to check the command to do your custom tables samely to do so. 2. As your scene, I suggest you upload 3GB files to an AWS bucket or the `Kylin node` of the ec2 instance and use the hive command to set the data source to map the 3GB CSV files. Examples: https://stackoverflow.com/questions/19320611/hadoop-hive-loading-data-from-csv-on-a-local-machine. -- Best regards. Tengting Xu At 2022-10-11 07:05:34, "Will Glass-Husain" <wgl...@forio.com> wrote: >Hi, > >I have a 3GB CSV file with about 90 columns of data I want to load into >Kylin. I have set up Kylin cloud based on tutorial using kylin4_on_cloud >branch. > >Are there simple instructions for a new user as to the best method to load >the CSV file? I tried the online csv loader with a small 5 line file and >it doesn't work. (see KYLIN-5276). > >I assume the best way is to load into hive directly? Can someone point me >to simple instructions? > >Much appreciated. Trying to evaluate Kylin to see if it can speed up some >online data analysis we are trying to do. > >Best regards, WILL > >-- >William Glass-Husain /forio | +1 (415) 440 7500 x802 | forio.com ><http://www.forio.com/> -- William Glass-Husain /forio | +1 (415) 440 7500 x802 | forio.com