[ 
https://issues.apache.org/jira/browse/HIVE-20917?focusedWorklogId=836328&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-836328
 ]

ASF GitHub Bot logged work on HIVE-20917:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 01/Jan/23 00:22
            Start Date: 01/Jan/23 00:22
    Worklog Time Spent: 10m 
      Work Description: github-actions[bot] commented on PR #3718:
URL: https://github.com/apache/hive/pull/3718#issuecomment-1368304484

   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.




Issue Time Tracking
-------------------

    Worklog Id:     (was: 836328)
    Time Spent: 50m  (was: 40m)

> OpenCSVSerde quotes all columns
> -------------------------------
>
>                 Key: HIVE-20917
>                 URL: https://issues.apache.org/jira/browse/HIVE-20917
>             Project: Hive
>          Issue Type: Improvement
>          Components: Serializers/Deserializers
>            Reporter: nicolas paris
>            Priority: Minor
>              Labels: pull-request-available
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> The OpenCSVSerde produces a CSV with all its columns quoted 
> no matter of they type or if the string columns contain a separator or not.
>  
> The problem is some readers (such postgresql) are not compatible with
> such CSV, in particular when bulk loading them thought COPY statement.
>  
> I propose a new CsvSerde, based on a Univocity Parser (wich is used by Apache 
> Spark)
> that has been described a 2 times faster thant OpenCSV. 
> [https://github.com/uniVocity/csv-parsers-comparison] . This new CsvSerde 
> whould only quote columns when needed.
>  
> Regards,



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to