[ 
https://issues.apache.org/jira/browse/FLINK-1872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492599#comment-14492599
 ] 

Stephan Ewen commented on FLINK-1872:
-------------------------------------

Reading a CSV always requires a schema.

It is possible to register CSVs with a schema at HCatalog and then use a 
HCatalog input format to read them, without specifying the schema at the read 
command.

https://github.com/apache/flink/blob/master/flink-staging/flink-hcatalog/src/main/java/org/apache/flink/hcatalog/java/HCatInputFormat.java

> How can generation dataset in flink automatic depend on number of filed and 
> data type 
> --------------------------------------------------------------------------------------
>
>                 Key: FLINK-1872
>                 URL: https://issues.apache.org/jira/browse/FLINK-1872
>             Project: Flink
>          Issue Type: Bug
>            Reporter: hagersaleh
>
> when read csv file want generate dataset function automatic
> Example write this Mnola
> but want generate this code automatic for any csv flie input
> final ExecutionEnvironment env = 
> ExecutionEnvironment.getExecutionEnvironment();
>  
> DataSet<Tuple5<Integer, String, String, Integer, Double>> customers = 
> getCustomerDataSet(env);
> private static DataSet<Tuple5<Integer, String, String, Integer, Double>> 
> getCustomerDataSet(ExecutionEnvironment env) {
>               return env.readCsvFile(customerPath)
>                                       .fieldDelimiter("|")
>                                       .includeFields("11110100")
>                                       .types(Integer.class, String.class, 
> String.class, Integer.class, Double.class);
>       }



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to