Hi Stephan, thanks for answering, right now I am using an extension of the DelimitedInputFormat, is there a way to merge it with the option 2?
Il giorno 26/giu/2015, alle ore 12:17, Stephan Ewen <se...@apache.org<mailto:se...@apache.org>> ha scritto: There are two ways you can realize that: 1) Create multiple sources and union them. This is easy, but probably a bit less efficient. 2) Override the FileInputFormat's createInputSplits method to take a union of the paths to create a list of all files and fils splits that will be read. Stephan On Fri, Jun 26, 2015 at 12:12 PM, Michele Bertoni <michele1.bert...@mail.polimi.it<mailto:michele1.bert...@mail.polimi.it>> wrote: Hi everybody, is there a way to specify a list of URI (“hdfs://file1”,”hdfs://file2”,…) and open them as different files? I know i may open the entire directory, but i want to be able to select a subset of files in the directory thanks