You can do that using FileInputFormat.addInputPath 2015-05-27 10:41 GMT+02:00 ayan guha <[email protected]>:
> What about /blah/*/blah/out*.avro? > On 27 May 2015 18:08, "ÐΞ€ρ@Ҝ (๏̯͡๏)" <[email protected]> wrote: > >> I am doing that now. >> Is there no other way ? >> >> On Wed, May 27, 2015 at 12:40 PM, Akhil Das <[email protected]> >> wrote: >> >>> How about creating two and union [ sc.union(first, second) ] them? >>> >>> Thanks >>> Best Regards >>> >>> On Wed, May 27, 2015 at 11:51 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) <[email protected]> >>> wrote: >>> >>>> I have this piece >>>> >>>> sc.newAPIHadoopFile[AvroKey[GenericRecord], NullWritable, >>>> AvroKeyInputFormat[GenericRecord]]( >>>> "/a/b/c/d/exptsession/2015/05/22/out-r-*.avro") >>>> >>>> that takes ("/a/b/c/d/exptsession/2015/05/22/out-r-*.avro") this as >>>> input. >>>> >>>> I want to give a second directory as input but this is a invalid syntax >>>> >>>> >>>> that takes ("/a/b/c/d/exptsession/2015/05/*22*/out-r-*.avro", >>>> "/a/b/c/d/exptsession/2015/05/*21*/out-r-*.avro") >>>> >>>> OR >>>> >>>> ("/a/b/c/d/exptsession/2015/05/*22*/out-r-*.avro, >>>> /a/b/c/d/exptsession/2015/05/*21*/out-r-*.avro") >>>> >>>> >>>> Please suggest. >>>> >>>> >>>> >>>> -- >>>> Deepak >>>> >>>> >>> >> >> >> -- >> Deepak >> >>
