Are these regex static? If yes, this is easily achieved with embedding your script in Java or any other language that Pig supports http://pig.apache.org/docs/r0.13.0/cont.html
You could also possibly write a UDF that loops through all the regex and returns result. On Mon, Oct 6, 2014 at 12:44 PM, Ankur Kasliwal <[email protected] > wrote: > Hi, > > > > I have written a ‘Pig Script’ which is processing Sequence files given as > input. > > It is working fine but there is one problem mentioned below. > > > > I have repetitive statements in my pig script, as shown below: > > > > > > - Filtered_Data _1= FILTER BagName BY ($0 matches 'RegEx-1'); > - Filtered_Data_2 = FILTER BagName BY ($0 matches 'RegEx-2'); > - Filtered_Data_3 = FILTER BagName BY ($0 matches 'RegEx-3'); > - So on… > > > > Question : > > So is there any way by which I can have above statement written once and > > then loop through all possible “RegEx” and substitute in Pig script. > > > > For Example: > > > Filtered_Data _X = FILTER BagName BY ($0 matches 'RegEx'); ( have this > statement once ) > > ( loop through all possible RegEx and substitute value in the statement ) > > > > Right now I am calling Pig script from a shell script, so any way from > shell script will be also be welcome.. > > > > Thanks in advance. > > Happy Pigging!!!! >
