Re: Pig Output

2011-12-05 Thread Aaron Griffith
i think getmerge will work.

Pig Output

2011-12-05 Thread Aaron Griffith
Using PigStorage() my pig script output gets put into partial files on the hadoop file system. When I use the copyToLocal fuction from Hadoop it creates a local directory with all the partial files. Is there a way to copy the partial files from hadoop into a single local file? Thanks

Re: Help with Pig Script

2011-11-17 Thread Aaron Griffith
the ID or something. If you *do* have an ordered partitioner in > your cluster, it's just a matter of specifying the key range. > > On Nov 17, 2011, at 11:16 AM, Aaron Griffith wrote: > > > I am trying to do the following with a PIG script and am having trouble finding &g

Help with Pig Script

2011-11-17 Thread Aaron Griffith
I am trying to do the following with a PIG script and am having trouble finding the correct syntax. - I want to use the LOAD function to load a single key/value "row" into a pig object. - The contents of that row is then flattened into a list of keys. - I then want to use that list of keys for a

Cassandra Pig with network topology and data centers.

2011-07-29 Thread Aaron Griffith
I currently have a 9 node cassandra cluster setup as follows: DC1: Six nodes DC2: Three nodes The tokens alternate between the two datacenters. I have hadoop installed as tasktracker/datanodes on the three cassandra nodes in DC2. There is another non cassandra node that is used as the hadoop n