i think getmerge will work.
Using PigStorage() my pig script output gets put into partial files on the
hadoop
file system.
When I use the copyToLocal fuction from Hadoop it creates a local directory with
all the partial files.
Is there a way to copy the partial files from hadoop into a single local file?
Thanks
the ID or something. If you *do*
have an ordered partitioner in
> your cluster, it's just a matter of specifying the key range.
>
> On Nov 17, 2011, at 11:16 AM, Aaron Griffith wrote:
>
> > I am trying to do the following with a PIG script and am having trouble
finding
&g
I am trying to do the following with a PIG script and am having trouble finding
the correct syntax.
- I want to use the LOAD function to load a single key/value "row" into a pig
object.
- The contents of that row is then flattened into a list of keys.
- I then want to use that list of keys for a
I currently have a 9 node cassandra cluster setup as follows:
DC1: Six nodes
DC2: Three nodes
The tokens alternate between the two datacenters.
I have hadoop installed as tasktracker/datanodes on the
three cassandra nodes in DC2.
There is another non cassandra node that is used as the hadoop n