The  Pig readers are just the same as any other data source so you should be able to mix and match them as you please

Tthe sample pig script in contrib/pig/example-script.pig specifies the to use the CassandraStorage source when loading data 

rows = LOAD 'cassandra://Keyspace1/Standard1' USING CassandraStorage();

The LOAD command in Pig Latin supports a USING keyword to identify the data source type 
http://pig.apache.org/docs/r0.8.0/piglatin_ref2html#Load%2FStore+Functions

I'm less familiar with Hadoop, but it should be possible. AFAIK though it's going to be easier to do a join between data sources with Pig. 

Hope that helps. 
Aaron
 


On 15 Jan, 2011,at 06:00 PM, 김준영 <juneng...@gmail.com> wrote:

hi,

cassandra supports hadoop to map & reduce from cassandra.

now I am digging to find out a way to map from a file and cassandra together.

I mean if both of them are files in my disk, it is possible by using splits.

but, in this kind of a situtation, which way is posssible?

for example.

in a cassandra)
key1| value1 | value2
key2| value3 | value4
key3| value5 | value6

in a file)
key1| value1 | value2
key2| value7 | value4
key3| value7 | value6


the size of both are very hugh.
I want to get a result from diff from both of them.

which keys are deleted?
which values are changed?

thanks.

Reply via email to