Hi aron. I think that if the pig is able to support to map it, the same job could be represented in java code itself.
I believe that we can call a map function by loading a file and cassandra at the same time. Ps) I dont need to join from them. I just wanna compare each keys which are read from them. Thanks. 2011. 1. 17. 오전 5:56에 "Aaron Morton" <[email protected]>님이 작성: > The Pig readers are just the same as any other data source so you should be able to mix and match them as you please > > Tthe sample pig script in contrib/pig/example-script.pig specifies the to use the CassandraStorage source when loading data > > rows = LOAD 'cassandra://Keyspace1/Standard1' USING CassandraStorage(); > > The LOAD command in Pig Latin supports a USING keyword to identify the data source type > http://pig.apache.org/docs/r0.8.0/piglatin_ref2.html#Load%2FStore+Functions > > I'm less familiar with Hadoop, but it should be possible. AFAIK though it's going to be easier to do a join between data sources with Pig. > > Hope that helps. > Aaron > > > > On 15 Jan, 2011,at 06:00 PM, 김준영 <[email protected]> wrote: > > hi, > > cassandra supports hadoop to map & reduce from cassandra. > > now I am digging to find out a way to map from a file and cassandra together. > > I mean if both of them are files in my disk, it is possible by using splits. > > but, in this kind of a situtation, which way is posssible? > > for example. > > in a cassandra) > key1| value1 | value2 > key2| value3 | value4 > key3| value5 | value6 > > in a file) > key1| value1 | value2 > key2| value7 | value4 > key3| value7 | value6 > > > the size of both are very hugh. > I want to get a result from diff from both of them. > > which keys are deleted? > which values are changed? > > thanks.
