Re: java.io.IOException: Could not get input splits

2011-09-01 Thread Jian Fang
Thanks. How soon 0.8.5 will be out? Is there any 0.8.5 snapshot version available? On Thu, Sep 1, 2011 at 11:57 AM, Jonathan Ellis wrote: > Sounds like https://issues.apache.org/jira/browse/CASSANDRA-3044, > fixed for 0.8.5 > > On Thu, Sep 1, 2011 at 10:54 AM, Jian Fang >

Re: ColumnFamilyOutputFormat problem

2011-08-17 Thread Jian Fang
not. If so then were is the data going. > > Cheers > > - > Aaron Morton > Freelance Cassandra Developer > @aaronmorton > http://www.thelastpickle.com > > On 17/08/2011, at 1:36 AM, Jian Fang wrote: > > If you look at the source code and you will fi

Re: ColumnFamilyOutputFormat problem

2011-08-16 Thread Jian Fang
our MR job and make sure the > batch_mutation is sent. Sounds like it's not. > > Cheers > > - > Aaron Morton > Freelance Cassandra Developer > @aaronmorton > http://www.thelastpickle.com > > On 12 Aug 2011, at 07:22, Jian Fang wrote: > >

Re: ColumnFamilyOutputFormat problem

2011-08-11 Thread Jian Fang
Morton > Freelance Cassandra Developer > @aaronmorton > http://www.thelastpickle.com > > On 11 Aug 2011, at 14:50, Jian Fang wrote: > > There are data and each Cassandra cluster node holds about 100G. From the > application point of view, if I run the job twice with the same input

Re: ColumnFamilyOutputFormat problem

2011-08-10 Thread Jian Fang
ndra Developer > @aaronmorton > http://www.thelastpickle.com > > On 11 Aug 2011, at 08:20, Jian Fang wrote: > > Hi, > > I am using Cassandra 0.8.2 with Hadoop 0.20.2. My application read a file > and then write about 2.5 million records > to Cassandra. I used ColumnFamilyOutputFo

ColumnFamilyOutputFormat problem

2011-08-10 Thread Jian Fang
Hi, I am using Cassandra 0.8.2 with Hadoop 0.20.2. My application read a file and then write about 2.5 million records to Cassandra. I used ColumnFamilyOutputFormat to write to Cassandra. My Cassandra cluster has three nodes with one Hadoop task tracker on each node. The wired problem is that I on

Re: Cassandra 0.6.8 snapshot problem?

2011-08-02 Thread Jian Fang
We simply copy the snapshot data to another cluster to replace its data directory. This cluster does not have any incoming traffic. I haven't compared the md5 sum yet. Will try that. Thanks, John On Thu, Jul 28, 2011 at 10:27 PM, Zhu Han wrote: > > > On Thu, Jul 28, 2011 at

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
node. Thanks, John On Thu, Jul 28, 2011 at 3:51 PM, Jeremy Hanna wrote: > Just wondering - what consistency level are you using for hadoop reads? > Also, do you have task trackers running on the cassandra nodes so that > reads will be local? > > On Jul 28, 2011, at 2:46 PM, Jian

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
integration is implemented. I rarely saw any timeout exceptions when I use hector to get back data. Thanks, John On Thu, Jul 28, 2011 at 12:45 PM, Jian Fang wrote: > > My current setting is 1. I will try 3. > > Thanks, > > John > > On Thu, Jul 28, 2011 at 12:39 P

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
On Jul 28, 2011, at 11:09 AM, Jian Fang wrote: > > > Hi, > > > > I run Cassandra 0.8.2 and hadoop 0.20.2 on three nodes, each node > includes a Cassandra instance and a hadoop data node. > > I created a simple hadoop job to scan a Cassandra column value in a > column

Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
Hi, I run Cassandra 0.8.2 and hadoop 0.20.2 on three nodes, each node includes a Cassandra instance and a hadoop data node. I created a simple hadoop job to scan a Cassandra column value in a column family and write it to a file system if it meets some conditions. I keep getting the following time

Cassandra 0.6.8 snapshot problem?

2011-07-28 Thread Jian Fang
Hi, We have an old production Cassandra 0.6.8 instance without replica, i.e., the replication factor is 1. Recently, we noticed that the snapshot data we took from this instance are inconsistent with the running instance data. For example, we took snapshot in early July 2011. From the running inst

Re: Cassandra 0.8 questions

2011-05-24 Thread Jian Fang
est way to figure out what to do for your data > organization. > > Victor Kabdebon > http://www.voxnucleus.fr > http://www.victorkabdebon.net > > > 2011/5/24 Jian Fang > >> Does anyone have a good suggestion on my second question? I believe that >> question is

Re: Cassandra 0.8 questions

2011-05-24 Thread Jian Fang
ite performance point of view, what are the general rules to make mutliple column families and when to use a single column family? Thanks again, John On Mon, May 23, 2011 at 5:47 PM, Jian Fang wrote: > Hi, > > I am pretty new to Cassandra and am going to use Cassandra 0.8.0. I have >

Re: Cassandra 0.8 questions

2011-05-23 Thread Jian Fang
u must then call > Mutator#execute to send the batched operations. > > For Hector specific questions, feel free to sign up for > hector-us...@googlegroups.com as well. > > On Mon, May 23, 2011 at 4:47 PM, Jian Fang > wrote: > > Hi, > > > > I am pretty new to Cass

Cassandra 0.8 questions

2011-05-23 Thread Jian Fang
Hi, I am pretty new to Cassandra and am going to use Cassandra 0.8.0. I have two questions (sorry if they are very basic ones): 1) I have a column family to hold many super columns, say 30. When I first insert the data to the column family, do I need to insert each column one at a time or can I i