Hi all,

I'm trying to read an hbase table using this an example from github (
https://github.com/apache/spark/blob/master/examples/src/main/python/hbase_inputformat.py),
however I have two qualifiers in a column family.

Ex.:

 ROW COLUMN+CELL  row1 column=f1:1, timestamp=1401883411986, value=value1  row1
column=f1:2, timestamp=1401883415212, value=value2  row2 column=f1:1,
timestamp=1401883417858, value=value3  row3 column=f1:1,
timestamp=1401883420805, value=value4
When I run the code hbase_inputformat.py, the following loop print row1
just once:

output = hbase_rdd.collect()  for (k, v) in output:  print (k, v)
Am I doing anything wrong?

Thanks in advance.

Reply via email to