spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alan Prando <a...@scanboo.com.br>
Subject Reading from Hbase using python
Date Wed, 12 Nov 2014 19:32:49 GMT
Hi all,

I'm trying to read an hbase table using this an example from github (
https://github.com/apache/spark/blob/master/examples/src/main/python/hbase_inputformat.py),
however I have two qualifiers in a column family.

Ex.:

 ROW COLUMN+CELL  row1 column=f1:1, timestamp=1401883411986, value=value1  row1
column=f1:2, timestamp=1401883415212, value=value2  row2 column=f1:1,
timestamp=1401883417858, value=value3  row3 column=f1:1,
timestamp=1401883420805, value=value4
When I run the code hbase_inputformat.py, the following loop print row1
just once:

output = hbase_rdd.collect()  for (k, v) in output:  print (k, v)
Am I doing anything wrong?

Thanks in advance.

Mime
View raw message