hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gaurav Agarwal <gauravagarw...@gmail.com>
Subject Custom Retention of data based on Rowkey
Date Thu, 09 Mar 2017 08:38:13 GMT
Hi All,


We have an application that stores information on multiple users/customers/tenants in a common
table. Each tenant has a unique id which we encode in the row key of the records that are
stored in the table.


We want to apply custom (and dynamically updatable) data retention policies for each tenant. 
What would be a reasonable way to achieve that?


Searching through forums, I came across this link that suggests to either write an external
process to retrieve and delete cells based on the retention policy or write a custom compaction



We felt that writing an external scanner for managing retention would be simpler but very
inefficient as it would require getting the entire data set out of the hbase server and then
issuing delete calls back to it.


Does any one know if there has been any recent progress on this aspect of data retention in


Additionally, if I go the route of writing my own custom compaction policy, what would be
the best place to start? Maybe I could copy/extend the “default” hbase compaction policy
and enhance it to look at rowkey inside every Cell to make a call if the cell needs to be







  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message