gora-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kevin Ratnasekera (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (GORA-485) Apache Kudu datastore for Gora
Date Wed, 13 Mar 2019 12:50:00 GMT

    [ https://issues.apache.org/jira/browse/GORA-485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16791658#comment-16791658

Kevin Ratnasekera commented on GORA-485:

Hi John, 

Thanks for your interest in project. Apache Gora basically provides key value based datastore
abstraction to persist, query over real databases/backends like Eg:- HBase, such persisted
data can be analysed using Hadoop, Spark. Official website [2] and wiki [1] has very good
material to get things started. 

Try Go through Apache Gora source code and understand how datastore abstraction is implemented
other datastores like Eg:- HBase. Then try to understand  Kudu data model [3] and see how
you can apply key value based abstraction here. 

Kudu is written from C++, however you don't to deal native libraries. There java client [4]
which you can use to talking to Kudu server to perform all the datastore functions. However
when you write test cases you will need to start embedded Kudu server, at there you need to
come with a solution which you can propose. 

I suggest you to go through source code, build it and start using it. Once you done fix some
issues before you make a proposal as warm up. Subscribe to our dev list which you can start
a discussion over there, you can discuss these things in more wider audience.

[1] https://cwiki.apache.org/confluence/display/GORA/
[2] http://gora.apache.org/index.html
[3] https://kudu.apache.org/overview.html
[4] https://github.com/apache/kudu/tree/master/examples/java


> Apache Kudu datastore for Gora
> ------------------------------
>                 Key: GORA-485
>                 URL: https://issues.apache.org/jira/browse/GORA-485
>             Project: Apache Gora
>          Issue Type: New Feature
>    Affects Versions: 0.8
>            Reporter: Kevin Ratnasekera
>            Assignee: Cihad Guzel
>            Priority: Major
>              Labels: gora-kudu, gsoc2017, gsoc2019
>             Fix For: 0.9
> Apache Kudu is getting lot of attention. Kudu is a key/value store optimized for analytic
workloads. [1] 
> [1] https://blog.cloudera.com/blog/2015/09/kudu-new-apache-hadoop-storage-for-fast-analytics-on-fast-data/

This message was sent by Atlassian JIRA

View raw message