gora-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alfonso Nishikawa (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (GORA-109) Pig Adapter for Gora
Date Thu, 03 May 2018 18:41:00 GMT

    [ https://issues.apache.org/jira/browse/GORA-109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16462909#comment-16462909
] 

Alfonso Nishikawa edited comment on GORA-109 at 5/3/18 6:40 PM:
----------------------------------------------------------------

Hi, Pals.

It has been a lot of time since this issue. I finally got a _quite_-successful implementation
of a GoraStorage for Apache Pig. At this moment it is in my own branch since I impemented
it for my degree proyect. [https://github.com/alfonsonishikawa/gora/tree/goraexplorer/gora-pig]

It allows to load data from -any- HBase (I can only claim this) through Gora in Pig scripts
like:
{code:java}
set job.name 'GoraPig LOAD test';
register gora/*.jar;
webpage = LOAD '.' USING org.apache.gora.pig.GoraStorage('{
        "persistentClass": "admin.WebPage",
        "fields": "baseUrl,contentType"
}') ;
DUMP webpage;
{code}
I will update my page at [http://people.apache.org/~alfonsonishikawa/gora-109.html] with details
in the incoming days (the speed depends on my spare time), but please, ask me in advance the
first questions you have so I can write a FAQ too. If I have to write by my own ideas, I tend
to write EVERYTHING and I end NEVER :)

If anyone is a Pig user and wants to test it ask too. Questions priotize what I explain :)

I am thinking about some Nutch jobs in Pig :P

 

Sorry for the inmense delay :(. Just know that this _"complete"_ implementation (actually
nothing ever is complete) took me more than 300 hours :(. It has much details everywhere.


was (Author: alfonso.nishikawa):
Hi, Pals.

It has been a lot of time since this issue. I finally got a _quite_-successful implementation
of a GoraStorage for Apache Pig. At this moment it is in my own branch since I impemented
it for my degree proyect. [https://github.com/alfonsonishikawa/gora/tree/goraexplorer/gora-pig]

It allows to load data from -any- HBase (I can only claim this) through Gora in Pig scripts
like:
{code:java}
set job.name 'GoraPig LOAD test';
register gora/*.jar;
webpage = LOAD '.' USING org.apache.gora.pig.GoraStorage('{
        "persistentClass": "admin.WebPage",
        "fields": "baseUrl,contentType"
}') ;
DUMP webpage;
{code}
I will update my page at [http://people.apache.org/~alfonsonishikawa/gora-109.html] with details
in the incoming days (the speed depends on my spare time), but please, ask me in advance the
first questions you have so I can write a FAQ too. If I have to write by my own ideas, I tend
to write EVERYTHING and I end NEVER :)

I anyone is a Pig user and wants to test it ask too. Questions priotize what I explain :)

I am thinking about some Nutch jobs in Pig :P

 

Sorry for the inmense delay :(. Just know that this _"complete"_ implementation (actually
nothing ever is complete) took me more than 300 hours :(. It has much details everywhere.

> Pig Adapter for Gora
> --------------------
>
>                 Key: GORA-109
>                 URL: https://issues.apache.org/jira/browse/GORA-109
>             Project: Apache Gora
>          Issue Type: New Feature
>          Components: gora-pig
>    Affects Versions: 0.2
>            Reporter: Lewis John McGibbney
>            Assignee: Alfonso Nishikawa
>            Priority: Minor
>             Fix For: 0.9
>
>
> This has been on the agenda from the start and I suppose will be purely use case driven.
Persoanlly I haven't touched Pig as I don't have a use-case driven requirement to do so, however
there is no harm in this being logged if in the future it attracts interest.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message