chukwa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <>
Subject [jira] [Commented] (CHUKWA-734) Gora Storage System for Chuckwa Logs
Date Mon, 23 Feb 2015 05:03:12 GMT


Lewis John McGibbney commented on CHUKWA-734:

Hi [~eyang]
bq. I got an error for running TestHBaseWriter unit test:
My tests hang on {code}Running org.apache.hadoop.chukwa.datacollection.sender.TestAcksOnFailure{code},
however when I run just the HBaseWriter test, I also get an error
  5 testWriters(org.apache.hadoop.chukwa.datacollection.writer.TestHBaseWriter)  Time elapsed:
0.026 sec  <<< ERROR!
  6 java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/Stoppable
  7         at java.lang.ClassLoader.defineClass1(Native Method)
  8         at java.lang.ClassLoader.defineClass(
  9         at
 10         at
 11         at$100(
 12         at$
 13         at$
 14         at Method)
 15         at
 16         at java.lang.ClassLoader.loadClass(
 17         at sun.misc.Launcher$AppClassLoader.loadClass(
 18         at java.lang.ClassLoader.loadClass(
 19         at java.lang.ClassLoader.defineClass1(Native Method)
 20         at java.lang.ClassLoader.defineClass(
 21         at
 22         at
 23         at$100(
 24         at$
 25         at$
 26         at Method)
 27         at
 28         at java.lang.ClassLoader.loadClass(
 29         at sun.misc.Launcher$AppClassLoader.loadClass(
 30         at java.lang.ClassLoader.loadClass(
 31         at java.lang.ClassLoader.defineClass1(Native Method)
 32         at java.lang.ClassLoader.defineClass(
 33         at
 34         at
 35         at$100(
 36         at$
 37         at$
 38         at Method)
 39         at
 40         at java.lang.ClassLoader.loadClass(
 41         at sun.misc.Launcher$AppClassLoader.loadClass(
 42         at java.lang.ClassLoader.loadClass(
 43         at org.apache.hadoop.chukwa.datacollection.writer.TestHBaseWriter.setUp(
 44         at junit.framework.TestCase.runBare(
 45         at junit.framework.TestResult$1.protect(
 46         at junit.framework.TestResult.runProtected(
 47         at
 48         at
 49         at junit.framework.TestSuite.runTest(
 50         at
 51         at
 52         at org.apache.maven.surefire.junit4.JUnit4TestSet.execute(
 53         at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(
 54         at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(
 55         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

bq. Does Gora support Hadoop1?
Yes we have shim layer support for Hadoop 1.2.1 and 2.5.2. We just need to configure it properly.

bq. We probably need to setup another profile for enabling Hadoop 1 vs Hadoop 2.
Most likely Eric. I will be looking in to this. I see that there are existing profiles for
0.96.2-hadoop1 and 0.94.9
I am therefore thinking that we could potentially set up profiles for Gora-supported backends
for HBase 0.98.8-hadoop2, Cassandra 2.0.2, Solr 4.10.3, Mongodb 2.6 and Accumulo 1.5.1. Before
I do this right now however I need clarification on 
 * what version of HBase is currently activated by default in Gora?
 * how and where is this defined as default within pom.xml?

bq. Can Gora map sequence ID value to column name in HBase?
Mmmmmm.... I am not sure about this. Reasoning is as follows: currently we define AHEAD OF
 * a table name
 * columns within this table (columns can also have [optional params like compression, bloom
filters, etc|])
 * we then map out data beans to these definitions

It would appear to me that for us to be able to map sequenceID to a column name, we would
1) want to dynamically create many many columns over time directly dependent on the number
of data chunks we get, is this correct? 2) once we get a new incoming data chunk we would
wish to dynamically generate a new column within the existing table with the sequenceID as
the column name, is this correct?


> Gora Storage System for Chuckwa Logs
> ------------------------------------
>                 Key: CHUKWA-734
>                 URL:
>             Project: Chukwa
>          Issue Type: New Feature
>          Components: Data Collection
>    Affects Versions: 0.6.0
>            Reporter: Lewis John McGibbney
>             Fix For: 0.6.0
>         Attachments: CHUKWA-734.patch
>   Original Estimate: 5h
>  Remaining Estimate: 5h
> I would like to build a Gora-backed log-to-datastore module for Chuckwa. I am going to
work on this today.
> Gora is an in-memory data modeling and storage abstraction 
> Gora powers the Apache Nutch 2.X software which generates a bunch of log data. Having
a Chuckwa monitoring tool for Nutch would be grand.

This message was sent by Atlassian JIRA

View raw message