samoa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jay (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SAMOA-47) Integrate Avro Streams with SAMOA
Date Sun, 25 Oct 2015 11:37:27 GMT
Jay created SAMOA-47:
------------------------

             Summary: Integrate Avro Streams with SAMOA
                 Key: SAMOA-47
                 URL: https://issues.apache.org/jira/browse/SAMOA-47
             Project: SAMOA
          Issue Type: New Feature
          Components: SAMOA-API, SAMOA-Instances
            Reporter: Jay
            Priority: Minor


The current SAMOA readers can only support data streams in ARFF format. Hence SAMOA as a distributed
streaming machine learning framework is limited in scope since end users may have to transform
their data to ARFF . Apache Avro is a data serialization system that handles data streams
in compact binary format and is typically used in conjunction with with Big Data eco-system
tools. Avro allows two encodings for the data: Binary & JSON. Hence an Avro support may
allow users with JSON data also to use SAMOA seamlessly.

The GOAL is to build support for Avro Streams into SAMOA by adding Avro File Stream Handler,
Avro Loader to read records & transform to instances and  a user option to switch between
JSON/Binary encodings. The input format with representation of meta-data for both JSON/Binary
data to be finalized along with build.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message