samoa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Commented] (SAMOA-47) Integrate Avro Streams with SAMOA
Date Sun, 31 Jan 2016 12:32:39 GMT


ASF GitHub Bot commented on SAMOA-47:

Github user gdfm commented on the pull request:
    +1, thanks @jayadeepj, sorry for the extremely long turnaround.

> Integrate Avro Streams with SAMOA
> ---------------------------------
>                 Key: SAMOA-47
>                 URL:
>             Project: SAMOA
>          Issue Type: New Feature
>          Components: SAMOA-API, SAMOA-Instances
>            Reporter: jayadeepj
>            Priority: Minor
>              Labels: patch
> The current SAMOA readers can only support data streams in ARFF format. Hence SAMOA as
a distributed streaming machine learning framework is limited in scope since end users may
have to transform their data to ARFF . Apache Avro is a data serialization system that handles
data streams in compact binary format and is typically used in conjunction with with Big Data
eco-system tools. Avro allows two encodings for the data: Binary & JSON. Hence an Avro
support may allow users with JSON data also to use SAMOA seamlessly.
> The GOAL is to build support for Avro Streams into SAMOA by adding Avro File Stream Handler,
Avro Loader to read records & transform to instances and  a user option to switch between
JSON/Binary encodings. The input format with representation of meta-data for both JSON/Binary
data to be finalized along with build.

This message was sent by Atlassian JIRA

View raw message