Hello,
I am trying to use Samoa/Samza combination to apply ML for a dataset I have
in JSON format.
This is the document I am following:
https://samoa.incubator.apache.org/documentation/Executing-SAMOA-with-Apache-Samza.html
Couple of questions:
1. How do I point the input event to a Stream/Topic in Kafka? The data is
in JSON.
2. If I want to use historical data that is stored in a file, how do I
point the job to read from a file and serialise as json?
bin/samoa samza target/SAMOA-Samza-0.3.0-SNAPSHOT.jar
"PrequentialEvaluation -l classifiers.ensemble.Bagging -s (??)"
- Shekar
|