samza-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rob Martin <>
Subject Samza for text processing
Date Sun, 28 Apr 2019 14:08:58 GMT
Im looking at creating a distributed steaming pipeline for processing text
documents (eg cleaning, NER and machine learning). Documents will generally
be under 1mb and processing will be stateless. Was aiming to feed documents
from various sources and additional data into Kafka to be streamed to the
proccing pipeline in Samza. Would this be an appropriate use case for Samza?

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message