beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component
Date Wed, 24 May 2017 20:29:04 GMT

    [ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16023633#comment-16023633
] 

ASF GitHub Bot commented on BEAM-2328:
--------------------------------------

GitHub user sberyozkin opened a pull request:

    https://github.com/apache/beam-site/pull/250

    [BEAM-2328] Add TikaIO to the list of in-progress transforms

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/sberyozkin/beam-site patch-2

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/beam-site/pull/250.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #250
    
----

----


> Introduce Apache Tika Input component
> -------------------------------------
>
>                 Key: BEAM-2328
>                 URL: https://issues.apache.org/jira/browse/BEAM-2328
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-ideas, sdk-java-extensions
>            Reporter: Sergey Beryozkin
>            Assignee: Sergey Beryozkin
>             Fix For: 2.1.0
>
>
> Apache Tika is a popular project that offers an extensive support for parsing the variety
of file formats. It is used in many projects including Lucene and Elastic Search. 
> Supporting a Tika Input (Read) at the Beam level would be of major interest to many users.
> PR is to follow



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message