incubator-droids-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Richard Frovarp (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DROIDS-142) Add additional filtering untill the file is saved on disk
Date Fri, 15 Jun 2012 02:46:43 GMT

     [ https://issues.apache.org/jira/browse/DROIDS-142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Richard Frovarp updated DROIDS-142:
-----------------------------------

    Fix Version/s:     (was: 0.2.0)
                   0.3.0
    
> Add additional filtering untill the file is saved on disk
> ---------------------------------------------------------
>
>                 Key: DROIDS-142
>                 URL: https://issues.apache.org/jira/browse/DROIDS-142
>             Project: Droids
>          Issue Type: New Feature
>          Components: core
>    Affects Versions: 0.2.0
>            Reporter: Eugen Paraschiv
>             Fix For: 0.3.0
>
>
> The existing filtering process allows URLs to be accepted based on the URL itself, which
is very useful. There are some cases though where you need to decide if the file is relevant
and should be saved (or not) based on the content itself. 
> There should be a step in SaveHandler before the file is actually saved, where the handler
should be able to decide if the file is to be persisted or ignored based on the URL but also
on the file contents itself. It is here that specific checks should be introduced to further
filter out the files. 
> - note: as an example of this, consider the very common site that doesn't really have
hierarchical, well defined URLs, but instead simple /domain/object1, /domain/object2 type
URLs; this links don't really say anything about the content, so filtering them out by a regex
would do no good; the page itself however is likely to contain all the information required
to have a more granular filtering in place

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message