nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Nioche (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-998) index-basic should use filename if title is empty
Date Wed, 18 May 2011 11:41:47 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13035316#comment-13035316
] 

Julien Nioche commented on NUTCH-998:
-------------------------------------

-1 : I'd rather not do that and leave to the search front end to decide on what do display
when a proper title is missing. In terms of relevancy and scoring having a real title is not
the same as populating one with the filename + better to know what is what.  

> index-basic should use filename if title is empty
> -------------------------------------------------
>
>                 Key: NUTCH-998
>                 URL: https://issues.apache.org/jira/browse/NUTCH-998
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>    Affects Versions: 1.3, 2.0
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>
> In some cases documents are indexed with empty title fields, this is not very user friendly.
Although this can be remedied in Solr using a conditional copyField in a custom update request
processor i'd rather see it fixed in Nutch itself.
> Any thoughts? 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message