nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dennis Kubes (JIRA)" <j...@apache.org>
Subject [jira] Updated: (NUTCH-44) too many search results
Date Sat, 16 Feb 2008 00:28:09 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-44?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Dennis Kubes updated NUTCH-44:
------------------------------

    Attachment: NUTCH-44-2-20080215.patch

Updated patch, changed the name to searcher.max.hits.per.page (yes still long but best I could
come up with given the givens), also updates patch to the current SVN.  This has been tested
and run through fetch and search cycles on linux.

> too many search results
> -----------------------
>
>                 Key: NUTCH-44
>                 URL: https://issues.apache.org/jira/browse/NUTCH-44
>             Project: Nutch
>          Issue Type: Bug
>          Components: web gui
>         Environment: web environment
>            Reporter: Emilijan Mirceski
>            Assignee: Dennis Kubes
>         Attachments: NUTCH-44-2-20080215.patch, NUTCH-44.patch
>
>
> There should be a limitation (user defined) on the number of results the search engine
can return. 
> For example, if one modifies the seach url as:
> http://<my>/search.jsp?query=<some quiery>&hitsPerPage=20000&hitsPerSite=0
> The search will try to return 20,000 pages which isn't good for the server side performance.

> Is it possible to have a setting in the config xml files to control this?
> Thanks,
> Emilijan

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message