nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dennis Kubes (JIRA)" <j...@apache.org>
Subject [jira] Commented: (NUTCH-44) too many search results
Date Sat, 16 Feb 2008 00:26:09 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-44?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12569458#action_12569458
] 

Dennis Kubes commented on NUTCH-44:
-----------------------------------

Do you mean when you do a query on say the second page and the max is 1000 that the query
actually searches for 2000 results, because I noticed this as well.  Although don't know what
would be the way to prevent this, except maybe not allowing that deep of a search.  

> too many search results
> -----------------------
>
>                 Key: NUTCH-44
>                 URL: https://issues.apache.org/jira/browse/NUTCH-44
>             Project: Nutch
>          Issue Type: Bug
>          Components: web gui
>         Environment: web environment
>            Reporter: Emilijan Mirceski
>            Assignee: Dennis Kubes
>         Attachments: NUTCH-44.patch
>
>
> There should be a limitation (user defined) on the number of results the search engine
can return. 
> For example, if one modifies the seach url as:
> http://<my>/search.jsp?query=<some quiery>&hitsPerPage=20000&hitsPerSite=0
> The search will try to return 20,000 pages which isn't good for the server side performance.

> Is it possible to have a setting in the config xml files to control this?
> Thanks,
> Emilijan

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message