nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stefan Groschupf (JIRA)" <j...@apache.org>
Subject [jira] Created: (NUTCH-135) http header meta data are case insensitive in the real world (e.g. Content-Type or content-type)
Date Fri, 09 Dec 2005 20:51:08 GMT
http header meta data are case insensitive in the real world (e.g. Content-Type or content-type)
------------------------------------------------------------------------------------------------

         Key: NUTCH-135
         URL: http://issues.apache.org/jira/browse/NUTCH-135
     Project: Nutch
        Type: Bug
  Components: fetcher  
    Versions: 0.7.1, 0.7    
    Reporter: Stefan Groschupf
    Priority: Critical
     Fix For: 0.8-dev, 0.7.2-dev


As described in issue nutch-133, some webservers return http header meta data not standard
conform case insensitive.
This provides many negative side effects, for example query thet content type from the meta
data return null also in case the webserver returns a content type, but the key is not standard
conform e.g. lower case. Also this has effects to the pdf parser that queries the content
length etc.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


Mime
View raw message