manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Issei Nishigata <>
Subject Web crawler doesn't extract links
Date Sun, 06 Dec 2015 14:11:43 GMT
I'm using MCF 2.2.
When I crawl links that attribute values of href like below, MCF can't
extract links properly.

<a href=/sample/Mainservlet?sample=000 >sample</a>
# attribute value doesn't specified by the double quoted.
# I got "/sample".

In HTML4, it does not always require quotes around attribute value.
XHTML requires quotes around attribute value.
Is MCF compliant with HTML4?


View raw message