maven-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joakim Erdfelt (JIRA)" <j...@codehaus.org>
Subject [jira] Closed: (WAGON-218) Link Parsing in http is flawed
Date Sun, 01 Jun 2008 04:18:53 GMT

     [ http://jira.codehaus.org/browse/WAGON-218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Joakim Erdfelt closed WAGON-218.
--------------------------------

    Resolution: Fixed

Work completed in revision 662070

After a sample LinkParser replacement is P.o.C. in a wagon-http-with-webdav branch, and a
discussion in the dev@wagon mailing list.  The following changes have been made.

1) Replaced jtidy with nekohtml
    This resulted in a smaller dependency list and improved memory utilization.
2) Replaces reliance of String URL manipulation with use of java.net.URI
    This change makes the detection of content that belongs to the page more accurate, as
well as enables some complex relative uri resolution almost trivial.
3) Added more unit tests for real world scenarios encountered since the original implementation
was loose on the world.


> Link Parsing in http is flawed
> ------------------------------
>
>                 Key: WAGON-218
>                 URL: http://jira.codehaus.org/browse/WAGON-218
>             Project: Maven Wagon
>          Issue Type: Improvement
>          Components: wagon-http, wagon-http-lightweight
>    Affects Versions: 1.0-beta-2
>            Reporter: Joakim Erdfelt
>            Assignee: Joakim Erdfelt
>
> The link parsing in wagon http has a few issues.
> a) not all links detected.
> b) the various ways that page content is identified via url string manipulation isn't
working in many example cases.
> c) the use of jtidy introduces a large dependency and high memory usage.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message