tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeremias Maerki <...@jeremias-maerki.ch>
Subject Re: svn commit: r594376 - in /incubator/tika/trunk: CHANGES.txt src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java src/main/java/org/apache/tika/parser/pdf/PDFParser.java
Date Sun, 18 Nov 2007 17:54:54 GMT
The constructor IOException(String, Exception) only exists since Java 6.
I don't think that was intended, was it?

Jeremias Maerki



On 13.11.2007 02:04:31 jukka wrote:
> Author: jukka
> Date: Mon Nov 12 17:04:30 2007
> New Revision: 594376
> 
> URL: http://svn.apache.org/viewvc?rev=594376&view=rev
> Log:
> TIKA-100 - Structured PDF parsing
>     - Customized the PdfTextStripper class to produce XHTML SAX events
>       (there's a somewhat similar PdfText2HTML class in PDFBox, but
>       that class produces a character stream instead of SAX events)
> 
> Added:
>     incubator/tika/trunk/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java   (with
props)
> Modified:
>     incubator/tika/trunk/CHANGES.txt
>     incubator/tika/trunk/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
> 
<snip/>
> Added: incubator/tika/trunk/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java
> URL: http://svn.apache.org/viewvc/incubator/tika/trunk/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java?rev=594376&view=auto
> ==============================================================================
> --- incubator/tika/trunk/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java (added)
> +++ incubator/tika/trunk/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java Mon
Nov 12 17:04:30 2007
> +    protected void endDocument(PDDocument pdf) throws IOException {
> +        try {
> +            handler.endDocument();
> +        } catch (SAXException e) {
> +            throw new IOException("Unable to end a document", e);
> +        }
> +    }


Mime
View raw message