Guest (JIRA) |
[jira] Updated: (TIKA-558) Problems/inconsistency with jar edu.ucar:netcdf:4.2 used by Tika 0.8 |
Thu, 25 Nov, 12:25 |
Hasan Diwan (JIRA) |
[jira] Created: (TIKA-541) Use commons-cli in lieu of writing our own option parser |
Tue, 02 Nov, 06:58 |
Hasan Diwan (JIRA) |
[jira] Updated: (TIKA-541) Use commons-cli in lieu of writing our own option parser |
Tue, 02 Nov, 07:00 |
Igor Spasic (JIRA) |
[jira] Created: (TIKA-547) Can't extract PDF text |
Tue, 09 Nov, 14:06 |
Igor Spasic (JIRA) |
[jira] Updated: (TIKA-547) Can't extract PDF text |
Tue, 09 Nov, 14:08 |
Igor Spasic (JIRA) |
[jira] Updated: (TIKA-547) Can't extract PDF text |
Tue, 09 Nov, 14:10 |
Igor Spasic (JIRA) |
[jira] Commented: (TIKA-547) Can't extract PDF text |
Tue, 09 Nov, 14:26 |
Igor Spasic (JIRA) |
[jira] Commented: (TIKA-547) Can't extract PDF text |
Tue, 09 Nov, 14:34 |
Jukka Zitting |
Re: Hudson build is still unstable: Tika-trunk #395 |
Mon, 01 Nov, 01:16 |
Jukka Zitting |
Java 6 (Was: Hudson build is still unstable: Tika-trunk #395) |
Mon, 01 Nov, 13:39 |
Jukka Zitting |
Re: ReviewBoard instance |
Wed, 10 Nov, 19:23 |
Jukka Zitting |
RE: tika and plain text -- bug or feature? |
Wed, 10 Nov, 22:42 |
Jukka Zitting |
RE: tika and plain text -- bug or feature? |
Thu, 11 Nov, 00:21 |
Jukka Zitting |
Re: svn commit: r1033937 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/extractor/ tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/main/java/org/apache/tika/parser/pkg/ |
Thu, 11 Nov, 14:05 |
Jukka Zitting |
RE: buildbot failure in ASF Buildbot on tika-trunk |
Sat, 13 Nov, 15:06 |
Jukka Zitting |
RE: RecursiveMetadata and MetadataDiscussion - some long-term input - if you need RDF call xesam or aperture |
Mon, 15 Nov, 19:39 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-531) xmpTPg:NPages creates invalid XML |
Mon, 01 Nov, 00:17 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-373) Upgrade to POI 3.7 |
Mon, 01 Nov, 22:52 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-542) Publish Javadoc on tika.apache.org |
Thu, 04 Nov, 14:41 |
Jukka Zitting (JIRA) |
[jira] Updated: (TIKA-543) Remove rome 1.0 dependency on java.net repository |
Sat, 06 Nov, 23:48 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-543) Remove rome 1.0 dependency on java.net repository |
Sat, 06 Nov, 23:48 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-543) Remove rome 1.0 dependency on java.net repository |
Sat, 06 Nov, 23:48 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-482) Refactor image and jpeg parsers for access to MetadataExtractor API |
Wed, 10 Nov, 18:43 |
Jukka Zitting (JIRA) |
[jira] Created: (TIKA-553) Automatic license header checks |
Fri, 12 Nov, 20:03 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-553) Automatic license header checks |
Fri, 12 Nov, 21:28 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 |
Sat, 13 Nov, 09:48 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-548) PDF content extracted as single line |
Thu, 18 Nov, 18:12 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-554) ParseUtils.getStringContent needs an option to set the write limit that can be passed into the BodyContentHandler |
Thu, 18 Nov, 20:41 |
Jukka Zitting (JIRA) |
[jira] Created: (TIKA-556) Problems with the NetCDF jar |
Tue, 23 Nov, 15:44 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-556) Problems with the NetCDF jar |
Tue, 23 Nov, 15:52 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-556) Problems with the NetCDF jar |
Tue, 23 Nov, 17:26 |
Julien Nioche |
Re: Furthering Along TIKA-461 |
Thu, 25 Nov, 18:11 |
Julien Nioche (JIRA) |
[jira] Commented: (TIKA-461) RFC822 messages not parsed |
Tue, 09 Nov, 16:13 |
Julien Nioche (JIRA) |
[jira] Updated: (TIKA-461) RFC822 messages not parsed |
Tue, 30 Nov, 16:08 |
Julien Nioche (JIRA) |
[jira] Commented: (TIKA-461) RFC822 messages not parsed |
Tue, 30 Nov, 16:14 |
Julien Nioche (JIRA) |
[jira] Commented: (TIKA-461) RFC822 messages not parsed |
Tue, 30 Nov, 16:50 |
Ken Krugler |
Re: Build problem with trunk? |
Thu, 04 Nov, 14:01 |
Ken Krugler |
Re: Charset SPI |
Sat, 06 Nov, 19:19 |
Ken Krugler |
XML parsing hang |
Tue, 09 Nov, 18:35 |
Ken Krugler |
Re: [VOTE] Apache Tika 0.8 Release Candidate #1 |
Thu, 11 Nov, 14:24 |
Ken Krugler |
Re: buildbot failure in ASF Buildbot on tika-trunk |
Sat, 13 Nov, 17:37 |
Ken Krugler (JIRA) |
[jira] Closed: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document |
Tue, 02 Nov, 13:12 |
Ken Krugler (JIRA) |
[jira] Created: (TIKA-543) Remove rome 1.0 dependency on java.net repository |
Thu, 04 Nov, 13:47 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-543) Remove rome 1.0 dependency on java.net repository |
Thu, 04 Nov, 13:59 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-466) Feed Parser |
Thu, 04 Nov, 13:59 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom |
Thu, 04 Nov, 16:09 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom |
Thu, 04 Nov, 16:09 |
Ken Krugler (JIRA) |
[jira] Updated: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom |
Fri, 05 Nov, 13:34 |
Ken Krugler (JIRA) |
[jira] Resolved: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom |
Fri, 05 Nov, 13:54 |
Ken Krugler (JIRA) |
[jira] Created: (TIKA-544) AutoDetectParser ignores charset in Content-Type metadata |
Fri, 05 Nov, 20:14 |
Ken Krugler (JIRA) |
[jira] Closed: (TIKA-544) AutoDetectParser ignores charset in Content-Type metadata |
Fri, 05 Nov, 21:30 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-539) Encoding detection is too biased by encoding in meta tag |
Fri, 05 Nov, 21:36 |
Ken Krugler (JIRA) |
[jira] Issue Comment Edited: (TIKA-539) Encoding detection is too biased by encoding in meta tag |
Fri, 05 Nov, 21:36 |
Ken Krugler (JIRA) |
[jira] Issue Comment Edited: (TIKA-539) Encoding detection is too biased by encoding in meta tag |
Fri, 05 Nov, 21:40 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-539) Encoding detection is too biased by encoding in meta tag |
Sat, 06 Nov, 18:58 |
Ken Krugler (JIRA) |
[jira] Updated: (TIKA-369) Improve accuracy of language detection |
Sat, 20 Nov, 21:48 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-557) Extract text file PDF error |
Thu, 25 Nov, 14:51 |
Leo Sauermann |
RecursiveMetadata and MetadataDiscussion - some long-term input |
Sun, 14 Nov, 09:13 |
Leo Sauermann |
Re: RecursiveMetadata and MetadataDiscussion - some long-term input - if you need RDF call xesam or aperture |
Mon, 15 Nov, 15:15 |
Mattmann, Chris A (388J) |
Re: Hudson build is still unstable: Tika-trunk #395 |
Mon, 01 Nov, 02:38 |
Mattmann, Chris A (388J) |
Re: Hudson build is still unstable: Tika-trunk #395 |
Mon, 01 Nov, 03:38 |
Mattmann, Chris A (388J) |
Re: Hudson build is still unstable: Tika-trunk #395 |
Mon, 01 Nov, 04:08 |
Mattmann, Chris A (388J) |
0.8 release: latest status |
Mon, 01 Nov, 06:22 |
Mattmann, Chris A (388J) |
Re: 0.8 release: latest status |
Wed, 03 Nov, 01:50 |
Mattmann, Chris A (388J) |
My ApacheConNA 2010 slides |
Sat, 06 Nov, 19:52 |
Mattmann, Chris A (388J) |
Re: 0.8 release: latest status |
Sun, 07 Nov, 00:00 |
Mattmann, Chris A (388J) |
Re: 0.8 release: latest status |
Sun, 07 Nov, 00:01 |
Mattmann, Chris A (388J) |
[ANNOUNCE] Welcome Maxim Valyanskiy as Tika PMC/Committer |
Mon, 08 Nov, 07:20 |
Mattmann, Chris A (388J) |
Re: 0.8 release: latest status |
Tue, 09 Nov, 01:07 |
Mattmann, Chris A (388J) |
Re: 0.8 release: latest status |
Tue, 09 Nov, 21:20 |
Mattmann, Chris A (388J) |
[VOTE] Apache Tika 0.8 Release Candidate #1 |
Tue, 09 Nov, 21:29 |
Mattmann, Chris A (388J) |
Re: svn commit: r1033937 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/extractor/ tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/main/java/org/apache/tika/parser/pkg/ |
Thu, 11 Nov, 15:00 |
Mattmann, Chris A (388J) |
Re: svn commit: r1033937 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/extractor/ tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/main/java/org/apache/tika/parser/pkg/ |
Thu, 11 Nov, 15:01 |
Mattmann, Chris A (388J) |
[RESULT] [VOTE] Apache Tika 0.8 Release Candidate #1 |
Sat, 13 Nov, 03:45 |
Mattmann, Chris A (388J) |
[ANNOUNCE] Apache Tika 0.8 released |
Sat, 13 Nov, 07:09 |
Mattmann, Chris A (388J) |
Re: RecursiveMetadata and MetadataDiscussion - some long-term input |
Sun, 14 Nov, 16:48 |
Mattmann, Chris A (388J) |
Re: RecursiveMetadata and MetadataDiscussion - some long-term input - if you need RDF call xesam or aperture |
Mon, 15 Nov, 16:58 |
Maxim Valyanskiy |
Re: [ANNOUNCE] Welcome Maxim Valyanskiy as Tika PMC/Committer |
Tue, 09 Nov, 12:43 |
Maxim Valyanskiy |
Re: svn commit: r1033937 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/extractor/ tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/main/java/org/apache/tika/parser/pkg/ |
Thu, 11 Nov, 14:38 |
Maxim Valyanskiy |
Re: buildbot failure in ASF Buildbot on tika-trunk |
Sat, 13 Nov, 10:59 |
Maxim Valyanskiy (JIRA) |
[jira] Commented: (TIKA-540) extract text from .docx footnotes |
Fri, 05 Nov, 13:03 |
Maxim Valyanskiy (JIRA) |
[jira] Resolved: (TIKA-540) extract text from .docx footnotes |
Fri, 05 Nov, 13:05 |
Maxim Valyanskiy (JIRA) |
[jira] Resolved: (TIKA-510) Use POI API for text extraction from XSLF shape |
Tue, 09 Nov, 11:23 |
Maxim Valyanskiy (JIRA) |
[jira] Resolved: (TIKA-511) NPE when POI is configured to prefer event extractors |
Tue, 09 Nov, 11:23 |
Maxim Valyanskiy (JIRA) |
[jira] Created: (TIKA-549) There is no support for extracting OLE-shapes from PPT |
Fri, 12 Nov, 11:51 |
Maxim Valyanskiy (JIRA) |
[jira] Resolved: (TIKA-549) There is no support for extracting OLE-shapes from PPT |
Fri, 12 Nov, 12:07 |
Maxim Valyanskiy (JIRA) |
[jira] Created: (TIKA-550) Add stable filenames for extracted embedded files from Office binaries |
Fri, 12 Nov, 12:17 |
Maxim Valyanskiy (JIRA) |
[jira] Updated: (TIKA-550) Add stable filenames for extracted embedded files from Office binaries |
Fri, 12 Nov, 12:17 |
Maxim Valyanskiy (JIRA) |
[jira] Resolved: (TIKA-550) Add stable filenames for extracted embedded files from Office binaries |
Fri, 12 Nov, 12:33 |
Maxim Valyanskiy (JIRA) |
[jira] Created: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 |
Fri, 12 Nov, 12:57 |
Maxim Valyanskiy (JIRA) |
[jira] Updated: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 |
Fri, 12 Nov, 12:59 |
Maxim Valyanskiy (JIRA) |
[jira] Commented: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 |
Fri, 12 Nov, 13:01 |
Maxim Valyanskiy (JIRA) |
[jira] Commented: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 |
Sat, 13 Nov, 10:18 |
Michel Tremblay (JIRA) |
[jira] Commented: (TIKA-389) Garbled metadata when dealing with encrypted PDF files. |
Tue, 30 Nov, 22:56 |
Nick Burch |
Re: svn commit: r1033937 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/extractor/ tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/main/java/org/apache/tika/parser/pkg/ |
Thu, 11 Nov, 14:51 |
Nick Burch |
Re: MS Lectures on office file formats |
Fri, 12 Nov, 14:52 |
Nick Burch (JIRA) |
[jira] Commented: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. |
Tue, 09 Nov, 12:09 |
Nick Burch (JIRA) |
[jira] Commented: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. |
Tue, 09 Nov, 12:29 |
Nick Burch (JIRA) |
[jira] Commented: (TIKA-461) RFC822 messages not parsed |
Tue, 09 Nov, 16:03 |
Nick Burch (JIRA) |
[jira] Commented: (TIKA-461) RFC822 messages not parsed |
Tue, 09 Nov, 16:29 |