Jan Høydahl (JIRA) |
[jira] Created: (TIKA-523) Add application/ms-tnef as alias to application/vnd.ms-tnef |
Mon, 04 Oct, 14:56 |
Jan Høydahl (JIRA) |
[jira] Commented: (TIKA-490) Support for adding language profiles dynamically |
Mon, 04 Oct, 19:13 |
Jan Høydahl (JIRA) |
[jira] Created: (TIKA-527) Allow override mapping mime<-->parsers through config |
Fri, 08 Oct, 13:43 |
Jan Høydahl (JIRA) |
[jira] Updated: (TIKA-527) Allow override mapping mime<-->parsers through config |
Mon, 11 Oct, 06:22 |
Jan Høydahl (JIRA) |
[jira] Commented: (TIKA-527) Allow override mapping mime<-->parsers through config |
Mon, 11 Oct, 07:48 |
Jan Høydahl (JIRA) |
[jira] Created: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers |
Sat, 23 Oct, 02:31 |
Jan Høydahl (JIRA) |
[jira] Updated: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers |
Sat, 23 Oct, 13:37 |
Alex Skochin (JIRA) |
[jira] Updated: (TIKA-422) Wrong charset conversion in some RTF documents. |
Thu, 21 Oct, 14:37 |
Alex Skochin (JIRA) |
[jira] Issue Comment Edited: (TIKA-422) Wrong charset conversion in some RTF documents. |
Thu, 21 Oct, 14:41 |
Alex Skochin (JIRA) |
[jira] Issue Comment Edited: (TIKA-422) Wrong charset conversion in some RTF documents. |
Thu, 21 Oct, 15:15 |
Andrey Sidorenko (JIRA) |
[jira] Updated: (TIKA-516) Excel 5 files are inconsistently detected as either "application/msword" or "application/vnd.ms-excel" |
Wed, 06 Oct, 12:14 |
Apache Hudson Server |
Build failed in Hudson: Tika-trunk » Apache Tika core #391 |
Wed, 27 Oct, 17:14 |
Apache Hudson Server |
Build failed in Hudson: Tika-trunk #391 |
Wed, 27 Oct, 17:14 |
Apache Hudson Server |
Hudson build is back to normal : Tika-trunk » Apache Tika core #392 |
Thu, 28 Oct, 14:17 |
Apache Hudson Server |
Hudson build is back to normal : Tika-trunk #392 |
Thu, 28 Oct, 14:17 |
Apache Hudson Server |
Hudson build became unstable: Tika-trunk » Apache Tika parsers #393 |
Sun, 31 Oct, 03:04 |
Apache Hudson Server |
Hudson build is unstable: Tika-trunk #394 |
Sun, 31 Oct, 22:14 |
Apache Hudson Server |
Hudson build is still unstable: Tika-trunk » Apache Tika parsers #394 |
Sun, 31 Oct, 22:14 |
Apache Hudson Server |
Hudson build is back to normal : Tika-trunk » Apache Tika application #394 |
Sun, 31 Oct, 22:14 |
Bruno Dumon (JIRA) |
[jira] Created: (TIKA-528) Reuse tagsoup HtmlSchema instance across HtmlParsers (performance improvement) |
Sat, 09 Oct, 11:07 |
Bruno Dumon (JIRA) |
[jira] Updated: (TIKA-528) Reuse tagsoup HtmlSchema instance across HtmlParsers (performance improvement) |
Sat, 09 Oct, 11:09 |
Chris A. Mattmann (JIRA) |
[jira] Commented: (TIKA-433) Tika + Hadoop |
Sun, 03 Oct, 20:02 |
Chris A. Mattmann (JIRA) |
[jira] Commented: (TIKA-433) Tika + Hadoop |
Sun, 03 Oct, 20:14 |
Chris A. Mattmann (JIRA) |
[jira] Commented: (TIKA-490) Support for adding language profiles dynamically |
Mon, 04 Oct, 20:00 |
Chris A. Mattmann (JIRA) |
[jira] Commented: (TIKA-536) Updated site layout |
Fri, 22 Oct, 18:12 |
Chris A. Mattmann (JIRA) |
[jira] Commented: (TIKA-407) Push NetCDF4 lib dependency to Maven Central and Update Tika POM |
Sat, 30 Oct, 01:02 |
Chris A. Mattmann (JIRA) |
[jira] Resolved: (TIKA-407) Push NetCDF4 lib dependency to Maven Central and Update Tika POM |
Sun, 31 Oct, 01:43 |
Chris A. Mattmann (JIRA) |
[jira] Resolved: (TIKA-515) MimeType.getDescription() often returns nothing when "tika-mimetypes.xml" has a useful description already available. |
Sun, 31 Oct, 01:55 |
Chris A. Mattmann (JIRA) |
[jira] Commented: (TIKA-515) MimeType.getDescription() often returns nothing when "tika-mimetypes.xml" has a useful description already available. |
Sun, 31 Oct, 01:55 |
Chris A. Mattmann (JIRA) |
[jira] Updated: (TIKA-456) Support timeouts for parsers |
Sun, 31 Oct, 02:31 |
Chris A. Mattmann (JIRA) |
[jira] Resolved: (TIKA-399) HDF4/5 Tika Parser |
Sun, 31 Oct, 22:01 |
Cristian Vat (JIRA) |
[jira] Updated: (TIKA-422) Wrong charset conversion in some RTF documents. |
Tue, 12 Oct, 21:42 |
Cristian Vat (JIRA) |
[jira] Commented: (TIKA-422) Wrong charset conversion in some RTF documents. |
Tue, 12 Oct, 21:48 |
Cristian Vat (JIRA) |
[jira] Updated: (TIKA-422) Wrong charset conversion in some RTF documents. |
Wed, 13 Oct, 00:54 |
Cristian Vat (JIRA) |
[jira] Updated: (TIKA-422) Wrong charset conversion in some RTF documents. |
Thu, 14 Oct, 18:54 |
Cristian Vat (JIRA) |
[jira] Commented: (TIKA-422) Wrong charset conversion in some RTF documents. |
Thu, 14 Oct, 20:34 |
Dennis Adler (JIRA) |
[jira] Created: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Fri, 01 Oct, 17:50 |
Dennis Adler (JIRA) |
[jira] Updated: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Fri, 01 Oct, 19:30 |
Dennis Adler (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Mon, 04 Oct, 17:54 |
Dennis Adler (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Mon, 04 Oct, 21:42 |
Dennis Adler (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Mon, 04 Oct, 22:53 |
Dennis Adler (JIRA) |
[jira] Issue Comment Edited: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Tue, 05 Oct, 19:56 |
Dennis Adler (JIRA) |
[jira] Updated: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Tue, 05 Oct, 23:35 |
Dennis Adler (JIRA) |
[jira] Issue Comment Edited: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Tue, 05 Oct, 23:41 |
Dennis Adler (JIRA) |
[jira] Issue Comment Edited: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Tue, 05 Oct, 23:41 |
Dennis Adler (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Tue, 05 Oct, 23:58 |
Dennis Adler (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Thu, 07 Oct, 18:58 |
Dennis Adler (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Sun, 10 Oct, 22:55 |
Geoff Jarrad (JIRA) |
[jira] Commented: (TIKA-506) Improve doc and docx parsing to include more things |
Fri, 01 Oct, 00:05 |
Geoff Jarrad (JIRA) |
[jira] Created: (TIKA-524) Unification of HTML output from Office, OOXML and Open Document parsers |
Mon, 04 Oct, 23:18 |
Geoff Jarrad (JIRA) |
[jira] Created: (TIKA-525) Mismatched start and end elements in HtmlParser |
Tue, 05 Oct, 00:35 |
Geoff Jarrad (JIRA) |
[jira] Created: (TIKA-526) OOXMLParser fails to extract text from within smart tags |
Tue, 05 Oct, 03:59 |
Geoff Jarrad (JIRA) |
[jira] Updated: (TIKA-526) OOXMLParser fails to extract text from within smart tags |
Tue, 05 Oct, 04:01 |
Geoff Jarrad (JIRA) |
[jira] Created: (TIKA-533) Mis-detection of zip-within-zip as application/vnd.apple.iwork, with no output by CLI app |
Sun, 17 Oct, 22:23 |
Geoff Jarrad (JIRA) |
[jira] Updated: (TIKA-533) Mis-detection of zip-within-zip as application/vnd.apple.iwork, with no output by CLI app |
Sun, 17 Oct, 22:25 |
Geoff Jarrad (JIRA) |
[jira] Commented: (TIKA-533) Mis-detection of zip-within-zip as application/vnd.apple.iwork, with no output by CLI app |
Mon, 18 Oct, 00:21 |
Geoff Jarrad (JIRA) |
[jira] Created: (TIKA-534) MetadataException: Unsupported component id error parsing jpg |
Mon, 18 Oct, 22:53 |
Geoff Jarrad (JIRA) |
[jira] Updated: (TIKA-534) MetadataException: Unsupported component id error parsing jpg |
Mon, 18 Oct, 22:55 |
Geoff Jarrad (JIRA) |
[jira] Updated: (TIKA-534) MetadataException: Unsupported component id error parsing jpg |
Mon, 18 Oct, 23:46 |
Grant Ingersoll (JIRA) |
[jira] Commented: (TIKA-433) Tika + Hadoop |
Sun, 03 Oct, 20:10 |
Jukka Zitting |
Gearing up for Tika 0.8 |
Thu, 21 Oct, 19:28 |
Jukka Zitting |
Re: Build failed in Hudson: Tika-trunk » Apache Tika core #391 |
Wed, 27 Oct, 17:54 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-241) Rar archive support |
Sun, 03 Oct, 19:48 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-433) Tika + Hadoop |
Sun, 03 Oct, 19:58 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-169) Tika Web Service Servlet |
Sun, 03 Oct, 20:40 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-426) Parsing javascript as XML |
Sun, 03 Oct, 21:10 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-429) Error parsing DTD |
Sun, 03 Oct, 21:19 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-427) Parsing CSS as XML |
Sun, 03 Oct, 21:21 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Tue, 05 Oct, 20:04 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-527) Allow override mapping mime<-->parsers through config |
Sun, 10 Oct, 19:17 |
Jukka Zitting (JIRA) |
[jira] Reopened: (TIKA-446) Upgrade to PDFBox 1.2.1 |
Thu, 14 Oct, 09:11 |
Jukka Zitting (JIRA) |
[jira] Updated: (TIKA-446) Upgrade to PDFBox 1.3.0 |
Thu, 14 Oct, 09:13 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-533) Mis-detection of zip-within-zip as application/vnd.apple.iwork, with no output by CLI app |
Mon, 18 Oct, 10:06 |
Jukka Zitting (JIRA) |
[jira] Created: (TIKA-535) Implement Apache project branding requirements |
Tue, 19 Oct, 09:53 |
Jukka Zitting (JIRA) |
[jira] Updated: (TIKA-533) Mis-detection of zip files as application/vnd.apple.iwork |
Tue, 19 Oct, 16:30 |
Jukka Zitting (JIRA) |
[jira] Created: (TIKA-536) Updated site layout |
Fri, 22 Oct, 13:25 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-536) Updated site layout |
Fri, 22 Oct, 13:29 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-536) Updated site layout |
Mon, 25 Oct, 13:29 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-407) Push NetCDF4 lib dependency to Maven Central and Update Tika POM |
Sun, 31 Oct, 18:50 |
Jukka Zitting (JIRA) |
[jira] Updated: (TIKA-446) Upgrade to PDFBox 1.3.1 |
Sun, 31 Oct, 23:05 |
Jukka Zitting (JIRA) |
[jira] Resolved: (TIKA-446) Upgrade to PDFBox 1.3.1 |
Sun, 31 Oct, 23:11 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document |
Sun, 31 Oct, 23:53 |
Jukka Zitting (JIRA) |
[jira] Commented: (TIKA-536) Updated site layout |
Sun, 31 Oct, 23:59 |
Ken Krugler |
Re: Gearing up for Tika 0.8 |
Tue, 26 Oct, 17:03 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Fri, 01 Oct, 19:52 |
Ken Krugler (JIRA) |
[jira] Assigned: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio |
Fri, 01 Oct, 19:52 |
Ken Krugler (JIRA) |
[jira] Assigned: (TIKA-528) Reuse tagsoup HtmlSchema instance across HtmlParsers (performance improvement) |
Sat, 09 Oct, 20:19 |
Ken Krugler (JIRA) |
[jira] Assigned: (TIKA-525) Mismatched start and end elements in HtmlParser |
Sat, 09 Oct, 20:23 |
Ken Krugler (JIRA) |
[jira] Assigned: (TIKA-529) IBM420 charset detection's isLamAlef is allocation-happy |
Tue, 12 Oct, 02:50 |
Ken Krugler (JIRA) |
[jira] Resolved: (TIKA-528) Reuse tagsoup HtmlSchema instance across HtmlParsers (performance improvement) |
Tue, 12 Oct, 20:27 |
Ken Krugler (JIRA) |
[jira] Issue Comment Edited: (TIKA-528) Reuse tagsoup HtmlSchema instance across HtmlParsers (performance improvement) |
Tue, 12 Oct, 20:31 |
Ken Krugler (JIRA) |
[jira] Closed: (TIKA-532) missing spaces in text extraction of BodyContentHandler |
Tue, 26 Oct, 16:22 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-394) Missing spaces on html parsing |
Tue, 26 Oct, 17:00 |
Ken Krugler (JIRA) |
[jira] Updated: (TIKA-394) Missing spaces on html parsing |
Tue, 26 Oct, 17:00 |
Ken Krugler (JIRA) |
[jira] Resolved: (TIKA-394) Missing spaces on html parsing |
Tue, 26 Oct, 17:02 |
Ken Krugler (JIRA) |
[jira] Assigned: (TIKA-539) Encoding detection is too biased by encoding in meta tag |
Tue, 26 Oct, 20:00 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-539) Encoding detection is too biased by encoding in meta tag |
Tue, 26 Oct, 22:39 |
Ken Krugler (JIRA) |
[jira] Commented: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom |
Sun, 31 Oct, 16:14 |
Mattmann, Chris A (388J) |
Re: Gearing up for Tika 0.8 |
Thu, 21 Oct, 22:29 |
Mattmann, Chris A (388J) |
ReviewBoard instance |
Tue, 26 Oct, 13:52 |