tika-dev mailing list archives: May 2013

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Andrew Jackson (JIRA) [jira] [Created] (TIKA-1117) IWorkPackageParser should not close the InputStream Wed, 01 May, 13:26
Ray Gauss II (JIRA) [jira] [Assigned] (TIKA-1115) ExifHandler throws NullPointerException Wed, 01 May, 16:50
Ray Gauss II (JIRA) [jira] [Commented] (TIKA-1115) ExifHandler throws NullPointerException Wed, 01 May, 16:58
Lee Graber (JIRA) [jira] [Commented] (TIKA-1115) ExifHandler throws NullPointerException Wed, 01 May, 17:04
Ray Gauss II (JIRA) [jira] [Resolved] (TIKA-1115) ExifHandler throws NullPointerException Wed, 01 May, 17:46
Apache Jenkins Server Build failed in Jenkins: Tika-trunk #994 Wed, 01 May, 21:01
Ray Gauss II Re: Build failed in Jenkins: Tika-trunk #994 Wed, 01 May, 21:12
Michael McCandless Re: Build failed in Jenkins: Tika-trunk #994 Wed, 01 May, 21:24
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk ยป Apache Tika parsers #995 Thu, 02 May, 01:47
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk #995 Thu, 02 May, 01:47
Ray Gauss II Re: Build failed in Jenkins: Tika-trunk #994 Thu, 02 May, 01:56
Nick Burch (JIRA) [jira] [Commented] (TIKA-788) DWG parser infinite loop on possibly corrupt file Thu, 09 May, 11:55
kiran (JIRA) [jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values Sun, 12 May, 19:49
Andreas Hubold (JIRA) [jira] [Updated] (TIKA-967) Tika comes with transitive Maven dependency to a test artifact of vorbis-java-core Mon, 13 May, 09:39
Markus Jelsma (JIRA) [jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values Mon, 13 May, 15:19
Markus Jelsma (JIRA) [jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values Mon, 13 May, 15:27
Dave Meikle (JIRA) [jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values Mon, 13 May, 18:01
Pankaj Kumar Re: [jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values Mon, 13 May, 20:04
Mattmann, Chris A (398J) Wanting to contribute to Tika (was Re: [jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values) Mon, 13 May, 20:33
Markus Jelsma (JIRA) [jira] [Resolved] (TIKA-992) OpenGraph meta tags to allow multiple values Tue, 14 May, 09:03
Jukka Zitting (JIRA) [jira] [Resolved] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding Tue, 14 May, 15:41
Lee Graber (JIRA) [jira] [Created] (TIKA-1118) OOXML parser throws when relationship points to 0 byte embedded part Tue, 14 May, 20:55
Nick Burch (JIRA) [jira] [Commented] (TIKA-1118) OOXML parser throws when relationship points to 0 byte embedded part Tue, 14 May, 21:07
Lee Graber (JIRA) [jira] [Commented] (TIKA-1118) OOXML parser throws when relationship points to 0 byte embedded part Tue, 14 May, 23:40
Lee Graber (JIRA) [jira] [Created] (TIKA-1119) HSLFExtractor throws if PictureData is not readable Wed, 15 May, 00:12
Nick Burch (JIRA) [jira] [Commented] (TIKA-1118) OOXML parser throws when relationship points to 0 byte embedded part Wed, 15 May, 00:32
Nick Burch (JIRA) [jira] [Comment Edited] (TIKA-1118) OOXML parser throws when relationship points to 0 byte embedded part Wed, 15 May, 00:34
Nick Burch (JIRA) [jira] [Commented] (TIKA-1119) HSLFExtractor throws if PictureData is not readable Wed, 15 May, 00:42
Lee Graber (JIRA) [jira] [Commented] (TIKA-1119) HSLFExtractor throws if PictureData is not readable Wed, 15 May, 15:59
Oliver Kopp (JIRA) [jira] [Created] (TIKA-1120) Enable direct use of org.apache.tika.mime.MediaType.detect(...) Sat, 18 May, 16:13
Dave Meikle (JIRA) [jira] [Created] (TIKA-1121) Socket server text parsing error on large text files Sun, 19 May, 22:33
Tejas Patil (JIRA) [jira] [Created] (TIKA-1122) Tika fails to parse chm files Tue, 21 May, 02:00
Nick Burch (JIRA) [jira] [Commented] (TIKA-1122) Tika fails to parse chm files Tue, 21 May, 16:33
Bernhard Berger (JIRA) [jira] [Created] (TIKA-1123) Add more mimetypes for famous programming languages Wed, 22 May, 07:51
Bernhard Berger (JIRA) [jira] [Updated] (TIKA-1123) Add more mimetypes for famous programming languages Wed, 22 May, 07:53
Tim Allison (JIRA) [jira] [Created] (TIKA-1124) Nested documents not extracted if a PDF file is in the chain Thu, 23 May, 18:43
Tim Allison (JIRA) [jira] [Updated] (TIKA-1124) Nested documents not extracted if a PDF file is in the chain Thu, 23 May, 18:45
Stenger (JIRA) [jira] [Created] (TIKA-1125) Why does tika-app-0.9.jar contain slf4j? Fri, 24 May, 10:53
Ali Mosavian (JIRA) [jira] [Created] (TIKA-1126) text/html procuder for tika-server Fri, 24 May, 13:24
Ali Mosavian (JIRA) [jira] [Updated] (TIKA-1126) text/html procuder for tika-server Fri, 24 May, 13:26
Dave Meikle (JIRA) [jira] [Commented] (TIKA-1123) Add more mimetypes for famous programming languages Sat, 25 May, 08:52
Dave Meikle (JIRA) [jira] [Resolved] (TIKA-1123) Add more mimetypes for famous programming languages Sat, 25 May, 08:52
Nick Burch (JIRA) [jira] [Resolved] (TIKA-1125) Why does tika-app-0.9.jar contain slf4j? Sat, 25 May, 22:35
Nick Burch (JIRA) [jira] [Commented] (TIKA-1125) Why does tika-app-0.9.jar contain slf4j? Sat, 25 May, 22:35
Dave Meikle (JIRA) [jira] [Resolved] (TIKA-1126) text/html procuder for tika-server Sun, 26 May, 11:38
Dave Meikle (JIRA) [jira] [Commented] (TIKA-1126) text/html procuder for tika-server Sun, 26 May, 11:38
Ali Mosavian (JIRA) [jira] [Commented] (TIKA-1126) text/html procuder for tika-server Mon, 27 May, 11:46
stdexcept tika pull request: Similar to TIKA-1126, this commit adds the ability to pr... Mon, 27 May, 14:26
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1086) Tika-bundle 1.3 does not import org.w3c.dom package Mon, 27 May, 16:50
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1067) Tika extracts non-existent asterisks (*) from .ppt files Mon, 27 May, 16:52
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1079) Word document hits AIOOBE in SummaryExtractor.parseSummaries Mon, 27 May, 16:52
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1109) Metadata not extracted before the context in OOXML (pptx) Mon, 27 May, 16:52
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1078) TikaCLI: invalid characters in embedded document name causes FNFE when trying to save Mon, 27 May, 16:52
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1072) AIOOBE when handling embedded document in .doc file Mon, 27 May, 16:52
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1046) Get "java.util.zip.ZipException: unknown compression method" when indexing ppf97-file containing wmf-image Mon, 27 May, 16:52
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1054) Problem with parsing excel date formats Mon, 27 May, 16:52
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1045) Unsupported AutoCAD drawing version: AC1014 Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1037) No text extracted from Excel file (rus chars) Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1017) DefaultHtmlMapper misses some safe elements Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1108) Represent individual slides in pptx Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1111) Class loading issues when running in OSGi environment Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1102) Can we add <div> to the list of heuristics for bad html fragments? Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1107) Can't parse velocity file Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-988) We don't extract a placeholder for a Word document embedded in an Excel document Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1110) Incorrectly declared SUPPORTED_TYPES in ChmParser. Mon, 27 May, 16:54
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-993) Language Detection Fault Mon, 27 May, 16:56
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1004) Support "ansi" as an alias for windows-1252 charset Mon, 27 May, 16:56
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1057) document content property "Status" is not extracted for *.doc files Mon, 27 May, 16:56
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1120) Enable direct use of org.apache.tika.mime.MediaType.detect(...) Mon, 27 May, 16:56
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-978) OSGi bundle build fails if space exists in build path Mon, 27 May, 16:56
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1079) Word document hits AIOOBE in SummaryExtractor.parseSummaries Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-995) XHTMLContentHandler doesn't pass attributes of body element Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1106) CLAVIN Integration Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1076) Upgrade to Apache POI 3.9 Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-817) (PPT/PPTX) Missing date/time in text content. Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1059) Better Handling of InterruptedException in ExternalParser and ExternalEmbedder Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true) Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-605) Tika GDAL parser Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-774) ExifTool Parser Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-980) MicrodataContentHandler for Apache Tika Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-987) Embedded drawing (SHAPE MERGEFORMAT) sometimes not extracted Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-715) Some parsers produce non-well-formed XHTML SAX events Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1086) Tika-bundle 1.3 does not import org.w3c.dom package Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1122) Tika fails to parse chm files Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-985) Support for HTML5 elements Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-820) Locator is unset for HTML parser Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1078) TikaCLI: invalid characters in embedded document name causes FNFE when trying to save Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1109) Metadata not extracted before the context in OOXML (pptx) Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-891) Use POST in addition to PUT on method calls in tika-server Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1108) Represent individual slides in pptx Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1110) Incorrectly declared SUPPORTED_TYPES in ChmParser. Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-776) ExifTool Embedder Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-988) We don't extract a placeholder for a Word document embedded in an Excel document Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Updated] (TIKA-1072) AIOOBE when handling embedded document in .doc file Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Created] (TIKA-1127) text/xml for tika-server Mon, 27 May, 16:58
Chris A. Mattmann (JIRA) [jira] [Resolved] (TIKA-1127) text/xml for tika-server Mon, 27 May, 17:05
Mattmann, Chris A (398J) [DISCUSS] Apache Tika 1.4 RC? Mon, 27 May, 17:06
Michael McCandless Re: [DISCUSS] Apache Tika 1.4 RC? Mon, 27 May, 17:57
Message list1 · 2 · Next »Thread · Author · Date
Box list
Sep 201934
Aug 2019153
Jul 2019196
Jun 2019172
May 2019328
Apr 2019194
Mar 201956
Feb 201985
Jan 2019222
Dec 2018158
Nov 2018339
Oct 2018298
Sep 2018267
Aug 2018171
Jul 2018235
Jun 2018200
May 2018228
Apr 2018138
Mar 2018368
Feb 2018249
Jan 2018128
Dec 2017176
Nov 2017263
Oct 2017142
Sep 2017236
Aug 2017214
Jul 2017364
Jun 2017310
May 2017493
Apr 2017426
Mar 2017405
Feb 2017235
Jan 2017375
Dec 2016359
Nov 2016351
Oct 2016385
Sep 2016476
Aug 2016242
Jul 2016197
Jun 2016328
May 2016344
Apr 2016620
Mar 2016423
Feb 2016463
Jan 2016296
Dec 2015185
Nov 2015170
Oct 2015320
Sep 2015388
Aug 2015397
Jul 2015323
Jun 2015307
May 2015317
Apr 2015475
Mar 2015891
Feb 2015445
Jan 2015601
Dec 2014253
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712