tika-dev mailing list archives: January 2012

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Jan H√łydahl (Commented) (JIRA) [jira] [Commented] (TIKA-638) Language recognition - Failed trying to load language profile for language lt . Error: java.lang.IllegalArgumentException: Unable to add an ngram of incorrect length: 5 != 3 Wed, 18 Jan, 14:04
Alexander Chow (Commented) (JIRA) [jira] [Commented] (TIKA-851) M4V and M4A detection invalid Fri, 27 Jan, 15:10
Alexander Chow (Commented) (JIRA) [jira] [Commented] (TIKA-851) M4V and M4A detection invalid Fri, 27 Jan, 18:42
Alexander Chow (Commented) (JIRA) [jira] [Commented] (TIKA-851) M4V and M4A detection invalid Fri, 27 Jan, 19:12
Alexander Chow (Commented) (JIRA) [jira] [Commented] (TIKA-851) M4V and M4A detection invalid Sat, 28 Jan, 20:20
Alexander Chow (Created) (JIRA) [jira] [Created] (TIKA-851) M4V magic detection invalid Fri, 27 Jan, 13:11
Alexander Chow (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-851) M4V and M4A detection invalid Fri, 27 Jan, 19:12
Alexander Chow (Updated) (JIRA) [jira] [Updated] (TIKA-851) M4V and M4A detection invalid Fri, 27 Jan, 14:52
Alexander Chow (Updated) (JIRA) [jira] [Updated] (TIKA-851) M4V and M4A detection invalid Fri, 27 Jan, 15:06
Andrew Jackson (Commented) (JIRA) [jira] [Commented] (TIKA-86) Support magic(5) files Mon, 16 Jan, 14:26
Andrew Jackson (Commented) (JIRA) [jira] [Commented] (TIKA-86) Support magic(5) files Mon, 16 Jan, 16:52
Andrew Jackson (Commented) (JIRA) [jira] [Commented] (TIKA-847) Add regular expression support to the MagicDetector Tue, 17 Jan, 11:10
Andrew Jackson (Commented) (JIRA) [jira] [Commented] (TIKA-86) Support magic(5) files Wed, 18 Jan, 10:47
Andrew Jackson (Commented) (JIRA) [jira] [Commented] (TIKA-849) Identify and parse the Apple iBooks format Tue, 24 Jan, 11:30
Andrew Jackson (Commented) (JIRA) [jira] [Commented] (TIKA-849) Identify and parse the Apple iBooks format Tue, 24 Jan, 11:54
Andrew Jackson (Created) (JIRA) [jira] [Created] (TIKA-847) Add regular expression support to the MagicDetector Tue, 17 Jan, 11:08
Andrew Jackson (Created) (JIRA) [jira] [Created] (TIKA-849) Identify and parse the Apple iBooks format Mon, 23 Jan, 13:30
Andrew Jackson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-847) Add regular expression support to the MagicDetector Tue, 17 Jan, 11:10
Andrew Jackson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-849) Identify and parse the Apple iBooks format Mon, 23 Jan, 13:36
Andrew Jackson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-849) Identify and parse the Apple iBooks format Tue, 24 Jan, 11:54
Andrew Jackson (Updated) (JIRA) [jira] [Updated] (TIKA-849) Identify and parse the Apple iBooks format Mon, 23 Jan, 13:36
Antoni Mylka (Commented) (JIRA) [jira] [Commented] (TIKA-854) No text extraction for Word macroenabled template Tue, 31 Jan, 14:36
Apache Jenkins Server Build failed in Jenkins: Tika-trunk #768 Tue, 03 Jan, 06:06
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk #769 Tue, 03 Jan, 19:12
Apache Jenkins Server Build failed in Jenkins: Tika-trunk #785 Fri, 27 Jan, 17:06
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk #786 Fri, 27 Jan, 18:13
Chris A. Mattmann (Commented) (JIRA) [jira] [Commented] (TIKA-737) Use (Incubating) ODFToolkit to improve ODF file format processing Fri, 06 Jan, 04:37
Chris A. Mattmann (Commented) (JIRA) [jira] [Commented] (TIKA-846) Ability to Parse RDF Bag Elements in XML Mon, 16 Jan, 19:43
Chris A. Mattmann (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-846) Ability to Parse RDF Bag Elements in XML Tue, 17 Jan, 00:55
Chris A. Mattmann (Resolved) (JIRA) [jira] [Resolved] (TIKA-824) Extract rel attr with LinkContentHandler Tue, 03 Jan, 04:20
Devin Han [ANNOUNCEMENT][THANKS] Apache ODF Toolkit(Incubating) 0.5-incubating Release Mon, 16 Jan, 12:59
Devin Han Re: [ANNOUNCEMENT][THANKS] Apache ODF Toolkit(Incubating) 0.5-incubating Release Tue, 17 Jan, 04:35
Etienne Jouvin (Commented) (JIRA) [jira] [Commented] (TIKA-695) Custom properties on xlsx, docx, pptx Thu, 05 Jan, 10:25
Etienne Jouvin (Updated) (JIRA) [jira] [Updated] (TIKA-694) On extraction, get properties AND / OR content extraction Wed, 04 Jan, 19:23
Etienne Jouvin (Updated) (JIRA) [jira] [Updated] (TIKA-695) Custom properties on xlsx, docx, pptx Wed, 04 Jan, 19:25
Etienne Jouvin (Updated) (JIRA) [jira] [Updated] (TIKA-694) On extraction, get properties AND / OR content extraction Wed, 04 Jan, 19:25
Fabian Lange (Commented) (JIRA) [jira] [Commented] (TIKA-526) OOXMLParser fails to extract text from within smart tags Sun, 01 Jan, 14:39
Fabian Lange (Commented) (JIRA) [jira] [Commented] (TIKA-838) EmptyParser Singleton should be final Tue, 03 Jan, 18:04
Fabian Lange (Created) (JIRA) [jira] [Created] (TIKA-837) Make inner classes static for performance reasons Sun, 01 Jan, 15:51
Fabian Lange (Created) (JIRA) [jira] [Created] (TIKA-838) EmptyParser Singleton should be final Sun, 01 Jan, 16:05
Fabian Lange (Updated) (JIRA) [jira] [Updated] (TIKA-837) Make inner classes static for performance reasons Sun, 01 Jan, 15:51
Fabian Lange (Updated) (JIRA) [jira] [Updated] (TIKA-838) EmptyParser Singleton should be final Sun, 01 Jan, 16:07
Franz Canaval (Resolved) (JIRA) [jira] [Resolved] (TIKA-796) Tika breaks words of rotated text in PDF documents Fri, 20 Jan, 08:54
John Mastarone (Commented) (JIRA) [jira] [Commented] (TIKA-853) java.io.IOException with TikaGUI and testMP4.m4a Mon, 30 Jan, 23:41
John Mastarone (Created) (JIRA) [jira] [Created] (TIKA-839) TikaException with testPPT.potm in Tika GUI / CLI Wed, 11 Jan, 03:33
John Mastarone (Created) (JIRA) [jira] [Created] (TIKA-853) java.io.IOException with TikaGUI and testMP4.m4a Mon, 30 Jan, 04:01
John Mastarone (Updated) (JIRA) [jira] [Updated] (TIKA-839) TikaException with testPPT.potm in Tika GUI / CLI Wed, 11 Jan, 03:35
John Mastarone (Updated) (JIRA) [jira] [Updated] (TIKA-839) TikaException with testPPT.potm in Tika GUI / CLI Wed, 11 Jan, 03:37
Jukka Zitting Re: Sharing metadata logic between parsers Mon, 30 Jan, 14:50
Jukka Zitting Re: Sharing metadata logic between parsers Mon, 30 Jan, 15:16
Jukka Zitting Re: Sharing metadata logic between parsers Mon, 30 Jan, 15:52
Jukka Zitting (Commented) (JIRA) [jira] [Commented] (TIKA-843) Support for Date without a Time Component Fri, 20 Jan, 16:53
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-838) EmptyParser Singleton should be final Tue, 03 Jan, 18:12
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-86) Support magic(5) files Mon, 16 Jan, 17:11
Julien Nioche Re: % of different content types out there on the web Sun, 29 Jan, 16:29
Ken Krugler (Commented) (JIRA) [jira] [Commented] (TIKA-86) Support magic(5) files Mon, 16 Jan, 18:01
Ken Krugler (Commented) (JIRA) [jira] [Commented] (TIKA-844) Ability to Define an Internal Text Bag Property Mon, 16 Jan, 19:13
Ken Krugler (Resolved) (JIRA) [jira] [Resolved] (TIKA-638) Language recognition - Failed trying to load language profile for language lt . Error: java.lang.IllegalArgumentException: Unable to add an ngram of incorrect length: 5 != 3 Wed, 18 Jan, 14:30
Markus Jelsma Re: % of different content types out there on the web Tue, 31 Jan, 12:39
Markus Jelsma Re: % of different content types out there on the web Tue, 31 Jan, 14:54
Mattmann, Chris A (388J) Re: [ANNOUNCEMENT][THANKS] Apache ODF Toolkit(Incubating) 0.5-incubating Release Mon, 16 Jan, 16:22
Mattmann, Chris A (388J) % of different content types out there on the web Sat, 28 Jan, 02:01
Mattmann, Chris A (388J) Re: % of different content types out there on the web Tue, 31 Jan, 14:55
Maxim Valyanskiy (Created) (JIRA) [jira] [Created] (TIKA-854) No text extraction Word macroenabled template Tue, 31 Jan, 14:22
Maxim Valyanskiy (Resolved) (JIRA) [jira] [Resolved] (TIKA-854) No text extraction for Word macroenabled template Tue, 31 Jan, 14:40
Maxim Valyanskiy (Updated) (JIRA) [jira] [Updated] (TIKA-854) No text extraction Word macroenabled template Tue, 31 Jan, 14:26
Maxim Valyanskiy (Updated) (JIRA) [jira] [Updated] (TIKA-854) No text extraction for Word macroenabled template Tue, 31 Jan, 14:34
Nick Burch Re: ExifTool Parser Conventions Mon, 23 Jan, 12:59
Nick Burch Re: How to Convert Doc or Docx File to HTML? Sun, 29 Jan, 13:42
Nick Burch Sharing metadata logic between parsers Mon, 30 Jan, 14:40
Nick Burch Re: Sharing metadata logic between parsers Mon, 30 Jan, 14:59
Nick Burch Re: Sharing metadata logic between parsers Mon, 30 Jan, 15:20
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-826) TikaException / OfficeXmlFileException with .xlsb files Tue, 03 Jan, 05:14
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-838) EmptyParser Singleton should be final Tue, 03 Jan, 05:20
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-837) Make inner classes static for performance reasons Tue, 03 Jan, 05:24
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-695) Custom properties on xlsx, docx, pptx Thu, 05 Jan, 02:40
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-695) Custom properties on xlsx, docx, pptx Thu, 12 Jan, 15:01
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-840) OOXML parser content type setting Fri, 13 Jan, 15:03
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser Fri, 13 Jan, 16:05
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-805) improvements in XSLFPowerPointExtractorDecorator Mon, 16 Jan, 10:42
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-841) User supplied parsers should be preferred Mon, 16 Jan, 12:22
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-87) MimeTypes should allow modification of MIME types Mon, 16 Jan, 12:28
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-86) Support magic(5) files Mon, 16 Jan, 12:32
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-86) Support magic(5) files Mon, 16 Jan, 14:44
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-86) Support magic(5) files Mon, 16 Jan, 17:01
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library Mon, 16 Jan, 20:13
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library Mon, 16 Jan, 21:03
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library Mon, 16 Jan, 21:13
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-841) User supplied parsers should be preferred Wed, 18 Jan, 14:28
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document Fri, 20 Jan, 15:01
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-507) Parser for font files Fri, 20 Jan, 15:59
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-843) Support for Date without a Time Component Fri, 20 Jan, 16:47
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-843) Support for Date without a Time Component Fri, 20 Jan, 17:09
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-848) NullPointerException in SecurityHandler.addDictionaryAndSubDictionary(SecurityHandler.java:185) Mon, 23 Jan, 00:04
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-848) NullPointerException in SecurityHandler.addDictionaryAndSubDictionary(SecurityHandler.java:185) Mon, 23 Jan, 00:20
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-818) Allow PDFBox to be used with RandomAccessFile vs RandomAccessBuffer to allow for a memory vs performance tradeoff Mon, 23 Jan, 10:00
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-846) Ability to Parse RDF Bag Elements in XML Mon, 23 Jan, 15:41
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-844) Ability to Define an Internal Text Bag Property Mon, 23 Jan, 15:47
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-845) Check for Existing Value in Multi-Value Fields in XML Metadata Handler Mon, 23 Jan, 16:09
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-849) Identify and parse the Apple iBooks format Mon, 23 Jan, 16:31
Message list1 · 2 · Next »Thread · Author · Date
Box list
Jul 201986
Jun 2019172
May 2019328
Apr 2019194
Mar 201956
Feb 201985
Jan 2019222
Dec 2018158
Nov 2018339
Oct 2018298
Sep 2018267
Aug 2018171
Jul 2018235
Jun 2018200
May 2018228
Apr 2018138
Mar 2018368
Feb 2018249
Jan 2018128
Dec 2017176
Nov 2017263
Oct 2017142
Sep 2017236
Aug 2017214
Jul 2017364
Jun 2017310
May 2017493
Apr 2017426
Mar 2017405
Feb 2017235
Jan 2017375
Dec 2016359
Nov 2016351
Oct 2016385
Sep 2016476
Aug 2016242
Jul 2016197
Jun 2016328
May 2016344
Apr 2016620
Mar 2016423
Feb 2016463
Jan 2016296
Dec 2015185
Nov 2015170
Oct 2015320
Sep 2015388
Aug 2015397
Jul 2015323
Jun 2015307
May 2015317
Apr 2015475
Mar 2015891
Feb 2015445
Jan 2015601
Dec 2014253
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712