tika-dev mailing list archives: June 2020

Site index · List index
Message list1 · 2 · 3 · 4 · Next »Thread · Author · Date
阿里木 (Jira) [jira] [Updated] (TIKA-3123) request to parse Chinese, but return Russian Tue, 23 Jun, 07:36
阿里木 (Jira) [jira] [Created] (TIKA-3123) request to parse Chinese, but return Russian Tue, 23 Jun, 07:36
阿里木 (Jira) [jira] [Commented] (TIKA-3123) request to parse Chinese, but return Russian Wed, 24 Jun, 02:23
Andreas Lehmkühler (Jira) [jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20 Fri, 12 Jun, 16:15
Andreas Lehmkühler (Jira) [jira] [Comment Edited] (TIKA-3111) Upgrade to PDFBox 2.0.20 Fri, 12 Jun, 17:43
Andreas Lehmkühler (Jira) [jira] [Updated] (TIKA-3111) Upgrade to PDFBox 2.0.20 Sat, 13 Jun, 10:13
Andreas Lehmkühler (Jira) [jira] [Comment Edited] (TIKA-3111) Upgrade to PDFBox 2.0.20 Sat, 13 Jun, 10:23
Andreas Lehmkühler (Jira) [jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20 Sat, 13 Jun, 10:23
Andreas Lehmkühler (Jira) [jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20 Sat, 13 Jun, 12:21
Andreas Lehmkühler (Jira) [jira] [Comment Edited] (TIKA-3111) Upgrade to PDFBox 2.0.20 Sun, 14 Jun, 10:45
Andreas Lehmkühler (Jira) [jira] [Updated] (TIKA-3111) Upgrade to PDFBox 2.0.20 Sun, 14 Jun, 12:54
Andreas Lehmkühler (Jira) [jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20 Sun, 14 Jun, 12:59
Andreas Lehmkühler (Jira) [jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20 Mon, 15 Jun, 04:58
Christoph Läubrich (Jira) [jira] [Commented] (TIKA-3110) cannot extract metadata from 7z .tar archive Fri, 12 Jun, 17:02
Christoph Läubrich (Jira) [jira] [Commented] (TIKA-3110) cannot extract metadata from 7z .tar archive Fri, 12 Jun, 17:10
Christoph Läubrich (Jira) [jira] [Commented] (TIKA-3110) cannot extract metadata from 7z .tar archive Fri, 12 Jun, 17:47
Milan Vereščák (Jira) [jira] [Created] (TIKA-3127) When using html parser any empty attribute sets value to attribute name e.g. <a href>link</a> gives href="href" Tue, 30 Jun, 17:52
Milan Vereščák (Jira) [jira] [Updated] (TIKA-3127) When using html parser any empty attribute sets value to attribute name e.g. <a href>link</a> gives href="href" Tue, 30 Jun, 17:54
Milan Vereščák (Jira) [jira] [Updated] (TIKA-3127) When using html parser any empty attribute sets value to attribute name e.g. <a href>link</a> gives href="href" Tue, 30 Jun, 17:55
Ondřej Duchoň (Jira) [jira] [Created] (TIKA-3105) OFT format detection based on file content Wed, 03 Jun, 12:19
Ondřej Duchoň (Jira) [jira] [Updated] (TIKA-3105) OFT format detection based on file name (extension) instead of file content Wed, 03 Jun, 13:04
GitBox [GitHub] [tika] KranthiGV commented on a change in pull request #317: fix for TIKA-3089 contributed by pvanderweerd Wed, 03 Jun, 16:10
GitBox [GitHub] [tika] pszemus opened a new pull request #320: tika-mimetypes: Add mimetypes for .mpd, .m3u8 and .m4s Wed, 10 Jun, 09:09
GitBox [GitHub] [tika] deathy opened a new pull request #321: fix for TIKA-3008 contributed by deathy Sun, 14 Jun, 11:05
GitBox [GitHub] [tika] matthewford opened a new pull request #322: Update PDFParser.properties Tue, 16 Jun, 16:20
GitBox [GitHub] [tika] tballison merged pull request #322: Update PDFParser.properties Tue, 16 Jun, 16:45
GitBox [GitHub] [tika] tballison merged pull request #278: TIKA-2830 add heif mimetype support Tue, 16 Jun, 16:50
GitBox [GitHub] [tika] tballison commented on pull request #278: TIKA-2830 add heif mimetype support Tue, 16 Jun, 16:53
GitBox [GitHub] [tika] tballison merged pull request #320: tika-mimetypes: Add MIME types for .mpd, .m3u8 and .m4s Tue, 16 Jun, 16:53
GitBox [GitHub] [tika] tballison merged pull request #272: TIKA-2888 Add wmv2 codec detection for WMV files Tue, 16 Jun, 16:56
GitBox [GitHub] [tika] tballison merged pull request #276: Disable external DTD + Stylesheets with the TransformerFactory Tue, 16 Jun, 16:57
ASF GitHub Bot (Jira) [jira] [Commented] (TIKA-3089) Text should be wrapped in pre-tags instead of in p-tags Wed, 03 Jun, 16:11
ASF GitHub Bot (Jira) [jira] [Commented] (TIKA-3008) Word Doc/Docx Formatting Extraction - Superscript/Subscript Sun, 14 Jun, 11:06
ASF GitHub Bot (Jira) [jira] [Commented] (TIKA-2830) Detect Media type of HEIF file correctly Tue, 16 Jun, 16:51
ASF GitHub Bot (Jira) [jira] [Commented] (TIKA-2830) Detect Media type of HEIF file correctly Tue, 16 Jun, 16:54
ASF GitHub Bot (Jira) [jira] [Commented] (TIKA-2888) Add wmv2 codec detection to ASF container Tue, 16 Jun, 16:57
Adam Gibson (Jira) [jira] [Commented] (TIKA-3119) General upgrades for 1.25 Wed, 24 Jun, 05:03
Alex (Jira) [jira] [Created] (TIKA-3110) cannot extract metadata from 7z .tar archive Wed, 10 Jun, 20:57
Alex (Jira) [jira] [Updated] (TIKA-3110) cannot extract metadata from 7z .tar archive Wed, 10 Jun, 20:58
Carina Antunes (Jira) [jira] [Created] (TIKA-3126) Consider new endpoint (metadata + content non recursive) Tue, 30 Jun, 09:57
Chris Mattmann Re: [EXTERNAL] renaming master? Tue, 16 Jun, 19:22
Chris Mattmann (Jira) [jira] [Commented] (TIKA-3119) General upgrades for 1.25 Sat, 20 Jun, 04:43
Cristian Vat (Jira) [jira] [Commented] (TIKA-3008) Word Doc/Docx Formatting Extraction - Superscript/Subscript Sun, 14 Jun, 11:17
Danny McKinney (Jira) [jira] [Created] (TIKA-3113) Currently Tika is detecting a .aux file as text/html Thu, 11 Jun, 23:29
Dupinder Singh Problem in resolving tika parser in Gradle projects Thu, 04 Jun, 01:47
Dushyanth Balasubramanian (Jira) [jira] [Created] (TIKA-3114) Error reading transcript from document Fri, 12 Jun, 00:00
Dushyanth Balasubramanian (Jira) [jira] [Commented] (TIKA-3114) Error reading transcript from document Fri, 12 Jun, 00:12
Dushyanth Balasubramanian (Jira) [jira] [Commented] (TIKA-3114) Error reading transcript from document Fri, 12 Jun, 23:29
Dushyanth Balasubramanian (Jira) [jira] [Commented] (TIKA-3114) Error reading transcript from document Sat, 13 Jun, 18:23
Hudson (Jira) [jira] [Commented] (TIKA-3101) Include XMPSchemaBasic metadata in xmp metadata extraction Mon, 01 Jun, 15:06
Hudson (Jira) [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode Mon, 01 Jun, 22:08
Hudson (Jira) [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode Tue, 02 Jun, 14:06
Hudson (Jira) [jira] [Commented] (TIKA-2961) Tika 在识别以caff开始的txt文档时会把它错误地识别为audio/x-caf 音频类型 Tue, 02 Jun, 15:13
Hudson (Jira) [jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension. Tue, 02 Jun, 15:13
Hudson (Jira) [jira] [Commented] (TIKA-3101) Include XMPSchemaBasic metadata in xmp metadata extraction Tue, 02 Jun, 15:13
Hudson (Jira) [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode Tue, 02 Jun, 15:13
Hudson (Jira) [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode Wed, 03 Jun, 19:09
Hudson (Jira) [jira] [Commented] (TIKA-3106) Tika Fails to detect some EML files if extension is not .eml Thu, 04 Jun, 06:04
Hudson (Jira) [jira] [Commented] (TIKA-3112) New bugs introduced in Tika-app-1.24.1.jar Thu, 11 Jun, 22:19
Hudson (Jira) [jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20 Thu, 11 Jun, 22:19
Hudson (Jira) [jira] [Commented] (TIKA-3110) cannot extract metadata from 7z .tar archive Fri, 12 Jun, 20:51
Hudson (Jira) [jira] [Commented] (TIKA-3115) Detect parquet files Fri, 12 Jun, 22:57
Hudson (Jira) [jira] [Commented] (TIKA-2888) Add wmv2 codec detection to ASF container Tue, 16 Jun, 18:16
Hudson (Jira) [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode Tue, 16 Jun, 19:16
Hudson (Jira) [jira] [Commented] (TIKA-3115) Detect parquet files Tue, 16 Jun, 19:16
Hudson (Jira) [jira] [Commented] (TIKA-3106) Tika Fails to detect some EML files if extension is not .eml Tue, 16 Jun, 19:16
Hudson (Jira) [jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20 Tue, 16 Jun, 19:16
Hudson (Jira) [jira] [Commented] (TIKA-3112) NullPointerException at AbstractPDF2XHTML.extractXMPXFA() when using tika-app GUI Tue, 16 Jun, 19:16
Hudson (Jira) [jira] [Commented] (TIKA-3117) Upgrade to metadata-extractor 2.14.0 Tue, 16 Jun, 19:16
Hudson (Jira) [jira] [Commented] (TIKA-3110) cannot extract metadata from 7z .tar archive Tue, 16 Jun, 19:16
Hudson (Jira) [jira] [Commented] (TIKA-3117) Upgrade to metadata-extractor 2.14.0 Tue, 16 Jun, 19:18
Hudson (Jira) [jira] [Commented] (TIKA-2888) Add wmv2 codec detection to ASF container Tue, 16 Jun, 19:18
Hudson (Jira) [jira] [Commented] (TIKA-3119) General upgrades for 1.25 Fri, 19 Jun, 20:10
Hudson (Jira) [jira] [Commented] (TIKA-3120) Remove whitelist/blacklist terminology Fri, 19 Jun, 21:38
Hudson (Jira) [jira] [Commented] (TIKA-3119) General upgrades for 1.25 Fri, 19 Jun, 21:38
Hudson (Jira) [jira] [Commented] (TIKA-3120) Remove whitelist/blacklist terminology Fri, 19 Jun, 22:18
Hudson (Jira) [jira] [Commented] (TIKA-3122) Extract inline image metadata without rendering for PDFs Mon, 22 Jun, 17:19
Hudson (Jira) [jira] [Commented] (TIKA-3122) Extract inline image metadata without rendering for PDFs Mon, 22 Jun, 17:38
Hudson (Jira) [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode Thu, 25 Jun, 22:16
Hudson (Jira) [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode Thu, 25 Jun, 23:13
Ip Smile (Jira) [jira] [Created] (TIKA-3112) New bugs introduced in Tika-app-1.24.1.jar Thu, 11 Jun, 18:35
Ip Smile (Jira) [jira] [Updated] (TIKA-3112) New bugs introduced in Tika-app-1.24.1.jar Thu, 11 Jun, 18:50
Ip Smile (Jira) [jira] [Updated] (TIKA-3112) New bugs introduced in Tika-app-1.24.1.jar Thu, 11 Jun, 19:19
Ip Smile (Jira) [jira] [Commented] (TIKA-3112) New bugs introduced in Tika-app-1.24.1.jar Thu, 11 Jun, 21:53
Ip Smile (Jira) [jira] [Comment Edited] (TIKA-3112) New bugs introduced in Tika-app-1.24.1.jar Thu, 11 Jun, 21:53
Jeroen Steggink (Jira) [jira] [Created] (TIKA-3118) PDFParser: totalCharsPerPage vs. actual chars per page after parsing Fri, 19 Jun, 07:11
Jeroen Steggink (Jira) [jira] [Commented] (TIKA-3118) PDFParser: totalCharsPerPage vs. actual chars per page after parsing Fri, 19 Jun, 20:41
Jeroen Steggink (Jira) [jira] [Commented] (TIKA-3118) PDFParser: totalCharsPerPage vs. actual chars per page after parsing Fri, 19 Jun, 22:41
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3109) Ingest attachment: failed to extract text from iframe Wed, 10 Jun, 21:07
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3109) Ingest attachment: failed to extract text from iframe Wed, 10 Jun, 21:09
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3109) Ingest attachment: failed to extract text from iframe Wed, 10 Jun, 21:18
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3109) Ingest attachment: failed to extract text from iframe Wed, 10 Jun, 21:30
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3109) Ingest attachment: failed to extract text from iframe Wed, 10 Jun, 21:39
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3114) Error reading transcript from document Fri, 12 Jun, 00:06
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3114) Error reading transcript from document Fri, 12 Jun, 00:24
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3115) Detect parquet files Fri, 12 Jun, 20:14
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3115) Detect parquet files Fri, 12 Jun, 21:07
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3114) Error reading transcript from document Fri, 12 Jun, 23:51
Kenneth William Krugler (Jira) [jira] [Commented] (TIKA-3123) request to parse Chinese, but return Russian Tue, 23 Jun, 13:23
Konstantin Gribov (Jira) [jira] [Commented] (TIKA-3121) Rename master branch Fri, 26 Jun, 13:24
Message list1 · 2 · 3 · 4 · Next »Thread · Author · Date
Box list
Oct 2020109
Sep 2020264
Aug 2020315
Jul 2020246
Jun 2020332
May 2020146
Apr 2020189
Mar 2020219
Feb 2020249
Jan 2020122
Dec 2019221
Nov 2019211
Oct 2019331
Sep 201982
Aug 2019153
Jul 2019196
Jun 2019172
May 2019328
Apr 2019194
Mar 201956
Feb 201985
Jan 2019222
Dec 2018158
Nov 2018339
Oct 2018298
Sep 2018267
Aug 2018171
Jul 2018235
Jun 2018200
May 2018228
Apr 2018138
Mar 2018368
Feb 2018249
Jan 2018128
Dec 2017176
Nov 2017263
Oct 2017142
Sep 2017236
Aug 2017214
Jul 2017364
Jun 2017310
May 2017493
Apr 2017426
Mar 2017405
Feb 2017235
Jan 2017375
Dec 2016359
Nov 2016351
Oct 2016385
Sep 2016476
Aug 2016242
Jul 2016197
Jun 2016328
May 2016344
Apr 2016620
Mar 2016423
Feb 2016463
Jan 2016296
Dec 2015185
Nov 2015170
Oct 2015320
Sep 2015388
Aug 2015397
Jul 2015323
Jun 2015307
May 2015317
Apr 2015475
Mar 2015891
Feb 2015445
Jan 2015601
Dec 2014253
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712