tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Mattmann <mattm...@apache.org>
Subject Re: [EXTERNAL] Regarding unicodeencode Error
Date Thu, 09 Jan 2020 05:37:27 GMT
OK can you please post an issue http://issues.apache.org/jira/browse/TIKA and attach your
document and specific error? Thanks!

 

 

 

From: "Gowda,Sumanth" <Sumanth.Gowda@Cerner.com>
Date: Wednesday, January 8, 2020 at 9:36 PM
To: Chris Mattmann <mattmann@apache.org>
Subject: RE: [EXTERNAL] Regarding unicodeencode Error

 

Tika python

 

 

From: Chris Mattmann <mattmann@apache.org> 
Sent: Thursday, January 9, 2020 8:47 AM
To: Gowda,Sumanth <Sumanth.Gowda@Cerner.com>
Cc: dev@tika.apache.org
Subject: Re: [EXTERNAL] Regarding unicodeencode Error

 

Hi Sumanth,

 

Are you using Tika Python? Or plain Tika in Java?

 

Can you file a ticket and share the PDF?

 

Cheers,

Chris

 

 

 

 

From: "Gowda,Sumanth" <Sumanth.Gowda@Cerner.com>
Date: Wednesday, January 8, 2020 at 12:58 AM
To: "Mattmann, Chris A (US 1760)" <chris.a.mattmann@jpl.nasa.gov>
Subject: [EXTERNAL] Regarding unicodeencode Error

 

Hi Chris,

 

I was trying to read a pdf using the tika parser and I am getting a unicodeencodeError  for
u2013.Any idea how I can resolve this?

 

Thanks,

Sumanth Gowda

 

Sent from Mail for Windows 10

 

  

CONFIDENTIALITY NOTICE This message and any included attachments are from Cerner Corporation
and are intended only for the addressee. The information contained in this message is confidential
and may constitute inside or non-public information under international, federal, or state
securities laws. Unauthorized forwarding, printing, copying, distribution, or use of such
information is strictly prohibited and may be unlawful. If you are not the addressee, please
promptly delete this message and notify the sender of the delivery error by e-mail or you
may call Cerner's corporate offices in Kansas City, Missouri, U.S.A at (+1) (816)221-1024.


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message