tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benson Margulies <bimargul...@gmail.com>
Subject Re: Charset SPI
Date Sat, 06 Nov 2010 19:30:33 GMT
It provides a tiny convenience. It allows people to use
Charset.forName("tikaDetector") and then use the results to apply the
detector to any of the APIs that accept a Charset object. I think it's
nearly cost-free; it requires a class and an SPI text file.

On Sat, Nov 6, 2010 at 3:19 PM, Ken Krugler <kkrugler_lists@transpac.com> wrote:
>
> On Nov 4, 2010, at 7:08am, Benson Margulies wrote:
>
>> Have you all ever considered wiring the CharsetDetector to the
>> java.nio.Charset SPI mechanism as an autodetecting charset?
>
> No, I don't remember this coming up.
>
> Can you provide any additional information about costs and benefits?
>
> Thanks,
>
> -- Ken
>
>>
>> I could knock one off. Would you want it to be a separate JAR or just
>> in the parsers with the detector?
>
> --------------------------
> Ken Krugler
> +1 530-210-6378
> http://bixolabs.com
> e l a s t i c   w e b   m i n i n g
>
>
>
>
>
>

Mime
View raw message