lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benson Margulies <ben...@basistech.com>
Subject Re: Where is the source for the .dat files in Kuromoji?
Date Mon, 02 Dec 2013 23:38:02 GMT
On Mon, Dec 2, 2013 at 6:27 PM, Christian Moen <cm@atilika.com> wrote:

> Hello Benson,
>
> The sources for the .dat files are available from
>
>
> https://mecab.googlecode.com/files/mecab-ipadic-2.7.0-20070801.tar.gz
>
> http://atilika.com/releases/mecab-ipadic/mecab-ipadic-2.7.0-20070801.tar.gz




>
> and a range of other places.
>
> I’m not sure I follow what you’re saying regarding unk.def -- it’s to my
> knowledge used as-is from the above sources when the binary .dat files are
> made.  (See lucene/analysis/kuromoji/src/tools in the Lucene code tree.)
>
> Perhaps I’m missing something.  Could you clarify how you think things
> should be done?
>

I'm not clear that there's anything that anyone would complain of. The
question is, are the .dat files part of the source bundle that is the
'official release'? I just fetched from git, not from the official release,
so I don't know.







>
> Many thanks,
>
> Christian Moen
> アティリカ株式会社
> http://www.atilika.com
>
> On Dec 3, 2013, at 2:11 AM, Benson Margulies <benson@basistech.com> wrote:
>
> > There are a handful of binary files in
> ./src/resources/org/apache/lucene/analysis/ja/dict/ with filenames ending
> in .dat.
> >
> > Trailing around in the source, it seems as if at least one of these
> derives from a source file named "unk.def".  In turn, this file comes from
> a dependency. should the build generate the file rather than having it in
> the tree and shipped as part of the source release?
> >
> >
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message