asterixdb-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chen Li <che...@gmail.com>
Subject Re: [jira] [Created] (ASTERIXDB-1208) ngram tokenizer failure with negative length
Date Wed, 02 Dec 2015 00:25:17 GMT
@Taewoo: can you help?

On Tue, Dec 1, 2015 at 2:26 PM, Wenhai (JIRA) <jira@apache.org> wrote:
> Wenhai created ASTERIXDB-1208:
> ---------------------------------
>
>              Summary: ngram tokenizer failure with negative length
>                  Key: ASTERIXDB-1208
>                  URL: https://issues.apache.org/jira/browse/ASTERIXDB-1208
>              Project: Apache AsterixDB
>           Issue Type: Bug
>           Components: Hyracks Core
>             Reporter: Wenhai
>
>
> drop dataverse test if exists;
> create dataverse test;
> use dataverse test;
> create type DBLPOpenType as open {
>   id: int64,
>   dblpid: string,
>   authors: string,
>   misc: string
> }
> create dataset DBLPOpen(DBLPOpenType) primary key id;
> insert into dataset DBLPOpen { "id": 93, "dblpid": "journals/iandc/IbarraJCR91", "authors":
"Some Classes of Languages in NCĀ¹", "misc": "2006-04-25 86-106 Inf. Comput. January 1991
90 1 db/journals/iandc/iandc90.html#IbarraJCR91" }
>
> use dataverse test;
> set import-private-functions 'true'
> for $d in dataset DBLPOpen
> where similarity-jaccard(gram-tokens("",3,false),gram-tokens($d.title,3,false)) >=
0.5
> return {"rec": $d}
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.3.4#6332)

Mime
View raw message