lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shai Erera (JIRA)" <j...@apache.org>
Subject [jira] Closed: (LUCENE-606) Change behavior of ParallelReader.document(int)
Date Thu, 27 Jan 2011 10:17:44 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shai Erera closed LUCENE-606.
-----------------------------

    Resolution: Won't Fix

Closing due to long inactivity, and also because I don't think it's a bug. IndexReaders today
expose a getSequentialSubReaders API which allows one to fetch a Document's fields from whatever
sub-reader they want. Also, the general concept behind ParallelReader is that fields do not
exist in more than 1 sub-reader.

> Change behavior of ParallelReader.document(int)
> -----------------------------------------------
>
>                 Key: LUCENE-606
>                 URL: https://issues.apache.org/jira/browse/LUCENE-606
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>    Affects Versions: 2.0.0
>            Reporter: Christian Kohlschuetter
>            Priority: Minor
>         Attachments: ParallelReaderTest2.java, patch-allfields.diff
>
>
> Currently, the returned documents contain, for each field, the stored data from all enclosed
IndexReaders which contain the corresponding field.
> That is, a call to ParallelReader.document(doc).getFields(fieldName) returns an array
of possibly several Field objects. Since null entries are disallowed, there is no way to determine
to which IndexReader the field data exactly belongs.
> On the other side, a search for a term on that field only yields results if that term
was contained in the *first* matching IndexReader which contained the field.
> Thus, when merging the ParallelReader contents to another IndexWriter, the indexed data
does not correspond to the stored information.
> I am not sure whether this can be considered a bug (in some cases, this may exactly be
required). However I would like to see an option to change this behaviour.
> I suggest a parameter for ParallelReader which specifies whether stored data from all
IndexReaders or only from the one which is repsonsible for the field's indexed data will be
returned by ParallelReader.document(int).
> Please find my proposed implementation attached, as well as a JUnit testcase.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message