lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tushar kapoor <tushar_kapoor...@rediffmail.com>
Subject Encoded search string & qt=Dismax
Date Tue, 02 Dec 2008 18:06:44 GMT

Hi,

I am facing problems while searching for some encoded text as part of the
search query string. The results don't come up when I use some url encoding
with qt=dismaxrequest.

I am searching a Russian word by posting a URL encoded UTF8 transformation
of the word. The query works fine for normal request. However, no docs are
fetched when qt=dismaxrequest is appended as part of the query string.

The word being searched is -
Russian Word - Предварительное 

UTF8 Java Encoding -
\u041f\u0440\u0435\u0434\u0432\u0430\u0440\u0438\u0442\u0435\u043b\u044c\u043d\u043e\u0435

Posted query string (URL Encoded) - 
%5Cu041f%5Cu0440%5Cu0435%5Cu0434%5Cu0432%5Cu0430%5Cu0440%5Cu0438%5Cu0442%5Cu0435%5Cu043b%5Cu044c%5Cu043d%5Cu043e%5Cu0435

Following are the two queries and the difference in results

Query 1 - this one works fine

?q=%5Cu041f%5Cu0440%5Cu0435%5Cu0434%5Cu0432%5Cu0430%5Cu0440%5Cu0438%5Cu0442%5Cu0435%5Cu043b%5Cu044c%5Cu043d%5Cu043e%5Cu0435

Result -

<?xml version="1.0" encoding="UTF8" ?> 
 <response>
 <lst name="responseHeader">
  <int name="status">0</int> 
  <int name="QTime">0</int> 
 <lst name="params">
  <str
name="q">\u041f\u0440\u0435\u0434\u0432\u0430\u0440\u0438\u0442\u0435\u043b\u044c\u043d\u043e\u0435</str>

  </lst>
  </lst>
 <result name="response" numFound="1" start="0">
 <doc>
  <str name="Index_Type_s">productIndex</str> 
  <str name="Index_Type_str_s">productIndex</str> 
  <str name="URL_s">4100018</str> 
  <str name="URL_str_s">4100018</str> 
 <arr name="all">
  <str>productIndex</str> 
  <str>product</str> 
  <str>Предварительное K математики учебная книга</str>

  <str>4100018</str> 
  <str>4100018</str> 
  <str>210000125</str> 
  <str>910000048</str> 
  <str>910000047</str> 
  </arr>
  <str name="editionTypeId_s">210000125</str> 
  <str name="editionTypeId_str_s">210000125</str> 
 <arr name="listOf_taxonomyPath">
  <str>910000048</str> 
  <str>910000047</str> 
  </arr>
  <str name="prdMainTitle_s">Предварительное K математики учебная
книга</str> 
  <str name="prdMainTitle_str_s">Предварительное K математики
учебная
книга</str> 
  <str name="productType_s">product</str> 
  <str name="productType_str_s">product</str> 
 <arr name="strlistOf_taxonomyPath">
  <str>910000048</str> 
  <str>910000047</str> 
  </arr>
  <date name="timestamp">20081202T08:14:05.63Z</date> 
  </doc>
  </result>
  </response>

Query 2 - qt=dismaxrequest - This doesnt work

?q=%5Cu041f%5Cu0440%5Cu0435%5Cu0434%5Cu0432%5Cu0430%5Cu0440%5Cu0438%5Cu0442%5Cu0435%5Cu043b%5Cu044c%5Cu043d%5Cu043e%5Cu0435&qt=dismaxrequest

Result -
  <?xml version="1.0" encoding="UTF8" ?> 
 <response>
 <lst name="responseHeader">
  <int name="status">0</int> 
  <int name="QTime">109</int> 
 <lst name="params">
  <str
name="q">\u041f\u0440\u0435\u0434\u0432\u0430\u0440\u0438\u0442\u0435\u043b\u044c\u043d\u043e\u0435</str>

  <str name="qt">dismaxrequest</str> 
  </lst>
  </lst>
  <result name="response" numFound="0" start="0" maxScore="0.0" /> 
  </response>

Dont know why there is a difference on appending qt=dismaxrequest. Any help
would be appreciated.


Regards,
Tushar.
-- 
View this message in context: http://www.nabble.com/Encoded--search-string---qt%3DDismax-tp20797703p20797703.html
Sent from the Solr - User mailing list archive at Nabble.com.


Mime
View raw message