Hi,

I have indexed text using standard analyzer. Say, there are two words, *
10000* and *10,000*.
When i am searching, i am parsing the text as following.

QueryParser queryParser = new
QueryParser(Lucene.Net.Util.Version.LUCENE_29,"Contents", new
Lucene.Net.Analysis.Standard.StandardAnalyzer(new string[] { "" }));
queryParser.SetMultiTermRewriteMethod(MultiTermQuery.SCORING_BOOLEAN_QUERY_REWRITE);
Query questionQuery = queryParser.Parse("10000");
//AND
questionQuery = queryParser.Parse("10,000");

Now, what i want is that while searching it should be ignoring '*,*' in the
search term. Apparently, my code doesn't support that. Can any one tell me
what i don't know or missing???
one more thing, It doesn't matter whether i search *Nov 23, 2010* or *Nov
23 2010*. The results are same but not in case when '*,*' appeared between
digits.


Regards,
Umer.

Search Discussions

  • Andrew Busby at Nov 23, 2010 at 9:44 am
    Hi Umer,

    I would think this is more to do with white space than commas.

    Looking at the data example you provided the StandardAnaylzer will break
    this into tokens on the white space, so the tokenized strings are
    Nov | 23 |,| 2010
    Nov | 23, | 2010

    (I actually think the lone comma is removed). So it would be the Nov and
    the 2010 that are being matched on the 23, or 23

    As your 10000 and 10,000 are string values with no white space they are
    treated as different search term.

    This is a guess but it might help fire some thoughts in your mind.

    Andrew

    -----Original Message-----
    From: Umer Khalid Qureshi
    Sent: 23 November 2010 09:35
    To: [email protected]
    Subject: Is comma a special character in Lucene query parsing.

    Hi,

    I have indexed text using standard analyzer. Say, there are two words, *
    10000* and *10,000*.
    When i am searching, i am parsing the text as following.

    QueryParser queryParser = new
    QueryParser(Lucene.Net.Util.Version.LUCENE_29,"Contents", new
    Lucene.Net.Analysis.Standard.StandardAnalyzer(new string[] { "" }));
    queryParser.SetMultiTermRewriteMethod(MultiTermQuery.SCORING_BOOLEAN_QUERY_R
    EWRITE);
    Query questionQuery = queryParser.Parse("10000");
    //AND
    questionQuery = queryParser.Parse("10,000");

    Now, what i want is that while searching it should be ignoring '*,*' in the
    search term. Apparently, my code doesn't support that. Can any one tell me
    what i don't know or missing???
    one more thing, It doesn't matter whether i search *Nov 23, 2010* or *Nov
    23 2010*. The results are same but not in case when '*,*' appeared between
    digits.


    Regards,
    Umer.
  • Digy digy at Nov 23, 2010 at 10:04 am
    Again same answer.

    http://mail-archives.apache.org/mod_mbox/lucene-lucene-net-user/201011.mbox/%3[email protected]%3E

    Use the code and see what StandardAnalyzer does with your input string.

    DIGY
    On Tue, Nov 23, 2010 at 11:34 AM, Umer Khalid Qureshi wrote:

    Hi,

    I have indexed text using standard analyzer. Say, there are two words, *
    10000* and *10,000*.
    When i am searching, i am parsing the text as following.

    QueryParser queryParser = new
    QueryParser(Lucene.Net.Util.Version.LUCENE_29,"Contents", new
    Lucene.Net.Analysis.Standard.StandardAnalyzer(new string[] { "" }));

    queryParser.SetMultiTermRewriteMethod(MultiTermQuery.SCORING_BOOLEAN_QUERY_REWRITE);
    Query questionQuery = queryParser.Parse("10000");
    //AND
    questionQuery = queryParser.Parse("10,000");

    Now, what i want is that while searching it should be ignoring '*,*' in the
    search term. Apparently, my code doesn't support that. Can any one tell me
    what i don't know or missing???
    one more thing, It doesn't matter whether i search *Nov 23, 2010* or *Nov
    23 2010*. The results are same but not in case when '*,*' appeared between
    digits.


    Regards,
    Umer.

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
grouplucene-net-user @
categorieslucene
postedNov 23, '10 at 9:35a
activeNov 23, '10 at 10:04a
posts3
users3
websitelucene.apache.org

People

Translate

site design / logo © 2023 Grokbase