FAQ
May it be, that microsoft is found, because the search is not case sensitive (text) and ct is not found because there the search is case sensitive (Keyword)

Did you try
+state:CT +company:"microsoft"~10
^^
?

Marcus
-----Ursprüngliche Nachricht-----
Von: Alain Lauzon
Gesendet: Freitag, 28. Februar 2003 05:10
An: Lucene Users List
Betreff: How is that possible ?


I am running nuts here. I am doing a very simple search that
is supposed
to find me one hit but none is found. This thing works with others
queries, but with some like this one I can't understand. I
even tried to
clean my whole index and add only those two documents.

So if I am looking for companies with the name microsoft in it I find
2. And if I look for companies with microsoft in the name in
the state of
CT it is supposed to find 1, right ? It finds none !!!!!!!

Any clue on that one ?

I am using my own analyzer that is a standard analyzer less a
couple stop
words.

Searching for: +company:"microsoft"~10
2 total matching documents

2 total matching documents in 190 ms.

score(0)=0.4220032
doc(0)=Document<Text<companySicDescription:COMPUTER SOFTWARE>
Keyword<companySicCode:573401> Text<company:MICROSOFT CORP> Keyword<co
mpanyId:50561229> Keyword<tel:860-678-3100> Text<city:FARMINGTON>
Keyword<state:CT> Keyword<country:USA> Unindexed<address:74 BATTER
SON PARK RD> Unindexed<email:>
Unindexed<www:http://WWW.MICROSOFT.COM/NEWENGLAND/>
Unindexed<zip:06032-2565> Keyword<typePackage:3>>

score(1)=0.4220032
doc(1)=Document<Keyword<typePackage:3> Unindexed<zip:85012-2727>
Unindexed<www:http://WWW.MICROSOFT.COM> Unindexed<email:> Unindexed
<address:2929 N CENTRAL AVE> Keyword<country:USA> Keyword<state:AZ>
Text<city:PHOENIX> Keyword<tel:602-266-0302> Keyword<companyId:5
0565446> Text<company:MICROSOFT CORP> Keyword<companySicCode:573401>
Text<companySicDescription:COMPUTER SOFTWARE>>



Searching for: +state:ct +company:"microsoft"~10
0 total matching documents

0 total matching documents in 30 ms.

Thanks for any advice, Alain Lauzon
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Search Discussions

  • Alain Lauzon at Feb 28, 2003 at 12:17 pm

    At 07:16 2003-02-28 +0100, you wrote:
    May it be, that microsoft is found, because the search is not case
    sensitive (text) and ct is not found because there the search is case
    sensitive (Keyword)

    Did you try
    +state:CT +company:"microsoft"~10
    ^^
    ?
    I don't thnik so because the StandardAnalyzer will put everything in
    lowercase. I will try without the StandardAnalyzer.

    Marcus
    -----Ursprüngliche Nachricht-----
    Von: Alain Lauzon
    Gesendet: Freitag, 28. Februar 2003 05:10
    An: Lucene Users List
    Betreff: How is that possible ?


    I am running nuts here. I am doing a very simple search that
    is supposed
    to find me one hit but none is found. This thing works with others
    queries, but with some like this one I can't understand. I
    even tried to
    clean my whole index and add only those two documents.

    So if I am looking for companies with the name microsoft in it I find
    2. And if I look for companies with microsoft in the name in
    the state of
    CT it is supposed to find 1, right ? It finds none !!!!!!!

    Any clue on that one ?

    I am using my own analyzer that is a standard analyzer less a
    couple stop
    words.

    Searching for: +company:"microsoft"~10
    2 total matching documents

    2 total matching documents in 190 ms.

    score(0)=0.4220032
    doc(0)=Document<Text<companySicDescription:COMPUTER SOFTWARE>
    Keyword<companySicCode:573401> Text<company:MICROSOFT CORP> Keyword<co
    mpanyId:50561229> Keyword<tel:860-678-3100> Text<city:FARMINGTON>
    Keyword<state:CT> Keyword<country:USA> Unindexed<address:74 BATTER
    SON PARK RD> Unindexed<email:>
    Unindexed<www:http://WWW.MICROSOFT.COM/NEWENGLAND/>
    Unindexed<zip:06032-2565> Keyword<typePackage:3>>

    score(1)=0.4220032
    doc(1)=Document<Keyword<typePackage:3> Unindexed<zip:85012-2727>
    Unindexed<www:http://WWW.MICROSOFT.COM> Unindexed<email:> Unindexed
    <address:2929 N CENTRAL AVE> Keyword<country:USA> Keyword<state:AZ>
    Text<city:PHOENIX> Keyword<tel:602-266-0302> Keyword<companyId:5
    0565446> Text<company:MICROSOFT CORP> Keyword<companySicCode:573401>
    Text<companySicDescription:COMPUTER SOFTWARE>>



    Searching for: +state:ct +company:"microsoft"~10
    0 total matching documents

    0 total matching documents in 30 ms.

    Thanks for any advice, Alain Lauzon
    ---------------------------------------------------------------------
    To unsubscribe, e-mail: [email protected]
    For additional commands, e-mail: [email protected]


    ---------------------------------------------------------------------
    To unsubscribe, e-mail: [email protected]
    For additional commands, e-mail: [email protected]
  • Tatu Saloranta at Feb 28, 2003 at 2:48 pm

    On Friday 28 February 2003 05:15, Alain Lauzon wrote:
    At 07:16 2003-02-28 +0100, you wrote:
    May it be, that microsoft is found, because the search is not case
    sensitive (text) and ct is not found because there the search is case
    sensitive (Keyword)

    Did you try
    +state:CT +company:"microsoft"~10
    ^^
    ?
    I don't thnik so because the StandardAnalyzer will put everything in
    lowercase. I will try without the StandardAnalyzer.
    Yes, but only fields that are tokenizable. Keywords are not touched, they are
    indexed as is. So if 'state' field is a keyword field, it would be stored in
    upper case (this is explained in FAQ).

    -+ Tatu +-



    ---------------------------------------------------------------------
    To unsubscribe, e-mail: [email protected]
    For additional commands, e-mail: [email protected]

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupjava-user @
categorieslucene
postedFeb 28, '03 at 6:17a
activeFeb 28, '03 at 2:48p
posts3
users3
websitelucene.apache.org

People

Translate

site design / logo © 2023 Grokbase