FAQ
Hello, I am indexing documents with a field that contains the first and
last name of people. It is working wonderfully with a slight issue: if
Thomas is indexed for a document, I would like searches for Tom to match
that document. I am sure this is a common problem that many of you must have
addressed, but I was unable to find anything in the archives or on Google.
Thanks in advance for the guidance.


Regards,
Khawaja Shams

Search Discussions

  • Ian Lea at Dec 3, 2008 at 9:18 am
    Hi


    To get from Thomas to Tom you'll need to use synonyms. For Thom you
    would have been able to use prefixes or wild cards.

    If you google for lucene synonyms you'll find loads of stuff. Also, I
    believe that Solr has built in support for synonyms.


    --
    Ian.

    On Wed, Dec 3, 2008 at 6:16 AM, Khawaja Shams wrote:
    Hello, I am indexing documents with a field that contains the first and
    last name of people. It is working wonderfully with a slight issue: if
    Thomas is indexed for a document, I would like searches for Tom to match
    that document. I am sure this is a common problem that many of you must have
    addressed, but I was unable to find anything in the archives or on Google.
    Thanks in advance for the guidance.


    Regards,
    Khawaja Shams
    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Ganesh at Dec 3, 2008 at 9:42 am
    If you want to query for Tom, then you need to index the value Tom. Create
    one more field as Alias or add alias name as part of name field.

    Regards
    Ganesh

    ----- Original Message -----
    From: "Khawaja Shams" <ksshams@gmail.com>
    To: <java-user@lucene.apache.org>
    Sent: Wednesday, December 03, 2008 11:46 AM
    Subject: Indexing Names in Lucene -- Thomas = Tom, etc

    Hello, I am indexing documents with a field that contains the first and
    last name of people. It is working wonderfully with a slight issue: if
    Thomas is indexed for a document, I would like searches for Tom to match
    that document. I am sure this is a common problem that many of you must
    have
    addressed, but I was unable to find anything in the archives or on Google.
    Thanks in advance for the guidance.


    Regards,
    Khawaja Shams
    Send instant messages to your online friends http://in.messenger.yahoo.com

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Khawaja Shams at Dec 3, 2008 at 10:03 pm
    Hi, Yes that is pretty obvious that I would have to index Tom, but I think
    you missed the point. I don't have a list of names with their nick names,
    and this is pretty common: Mike being Michael, Richard being Rich or Dick,
    William could be Bill or Will, etc. I thought I would check if there was a
    canonical way of doing this before building my own list from common sir
    names. Thanks for your time.


    Sincerely,
    Khawaja Shams
    On Wed, Dec 3, 2008 at 1:42 AM, Ganesh wrote:

    If you want to query for Tom, then you need to index the value Tom. Create
    one more field as Alias or add alias name as part of name field.

    Regards
    Ganesh

    ----- Original Message ----- From: "Khawaja Shams" <ksshams@gmail.com>
    To: <java-user@lucene.apache.org>
    Sent: Wednesday, December 03, 2008 11:46 AM
    Subject: Indexing Names in Lucene -- Thomas = Tom, etc



    Hello, I am indexing documents with a field that contains the first and
    last name of people. It is working wonderfully with a slight issue: if
    Thomas is indexed for a document, I would like searches for Tom to match
    that document. I am sure this is a common problem that many of you must
    have
    addressed, but I was unable to find anything in the archives or on Google.
    Thanks in advance for the guidance.


    Regards,
    Khawaja Shams
    Send instant messages to your online friends http://in.messenger.yahoo.com
    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Grant Ingersoll at Dec 4, 2008 at 12:13 pm
    I believe these lists exists out on the Internet, just google for
    something like "most common first names" or "common
    nicknames" (yields: http://www.cc.kyoto-su.ac.jp/~trobb/nicklist.html
    for instance)

    If you want to dig deeper, you might look into named entity
    recognition research, and also co-reference resolution research, as
    those problems often have to deal with this problem.

    HTH,
    Grant
    On Dec 3, 2008, at 5:02 PM, Khawaja Shams wrote:

    Hi, Yes that is pretty obvious that I would have to index Tom, but
    I think
    you missed the point. I don't have a list of names with their nick
    names,
    and this is pretty common: Mike being Michael, Richard being Rich or
    Dick,
    William could be Bill or Will, etc. I thought I would check if there
    was a
    canonical way of doing this before building my own list from common
    sir
    names. Thanks for your time.


    Sincerely,
    Khawaja Shams
    On Wed, Dec 3, 2008 at 1:42 AM, Ganesh wrote:

    If you want to query for Tom, then you need to index the value Tom.
    Create
    one more field as Alias or add alias name as part of name field.

    Regards
    Ganesh

    ----- Original Message ----- From: "Khawaja Shams"
    <ksshams@gmail.com>
    To: <java-user@lucene.apache.org>
    Sent: Wednesday, December 03, 2008 11:46 AM
    Subject: Indexing Names in Lucene -- Thomas = Tom, etc



    Hello, I am indexing documents with a field that contains the
    first and
    last name of people. It is working wonderfully with a slight
    issue: if
    Thomas is indexed for a document, I would like searches for Tom to
    match
    that document. I am sure this is a common problem that many of you
    must
    have
    addressed, but I was unable to find anything in the archives or on
    Google.
    Thanks in advance for the guidance.


    Regards,
    Khawaja Shams
    Send instant messages to your online friends http://in.messenger.yahoo.com
    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
    --------------------------
    Grant Ingersoll

    Lucene Helpful Hints:
    http://wiki.apache.org/lucene-java/BasicsOfPerformance
    http://wiki.apache.org/lucene-java/LuceneFAQ











    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupjava-user @
categorieslucene
postedDec 3, '08 at 6:17a
activeDec 4, '08 at 12:13p
posts5
users4
websitelucene.apache.org

People

Translate

site design / logo © 2022 Grokbase