Thanks for the congratulations, Doug!

The credits for the Lucene side of the work really go to Michael, and
to the entire Lucene group - this community sometimes came up with
patches faster than we could ask for them.

To answer your question: How is Lucene used in this product?
- Needless to mention that we use Lucene to index and search documents.
- The documents are gathered by web and file system crawlers that we
took from OmniFind Enterprise Edition, improved and adapted to the
small-footprint of Yahoo! Edition.
- For analysis, we use IBM's LanguageWare text analytics packaged into
the UIMA framework - no "vanilla" Lucene analyzers used. This part
was a little tricky because UIMA's document processing model (analyze
the entire document at once) differs from Lucene's, which analyzes
each field separately.
- For search, we extended QueryParser for LanguageWare-specific handling
of base forms, stopword. and synonyms. Oh, and we tuned the scoring a
- A lot of the work actually went into the infrastructure that puts it
all together - configuration, administration, APIs etc.

All together, it was a thrill to work with Lucene, it made a lot of things
a whole lot easier.

- Andreas.

To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

Search Discussions

Discussion Posts


Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 3 of 4 | next ›
Discussion Overview
groupdev @
postedDec 13, '06 at 6:54p
activeDec 14, '06 at 10:03a



site design / logo © 2019 Grokbase