Grokbase Groups Lucene dev March 2009
FAQ
Brazilian Analyzer doesn't remove stopwords when uppercase is given
-------------------------------------------------------------------

Key: LUCENE-1576
URL: https://issues.apache.org/jira/browse/LUCENE-1576
Project: Lucene - Java
Issue Type: Bug
Components: contrib/analyzers
Affects Versions: 2.3.3, 2.4.2, 2.9, 3.0
Environment: not applicable
Reporter: Douglas Campos


The order of filters matter here, just need to apply lowercase token filter before removing stopwords

result = new StopFilter( result, stoptable );
result = new BrazilianStemFilter( result, excltable );
// Convert to lowercase after stemming!
result = new LowerCaseFilter( result );

Lowercase must come before BrazilianStemFilter

At the end of day I'll attach a patch, it's straightforward

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

Search Discussions

Discussion Posts

Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 1 of 7 | next ›
Discussion Overview
groupdev @
categorieslucene
postedMar 27, '09 at 4:55p
activeMar 27, '09 at 7:05p
posts7
users1
websitelucene.apache.org

1 user in discussion

Michael McCandless (JIRA): 7 posts

People

Translate

site design / logo © 2022 Grokbase