Adriano Crestani commented on LUCENE-1576:
------------------------------------------
FYI, this topic was already discussed on this thread: http://markmail.org/thread/5wjjl6jx4yoxake5
Brazilian Analyzer doesn't remove stopwords when uppercase is given
-------------------------------------------------------------------
Key: LUCENE-1576
URL: https://issues.apache.org/jira/browse/LUCENE-1576
Project: Lucene - Java
Issue Type: Bug
Components: contrib/analyzers
Affects Versions: 2.3.3, 2.4.2, 2.9, 3.0
Environment: not applicable
Reporter: Douglas Campos
Original Estimate: 0.25h
Remaining Estimate: 0.25h
The order of filters matter here, just need to apply lowercase token filter before removing stopwords
result = new StopFilter( result, stoptable );
result = new BrazilianStemFilter( result, excltable );
// Convert to lowercase after stemming!
result = new LowerCaseFilter( result );
Lowercase must come before BrazilianStemFilter
At the end of day I'll attach a patch, it's straightforward
---------------------------------------------------------------------
Key: LUCENE-1576
URL: https://issues.apache.org/jira/browse/LUCENE-1576
Project: Lucene - Java
Issue Type: Bug
Components: contrib/analyzers
Affects Versions: 2.3.3, 2.4.2, 2.9, 3.0
Environment: not applicable
Reporter: Douglas Campos
Original Estimate: 0.25h
Remaining Estimate: 0.25h
The order of filters matter here, just need to apply lowercase token filter before removing stopwords
result = new StopFilter( result, stoptable );
result = new BrazilianStemFilter( result, excltable );
// Convert to lowercase after stemming!
result = new LowerCaseFilter( result );
Lowercase must come before BrazilianStemFilter
At the end of day I'll attach a patch, it's straightforward
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org