Grokbase Groups Lucene dev March 2009
Brazilian Analyzer doesn't remove stopwords when uppercase is given

Key: LUCENE-1576
Project: Lucene - Java
Issue Type: Bug
Components: contrib/analyzers
Affects Versions: 2.3.3, 2.4.2, 2.9, 3.0
Environment: not applicable
Reporter: Douglas Campos

The order of filters matter here, just need to apply lowercase token filter before removing stopwords

result = new StopFilter( result, stoptable );
result = new BrazilianStemFilter( result, excltable );
// Convert to lowercase after stemming!
result = new LowerCaseFilter( result );

Lowercase must come before BrazilianStemFilter

At the end of day I'll attach a patch, it's straightforward

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

Search Discussions

Discussion Posts

Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 1 of 7 | next ›
Discussion Overview
groupdev @
postedMar 27, '09 at 4:55p
activeMar 27, '09 at 7:05p

1 user in discussion

Michael McCandless (JIRA): 7 posts



site design / logo © 2022 Grokbase