fr.gouv.culture.sdx.search.lucene.analysis
Class Analyzer_br

java.lang.Object
  extended by org.apache.lucene.analysis.Analyzer
      extended by fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer
          extended by fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer
              extended by fr.gouv.culture.sdx.search.lucene.analysis.Analyzer_br
All Implemented Interfaces:
Analyzer, java.io.Serializable, org.apache.avalon.framework.configuration.Configurable, org.apache.avalon.framework.logger.LogEnabled, org.apache.excalibur.xml.sax.XMLizable

Deprecated. use org.apache.lucene.analysis.br.BrazilianAnalyzer

public class Analyzer_br
extends DefaultAnalyzer

Analyzer for brazilian language. Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (word that will not be stemmed, but indexed).

Version:
$Id: BrazilianAnalyzer.java,v 1.0 2001/02/13 21:29:04
Author:
João Kramer
See Also:
Serialized Form

Field Summary
protected static java.lang.String ANALYZER_TYPE
          Deprecated.  
static java.lang.String[] BRAZILIAN_STOP_WORDS
          Deprecated. List of typical brazilian stopwords.
 
Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer
ATTRIBUTE_EXCLUDE_STEMS, ATTRIBUTE_USE_STOP_WORDS, DEFAULT_STOP_WORDS, EXCLUDE_STEM_ELEMENT, EXCLUDE_STEMS_ELEMENT, excludeTable, stopTable
 
Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer
logger
 
Constructor Summary
Analyzer_br(java.io.File stopwords)
          Deprecated. Builds an analyzer with the given stop words.
Analyzer_br(java.util.Set stopwords)
          Deprecated. Builds an analyzer with the given stop words.
Analyzer_br(java.lang.String[] stopwords)
          Deprecated. Builds an analyzer with the given stop words.
 
Method Summary
protected  java.lang.String getAnalyzerType()
          Deprecated.  
 void setStemExclusionTable(java.io.File exclusionlist)
          Deprecated. Builds an exclusionlist from the words contained in the given file.
 void setStemExclusionTable(java.util.Set exclusionlist)
          Deprecated. Builds an exclusionlist from a Hashtable.
 void setStemExclusionTable(java.lang.String[] exclusionlist)
          Deprecated. Builds an exclusionlist from an array of Strings.
 org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName, java.io.Reader reader)
          Deprecated. Creates a TokenStream which tokenizes all the text in the provided Reader.
 
Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer
buildExcludeTable, buildStopTable, configure, getDefaultStopWords, tokenStream
 
Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer
enableLogging, toSAX
 
Methods inherited from class org.apache.lucene.analysis.Analyzer
getPositionIncrementGap
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

ANALYZER_TYPE

protected static final java.lang.String ANALYZER_TYPE
Deprecated. 
See Also:
Constant Field Values

BRAZILIAN_STOP_WORDS

public static final java.lang.String[] BRAZILIAN_STOP_WORDS
Deprecated. 
List of typical brazilian stopwords.

Constructor Detail

Analyzer_br

public Analyzer_br(java.lang.String[] stopwords)
Deprecated. 
Builds an analyzer with the given stop words.

Parameters:
stopwords -

Analyzer_br

public Analyzer_br(java.util.Set stopwords)
Deprecated. 
Builds an analyzer with the given stop words.

Parameters:
stopwords -

Analyzer_br

public Analyzer_br(java.io.File stopwords)
            throws java.io.IOException
Deprecated. 
Builds an analyzer with the given stop words.

Parameters:
stopwords -
Throws:
java.io.IOException
Method Detail

setStemExclusionTable

public void setStemExclusionTable(java.lang.String[] exclusionlist)
Deprecated. 
Builds an exclusionlist from an array of Strings.

Parameters:
exclusionlist -

setStemExclusionTable

public void setStemExclusionTable(java.util.Set exclusionlist)
Deprecated. 
Builds an exclusionlist from a Hashtable.

Parameters:
exclusionlist -

setStemExclusionTable

public void setStemExclusionTable(java.io.File exclusionlist)
                           throws java.io.IOException
Deprecated. 
Builds an exclusionlist from the words contained in the given file.

Parameters:
exclusionlist -
Throws:
java.io.IOException

tokenStream

public final org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName,
                                                                java.io.Reader reader)
Deprecated. 
Creates a TokenStream which tokenizes all the text in the provided Reader.

Specified by:
tokenStream in interface Analyzer
Overrides:
tokenStream in class DefaultAnalyzer
Returns:
A TokenStream build from a StandardTokenizer filtered with StandardFilter, StopFilter, GermanStemFilter and LowerCaseFilter.

getAnalyzerType

protected java.lang.String getAnalyzerType()
Deprecated. 
Overrides:
getAnalyzerType in class DefaultAnalyzer
See Also:
fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer#getAnalyserType()


Copyright © 2000-2010 Ministere de la culture et de la communication / AJLSM. All Rights Reserved.