java.lang.Object
org.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.StopwordAnalyzerBase
org.apache.lucene.analysis.core.StopAnalyzer
- All Implemented Interfaces:
Closeable,AutoCloseable
- Since:
- 3.1
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer
Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents -
Field Summary
Fields inherited from class org.apache.lucene.analysis.StopwordAnalyzerBase
stopwordsFields inherited from class org.apache.lucene.analysis.Analyzer
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY -
Constructor Summary
ConstructorsConstructorDescriptionStopAnalyzer(Reader stopwords) Builds an analyzer with the stop words from the given reader.StopAnalyzer(Path stopwordsFile) Builds an analyzer with the stop words from the given path.StopAnalyzer(CharArraySet stopWords) Builds an analyzer with the stop words from the given set. -
Method Summary
Modifier and TypeMethodDescriptionprotected Analyzer.TokenStreamComponentscreateComponents(String fieldName) CreatesAnalyzer.TokenStreamComponentsused to tokenize all the text in the providedReader.protected TokenStreamnormalize(String fieldName, TokenStream in) Wrap the givenTokenStreamin order to apply normalization filters.Methods inherited from class org.apache.lucene.analysis.StopwordAnalyzerBase
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSetMethods inherited from class org.apache.lucene.analysis.Analyzer
attributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, initReader, initReaderForNormalization, normalize, tokenStream, tokenStream
-
Constructor Details
-
StopAnalyzer
Builds an analyzer with the stop words from the given set.- Parameters:
stopWords- Set of stop words
-
StopAnalyzer
Builds an analyzer with the stop words from the given path.- Parameters:
stopwordsFile- File to load stop words from- Throws:
IOException- See Also:
-
StopAnalyzer
Builds an analyzer with the stop words from the given reader.- Parameters:
stopwords- Reader to load stop words from- Throws:
IOException- See Also:
-
-
Method Details
-
createComponents
CreatesAnalyzer.TokenStreamComponentsused to tokenize all the text in the providedReader.- Specified by:
createComponentsin classAnalyzer- Parameters:
fieldName- the name of the fields content passed to theAnalyzer.TokenStreamComponentssink as a reader- Returns:
Analyzer.TokenStreamComponentsbuilt from aLetterTokenizerfiltered withStopFilter
-
normalize
Description copied from class:AnalyzerWrap the givenTokenStreamin order to apply normalization filters. The default implementation returns theTokenStreamas-is. This is used byAnalyzer.normalize(String, String).
-