Uses of Interface
org.apache.lucene.util.Unwrappable
Packages that use Unwrappable
Package
Description
Text analysis.
Analyzer for Arabic.
Analyzer for Bulgarian.
Analyzer for Bengali Language.
Provides various convenience classes for creating boosts on Tokens.
Analyzer for Brazilian Portuguese.
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
Analyzer for Sorani Kurdish.
Fast, general-purpose grammar-based tokenizers.
Construct n-grams for frequently occurring terms and phrases.
A filter that decomposes compound words you find in many Germanic languages into the word parts.
Basic, general-purpose analysis components.
Analyzer for Czech.
Analyzer for German.
Analyzer for Greek.
Analyzer for English.
Analyzer for Spanish.
Analyzer for Persian.
Analyzer for Finnish.
Analyzer for French.
Analyzer for Irish.
Analyzer for Galician.
Analyzer for Hindi.
Analyzer for Hungarian.
A Java implementation of Hunspell stemming and
spell-checking algorithms (
Hunspell), and a stemming
TokenFilter (HunspellStemFilter) based on it.Analysis components based on ICU
Analyzer for Indonesian.
Analyzer for Indian languages.
Analyzer for Italian.
Analyzer for Japanese.
Analyzer for Korean.
Analyzer for Latvian.
MinHash filtering (for LSH).
Miscellaneous Tokenstreams.
Character n-gram tokenizers and filters.
Analyzer for Norwegian.
Set of components for pattern-based (regex) analysis.
Provides various convenience classes for creating payloads on Tokens.
Analysis components for phonetic search.
Analyzer for Portuguese.
Filter to reverse token text.
Analyzer for Russian.
Word n-gram filters.
Analyzer for Serbian.
Stempel: Algorithmic Stemmer
Analyzer for Swedish.
Analysis components for Synonyms.
Analysis components for Synonyms using Word2Vec model.
Analyzer for Telugu Language.
Analyzer for Turkish.
Utility functions for text analysis.
Code to maintain and access indices.
Misc index tools and index support.
Monitoring framework
Experimental index-related classes
Code to search indices.
Highlighting search terms.
Analyzer based autosuggest.
Support for document suggestion
The UnifiedHighlighter -- a flexible highlighter that can get offsets from postings, term
vectors, or analysis.
-
Uses of Unwrappable in org.apache.lucene.analysis
Classes in org.apache.lucene.analysis that implement UnwrappableModifier and TypeClassDescriptionfinal classThis class can be used if the token attributes of a TokenStream are intended to be consumed more than once.classAbstract base class for TokenFilters that may remove tokens.classAn abstract TokenFilter that exposes its input stream as a graphclassNormalizes token text to lower case.classRemoves stop words from a token stream.classA TokenFilter is a TokenStream whose input is another TokenStream. -
Uses of Unwrappable in org.apache.lucene.analysis.ar
Classes in org.apache.lucene.analysis.ar that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesArabicNormalizerto normalize the orthography.final classATokenFilterthat appliesArabicStemmerto stem Arabic words.. -
Uses of Unwrappable in org.apache.lucene.analysis.bg
Classes in org.apache.lucene.analysis.bg that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesBulgarianStemmerto stem Bulgarian words. -
Uses of Unwrappable in org.apache.lucene.analysis.bn
Classes in org.apache.lucene.analysis.bn that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesBengaliNormalizerto normalize the orthography.final classATokenFilterthat appliesBengaliStemmerto stem Bengali words. -
Uses of Unwrappable in org.apache.lucene.analysis.boost
Classes in org.apache.lucene.analysis.boost that implement UnwrappableModifier and TypeClassDescriptionfinal classCharacters before the delimiter are the "token", those after are the boost. -
Uses of Unwrappable in org.apache.lucene.analysis.br
Classes in org.apache.lucene.analysis.br that implement Unwrappable -
Uses of Unwrappable in org.apache.lucene.analysis.cjk
Classes in org.apache.lucene.analysis.cjk that implement UnwrappableModifier and TypeClassDescriptionfinal classForms bigrams of CJK terms that are generated from StandardTokenizer or ICUTokenizer.final classATokenFilterthat normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin Folds halfwidth Katakana variants into the equivalent kana -
Uses of Unwrappable in org.apache.lucene.analysis.ckb
Classes in org.apache.lucene.analysis.ckb that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesSoraniNormalizerto normalize the orthography.final classATokenFilterthat appliesSoraniStemmerto stem Sorani words. -
Uses of Unwrappable in org.apache.lucene.analysis.classic
Classes in org.apache.lucene.analysis.classic that implement Unwrappable -
Uses of Unwrappable in org.apache.lucene.analysis.commongrams
Classes in org.apache.lucene.analysis.commongrams that implement UnwrappableModifier and TypeClassDescriptionfinal classConstruct bigrams for frequently occurring terms while indexing.final classWrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram. -
Uses of Unwrappable in org.apache.lucene.analysis.compound
Classes in org.apache.lucene.analysis.compound that implement UnwrappableModifier and TypeClassDescriptionclassBase class for decomposition token filters.classATokenFilterthat decomposes compound words found in many Germanic languages.classATokenFilterthat decomposes compound words found in many Germanic languages. -
Uses of Unwrappable in org.apache.lucene.analysis.core
Classes in org.apache.lucene.analysis.core that implement UnwrappableModifier and TypeClassDescriptionfinal classFolds all Unicode digits in[:General_Category=Decimal_Number:]to Basic Latin digits (0-9).final classConverts an incoming graph token stream, such as one fromSynonymGraphFilter, into a flat form so that all nodes form a single linear chain with no side paths.final classNormalizes token text to lower case.final classRemoves stop words from a token stream.final classRemoves tokens whose types appear in a set of blocked types from a token stream.final classNormalizes token text to UPPER CASE. -
Uses of Unwrappable in org.apache.lucene.analysis.cz
Classes in org.apache.lucene.analysis.cz that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesCzechStemmerto stem Czech words. -
Uses of Unwrappable in org.apache.lucene.analysis.de
Classes in org.apache.lucene.analysis.de that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesGermanLightStemmerto stem German words.final classATokenFilterthat appliesGermanMinimalStemmerto stem German words.final classNormalizes German characters according to the heuristics of the German2 snowball algorithm.final classATokenFilterthat stems German words. -
Uses of Unwrappable in org.apache.lucene.analysis.el
Classes in org.apache.lucene.analysis.el that implement UnwrappableModifier and TypeClassDescriptionfinal classNormalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.final classATokenFilterthat appliesGreekStemmerto stem Greek words. -
Uses of Unwrappable in org.apache.lucene.analysis.en
Classes in org.apache.lucene.analysis.en that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesEnglishMinimalStemmerto stem English words.final classTokenFilter that removes possessives (trailing 's) from words.final classA high-performance kstem filter for english.final classTransforms the token stream as per the Porter stemming algorithm. -
Uses of Unwrappable in org.apache.lucene.analysis.es
Classes in org.apache.lucene.analysis.es that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesSpanishLightStemmerto stem Spanish words.final classDeprecated.final classATokenFilterthat appliesSpanishPluralStemmerto stem Spanish words. -
Uses of Unwrappable in org.apache.lucene.analysis.fa
Classes in org.apache.lucene.analysis.fa that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesPersianNormalizerto normalize the orthography.final classATokenFilterthat appliesPersianStemmerto stem Persian words. -
Uses of Unwrappable in org.apache.lucene.analysis.fi
Classes in org.apache.lucene.analysis.fi that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesFinnishLightStemmerto stem Finnish words. -
Uses of Unwrappable in org.apache.lucene.analysis.fr
Classes in org.apache.lucene.analysis.fr that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesFrenchLightStemmerto stem French words.final classATokenFilterthat appliesFrenchMinimalStemmerto stem French words. -
Uses of Unwrappable in org.apache.lucene.analysis.ga
Classes in org.apache.lucene.analysis.ga that implement UnwrappableModifier and TypeClassDescriptionfinal classNormalises token text to lower case, handling t-prothesis and n-eclipsis (i.e., that 'nAthair' should become 'n-athair') -
Uses of Unwrappable in org.apache.lucene.analysis.gl
Classes in org.apache.lucene.analysis.gl that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesGalicianMinimalStemmerto stem Galician words.final classATokenFilterthat appliesGalicianStemmerto stem Galician words. -
Uses of Unwrappable in org.apache.lucene.analysis.hi
Classes in org.apache.lucene.analysis.hi that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesHindiNormalizerto normalize the orthography.final classATokenFilterthat appliesHindiStemmerto stem Hindi words. -
Uses of Unwrappable in org.apache.lucene.analysis.hu
Classes in org.apache.lucene.analysis.hu that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesHungarianLightStemmerto stem Hungarian words. -
Uses of Unwrappable in org.apache.lucene.analysis.hunspell
Classes in org.apache.lucene.analysis.hunspell that implement UnwrappableModifier and TypeClassDescriptionfinal classTokenFilter that uses hunspell affix rules and words to stem tokens. -
Uses of Unwrappable in org.apache.lucene.analysis.icu
Classes in org.apache.lucene.analysis.icu that implement UnwrappableModifier and TypeClassDescriptionfinal classA TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.classNormalize token text with ICU'sNormalizer2final classATokenFilterthat transforms text with ICU. -
Uses of Unwrappable in org.apache.lucene.analysis.id
Classes in org.apache.lucene.analysis.id that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesIndonesianStemmerto stem Indonesian words. -
Uses of Unwrappable in org.apache.lucene.analysis.in
Classes in org.apache.lucene.analysis.in that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesIndicNormalizerto normalize text in Indian Languages. -
Uses of Unwrappable in org.apache.lucene.analysis.it
Classes in org.apache.lucene.analysis.it that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesItalianLightStemmerto stem Italian words. -
Uses of Unwrappable in org.apache.lucene.analysis.ja
Classes in org.apache.lucene.analysis.ja that implement UnwrappableModifier and TypeClassDescriptionfinal classReplaces term text with theBaseFormAttribute.final classATokenFilterthat adds Japanese romanized tokens to the term attribute.final classATokenFilterthat normalizes small letters (捨て仮名) in hiragana into normal letters.final classATokenFilterthat normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).final classATokenFilterthat normalizes small letters (捨て仮名) in katakana into normal letters.classATokenFilterthat normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.final classRemoves tokens that match a set of part-of-speech tags.final classATokenFilterthat replaces the term attribute with the reading of a token in either katakana or romaji form. -
Uses of Unwrappable in org.apache.lucene.analysis.ko
Classes in org.apache.lucene.analysis.ko that implement UnwrappableModifier and TypeClassDescriptionclassATokenFilterthat normalizes Korean numbers to regular Arabic decimal numbers in half-width characters.final classRemoves tokens that match a set of part-of-speech tags.final classReplaces term text with theReadingAttributewhich is the Hangul transcription of Hanja characters. -
Uses of Unwrappable in org.apache.lucene.analysis.lv
Classes in org.apache.lucene.analysis.lv that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesLatvianStemmerto stem Latvian words. -
Uses of Unwrappable in org.apache.lucene.analysis.minhash
Classes in org.apache.lucene.analysis.minhash that implement UnwrappableModifier and TypeClassDescriptionclassGenerate min hash tokens from an incoming stream of tokens. -
Uses of Unwrappable in org.apache.lucene.analysis.miscellaneous
Classes in org.apache.lucene.analysis.miscellaneous that implement UnwrappableModifier and TypeClassDescriptionfinal classThis class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists.final classA filter to apply normal capitalization rules to Tokens.final classRemoves words that are too long or too short from the stream.classAllows skipping TokenFilters based on the current set of attributes.classFilters all tokens that cannot be parsed to a date, using the providedDateFormat.final classCharacters before the delimiter are the "token", the textual integer after is the term frequency.final classAllows Tokens with a given combination of flags to be dropped.classFilter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.final classDeprecated.Fix the token filters that create broken offsets in the first place.final classWhen the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.final classA TokenFilter that only keeps tokens with text contained in the required words.classMarks terms as keywords via theKeywordAttribute.final classThis TokenFilter emits each incoming token twice once as keyword and once non-keyword, in other words once withKeywordAttribute.setKeyword(boolean)set totrueand once set tofalse.final classRemoves words that are too long or too short from the stream.final classThis TokenFilter limits the number of tokens while indexing.final classLets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream.final classThis TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit.final classMarks terms as keywords via theKeywordAttribute.classA ConditionalTokenFilter that only applies its wrapped filters to tokens that are not contained in a protected set.final classA TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.final classThis filter folds Scandinavian characters åÅäæÄÆ->a and öÖøØ->o.final classThis filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ.final classMarks terms as keywords via theKeywordAttribute.final classProvides the ability to override anyKeywordAttributeaware stemmer with custom dictionary-based stemming.final classTrims leading and trailing whitespace from Tokens in the stream.final classA token filter for truncating the terms into a specific length.final classAdds theTypeAttribute.type()as a synonym, i.e.final classDeprecated.UseWordDelimiterGraphFilterinstead: it produces a correct token graph so that e.g.final classSplits words into subwords and performs optional transformations on subword groups, producing a correct token graph so that e.g. -
Uses of Unwrappable in org.apache.lucene.analysis.ngram
Classes in org.apache.lucene.analysis.ngram that implement UnwrappableModifier and TypeClassDescriptionfinal classTokenizes the given token into n-grams of given size(s).final classTokenizes the input into n-grams of the given size(s). -
Uses of Unwrappable in org.apache.lucene.analysis.no
Classes in org.apache.lucene.analysis.no that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesNorwegianLightStemmerto stem Norwegian words.final classATokenFilterthat appliesNorwegianMinimalStemmerto stem Norwegian words.final classThis filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (ae, oe, aa) by transforming them to åÅæÆøØ. -
Uses of Unwrappable in org.apache.lucene.analysis.pattern
Classes in org.apache.lucene.analysis.pattern that implement UnwrappableModifier and TypeClassDescriptionfinal classCaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.final classA TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences with the specified replacement string.classSet a type attribute to a parameterized value when tokens are matched by any of a several regex patterns. -
Uses of Unwrappable in org.apache.lucene.analysis.payloads
Classes in org.apache.lucene.analysis.payloads that implement UnwrappableModifier and TypeClassDescriptionfinal classCharacters before the delimiter are the "token", those after are the payload.classAssigns a payload to a token based on theTypeAttributeclassAdds theOffsetAttribute.startOffset()andOffsetAttribute.endOffset()First 4 bytes are the startclassMakes theTypeAttributea payload. -
Uses of Unwrappable in org.apache.lucene.analysis.phonetic
Classes in org.apache.lucene.analysis.phonetic that implement UnwrappableModifier and TypeClassDescriptionfinal classTokenFilter for Beider-Morse phonetic encoding.final classCreate tokens for phonetic matches based on Daitch–Mokotoff Soundex.final classFilter for DoubleMetaphone (supporting secondary codes)final classCreate tokens for phonetic matches. -
Uses of Unwrappable in org.apache.lucene.analysis.pt
Classes in org.apache.lucene.analysis.pt that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesPortugueseLightStemmerto stem Portuguese words.final classATokenFilterthat appliesPortugueseMinimalStemmerto stem Portuguese words.final classATokenFilterthat appliesPortugueseStemmerto stem Portuguese words. -
Uses of Unwrappable in org.apache.lucene.analysis.reverse
Classes in org.apache.lucene.analysis.reverse that implement UnwrappableModifier and TypeClassDescriptionfinal classReverse token string, for example "country" => "yrtnuoc". -
Uses of Unwrappable in org.apache.lucene.analysis.ru
Classes in org.apache.lucene.analysis.ru that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesRussianLightStemmerto stem Russian words. -
Uses of Unwrappable in org.apache.lucene.analysis.shingle
Classes in org.apache.lucene.analysis.shingle that implement UnwrappableModifier and TypeClassDescriptionfinal classA FixedShingleFilter constructs shingles (token n-grams) from a token stream.final classA ShingleFilter constructs shingles (token n-grams) from a token stream. -
Uses of Unwrappable in org.apache.lucene.analysis.sinks
Classes in org.apache.lucene.analysis.sinks that implement UnwrappableModifier and TypeClassDescriptionfinal classThis TokenFilter provides the ability to set aside attribute states that have already been analyzed. -
Uses of Unwrappable in org.apache.lucene.analysis.snowball
Classes in org.apache.lucene.analysis.snowball that implement UnwrappableModifier and TypeClassDescriptionfinal classA filter that stems words using a Snowball-generated stemmer. -
Uses of Unwrappable in org.apache.lucene.analysis.sr
Classes in org.apache.lucene.analysis.sr that implement UnwrappableModifier and TypeClassDescriptionfinal classNormalizes Serbian Cyrillic and Latin characters to "bald" Latin.final classNormalizes Serbian Cyrillic to Latin. -
Uses of Unwrappable in org.apache.lucene.analysis.stempel
Classes in org.apache.lucene.analysis.stempel that implement UnwrappableModifier and TypeClassDescriptionfinal classTransforms the token stream as per the stemming algorithm. -
Uses of Unwrappable in org.apache.lucene.analysis.sv
Classes in org.apache.lucene.analysis.sv that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesSwedishLightStemmerto stem Swedish words.final classATokenFilterthat appliesSwedishMinimalStemmerto stem Swedish words. -
Uses of Unwrappable in org.apache.lucene.analysis.synonym
Classes in org.apache.lucene.analysis.synonym that implement UnwrappableModifier and TypeClassDescriptionfinal classDeprecated.UseSynonymGraphFilterinstead, but be sure to also useFlattenGraphFilterat index time (not at search time) as well.final classApplies single- or multi-token synonyms from aSynonymMapto an incomingTokenStream, producing a fully correct graph output. -
Uses of Unwrappable in org.apache.lucene.analysis.synonym.word2vec
Classes in org.apache.lucene.analysis.synonym.word2vec that implement UnwrappableModifier and TypeClassDescriptionfinal classApplies single-token synonyms from a Word2Vec trained network to an incomingTokenStream. -
Uses of Unwrappable in org.apache.lucene.analysis.te
Classes in org.apache.lucene.analysis.te that implement UnwrappableModifier and TypeClassDescriptionfinal classATokenFilterthat appliesTeluguNormalizerto normalize the orthography.final classATokenFilterthat appliesTeluguStemmerto stem Telugu words. -
Uses of Unwrappable in org.apache.lucene.analysis.tr
Classes in org.apache.lucene.analysis.tr that implement UnwrappableModifier and TypeClassDescriptionfinal classStrips all characters after an apostrophe (including the apostrophe itself).final classNormalizes Turkish token text to lower case. -
Uses of Unwrappable in org.apache.lucene.analysis.util
Classes in org.apache.lucene.analysis.util that implement Unwrappable -
Uses of Unwrappable in org.apache.lucene.index
Classes in org.apache.lucene.index that implement UnwrappableModifier and TypeClassDescriptionstatic classBase class for filteringPostingsEnumimplementations.classA wrapper forMergePolicyinstances.classA wrapping merge policy that wraps theMergePolicy.OneMergeobjects returned by the wrapped merge policy.final classThisMergePolicyallows to carry over soft deleted documents across merges.classThisMergePolicyis used for upgrading all existing segments of an index when callingIndexWriter.forceMerge(int). -
Uses of Unwrappable in org.apache.lucene.misc.index
Classes in org.apache.lucene.misc.index that implement UnwrappableModifier and TypeClassDescriptionfinal classA merge policy that reorders merged segments according to aBPIndexReorderer. -
Uses of Unwrappable in org.apache.lucene.monitor
Classes in org.apache.lucene.monitor that implement Unwrappable -
Uses of Unwrappable in org.apache.lucene.sandbox.index
Classes in org.apache.lucene.sandbox.index that implement UnwrappableModifier and TypeClassDescriptionclassA simple extension to wrapMergePolicyto merge all tiny segments (or at least segments smaller than specified inMergeOnFlushMergePolicy.setSmallSegmentThresholdMB(double)into one segment on commit. -
Uses of Unwrappable in org.apache.lucene.search
Classes in org.apache.lucene.search that implement UnwrappableModifier and TypeClassDescriptionclassAFilterScorercontains anotherScorer, which it uses as its basic source of data, possibly transforming the data along the way or providing additional functionality.private static class -
Uses of Unwrappable in org.apache.lucene.search.highlight
Classes in org.apache.lucene.search.highlight that implement UnwrappableModifier and TypeClassDescription(package private) final classThis is a simplified version of org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilter to prevent a dependency on analysis-common.jar.final classThis TokenFilter limits the number of tokens while indexing by adding up the current offset. -
Uses of Unwrappable in org.apache.lucene.search.suggest.analyzing
Classes in org.apache.lucene.search.suggest.analyzing that implement UnwrappableModifier and TypeClassDescriptionfinal classLikeStopFilterexcept it will not remove the last token if that token was not followed by some token separator. -
Uses of Unwrappable in org.apache.lucene.search.suggest.document
Classes in org.apache.lucene.search.suggest.document that implement UnwrappableModifier and TypeClassDescriptionfinal classAConcatenateGraphFilterbut we can set the payload and provide access to config options.private static final classTheContextSuggestField.PrefixTokenFilterwraps aTokenStreamand adds a set prefixes ahead. -
Uses of Unwrappable in org.apache.lucene.search.uhighlight
Classes in org.apache.lucene.search.uhighlight that implement UnwrappableModifier and TypeClassDescriptionprivate static final classWraps anAnalyzerand string text that represents multiple values delimited by a specified character.
SpanishPluralStemFilterinstead.