Class WordSegmenter
java.lang.Object
org.apache.lucene.analysis.cn.smart.WordSegmenter
Segment a sentence of Chinese text into words.
-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionconvertSegToken(SegToken st, String sentence, int sentenceStartOffset) Process aSegTokenso that it is ready for indexing.segmentSentence(String sentence, int startOffset) Segment a sentence into words withHHMMSegmenter
-
Field Details
-
hhmmSegmenter
-
tokenFilter
-
-
Constructor Details
-
WordSegmenter
WordSegmenter()
-
-
Method Details
-
segmentSentence
Segment a sentence into words withHHMMSegmenter -
convertSegToken
Process aSegTokenso that it is ready for indexing.This method calculates offsets and normalizes the token with
SegTokenFilter.
-