Class TokenInfoDictionaryBuilder
java.lang.Object
org.apache.lucene.analysis.ko.util.TokenInfoDictionaryBuilder
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate Stringprivate Normalizer.Formprivate intInternal word id - incrementally assigned as entries are read and added. -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprivate TokenInfoDictionaryWriterbuildDictionary(List<Path> csvFiles)
-
Field Details
-
offset
private int offsetInternal word id - incrementally assigned as entries are read and added. This will be byte offset of dictionary file -
encoding
-
normalForm
-
-
Constructor Details
-
TokenInfoDictionaryBuilder
TokenInfoDictionaryBuilder(String encoding, boolean normalizeEntries)
-
-
Method Details
-
build
- Throws:
IOException
-
buildDictionary
- Throws:
IOException
-