Class UserDictionary
java.lang.Object
org.apache.lucene.analysis.ko.dict.UserDictionary
- All Implemented Interfaces:
Dictionary
Class for building a User Dictionary. This class allows for adding custom nouns (세종) or compounds
(세종시 세종 시).
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.apache.lucene.analysis.ko.dict.Dictionary
Dictionary.Morpheme -
Field Summary
FieldsModifier and TypeFieldDescriptionprivate final TokenInfoFSTprivate static final shortprivate static final shortprivate static final shortprivate static final shortprivate final short[]private final int[][]private static final int -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiongetFST()intgetLeftId(int wordId) Get left id of specified wordgetLeftPOS(int wordId) Get the leftPOS.Tagof specfied word.getMorphemes(int wordId, char[] surfaceForm, int off, int len) Get the morphemes of specified word (e.g.getPOSType(int wordId) Get thePOS.Typeof specified word (morpheme, compound, inflect or pre-analysis)getReading(int wordId) Get the reading of specified word (mainly used for Hanja to Hangul conversion).intgetRightId(int wordId) Get right id of specified wordgetRightPOS(int wordId) Get the rightPOS.Tagof specfied word.intgetWordCost(int wordId) Get word cost of specified wordlookup(char[] chars, int off, int len) Lookup words in textstatic UserDictionary
-
Field Details
-
fst
-
WORD_COST
private static final int WORD_COST- See Also:
-
LEFT_ID
private static final short LEFT_ID- See Also:
-
RIGHT_ID
private static final short RIGHT_ID- See Also:
-
RIGHT_ID_T
private static final short RIGHT_ID_T- See Also:
-
RIGHT_ID_F
private static final short RIGHT_ID_F- See Also:
-
segmentations
private final int[][] segmentations -
rightIds
private final short[] rightIds
-
-
Constructor Details
-
UserDictionary
- Throws:
IOException
-
-
Method Details
-
open
- Throws:
IOException
-
getFST
-
getLeftId
public int getLeftId(int wordId) Description copied from interface:DictionaryGet left id of specified word- Specified by:
getLeftIdin interfaceDictionary
-
getRightId
public int getRightId(int wordId) Description copied from interface:DictionaryGet right id of specified word- Specified by:
getRightIdin interfaceDictionary
-
getWordCost
public int getWordCost(int wordId) Description copied from interface:DictionaryGet word cost of specified word- Specified by:
getWordCostin interfaceDictionary
-
getPOSType
Description copied from interface:DictionaryGet thePOS.Typeof specified word (morpheme, compound, inflect or pre-analysis)- Specified by:
getPOSTypein interfaceDictionary
-
getLeftPOS
Description copied from interface:DictionaryGet the leftPOS.Tagof specfied word.For
POS.Type.MORPHEMEandPOS.Type.COMPOUNDthe left and right POS are the same.- Specified by:
getLeftPOSin interfaceDictionary
-
getRightPOS
Description copied from interface:DictionaryGet the rightPOS.Tagof specfied word.For
POS.Type.MORPHEMEandPOS.Type.COMPOUNDthe left and right POS are the same.- Specified by:
getRightPOSin interfaceDictionary
-
getReading
Description copied from interface:DictionaryGet the reading of specified word (mainly used for Hanja to Hangul conversion).- Specified by:
getReadingin interfaceDictionary
-
getMorphemes
Description copied from interface:DictionaryGet the morphemes of specified word (e.g. 가깝으나: 가깝 + 으나).- Specified by:
getMorphemesin interfaceDictionary
-
lookup
Lookup words in text- Parameters:
chars- textoff- offset into textlen- length of text- Returns:
- array of wordId
- Throws:
IOException
-