Package org.apache.lucene.analysis.ko
package org.apache.lucene.analysis.ko
Analyzer for Korean.
-
ClassDescriptionA token that was generated from a compound.A token stored in a
KoMorphData.Analyzer for Korean that uses morphological analysis.ATokenFilterthat normalizes Korean numbers to regular Arabic decimal numbers in half-width characters.Buffer that holds a Korean number string and a position index used as a parsed-to markerFactory forKoreanNumberFilter.Removes tokens that match a set of part-of-speech tags.Factory forKoreanPartOfSpeechStopFilter.Replaces term text with theReadingAttributewhich is the Hangul transcription of Hanja characters.Factory forKoreanReadingFormFilter.Tokenizer for Korean that uses morphological analysis.Decompound mode: this determines how the tokenizer handlesPOS.Type.COMPOUND,POS.Type.INFLECTandPOS.Type.PREANALYSIStokens.Factory forKoreanTokenizer.Part of speech classification for Korean based on Sejong corpus classification.Part of speech tag for Korean based on Sejong corpus classification.The type of the token.Analyzed token with morphological data.