org.apache.lucene.analysis.cn.smart.hhmm | SmartChineseAnalyzer Hidden Markov Model package. |
AnalyzerProfile | Manages analysis data configuration for SmartChineseAnalyzer
SmartChineseAnalyzer has a built-in dictionary and stopword list out-of-box. |
code | html |
CharType | Internal SmartChineseAnalyzer character type constants. | code | html |
SentenceTokenizer | Tokenizes input text into sentences. | code | html |
SmartChineseAnalyzer | SmartChineseAnalyzer is an analyzer for Chinese or mixed Chinese-English text. |
code | html |
SmartChineseAnalyzer.DefaultSetHolder | Atomically loads the DEFAULT_STOP_SET in a lazy fashion once the outer class accesses the static final set the first time.; | code | html |
SmartChineseAnalyzer.SavedStreams | code | html | |
Utility | SmartChineseAnalyzer utility constants and methods
WARNING: The status of the analyzers/smartcn analysis.cn.smart package is experimental. |
code | html |
WordSegmenter | Segment a sentence of Chinese text into words. | code | html |
WordTokenFilter | A TokenFilter that breaks sentences into words. | code | html |
WordType | Internal SmartChineseAnalyzer token type constants
WARNING: The status of the analyzers/smartcn analysis.cn.smart package is experimental. |
code | html |