org.mtzky.lucene.tokenizer
クラス UnicodeBlockTokenizer
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.Tokenizer
org.mtzky.lucene.tokenizer.UnicodeBlockTokenizer
- すべての実装されたインタフェース:
- Closeable
public class UnicodeBlockTokenizer
- extends Tokenizer
Tokenizes the input into unigram of the Unicode code point.
- 作成者:
- mtzky
- 関連項目:
StopTermFilter
|
メソッドの概要 |
void |
end()
|
boolean |
incrementToken()
|
protected int |
read()
Reads a single Unicode code point. |
protected char[] |
readToken()
Returns a character array of the token, or null if the end of the
stream has been reached
Override to change the behavior. |
void |
reset()
|
void |
reset(Reader input)
|
protected void |
unread(int codePoint)
Pushes back a single Unicode code point to the front of the
push back buffer. |
| クラス org.apache.lucene.util.AttributeSource から継承されたメソッド |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString |
UnicodeBlockTokenizer
public UnicodeBlockTokenizer(Reader in)
- パラメータ:
in -
UnicodeBlockTokenizer
public UnicodeBlockTokenizer(PushbackReader in)
- パラメータ:
in -
incrementToken
public final boolean incrementToken()
throws IOException
- 定義:
- クラス
TokenStream 内の incrementToken
- 例外:
IOException
readToken
protected char[] readToken()
throws IOException
Returns a character array of the token, or null if the end of the
stream has been reached
Override to change the behavior.
- 戻り値:
- a character array of the token, or
null if the end of the
stream has been reached
- 例外:
IOException- 関連項目:
read(),
unread(int),
TypeAttribute
read
protected final int read()
throws IOException
Reads a single Unicode code point.
- 戻り値:
- The code point read, or -1 if the end of the stream has been
reached
- 例外:
IOException
unread
protected final void unread(int codePoint)
throws IOException
Pushes back a single Unicode code point to the front of the
push back buffer.
- パラメータ:
codePoint - Unicode code point to push back
- 例外:
IOException
end
public void end()
- オーバーライド:
- クラス
TokenStream 内の end
reset
public void reset(Reader input)
throws IOException
- オーバーライド:
- クラス
Tokenizer 内の reset
- 例外:
IOException
reset
public void reset()
throws IOException
- オーバーライド:
- クラス
TokenStream 内の reset
- 例外:
IOException
Copyright (C) 2010-2011 Mtzky. AllRights Reserved.