|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.analysis.TokenStream org.apache.lucene.analysis.TokenFilter gpl.pierrick.brihaye.aramorph.lucene.ArabicStemmer
A stemmer that will return the possible stems for arabic tokens.
Field Summary | |
protected boolean |
debug
Whether or not the analyzer should output debug messages |
protected boolean |
outputBuckwalter
Whether or not the analyzer should output tokens in the Buckwalter transliteration system |
Fields inherited from class org.apache.lucene.analysis.TokenFilter |
input |
Constructor Summary | |
ArabicStemmer(org.apache.lucene.analysis.TokenStream input)
Constructs a stemmer that will return the possible stems for arabic tokens in the Buckwalter transliteration system. |
|
ArabicStemmer(org.apache.lucene.analysis.TokenStream input,
boolean debug)
Constructs a stemmer that will return the possible stems for arabic tokens in the Buckwalter transliteration system. |
|
ArabicStemmer(org.apache.lucene.analysis.TokenStream input,
boolean debug,
boolean outputBuckwalter)
Constructs a stemmer that will return the possible stems for arabic tokens. |
Method Summary | |
AraMorph |
getAramorph()
Returns the arabic stemmer in use. |
org.apache.lucene.analysis.Token |
next()
Returns the next token in the stream, or null at EOS. |
Methods inherited from class org.apache.lucene.analysis.TokenFilter |
close |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
protected boolean debug
protected boolean outputBuckwalter
Constructor Detail |
public ArabicStemmer(org.apache.lucene.analysis.TokenStream input)
input
- The token stream from a tokenizerpublic ArabicStemmer(org.apache.lucene.analysis.TokenStream input, boolean debug)
input
- The readerdebug
- Whether or not the stemmer should display convenience messages on System.out
public ArabicStemmer(org.apache.lucene.analysis.TokenStream input, boolean debug, boolean outputBuckwalter)
input
- The readerdebug
- Whether or not the stemmer should display convenience messages on System.out
outputBuckwalter
- Whether or not the analyzer should output tokens in the Buckwalter transliteration systemMethod Detail |
public AraMorph getAramorph()
AraMorph
public final org.apache.lucene.analysis.Token next() throws java.io.IOException
null
at EOS.
type
set to the morphological identification of the
stem. Tokens with no grammatical identification have their type
set to
NO_RESULT
. Token's termText is the romanized form of the
stem
java.io.IOException
- If a problem occursToken.type()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |