|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
ObjectStdTermFilter
public class StdTermFilter
Performs standard tokenization activities for terms, such as mapping to lowercase, removing apostrophes, etc.
Nested Class Summary | |
---|---|
private class |
StdTermFilter.DribbleStream
|
Field Summary | |
---|---|
private StdTermFilter.DribbleStream |
dribble
|
private TokenStream |
filter
|
private static String |
SAVE_WILD_QMARK
During tokenization, the '?' |
private static String |
SAVE_WILD_STAR
During tokenization, the '*' wildcard has to be changed to a word to keep it from being removed. |
Constructor Summary | |
---|---|
StdTermFilter()
Construct the rewriter |
Method Summary | |
---|---|
String |
filter(String term)
Apply the standard mapping to the given term. |
protected static String |
restoreWildcards(String s)
Restores wildcards saved by saveWildcards(String) . |
protected static String |
saveWildcards(String s)
Converts wildcard characters into word-looking bits that would never occur in real text, so the standard tokenizer will keep them part of words. |
Methods inherited from class Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
private StdTermFilter.DribbleStream dribble
private TokenStream filter
private static final String SAVE_WILD_STAR
private static final String SAVE_WILD_QMARK
Constructor Detail |
---|
public StdTermFilter()
Method Detail |
---|
public String filter(String term)
protected static String saveWildcards(String s)
restoreWildcards(String)
.
protected static String restoreWildcards(String s)
saveWildcards(String)
.
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |