public class SuggestStopFilterFactory extends TokenFilterFactory implements ResourceLoaderAware
SuggestStopFilter.
<fieldType name="autosuggest" class="solr.TextField"
positionIncrementGap="100" autoGeneratePhraseQueries="true">
<analyzer>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.SuggestStopFilterFactory" ignoreCase="true"
words="stopwords.txt" format="wordset"/>
</analyzer>
</fieldType>
All attributes are optional:
ignoreCase defaults to falsewords should be the name of a stopwords file to parse, if not
specified the factory will use EnglishAnalyzer.ENGLISH_STOP_WORDS_SET
format defines how the words file will be parsed,
and defaults to wordset. If words is not specified,
then format must not be specified.
The valid values for the format option are:
wordset - This is the default format, which supports one word per
line (including any intra-word whitespace) and allows whole line comments
beginning with the "#" character. Blank lines are ignored. See
WordlistLoader.getLines for details.
snowball - This format allows for multiple words specified on each
line, and trailing comments may be specified using the vertical line ("|").
Blank lines are ignored. See
WordlistLoader.getSnowballWordSet
for details.
| Modifier and Type | Field and Description |
|---|---|
static String |
FORMAT_SNOWBALL
multiple words may be specified on each line, trailing comments start with "|"
|
static String |
FORMAT_WORDSET
the default format, one word per line, whole line comments start with "#"
|
static String |
NAME
SPI name
|
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion| Constructor and Description |
|---|
SuggestStopFilterFactory(Map<String,String> args)
Creates a new StopFilterFactory
|
| Modifier and Type | Method and Description |
|---|---|
TokenStream |
create(TokenStream input) |
CharArraySet |
getStopWords()
Returns the configured stopword set
|
void |
inform(ResourceLoader loader) |
boolean |
isIgnoreCase()
Whether or not to ignore case
|
availableTokenFilters, findSPIName, forName, lookupClass, normalize, reloadTokenFiltersget, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNamespublic static final String NAME
public static final String FORMAT_WORDSET
public static final String FORMAT_SNOWBALL
public void inform(ResourceLoader loader) throws IOException
inform in interface ResourceLoaderAwareIOExceptionpublic boolean isIgnoreCase()
public CharArraySet getStopWords()
public TokenStream create(TokenStream input)
create in class TokenFilterFactoryCopyright © 2000-2024 Apache Software Foundation. All Rights Reserved.