public final class ICUFoldingFilter extends ICUNormalizer2Filter
This filter applies the following foldings from the report to unicode text:
Additionally, Default Ignorables are removed, and text is normalized to NFKC. All foldings, case folding, and normalization mappings are applied recursively to ensure a fully folded and normalized result.
A normalizer with additional settings such as a filter that lists characters not to be normalized can be passed in the constructor.
AttributeSource.State| Modifier and Type | Field and Description |
|---|---|
static com.ibm.icu.text.Normalizer2 |
NORMALIZER
A normalizer for search term folding to Unicode text,
applying foldings from UTR#30 Character Foldings.
|
inputDEFAULT_TOKEN_ATTRIBUTE_FACTORY| Constructor and Description |
|---|
ICUFoldingFilter(TokenStream input)
Create a new ICUFoldingFilter on the specified input
|
ICUFoldingFilter(TokenStream input,
com.ibm.icu.text.Normalizer2 normalizer)
Create a new ICUFoldingFilter on the specified input with the specified
normalizer
|
incrementTokenclose, end, resetaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toStringpublic static final com.ibm.icu.text.Normalizer2 NORMALIZER
public ICUFoldingFilter(TokenStream input)
public ICUFoldingFilter(TokenStream input, com.ibm.icu.text.Normalizer2 normalizer)
Copyright © 2000-2024 Apache Software Foundation. All Rights Reserved.