Package smile.nlp.normalizer
Interface Normalizer
- All Known Implementing Classes:
SimpleNormalizer
public interface Normalizer
Normalization transforms text into a canonical form by removing unwanted
variations. Normalization may range from light textual cleanup such as
compressing whitespace to more aggressive and knowledge-intensive forms
like standardizing date formats or expanding abbreviations. The nature and
extent of normalization, as well as whether it is most appropriate to apply
on the document, sentence, or token level, must be determined in the context
of a specific application.
-
Method Summary
-
Method Details
-
normalize
Normalize the given string.- Parameters:
text
- the text.- Returns:
- the normalized text.
-