Data Preparation
Delete supplementary and common words
“a”, “an”, “the”, “is”
Identify stems of words
Separate / remove prefixes, suffixes, endings
Identify senses
Which meaning of word is used?
Normalize
Replace similar words with common token
Previous slide
Next slide
Back to first slide
View graphic version