Number of Features
Pick k most discriminating terms
Small Fk preferred
- Will not overfit
- Resulting taxonomy is smaller
Split documents into training set (T) and validation set (V)
- Compute Fisher index of each term based on T
- Classify V using various prefixes Fk
- Nk is number of misclassified documents using Fk
- Minimize Nk