Why Text is Rich to Mine
Organizations have large amounts of online documents with important information
Examples
- Electronic mail from customers, containing feedback about products and services
- Intranet documents such as memos and presentations embodying corporate expertise
- Technical reports describing new technology
- News wires with information about business environments and the activities of competitors
Forrester Research predicted that unstructured data (e.g., text) will dominate online data