STATISTICA Text Miner

STATISTICA Text Miner is an optional extension of STATISTICA Data Miner, ideal for translating unstructured text data into meaningful, valuable clusters of decision-making "gold". As most users familiar with data mining already know, real-world data comes in a variety of forms, not always organized or easily ready to analyze. STATISTICA Text Miner digs for the underlying information not readily apparent in traditional structured data.

STATISTICA Text Miner was specifically designed as a general and open-architecture tool for mining unstructured information. The feature extraction/selection and other analytic tools available in STATISTICA Text Miner are not only applicable to text documents or Web pages, but can also be used to index, classify, cluster, or otherwise include in your analyses unstructured information such as (pre-processed) bitmaps, sound files, etc.


How can I use STATISTICA Text Miner?
  • Analyze the contents of Web pages. For example, users can automatically process and summarize all Web pages of particular companies, message boards, etc.
  • Include unstructured notes in predictive data mining projects. For example, users may include responses to open-ended interview questions, patients' own descriptions of medical symptoms, etc. in data mining projects involving the clustering of patients and symptoms.
  • Analyze large document repositories. For example, users may analyze repositories of documents such as narratives of insurance claims, etc., to include such information in fraud detection projects.