|
CONTENT ANALYST'S PATENTED LATENT SEMANTIC INDEXING (LSI) TECHNOLOGY
Latent Semantic Indexing (LSI) is the patented machine-learning technique that enables CAAT technology to identify, represent, and compare concepts existing within a collection of documents or data.
LSI is a mathematics-based, text analytics technology. It is designed to extract every contextual relation among every term in every text object within a collection. It then generates a vector space representation of all terms based on those relations. Within that space, proximity is a good indicator of conceptual similarity. The result: similarities can be identified based on concepts within the material. Because LSI is mathematics-based, there are no word lists, taxonomies, or thesauruses required for CAAT to accurately identify conceptually similar text.
The Content Analyst team has evolved the original concept of LSI to create major extensions and refinements to the technology and the company now holds a number of patents related to text analytics.
Refer to our White Papers Section to read more about LSI.
|