Limitations of Fuzzy Matching of Lexical Variants

Some vendors of text analytics software claim that their software can identify the occurrences of text reflecting specific taxonomy terms (with the strong, and false, implication that it identifies all such occurrences) using “fuzzy matching” or “fuzzy term matching.” Some explanations of the technology, from Techopedia and Wikipedia, show that it is a fairly crude mathematical approach, similar to the co-occurrence statistical approaches that such software also tends to use, and no match for rule-based indexing approaches that derive their effectiveness from human intelligence.

