A system, method, and computer-readable medium for performing ambiguous utterances identification operations by extrapolating statements of the utterance using machine learning based linguistic analysis. More specifically, in certain embodiments, the ambiguous utterances identification operations are performed by generating an ambiguous utterance repository that is indexed by and contains individuals, regions, tweets, blogs, and latest trends. This ambiguous utterance repository is then linked to a lexicon engine that stores linguistic semantics for particular demographics. The ambiguous utterances identification operations also can capture the latest trends in ambiguous utterances occurring happening in certain demographics.
展开▼