A system and process for automated text analysis which can be used to identify phrases in reports such as medical reports includes identifying a phrase contained within a text, extracting the phrase from the text, determining a value of the phrase and, in response to the phrase having at least a threshold value, reducing the phrase to a root meaning. In one embodiment, the value of the phrase is assigned via lexicon-based hierarchical decision trees.
展开▼