Methods for text analysis of medical study data to extract predictive data. Natural language processing is performed on a document in a collection of documents to determine whether the document contains medical model data. In response to determining that the document contains medical model data, content relating to the medical model data in the document is annotated. A first medical model is generated based on the annotations for the identified medical model data and a certainty threshold In response to the certainty threshold meeting a user setting, the first medical model is added to a predictive model for determining a risk score, based on the analyzed data.
展开▼