Genome-Wide Association Studies (GWAS) have identified genetic variants for thousands of diseases and traits. In this study, we evaluated the relationships between specific risk factors (for example, blood cholesterol level) and diseases on the basis of their shared genetic architecture in a comprehensive human disease-SNP association database (VARIMED), analyzing the findings from 8,962 published association studies. Similarity between traits and diseases was statistically evaluated based on their association with shared gene variants. We identified 120 disease-trait pairs that were statistically similar, and of these we tested and validated five previously unknown disease-trait associations by searching electronic medical records (EMR) from 3 independent medical centers for evidence of the trait appearing in patients within one year of first diagnosis of the disease. We validated that mean corpuscular volume is elevated before diagnosis of acute lymphoblastic leukemia; both have associated variants in the gene IKZF1. Platelet count is decreased before diagnosis of alcohol dependence; both are associated with variants in the gene C12orf51. Alkaline phosphatase level is elevated in patients with venous thromboembolism; both share variants in ABO. Similarly, we found prostate specific antigen and serum magnesium levels were altered before the diagnosis of lung cancer and gastric cancer, respectively. Disease-trait associations identifies traits that can potentially serve a prognostic function clinically; validating disease-trait associations through EMR can whether these candidates are risk factors for complex diseases.
展开▼