CoDP: predicting the impact of unclassified genetic variants in MSH6 by the combination of different properties of the protein

Hiroko Terui; Kiwamu Akagi; Hiroshi Kawame; Kei Yura
June 2013
Journal of Biomedical Science;2013, Vol. 20 Issue 1, p1
Academic Journal
Background: Lynch syndrome is a hereditary cancer predisposition syndrome caused by a mutation in one of the DNA mismatch repair (MMR) genes. About 24% of the mutations identified in Lynch syndrome are missense substitutions and the frequency of missense variants in MSH6 is the highest amongst these MMR genes. Because of this high frequency, the genetic testing was not effectively used in MSH6 so far. We, therefore, developed CoDP (Combination of the Different Properties), a bioinformatics tool to predict the impact of missense variants in MSH6. Methods: We integrated the prediction results of three methods, namely MAPP, PolyPhen-2 and SIFT. Two other structural properties, namely solvent accessibility and the change in the number of heavy atoms of amino acids in the MSH6 protein, were further combined explicitly. MSH6 germline missense variants classified by their associated clinical and molecular data were used to fit the parameters for the logistic regression model and to assess the prediction. The performance of CoDP was compared with those of other conventional tools, namely MAPP, SIFT, PolyPhen-2 and PON-MMR. Results: A total of 294 germline missense variants were collected from the variant databases and literature. Of them, 34 variants were available for the parameter training and the prediction performance test. We integrated the prediction results of MAPP, PolyPhen-2 and SIFT, and two other structural properties, namely solvent accessibility and the change in the number of heavy atoms of amino acids in the MSH6 protein, were further combined explicitly. Variants data classified by their associated clinical and molecular data were used to fit the parameters for the logistic regression model and to assess the prediction. The values of the positive predictive value (PPV), the negative predictive value (NPV), sensitivity, specificity and accuracy of the tools were compared on the whole data set. PPV of CoDP was 93.3% (14/15), NPV was 94.7% (18/19), specificity was 94.7% (18/19), sensitivity was 93.3% (14/15) and accuracy was 94.1% (32/34). Area under the curve of CoDP was 0.954, that of MAPP for MSH6 was 0.919, of SIFT was 0.864 and of PolyPhen-2 HumVar was 0.819. The power to distinguish between pathogenic and non-pathogenic variants of these methods was tested by Wilcoxon rank sum test (p < 8.9 × 10-6 for CoDP, p < 3.3 × 10-5 for MAPP, p < 3.1 × 10-4 for SIFT and p < 1.2 × 10-3 for PolyPhen-2 HumVar), and CoDP was shown to outperform other conventional methods. Conclusion: In this paper, we provide a human curated data set for MSH6 missense variants, and CoDP, the prediction tool, which achieved better accuracy for predicting the impact of missense variants in MSH6 than any other known tools. CoDP is available at http://cib.cf.ocha.ac.jp/CoDP/.


Related Articles

  • Genetic variants in the 8q24 locus and risk of testicular germ cell tumors. Cook, Michael; Graubard, Barry; Quraishi, Sabah; Yeager, Meredith; Chanock, Stephen; Crenshaw, Andrew; Erickson, Ralph; Rubertone, Mark; Thomas, Gilles; McGlynn, Katherine // Human Genetics;May2008, Vol. 123 Issue 4, p409 

    Much evidence supports the premise that population genetic variation contributes significantly to the risk of testicular germ-cell tumor (TGCT). However, investigations of the association between genomic markers and TGCT susceptibility are scarce. Single nucleotide polymorphisms (SNPs) at the...

  • Germline mutation and protein expression analysis of mismatch repair genes MSH6 and PMS2 in Malaysian Lynch syndrome patients. Zahary, Mohd; Kaur, Gurjeet; Hassan, Muhammad; Sidek, Ahmad; Singh, Harjinder; Yeh, Lee; Ankathil, Ravindran // International Journal of Colorectal Disease;Feb2014, Vol. 29 Issue 2, p261 

    The article discusses a study which examines the spectrum of germline mutation and protein expression of DNA mismatch repair (MMR) genes MSH6 and PMS2 in Malaysian patients with Lynch syndrome. Results revealed that five Lynch syndrome patients were found to harbor germline mutations with a...

  • MLH1 promoter hypermethylation in the analytical algorithm of Lynch syndrome: a cost-effectiveness study. Gausachs, Mireia; Mur, Pilar; Corral, Julieta; Pineda, Marta; González, Sara; Benito, Llúcia; Menéndez, Mireia; Espinàs, Josep Alfons; Brunet, Joan; Iniesta, María Dolores; Gruber, Stephen B; Lázaro, Conxi; Blanco, Ignacio; Capellá, Gabriel // European Journal of Human Genetics;Jul2012, Vol. 20 Issue 7, p762 

    The analytical algorithm of Lynch syndrome (LS) is increasingly complex. BRAF V600E mutation and MLH1 promoter hypermethylation have been proposed as a screening tool for the identification of LS. The aim of this study was to assess the clinical usefulness and cost-effectiveness of both somatic...

  • Loss of DNA Mismatch Repair Imparts a Selective Advantage in Planarian Adult Stem Cells. Hollenbach, Jessica P.; Resch, Alissa M.; Palakodeti, Dasaradhi; Graveley, Brenton R.; Heinen, Christopher D. // PLoS ONE;2011, Vol. 6 Issue 7, p1 

    Lynch syndrome (LS) leads to an increased risk of early-onset colorectal and other types of cancer and is caused by germline mutations in DNA mismatch repair (MMR) genes. Loss of MMR function results in a mutator phenotype that likely underlies its role in tumorigenesis. However, loss of MMR...

  • Breast carcinoma and Lynch syndrome: molecular analysis of tumors arising in mutation carriers, non-carriers, and sporadic cases. Lotsari, Johanna E.; Gylling, Annette; Abdel-Rahman, Wael M.; Nieminen, Taina T.; Aittomäki, Kristiina; Friman, Marjukka; Pitkänen, Reino; Aarnio, Markku; Järvinen, Heikki J.; Mecklin, Jukka-Pekka; Kuopio, Teijo; Peltomäki, Päivi // Breast Cancer Research;2012, Vol. 14 Issue 3, p1 

    Introduction: Breast carcinoma is the most common cancer in women, but its incidence is not increased in Lynch syndrome (LS) and studies on DNA mismatch repair deficiency (MMR) in LS-associated breast cancers have arrived at conflicting results. This study aimed to settle the question as to...

  • Lynch syndrome in Tunisia: first description of clinical features and germline mutations. Moussa, Sana; Moussa, Amel; Kourda, Nadia; Mezlini, Amel; Abdelli, Nabil; Zerimech, Farid; Najjar, Taoufik; Jilani, Sarah; Porchet, Nicole; Ayed, Farhat; Manai, Mohamed; Buisine, Marie-Pierre // International Journal of Colorectal Disease;Apr2011, Vol. 26 Issue 4, p455 

    Purpose: High rates of early colorectal cancers (CRC) are observed in Tunisia suggesting genetic susceptibility. Nevertheless, up to now, no molecular study has been performed in the Tunisian population. In our research, we evaluated the clinical characteristics of Tunisian families suspected of...

  • Clinical utility gene card for: Lynch syndrome (MLH1, MSH2, MSH6, PMS2, EPCAM) - update 2012. Rahner, Nils; Steinke, Verena; Schlegelberger, Brigitte; Eisinger, Francois; Hutter, Pierre; Olschwang, Sylviane // European Journal of Human Genetics;Jan2013, Vol. 21 Issue 1, p00 

    The article presents a clinical utility gene card for Lynch syndrome II which has a pathogenic germ cell mutation that will need prophylactic hysterectomy and bilateral ovariectomy. It notes that gene defect can be traced from the pattern of staining in immunohistochemistry. It mentions that age...

  • Screening of the DNA mismatch repair genes MLH1, MSH2 and MSH6 in a Greek cohort of Lynch syndrome suspected families. Thodi, Georgia; Fostira, Florentia; Sandaltzopoulos, Raphael; Nasioulas, George; Grivas, Anastasios; Boukovinas, Ioannis; Mylonaki, Maria; Panopoulos, Christos; Magic, Mirjana Brankovic; Fountzilas, George; Yannoukakos, Drakoulis // BMC Cancer;2010, Vol. 10, p544 

    Background: Germline mutations in the DNA mismatch repair genes predispose to Lynch syndrome, thus conferring a high relative risk of colorectal and endometrial cancer. The MLH1, MSH2 and MSH6 mutational spectrum reported so far involves minor alterations scattered throughout their coding...

  • Performance of random forests and logic regression methods using mini-exome sequence data. Yoonhee Kim; Qing Li; Cropp, Cheryl D.; Heejong Sung; Juanliang Cai; Simpson, Claire L.; Perry, Brian; Dasgupta, Abhijit; Malley, James D.; Wilson, Alexander F.; Bailey-Wilson, Joan E. // BMC Proceedings;2011 Supplement 9, Vol. 5 Issue Suppl 9, p1 

    Machine learning approaches are an attractive option for analyzing large-scale data to detect genetic variants that contribute to variation of a quantitative trait, without requiring specific distributional assumptions. We evaluate two machine learning methods, random forests and logic...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics