Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data

  • Young Sup Lee
  • , Kyeong Hye Won
  • , Donghyun Shin*
  • , Jae Don Oh*
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Despite the various existing studies about nonsynonymous single nucleotide polymorphisms (nsSNPs), genome-wide studies based on nsSNPs are rare. NsSNPs alter amino acid sequences, affect protein structure and function, and have deleterious effects. By predicting the deleterious effect of nsSNPs, we determined the total risk score per individual. Additionally, the machine learning technique was utilized to find an optimal nsSNP subset that best explains the complete nsSNP effect. A total of 16,100 nsSNPs were selected as the best representatives among 89,519 regressed nsSNPs. In the gene ontology analysis encompassing the 16,100 nsSNPs, DNA metabolic process, chemokine- and immune-related, and reproduction were the most enriched terms. We expect that our risk score prediction and nsSNP marker selection will contribute to future development of extant genome-wide association studies and breeding science more broadly.

Original languageEnglish
Pages (from-to)321-328
Number of pages8
JournalAnimal Cells and Systems
Volume24
Issue number6
DOIs
StatePublished - 2020

Keywords

  • Breeding
  • deleterious effect
  • marker selection
  • nsSNP
  • risk prediction

Quacquarelli Symonds(QS) Subject Topics

  • Agriculture & Forestry
  • Biological Sciences

Fingerprint

Dive into the research topics of 'Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data'. Together they form a unique fingerprint.

Cite this