Skip to main navigation Skip to search Skip to main content

kDeepBind: Prediction of RNA-Proteins binding sites using convolution neural network and k-gram features

  • Muhammad Tahir
  • , Hilal Tayara
  • , Maqsood Hayat*
  • , Kil To Chong*
  • *Corresponding author for this work
  • Abdul Wali Khan University Mardan
  • Jeonbuk National University

Research output: Contribution to journalJournal articlepeer-review

Abstract

RNA-binding proteins (RBPs) have a central role in different biological processes like gene regulation, containing transcription and alternative splicing, and provide necessary useful information for patient care. Therefore, identifying binding sites of the RBPs on RNA is the main research direction to understand the procedure of several biological processes. Through biochemical experiments, the RBPs binding sites identification has produced good results but these biochemical processes and exploratory methods consume time and money. Thus, it is important to propose an efficient computational model for the identification of RBPs binding sites. In this study, we propose an intelligent prediction model for RBPs binding sites using a deep learning approach. The proposed prediction model is called kDeepBind. We first apply the one-hot representation for the input sequence to permit successive convolution layers. To declare the binding hidden information from the recognized sequences, the convolution neural network (CNN) model is applied to automatically learn the abstract features. Then, we apply the k-Gram feature extraction technique and concatenate with CNN features. We evaluate the proposed predictor kDeepBind on verified RBP binding sites that are obtained from CLIP-seq datasets. Evaluation results show that the proposed prediction model obtains better performance in the term of ROC-AUC than comparative methods. It is thus highly expected that kDeepBind model will become a useful tool for academic research on RBPs binding sites prediction of RNA as well as in drug discovery. All training and testing data, learned motifs, and trained models can be downloaded at http://nsclbio.jbnu.ac.kr/tools/kDeepBind/

Original languageEnglish
Article number104217
JournalChemometrics and Intelligent Laboratory Systems
Volume208
DOIs
StatePublished - 2021.01.15

Keywords

  • Convolution neural network
  • k-Gram
  • RNA-Binding proteins
  • Sequence motifs

Quacquarelli Symonds(QS) Subject Topics

  • Computer Science & Information Systems
  • Engineering - Petroleum
  • Data Science
  • Engineering - Chemical
  • Chemistry

Fingerprint

Dive into the research topics of 'kDeepBind: Prediction of RNA-Proteins binding sites using convolution neural network and k-gram features'. Together they form a unique fingerprint.

Cite this