Identifying Enhancers and Their Strength by the Integration of Word Embedding and Convolution Neural Network

  • Jhabindra Khanal
  • , Hilal Tayara*
  • , Kil To Chong
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

The enhancer is a short regulatory element that plays a major role in up-regulating eukaryotic gene expression. To identify enhancers, an experimental process takes a long time and high cost; therefore, an accurate computational tool is a much-needed work in this area. Existing techniques were developed by the use of handcrafted features followed by machine learning techniques, while the proposed model extracts the features of enhancers from raw DNA sequences by the integration of natural language processing (NLP) technique using word2vec and convolutional neural network (CNN). Therefore, an accurate computational tool, iEnhancer-CNN, is developed. The developed tool can predict enhancers and their strength. The evaluation results show that iEnhancer-CNN is remarkably superior to the existing state-of-the-art models. In more detail, iEnhancer-CNN improved the accuracy of enhancer and enhancer strength identification by 2.6% and 11.4%, respectively. A web server for the iEnhancer-CNN is freely available at https://home.jbnu.ac.kr/NSCL/iEnhancer-CNN.htm.

Original languageEnglish
Article number9044822
Pages (from-to)58369-58376
Number of pages8
JournalIEEE Access
Volume8
DOIs
StatePublished - 2020

Keywords

  • Convolutional neural network
  • deep learning
  • DNA sequence
  • enhancers
  • K-mers
  • word2vec

Quacquarelli Symonds(QS) Subject Topics

  • Materials Science
  • Computer Science & Information Systems

Fingerprint

Dive into the research topics of 'Identifying Enhancers and Their Strength by the Integration of Word Embedding and Convolution Neural Network'. Together they form a unique fingerprint.

Cite this