iProm-phage: A two-layer model to identify phage promoters and their types using a convolutional neural network

  • Muhammad Shujaat
  • , Joe Sung Jin
  • , Hilal Tayara*
  • , Kil To Chong*
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

The increased interest in phages as antibacterial agents has resulted in a rise in the number of sequenced phage genomes, necessitating the development of user-friendly bioinformatics tools for genome annotation. A promoter is a DNA sequence that is used in the annotation of phage genomes. In this study we proposed a two layer model called “iProm-phage” for the prediction and classification of phage promoters. Model first layer identify query sequence as promoter or non-promoter and if the query sequence is predicted as promoter then model second layer classify it as phage or host promoter. Furthermore, rather than using non-coding regions of the genome as a negative set, we created a more challenging negative dataset using promoter sequences. The presented approach improves discrimination while decreasing the frequency of erroneous positive predictions. For feature selection, we investigated 10 distinct feature encoding approaches and utilized them with several machine-learning algorithms and a 1-D convolutional neural network model. We discovered that the one-hot encoding approach and the CNN model outperformed based on performance metrics. Based on the results of the 5-fold cross validation, the proposed predictor has a high potential. Furthermore, to make it easier for other experimental scientists to obtain the results they require, we set up a freely accessible and user-friendly web server at http://nsclbio.jbnu.ac.kr/tools/iProm-phage/.

Original languageEnglish
Article number1061122
JournalFrontiers in Microbiology
Volume13
DOIs
StatePublished - 2022.11.4

Keywords

  • DNA promoters
  • bioinformatics
  • computational biology
  • convolutional neural networks
  • phages

Quacquarelli Symonds(QS) Subject Topics

  • Medicine
  • Biological Sciences

Fingerprint

Dive into the research topics of 'iProm-phage: A two-layer model to identify phage promoters and their types using a convolutional neural network'. Together they form a unique fingerprint.

Cite this