Abstract
Motivation Promoter is a short region of DNA which is responsible for initiating transcription of specific genes. Development of computational tools for automatic identification of promoters is in high demand. According to the difference of functions, promoters can be of different types. Promoters may have both intra and inter class variation and similarity in terms of consensus sequences. Accurate classification of various types of sigma promoters still remains a challenge.
Results We present iPromoter-BnCNN for identification and accurate classification of six types of promoters - σ24, σ28, σ32, σ38, σ54, σ70. It is a CNN based classifier which combines local features related to monomer nucleotide sequence, trimer nucleotide sequence, dimer structural properties and trimer structural properties through the use of parallel branching. We conducted experiments on a benchmark dataset and compared with six state-of-the-art tools to show our supremacy on 5-fold cross-validation. Moreover, we tested our classifier on an independent test dataset.
Availability Our proposed tool iPromoter-BnCNN web server is freely available at http://103.109.52.8/iPromoter-BnCNN. The runnable source code can be found here.
Contact rafeed{at}cse.uiu.ac.bd
Supplementary information Supplementary data (benchmark dataset, independent test dataset, model files, structural property information, attention mechanism details and web server usage) are available at Bioinformatics. online.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
↵* Swakkhar Shatabda