Promoters are DNA sequences containing regulatory elements required to guide and modulate the transcription initiation of the gene. Predicting promoter sequences in genomic sequences is a significant task in genome annotation and understanding transcriptional regulation. In the past decade many methods with many feature extraction schemes have been proposed for the prediction of eukaryotic and prokaryotic promoters. Still there is great need for more accurate and faster methods. In this paper we employed extreme learning machine algorithm (ELM), for promoter prediction in DNA sequences of H. sapiens, D. melanogaster, A. thaliana, C. elegans and E. coli. We extracted dinucleotide and CpG island features, and achieved accuracy above 90% for all the five species. Performance is compared with the feed forward back propagation algorithm (BP) and support vector machines (SVM) and the results establish the viability of the presented approach.
展开▼