The computational detection of functional nucleotide sequence motifs in the coding regions of organisms.

Publication Type:

Journal Article


Experimental biology and medicine (Maywood, N.J.), Volume 233, Issue 6, p.665-73 (2008)


2008, 258, Algorithms, Amino Acid Motifs, Base Sequence, Codon, Computational Biology, Entropy, Escherichia coli, Evolution, Molecular, Genome, Bacterial, GENOMICS, Oligonucleotides, Open Reading Frames, Phylogeny, Proteome, PROTEOMICS


A new algorithm has been constructed for finding under- and overrepresented oligonucleotide motifs in the protein coding regions of genomes that have been normalized for G/C content, codon usage, and amino acid order. This Robins-Krasnitz algorithm has been employed to compare the oligonucleotide frequencies between many different prokaryotic genomes. Evidence is presented demonstrating that at least some of these sequence motifs are functionally important and selected for or against during the evolution of these prokaryotes. The applications of this method include the optimization of protein expression for synthetic genes in foreign organisms, identification of novel oligonucleotide signals used by the organism and the examination of evolutionary relationships not dependent upon different gene sequence trees.