|
|
|
BETAWRAP: Successful prediction of parallel Beta-helices from primary sequence reveals an association with many microbial pathogens
|
|
Philip Bradley, Lenore Cowen, Matthew Menke, Jonathan King, and Bonnie Berger
|
|
|
|
The amino acid sequence rules that specify -sheet structure in
proteins remain obscure. A subclass of -sheet proteins,
parallel -helices, represent a processive folding of the chain into
an elongated topologically simpler fold than globular -sheets. In this paper, we present a computational approach that predicts the
right-handed parallel -helix supersecondary structural motif in
primary amino acid sequences by using -strand interactions learned
from non- -helix structures. A program called
BETAWRAP (http://theory.lcs.mit.edu/betawrap) implements this
method and recognizes each of the seven known parallel -helix
families, when trained on the known parallel -helices from outside
that family. BETAWRAP identifies 2,448 sequences
among 595,890 screened from the National Center for Biotechnology
Information (NCBI; http://www.ncbi.nlm.nih.gov/) nonredundant
protein database as likely parallel -helices. It identifies
surprisingly many bacterial and fungal protein sequences that play a
role in human infectious disease; these include toxins, virulence
factors, adhesins, and surface proteins of Chlamydia,
Helicobacteria, Bordetella,
Leishmania, Borrelia,
Rickettsia, Neisseria, and
Bacillus anthracis. Also unexpected was the rarity of
the parallel -helix fold and its predicted sequences among higher
eukaryotes. The computational method introduced here can be called a
three-dimensional dynamic profile method because it generates
interstrand pairwise correlations from a processive sequence wrap. Such
methods may be applicable to recognizing other beta structures for
which strand topology and profiles of residue accessibility are well conserved.
|
|
http://www.pnas.org/cgi/content/abstract/98/26/14819
|
|
|
|