SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR01295

Identifier
CLOACIN  [View Relations]  [View Alignment]  
Accession
PR01295
No. of Motifs
8
Creation Date
17-FEB-2000
Title
Cloacin signature
Database References
Literature References
1. VAN DEN ELZEN, P.J., WALTERS, H.H., VELTKAMP, E. AND NIJKAMP, H.J.
Molecular structure and function of the bacteriocin gene and bacteriocin
protein of plasmid Clo DF13.
NUCLEIC ACIDS RES. 11 2465-2477 (1983). 

Documentation
Colicins are polypeptide toxins produced by, and active against, Escherichia
coli and closely related bacteria. The bacteriocin cloacin DF13 inactivates
ribosomes by hydrolysing 16S RNA in 30S ribosomes at a specific site. The
protein consists of 561 amino acids and has a molecular weight of ~59kD. 
Sequence analysis reveals the N-terminal third of the cloacin molecule,
which is involved in translocation of the protein across the cell membrane,
to be relatively hydrophobic and rich in glycine [1]. The C-terminal portion
is rich in positively charged amino acids, possibly reflecting the RNase 
activity located within this domain [1]. Sequence comparisons reveal 
similarities with colicin E3 and E6, but not with Col E1, despite striking
similarities in codon usage [1]. 
 
CLOACIN is an 8-element fingerprint that provides a signature for cloacins.
The fingerprint was derived from an initial alignment of 3 sequences: the
motifs were drawn from conserved regions spanning the central portion of the
alignment - motifs 1-4 were drawn from the domain thought to be involved in
translocation of the protein across the cell membrane; and motifs 5-8 reside
within the domain responsible for the receptor binding activity. Two
iterations on SPTR37_10f were required to reach convergence, at which point
a true set comprising 6 sequences was identified.
Summary Information
6 codes involving  8 elements
0 codes involving 7 elements
0 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
866666666
700000000
600000000
500000000
400000000
300000000
200000000
12345678
True Positives
CEA2_ECOLI    CEA3_ECOLI    CEA6_ECOLI    CEA9_ECOLI    
CEAC_ECOLI Q51604
Sequence Titles
CEA2_ECOLI  COLICIN E2 (EC 3.1.21.1) - ESCHERICHIA COLI.  
CEA3_ECOLI COLICIN E3 (EC 3.1.-.-) (COLICIN E3 A CHAIN) (RIBONUCLEASE) - ESCHERICHIA COLI.
CEA6_ECOLI COLICIN E6 (EC 3.1.-.-) (RIBONUCLEASE) - ESCHERICHIA COLI.
CEA9_ECOLI COLICIN E9 (EC 3.1.21.1) - ESCHERICHIA COLI.
CEAC_ECOLI CLOACIN (EC 3.1.-.-) (RIBONUCLEASE) - ESCHERICHIA COLI.
Q51604 COLICIN E7 - ESCHERICHIA COLI.
Scan History
SPTR37_10f 1  30   NSINGLE    
Initial Motifs
Motif 1  width=22
Element Seqn Id St Int Rpt
AAPVAFGFPALSTPGAGGLAVS CEA3_ECOLI 85 85 -
AAPVAFGFPALSTPGAGGLAVS CEA6_ECOLI 85 85 -
ATAMAFGLPALATPGAEGPALS CEAC_ECOLI 94 94 -

Motif 2 width=17
Element Seqn Id St Int Rpt
SAGALSAAIADIMAALK CEA3_ECOLI 108 1 -
SAGALSAAIADIMAALK CEA6_ECOLI 108 1 -
SGDALSSAVADVLAALK CEAC_ECOLI 117 1 -

Motif 3 width=20
Element Seqn Id St Int Rpt
FKFGLWGVALYGVLPSQIAK CEA3_ECOLI 127 2 -
FKFGLWGVALYGVLPSQIAK CEA6_ECOLI 127 2 -
FKFGLWGIAIYGVLPSEIAK CEAC_ECOLI 136 2 -

Motif 4 width=18
Element Seqn Id St Int Rpt
SLPADDITESPVSSLPLD CEA3_ECOLI 158 11 -
SLPADDITESPVSSLPLD CEA6_ECOLI 158 11 -
SLPADTVTETPASTLPLD CEAC_ECOLI 167 11 -

Motif 5 width=20
Element Seqn Id St Int Rpt
RQNISVVSGVPMSVPVVDAK CEA3_ECOLI 193 17 -
RQNISVVSGVPMSVPVVDAK CEA6_ECOLI 193 17 -
RQHIAVVAGRPMSVPVVDAK CEAC_ECOLI 202 17 -

Motif 6 width=23
Element Seqn Id St Int Rpt
GVFTASIPGAPVLNISVNNSTPA CEA3_ECOLI 218 5 -
GVFTASIPGAPVLNISVNNSTPA CEA6_ECOLI 218 5 -
GVFSVSIPGLPALQVSVPKGVPA CEAC_ECOLI 227 5 -

Motif 7 width=23
Element Seqn Id St Int Rpt
RDAVIRFPKDSGHNAVYVSVSDV CEA3_ECOLI 268 27 -
RDAVIRFPKDSGHNAVYVSVSDV CEA6_ECOLI 268 27 -
REAVIRFPKETGQKPVYVSVTDV CEAC_ECOLI 276 26 -

Motif 8 width=20
Element Seqn Id St Int Rpt
WDATHPVEAAERNYERARAE CEA3_ECOLI 310 19 -
WDATHPVEAAERNYERARAE CEA6_ECOLI 310 19 -
WDAAHPEEGLKREYDKAKAE CEAC_ECOLI 318 19 -
Final Motifs
Motif 1  width=22
Element Seqn Id St Int Rpt
AAPVAFGFPALSTPGAGGLAVS CEA3_ECOLI 85 85 -
AAPVAFGFPALSTPGAGGLAVS CEA6_ECOLI 85 85 -
AAPVAFGFPALSTPGAGGLAVS CEA2_ECOLI 85 85 -
ATAMAFGLPALATPGAEGPALS CEAC_ECOLI 94 94 -
AAPVAFGFPALSTPGAGGLAVS CEA9_ECOLI 85 85 -
AAPMAFGFPALAAPGAGTLGIS Q51604 80 80 -

Motif 2 width=17
Element Seqn Id St Int Rpt
SAGALSAAIADIMAALK CEA3_ECOLI 108 1 -
SAGALSAAIADIMAALK CEA6_ECOLI 108 1 -
SAGALSAAIADIMAALK CEA2_ECOLI 108 1 -
SGDALSSAVADVLAALK CEAC_ECOLI 117 1 -
SASELSAAIAGIIAKLK CEA9_ECOLI 108 1 -
SGEALSAAIADIFAALK Q51604 103 1 -

Motif 3 width=20
Element Seqn Id St Int Rpt
FKFGLWGVALYGVLPSQIAK CEA3_ECOLI 127 2 -
FKFGLWGVALYGVLPSQIAK CEA6_ECOLI 127 2 -
FKFGLWGVALYGVLPSQIAK CEA2_ECOLI 127 2 -
FKFGLWGIAIYGVLPSEIAK CEAC_ECOLI 136 2 -
LKFTPFGVVLSSLIPSEIAK CEA9_ECOLI 128 3 -
FKFSAWGIALYGILPSEIAK Q51604 122 2 -

Motif 4 width=18
Element Seqn Id St Int Rpt
SLPADDITESPVSSLPLD CEA3_ECOLI 158 11 -
SLPADDITESPVSSLPLD CEA6_ECOLI 158 11 -
SLPADDITESPVSSLPLD CEA2_ECOLI 158 11 -
SLPADTVTETPASTLPLD CEAC_ECOLI 167 11 -
SLPADDITESPVSSLPLD CEA9_ECOLI 159 11 -
SLPAETVTNVQVSTLPLD Q51604 153 11 -

Motif 5 width=20
Element Seqn Id St Int Rpt
RQNISVVSGVPMSVPVVDAK CEA3_ECOLI 193 17 -
RQNISVVSGVPMSVPVVDAK CEA6_ECOLI 193 17 -
RQNISVVSGVPMSVPVVDAK CEA2_ECOLI 193 17 -
RQHIAVVAGRPMSVPVVDAK CEAC_ECOLI 202 17 -
RQNISVVSGVPMSVPVVDAK CEA9_ECOLI 194 17 -
RQHIAVVAGVPMSVPVVNAK Q51604 188 17 -

Motif 6 width=23
Element Seqn Id St Int Rpt
GVFTASIPGAPVLNISVNNSTPA CEA3_ECOLI 218 5 -
GVFTASIPGAPVLNISVNNSTPA CEA6_ECOLI 218 5 -
GVFTASIPGAPVLNISVNNSTPE CEA2_ECOLI 218 5 -
GVFSVSIPGLPALQVSVPKGVPA CEAC_ECOLI 227 5 -
GVFTASIPGAPVLNISVNDSTPA CEA9_ECOLI 219 5 -
GVFHASFPGVPSLTVSTVKGLPV Q51604 213 5 -

Motif 7 width=23
Element Seqn Id St Int Rpt
RDAVIRFPKDSGHNAVYVSVSDV CEA3_ECOLI 268 27 -
RDAVIRFPKDSGHNAVYVSVSDV CEA6_ECOLI 268 27 -
RDAVIRFPKDSGHNAVYVSVSDV CEA2_ECOLI 268 27 -
REAVIRFPKETGQKPVYVSVTDV CEAC_ECOLI 276 26 -
RDAVIRFPKDSGHNAVYVSVSDV CEA9_ECOLI 269 27 -
HEAVIRFPKESGQKPVYVSVTDV Q51604 263 27 -

Motif 8 width=20
Element Seqn Id St Int Rpt
WDATHPVEAAERNYERARAE CEA3_ECOLI 310 19 -
WDATHPVEAAERNYERARAE CEA6_ECOLI 310 19 -
WDATHPVEAAERNYERARAE CEA2_ECOLI 310 19 -
WDAAHPEEGLKREYDKAKAE CEAC_ECOLI 318 19 -
WDATHPVEAAERNYERARAE CEA9_ECOLI 311 19 -
WNDAHPVEVAERNYEQARAE Q51604 305 19 -