SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00731

Identifier
CAULIMOPTASE  [View Relations]  [View Alignment]  
Accession
PR00731
No. of Motifs
4
Creation Date
01-MAY-1997  (UPDATE 14-JUN-1999)
Title
Cauliflower mosaic virus peptidase (A3) signature
Database References

INTERPRO; IPR000588
Literature References
1. RALINGS, N.D. AND BARRETT, A.J.
Families of aspartic peptidases, and those of unknown catalytic mechanism.
METHODS ENZYMOL. 248 105-120 (1995).

Documentation
Cauliflower mosaic viruses belong to a group of plant viruses known as
pararetroviruses, which have a double-stranded DNA genome [1]. The genome
includes an open reading frame (ORF V) that shows similarities to the pol
gene of retroviruses [1]. This ORF codes for a polyprotein that includes
a reverse transcriptase, which, on the basis of a DTG triplet near the
N-terminus, was suggested to include an aspartic protease [1].
 
The presence of an aspartic protease has been confirmed by mutational
studies, implicating Asp-45 in catalysis [1]. The protease releases itself
from the polyprotein and is involved in reactions required to process the
ORF IV polyprotein, which includes the viral coat protein gene [1]. 
 
CAULIMOPTASE is a 4-element fingerprint that provides a signature for the
cauliflower mosaic virus aspartate protease. The fingerprint was derived
from an initial alignment of 7 sequences: the motifs were drawn from
conserved regions within the gag/pol protease domain, motif 2 containing 
the catalytic aspartate residue. Two iterations on OWL29.2 were required
to reach convergence, at which point a true set comprising 12 sequences
was identified. A single partial match was also found, PCU139881, a
fragment lacking the region of sequence bearing the first motif.
 
An update on SPTR37_9f identified a true set of 12 sequences.
Summary Information
12 codes involving  4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
412121212
30000
20000
1234
True Positives
POL_CAMVC     POL_CAMVD     POL_CAMVE     POL_CAMVN     
POL_CAMVS POL_CERV POL_FMVD POL_SOCMV
Q66162 Q83169 Q84682 Q88442
Sequence Titles
POL_CAMVC   ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - CAULIFLOWER MOSAIC VIRUS (STRAIN CM-1841) (CAMV). 
POL_CAMVD ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - CAULIFLOWER MOSAIC VIRUS (STRAIN D/H) (CAMV).
POL_CAMVE ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - CAULIFLOWER MOSAIC VIRUS (STRAIN BBC) (CAMV).
POL_CAMVN ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - CAULIFLOWER MOSAIC VIRUS (STRAIN NY8153) (CAMV).
POL_CAMVS ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - CAULIFLOWER MOSAIC VIRUS (STRAIN STRASBOURG) (CAMV).
POL_CERV ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - CARNATION ETCHED RING VIRUS (CERV).
POL_FMVD ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - FIGWORT MOSAIC VIRUS (STRAIN DXS) (FMV).
POL_SOCMV ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.23.-); ENDONUCLEASE; REVERSE TRANSCRIPTASE (EC 2.7.7.49)] - SOYBEAN CHLOROTIC MOTTLE VIRUS.
Q66162 ORF V - CAULIFLOWER MOSAIC VIRUS.
Q83169 REVERSE TRANSCRIPTASE - CAULIFLOWER MOSAIC VIRUS.
Q84682 REVERSE TRANSCRIPTASE - PEANUT CHLOROTIC STREAK VIRUS.
Q88442 COMPLETE GENOME - STRAWBERRY VEIN BANDING VIRUS.
Scan History
OWL29_2    2  100  NSINGLE    
SPTR37_9f 2 200 NSINGLE
Initial Motifs
Motif 1  width=14
Element Seqn Id St Int Rpt
NVTNPNSIYIKGRL POL_CAMVC 17 17 -
NVTNPNSIYIKGRL POL_CAMVN 18 18 -
NVTNPNSIYIKGRL POL_CAMVS 17 17 -
NITNPNSIYIKGRL POL_CAMVD 19 19 -
NVTNPNSIYIEGKL POL_FMVD 26 26 -
NRTNPNSIYVKGIL POL_CERV 5 5 -
TKGNPNVTFIKVSI POL_SOCMV 13 13 -

Motif 2 width=15
Element Seqn Id St Int Rpt
FVDTGASLCIASKFV POL_CAMVC 43 12 -
FVDTGASLCIASKFV POL_CAMVN 44 12 -
FVDTGASLCIASKFV POL_CAMVS 43 12 -
FVDTGASLCIASKFV POL_CAMVD 45 12 -
YVDTGASLCIASRYI POL_FMVD 52 12 -
YVDTGSSLCMASKYV POL_CERV 32 13 -
YIDTGATLCFGKRKI POL_SOCMV 34 7 -

Motif 3 width=16
Element Seqn Id St Int Rpt
FKIPTVYQQESGIDFI POL_CAMVC 98 40 -
FKIPTVYQQESGIDFI POL_CAMVN 99 40 -
FRIPTVYQQESGIDFI POL_CAMVS 98 40 -
FHIPTVYQQESGIDFI POL_CAMVD 100 40 -
FEIPTVYQQETGIDFL POL_FMVD 107 40 -
FLIPTLFQQESGIDLL POL_CERV 87 40 -
FLIPIIYLHDSGLDLI POL_SOCMV 87 38 -

Motif 4 width=15
Element Seqn Id St Int Rpt
IGNNFCQLYEPFIQF POL_CAMVC 114 0 -
IGNNFCQLYEPFIQF POL_CAMVN 115 0 -
IGNNFCQLYEPFIQF POL_CAMVS 114 0 -
IGNNFCQLYEPFIQF POL_CAMVD 116 0 -
IGNNFCRLYNPFIQW POL_FMVD 123 0 -
LGNNFCQLYSPFIQY POL_CERV 103 0 -
IGNNFLKLYQPFIQR POL_SOCMV 103 0 -
Final Motifs
Motif 1  width=14
Element Seqn Id St Int Rpt
NVTNPNSIYIKGRL POL_CAMVC 17 17 -
NVTNPNSIYIKGRL Q83169 18 18 -
NVTNPNSIYIKGRL Q66162 18 18 -
NVTNPNSIYIKGRL POL_CAMVN 18 18 -
NVTNPNSIYIKGRL POL_CAMVE 17 17 -
NVTNPNSIYIKGRL POL_CAMVS 17 17 -
NITNPNSIYIKGRL POL_CAMVD 19 19 -
NVTNPNSIYIEGKL POL_FMVD 26 26 -
NRTNPNSIYVKGIL POL_CERV 5 5 -
TKTNPNSIYIRGNF Q88442 51 51 -
TKGNPNVTFIKVSI POL_SOCMV 13 13 -
SSKNSSFIKVKLFN Q84682 2 2 -

Motif 2 width=15
Element Seqn Id St Int Rpt
FVDTGASLCIASKFV POL_CAMVC 43 12 -
FVDTGASLCIASKFV Q83169 44 12 -
FVDTGASLCIASKFV Q66162 44 12 -
FVDTGASLCIASKFV POL_CAMVN 44 12 -
FVDTGASLCIASKFV POL_CAMVE 43 12 -
FVDTGASLCIASKFV POL_CAMVS 43 12 -
FVDTGASLCIASKFV POL_CAMVD 45 12 -
YVDTGASLCIASRYI POL_FMVD 52 12 -
YVDTGSSLCMASKYV POL_CERV 32 13 -
YVDTGASMCTANKHV Q88442 77 12 -
YIDTGATLCFGKRKI POL_SOCMV 34 7 -
YIDTGATICLAQAKI Q84682 21 5 -

Motif 3 width=16
Element Seqn Id St Int Rpt
FKIPTVYQQESGIDFI POL_CAMVC 98 40 -
FKIPTVYQQESGIDFI Q83169 99 40 -
FKIPTVYQQESGIDFI Q66162 99 40 -
FKIPTVYQQESGIDFI POL_CAMVN 99 40 -
FKIPTVYQQESGIDFI POL_CAMVE 98 40 -
FRIPTVYQQESGIDFI POL_CAMVS 98 40 -
FHIPTVYQQESGIDFI POL_CAMVD 100 40 -
FEIPTVYQQETGIDFL POL_FMVD 107 40 -
FLIPTLFQQESGIDLL POL_CERV 87 40 -
FIIPTLYQATTKGDIT Q88442 132 40 -
FLIPIIYLHDSGLDLI POL_SOCMV 87 38 -
FPLPSVYQQDAGLPLI Q84682 76 40 -

Motif 4 width=15
Element Seqn Id St Int Rpt
IGNNFCQLYEPFIQF POL_CAMVC 114 0 -
IGNNFCQLYEPFIQF Q83169 115 0 -
IGNNFCQLYEPFIQF Q66162 115 0 -
IGNNFCQLYEPFIQF POL_CAMVN 115 0 -
IGNNFCQLYEPFIQF POL_CAMVE 114 0 -
IGNNFCQLYEPFIQF POL_CAMVS 114 0 -
IGNNFCQLYEPFIQF POL_CAMVD 116 0 -
IGNNFCRLYNPFIQW POL_FMVD 123 0 -
LGNNFCQLYSPFIQY POL_CERV 103 0 -
LGNNFCRLYEPFVQY Q88442 148 0 -
IGNNFLKLYQPFIQR POL_SOCMV 103 0 -
LGNNFLKLYNPFIQT Q84682 92 0 -