SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00862

Identifier
PROLIGOPTASE  [View Relations]  [View Alignment]  
Accession
PR00862
No. of Motifs
6
Creation Date
26-APR-1998  (UPDATE 07-JUN-1999)
Title
Prolyl oligopeptidase serine protease (S9A) signature
Database References

PROSITE; PS00708 PRO_ENDOPEP_SER
BLOCKS; BL00708
PFAM; PF00326 Prolyl_oligopep
INTERPRO; IPR002470
Literature References
1. RAWLINGS, N.D. AND BARRETT, A.J.
Families of serine peptidases.
METHODS ENZYMOL. 244 19-61 (1994).
 
2. RAWLINGS, N.D. AND BARRETT, A.J.
Evolutionary families of peptidases.
BIOCHEM.J. 290 205-218 (1993).
 
3. BAIROCH, A. AND RAWLINGS, N.
Classification of peptidase families and index of peptidase entries in
SWISS-PROT.
http://expasy.hcuge.ch/cgi-bin/lists?peptidas.txt

Documentation
Proteolytic enzymes that use serine in their catalytic machinery are 
widespread and numerous, being found in viruses, bacteria and eukaryotes
[1]. They encompass a range of peptidase activity, including exopeptidase,
endopeptidase, oligopeptidase and omega-peptidase. More than 20 serine
protease families (denoted S1 - S27) have been identified, which have been
grouped into 6 clans (SA, SB, SC, SE, SF and SG) on the basis of structural
and functional similarities [1]. Structures from four clans have been
examined (SA, SB, SC and SE): these appear to be unrelated, suggesting at 
least four evolutionary origins of serine peptidase, and possibly many more
[1]. Since that examination, structural representations from the other two 
clan members (SF, SG) have been determined [3].
 
Notwithstanding their different evolutionary origins, there are similarities
in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin
and carboxypeptidase C clans have a catalytic triad of serine, aspartate and
histidine in common: serine acts as a nucleophile, aspartate as an
electrophile, and histidine as a base [1]. The geometric orientations of
the catalytic residues are similar between families, despite different 
protein folds [1]. The linear arrangements of the catalytic residues
commonly reflect clan relationships. For example the catalytic triad in 
the chymotrypsin clan (SA) is ordered HDS, but is ordered DHS in the
subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [1,2].
 
Prolyl oligopeptidases belong to the S9A family of the carboxypeptidase (SC)
clan [1,3]. The active site of members of this clan consists of a linear 
arrangment of serine, histidine and threonine catalytic residues [1]. Prolyl
oligopeptidases are either located in the cytosol or they are membrane bound,
where they cleave peptide bonds with prolyl P1 specificities (but cleavage
of alanyl bonds has been detected). The proline must adopt a trans con-
figuration within the chain. Peptides of up to 30 residues are cleaved [1].
 
PROLIGOPTASE is a 6-element fingerprint that provides a signature for the
prolyl oligopeptidase (S9A) family of serine proteases. The fingerprint was
derived from an initial alignment of 13 sequences: the motifs were drawn
from conserved regions spanning the C-terminal half of the alignment -
motifs 3 and 4 overlap the region encoded by PROSITE pattern PRO_ENDOPEP_SER
(PS00708), which contains the catalytic serine; and motif 6 encodes the
catalytic aspartate. Two iterations on OWL30.1 were required to reach
convergence, at which point a true set comprising 19 sequences was found.
 
An update on SPTR37_9f identified a true set of 19 sequences.
Summary Information
19 codes involving  6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
6191919191919
5000000
4000000
3000000
2000000
123456
True Positives
O05748        O07178        O07834        O58991        
O70196 O76728 P71835 PPCE_AERHY
PPCE_FLAME PPCE_HUMAN PPCE_PIG PPCF_FLAME
PTRB_ECOLI PTRB_MORLA Q51714 Q94795
Y4NA_RHISN Y4QF_RHISN Y4SO_RHISN
Sequence Titles
O05748      PTRB - MYCOBACTERIUM LEPRAE.                  
O07178 HYPOTHETICAL 74.5 KD PROTEIN - MYCOBACTERIUM TUBERCULOSIS.
O07834 DIPEPTIDYL AMINOPEPTIDASE - PSEUDOMONAS SP.
O58991 617AA LONG HYPOTHETICAL PROLYL ENDOPEPTIDASE - PYROCOCCUS HORIKOSHII.
O70196 RPOP - RATTUS NORVEGICUS (RAT).
O76728 OLIGOPEPTIDASE B - TRYPANOSOMA BRUCEI BRUCEI.
P71835 HYPOTHETICAL 60.9 KD PROTEIN CY369.26 - MYCOBACTERIUM TUBERCULOSIS.
PPCE_AERHY PROLYL ENDOPEPTIDASE (EC 3.4.21.26) (POST-PROLINE CLEAVING ENZYME) (PE) - AEROMONAS HYDROPHILA.
PPCE_FLAME PROLYL ENDOPEPTIDASE PRECURSOR (EC 3.4.21.26) (POST-PROLINE CLEAVING ENZYME) (PE) (VERSION 1) - FLAVOBACTERIUM MENINGOSEPTICUM.
PPCE_HUMAN PROLYL ENDOPEPTIDASE (EC 3.4.21.26) (POST-PROLINE CLEAVING ENZYME) (PE) - HOMO SAPIENS (HUMAN).
PPCE_PIG PROLYL ENDOPEPTIDASE (EC 3.4.21.26) (POST-PROLINE CLEAVING ENZYME) (PE) - SUS SCROFA (PIG).
PPCF_FLAME PROLYL ENDOPEPTIDASE PRECURSOR (EC 3.4.21.26) (POST-PROLINE CLEAVING ENZYME) (PE) (VERSION 2) - FLAVOBACTERIUM MENINGOSEPTICUM.
PTRB_ECOLI PROTEASE II (EC 3.4.21.83) (OLIGOPEPTIDASE B) - ESCHERICHIA COLI.
PTRB_MORLA PROTEASE II (EC 3.4.21.83) (OLIGOPEPTIDASE B) - MORAXELLA LACUNATA.
Q51714 PROLYL ENDOPEPTIDASE (EC 3.4.21.26) (POST-PROLINE CLEAVING ENZYME) (PE) - PYROCOCCUS FURIOSUS.
Q94795 PEPTIDASE A - TRYPANOSOMA CRUZI.
Y4NA_RHISN PROBABLE PEPTIDASE Y4NA (EC 3.4.21.-) - RHIZOBIUM SP. (STRAIN NGR234).
Y4QF_RHISN PROBABLE PEPTIDASE Y4QF (EC 3.4.21.-) - RHIZOBIUM SP. (STRAIN NGR234).
Y4SO_RHISN PROBABLE PEPTIDASE Y4SO (EC 3.4.21.-) - RHIZOBIUM SP. (STRAIN NGR234).
Scan History
OWL30_0    2  50   NSINGLE    
SPTR37_9f 2 20 NSINGLE
Initial Motifs
Motif 1  width=19
Element Seqn Id St Int Rpt
PTILYGYGGFDVSLTPSFS PPCE_AERHY 451 451 -
PAFLYGYGGFNISITPNYS PPCE_HUMAN 467 467 -
PAFLYGYGGFNISITPNYS PPCE_PIG 467 467 -
PLLVYGYGSYGASIDADFS PTRB_ECOLI 446 446 -
PTLLNGYGGFETSRTPTYD MTCI429A 248 248 -
PTVLYAYGSYGACVEPEFS TCU69897 475 475 -
RAWVFGYGGFNIALTPMFF JC4084 395 395 -
PLLVHVYGAYGMDLKMNFR AB007896 435 435 -
PVLLSVYGCYGIPRLPSFL Y4QF_RHISN 493 493 -
PVLLNVYGCYGAQSLPAFF Y4SO_RHISN 462 462 -
PTILYAYGGFQIPMQPSYS Y4NA_RHISN 496 496 -
PTILYSYGGFNISLQPAFS PPCE_FLAME 470 470 -
PTILYSYGGFNISLQPAFS PPCF_FLAME 470 470 -

Motif 2 width=25
Element Seqn Id St Int Rpt
DRGVAFGIVHVRGGGELGRAWHEAA Y4SO_RHISN 494 13 -
EKGGAYALANIRGGGEFGPKWHDAG Y4NA_RHISN 523 8 -
DRGFVYAIVHVRGGGELGQQWYEDG PTRB_ECOLI 472 7 -
HMGGVLAVANIRGGGEYGETWHKGG PPCE_PIG 494 8 -
DRGVIYVIAHVRGGGEMGRAWYEVG TCU69897 501 7 -
ARGGTYALANIRGGGEYGPGWHTQA MTCI429A 275 8 -
KRGGTFIMANLRGGSEYGEEWHRAG JC4084 421 7 -
DDGWILAYCHVRGGGELGLQWHADG AB007896 461 7 -
HMGGILAVANIRGGGEYGETWHKGG PPCE_HUMAN 494 8 -
DLGGVYAVANLRGGGEYGQAWHLAG PPCE_AERHY 477 7 -
ENGGIYAVPNIRGGGEYGKKWHDAG PPCF_FLAME 496 7 -
ENGGIYAVPNIRGGGEYGKKWHDAG PPCE_FLAME 496 7 -
DREVAFGIVHVRGGGELGRPWHDAA Y4QF_RHISN 525 13 -

Motif 3 width=20
Element Seqn Id St Int Rpt
GRDKVAQDFAAVATDLVTRG MTCI429A 303 3 -
TKRNTFSDFIACAEYLIEIG TCU69897 530 4 -
NKQNVFDDFIAVLEKRKKEG JC4084 449 3 -
KKLNGLADLEACIKTLHGQG AB007896 489 3 -
QKKNVFNDFIAAGEYLQKNG PPCE_FLAME 524 3 -
QKKNVFNDFIAAGEYLQKNG PPCF_FLAME 524 3 -
NKQNVFDDFIAAAEYLKAEG PPCE_AERHY 505 3 -
NKQNCFDDFQCAAEYLIKEG PPCE_HUMAN 522 3 -
NKQNCFDDFQCAAEYLIKEG PPCE_PIG 522 3 -
KKKNTFNDYLDACDALLKLG PTRB_ECOLI 500 3 -
NRQRVYDDFQAVAQDLIAKK Y4NA_RHISN 551 3 -
QKRLTHTDLIAAAECLVEHR Y4SO_RHISN 522 3 -
QKRITHTDLISATEGLIERG Y4QF_RHISN 553 3 -

Motif 4 width=21
Element Seqn Id St Int Rpt
AFSAGGVLAGALCNSNPELVR AB007896 519 10 -
GGSNGGLLMGIMLTGYPEKFG MTCI429A 333 10 -
GRSAGGLLIGAVLNMRPDLFR TCU69897 560 10 -
GRSNGGLLVGATMTMRPDLAK PPCE_FLAME 554 10 -
GRSNGGLLVGATMTMRPDLAK PPCF_FLAME 554 10 -
GGSNGGLLVGAVMTQRPDLMR PPCE_AERHY 535 10 -
GGSNGGLLVAACANQRPDLFG PPCE_HUMAN 552 10 -
GGSNGGLLVATCANQRPDLFG PPCE_PIG 552 10 -
GGSAGGMLMGVAINQRPELFH PTRB_ECOLI 530 10 -
GGSNGGLLMGVQMIQRPDLWN Y4NA_RHISN 581 10 -
GRSAGGGTVLAAAVLRPDLFR Y4SO_RHISN 552 10 -
GKSGGGGTVLATAVFRPNLFR Y4QF_RHISN 583 10 -
GRSNGGLLVSATLTQRPDVMD JC4084 475 6 -

Motif 5 width=16
Element Seqn Id St Int Rpt
EYRYLRSYDPYYNLSP Y4QF_RHISN 641 37 -
HKNYIKRYCPYQNIKP AB007896 578 38 -
DREFLLKYSPYHNVDP JC4084 530 34 -
DWKFISEYSPYQNISA MTCI429A 388 34 -
FFDYMNSYSPVDNVRA TCU69897 618 37 -
MFEYLKSYSPVHNVKA PPCE_FLAME 610 35 -
MFEYLKSYSPVHNVKA PPCF_FLAME 610 35 -
MFDYLKGYSPLHSVRA PPCE_AERHY 591 35 -
HFEWLVKYSPLHNVKL PPCE_HUMAN 607 34 -
HFEWLIKYSPLHNVKL PPCE_PIG 607 34 -
YYEYMKSYSPYDNVTA PTRB_ECOLI 588 37 -
EGAFLRSISPYHNVKA Y4NA_RHISN 636 34 -
DYQYLRSYDPYYNLTP Y4SO_RHISN 610 37 -

Motif 6 width=23
Element Seqn Id St Int Rpt
YPPTYIDAALHDSQVLYYQPARY Y4SO_RHISN 629 3 -
YPPTLIYTGLHDDRVHPAHALKF JC4084 549 3 -
YPPVLMTTSTRDDRVHPGHARKM MTCI429A 407 3 -
YPHLMIQAGLHDPRVAYWEPAKW TCU69897 636 2 -
LPPTYVDAALDDGQVIYYQPARY Y4QF_RHISN 660 3 -
YPSIHITAYENDERVPLKGIVSY AB007896 596 2 -
YPSTMVITSDHDDRVVPAHSFKF PPCE_FLAME 629 3 -
YPSTMVITSDHDDRVVPAHSFKF PPCF_FLAME 629 3 -
YPSTLVTTADHDDRVVPAHSFKF PPCE_AERHY 610 3 -
YPSMLLLTADHDDRVVPLHSLKF PPCE_HUMAN 630 7 -
YPSMLLLTADHDDRVVPLHSLKF PPCE_PIG 630 7 -
YPHLLVTTGLHDSQVQYWEPAKW PTRB_ECOLI 606 2 -
YPEPFFETSTKDDRVGPVHARKM Y4NA_RHISN 655 3 -
Final Motifs
Motif 1  width=19
Element Seqn Id St Int Rpt
PTILYSYGGFNISLQPAFS PPCE_FLAME 470 470 -
PTILYSYGGFNISLQPAFS PPCF_FLAME 470 470 -
PTILYGYGGFDVSLTPSFS PPCE_AERHY 451 451 -
PAFLYGYGGFNISITPNYS O70196 467 467 -
PAFLYGYGGFNISITPNYS PPCE_HUMAN 467 467 -
PAFLYGYGGFNISITPNYS PPCE_PIG 467 467 -
PTMLYGYGSYGICIEPEFN O76728 476 476 -
PTVLYAYGSYGACVEPEFS Q94795 475 475 -
PALIYGYGAYEICEDPRFS P71835 302 302 -
PAVLYGYGAYEICVDPSFS O05748 473 473 -
KAWVFGYGGFNIALTPRFF O58991 396 396 -
RAWVFGYGGFNIALTPMFF Q51714 395 395 -
PLILYGYGSYGSNSDPRFD PTRB_MORLA 448 448 -
PLLVYGYGSYGASIDADFS PTRB_ECOLI 446 446 -
PMLQYAYGSYGASMDPNFS O07834 488 488 -
PTILYAYGGFQIPMQPSYS Y4NA_RHISN 496 496 -
PVLLNVYGCYGAQSLPAFF Y4SO_RHISN 462 462 -
PTLLNGYGGFETSRTPTYD O07178 445 445 -
PVLLSVYGCYGIPRLPSFL Y4QF_RHISN 493 493 -

Motif 2 width=25
Element Seqn Id St Int Rpt
ENGGIYAVPNIRGGGEYGKKWHDAG PPCE_FLAME 496 7 -
ENGGIYAVPNIRGGGEYGKKWHDAG PPCF_FLAME 496 7 -
DLGGVYAVANLRGGGEYGQAWHLAG PPCE_AERHY 477 7 -
HMGGVLAVANIRGGGEYGETWHKGG O70196 494 8 -
HMGGILAVANIRGGGEYGETWHKGG PPCE_HUMAN 494 8 -
HMGGVLAVANIRGGGEYGETWHKGG PPCE_PIG 494 8 -
DRGMIYAIAHVRGGGEMGRTWYEVG O76728 502 7 -
DRGVIYVIAHVRGGGEMGRAWYEVG Q94795 501 7 -
DRGMVFVVAHVRGGGEMGRLWYENG P71835 328 7 -
DRGMMFVIAHVRGGGEMGRLWYEHG O05748 499 7 -
KRGGTFVMANLRGGSEYGEEWHRAG O58991 422 7 -
KRGGTFIMANLRGGSEYGEEWHRAG Q51714 421 7 -
EKGIVFVTAQVRGGSEMGRGWYEDG PTRB_MORLA 474 7 -
DRGFVYAIVHVRGGGELGQQWYEDG PTRB_ECOLI 472 7 -
DRGVVYALAHIRGGQEMGRAWYDDG O07834 514 7 -
EKGGAYALANIRGGGEFGPKWHDAG Y4NA_RHISN 523 8 -
DRGVAFGIVHVRGGGELGRAWHEAA Y4SO_RHISN 494 13 -
ARGGTYALANIRGGGEYGPGWHTQA O07178 472 8 -
DREVAFGIVHVRGGGELGRPWHDAA Y4QF_RHISN 525 13 -

Motif 3 width=20
Element Seqn Id St Int Rpt
QKKNVFNDFIAAGEYLQKNG PPCE_FLAME 524 3 -
QKKNVFNDFIAAGEYLQKNG PPCF_FLAME 524 3 -
NKQNVFDDFIAAAEYLKAEG PPCE_AERHY 505 3 -
NKQNCFDDFQCAAEYLIKEG O70196 522 3 -
NKQNCFDDFQCAAEYLIKEG PPCE_HUMAN 522 3 -
NKQNCFDDFQCAAEYLIKEG PPCE_PIG 522 3 -
TKRNTFMDFIACAEHLISSG O76728 531 4 -
TKRNTFSDFIACAEYLIEIG Q94795 530 4 -
DKKNTFTDFIAVARHLVDTG P71835 356 3 -
EKKNTFTDFISVAKHLVDSG O05748 527 3 -
NKQNVFDDFIAVLEKLKKEG O58991 450 3 -
NKQNVFDDFIAVLEKRKKEG Q51714 449 3 -
NKRNTFTDFIAAAKHLIDQN PTRB_MORLA 502 3 -
KKKNTFNDYLDACDALLKLG PTRB_ECOLI 500 3 -
NKINTFTDFIDVTDYLVKEG O07834 542 3 -
NRQRVYDDFQAVAQDLIAKK Y4NA_RHISN 551 3 -
QKRLTHTDLIAAAECLVEHR Y4SO_RHISN 522 3 -
GRDKVAQDFAAVATDLVTRG O07178 500 3 -
QKRITHTDLISATEGLIERG Y4QF_RHISN 553 3 -

Motif 4 width=21
Element Seqn Id St Int Rpt
GRSNGGLLVGATMTMRPDLAK PPCE_FLAME 554 10 -
GRSNGGLLVGATMTMRPDLAK PPCF_FLAME 554 10 -
GGSNGGLLVGAVMTQRPDLMR PPCE_AERHY 535 10 -
GGSNGGLLVAACANQRPDLFG O70196 552 10 -
GGSNGGLLVAACANQRPDLFG PPCE_HUMAN 552 10 -
GGSNGGLLVATCANQRPDLFG PPCE_PIG 552 10 -
GRSAGGLLVGAVLNMRPDLFH O76728 561 10 -
GRSAGGLLIGAVLNMRPDLFR Q94795 560 10 -
GGSAGGLLMGAVANMAPDLFA P71835 386 10 -
GGSAGGLLMGVVANIAPDLFT O05748 557 10 -
GRSNGGLLVSATLTQRPDIMD O58991 476 6 -
GRSNGGLLVSATLTQRPDVMD Q51714 475 6 -
GGSAGGLLVGAVANMAGELFK PTRB_MORLA 532 10 -
GGSAGGMLMGVAINQRPELFH PTRB_ECOLI 530 10 -
GGSAGGLLMGAVSNMAPEKYK O07834 572 10 -
GGSNGGLLMGVQMIQRPDLWN Y4NA_RHISN 581 10 -
GRSAGGGTVLAAAVLRPDLFR Y4SO_RHISN 552 10 -
GGSNGGLLMGIMLTGYPEKFG O07178 530 10 -
GKSGGGGTVLATAVFRPNLFR Y4QF_RHISN 583 10 -

Motif 5 width=16
Element Seqn Id St Int Rpt
MFEYLKSYSPVHNVKA PPCE_FLAME 610 35 -
MFEYLKSYSPVHNVKA PPCF_FLAME 610 35 -
MFDYLKGYSPLHSVRA PPCE_AERHY 591 35 -
HFEWLLKYSPLHNVKL O70196 607 34 -
HFEWLVKYSPLHNVKL PPCE_HUMAN 607 34 -
HFEWLIKYSPLHNVKL PPCE_PIG 607 34 -
FFDYMNSYSPIDNVRA O76728 619 37 -
FFDYMNSYSPVDNVRA Q94795 618 37 -
VYAYVKSYSPYENVTA P71835 445 38 -
FYSYIKSYSPYENVEA O05748 616 38 -
DREFLLKYSPYHNVDP O58991 531 34 -
DREFLLKYSPYHNVDP Q51714 530 34 -
DYFYMKSYSPYDNVEA PTRB_MORLA 590 37 -
YYEYMKSYSPYDNVTA PTRB_ECOLI 588 37 -
YYDYILTYSPYDNLQA O07834 630 37 -
EGAFLRSISPYHNVKA Y4NA_RHISN 636 34 -
DYQYLRSYDPYYNLTP Y4SO_RHISN 610 37 -
DWKFISEYSPYQNISA O07178 585 34 -
EYRYLRSYDPYYNLSP Y4QF_RHISN 641 37 -

Motif 6 width=23
Element Seqn Id St Int Rpt
YPSTMVITSDHDDRVVPAHSFKF PPCE_FLAME 629 3 -
YPSTMVITSDHDDRVVPAHSFKF PPCF_FLAME 629 3 -
YPSTLVTTADHDDRVVPAHSFKF PPCE_AERHY 610 3 -
YPSMLLLTADHDDRVVPLHSLKF O70196 630 7 -
YPSMLLLTADHDDRVVPLHSLKF PPCE_HUMAN 630 7 -
YPSMLLLTADHDDRVVPLHSLKF PPCE_PIG 630 7 -
YPHLMIQAGLHDPRVAYWEPAKW O76728 637 2 -
YPHLMIQAGLHDPRVAYWEPAKW Q94795 636 2 -
YPAILAMTSLNDTRVYYVEPAKW P71835 463 2 -
YPAILAMTSLHDTRVHYVEPAKW O05748 634 2 -
YPLTLIYTGLHDDRVHPAHALKF O58991 550 3 -
YPPTLIYTGLHDDRVHPAHALKF Q51714 549 3 -
YPHMYITTGINDPRVGYFEPAKW PTRB_MORLA 608 2 -
YPHLLVTTGLHDSQVQYWEPAKW PTRB_ECOLI 606 2 -
YPAMFVGTGLWDSQVQYWEPAKY O07834 648 2 -
YPEPFFETSTKDDRVGPVHARKM Y4NA_RHISN 655 3 -
YPPTYIDAALHDSQVLYYQPARY Y4SO_RHISN 629 3 -
YPPVLMTTSTRDDRVHPGHARKM O07178 604 3 -
LPPTYVDAALDDGQVIYYQPARY Y4QF_RHISN 660 3 -