WORKLIST ENTRIES (1):

GLHYDRLASE15 View alignment     Glycosyl hydrolase family 15 signature
 Type of fingerprint: COMPOUND with 7  elements
Links:
   PRINTS; PR00131 GLHYDRLASE1; PR00132 GLHYDRLASE2; PR00133 GLHYDRLASE3
   PRINTS; PR00732 GLHYDRLASE4; PR00733 GLHYDRLASE6; PR00734 GLHYDRLASE7
   PRINTS; PR00735 GLHYDRLASE8; PR00134 GLHYDRLASE10; PR00911 GLHYDRLASE11
   PRINTS; PR00737 GLHYDRLASE16; PR00738 GLHYDRLASE20; PR00739 GLHYDRLASE26
   PRINTS; PR00740 GLHYDRLASE27; PR00741 GLHYDRLASE29; PR00843 GLHYDRLASE30
   PRINTS; PR00742 GLHYDRLASE35; PR00743 GLHYDRLASE36; PR00744 GLHYDRLASE37
   PRINTS; PR00745 GLHYDRLASE39; PR00746 GLHYDRLASE41; PR00747 GLHYDRLASE47
   PRINTS; PR00844 GLHYDRLASE48; PR00845 GLHYDRLASE52; PR00846 GLHYDRLASE56
   PRINTS; PR00849 GLHYDRLASE58; PR00850 GLHYDRLASE59; PR00748 MELIBIASE
   PRINTS; PR00137 LYSOZYME; PR00684 T4LYSOZYME; PR00749 LYSOZYMEG
   PRINTS; PR00110 ALPHAAMYLASE; PR00750 BETAAMYLASE
   INTERPRO; IPR000165
   PROSITE; PS00820 GLUCOAMYLASE
   PFAM; PF00723 glycosyl_hydr10
   PDB; 1AGM 3Dinfo
   SCOP; 1AGM
   CATH; 1AGM

 Creation date 05-JUN-1997; UPDATE 10-JUN-1999

   1. HENRISSAT, B. AND BAIROCH, A.
   New families in the classification of glycosyl hydrolases based on amino
   acid sequence similarities.
   BIOCHEM.J. 293 781-788 (1993).

   2. HENRISSAT, B.
   A classification of glycosyl hydrolases based on amino acid sequence
   similarities.
   BIOCHEM.J. 280 309-316 (1991).

   3. DAVIES, G. AND HENRISSAT, B.
   Structures and mechanisms of glycosyl hydrolases.
   STRUCTURE 3 853-859 (1995).

   4. HENRISSAT, B. AND BAIROCH, A.
   Updating the sequence-based classification of glycosyl hydrolases.
   BIOCHEM.J. 316 695-696 (1996).

   5. SIERKS, M.R., FORD, C., REILLY, P.J. AND SVENSSON, B.
   Catalytic mechanism of fungal glucoamylase as defined by mutagenesis 
   of Asp176, Glu179 and Glu180 in the enzyme from Aspergillus awamori.
   PROTEIN ENG. 3 193-198 (1990).

   6. OHNISHI, H., KITAMURA, H., MINOWA, T., SAKAI, H. AND OHTA, T.
   Molecular-cloning of a glucoamylase gene from a thermophilic Clostridium
   and kinetics of the cloned enzyme.
   EUR.J.BIOCHEM. 207 413-418 (1992).

   7. ALESHIN, A.E., FIRSOV, L.M. AND HONZATKO, R.B.
   Refined structure for the complex of acarbose with glucoamylase from
   Aspergillus awamori var. x100 to 2.4A resolution.
   J.BIOL.CHEM. 269(22) 15631-15639 (1994).

   O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
   hydrolyse the glycosidic bond between two or more carbohydrates, or between
   a carbohydrate and a non-carbohydrate moiety. A classification system for
   glycosyl hydrolases, based on sequence similarity, has led to the definition
   of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
   glycosid.txt).
  
   Family 15 encompasses the glucoamylases (GA). GA catalyses the release of
   D-glucose from the non-reducing ends of starch and other oligo- or poly-
   saccharides. Studies of fungal GA have indicated 3 closely-clustered acidic
   residues that play a role in the catalytic mechanism [5]. This region is 
   also conserved in a recently sequenced bacterial GA [6].
  
   The 3D structure of the pseudo-tetrasaccharide acarbose complexed with
   glucoamylase II(471) from Aspergillus awamori var. X100 has been determined
   to 2.4A resolution [7]. The protein belongs to the mainly-alpha class, and
   contains 19 helices and 9 strands.
  
   GLHYDRLASE15 is a 7-element fingerprint that provides a signature for
   family 15 glycosyl hydrolases. The fingerprint was derived from an initial
   alignment of 6 sequences: the motifs were drawn from short conserved regions 
   spanning the N-terminal half of the alignment - motif 2 includes part of 
   helix 5 and the second beta-strand; motif 3 encompasses most of helix 6;
   motif 4 spans strands 3 and 4, and includes the region encoded by PROSITE
   pattern GLUCOAMYLASE (PS00820), which contains the catalytic cluster of
   acidic residues; motif 5 includes part of helix 13; motif 6 spans the C-
   terminus of helix 17 and strand 8; and motif 7 spans helix 18. Two
   iterations on OWL29.3 was required to reach convergence, at which point
   a true set comprising 18 sequences was identified. Several partial matches
   were also found, all of which are related sequences that fail to make
   significant matches with one or more motifs.
  
   An update on SPTR37_9f identified a true set of 18 sequences, and 3
   partial matches.

  SUMMARY INFORMATION
     18 codes involving  7 elements
      0 codes involving  6 elements
      3 codes involving  5 elements
      0 codes involving  4 elements
      0 codes involving  3 elements
      0 codes involving  2 elements

   COMPOSITE FINGERPRINT INDEX
  
    7|  18   18   18   18   18   18   18  
    6|   0    0    0    0    0    0    0  
    5|   3    3    3    3    3    0    0  
    4|   0    0    0    0    0    0    0  
    3|   0    0    0    0    0    0    0  
    2|   0    0    0    0    0    0    0  
   --+------------------------------------
     |   1    2    3    4    5    6    7  

True positives..
 Q12537         AMYG_ASPAK     Q92201         AMYG_ASPNG     
 AMYG_ASPSH     Q02296         AMYG_ASPOR     AMYG_NEUCR     
 Q12596         Q12623         AMYG_HORRE     O59846         
 AMYH_SACFI     AMYG_SACFI     O60087         AMYG_YEAST     
 AMYG_RHIOR     AMYG_ARXAD     
Subfamily:  Codes involving 5 elements
 Subfamily True positives..
 AMYH_SACDI     Q92314         AMYI_SACDI     


  PROTEIN TITLES
   Q12537           GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
   AMYG_ASPAK       GLUCOAMYLASE I PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUC
   Q92201           GLUCOAMYLASE G1 AND G2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-AL
   AMYG_ASPNG       GLUCOAMYLASE G1 AND G2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-AL
   AMYG_ASPSH       GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
   Q02296           GLUCOAMYLASE - ASPERGILLUS NIGER.
   AMYG_ASPOR       GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
   AMYG_NEUCR       GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
   Q12596           GLUCOAMYLASE G2 (EC 3.2.1.3) - CORTICIUM ROLFSII.
   Q12623           GLUCOAMYLASE (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOSIDASE) (1,
   AMYG_HORRE       GLUCOAMYLASE P PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUC
   O59846           GLUCOAMYLASE - ASPERGILLUS ORYZAE.
   AMYH_SACFI       GLUCOAMYLASE GLA1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA- 
   AMYG_SACFI       GLUCOAMYLASE GLU1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA- 
   O60087           GLUCOAMYLASE - SCHIZOSACCHAROMYCES POMBE (FISSION YEAST).
   AMYG_YEAST       GLUCOAMYLASE, INTRACELLULAR SPORULATION-SPECIFIC (EC 3.2.1.3
   AMYG_RHIOR       GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
   AMYG_ARXAD       GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
 
   AMYH_SACDI       GLUCOAMYLASE S1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLU
   Q92314           GLUCOAMYLASE S1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLU
   AMYI_SACDI       GLUCOAMYLASE S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLU

SCAN HISTORY OWL29_3 2 100 NSINGLE SPTR37_9f 2 21 NSINGLE INITIAL MOTIF SETS GLHYDRLASE151 Length of motif = 18 Motif number = 1 Glycosyl hydrolase family 15 motif I - 1 PCODE ST INT GADSGIVVASPSTDNPDY AMYG_ASPAK 55 55 GADSGIVVASPSTDNPDY AMYG_ASPNG 55 55 GASPGVVIASPSKSDPDY AMYG_ASPOR 58 58 GAGAGFVVASPSKANPDY AMYG_HORRE 59 59 GAASGVVVASPSKSSPDW AMYG_NEUCR 65 65 SISPGVVIASPSQTHPDY AMYG_YEAST 108 108 GLHYDRLASE152 Length of motif = 18 Motif number = 2 Glycosyl hydrolase family 15 motif II - 1 PCODE ST INT LGEPKFNVDETAYTGSWG AMYG_ASPAK 127 54 LGEPKFNVDETAYTGSWG AMYG_ASPNG 128 55 LGEPKFNVDETAFTGAWG AMYG_ASPOR 130 54 LGEPKFMVDGTRFNGPWG AMYG_HORRE 133 56 LGEPKFMVDLQQFTGAWG AMYG_NEUCR 138 55 LGDPKWNVDNTAFTEDWG AMYG_YEAST 182 56 GLHYDRLASE153 Length of motif = 20 Motif number = 3 Glycosyl hydrolase family 15 motif III - 1 PCODE ST INT RPQRDGPALRATAMIGFGQV AMYG_ASPAK 145 0 RPQRDGPALRATAMIGFGQW AMYG_ASPNG 146 0 RPQRDGPALRATAMISFGEW AMYG_ASPOR 148 0 RPQRDGPALRAIALMTYSNW AMYG_HORRE 151 0 RPQRDGPPLRAIALIGYGKW AMYG_NEUCR 156 0 RPQNDGPALRSIAILKIIDY AMYG_YEAST 200 0 GLHYDRLASE154 Length of motif = 19 Motif number = 4 Glycosyl hydrolase family 15 motif IV - 1 PCODE ST INT WNQTGYDLWEEVNGSSFFT AMYG_ASPAK 193 28 WNQTGYDLWEEVNGSSFFT AMYG_ASPNG 194 28 WSQSGFDLWEEVQGTSFFT AMYG_ASPOR 196 28 WNQSGFDLWEETYASSFFT AMYG_HORRE 199 28 WNNTGFDLWEEVNSSSFFT AMYG_NEUCR 204 28 WNSSGFDLWEEVNGMHFFT AMYG_YEAST 255 35 GLHYDRLASE155 Length of motif = 20 Motif number = 5 Glycosyl hydrolase family 15 motif V - 1 PCODE ST INT NDGLSDSEAVAVGRYPEDSY AMYG_ASPAK 315 103 NDGLSDSEAVAVGRYPEDTY AMYG_ASPNG 316 103 NSGRAENQAVAVGRYPEDSY AMYG_ASPOR 318 103 NAGIPEGQGVAVGRYAEDVY AMYG_HORRE 322 104 NSGRTAGKAAAVGRYAEDVY AMYG_NEUCR 326 103 NDSSKNATGIALGRYPEDVY AMYG_YEAST 388 114 GLHYDRLASE156 Length of motif = 22 Motif number = 6 Glycosyl hydrolase family 15 motif VI - 1 PCODE ST INT ADGFVSIVETHAASNGSLSEQF AMYG_ASPAK 404 69 ADGFVSIVETHAASNGSMSEQY AMYG_ASPNG 405 69 ADGYVQIVQTYAASTGSMAEQY AMYG_ASPOR 407 69 ADSYVAIAEKYIPSNGSLSEQF AMYG_HORRE 413 71 ADGFVDIVAQYTPSDGSLAEQF AMYG_NEUCR 415 69 ADSFLVKLKAHVGTDGELSEQF AMYG_YEAST 494 86 GLHYDRLASE157 Length of motif = 16 Motif number = 7 Glycosyl hydrolase family 15 motif VII - 1 PCODE ST INT DLTWSYAALLTANNRR AMYG_ASPAK 437 11 DLTWSYAALLTANNRR AMYG_ASPNG 438 11 DLTWSYAALLTANNRR AMYG_ASPOR 440 11 DLTWSYAAFITMSQRR AMYG_HORRE 446 11 HLTWSYASFLSAAARR AMYG_NEUCR 448 11 HLTWSYTSFWDAYQIR AMYG_YEAST 527 11 FINAL MOTIF SETS GLHYDRLASE151 Length of motif = 18 Motif number = 1 Glycosyl hydrolase family 15 motif I - 2 PCODE ST INT GADSGIVVASPSTDNPDY Q12537 55 55 GADSGIVVASPSTDNPDY AMYG_ASPAK 55 55 GADSGIVVASPSTDNPDY AMYG_ASPNG 55 55 GADSGIVVASPSTDNPDY Q92201 55 55 GADSGIVVASPSTDNPDY AMYG_ASPSH 55 55 GADSGIVVASPSTDNPDY Q02296 55 55 GASPGVVIASPSKSDPDY AMYG_ASPOR 58 58 GAASGVVVASPSKSSPDW AMYG_NEUCR 65 65 GAYSGIVIASPSKTSPDY Q12596 51 51 GAAAGVVIASPSRTDPPY Q12623 61 61 GAGAGFVVASPSKANPDY AMYG_HORRE 59 59 GAAAGIVVASPSKSNPDY O59846 58 58 DGVPGTVIASPSTSNPDY AMYH_SACFI 73 73 NGVPGTVIASPSTSNPDY AMYG_SACFI 73 73 DINPGCIIASPSTDSPDY O60087 59 59 SISPGVVIASPSQTHPDY AMYG_YEAST 108 108 GSATGFIAASLSTAGPDY AMYG_RHIOR 193 193 GAAPGTVIAAQSYSEPDY AMYG_ARXAD 187 187 GLHYDRLASE152 Length of motif = 18 Motif number = 2 Glycosyl hydrolase family 15 motif II - 2 PCODE ST INT LGEPKFNVDETAYTGSWG Q12537 127 54 LGEPKFNVDETAYTGSWG AMYG_ASPAK 127 54 LGEPKFNVDETAYTGSWG AMYG_ASPNG 128 55 LGEPKFNVDETAYTGSWG Q92201 128 55 LGEPKFNVDETAYAGSWG AMYG_ASPSH 127 54 LGEPKFNVDETAYTGSWG Q02296 128 55 LGEPKFNVDETAFTGAWG AMYG_ASPOR 130 54 LGEPKFMVDLQQFTGAWG AMYG_NEUCR 138 55 LGEPKFNIDETAFTGAWG Q12596 124 55 LGEAKFNVDLTAFTGEWG Q12623 121 42 LGEPKFMVDGTRFNGPWG AMYG_HORRE 133 56 LAEPKFYVNISQFTDSWG O59846 131 55 LGEPKFNTDGSAYTGAWG AMYH_SACFI 150 59 LGEPKFNTDGSAYTGAWG AMYG_SACFI 150 59 LGEPKFNVDGTSYDGDWG O60087 131 54 LGDPKWNVDNTAFTEDWG AMYG_YEAST 182 56 LGEPKFNPDASGYTGAWG AMYG_RHIOR 263 52 MGEPKFYLNNTAFTGSWG AMYG_ARXAD 259 54 GLHYDRLASE153 Length of motif = 20 Motif number = 3 Glycosyl hydrolase family 15 motif III - 2 PCODE ST INT RPQRDGPALRATAMIGFGQW Q12537 145 0 RPQRDGPALRATAMIGFGQV AMYG_ASPAK 145 0 RPQRDGPALRATAMIGFGQW AMYG_ASPNG 146 0 RPQRDGPALRATAMIGFGQW Q92201 146 0 RPQRDGPALRATAMIGFGQW AMYG_ASPSH 145 0 RPQRDGPALRATAMIGFGQW Q02296 146 0 RPQRDGPALRATAMISFGEW AMYG_ASPOR 148 0 RPQRDGPPLRAIALIGYGKW AMYG_NEUCR 156 0 RPQRDGPALRATAIMTYATY Q12596 142 0 RPQRDGPPLRAIALIQYAKW Q12623 139 0 RPQRDGPALRAIALMTYSNW AMYG_HORRE 151 0 RPQRDGPALRASALIAYGNS O59846 149 0 RPQNDGPALRAYAISRYLND AMYH_SACFI 168 0 RPQNDGPALRAYAISRYLND AMYG_SACFI 168 0 RPQNDSPALRAIAFIKYMNY O60087 149 0 RPQNDGPALRSIAILKIIDY AMYG_YEAST 200 0 RPQNDGPAERATTFILFADS AMYG_RHIOR 281 0 RPQNDGPATRAITLIEFANA AMYG_ARXAD 277 0 GLHYDRLASE154 Length of motif = 19 Motif number = 4 Glycosyl hydrolase family 15 motif IV - 2 PCODE ST INT WNQTGYDLWEEVNGSSFFT Q12537 193 28 WNQTGYDLWEEVNGSSFFT AMYG_ASPAK 193 28 WNQTGYDLWEEVNGSSFFT AMYG_ASPNG 194 28 WNQTGYDLWEEVNGSSFFT Q92201 194 28 WNQTGYDLWEEVNGSSFFT AMYG_ASPSH 193 28 WNQTGYDLWEVNGSSFFTI Q02296 194 28 WSQSGFDLWEEVQGTSFFT AMYG_ASPOR 196 28 WNNTGFDLWEEVNSSSFFT AMYG_NEUCR 204 28 WNQTTFDLWEEVDSSSFFT Q12596 190 28 WNETGFDLWEEVPGSSFFT Q12623 187 28 WNQSGFDLWEETYASSFFT AMYG_HORRE 199 28 WNQTGFDLWEEVQGSSFFT O59846 197 28 WDSTGFDLWEENQGRHFFT AMYH_SACFI 228 40 WDSTGFDLWEENQGRHFFT AMYG_SACFI 228 40 WTEASFDLWEEIKDVHYFT O60087 197 28 WNSSGFDLWEEVNGMHFFT AMYG_YEAST 255 35 WSNGCFDLWEEVNGVHFYT AMYG_RHIOR 330 29 WSSPSFDLWEEEESAHFYT AMYG_ARXAD 334 37 GLHYDRLASE155 Length of motif = 20 Motif number = 5 Glycosyl hydrolase family 15 motif V - 2 PCODE ST INT NDGLSDSEAVAVGRYPEDSY Q12537 315 103 NDGLSDSEAVAVGRYPEDSY AMYG_ASPAK 315 103 NDGLSDSEAVAVGRYPEDTY AMYG_ASPNG 316 103 NDGLSDSEAVAVGRYPEDTY Q92201 316 103 NDGLSDSEAVAVGRYPEDSY AMYG_ASPSH 315 103 NDGLSDSEAVAVGRYPEDTY Q02296 315 102 NSGRAENQAVAVGRYPEDSY AMYG_ASPOR 318 103 NSGRTAGKAAAVGRYAEDVY AMYG_NEUCR 326 103 NSGISSTSGVATGRYPEDSY Q12596 315 106 NKGIAQGKAVAVGRYSEDVY Q12623 313 107 NAGIPEGQGVAVGRYAEDVY AMYG_HORRE 322 104 NNGRGAGKAAAVGPYAEDTY O59846 319 103 SVNSAYSAGAAIGRYPEDVY AMYH_SACFI 359 112 SVNSAYSAGAAIGRYPEDVY AMYG_SACFI 359 112 DYPVNQGWKQAMGRYPEDVY O60087 308 92 NDSSKNATGIALGRYPEDVY AMYG_YEAST 388 114 NKNLPSYLGNSIGRYPEDTY AMYG_RHIOR 455 106 SDESGKPLGIPVGRYPEDVY AMYG_ARXAD 463 110 GLHYDRLASE156 Length of motif = 22 Motif number = 6 Glycosyl hydrolase family 15 motif VI - 2 PCODE ST INT ADGFVSIVETHAASNGSLSEQF Q12537 404 69 ADGFVSIVETHAASNGSLSEQF AMYG_ASPAK 404 69 ADGFVSIVETHAASNGSMSEQY AMYG_ASPNG 405 69 ADGFVSIVETHAASNGSMSEQY Q92201 405 69 ADGFVSIVETHAASNGSLSEQF AMYG_ASPSH 404 69 ADGFVSIVETHAASNGSMSEQY Q02296 403 68 ADGYVQIVQTYAASTGSMAEQY AMYG_ASPOR 407 69 ADGFVDIVAQYTPSDGSLAEQF AMYG_NEUCR 415 69 ADEFVDIVAKYTPSSGFLSEQY Q12596 404 69 ADGFIEVAAKYTPSNGALAEQY Q12623 402 69 ADSYVAIAEKYIPSNGSLSEQF AMYG_HORRE 413 71 ADGFISVVQEYTPDGGALAEQY O59846 408 69 GDSFLQVILDHINDDGSLNEQL AMYH_SACFI 464 85 GDSFLQVILDHINDDGSLNEQL AMYG_SACFI 464 85 ADNFLKAVAEFQHPNGSMSEQF O60087 395 67 ADSFLVKLKAHVGTDGELSEQF AMYG_YEAST 494 86 ADRFLSTVQLHAHNNGSLAEEF AMYG_RHIOR 550 75 GDAFMRRAKYHTPSSGHMSEEF AMYG_ARXAD 560 77 GLHYDRLASE157 Length of motif = 16 Motif number = 7 Glycosyl hydrolase family 15 motif VII - 2 PCODE ST INT DLTWSYAALLTANNRR Q12537 437 11 DLTWSYAALLTANNRR AMYG_ASPAK 437 11 DLTWSYAALLTANNRR AMYG_ASPNG 438 11 DLTWSYAALLTANNRR Q92201 438 11 DLTWSYAALLTANNRR AMYG_ASPSH 437 11 DLTWSYAALLTANNRR Q02296 436 11 DLTWSYAALLTANNRR AMYG_ASPOR 440 11 HLTWSYASFLSAAARR AMYG_NEUCR 448 11 NLTWSYAAAITAYQAR Q12596 437 11 DLTWSYSAFLSAIDRR Q12623 435 11 DLTWSYAAFITMSQRR AMYG_HORRE 446 11 DLTWSYAAFLSAVGRR O59846 441 11 SLTWSSGALLEAIRLR AMYH_SACFI 497 11 SLTWSSGALLEAIRLR AMYG_SACFI 497 11 DLTWSYSSLLNAIYRR O60087 428 11 HLTWSYTSFWDAYQIR AMYG_YEAST 527 11 DLTWSHASLITASYAK AMYG_RHIOR 583 11 DLTWSYASLLSAAFAR AMYG_ARXAD 593 11

User query: Display/Full Code "GLHYDRLASE15"