WORKLIST ENTRIES (1):

GLHYDRLASE16 View alignment     Glycosyl hydrolase family 16 signature
 Type of fingerprint: COMPOUND with 8  elements
Links:
   PRINTS; PR00131 GLHYDRLASE1; PR00132 GLHYDRLASE2; PR00133 GLHYDRLASE3
   PRINTS; PR00732 GLHYDRLASE4; PR00733 GLHYDRLASE6; PR00734 GLHYDRLASE7
   PRINTS; PR00735 GLHYDRLASE8; PR00134 GLHYDRLASE10; PR00911 GLHYDRLASE11
   PRINTS; PR00736 GLHYDRLASE15; PR00738 GLHYDRLASE20; PR00739 GLHYDRLASE26
   PRINTS; PR00740 GLHYDRLASE27; PR00741 GLHYDRLASE29; PR00843 GLHYDRLASE30
   PRINTS; PR00742 GLHYDRLASE35; PR00743 GLHYDRLASE36; PR00744 GLHYDRLASE37
   PRINTS; PR00745 GLHYDRLASE39; PR00746 GLHYDRLASE41; PR00747 GLHYDRLASE47
   PRINTS; PR00844 GLHYDRLASE48; PR00845 GLHYDRLASE52; PR00846 GLHYDRLASE56
   PRINTS; PR00849 GLHYDRLASE58; PR00850 GLHYDRLASE59; PR00748 MELIBIASE
   PRINTS; PR00137 LYSOZYME; PR00684 T4LYSOZYME; PR00749 LYSOZYMEG
   PRINTS; PR00110 ALPHAAMYLASE; PR00750 BETAAMYLASE
   INTERPRO; IPR000757
   PROSITE; PS01034 GLYCOSYL_HYDROL_F16
   PFAM; PF00722 glycosyl_hydro9
   PDB; 1AYH 3Dinfo
   SCOP; 1CPM
   CATH; 1AYH

 Creation date 05-JUN-1997; UPDATE 21-JUN-1999

   1. HENRISSAT, B. AND BAIROCH, A.
   New families in the classification of glycosyl hydrolases based on amino
   acid sequence similarities.
   BIOCHEM.J. 293 781-788 (1993).

   2. HENRISSAT, B.
   A classification of glycosyl hydrolases based on amino acid sequence
   similarities.
   BIOCHEM.J. 280 309-316 (1991).

   3. DAVIES, G. AND HENRISSAT, B.
   Structures and mechanisms of glycosyl hydrolases.
   STRUCTURE 3 853-859 (1995).

   4. HENRISSAT, B. AND BAIROCH, A.
   Updating the sequence-based classification of glycosyl hydrolases.
   BIOCHEM.J. 316 695-696 (1996).

   5. HAHN, M., KEITEL, T. AND HEINEMANN, U.
   Crystal and molecular structure at 0.16nm resolution of the hybrid Bacillus
   endo-1,3-1,4-beta-D-glucan 4-glucanohydrolase H(A16-M).
   EUR.J.BIOCHEM. 232(3) 849-858 (1995).

   6. JUNCOSA, M., PONS, J., DOT, T., QUEROL, E., PLANAS, A.
   Identification of active-site carboxylic residues in Bacillus licheniformis
   1,3-1,4-beta-d-glucan 4-glucanohydrolase by site-directed mutagenesis.
   J.BIOL.CHEM. 269 14530-14535 (1994).

   O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
   hydrolyse the glycosidic bond between two or more carbohydrates, or between
   a carbohydrate and a non-carbohydrate moiety. A classification system for
   glycosyl hydrolases, based on sequence similarity, has led to the definition
   of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
   glycosid.txt).
  
   Family 16 mainly encompasses bacterial beta-glucanases. These enzymes
   hydrolyse 1,4-beta-D-glycosidic linkages in beta-D-glucans containing
   1,3- and 1,4-bonds.
  
   The 3D structure of a hybrid endo-1,3-1,4-beta-D-glucan 4-glucanohydrolase
   from Bacillus has been refined to 1.6A [5]. The protein belongs to the
   mainly-beta class, with 16 strands and 3 helices; the fold reveals a
   compact jellyroll-type beta-sheet structure that comprises two major
   anti-parallel beta-sheets and a 3-stranded minor mixed sheet. Amino acid
   residues involved in catalysis (two closely-clustered conserved glutamates
   [6]) and substrate binding are located inside a deep channel spanning the
   surface of the protein [5]. 
  
   GLHYDRLASE16 is an 8-element fingerprint that provides a signature for
   family 16 glycosyl hydrolases. The fingerprint was derived from an initial
   alignment of 7 sequences: the motifs were drawn from short conserved regions 
   spanning the N-terminal half of the alignment - motif 1 spans beta-strands
   6 and 7; motif 2 includes strand 8; motif 3 spans strand 9 and includes the
   region encoded by PROSITE pattern GLYCOSYL_HYDROL_F16 (PS01034), which
   contains the catalytic cluster of acidic residues; motif 4 includes strand
   10; motif 5 spans helix 2 and strand 12; motif 6 spans strands 13 and 14;
   motif 7 spans strand 15 and helix 3; and motif 7 includes strand 16. Two
   iterations on OWL29.3 were required to reach convergence, at which point
   a true set comprising 25 sequences was identified. Several partial matches
   were also found, all of which are related sequences that fail to make
   significant matches with one or more motifs.
  
   An update on SPTR37_9f identified a true set of 12 sequences, and 11
   partial matches.

  SUMMARY INFORMATION
     12 codes involving  8 elements
      0 codes involving  7 elements
      0 codes involving  6 elements
      1 codes involving  5 elements
      0 codes involving  4 elements
      0 codes involving  3 elements
     10 codes involving  2 elements

   COMPOSITE FINGERPRINT INDEX
  
    8|  12   12   12   12   12   12   12   12  
    7|   0    0    0    0    0    0    0    0  
    6|   0    0    0    0    0    0    0    0  
    5|   1    1    1    1    0    1    0    0  
    4|   0    0    0    0    0    0    0    0  
    3|   0    0    0    0    0    0    0    0  
    2|   4    1    5    1    5    4    0    0  
   --+-----------------------------------------
     |   1    2    3    4    5    6    7    8  

True positives..
 GUB_BACSU      GUB_PAEPO      GUB_PAEMA      Q45691         
 GUB_BACAM      GUB_BACLI      Q45648         GUB_BACBR      
 O14412         GUB_CLOTM      O07856         XYND_RUMFL     
Subfamily:  Codes involving 5 elements
 Subfamily True positives..
 EXOK_RHIME     
Subfamily:  Codes involving 2 elements
 Subfamily True positives..
 GUB_FIBSU      Q39099         Q51333         O80803         
 O68641         YG46_YEAST     P93670         CGKA_ALTCA     
 Q59328         P93669         


  PROTEIN TITLES
   GUB_BACSU        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   GUB_PAEPO        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   GUB_PAEMA        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   Q45691           ENDO-BETA-1,3-1,4-GLUCANASE - BACILLUS SUBTILIS.
   GUB_BACAM        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   GUB_BACLI        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   Q45648           LICHENASE PRECURSOR - BACILLUS SP.
   GUB_BACBR        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   O14412           BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   GUB_CLOTM        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   O07856           BETA-(1,3-1,4)-GLUCANASE PRECURSOR - STREPTOCOCCUS BOVIS.
   XYND_RUMFL       XYLANASE/BETA-GLUCANASE PRECURSOR [INCLUDES: ENDO-1,4-BETA-X
 
   EXOK_RHIME       SUCCINOGLYCAN BIOSYNTHESIS PROTEIN EXOK - RHIZOBIUM MELILOTI
 
   GUB_FIBSU        BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GL
   Q39099           ENDO-XYLOGLUCAN TRANSFERASE - ARABIDOPSIS THALIANA (MOUSE-EA
   Q51333           BETA-1,3-GLUCANASE IIA - OERSKOVIA XANTHINEOLYTICA.
   O80803           T8F5.9 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O68641           BETA-1,3-GLUCANASE II - OERSKOVIA XANTHINEOLYTICA.
   YG46_YEAST       HYPOTHETICAL 52.8 KD PROTEIN IN BUB1-HIP1 INTERGENIC REGION 
   P93670           XYLOGLUCAN ENDOTRANSGLYCOSYLASE (XET) - HORDEUM VULGARE (BAR
   CGKA_ALTCA       KAPPA-CARRAGEENASE PRECURSOR (EC 3.2.1.83) - ALTEROMONAS CAR
   Q59328           ENDO-1,3(4)-BETA-GLUCANASE (EC 3.2.1.6) (ENDO-1,4-BETA-GLUCA
   P93669           XYLOGLUCAN ENDOTRANSGLYCOSYLASE-LIKE PROTEIN - HORDEUM VULGA

SCAN HISTORY OWL29_3 2 100 NSINGLE SPTR37_9f 2 100 NSINGLE INITIAL MOTIF SETS GLHYDRLASE161 Length of motif = 19 Motif number = 1 Glycosyl hydrolase family 16 motif I - 1 PCODE ST INT AEYRSTNIYGYGLYEVSMK GUB_BACMA 85 85 GENRSVQTYGYGLYEVRMK GUB_BACAM 87 87 GEYRSTNNYGYGLYEVSMK GUB_BACPO 86 86 GENRSVQTYGYGLYEVRMK GUB_BACSU 90 90 GENRSVQTYGYGLYEVNMK GUB_BACLI 91 91 GELRTNDFYHYGLFEVSMK GUB_BACBR 97 97 GEYRTKSFFGYGYYEVRMK GUB_CLOTM 93 93 GLHYDRLASE162 Length of motif = 17 Motif number = 2 Glycosyl hydrolase family 16 motif II - 1 PCODE ST INT PAKNTGIVSSFFTYTGP GUB_BACMA 104 0 PAKNTGIVSSFFTYTGP GUB_BACAM 106 0 PAKNTGIVSSFFTYTGP GUB_BACPO 105 0 PAKNTGIVSSFFTYTGP GUB_BACSU 109 0 PAKNVGIVSSFFTYTGP GUB_BACLI 110 0 PAKVEGTVSSFFTYTGE GUB_BACBR 116 0 AAKNVGIVSSFFTYTGP GUB_CLOTM 112 0 GLHYDRLASE163 Length of motif = 16 Motif number = 3 Glycosyl hydrolase family 16 motif III - 1 PCODE ST INT QWDEIDIEFLGKDTTK GUB_BACMA 125 4 PWDEIDIEFLGKDTTK GUB_BACAM 127 4 QWDEIDIEFLGKDTTK GUB_BACPO 126 4 PWDEIDIEFLGKDTTK GUB_BACSU 130 4 PWDEIDIEFLGKDTTK GUB_BACLI 131 4 PWDEIDIEFLGKDTTR GUB_BACBR 139 6 PWDEIDIEFLGKDTTK GUB_CLOTM 133 4 GLHYDRLASE164 Length of motif = 14 Motif number = 4 Glycosyl hydrolase family 16 motif IV - 1 PCODE ST INT VQFNYYTNGVGGHE GUB_BACMA 141 0 VQFNYYTNGAGNHE GUB_BACAM 143 0 VQFNYYTNGVGGHE GUB_BACPO 142 0 VQFNYYTNGAGNHE GUB_BACSU 146 0 VQFNYYTNGVGNHE GUB_BACLI 147 0 IQFNYFTNGVGGNE GUB_BACBR 155 0 VQFNWYKNGVGGNE GUB_CLOTM 149 0 GLHYDRLASE165 Length of motif = 18 Motif number = 5 Glycosyl hydrolase family 16 motif V - 1 PCODE ST INT LGFDASKGFHTYAFDWQP GUB_BACMA 159 4 LGFDAANAYHTYAFDWQP GUB_BACAM 161 4 LGFDASTSFHTYAFDWQP GUB_BACPO 160 4 LGFDAANAYHTYAFDWQP GUB_BACSU 164 4 LGFDAANSYHTYAFDWQP GUB_BACLI 165 4 LGFDASESFNTYAFEWRE GUB_BACBR 173 4 LGFDASQDFHTYGFEWRP GUB_CLOTM 167 4 GLHYDRLASE166 Length of motif = 18 Motif number = 6 Glycosyl hydrolase family 16 motif VI - 1 PCODE ST INT IKWYVDGVLKHTATANIP GUB_BACMA 179 2 IKWYVDGQLKHTATTQIP GUB_BACAM 181 2 IKWYVDGVLKHTATTNIP GUB_BACPO 180 2 IKWYVDGQLKHTATNQIP GUB_BACSU 184 2 IKWYVDGQLKHTATTQIP GUB_BACLI 185 2 ITWYVNGEAVHTATENIP GUB_BACBR 193 2 IDFYVDGKKVYRGTRNIP GUB_CLOTM 187 2 GLHYDRLASE167 Length of motif = 23 Motif number = 7 Glycosyl hydrolase family 16 motif VII - 1 PCODE ST INT PGKIMMNLWNGTGVDDWLGSYNG GUB_BACMA 199 2 PGKIMMNLWNGTGVDDWLGSYNG GUB_BACAM 201 2 PGKIMMNLWNGTGVDSWLGSYNG GUB_BACPO 200 2 PGKIMMNLWNGTGVDEWLGSYNG GUB_BACSU 204 2 PGKIMMNLWNGAGVDEWLGSYNG GUB_BACLI 205 2 PQKIMMNLWPGVGVDGWTGVFDG GUB_BACBR 213 2 PGKIMMNLWPGIGVDEWLGRYDG GUB_CLOTM 207 2 GLHYDRLASE168 Length of motif = 13 Motif number = 8 Glycosyl hydrolase family 16 motif VIII - 1 PCODE ST INT NPLYAEYDWVKYT GUB_BACMA 223 1 NPIYAHYDWMRYR GUB_BACAM 225 1 NPLYAEYDWVKYT GUB_BACPO 224 1 NPLYAHYDWVRYT GUB_BACSU 228 1 TPLSRSLHWVRYT GUB_BACLI 229 1 TPVYSYYDWVRYT GUB_BACBR 238 2 TPLQAEYEYVKYY GUB_CLOTM 231 1 FINAL MOTIF SETS GLHYDRLASE161 Length of motif = 19 Motif number = 1 Glycosyl hydrolase family 16 motif I - 2 PCODE ST INT GENRSVQTYGYGLYEVRMK GUB_BACSU 90 90 GEYRSTNNYGYGLYEVSMK GUB_PAEPO 86 86 AEYRSTNIYGYGLYEVSMK GUB_PAEMA 85 85 GENRSVQTYGYGLYEVRMK Q45691 87 87 GENRSVQTYGYGLYEVRMK GUB_BACAM 87 87 GENRSVQTYGYGLYEVNMK GUB_BACLI 91 91 GELRTNDFYHYGLFEVSMK Q45648 99 99 GELRTNDFYHYGLFEVSMK GUB_BACBR 97 97 GEYRTKNYYGYGMFQVNMK O14412 91 91 GEYRTKSFFGYGYYEVRMK GUB_CLOTM 93 93 GEWRSKERFGYGLFQVNMK O07856 84 84 GEFRTNNFYHYGYYECSMQ XYND_RUMFL 637 637 GLHYDRLASE162 Length of motif = 17 Motif number = 2 Glycosyl hydrolase family 16 motif II - 2 PCODE ST INT PAKNTGIVSSFFTYTGP GUB_BACSU 109 0 PAKNTGIVSSFFTYTGP GUB_PAEPO 105 0 PAKNTGIVSSFFTYTGP GUB_PAEMA 104 0 PAKNTGIVSSFFTYTGP Q45691 106 0 PAKNTGIVSSFFTYTGP GUB_BACAM 106 0 PAKNVGIVSSFFTYTGP GUB_BACLI 110 0 PAKSTGTVSSFFTYTGP Q45648 118 0 PAKVEGTVSSFFTYTGE GUB_BACBR 116 0 PIKNPGVVSSFFTYTGP O14412 110 0 AAKNVGIVSSFFTYTGP GUB_CLOTM 112 0 PIKNPGVVSSFFTYTGP O07856 103 0 AMKNDGVVSSFFTYTGP XYND_RUMFL 656 0 GLHYDRLASE163 Length of motif = 16 Motif number = 3 Glycosyl hydrolase family 16 motif III - 2 PCODE ST INT PWDEIDIEFLGKDTTK GUB_BACSU 130 4 QWDEIDIEFLGKDTTK GUB_PAEPO 126 4 QWDEIDIEFLGKDTTK GUB_PAEMA 125 4 PWDEIDIEFLGKDTTK Q45691 127 4 PWDEIDIEFLGKDTTK GUB_BACAM 127 4 PWDEIDIEFLGKDTTK GUB_BACLI 131 4 PWDEIDIEFLGKDTTK Q45648 141 6 PWDEIDIEFLGKDTTR GUB_BACBR 139 6 KWDEIDIEFLGYDTTK O14412 131 4 PWDEIDIEFLGKDTTK GUB_CLOTM 133 4 KWDEIDIEFLGKDTTK O07856 124 4 PWDEIDIEILGKNTTQ XYND_RUMFL 677 4 GLHYDRLASE164 Length of motif = 14 Motif number = 4 Glycosyl hydrolase family 16 motif IV - 2 PCODE ST INT VQFNYYTNGAGNHE GUB_BACSU 146 0 VQFNYYTNGVGGHE GUB_PAEPO 142 0 VQFNYYTNGVGGHE GUB_PAEMA 141 0 VQFNYYTNGAENHE Q45691 143 0 VQFNYYTNGAGNHE GUB_BACAM 143 0 VQFNYYTNGVGNHE GUB_BACLI 147 0 IQFNYFTNGVGGNE Q45648 157 0 IQFNYFTNGVGGNE GUB_BACBR 155 0 VQFNYYTNGQGHHE O14412 147 0 VQFNWYKNGVGGNE GUB_CLOTM 149 0 VQFNYYTSGQGNHE O07856 140 0 VQFNYYTNGQGKHE XYND_RUMFL 693 0 GLHYDRLASE165 Length of motif = 18 Motif number = 5 Glycosyl hydrolase family 16 motif V - 2 PCODE ST INT LGFDAANAYHTYAFDWQP GUB_BACSU 164 4 LGFDASTSFHTYAFDWQP GUB_PAEPO 160 4 LGFDASKGFHTYAFDWQP GUB_PAEMA 159 4 LGFDAANAYHTYAFDWQP Q45691 161 4 LGFDAANAYHTYAFDWQP GUB_BACAM 161 4 LGFDAANSYHTYAFDWQP GUB_BACLI 165 4 LGFDAADDFNTYAFEWRP Q45648 175 4 LGFDASESFNTYAFEWRE GUB_BACBR 173 4 LGFDASQGFHTYGFFWAR O14412 165 4 LGFDASQDFHTYGFEWRP GUB_CLOTM 167 4 LGFDASQGFHTYGFDWQA O07856 158 4 LGFDSSEAYHTYGFDWQP XYND_RUMFL 711 4 GLHYDRLASE166 Length of motif = 18 Motif number = 6 Glycosyl hydrolase family 16 motif VI - 2 PCODE ST INT IKWYVDGQLKHTATNQIP GUB_BACSU 184 2 IKWYVDGVLKHTATTNIP GUB_PAEPO 180 2 IKWYVDGVLKHTATANIP GUB_PAEMA 179 2 IKWYVDWQLKHTATTQIP Q45691 181 2 IKWYVDGQLKHTATTQIP GUB_BACAM 181 2 IKWYVDGQLKHTATTQIP GUB_BACLI 185 2 IRWFVNGELVHTATENIP Q45648 195 2 ITWYVNGEAVHTATENIP GUB_BACBR 193 2 ITWYVDGTAVYTAYDNIP O14412 185 2 IDFYVDGKKVYRGTRNIP GUB_CLOTM 187 2 ITWYVDGRAVYTAYNNIP O07856 178 2 IAWYVDGREVYRATQDIP XYND_RUMFL 731 2 GLHYDRLASE167 Length of motif = 23 Motif number = 7 Glycosyl hydrolase family 16 motif VII - 2 PCODE ST INT PGKIMMNLWNGTGVDEWLGSYNG GUB_BACSU 204 2 PGKIMMNLWNGTGVDSWLGSYNG GUB_PAEPO 200 2 PGKIMMNLWNGTGVDDWLGSYNG GUB_PAEMA 199 2 PGKIMMNLWNGTGVSDWLGSYNG Q45691 201 2 PGKIMMNLWNGTGVDDWLGSYNG GUB_BACAM 201 2 PGKIMMNLWNGAGVDEWLGSYNG GUB_BACLI 205 2 PQKIMMNLWPGIGVDGWTGRFNG Q45648 215 2 PQKIMMNLWPGVGVDGWTGVFDG GUB_BACBR 213 2 PGKIMMNAWNGIGVDDWLRPFNG O14412 205 2 PGKIMMNLWPGIGVDEWLGRYDG GUB_CLOTM 207 2 PGKIMMNAWPGTHEVDSWLGAYN O07856 198 2 PGKIMMNAWPGLTVDDWLKAFNG XYND_RUMFL 751 2 GLHYDRLASE168 Length of motif = 13 Motif number = 8 Glycosyl hydrolase family 16 motif VIII - 2 PCODE ST INT NPLYAHYDWVRYT GUB_BACSU 228 1 NPLYAEYDWVKYT GUB_PAEPO 224 1 NPLYAEYDWVKYT GUB_PAEMA 223 1 NPLYAHYDWVRYT Q45691 225 1 NPIYAHYDWMRYR GUB_BACAM 225 1 TPLSRSLHWVRYT GUB_BACLI 229 1 TPVVTQYDWVKYT Q45648 240 2 TPVYSYYDWVRYT GUB_BACBR 238 2 TNISAYYDWVSYD O14412 229 1 TPLQAEYEYVKYY GUB_CLOTM 231 1 TPLYAYYDWISYD O07856 223 2 TPLTAHYQWVTYN XYND_RUMFL 775 1

User query: Display/Full Code "GLHYDRLASE16"