SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00735

Identifier
GLHYDRLASE8  [View Relations]  [View Alignment]  
Accession
PR00735
No. of Motifs
6
Creation Date
03-JUN-1997  (UPDATE 07-JUN-1999)
Title
Glycosyl hydrolase family 8 signature
Database References

PROSITE; PS00812 GLYCOSYL_HYDROL_F8
PFAM; PF01270 Glycosyl_hydr20
INTERPRO; IPR002037
Literature References
1. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
 
2. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence
similarities.
BIOCHEM.J. 280 309-316 (1991).
 
3. DAVIES, G. AND HENRISSAT, B.
Structures and mechanisms of glycosyl hydrolases.
STRUCTURE 3 853-859 (1995).
 
4. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).
 
5. ALZARI, P.M., SOUCHON, H. AND DOMINGUEZ, R.
The crystal structure of endoglucanase celA, a family8 glycosyl hydrolase
from Clostridium thermocellum.
STRUCTURE 4(3) 265-275 (1996).

Documentation
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt). Family 8 includes mostly endoglucanases, which are essential
enzymes for microbial degradation of cellulose and xylans. 
 
GLHYDRLASE8 is a 6-element fingerprint that provides a signature for
family 8 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 8 sequences: the motifs were drawn from conserved regions
spanning virtually the full alignment length - motif 2 includes the region
encoded by PROSITE pattern GLYCOSYL_HYDROL_F8 (PS00812), the first 
aspartate of which is thought to act as the nucleophile in the catalytic
mechanism [5]. A single iteration on OWL29.3 was required to reach
convergence, no further sequences being identified beyond the starting set.
Several partial matches were found, all of which are family members that
fail to make significant matches with one or more motifs.
 
An update on SPTR37_9f identified a true set of 9 sequences, and 2
partial matches.
Summary Information
   9 codes involving  6 elements
0 codes involving 5 elements
1 codes involving 4 elements
1 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
6999999
5000000
4111010
3110010
2000000
123456
True Positives
GUB_BACCI     GUN2_CLOJO    GUNA_ACEXY    GUNA_CLOTM    
GUNC_CLOCE GUNY_ERWCH GUN_BACSP O82857
YHJM_ECOLI
True Positive Partials
Codes involving 4 elements
GUN_CELUD
Codes involving 3 elements
Q44416
Sequence Titles
GUB_BACCI   BETA-GLUCANASE PRECURSOR (EC 3.2.1.73) (ENDO-BETA-1,3-1,4 GLUCANASE) - BACILLUS CIRCULANS. 
GUN2_CLOJO ENDOGLUCANASE 2 PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA-GLUCANASE 2) (CELLULASE 2) - CLOSTRIDIUM JOSUI.
GUNA_ACEXY PROBABLE ENDOGLUCANASE PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA- GLUCANASE) (CELLULASE) - ACETOBACTER XYLINUM (ACETOBACTER PASTEURIANUS).
GUNA_CLOTM ENDOGLUCANASE A PRECURSOR (EC 3.2.1.4) (EGA) (ENDO-1,4-BETA-GLUCANASE) (CELLULASE A) - CLOSTRIDIUM THERMOCELLUM.
GUNC_CLOCE ENDOGLUCANASE C PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA-GLUCANASE C) (CELLULASE C) (EGCCC) - CLOSTRIDIUM CELLULOLYTICUM.
GUNY_ERWCH MINOR ENDOGLUCANASE Y PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA-GLUCANASE Y) (CELLULASE Y) (EGY) - ERWINIA CHRYSANTHEMI.
GUN_BACSP ENDOGLUCANASE PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA-GLUCANASE) (CELLULASE) (ENDO-K) - BACILLUS SP. (STRAIN KSM-330).
O82857 ENDOGLUCANASE - ACETOBACTER XYLINUM (ACETOBACTER PASTEURIANUS).
YHJM_ECOLI HYPOTHETICAL 41.7 KD PROTEIN IN DCTA-DPPF INTERGENIC REGION PRECURSOR (F368) - ESCHERICHIA COLI.

GUN_CELUD ENDOGLUCANASE PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA-GLUCANASE) (CELLULASE) - CELLULOMONAS UDA.

Q44416 ENDOGLUCANASE - AGROBACTERIUM TUMEFACIENS.
Scan History
OWL29_3    1  50   NSINGLE    
SPTR37_9f 2 22 NSINGLE
Initial Motifs
Motif 1  width=14
Element Seqn Id St Int Rpt
TVSEGLGYGMLLAV GUN2_CLOJO 96 96 -
TVSEGLGYGLLLSV GUNC_CLOCE 96 96 -
TVSEGMGYGLLLAV GUNA_CLOTM 92 92 -
SHSEGQGYGMLFAA GUNA_ACEXY 54 54 -
SHTEGQGFAMLMAV GUNY_ERWCH 50 50 -
GTSEGQGYGMIITV GUN_BACSP 127 127 -
TVSEAHGYGMLATV GUB_BACCI 92 92 -
TTSEGQSYGMFSAL YHJM_ECOLI 52 52 -

Motif 2 width=19
Element Seqn Id St Int Rpt
ATDADEDIAVSLVFAHKKW GUN2_CLOJO 153 43 -
ATDADEDIAVSLVFAHKKW GUNC_CLOCE 153 43 -
ATDADEDIALALIFADKLW GUNA_CLOTM 150 44 -
ATDGDLLIALALGRAGKRF GUNA_ACEXY 112 44 -
ASDGDVLIAWALLKAGNKW GUNY_ERWCH 108 44 -
ATDGDLDIAYSLLLAHKQW GUN_BACSP 189 48 -
ATDGDMDIAYSLLLADKQW GUB_BACCI 154 48 -
ASDGDVWMAWSLLEAGRLW YHJM_ECOLI 114 48 -

Motif 3 width=18
Element Seqn Id St Int Rpt
NPSYFAPAWYRIFADFTG GUN2_CLOJO 215 43 -
NPSYFAPAWYRIFADFTG GUNC_CLOCE 215 43 -
NPSYFAPAWYKVYAQYTG GUNA_CLOTM 212 43 -
NLSYYVMPSLLQAFDLTA GUNA_ACEXY 176 45 -
NPSYFLFPAWRDFANRSH GUNY_ERWCH 172 45 -
RPSDWMMSHLRAFYEFTG GUN_BACSP 252 44 -
RPSDFMLNHLKAFQAATG GUB_BACCI 219 46 -
NPSYLPPTLAQYFTRFGA YHJM_ECOLI 179 46 -

Motif 4 width=12
Element Seqn Id St Int Rpt
TGLVPDWCTANG GUN2_CLOJO 255 22 -
TGLVPDWCTANG GUNC_CLOCE 255 22 -
TGLVPDWCTASG GUNA_CLOTM 254 24 -
WRLPPDWLAVNR GUNA_ACEXY 216 22 -
VGLPTDWAALNA GUNY_ERWCH 212 22 -
TGLISDFVVKNP GUN_BACSP 295 25 -
TGLLPDFVVLSG GUB_BACCI 262 25 -
KGFSPDWVRYEK YHJM_ECOLI 215 18 -

Motif 5 width=10
Element Seqn Id St Int Rpt
FYYDAIRYQW GUN2_CLOJO 276 9 -
FYYDAIRYQW GUNC_CLOCE 276 9 -
YKYDATRYGW GUNA_CLOTM 275 9 -
FSYDAIRVPL GUNA_ACEXY 242 14 -
FSYDAIRIPL GUNY_ERWCH 237 13 -
YYYNASRVPL GUN_BACSP 324 17 -
YDYNSCRTPW GUB_BACCI 292 18 -
SSYDAIRVYM YHJM_ECOLI 240 13 -

Motif 6 width=15
Element Seqn Id St Int Rpt
YFGNTLRMMILLYTT GUN2_CLOJO 372 86 -
YFGNTLRMMVLLYTT GUNC_CLOCE 372 86 -
YYGNSLRLLTLLYIT GUNA_CLOTM 371 86 -
YYSAALTLLVYIARA GUNA_ACEXY 323 71 -
YYSSSLRLLVMLARG GUNY_ERWCH 317 70 -
YFSDSYNLLTMLFLT GUN_BACSP 420 86 -
YYEDSIKLFSMIVMS GUB_BACCI 389 87 -
YYNYVLTLFGQGWDQ YHJM_ECOLI 331 81 -
Final Motifs
Motif 1  width=14
Element Seqn Id St Int Rpt
TVSEGLGYGMLLAV GUN2_CLOJO 96 96 -
TVSEGLGYGLLLSV GUNC_CLOCE 96 96 -
TVSEGMGYGLLLAV GUNA_CLOTM 92 92 -
SHSEGQGYGMLFAA GUNA_ACEXY 54 54 -
SHSEGQGYGMLFSA O82857 53 53 -
SHTEGQGFAMLMAV GUNY_ERWCH 50 50 -
GTSEGQGYGMIITV GUN_BACSP 127 127 -
TVSEAHGYGMLATV GUB_BACCI 92 92 -
TTSEGQSYGMFSAL YHJM_ECOLI 52 52 -

Motif 2 width=19
Element Seqn Id St Int Rpt
ATDADEDIAVSLVFAHKKW GUN2_CLOJO 153 43 -
ATDADEDIAVSLVFAHKKW GUNC_CLOCE 153 43 -
ATDADEDIALALIFADKLW GUNA_CLOTM 150 44 -
ATDGDLLIALALGRAGKRF GUNA_ACEXY 112 44 -
ATDGDLLIALALAWAGKRW O82857 111 44 -
ASDGDVLIAWALLKAGNKW GUNY_ERWCH 108 44 -
ATDGDLDIAYSLLLAHKQW GUN_BACSP 189 48 -
ATDGDMDIAYSLLLADKQW GUB_BACCI 154 48 -
ASDGDVWMAWSLLEAGRLW YHJM_ECOLI 114 48 -

Motif 3 width=18
Element Seqn Id St Int Rpt
NPSYFAPAWYRIFADFTG GUN2_CLOJO 215 43 -
NPSYFAPAWYRIFADFTG GUNC_CLOCE 215 43 -
NPSYFAPAWYKVYAQYTG GUNA_CLOTM 212 43 -
NLSYYVMPSLLQAFDLTA GUNA_ACEXY 176 45 -
NLSYYVMPSLMQAFALTG O82857 175 45 -
NPSYFLFPAWRDFANRSH GUNY_ERWCH 172 45 -
RPSDWMMSHLRAFYEFTG GUN_BACSP 252 44 -
RPSDFMLNHLKAFQAATG GUB_BACCI 219 46 -
NPSYLPPTLAQYFTRFGA YHJM_ECOLI 179 46 -

Motif 4 width=12
Element Seqn Id St Int Rpt
TGLVPDWCTANG GUN2_CLOJO 255 22 -
TGLVPDWCTANG GUNC_CLOCE 255 22 -
TGLVPDWCTASG GUNA_CLOTM 254 24 -
WRLPPDWLAVNR GUNA_ACEXY 216 22 -
WKLPPDWLSINL O82857 215 22 -
VGLPTDWAALNA GUNY_ERWCH 212 22 -
TGLISDFVVKNP GUN_BACSP 295 25 -
TGLLPDFVVLSG GUB_BACCI 262 25 -
KGFSPDWVRYEK YHJM_ECOLI 215 18 -

Motif 5 width=10
Element Seqn Id St Int Rpt
FYYDAIRYQW GUN2_CLOJO 276 9 -
FYYDAIRYQW GUNC_CLOCE 276 9 -
YKYDATRYGW GUNA_CLOTM 275 9 -
FSYDAIRVPL GUNA_ACEXY 242 14 -
FSYDAIRVPL O82857 241 14 -
FSYDAIRIPL GUNY_ERWCH 237 13 -
YYYNASRVPL GUN_BACSP 324 17 -
YDYNSCRTPW GUB_BACCI 292 18 -
SSYDAIRVYM YHJM_ECOLI 240 13 -

Motif 6 width=15
Element Seqn Id St Int Rpt
YFGNTLRMMILLYTT GUN2_CLOJO 372 86 -
YFGNTLRMMVLLYTT GUNC_CLOCE 372 86 -
YYGNSLRLLTLLYIT GUNA_CLOTM 371 86 -
YYSAALTLLVYIARA GUNA_ACEXY 323 71 -
YYSAALTMLAYIARN O82857 322 71 -
YYSSSLRLLVMLARG GUNY_ERWCH 317 70 -
YFSDSYNLLTMLFLT GUN_BACSP 420 86 -
YYEDSIKLFSMIVMS GUB_BACCI 389 87 -
YYNYVLTLFGQGWDQ YHJM_ECOLI 331 81 -