SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00844

Identifier
GLHYDRLASE48  [View Relations]  [View Alignment]  
Accession
PR00844
No. of Motifs
9
Creation Date
20-FEB-1998  (UPDATE 07-JUN-1999)
Title
Glycosyl hydrolase family 48 signature
Database References

INTERPRO; IPR000556
Literature References
1. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence 
similarities.
BIOCHEM.J. 280 309-316 (1991).
 
2. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
 
3. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).
 
4. EL HASSOUNI, M., HENRISSAT, B., CHIPPAUX, M. AND BARRAS, F.
Nucleotide sequences of the Arb genes, which control beta-glucosidase
utilisation in Erwinia chrysanthemi - Comparison with the Escherichia
coli Bgl operon and evidence for a new beta-glycohydrolase family
including enzymes from eubacteria, archaebacteria and humans.
J.BACTERIOL. 174 765-777 (1992).
 
5. TE'O, V.S., SAUL, D.J. AND BERGQUIST, P.L.
CelA, another gene coding for a multidomain cellulase from the extreme
thermophile caldocellum saccharolyticum. 
APPL.MICROBIOL.BIOTECHNOL. 43 291-296 (1995). 

Documentation
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt). Family 48 encompasses endoglucanases (EC 3.2.1.4) and 
cellobiohydrolases (EC 3.2.1.91).
 
The largest cellulase gene sequenced to date is one of the cellulases (celA)
from the genome of the thermophilic anaerobic bacterium Caldocellum 
saccharolyticum [5]. The celA gene product is a polypeptide of 1751 amino
acids; this has a multidomain structure comprising two catalytic domains 
and two cellulose-binding domains, linked by Pro-Thr-rich regions (so-called
PT linkers) [5]. The N-terminal domain encodes an endoglucanase activity on
carboxymethylcellulose, consistent with its similarity to several endo-1, 
4-beta-D-glucanase sequences. The C-terminal domain shows similarity to a 
cellulase from Clostridium thermocellum (CelS), which acts synergistically
with a second component to hydrolyse crystalline cellulose [5]. 
 
GLHYDRLASE48 is a 9-element fingerprint that provides a signature for
family 48 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 5 sequences: the motifs were drawn from conserved regions 
spanning the C-terminal portion of the alignment, which corresponds to the
second catalytic domain, downstream from the Pro-Thr-rich region in celA 
from C.saccharolyticum. Two iterations on OWL30.0 were required to reach
convergence, at which point a true set comprising 6 sequences was identified.
 
An update on SPTR37_9f identified a true set of 7 sequences, and 1
partial match.
Summary Information
   7 codes involving  9 elements
1 codes involving 8 elements
0 codes involving 7 elements
0 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
9777777777
8101111111
7000000000
6000000000
5000000000
4000000000
3000000000
2000000000
123456789
True Positives
GUNA_CALSA    GUNF_CLOCE    GUNS_CLOTM    GUX2_CLOSR    
GUXB_CELFI O82831 O86728
True Positive Partials
Codes involving 8 elements
O65986
Sequence Titles
GUNA_CALSA  ENDOGLUCANASE A PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA-GLUCANASE A) (CELLULASE A) - CALDOCELLUM SACCHAROLYTICUM (CALDICELLULOSIRUPTOR SACCHAROLYTICUS). 
GUNF_CLOCE ENDOGLUCANASE F PRECURSOR (EC 3.2.1.4) (ENDO-1,4-BETA-GLUCANASE F) (CELLULASE F) (EGCCF) - CLOSTRIDIUM CELLULOLYTICUM.
GUNS_CLOTM ENDOGLUCANASE SS PRECURSOR (EC 3.2.1.4) (EGSS) (ENDO-1,4-BETA- GLUCANASE) (CELLULASE SS) - CLOSTRIDIUM THERMOCELLUM.
GUX2_CLOSR EXOGLUCANASE II PRECURSOR (EC 3.2.1.91) (EXOCELLOBIOHYDROLASE II) (1,4-BETA-CELLOBIOHYDROLASE II) (AVICELASE II) - CLOSTRIDIUM STERCORARIUM.
GUXB_CELFI EXOGLUCANASE B PRECURSOR (EC 3.2.1.91) (EXOCELLOBIOHYDROLASE B) (1,4-BETA-CELLOBIOHYDROLASE B) (CBP120) - CELLULOMONAS FIMI.
O82831 EXOGLUCANASE - CLOSTRIDIUM JOSUI.
O86728 PUTATIVE SECRETED CELLULASE - STREPTOMYCES COELICOLOR.

O65986 EXOGLUCANASE S (EC 3.2.1.91) - CLOSTRIDIUM CELLULOVORANS.
Scan History
OWL30_0    2  75   NSINGLE    
SPTR37_9f 2 9 NSINGLE
Initial Motifs
Motif 1  width=26
Element Seqn Id St Int Rpt
GIPYHSIETLMVEAPDYGHVTTSEAM GUNF_CLOCE 61 61 -
GIPYHSVETLICEAPDYGHLTTSEAF GUNA_CALSA 1140 1140 -
GIPYHSIETLIVEAPDYGHVTTSEAF GUNS_CLOTM 64 64 -
GIPYHAVETLMVEAPDYGHETTSEAY GUXB_CELFI 82 82 -
GIPYHSVETLICERPDYGHLTTSEAF ATZ86105 1109 1109 -
GIPYHAVETLIVEAPDYGHLTTSEAM GUX2_CLOSR 63 63 -

Motif 2 width=18
Element Seqn Id St Int Rpt
SYWLWLEALYGQVTQDWA GUXB_CELFI 108 0 -
SYYVWLEAVYGKLTGDWS ATZ86105 1135 0 -
SYYVWLEAVYGKLTGDWS GUNA_CALSA 1166 0 -
SYYMWLEAMHGRFSGDFT GUNF_CLOCE 87 0 -
SYYLWLEALYGKFTGDFS GUX2_CLOSR 89 0 -
SYYVWLEAMYGNLTGNWS GUNS_CLOTM 90 0 -

Motif 3 width=25
Element Seqn Id St Int Rpt
SFINTFQRGPQESVWETVPQPSCEE GUXB_CELFI 230 104 -
SFINTFQRGPEESVWETVPHPSWEE ATZ86105 1249 96 -
SFINTFQRGPEESVWETVPHPSWEE GUNA_CALSA 1280 96 -
SYINTFQRGEQESTWETIPQPCWDE GUNF_CLOCE 204 99 -
AYINTFQRGSQESVWETIPQPCWDD GUX2_CLOSR 206 99 -
TFINTFQRGEQESTWETIPHPSIEE GUNS_CLOTM 201 93 -

Motif 4 width=28
Element Seqn Id St Int Rpt
KQWRYTNAPDAEGRAIQAVYWANKWAKE GUNS_CLOTM 246 20 -
KQWRYTDAPDADARAIQATYWAKVWAKE ATZ86105 1294 20 -
KQWRYTNAPDADARAIQATYWAKVWAKE GUNA_CALSA 1325 20 -
KQFKYTNAPDADARAVQATYWADQWAKE GUNF_CLOCE 250 21 -
AQFKYTNAPDADARAIQATYWANQWAKE GUX2_CLOSR 251 20 -
KQWKYTSASDADARAVEAVYWANQWATE GUXB_CELFI 275 20 -

Motif 5 width=20
Element Seqn Id St Int Rpt
TGYDAAHYLLSWYYAWGGGI GUNF_CLOCE 316 38 -
TGYDAAHYLLSWYYAWGGGI GUX2_CLOSR 317 38 -
TGYDSAHYLMAWYTAWGGGI GUNS_CLOTM 315 41 -
QGREAAHYLLSWYMAWGGAT GUXB_CELFI 346 43 -
TGYDSAHYLLSWYYAWGGAL ATZ86105 1365 43 -
TGYDSAHYLLSWYYAWGGAL GUNA_CALSA 1396 43 -

Motif 6 width=24
Element Seqn Id St Int Rpt
RQLEFYQWLQSAEGAIAGGATNSW GUNF_CLOCE 384 48 -
RQIEFYRWLQSAEGAIAGGATNSW ATZ86105 1433 48 -
RQLEFYTWLQASNGGIAGGATNSW GUXB_CELFI 416 50 -
RQLEFYQWLQSAEGGIAGGATNSW GUNS_CLOTM 383 48 -
RQIEFYRWLQSAEGAIAGGATNSW GUNA_CALSA 1464 48 -
RQLEFYQWLQSAEGAIAGGATNSY GUX2_CLOSR 385 48 -

Motif 7 width=27
Element Seqn Id St Int Rpt
PVYRDPGSNTWFGFQAWSMQRVAEYYY GUNA_CALSA 1510 22 -
PVYADPGSNTWFGMQVWSMQRVAELYY GUNF_CLOCE 430 22 -
PVYLDPGSNTWFGFQAWTMQRVAEYYY GUX2_CLOSR 431 22 -
PVYADPGSNQWFGFQAWSMQRVMEYYL GUNS_CLOTM 429 22 -
PVYVDPPSNRWFGMQAWGVQRVAELYY GUXB_CELFI 462 22 -
PVYHDPGSNTWFGFQAWSMQRVVEYYY ATZ86105 1479 22 -

Motif 8 width=21
Element Seqn Id St Int Rpt
TFAIPSTLDWKRQPDTWNGAY ATZ86105 1534 28 -
TFAIPSTLDWSGQPDTWNGTY GUNA_CALSA 1565 28 -
TFQIPSTIDWEGQPDTWNPTQ GUNF_CLOCE 485 28 -
TFEIPGNLEWSGQPDTWTGTY GUX2_CLOSR 486 28 -
TFAIPSDLEWSGQPDTWTGTY GUNS_CLOTM 484 28 -
SWKVPSELKWTGKPDTWNAAA GUXB_CELFI 516 27 -

Motif 9 width=29
Element Seqn Id St Int Rpt
GWIGKMPNGDVIKSGVKFIDIRSKYKQDP ATZ86105 1641 86 -
GWSGTMPNGDRIEPGVTFLDIRSKYLNDP GUX2_CLOSR 588 81 -
GWTGTMPNGDVIKPGVSFLDIRSFYKKDP GUXB_CELFI 625 88 -
GWSGTMPNGDKIQPGIKFIDIRTKYRQDP GUNS_CLOTM 594 89 -
GWTGKMPNGDVIKSGVKFIDIRSKYKQDP GUNF_CLOCE 589 83 -
GWTGKMPNGDVIKSGVKFIDIRSKYKQDP GUNA_CALSA 1672 86 -
Final Motifs
Motif 1  width=26
Element Seqn Id St Int Rpt
GIPYHSVETLMVEAPDYGHVTTSEAM O82831 61 61 -
GIPYHSVETLICEAPDYGHLTTSEAF GUNA_CALSA 1140 1140 -
GIPYHSIETLMVEAPDYGHVTTSEAM GUNF_CLOCE 61 61 -
GIPYHAVETLIVEAPDYGHLTTSEAM GUX2_CLOSR 63 63 -
GIPYHSIETLIVEAPDYGHVTTSEAF GUNS_CLOTM 64 64 -
GIPYHSVETLIVEAPDHGHETTSEAY O86728 367 367 -
GIPYHAVETLMVEAPDYGHETTSEAY GUXB_CELFI 82 82 -

Motif 2 width=18
Element Seqn Id St Int Rpt
SYWLWLEALYGQVTQDWA GUXB_CELFI 108 0 -
SYYMWLEAMYGRFTGDFS O82831 87 0 -
SYYMWLEAMHGRFSGDFT GUNF_CLOCE 87 0 -
SYYVWLEAVYGKLTGDWS GUNA_CALSA 1166 0 -
SYYLWLEALYGKFTGDFS GUX2_CLOSR 89 0 -
SYYVWLEAMYGNLTGNWS GUNS_CLOTM 90 0 -
SYLLWLQAMYGKVTGDWS O86728 393 0 -

Motif 3 width=25
Element Seqn Id St Int Rpt
SYINTFQRGEQESTWETIPQPCWDE O82831 204 99 -
SFINTFQRGPEESVWETVPHPSWEE GUNA_CALSA 1280 96 -
SYINTFQRGEQESTWETIPQPCWDE GUNF_CLOCE 204 99 -
AYINTFQRGSQESVWETIPQPCWDD GUX2_CLOSR 206 99 -
TFINTFQRGEQESTWETIPHPSIEE GUNS_CLOTM 201 93 -
SYINTFQRGAQESVWETVPQPTCDA O86728 514 103 -
SFINTFQRGPQESVWETVPQPSCEE GUXB_CELFI 230 104 -

Motif 4 width=28
Element Seqn Id St Int Rpt
KQFKYTNAPDADARAVQATYWANEWAKE O82831 250 21 -
KQWRYTNAPDADARAIQATYWAKVWAKE GUNA_CALSA 1325 20 -
KQFKYTNAPDADARAVQATYWADQWAKE GUNF_CLOCE 250 21 -
AQFKYTNAPDADARAIQATYWANQWAKE GUX2_CLOSR 251 20 -
KQWRYTNAPDAEGRAIQAVYWANKWAKE GUNS_CLOTM 246 20 -
KQWKFTNAPDADARAVQAAYWADIWAGE O86728 559 20 -
KQWKYTSASDADARAVEAVYWANQWATE GUXB_CELFI 275 20 -

Motif 5 width=20
Element Seqn Id St Int Rpt
TGYDAAHYLLSWYYAWGGGI GUX2_CLOSR 317 38 -
TGYDSAHYLLSWYYAWGGGV O82831 316 38 -
TGYDSAHYLLSWYYAWGGAL GUNA_CALSA 1396 43 -
TGYDAAHYLLSWYYAWGGGI GUNF_CLOCE 316 38 -
TGYDSAHYLMAWYTAWGGGI GUNS_CLOTM 315 41 -
TGKDSSHYLLSWYYAWGGAV O86728 632 45 -
QGREAAHYLLSWYMAWGGAT GUXB_CELFI 346 43 -

Motif 6 width=24
Element Seqn Id St Int Rpt
RQLEFYQWLQSSEGAIAGGATNSW O82831 384 48 -
RQIEFYRWLQSAEGAIAGGATNSW GUNA_CALSA 1464 48 -
RQLEFYQWLQSAEGAIAGGATNSW GUNF_CLOCE 384 48 -
RQLEFYQWLQSAEGAIAGGATNSY GUX2_CLOSR 385 48 -
RQLEFYQWLQSAEGGIAGGATNSW GUNS_CLOTM 383 48 -
RQVEFYRWLQSDEGAIAGGATNSW O86728 702 50 -
RQLEFYTWLQASNGGIAGGATNSW GUXB_CELFI 416 50 -

Motif 7 width=27
Element Seqn Id St Int Rpt
PVYADPGSNTWFGMQVWSMQRVAELYY O82831 430 22 -
PVYRDPGSNTWFGFQAWSMQRVAEYYY GUNA_CALSA 1510 22 -
PVYADPGSNTWFGMQVWSMQRVAELYY GUNF_CLOCE 430 22 -
PVYLDPGSNTWFGFQAWTMQRVAEYYY GUX2_CLOSR 431 22 -
PVYADPGSNQWFGFQAWSMQRVMEYYL GUNS_CLOTM 429 22 -
PVYHDPPSNQWFGFQAWSMERVAEYYQ O86728 748 22 -
PVYVDPPSNRWFGMQAWGVQRVAELYY GUXB_CELFI 462 22 -

Motif 8 width=21
Element Seqn Id St Int Rpt
TFQIPGTLDWEGQPDTWDPTQ O82831 485 28 -
TFAIPSTLDWSGQPDTWNGTY GUNA_CALSA 1565 28 -
TFQIPSTIDWEGQPDTWNPTQ GUNF_CLOCE 485 28 -
TFEIPGNLEWSGQPDTWTGTY GUX2_CLOSR 486 28 -
TFAIPSDLEWSGQPDTWTGTY GUNS_CLOTM 484 28 -
TFRIPSTLQWSGQPDTWNASS O86728 803 28 -
SWKVPSELKWTGKPDTWNAAA GUXB_CELFI 516 27 -

Motif 9 width=29
Element Seqn Id St Int Rpt
GWTGKMPNGDVIKSGVKFIDIRSKYKQDP O82831 589 83 -
GWTGKMPNGDVIKSGVKFIDIRSKYKQDP GUNA_CALSA 1672 86 -
GWTGKMPNGDVIKSGVKFIDIRSKYKQDP GUNF_CLOCE 589 83 -
GWSGTMPNGDRIEPGVTFLDIRSKYLNDP GUX2_CLOSR 588 81 -
GWSGTMPNGDKIQPGIKFIDIRTKYRQDP GUNS_CLOTM 594 89 -
GWSGTMPNGDTVDASSTFASIRSFYQDDP O86728 905 81 -
GWTGTMPNGDVIKPGVSFLDIRSFYKKDP GUXB_CELFI 625 88 -