SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00850

Identifier
GLHYDRLASE59  [View Relations]  [View Alignment]  
Accession
PR00850
No. of Motifs
9
Creation Date
09-MAR-1998  (UPDATE 07-JUN-1999)
Title
Glycosyl hydrolase family 59 signature
Database References

INTERPRO; IPR001286
Literature References
1. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence
similarities.
BIOCHEM.J. 280 309-316 (1991).
 
2. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
 
3. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).
 
4. EL HASSOUNI, M., HENRISSAT, B., CHIPPAUX, M. AND BARRAS, F.
Nucleotide sequences of the Arb genes, which control beta-glucosidase
utilisation in Erwinia chrysanthemi - Comparison with the Escherichia
coli Bgl operon and evidence for a new beta-glycohydrolase family
including enzymes from eubacteria, archaebacteria and humans.
J.BACTERIOL. 174 765-777 (1992).
 
5. VICTORIA, T., RAFI, M.A. AND WENGER, D.A.
Cloning of the canine GALC cDNA and identification of the mutation causing
globoid cell leukodystrophy in West Highland White and Cairn terriers. 
GENOMICS 33 457-462 (1996). 
 
6. LUZI, P., RAFI, M.A. AND WENGER, D.A.
Structure and Organization of the Human Galactocerebrosidase (GALC) Gene
GENOMICS 26 407-409 (1995).
 
7. SAKI, N., FUKUSHIMA, H., INUI, K., FU, L., NISHIGAKI, T., YANAGIHARA, I.
TATSUMI, N., OZONO, K. AND OKADA, S.
Human galactocerebrosidase gene: promoter analysis of the 5'-flanking
region and structural organisation.
BIOCHIM.BIOPHYS.ACTA 1395 62-67 (1998).

Documentation
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt). Family 59 contains galactocerebrosidases (EC 3.2.1.46). 
 
Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal
recessive disorder that results from deficiency of galactocerebrosidase
(GALC) activity [5-7]. GALC is responsible for the lysosomal catabolism of
certain galactolipids, including galactosylceramide and psychosine [5]. 
 
GLHYDRLASE59 is a 9-element fingerprint that provides a signature for
family 59 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 3 sequences: the motifs were drawn from conserved regions
spanning virtually the full alignment length - motifs 6 and 9 include
potential glycosylation sites. Two iterations on OWL30.0 were required
to reach convergence, at which point a true set comprising 4 sequences
was identified. A single partial match was also found, HUMGALACB, a
human mRNA galactocerebrosidase splice product that lacks the portion of
sequence bearing the last 4 motifs.
 
An update on SPTR37_9f identified a true set of 5 sequences.
Summary Information
5 codes involving  9 elements
0 codes involving 8 elements
0 codes involving 7 elements
0 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
9555555555
8000000000
7000000000
6000000000
5000000000
4000000000
3000000000
2000000000
123456789
True Positives
GALC_CANFA    GALC_HUMAN    GALC_MOUSE    O02791        
O35151
Sequence Titles
GALC_CANFA  GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GALACTOSYLCERAMIDASE) (GALACTOSYLCERAMIDE BETA-GALACTOSIDASE) (GALACTOCEREBROSIDE BETA-GALACTOSIDASE) - CANIS FAMILIARIS (DOG). 
GALC_HUMAN GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GALACTOSYLCERAMIDASE) (GALACTOSYLCERAMIDE BETA-GALACTOSIDASE) (GALACTOCEREBROSIDE BETA-GALACTOSIDASE) - HOMO SAPIENS (HUMAN).
GALC_MOUSE GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GALACTOSYLCERAMIDASE) (GALACTOSYLCERAMIDE BETA-GALACTOSIDASE) (GALACTOCEREBROSIDE BETA-GALACTOSIDASE) - MUS MUSCULUS (MOUSE).
O02791 GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GALACTOSYLCERAMIDASE) (GALACTOSYLCERAMIDE BETA-GALACTOSIDASE) (GALACTOCEREBROSIDE BETA-GALACTOSIDASE) - MACACA MULATTA (RHESUS MACAQUE).
O35151 GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GALACTOSYLCERAMIDASE) (GALACTOSYLCERAMIDE BETA-GALACTOSIDASE) (GALACTOCEREBROSIDE BETA-GALACTOSIDASE) - MUS MUSCULUS (MOUSE).
Scan History
OWL30_0    2  75   NSINGLE    
SPTR37_9f 2 6 NSINGLE
Initial Motifs
Motif 1  width=29
Element Seqn Id St Int Rpt
AAAGSAGRAAVPLLLCALLAPGGAYVLDD GALC_HUMAN 3 3 -
AAAGSASRVAVPLLLCALLVPGGAYVLDD GALC_MOUSE 3 3 -
AAAGSAGHAAVPLLLCALLVPGGAYVLDD GALC_CANFA 3 3 -

Motif 2 width=25
Element Seqn Id St Int Rpt
HMHYALDENYFRGYEWWLMKEAKKR GALC_HUMAN 100 68 -
HMHYELDENYFRGYEWWLMKEAKKR GALC_MOUSE 100 68 -
HMHYALDENFFRGYEWWLMKEAKKR GALC_CANFA 100 68 -

Motif 3 width=27
Element Seqn Id St Int Rpt
YYVVTWIVGAKRYHDLDIDYIGIWNER GALC_HUMAN 157 32 -
YYVVRWILGAKHYHDLDIDYIGIWNER GALC_MOUSE 157 32 -
YYIMTWIVGAKHYHDLDIDYIGIWNER GALC_CANFA 157 32 -

Motif 4 width=25
Element Seqn Id St Int Rpt
KKLWSSEDFSTLNSDMGAGCWGRIL GALC_HUMAN 252 68 -
KKLWSSEDFSTINSNVGAGCWSRIL GALC_MOUSE 252 68 -
KKLWSSEDFSTLNSDVGAGCLGRIL GALC_CANFA 252 68 -

Motif 5 width=25
Element Seqn Id St Int Rpt
HTTQFTQPGWYYLKTVGHLEKGGSY GALC_HUMAN 330 53 -
HTTQFTQPGWYYLKTVGHLEKGGSY GALC_MOUSE 330 53 -
HTTQFTQPGWYYLKTVGHLEKGGSY GALC_CANFA 330 53 -

Motif 6 width=27
Element Seqn Id St Int Rpt
CIRPFLPYFNVSQQFATFVLKGSFSEI GALC_HUMAN 378 23 -
CIRPYLPYYNVSHQLATFTLKGSLREI GALC_MOUSE 378 23 -
CIRPFLPYFNVSRQFATFVLKGSFSEI GALC_CANFA 378 23 -

Motif 7 width=24
Element Seqn Id St Int Rpt
EDELFTLTTLTTGRKGSYPLPPKS GALC_HUMAN 444 39 -
EDEIFTLTTLTTGRKGSYPPPPSS GALC_MOUSE 444 39 -
EDEIFTLTTLTVGSKGSYPLPPKS GALC_CANFA 444 39 -

Motif 8 width=17
Element Seqn Id St Int Rpt
FSEAPNFADQTGVFEYF GALC_HUMAN 485 17 -
FSEAPNFADQTGVFEYY GALC_MOUSE 485 17 -
FSEAPNFADQTGVFEYF GALC_CANFA 485 17 -

Motif 9 width=30
Element Seqn Id St Int Rpt
WTNLTIKCDVYIETPDTGGVFIAGRVNKGG GALC_HUMAN 541 39 -
WTNMTVQCDVYIETPRSGGVFIAGRVNKGG GALC_MOUSE 540 38 -
WSNLTVRCDVYIETPEKGGVFIAGRVNKGG GALC_CANFA 541 39 -
Final Motifs
Motif 1  width=29
Element Seqn Id St Int Rpt
AAAGSAGRAAVPLLLCALLAPGGAYVLDD GALC_HUMAN 3 3 -
AAAGSAGRAAVPFLLCALLAPGGAYVLDD O02791 3 3 -
AAAGSASRVAVPLLLCALLVPGGAYVLDD GALC_MOUSE 3 3 -
AAAGSASRVAVPLLLCALLVPGGAYVLDD O35151 3 3 -
AAAGSAGHAAVPLLLCALLVPGGAYVLDD GALC_CANFA 3 3 -

Motif 2 width=25
Element Seqn Id St Int Rpt
HMHYALDENYFRGYEWWLMKEAKKR GALC_HUMAN 100 68 -
HMHYALDENYFRGYEWWLMKEAKKR O02791 100 68 -
HMHYELDENYFRGYEWWLMKEAKKR GALC_MOUSE 100 68 -
HMHYELDENYFRGYEWWLMKEAKKR O35151 100 68 -
HMHYALDENFFRGYEWWLMKEAKKR GALC_CANFA 100 68 -

Motif 3 width=27
Element Seqn Id St Int Rpt
YYVVTWIVGAKRYHDLDIDYIGIWNER GALC_HUMAN 157 32 -
YYVVTWIVGAKRYHDLDIDYIGIWNER O02791 157 32 -
YYVVRWILGAKHYHDLDIDYIGIWNER GALC_MOUSE 157 32 -
YYVVRWILGAKHYHDLDIDYIGIWNER O35151 157 32 -
YYIMTWIVGAKHYHDLDIDYIGIWNER GALC_CANFA 157 32 -

Motif 4 width=25
Element Seqn Id St Int Rpt
KKLWSSEDFSTLNSDMGAGCWGRIL GALC_HUMAN 252 68 -
KKLWSSEDFSTLNSDTGAGCWGRIL O02791 252 68 -
KKLWSSEDFSTINSNVGAGCWSRIL GALC_MOUSE 252 68 -
KKLWSSEDFSTINSNVGAGCWSRIL O35151 252 68 -
KKLWSSEDFSTLNSDVGAGCLGRIL GALC_CANFA 252 68 -

Motif 5 width=25
Element Seqn Id St Int Rpt
HTTQFTQPGWYYLKTVGHLEKGGSY GALC_HUMAN 330 53 -
HTTQFTQPGWYYLKTVGHLEKGGSY O02791 330 53 -
HTTQFTQPGWYYLKTVGHLEKGGSY GALC_MOUSE 330 53 -
HTTQFTQPGWYYLKTVGHLEKGGSY O35151 330 53 -
HTTQFTQPGWYYLKTVGHLEKGGSY GALC_CANFA 330 53 -

Motif 6 width=27
Element Seqn Id St Int Rpt
CIRPFLPYFNVSQQFATFVLKGSFSEI GALC_HUMAN 378 23 -
CIRPFLPYFNVSQQFATFVLKGSFSEI O02791 378 23 -
CIRPYLPYYNVSHQLATFTLKGSLREI GALC_MOUSE 378 23 -
CIRPYLPYYNVSHQLATFTLKGSLREI O35151 378 23 -
CIRPFLPYFNVSRQFATFVLKGSFSEI GALC_CANFA 378 23 -

Motif 7 width=24
Element Seqn Id St Int Rpt
EDELFTLTTLTTGRKGSYPLPPKS GALC_HUMAN 444 39 -
EDELFTLTTLTTGRKGSYLPPPKS O02791 444 39 -
EDEIFTLTTLTTGRKGSYPPPPSS GALC_MOUSE 444 39 -
EDEIFTLTTLTTGRKGSYPPPPSS O35151 444 39 -
EDEIFTLTTLTVGSKGSYPLPPKS GALC_CANFA 444 39 -

Motif 8 width=17
Element Seqn Id St Int Rpt
FSEAPNFADQTGVFEYF GALC_HUMAN 485 17 -
FSEAPNFADQTGVFEYF O02791 485 17 -
FSEAPNFADQTGVFEYY GALC_MOUSE 485 17 -
FSEAPNFADQTGVFEYY O35151 485 17 -
FSEAPNFADQTGVFEYF GALC_CANFA 485 17 -

Motif 9 width=30
Element Seqn Id St Int Rpt
WTNLTIKCDVYIETPDTGGVFIAGRVNKGG GALC_HUMAN 541 39 -
WTNLTIKCDVYIETPDTGGVFIAGRVNKGG O02791 541 39 -
WTNMTVQCDVYIETPRSGGVFIAGRVNKGG GALC_MOUSE 540 38 -
WTNMTVQCDVYIETPRSGGVFIAGRVNKGG O35151 540 38 -
WSNLTVRCDVYIETPEKGGVFIAGRVNKGG GALC_CANFA 541 39 -