WORKLIST ENTRIES (1):

GLHYDRLASE59 View alignment     Glycosyl hydrolase family 59 signature
 Type of fingerprint: COMPOUND with 9  elements
Links:
   PRINTS; PR00131 GLHYDRLASE1; PR00132 GLHYDRLASE2; PR00133 GLHYDRLASE3
   PRINTS; PR00732 GLHYDRLASE4; PR00733 GLHYDRLASE6; PR00734 GLHYDRLASE7
   PRINTS; PR00735 GLHYDRLASE8; PR00134 GLHYDRLASE10; PR00911 GLHYDRLASE11
   PRINTS; PR00736 GLHYDRLASE15; PR00737 GLHYDRLASE16; PR00738 GLHYDRLASE20
   PRINTS; PR00739 GLHYDRLASE26; PR00740 GLHYDRLASE27; PR00741 GLHYDRLASE29
   PRINTS; PR00843 GLHYDRLASE30; PR00742 GLHYDRLASE35; PR00743 GLHYDRLASE36
   PRINTS; PR00744 GLHYDRLASE37; PR00745 GLHYDRLASE39; PR00746 GLHYDRLASE41
   PRINTS; PR00747 GLHYDRLASE47; PR00844 GLHYDRLASE48; PR00845 GLHYDRLASE52
   PRINTS; PR00846 GLHYDRLASE56; PR00849 GLHYDRLASE58; PR00748 MELIBIASE
   PRINTS; PR00137 LYSOZYME; PR00684 T4LYSOZYME; PR00749 LYSOZYMEG
   PRINTS; PR00110 ALPHAAMYLASE; PR00750 BETAAMYLASE
   INTERPRO; IPR001286

 Creation date 09-MAR-1998; UPDATE 07-JUN-1999

   1. HENRISSAT, B.
   A classification of glycosyl hydrolases based on amino acid sequence
   similarities.
   BIOCHEM.J. 280 309-316 (1991).

   2. HENRISSAT, B. AND BAIROCH, A.
   New families in the classification of glycosyl hydrolases based on amino
   acid sequence similarities.
   BIOCHEM.J. 293 781-788 (1993).

   3. HENRISSAT, B. AND BAIROCH, A.
   Updating the sequence-based classification of glycosyl hydrolases.
   BIOCHEM.J. 316 695-696 (1996).

   4. EL HASSOUNI, M., HENRISSAT, B., CHIPPAUX, M. AND BARRAS, F.
   Nucleotide sequences of the Arb genes, which control beta-glucosidase
   utilisation in Erwinia chrysanthemi - Comparison with the Escherichia
   coli Bgl operon and evidence for a new beta-glycohydrolase family
   including enzymes from eubacteria, archaebacteria and humans.
   J.BACTERIOL. 174 765-777 (1992).
  
   5. VICTORIA, T., RAFI, M.A. AND WENGER, D.A.
   Cloning of the canine GALC cDNA and identification of the mutation causing
   globoid cell leukodystrophy in West Highland White and Cairn terriers. 
   GENOMICS 33 457-462 (1996). 

   6. LUZI, P., RAFI, M.A. AND WENGER, D.A.
   Structure and Organization of the Human Galactocerebrosidase (GALC) Gene
   GENOMICS 26 407-409 (1995).

   7. SAKI, N., FUKUSHIMA, H., INUI, K., FU, L., NISHIGAKI, T., YANAGIHARA, I.
   TATSUMI, N., OZONO, K. AND OKADA, S.
   Human galactocerebrosidase gene: promoter analysis of the 5'-flanking
   region and structural organisation.
   BIOCHIM.BIOPHYS.ACTA 1395 62-67 (1998).

   O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
   hydrolyse the glycosidic bond between two or more carbohydrates, or between
   a carbohydrate and a non-carbohydrate moiety. A classification system for
   glycosyl hydrolases, based on sequence similarity, has led to the definition
   of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
   glycosid.txt). Family 59 contains galactocerebrosidases (EC 3.2.1.46). 
  
   Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal
   recessive disorder that results from deficiency of galactocerebrosidase
   (GALC) activity [5-7]. GALC is responsible for the lysosomal catabolism of
   certain galactolipids, including galactosylceramide and psychosine [5]. 
  
   GLHYDRLASE59 is a 9-element fingerprint that provides a signature for
   family 59 glycosyl hydrolases. The fingerprint was derived from an initial
   alignment of 3 sequences: the motifs were drawn from conserved regions
   spanning virtually the full alignment length - motifs 6 and 9 include
   potential glycosylation sites. Two iterations on OWL30.0 were required
   to reach convergence, at which point a true set comprising 4 sequences
   was identified. A single partial match was also found, HUMGALACB, a
   human mRNA galactocerebrosidase splice product that lacks the portion of
   sequence bearing the last 4 motifs.
  
   An update on SPTR37_9f identified a true set of 5 sequences.

  SUMMARY INFORMATION
      5 codes involving  9 elements
      0 codes involving  8 elements
      0 codes involving  7 elements
      0 codes involving  6 elements
      0 codes involving  5 elements
      0 codes involving  4 elements
      0 codes involving  3 elements
      0 codes involving  2 elements

   COMPOSITE FINGERPRINT INDEX
  
    9|   5    5    5    5    5    5    5    5    5  
    8|   0    0    0    0    0    0    0    0    0  
    7|   0    0    0    0    0    0    0    0    0  
    6|   0    0    0    0    0    0    0    0    0  
    5|   0    0    0    0    0    0    0    0    0  
    4|   0    0    0    0    0    0    0    0    0  
    3|   0    0    0    0    0    0    0    0    0  
    2|   0    0    0    0    0    0    0    0    0  
   --+----------------------------------------------
     |   1    2    3    4    5    6    7    8    9  

True positives..
 GALC_HUMAN     O02791         O35151         GALC_MOUSE     
 GALC_CANFA     


  PROTEIN TITLES
   GALC_HUMAN       GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GA
   O02791           GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GA
   O35151           GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GA
   GALC_MOUSE       GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GA
   GALC_CANFA       GALACTOCEREBROSIDASE PRECURSOR (EC 3.2.1.46) (GALCERASE) (GA

SCAN HISTORY OWL30_0 2 75 NSINGLE SPTR37_9f 2 6 NSINGLE INITIAL MOTIF SETS GLHYDRLASE591 Length of motif = 29 Motif number = 1 Glycosyl hydrolase family 59 motif I - 1 PCODE ST INT AAAGSAGHAAVPLLLCALLVPGGAYVLDD GALC_CANFA 3 3 AAAGSAGRAAVPLLLCALLAPGGAYVLDD GALC_HUMAN 3 3 AAAGSASRVAVPLLLCALLVPGGAYVLDD GALC_MOUSE 3 3 GLHYDRLASE592 Length of motif = 25 Motif number = 2 Glycosyl hydrolase family 59 motif II - 1 PCODE ST INT HMHYALDENFFRGYEWWLMKEAKKR GALC_CANFA 100 68 HMHYALDENYFRGYEWWLMKEAKKR GALC_HUMAN 100 68 HMHYELDENYFRGYEWWLMKEAKKR GALC_MOUSE 100 68 GLHYDRLASE593 Length of motif = 27 Motif number = 3 Glycosyl hydrolase family 59 motif III - 1 PCODE ST INT YYIMTWIVGAKHYHDLDIDYIGIWNER GALC_CANFA 157 32 YYVVTWIVGAKRYHDLDIDYIGIWNER GALC_HUMAN 157 32 YYVVRWILGAKHYHDLDIDYIGIWNER GALC_MOUSE 157 32 GLHYDRLASE594 Length of motif = 25 Motif number = 4 Glycosyl hydrolase family 59 motif IV - 1 PCODE ST INT KKLWSSEDFSTLNSDVGAGCLGRIL GALC_CANFA 252 68 KKLWSSEDFSTLNSDMGAGCWGRIL GALC_HUMAN 252 68 KKLWSSEDFSTINSNVGAGCWSRIL GALC_MOUSE 252 68 GLHYDRLASE595 Length of motif = 25 Motif number = 5 Glycosyl hydrolase family 59 motif V - 1 PCODE ST INT HTTQFTQPGWYYLKTVGHLEKGGSY GALC_CANFA 330 53 HTTQFTQPGWYYLKTVGHLEKGGSY GALC_HUMAN 330 53 HTTQFTQPGWYYLKTVGHLEKGGSY GALC_MOUSE 330 53 GLHYDRLASE596 Length of motif = 27 Motif number = 6 Glycosyl hydrolase family 59 motif VI - 1 PCODE ST INT CIRPFLPYFNVSRQFATFVLKGSFSEI GALC_CANFA 378 23 CIRPFLPYFNVSQQFATFVLKGSFSEI GALC_HUMAN 378 23 CIRPYLPYYNVSHQLATFTLKGSLREI GALC_MOUSE 378 23 GLHYDRLASE597 Length of motif = 24 Motif number = 7 Glycosyl hydrolase family 59 motif VII - 1 PCODE ST INT EDEIFTLTTLTVGSKGSYPLPPKS GALC_CANFA 444 39 EDELFTLTTLTTGRKGSYPLPPKS GALC_HUMAN 444 39 EDEIFTLTTLTTGRKGSYPPPPSS GALC_MOUSE 444 39 GLHYDRLASE598 Length of motif = 17 Motif number = 8 Glycosyl hydrolase family 59 motif VIII - 1 PCODE ST INT FSEAPNFADQTGVFEYF GALC_CANFA 485 17 FSEAPNFADQTGVFEYF GALC_HUMAN 485 17 FSEAPNFADQTGVFEYY GALC_MOUSE 485 17 GLHYDRLASE599 Length of motif = 30 Motif number = 9 Glycosyl hydrolase family 59 motif IX - 1 PCODE ST INT WSNLTVRCDVYIETPEKGGVFIAGRVNKGG GALC_CANFA 541 39 WTNLTIKCDVYIETPDTGGVFIAGRVNKGG GALC_HUMAN 541 39 WTNMTVQCDVYIETPRSGGVFIAGRVNKGG GALC_MOUSE 540 38 FINAL MOTIF SETS GLHYDRLASE591 Length of motif = 29 Motif number = 1 Glycosyl hydrolase family 59 motif I - 2 PCODE ST INT AAAGSAGRAAVPLLLCALLAPGGAYVLDD GALC_HUMAN 3 3 AAAGSAGRAAVPFLLCALLAPGGAYVLDD O02791 3 3 AAAGSASRVAVPLLLCALLVPGGAYVLDD GALC_MOUSE 3 3 AAAGSASRVAVPLLLCALLVPGGAYVLDD O35151 3 3 AAAGSAGHAAVPLLLCALLVPGGAYVLDD GALC_CANFA 3 3 GLHYDRLASE592 Length of motif = 25 Motif number = 2 Glycosyl hydrolase family 59 motif II - 2 PCODE ST INT HMHYALDENYFRGYEWWLMKEAKKR GALC_HUMAN 100 68 HMHYALDENYFRGYEWWLMKEAKKR O02791 100 68 HMHYELDENYFRGYEWWLMKEAKKR GALC_MOUSE 100 68 HMHYELDENYFRGYEWWLMKEAKKR O35151 100 68 HMHYALDENFFRGYEWWLMKEAKKR GALC_CANFA 100 68 GLHYDRLASE593 Length of motif = 27 Motif number = 3 Glycosyl hydrolase family 59 motif III - 2 PCODE ST INT YYVVTWIVGAKRYHDLDIDYIGIWNER GALC_HUMAN 157 32 YYVVTWIVGAKRYHDLDIDYIGIWNER O02791 157 32 YYVVRWILGAKHYHDLDIDYIGIWNER GALC_MOUSE 157 32 YYVVRWILGAKHYHDLDIDYIGIWNER O35151 157 32 YYIMTWIVGAKHYHDLDIDYIGIWNER GALC_CANFA 157 32 GLHYDRLASE594 Length of motif = 25 Motif number = 4 Glycosyl hydrolase family 59 motif IV - 2 PCODE ST INT KKLWSSEDFSTLNSDMGAGCWGRIL GALC_HUMAN 252 68 KKLWSSEDFSTLNSDTGAGCWGRIL O02791 252 68 KKLWSSEDFSTINSNVGAGCWSRIL GALC_MOUSE 252 68 KKLWSSEDFSTINSNVGAGCWSRIL O35151 252 68 KKLWSSEDFSTLNSDVGAGCLGRIL GALC_CANFA 252 68 GLHYDRLASE595 Length of motif = 25 Motif number = 5 Glycosyl hydrolase family 59 motif V - 2 PCODE ST INT HTTQFTQPGWYYLKTVGHLEKGGSY GALC_HUMAN 330 53 HTTQFTQPGWYYLKTVGHLEKGGSY O02791 330 53 HTTQFTQPGWYYLKTVGHLEKGGSY GALC_MOUSE 330 53 HTTQFTQPGWYYLKTVGHLEKGGSY O35151 330 53 HTTQFTQPGWYYLKTVGHLEKGGSY GALC_CANFA 330 53 GLHYDRLASE596 Length of motif = 27 Motif number = 6 Glycosyl hydrolase family 59 motif VI - 2 PCODE ST INT CIRPFLPYFNVSQQFATFVLKGSFSEI GALC_HUMAN 378 23 CIRPFLPYFNVSQQFATFVLKGSFSEI O02791 378 23 CIRPYLPYYNVSHQLATFTLKGSLREI GALC_MOUSE 378 23 CIRPYLPYYNVSHQLATFTLKGSLREI O35151 378 23 CIRPFLPYFNVSRQFATFVLKGSFSEI GALC_CANFA 378 23 GLHYDRLASE597 Length of motif = 24 Motif number = 7 Glycosyl hydrolase family 59 motif VII - 2 PCODE ST INT EDELFTLTTLTTGRKGSYPLPPKS GALC_HUMAN 444 39 EDELFTLTTLTTGRKGSYLPPPKS O02791 444 39 EDEIFTLTTLTTGRKGSYPPPPSS GALC_MOUSE 444 39 EDEIFTLTTLTTGRKGSYPPPPSS O35151 444 39 EDEIFTLTTLTVGSKGSYPLPPKS GALC_CANFA 444 39 GLHYDRLASE598 Length of motif = 17 Motif number = 8 Glycosyl hydrolase family 59 motif VIII - 2 PCODE ST INT FSEAPNFADQTGVFEYF GALC_HUMAN 485 17 FSEAPNFADQTGVFEYF O02791 485 17 FSEAPNFADQTGVFEYY GALC_MOUSE 485 17 FSEAPNFADQTGVFEYY O35151 485 17 FSEAPNFADQTGVFEYF GALC_CANFA 485 17 GLHYDRLASE599 Length of motif = 30 Motif number = 9 Glycosyl hydrolase family 59 motif IX - 2 PCODE ST INT WTNLTIKCDVYIETPDTGGVFIAGRVNKGG GALC_HUMAN 541 39 WTNLTIKCDVYIETPDTGGVFIAGRVNKGG O02791 541 39 WTNMTVQCDVYIETPRSGGVFIAGRVNKGG GALC_MOUSE 540 38 WTNMTVQCDVYIETPRSGGVFIAGRVNKGG O35151 540 38 WSNLTVRCDVYIETPEKGGVFIAGRVNKGG GALC_CANFA 541 39

User query: Display/Full Code "GLHYDRLASE59"