SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00658

Identifier
CD44  [View Relations]  [View Alignment]  
Accession
PR00658
No. of Motifs
8
Creation Date
23-FEB-1997  (UPDATE 14-JUN-1999)
Title
CD44 antigen precursor signature
Database References

INTERPRO; IPR001231
Literature References
1. TSUKITA, S., YONEMURA, S. AND TSUKITA, S.
ERM proteins: head-to-tail regulation of actin-plasma membrane interaction.
TRENDS BIOCHEM.SCI. 22 53-58 (1997).

Documentation
CD44 is a polymorphic cell-surface glycoprotein synthesised in a variety
of cells. The protein interacts with actin-based cytoskeletons, and co-
localises with ERM proteins (ezrin, radixin and moesin) at actin filament-
plasma membrane interaction sites [1]. CD44 may be involved in cell 
migration, adhesion and differentiation in normal cells, as well as in
metastasis in cancer cells. It is a receptor for extracellular materials,
such as soluble or cell-bound hyaluronic acid, collagen, fibronectin and
serglycin. The protein has a single membrane-spanning domain and has a
heavily glycosylated extracellular domain; its cytoplasmic domain is
reportedly associated with an ankyrin-like protein [1].
 
CD44 is an 8-element fingerprint that provides a signature for the CD44
antigen precursor. The fingerprint was derived from an initial alignment
of 6 sequences: motifs 1-4 were drawn from conserved regions spanning
the N-terminal portion of the alignment; motifs 5 and 6 encode the membrane-
spanning domain; and motifs 7 and 8 lie in the C-terminal domain, which 
associates with ERM proteins. Two iterations on OWL29.1 were required to 
reach convergence, at which point a true set comprising 24 sequences was
identified. Several partial matches were also found, all of which are 
fragments or CD44 homologues.
 
An update on SPTR37_9f identified a true set of 10 sequences, and 1
partial match.
Summary Information
  10 codes involving  8 elements
0 codes involving 7 elements
1 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
81010101010101010
700000000
611111100
500000000
400000000
300000000
200000000
12345678
True Positives
CD44_BOVIN    CD44_CRIGR    CD44_HORSE    CD44_HUMAN    
CD44_MESAU CD44_MOUSE CD44_PAPHA CD44_RAT
O08779 O70509
True Positive Partials
Codes involving 6 elements
Q92493
Sequence Titles
CD44_BOVIN  CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU 
CD44_CRIGR CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU
CD44_HORSE CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU
CD44_HUMAN CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU
CD44_MESAU CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU
CD44_MOUSE CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU
CD44_PAPHA CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU
CD44_RAT CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU
O08779 CD44 PROTEIN - RATTUS NORVEGICUS (RAT).
O70509 GLYCOPROTEIN CD44S - RATTUS NORVEGICUS (RAT).

Q92493 CELL SURFACE GLYCOPROTEIN CD44 - HOMO SAPIENS (HUMAN).
Scan History
OWL29_1    2  100  NSINGLE    
SPTR37_9f 2 100 NSINGLE
Initial Motifs
Motif 1  width=20
Element Seqn Id St Int Rpt
DLNVTCRYAGVFHVEKNGRY CD44_MOUSE 25 25 -
DLNITCRYAGVFHVEKNGRY CD44_RAT 26 26 -
DLNITCRFEGIYHVEKNGRY CD44_PAPHA 23 23 -
DLNITCRYAGVFHVEKNGRY CD44_CRIGR 25 25 -
DLNITCRFAGVFHVEKNGRY CD4X_HUMAN 23 23 -
DLNITCRYAGVFHVEKNGRY CD44_HORSE 23 23 -

Motif 2 width=21
Element Seqn Id St Int Rpt
GFETCRYGFIEGHVVIPRIQP CD44_CRIGR 75 30 -
GFETCRYGFIEGHVVIPRIHP CD4X_HUMAN 73 30 -
GFETCRYGFIEGHVVIPRIHP CD44_RAT 76 30 -
GFETCRYGFIEGNVVIPRIHP CD44_MOUSE 75 30 -
GFETCRIGFIEGHVVIPPIHP CD44_HORSE 73 30 -
GFETCRYGFIEGHVVIPRIHP CD44_PAPHA 73 30 -

Motif 3 width=21
Element Seqn Id St Int Rpt
YDTYCFNASAPPEEDCTSVTD CD44_MOUSE 117 21 -
YDTYCFNASAPLEEDCTSVTD CD44_RAT 118 21 -
YDTYCFNASAPPEEDCTSVTD CD44_HORSE 114 20 -
YDTYCFNASAPPGEDCTSVTD CD44_PAPHA 114 20 -
YDTYCFNASAPLEEDCTSVTD CD44_CRIGR 116 20 -
YDTYCFNASAPPEEDCTSVTD CD4X_HUMAN 114 20 -

Motif 4 width=20
Element Seqn Id St Int Rpt
NRDGTRYSKKGEYRTHQEDI CD44_MOUSE 152 14 -
NRDGTRYVQKGEYRTNPEDI CD4X_HUMAN 149 14 -
NRDGTRYSKKGEYRTHQEDI CD44_RAT 153 14 -
NRDGTRYTKKGEYRTNPEDI CD44_HORSE 149 14 -
NRDGTRYVKKGEYRTNPEDI CD44_PAPHA 149 14 -
NRDGTRYSKKGEYRTHQEDI CD44_CRIGR 151 14 -

Motif 5 width=20
Element Seqn Id St Int Rpt
EGGASTTSGPIRRPQIPEWL CD44_HORSE 249 80 -
EGGANTTSGPIRTPQIPEWL CD4X_HUMAN 383 214 -
DSGLNSTSRPGGKPRVPEWL CD44_CRIGR 252 81 -
EGGANTTSGPLRTPQIPEWL CD44_PAPHA 252 83 -
DSGVTTTSGPARRPQIPEWL CD44_RAT 254 81 -
DSGVTTTSGPMRRPQIPEWL CD44_MOUSE 253 81 -

Motif 6 width=23
Element Seqn Id St Int Rpt
IILASLLALALILAVCIAVNSRR CD44_PAPHA 272 0 -
IILASLLALALILAVCIAVNSRR CD44_RAT 274 0 -
IILASLLALALILAVCIAVNSRR CD44_MOUSE 273 0 -
IILASLLALALILAVCIAVNSRR CD44_HORSE 269 0 -
IVLASLLALALILAVCIAVNSRR CD44_CRIGR 272 0 -
IILASLLALALILAVCIAVNSRR CD4X_HUMAN 403 0 -

Motif 7 width=20
Element Seqn Id St Int Rpt
CGQKKKLVINSGNGAVEDRK CD4X_HUMAN 427 1 -
CGQKKKLVINSGNGTVEDRK CD44_RAT 298 1 -
CGQKKKLVINGGNGTVEDRK CD44_MOUSE 297 1 -
CGQKKKLVINNGNGAVDDRK CD44_HORSE 293 1 -
CGQKKKLVINNGNGAVEDRK CD44_PAPHA 296 1 -
CGQKKKLVINSGNGKVEDRK CD44_CRIGR 296 1 -

Motif 8 width=20
Element Seqn Id St Int Rpt
DQFMTADETRNLQNVDMKIG CD44_HORSE 339 26 -
DQCMTADETRNLQSVDMKIG CD44_MOUSE 343 26 -
DQFMTADETRNLQNVDMKIG CD44_CRIGR 342 26 -
DQFMTADETRNLQNVDMKIG CD44_PAPHA 342 26 -
DQFMTADETRNLQSVDMKIG CD44_RAT 344 26 -
DQFMTADETRNLQNVDMKIG CD4X_HUMAN 473 26 -
Final Motifs
Motif 1  width=20
Element Seqn Id St Int Rpt
DLNITCRYAGVFHVEKNGRY CD44_RAT 26 26 -
DLNITCRYAGVFHVEKNGRY O70509 26 26 -
DLNITCRYAGVFHVEKNGRY O08779 26 26 -
DLNITCRYAGVFHVEKNGRY CD44_MESAU 25 25 -
DLNVTCRYAGVFHVEKNGRY CD44_MOUSE 25 25 -
DLNITCRFAGVFHVEKNGRY CD44_HUMAN 23 23 -
DLNITCRYAGVFHVEKNGRY CD44_HORSE 23 23 -
DLNITCRYAGVFHVEKNGRY CD44_BOVIN 23 23 -
DLNITCRFEGIYHVEKNGRY CD44_PAPHA 23 23 -
DLNITCRYAGVFHVEKNGRY CD44_CRIGR 25 25 -

Motif 2 width=21
Element Seqn Id St Int Rpt
GFETCRYGFIEGHVVIPRIHP CD44_RAT 76 30 -
GFETCRYGFIEGHVVIPRIHP O70509 76 30 -
GFETCRYGFIEGHVVIPRIHP O08779 76 30 -
GFETCRYGFIEGHVVIPRIQP CD44_MESAU 75 30 -
GFETCRYGFIEGNVVIPRIHP CD44_MOUSE 75 30 -
GFETCRYGFIEGHVVIPRIHP CD44_HUMAN 73 30 -
GFETCRIGFIEGHVVIPPIHP CD44_HORSE 73 30 -
GFETCRYGFIEGHVVIPRIHP CD44_BOVIN 73 30 -
GFETCRYGFIEGHVVIPRIHP CD44_PAPHA 73 30 -
GFETCRYGFIEGHVVIPRIQP CD44_CRIGR 75 30 -

Motif 3 width=21
Element Seqn Id St Int Rpt
YDTYCFNASAPLEEDCTSVTD CD44_RAT 118 21 -
YDTYCFNASAPLEEDCTSVTD O70509 118 21 -
YDTYCFNASAPLEEDCTSVTD O08779 118 21 -
YDTYCFNASAPLEEDCTSVTD CD44_MESAU 116 20 -
YDTYCFNASAPPEEDCTSVTD CD44_MOUSE 117 21 -
YDTYCFNASAPPEEDCTSVTD CD44_HUMAN 114 20 -
YDTYCFNASAPPEEDCTSVTD CD44_HORSE 114 20 -
YDTICFNASAPPGEDCTSVTD CD44_BOVIN 114 20 -
YDTYCFNASAPPGEDCTSVTD CD44_PAPHA 114 20 -
YDTYCFNASAPLEEDCTSVTD CD44_CRIGR 116 20 -

Motif 4 width=20
Element Seqn Id St Int Rpt
NRDGTRYSKKGEYRTHQEDI CD44_RAT 153 14 -
NRDGTRYSKKGEYRTHQEDI O70509 153 14 -
NRDGTRYSKKGEYRTHQEDI O08779 153 14 -
NRDGTRYSKKGEYRTHQEDI CD44_MESAU 151 14 -
NRDGTRYSKKGEYRTHQEDI CD44_MOUSE 152 14 -
NRDGTRYVQKGEYRTNPEDI CD44_HUMAN 149 14 -
NRDGTRYTKKGEYRTNPEDI CD44_HORSE 149 14 -
NRDGTRYTKKGEYRTNPEDI CD44_BOVIN 149 14 -
NRDGTRYVKKGEYRTNPEDI CD44_PAPHA 149 14 -
NRDGTRYSKKGEYRTHQEDI CD44_CRIGR 151 14 -

Motif 5 width=20
Element Seqn Id St Int Rpt
DSGVTTTSGPARRPQIPEWL CD44_RAT 254 81 -
DSGVTTTSGPARRPQIPEWL O70509 254 81 -
DSGVTTTSGPARRPQIPEWL O08779 670 497 -
DSGANTTSRPGRKPQIPEWL CD44_MESAU 321 150 -
DSGVTTTSGPMRRPQIPEWL CD44_MOUSE 253 81 -
EGGANTTSGPIRTPQIPEWL CD44_HUMAN 632 463 -
EGGASTTSGPIRRPQIPEWL CD44_HORSE 249 80 -
EHGANTTSGPMRKPQIPEWL CD44_BOVIN 256 87 -
EGGANTTSGPLRTPQIPEWL CD44_PAPHA 252 83 -
DSGLNSTSRPGGKPRVPEWL CD44_CRIGR 252 81 -

Motif 6 width=23
Element Seqn Id St Int Rpt
IILASLLALALILAVCIAVNSRR CD44_RAT 274 0 -
IILASLLALALILAVCIAVNSRR O70509 274 0 -
IILASLLALALILAVCIAVNSRR O08779 690 0 -
IVLASLLALALILAVCIAVNSRR CD44_MESAU 341 0 -
IILASLLALALILAVCIAVNSRR CD44_MOUSE 273 0 -
IILASLLALALILAVCIAVNSRR CD44_HUMAN 652 0 -
IILASLLALALILAVCIAVNSRR CD44_HORSE 269 0 -
IILASLLALALILAVCIAVNSRR CD44_BOVIN 276 0 -
IILASLLALALILAVCIAVNSRR CD44_PAPHA 272 0 -
IVLASLLALALILAVCIAVNSRR CD44_CRIGR 272 0 -

Motif 7 width=20
Element Seqn Id St Int Rpt
CGQKKKLVINSGNGTVEDRK CD44_RAT 298 1 -
CGQKKKLVINSGNGTVEDRK O70509 298 1 -
CGQKKKLVINSGNGTVEDRK O08779 714 1 -
CGQKKKLVINSGNGKVEDRK CD44_MESAU 365 1 -
CGQKKKLVINGGNGTVEDRK CD44_MOUSE 297 1 -
CGQKKKLVINSGNGAVEDRK CD44_HUMAN 676 1 -
CGQKKKLVINNGNGAVDDRK CD44_HORSE 293 1 -
CGQKKKLVINNGNGTMEERK CD44_BOVIN 300 1 -
CGQKKKLVINNGNGAVEDRK CD44_PAPHA 296 1 -
CGQKKKLVINSGNGKVEDRK CD44_CRIGR 296 1 -

Motif 8 width=20
Element Seqn Id St Int Rpt
DQFMTADETRNLQSVDMKIG CD44_RAT 344 26 -
DQFMTADETRNLQSVDMKIG O70509 344 26 -
DQFMTADETRNLQSVDMKIG O08779 760 26 -
DQFMTADETRNLQNVDMKIG CD44_MESAU 411 26 -
DQCMTADETRNLQSVDMKIG CD44_MOUSE 343 26 -
DQFMTADETRNLQNVDMKIG CD44_HUMAN 722 26 -
DQFMTADETRNLQNVDMKIG CD44_HORSE 339 26 -
DQFMTADETRNLQNVDMKIG CD44_BOVIN 346 26 -
DQFMTADETRNLQNVDMKIG CD44_PAPHA 342 26 -
DQFMTADETRNLQNVDMKIG CD44_CRIGR 342 26 -