Identifier | CD44  [View Relations]  [View Alignment]  
|
Accession | PR00658 |
No. of Motifs | 8 |
Creation Date | 23-FEB-1997  (UPDATE 14-JUN-1999) |
Title | CD44 antigen precursor signature |
Database References | INTERPRO; IPR001231
|
Literature References | 1. TSUKITA, S., YONEMURA, S. AND TSUKITA, S.
ERM proteins: head-to-tail regulation of actin-plasma membrane interaction.
TRENDS BIOCHEM.SCI. 22 53-58 (1997).
|
Documentation | CD44 is a polymorphic cell-surface glycoprotein synthesised in a variety
of cells. The protein interacts with actin-based cytoskeletons, and co-
localises with ERM proteins (ezrin, radixin and moesin) at actin filament-
plasma membrane interaction sites [1]. CD44 may be involved in cell
migration, adhesion and differentiation in normal cells, as well as in
metastasis in cancer cells. It is a receptor for extracellular materials,
such as soluble or cell-bound hyaluronic acid, collagen, fibronectin and
serglycin. The protein has a single membrane-spanning domain and has a
heavily glycosylated extracellular domain; its cytoplasmic domain is
reportedly associated with an ankyrin-like protein [1].
CD44 is an 8-element fingerprint that provides a signature for the CD44
antigen precursor. The fingerprint was derived from an initial alignment
of 6 sequences: motifs 1-4 were drawn from conserved regions spanning
the N-terminal portion of the alignment; motifs 5 and 6 encode the membrane-
spanning domain; and motifs 7 and 8 lie in the C-terminal domain, which
associates with ERM proteins. Two iterations on OWL29.1 were required to
reach convergence, at which point a true set comprising 24 sequences was
identified. Several partial matches were also found, all of which are
fragments or CD44 homologues.
An update on SPTR37_9f identified a true set of 10 sequences, and 1
partial match.
|
Summary Information | 10 codes involving 8 elements 0 codes involving 7 elements 1 codes involving 6 elements 0 codes involving 5 elements 0 codes involving 4 elements 0 codes involving 3 elements 0 codes involving 2 elements
|
Composite Feature Index | 8 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|
True Positives | CD44_BOVIN CD44_CRIGR CD44_HORSE CD44_HUMAN CD44_MESAU CD44_MOUSE CD44_PAPHA CD44_RAT O08779 O70509 |
True Positive Partials | Codes involving 6 elements Q92493 |
|
Sequence Titles | CD44_BOVIN CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU CD44_CRIGR CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU CD44_HORSE CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU CD44_HUMAN CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU CD44_MESAU CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU CD44_MOUSE CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU CD44_PAPHA CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU CD44_RAT CD44 ANTIGEN PRECURSOR (PHAGOCYTIC GLYCOPROTEIN I) (PGP-1) (HUTCH-I) (EXTRACELLU O08779 CD44 PROTEIN - RATTUS NORVEGICUS (RAT). O70509 GLYCOPROTEIN CD44S - RATTUS NORVEGICUS (RAT). Q92493 CELL SURFACE GLYCOPROTEIN CD44 - HOMO SAPIENS (HUMAN).
|
Scan History | OWL29_1 2 100 NSINGLE SPTR37_9f 2 100 NSINGLE
|
Initial Motifs | Motif 1 width=20 Element Seqn Id St Int Rpt DLNVTCRYAGVFHVEKNGRY CD44_MOUSE 25 25 - DLNITCRYAGVFHVEKNGRY CD44_RAT 26 26 - DLNITCRFEGIYHVEKNGRY CD44_PAPHA 23 23 - DLNITCRYAGVFHVEKNGRY CD44_CRIGR 25 25 - DLNITCRFAGVFHVEKNGRY CD4X_HUMAN 23 23 - DLNITCRYAGVFHVEKNGRY CD44_HORSE 23 23 - Motif 2 width=21 Element Seqn Id St Int Rpt GFETCRYGFIEGHVVIPRIQP CD44_CRIGR 75 30 - GFETCRYGFIEGHVVIPRIHP CD4X_HUMAN 73 30 - GFETCRYGFIEGHVVIPRIHP CD44_RAT 76 30 - GFETCRYGFIEGNVVIPRIHP CD44_MOUSE 75 30 - GFETCRIGFIEGHVVIPPIHP CD44_HORSE 73 30 - GFETCRYGFIEGHVVIPRIHP CD44_PAPHA 73 30 - Motif 3 width=21 Element Seqn Id St Int Rpt YDTYCFNASAPPEEDCTSVTD CD44_MOUSE 117 21 - YDTYCFNASAPLEEDCTSVTD CD44_RAT 118 21 - YDTYCFNASAPPEEDCTSVTD CD44_HORSE 114 20 - YDTYCFNASAPPGEDCTSVTD CD44_PAPHA 114 20 - YDTYCFNASAPLEEDCTSVTD CD44_CRIGR 116 20 - YDTYCFNASAPPEEDCTSVTD CD4X_HUMAN 114 20 - Motif 4 width=20 Element Seqn Id St Int Rpt NRDGTRYSKKGEYRTHQEDI CD44_MOUSE 152 14 - NRDGTRYVQKGEYRTNPEDI CD4X_HUMAN 149 14 - NRDGTRYSKKGEYRTHQEDI CD44_RAT 153 14 - NRDGTRYTKKGEYRTNPEDI CD44_HORSE 149 14 - NRDGTRYVKKGEYRTNPEDI CD44_PAPHA 149 14 - NRDGTRYSKKGEYRTHQEDI CD44_CRIGR 151 14 - Motif 5 width=20 Element Seqn Id St Int Rpt EGGASTTSGPIRRPQIPEWL CD44_HORSE 249 80 - EGGANTTSGPIRTPQIPEWL CD4X_HUMAN 383 214 - DSGLNSTSRPGGKPRVPEWL CD44_CRIGR 252 81 - EGGANTTSGPLRTPQIPEWL CD44_PAPHA 252 83 - DSGVTTTSGPARRPQIPEWL CD44_RAT 254 81 - DSGVTTTSGPMRRPQIPEWL CD44_MOUSE 253 81 - Motif 6 width=23 Element Seqn Id St Int Rpt IILASLLALALILAVCIAVNSRR CD44_PAPHA 272 0 - IILASLLALALILAVCIAVNSRR CD44_RAT 274 0 - IILASLLALALILAVCIAVNSRR CD44_MOUSE 273 0 - IILASLLALALILAVCIAVNSRR CD44_HORSE 269 0 - IVLASLLALALILAVCIAVNSRR CD44_CRIGR 272 0 - IILASLLALALILAVCIAVNSRR CD4X_HUMAN 403 0 - Motif 7 width=20 Element Seqn Id St Int Rpt CGQKKKLVINSGNGAVEDRK CD4X_HUMAN 427 1 - CGQKKKLVINSGNGTVEDRK CD44_RAT 298 1 - CGQKKKLVINGGNGTVEDRK CD44_MOUSE 297 1 - CGQKKKLVINNGNGAVDDRK CD44_HORSE 293 1 - CGQKKKLVINNGNGAVEDRK CD44_PAPHA 296 1 - CGQKKKLVINSGNGKVEDRK CD44_CRIGR 296 1 - Motif 8 width=20 Element Seqn Id St Int Rpt DQFMTADETRNLQNVDMKIG CD44_HORSE 339 26 - DQCMTADETRNLQSVDMKIG CD44_MOUSE 343 26 - DQFMTADETRNLQNVDMKIG CD44_CRIGR 342 26 - DQFMTADETRNLQNVDMKIG CD44_PAPHA 342 26 - DQFMTADETRNLQSVDMKIG CD44_RAT 344 26 - DQFMTADETRNLQNVDMKIG CD4X_HUMAN 473 26 -
|
Final Motifs | Motif 1 width=20 Element Seqn Id St Int Rpt DLNITCRYAGVFHVEKNGRY CD44_RAT 26 26 - DLNITCRYAGVFHVEKNGRY O70509 26 26 - DLNITCRYAGVFHVEKNGRY O08779 26 26 - DLNITCRYAGVFHVEKNGRY CD44_MESAU 25 25 - DLNVTCRYAGVFHVEKNGRY CD44_MOUSE 25 25 - DLNITCRFAGVFHVEKNGRY CD44_HUMAN 23 23 - DLNITCRYAGVFHVEKNGRY CD44_HORSE 23 23 - DLNITCRYAGVFHVEKNGRY CD44_BOVIN 23 23 - DLNITCRFEGIYHVEKNGRY CD44_PAPHA 23 23 - DLNITCRYAGVFHVEKNGRY CD44_CRIGR 25 25 - Motif 2 width=21 Element Seqn Id St Int Rpt GFETCRYGFIEGHVVIPRIHP CD44_RAT 76 30 - GFETCRYGFIEGHVVIPRIHP O70509 76 30 - GFETCRYGFIEGHVVIPRIHP O08779 76 30 - GFETCRYGFIEGHVVIPRIQP CD44_MESAU 75 30 - GFETCRYGFIEGNVVIPRIHP CD44_MOUSE 75 30 - GFETCRYGFIEGHVVIPRIHP CD44_HUMAN 73 30 - GFETCRIGFIEGHVVIPPIHP CD44_HORSE 73 30 - GFETCRYGFIEGHVVIPRIHP CD44_BOVIN 73 30 - GFETCRYGFIEGHVVIPRIHP CD44_PAPHA 73 30 - GFETCRYGFIEGHVVIPRIQP CD44_CRIGR 75 30 - Motif 3 width=21 Element Seqn Id St Int Rpt YDTYCFNASAPLEEDCTSVTD CD44_RAT 118 21 - YDTYCFNASAPLEEDCTSVTD O70509 118 21 - YDTYCFNASAPLEEDCTSVTD O08779 118 21 - YDTYCFNASAPLEEDCTSVTD CD44_MESAU 116 20 - YDTYCFNASAPPEEDCTSVTD CD44_MOUSE 117 21 - YDTYCFNASAPPEEDCTSVTD CD44_HUMAN 114 20 - YDTYCFNASAPPEEDCTSVTD CD44_HORSE 114 20 - YDTICFNASAPPGEDCTSVTD CD44_BOVIN 114 20 - YDTYCFNASAPPGEDCTSVTD CD44_PAPHA 114 20 - YDTYCFNASAPLEEDCTSVTD CD44_CRIGR 116 20 - Motif 4 width=20 Element Seqn Id St Int Rpt NRDGTRYSKKGEYRTHQEDI CD44_RAT 153 14 - NRDGTRYSKKGEYRTHQEDI O70509 153 14 - NRDGTRYSKKGEYRTHQEDI O08779 153 14 - NRDGTRYSKKGEYRTHQEDI CD44_MESAU 151 14 - NRDGTRYSKKGEYRTHQEDI CD44_MOUSE 152 14 - NRDGTRYVQKGEYRTNPEDI CD44_HUMAN 149 14 - NRDGTRYTKKGEYRTNPEDI CD44_HORSE 149 14 - NRDGTRYTKKGEYRTNPEDI CD44_BOVIN 149 14 - NRDGTRYVKKGEYRTNPEDI CD44_PAPHA 149 14 - NRDGTRYSKKGEYRTHQEDI CD44_CRIGR 151 14 - Motif 5 width=20 Element Seqn Id St Int Rpt DSGVTTTSGPARRPQIPEWL CD44_RAT 254 81 - DSGVTTTSGPARRPQIPEWL O70509 254 81 - DSGVTTTSGPARRPQIPEWL O08779 670 497 - DSGANTTSRPGRKPQIPEWL CD44_MESAU 321 150 - DSGVTTTSGPMRRPQIPEWL CD44_MOUSE 253 81 - EGGANTTSGPIRTPQIPEWL CD44_HUMAN 632 463 - EGGASTTSGPIRRPQIPEWL CD44_HORSE 249 80 - EHGANTTSGPMRKPQIPEWL CD44_BOVIN 256 87 - EGGANTTSGPLRTPQIPEWL CD44_PAPHA 252 83 - DSGLNSTSRPGGKPRVPEWL CD44_CRIGR 252 81 - Motif 6 width=23 Element Seqn Id St Int Rpt IILASLLALALILAVCIAVNSRR CD44_RAT 274 0 - IILASLLALALILAVCIAVNSRR O70509 274 0 - IILASLLALALILAVCIAVNSRR O08779 690 0 - IVLASLLALALILAVCIAVNSRR CD44_MESAU 341 0 - IILASLLALALILAVCIAVNSRR CD44_MOUSE 273 0 - IILASLLALALILAVCIAVNSRR CD44_HUMAN 652 0 - IILASLLALALILAVCIAVNSRR CD44_HORSE 269 0 - IILASLLALALILAVCIAVNSRR CD44_BOVIN 276 0 - IILASLLALALILAVCIAVNSRR CD44_PAPHA 272 0 - IVLASLLALALILAVCIAVNSRR CD44_CRIGR 272 0 - Motif 7 width=20 Element Seqn Id St Int Rpt CGQKKKLVINSGNGTVEDRK CD44_RAT 298 1 - CGQKKKLVINSGNGTVEDRK O70509 298 1 - CGQKKKLVINSGNGTVEDRK O08779 714 1 - CGQKKKLVINSGNGKVEDRK CD44_MESAU 365 1 - CGQKKKLVINGGNGTVEDRK CD44_MOUSE 297 1 - CGQKKKLVINSGNGAVEDRK CD44_HUMAN 676 1 - CGQKKKLVINNGNGAVDDRK CD44_HORSE 293 1 - CGQKKKLVINNGNGTMEERK CD44_BOVIN 300 1 - CGQKKKLVINNGNGAVEDRK CD44_PAPHA 296 1 - CGQKKKLVINSGNGKVEDRK CD44_CRIGR 296 1 - Motif 8 width=20 Element Seqn Id St Int Rpt DQFMTADETRNLQSVDMKIG CD44_RAT 344 26 - DQFMTADETRNLQSVDMKIG O70509 344 26 - DQFMTADETRNLQSVDMKIG O08779 760 26 - DQFMTADETRNLQNVDMKIG CD44_MESAU 411 26 - DQCMTADETRNLQSVDMKIG CD44_MOUSE 343 26 - DQFMTADETRNLQNVDMKIG CD44_HUMAN 722 26 - DQFMTADETRNLQNVDMKIG CD44_HORSE 339 26 - DQFMTADETRNLQNVDMKIG CD44_BOVIN 346 26 - DQFMTADETRNLQNVDMKIG CD44_PAPHA 342 26 - DQFMTADETRNLQNVDMKIG CD44_CRIGR 342 26 -
|