SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR01278

Identifier
CD97PROTEIN  [View Relations]  [View Alignment]  
Accession
PR01278
No. of Motifs
13
Creation Date
26-MAR-2000
Title
CD97 protein signature
Database References
PRINTS; PR90007 7TM; PR90006 GPCRCLAN; PR00249 GPCRSECRETIN

GCRDB; GCR_1063; GCR_2113; GCR_2382
Literature References
1. ATTWOOD, T.K. AND FINDLAY, J.B.C.
Fingerprinting G protein-coupled receptors.
PROTEIN ENG. 7(2) 195-203 (1994). 
 
2. ISHIHARA T., NAKAMURA S., KAZIRO, Y., TAKAHASHI, T., TAKAHASHI, K.
AND NAGATA, S.
Molecular cloning and expression of a cDNA encoding the secretin receptor
EMBO J. 10 1635-1641 (1991).
 
3. LIN, H.Y., HARRIS, T.L., FLANNERY, M.S., ARUFFO, A., KAJI, E.H., 
GORN, A., KOLAKOWSKI, L.F., LODISH, H.F. AND GOLDRING, S.R.
Expression cloning of adenylate cyclase-coupled calcitonin receptor
SCIENCE 254 1022-1024 (1991).
 
4. JUEPPNER, H., ABOU-SAMRA, A.-B., FREEMAN, M., KONG, X.F., 
SCHIPANI, E., RICHARDS, J., KOLALOWSKI, L.F., HOCK, J., POTTS, J.T.,
KRONENBERG, H.M. AND SEGRE, G.E.
A G protein linked receptor for parathyroid hormone and parathyroid
hormone-related peptide.
SCIENCE 254 1024-1026 (1991).
 
5. ISHIHARA, T., SHIGEMOTO, R., MORI, K., TAKAHASHI, K. AND NAGATA, S.
Functional expression and tissue distribution of a novel receptor for
vasoactive intestinal polypeptide.
NEURON 8(4) 811-819 (1992).
 
6. HAMANN, J., EICHLER, W., HAMANN, D., KERSTENS, H.M., PODDIGHE, P.J.,
HOOVERS, J.M., HARTMANN, E., STRAUSS, M. AND VAN LIER, R.A.
Expression cloning and chromosomal mapping of the leukocyte activation 
antigen CD97, a new seven-span transmembrane molecule of the secretion 
receptor superfamily with an unusual extracellular domain.
J.IMMUNOL. 155 1942-1950 (1995).

Documentation
G protein-coupled receptors (GPCRs) constitute a vast protein family that 
encompasses a wide range of functions (including various autocrine, para-
crine and endocrine processes). They show considerable diversity at the 
sequence level, on the basis of which they can be separated into distinct 
groups. We use the term clan to describe the GPCRs, as they embrace a group
of families for which there are indications of evolutionary relationship, 
but between which there is no statistically significant similarity in 
sequence [1]. The currently known clan members include the rhodopsin-like 
GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating
pheromone receptors, and the metabotropic glutamate receptor family.
 
The secretin-like GPCRs include secretin [2], calcitonin [3], parathyroid
hormone/parathyroid hormone-related peptides [4] and vasoactive intestinal
peptide [5], all of which activate adenylyl cyclase and the phosphatidyl-
inositol-calcium pathway. The amino acid sequences of the receptors contain
high proportions of hydrophobic residues grouped into 7 domains, in a
manner reminiscent of the rhodopsins and other receptors believed to inter-
act with G proteins. However, while a similar 3D framework has been
proposed to account for this, there is no significant sequence similarity
between these families: the secretin-like receptors thus bear their own
unique `7TM' signature.
 
CD97 is a 75-85kDa monomeric glycoprotein that is induced rapidly on the
surface of most leukocytes upon activation [6]. The mature protein contains
~722 amino acids, the C-terminal of which constitute the secretin-like 7TM 
signature. By contrast with most secretin-like receptors, CD97 has an 
extended N-terminal extracellular region (~433 amino acids) possessing
several epidermal growth factor (EGF)-like domains, some of which contain
calcium-binding sites, and a single Arg-Gly-Asp (RGD) motif [6]. The 
existence of structural elements characteristic of extracellular matrix 
proteins in a 7TM molecule suggests the CD97 receptor is potentially 
involved in both adhesion and signalling processes early after leukocyte
activation [6]. 
 
CD97PROTEIN is a 13-element fingerprint that provides a signature for the
leucocyte antigen CD97. The fingerprint was derived from an initial 
alignment if 3 sequences: the motifs were drawn from conserved regions
spanning the N-terminal two-thirds of the alignment, focusing on those
sections that characterise the CD97 proteins but distinguish them from
the rest of the secretin-like GPCR superfamily - motif 1 lies in the first
EGF-like domain; motif 2 spans the C-terminal half of the second EGF-like
domain; motif 3 encodes the C-terminal end of the fifth EGF-like domain;
motif 10 lies immediately N-terminal to TM domain 1; motif 11 spans the
first extracellular loop; motif 12 spans the second cytoplasmic loop; and
motif 13 resides in the third cytoplasmic loop. A single iteration on 
SPTR37_10f was required to reach convergence, no further sequences being
identified beyond the starting set.
Summary Information
3 codes involving 13 elements
0 codes involving 12 elements
0 codes involving 11 elements
0 codes involving 10 elements
0 codes involving 9 elements
0 codes involving 8 elements
0 codes involving 7 elements
0 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
133333333333333
120000000000000
110000000000000
100000000000000
90000000000000
80000000000000
70000000000000
60000000000000
50000000000000
40000000000000
30000000000000
20000000000000
12345678910111213
True Positives
CD97_HUMAN    O00718        Q9Z0M6        
Sequence Titles
CD97_HUMAN  LEUCOCYTE ANTIGEN CD97 PRECURSOR - HOMO SAPIENS (HUMAN). 
O00718 CD97 - HOMO SAPIENS (HUMAN).
Q9Z0M6 CD97 PROTEIN PRECURSOR - MUS MUSCULUS (MOUSE).
Scan History
SPTR37_10f 1  100  NSINGLE    
Initial Motifs
Motif 1  width=17
Element Seqn Id St Int Rpt
CARWCPQNSSCVNATAC CD97_HUMAN 26 26 -
CARWCPQNSSCVNATAC O00718 26 26 -
CAKWCPINSKCVSNRSC Q9Z0M6 31 31 -

Motif 2 width=20
Element Seqn Id St Int Rpt
GYEPVSGAKTFKNESENTCQ CD97_HUMAN 96 53 -
GYEPVSGTKTFKNESENTCQ O00718 96 53 -
GYKLLSGAESFVNESENTCQ Q9Z0M6 100 52 -

Motif 3 width=19
Element Seqn Id St Int Rpt
DTVCEDMTFSTWTPPPGVH CD97_HUMAN 160 44 -
DTVCEDMTFSTWTPPPGVH O00718 253 137 -
DTICQEPPFPTWTLLPTAH Q9Z0M6 257 137 -

Motif 4 width=18
Element Seqn Id St Int Rpt
HSQTLSRFFDKVQDLGRD CD97_HUMAN 178 -1 -
HSQTLSRFFDKVQDLGRD O00718 271 -1 -
HSQTLLRFSVEVQNLLRD Q9Z0M6 275 -1 -

Motif 5 width=18
Element Seqn Id St Int Rpt
QLLSNLEDIMRILAKSLP CD97_HUMAN 236 40 -
QLLSNLEDIMRILAKSLP O00718 329 40 -
QLLSNLEQSLRTLAQFLP Q9Z0M6 327 34 -

Motif 6 width=20
Element Seqn Id St Int Rpt
KGPFTYISPSNTELTLMIQE CD97_HUMAN 254 0 -
KGPFTYISPSNTELTLMIQE O00718 347 0 -
KGPFTYTSPSNTELSLMVKE Q9Z0M6 345 0 -

Motif 7 width=13
Element Seqn Id St Int Rpt
RMKLNWAVAAGAE CD97_HUMAN 287 13 -
RMKLNWAVAAGAE O00718 380 13 -
WMELDWAVTAGAK Q9Z0M6 379 14 -

Motif 8 width=16
Element Seqn Id St Int Rpt
MTTLLANASLNLHSKK CD97_HUMAN 314 14 -
MTTLLANASLNLHSKK O00718 407 14 -
MEKLLGNTPLNLEQRR Q9Z0M6 408 16 -

Motif 9 width=19
Element Seqn Id St Int Rpt
VQLRRLSAVNSIFLSHNNT CD97_HUMAN 344 14 -
VQLRRLSAVNSIFLSHNNT O00718 437 14 -
VSLKLLSNINSVFLTNTNT Q9Z0M6 437 13 -

Motif 10 width=16
Element Seqn Id St Int Rpt
HLSSFTILMAHYDVED CD97_HUMAN 436 73 -
HLSSFAILMAHYDVED O00718 529 73 -
HLTSFAILMAQYHVQD Q9Z0M6 511 55 -

Motif 11 width=19
Element Seqn Id St Int Rpt
ENEGGQVGLRCRLVAGLLH CD97_HUMAN 509 57 -
ENEGGQVGLRCRLVAGLLH O00718 602 57 -
ENEGGEVGLRCRLVAMMLH Q9Z0M6 584 57 -

Motif 12 width=16
Element Seqn Id St Int Rpt
RVFQGQGLSTRWLCLI CD97_HUMAN 550 22 -
RVFQGQGLSTRWLCLI O00718 643 22 -
RVFQGQGLSTWQRCLI Q9Z0M6 625 22 -

Motif 13 width=16
Element Seqn Id St Int Rpt
KLTQKFSEINPDMKKL CD97_HUMAN 624 58 -
KLTQKFSEINPDMKKL O00718 717 58 -
KLTKKFSEINPNMKKL Q9Z0M6 701 60 -
Final Motifs
Motif 1  width=17
Element Seqn Id St Int Rpt
CARWCPQNSSCVNATAC CD97_HUMAN 26 26 -
CARWCPQNSSCVNATAC O00718 26 26 -
CAKWCPINSKCVSNRSC Q9Z0M6 31 31 -

Motif 2 width=20
Element Seqn Id St Int Rpt
GYEPVSGAKTFKNESENTCQ CD97_HUMAN 96 53 -
GYEPVSGTKTFKNESENTCQ O00718 96 53 -
GYKLLSGAESFVNESENTCQ Q9Z0M6 100 52 -

Motif 3 width=19
Element Seqn Id St Int Rpt
DTVCEDMTFSTWTPPPGVH CD97_HUMAN 160 44 -
DTVCEDMTFSTWTPPPGVH O00718 253 137 -
DTICQEPPFPTWTLLPTAH Q9Z0M6 257 137 -

Motif 4 width=18
Element Seqn Id St Int Rpt
HSQTLSRFFDKVQDLGRD CD97_HUMAN 178 -1 -
HSQTLSRFFDKVQDLGRD O00718 271 -1 -
HSQTLLRFSVEVQNLLRD Q9Z0M6 275 -1 -

Motif 5 width=18
Element Seqn Id St Int Rpt
QLLSNLEDIMRILAKSLP CD97_HUMAN 236 40 -
QLLSNLEDIMRILAKSLP O00718 329 40 -
QLLSNLEQSLRTLAQFLP Q9Z0M6 327 34 -

Motif 6 width=20
Element Seqn Id St Int Rpt
KGPFTYISPSNTELTLMIQE CD97_HUMAN 254 0 -
KGPFTYISPSNTELTLMIQE O00718 347 0 -
KGPFTYTSPSNTELSLMVKE Q9Z0M6 345 0 -

Motif 7 width=13
Element Seqn Id St Int Rpt
RMKLNWAVAAGAE CD97_HUMAN 287 13 -
RMKLNWAVAAGAE O00718 380 13 -
WMELDWAVTAGAK Q9Z0M6 379 14 -

Motif 8 width=16
Element Seqn Id St Int Rpt
MTTLLANASLNLHSKK CD97_HUMAN 314 14 -
MTTLLANASLNLHSKK O00718 407 14 -
MEKLLGNTPLNLEQRR Q9Z0M6 408 16 -

Motif 9 width=19
Element Seqn Id St Int Rpt
VQLRRLSAVNSIFLSHNNT CD97_HUMAN 344 14 -
VQLRRLSAVNSIFLSHNNT O00718 437 14 -
VSLKLLSNINSVFLTNTNT Q9Z0M6 437 13 -

Motif 10 width=16
Element Seqn Id St Int Rpt
HLSSFTILMAHYDVED CD97_HUMAN 436 73 -
HLSSFAILMAHYDVED O00718 529 73 -
HLTSFAILMAQYHVQD Q9Z0M6 511 55 -

Motif 11 width=19
Element Seqn Id St Int Rpt
ENEGGQVGLRCRLVAGLLH CD97_HUMAN 509 57 -
ENEGGQVGLRCRLVAGLLH O00718 602 57 -
ENEGGEVGLRCRLVAMMLH Q9Z0M6 584 57 -

Motif 12 width=16
Element Seqn Id St Int Rpt
RVFQGQGLSTRWLCLI CD97_HUMAN 550 22 -
RVFQGQGLSTRWLCLI O00718 643 22 -
RVFQGQGLSTWQRCLI Q9Z0M6 625 22 -

Motif 13 width=16
Element Seqn Id St Int Rpt
KLTQKFSEINPDMKKL CD97_HUMAN 624 58 -
KLTQKFSEINPDMKKL O00718 717 58 -
KLTKKFSEINPNMKKL Q9Z0M6 701 60 -