Identifier | G10DORPHANR  [View Relations]  [View Alignment]  
|
Accession | PR00643 |
No. of Motifs | 9 |
Creation Date | 09-DEC-1996  (UPDATE 17-JUN-1999) |
Title | G10D orphan receptor signature |
Database References | PRINTS; PR90007 7TM; PR90006 GPCRCLAN; PR00237 GPCRRHODOPSN
INTERPRO; IPR001350 GCRDB; GCR_0181; GCR_0476 |
Literature References | 1. ATTWOOD, T.K. AND FINDLAY, J.B.C.
Fingerprinting G protein-coupled receptors.
PROTEIN ENG. 7(2) 195-203 (1994).
2. ATTWOOD, T.K. AND FINDLAY, J.B.C.
G protein-coupled receptor fingerprints.
7TM, VOLUME 2, EDS. G.VRIEND AND B.BYWATER (1993).
3. BIRNBAUMER, L.
G proteins in signal transduction.
ANNU.REV.PHARMACOL.TOXICOL. 30 675-705 (1990).
4. CASEY, P.J. AND GILMAN, A.G.
G protein involvement in receptor-effector coupling.
J.BIOL.CHEM. 263(6) 2577-2580 (1988).
5. ATTWOOD, T.K. AND FINDLAY, J.B.C.
Design of a discriminating fingerprint for G protein-coupled receptors.
PROTEIN ENG. 6(2) 167-176 (1993).
6. WATSON, S. AND ARKINSTALL,
Orphan receptors.
IN THE G PROTEIN-LINKED RECEPTOR FACTSBOOK, ACADEMIC PRESS, pp223-230.
7. HARRISON, J.K., BARBER, C.M. AND LYNCH, K.R.
Molecular-cloning of a novel rat G protein-coupled receptor gene
expressed prominently in lung, adrenal, and liver.
FEBS LETT. 318(1) 17-22 (1993).
|
Documentation | G protein-coupled receptors (GPCRs) constitute a vast protein family that
encompasses a wide range of functions (including various autocrine, para-
crine and endocrine processes). They show considerable diversity at the
sequence level, on the basis of which they can be separated into distinct
groups. We use the term clan to describe the GPCRs, as they embrace a group
of families for which there are indications of evolutionary relationship,
but between which there is no statistically significant similarity in
sequence [1]. The currently known clan members include the rhodopsin-like
GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating
pheromone receptors, and the metabotropic glutamate receptor family.
The rhodopsin-like GPCRs themselves represent a widespread protein family
that includes hormone, neurotransmitter and light receptors, all of
which transduce extracellular signals through interaction with guanine
nucleotide-binding (G) proteins. Although their activating ligands vary
widely in structure and character, the amino acid sequences of the
receptors are very similar and are believed to adopt a common structural
framework comprising 7 transmembrane (TM) helices [3-5].
Several 7TM receptors have been cloned but their endogenous ligands are
unknown; these have been termed orphan receptors. G10d was isolated from a
rat genomic library and a liver cDNA library [6,7]. It is widely distributed,
being found in high levels in the lung, liver and adrenal gland, and also
in the kidney, aorta, heart, spinal cord, gut and testis [6,7].
G10DORPHANR is a 9-element fingerprint that provides a signature for the
G10D orphan receptors. The fingerprint was derived from an initial
alignment of 4 sequences: the motifs were drawn from conserved sections
within either loop or N- and C-terminal regions, focusing on those areas
of the alignment that characterise the G10D receptors but distinguish
them from the rest of the rhodopsin-like superfamily - motifs 1-3 span
the N-terminus, motif 3 leading into the N-terminal portion of TM domain 1;
motif 4 spans the first cytoplasmic loop; motif 5 lies in the second
cytoplasmic loop; motif 6 lies in the second external loop; motif 7 lies in
the third cytoplasmic loop; motif 8 spans the third external loop; and
motif 9 resides at the C-terminus. A single iteration on OWL29.0 was
required to reach convergence, no further sequences being identified beyond
the starting set.
An update on SPTR37_9f identified a true set of 4 sequences.
|
Summary Information | 4 codes involving 9 elements 0 codes involving 8 elements 0 codes involving 7 elements 0 codes involving 6 elements 0 codes involving 5 elements 0 codes involving 4 elements 0 codes involving 3 elements 0 codes involving 2 elements
|
Composite Feature Index | 9 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|
True Positives | G10D_MOUSE G10D_RAT O15218 Q64166 |
Sequence Titles | G10D_MOUSE PROBABLE G PROTEIN-COUPLED RECEPTOR G10D (NOW) - MUS MUSCULUS (MOUSE). G10D_RAT PROBABLE G PROTEIN-COUPLED RECEPTOR G10D (NOW) - RATTUS NORVEGICUS (RAT). O15218 G-PROTEIN COUPLED RECEPTOR - HOMO SAPIENS (HUMAN). Q64166 ADRENOMEDULLIN RECEPTOR - RATTUS NORVEGICUS (RAT).
|
Scan History | OWL29_0 1 50 NSINGLE SPTR37_9f 2 50 NSINGLE
|
Initial Motifs | Motif 1 width=20 Element Seqn Id St Int Rpt VIPSSRPVSTLAPDNDFREI S79811 3 3 - VIPSSEAVSTLAPDNDFREI S40685 3 3 - VIPSSRPVSTLAPDNDFREI G10D_RAT 3 3 - VIPSPRPVSTLEPDNDFRDI G10D_MOUSE 3 3 - Motif 2 width=17 Element Seqn Id St Int Rpt IHNWTELLHLFNQTFSD S40685 22 -1 - IHNWTELLHLFNQTFSD G10D_RAT 22 -1 - IHNWTELLHLFNQTFTD G10D_MOUSE 22 -1 - IHNWTELLHLFNQTFSD S79811 22 -1 - Motif 3 width=16 Element Seqn Id St Int Rpt DCHMELNENTKQVVLF S79811 38 -1 - DCHMELNENTKQVVLF S40685 38 -1 - DCHIEFNENTKHVVLF G10D_MOUSE 38 -1 - DCRMELNENTKQVVLF G10D_RAT 38 -1 - Motif 4 width=12 Element Seqn Id St Int Rpt CVNCRRSGRVGM S40685 72 18 - CVNCRRSGRVGM G10D_RAT 72 18 - CVNCRRSGRVGM G10D_MOUSE 72 18 - CVNCRRSGRVGM S79811 72 18 - Motif 5 width=18 Element Seqn Id St Int Rpt TLTNTSPSWQRHQHRIRR S79811 150 66 - TLTNTSPSWQRHQHRIRR S40685 150 66 - TLTNTSPSWQRHQHRIRR G10D_MOUSE 150 66 - TLTNTSPSWQRHQHRIRR G10D_RAT 150 66 - Motif 6 width=19 Element Seqn Id St Int Rpt DGSEPMCLFLAPFETYSAW S40685 192 24 - DGSEPMCLFLAPFETYSAW G10D_MOUSE 192 24 - DGSEPMCLFLAPFETYSAW G10D_RAT 192 24 - DGSEPMCLFLAPFETYSAW S79811 192 24 - Motif 7 width=15 Element Seqn Id St Int Rpt CRLRRQGQTESRRHC S79811 239 28 - CRLRRQGQTESRRHC S40685 239 28 - CRLRRQGQTESRRHC G10D_RAT 239 28 - CRLRRQRQTESRRHC G10D_MOUSE 239 28 - Motif 8 width=18 Element Seqn Id St Int Rpt HCNLVNFLYFFYEIIDCF S40685 287 33 - HCHLVNLLYFFYEIIDCF G10D_MOUSE 287 33 - HCNLVNFLYFFYEITDCF G10D_RAT 287 33 - HCNLVNFLYFFYEIIDCF S79811 287 33 - Motif 9 width=20 Element Seqn Id St Int Rpt LLSLVVRYLPKEQARAAGGR S79811 327 22 - LLSLVVRYLPKEQARAAGGR S40685 327 22 - LLSLVVRYLPKEQARAAGGR G10D_MOUSE 327 22 - LLSLVVRYLPKEQARAAGGR G10D_RAT 327 22 -
|
Final Motifs | Motif 1 width=20 Element Seqn Id St Int Rpt VIPSSRPVSTLAPDNDFREI Q64166 3 3 - VIPSSRPVSTLAPDNDFREI G10D_RAT 3 3 - VIPSPRPVSTLEPDNDFRDI G10D_MOUSE 3 3 - WGPGPSEGVTAVPTSDLGEI O15218 7 7 - Motif 2 width=17 Element Seqn Id St Int Rpt IHNWTELLHLFNQTFSD Q64166 22 -1 - IHNWTELLHLFNQTFSD G10D_RAT 22 -1 - IHNWTELLHLFNQTFTD G10D_MOUSE 22 -1 - IHNWTELLDLFNHTLSE O15218 26 -1 - Motif 3 width=16 Element Seqn Id St Int Rpt DCHMELNENTKQVVLF Q64166 38 -1 - DCRMELNENTKQVVLF G10D_RAT 38 -1 - DCHIEFNENTKHVVLF G10D_MOUSE 38 -1 - ECHVELSQSTKRVVLF O15218 42 -1 - Motif 4 width=12 Element Seqn Id St Int Rpt CVNCRRSGRVGM Q64166 72 18 - CVNCRRSGRVGM G10D_RAT 72 18 - CVNCRRSGRVGM G10D_MOUSE 72 18 - CVNWRGSGRAGL O15218 76 18 - Motif 5 width=18 Element Seqn Id St Int Rpt TLTNTSPSWQRHQHRIRR Q64166 150 66 - TLTNTSPSWQRHQHRIRR G10D_RAT 150 66 - TLTNTSPSWQRHQHRIRR G10D_MOUSE 150 66 - TLTSASPSWQRYQHRVRR O15218 154 66 - Motif 6 width=19 Element Seqn Id St Int Rpt DGSEPMCLFLAPFETYSAW Q64166 192 24 - DGSEPMCLFLAPFETYSAW G10D_RAT 192 24 - DGSEPMCLFLAPFETYSAW G10D_MOUSE 192 24 - EGPEPMCLFMAPFETYSTW O15218 196 24 - Motif 7 width=15 Element Seqn Id St Int Rpt CRLRRQGQTESRRHC Q64166 239 28 - CRLRRQGQTESRRHC G10D_RAT 239 28 - CRLRRQRQTESRRHC G10D_MOUSE 239 28 - CRLRQPGQPKSRRHC O15218 243 28 - Motif 8 width=18 Element Seqn Id St Int Rpt HCNLVNFLYFFYEIIDCF Q64166 287 33 - HCNLVNFLYFFYEITDCF G10D_RAT 287 33 - HCHLVNLLYFFYEIIDCF G10D_MOUSE 287 33 - HCHLVHLLYFFYDVIDCF O15218 291 33 - Motif 9 width=20 Element Seqn Id St Int Rpt LLSLVVRYLPKEQARAAGGR Q64166 327 22 - LLSLVVRYLPKEQARAAGGR G10D_RAT 327 22 - LLSLVVRYLPKEQARAAGGR G10D_MOUSE 327 22 - LLNAVVHYLPKDQTKAGTCA O15218 331 22 -
|