SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00533

Identifier
MASONCOGENE  [View Relations]  [View Alignment]  
Accession
PR00533
No. of Motifs
7
Creation Date
01-JUN-1996  (UPDATE 07-JUN-1999)
Title
Mas oncogene signature
Database References
PRINTS; PR90007 7TM; PR90006 GPCRCLAN; PR00237 GPCRRHODOPSN

INTERPRO; IPR000820
GCRDB; GCR_0660; GCR_0142; GCR_0080
Literature References
1. ATTWOOD, T.K. AND FINDLAY, J.B.C. 
Fingerprinting G protein-coupled receptors.
PROTEIN ENG. 7(2) 195-203 (1994).
 
2. ATTWOOD, T.K. AND FINDLAY, J.B.C. 
G protein-coupled receptor fingerprints.
7TM, VOLUME 2, EDS. G.VRIEND AND B.BYWATER (1993).
 
3. BIRNBAUMER, L.
G proteins in signal transduction.
ANNU.REV.PHARMACOL.TOXICOL. 30 675-705 (1990).
 
4. CASEY, P.J. AND GILMAN, A.G.
G protein involvement in receptor-effector coupling.
J.BIOL.CHEM. 263(6) 2577-2580 (1988).
 
5. ATTWOOD, T.K. AND FINDLAY, J.B.C. 
Design of a discriminating fingerprint for G protein-coupled receptors.
PROTEIN ENG. 6(2) 167-176 (1993).
 
6. WATSON, S. AND ARKINSTALL, S.
Orphan receptors.
IN THE G PROTEIN-LINKED RECEPTOR FACTSBOOK, ACADEMIC PRESS, 1994, PP.223-230.

Documentation
G protein-coupled receptors (GPCRs) constitute a vast protein family that 
encompasses a wide range of functions (including various autocrine, para-
crine and endocrine processes). They show considerable diversity at the 
sequence level, on the basis of which they can be separated into distinct 
groups. We use the term clan to describe the GPCRs, as they embrace a group
of families for which there are indications of evolutionary relationship, 
but between which there is no statistically significant similarity in 
sequence [1]. The currently known clan members include the rhodopsin-like 
GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating
pheromone receptors, and the metabotropic glutamate receptor family.
 
The rhodopsin-like GPCRs themselves represent a widespread protein family 
that includes hormone, neurotransmitter and light receptors, all of
which transduce extracellular signals through interaction with guanine
nucleotide-binding (G) proteins. Although their activating ligands vary 
widely in structure and character, the amino acid sequences of the 
receptors are very similar and are believed to adopt a common structural 
framework comprising 7 transmembrane (TM) helices [3-5]. 
 
The mas oncogene was discovered following co-transfection with DNA isolated
from a human epidermal carcinoma [6]. It efficiently induces tumorigenicity
and has weak focus-inducing activity in NIH 3T3 cells. To date, it is the
only oncogene to have been sequenced that encodes a 7TM protein [6]. It has
been claimed that mas is a receptor for angiotensin, but this view has not
found wide acceptance. In the CNS, high levels of the mas oncogene
transcript are present in the cerebral cortex, with lower amounts in the
hippocampus and cerebellum. In the periphery, it is expressed in low levels
in the kidney, adrenals and liver [6]. The rat RTA protein has some
similarity to the mas oncogene sequence. 
 
MASONCOGENE is a 7-element fingerprint that provides a signature for the
mas oncogene transcript. The fingerprint was derived from an initial
alignment of 3 sequences: the motifs were drawn from conserved sections
within either loop or N- and C-terminal regions, focusing on those areas
of the alignment that characterise the mas oncogene sequences but
distinguish them from the rest of the rhodopsin-like superfamily - motif 1
lies at the N-terminus; motif 2 encodes part of TM domain 2 and the first
external loop; motif 3 spans the second external loop; motif 4 spans the
third cytoplasmic loop; motif 5 spans the third external loop; and motifs
6 and 7 lie at the C-terminus. A single iteration on OWL28.0 was required
to reach convergence, no further sequences being identified beyond the
starting set.
 
An update on SPTR37_9f identified a true set of 3 sequences.
Summary Information
3 codes involving  7 elements
0 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
73333333
60000000
50000000
40000000
30000000
20000000
1234567
True Positives
MAS_HUMAN     MAS_MOUSE     MAS_RAT       
Sequence Titles
MAS_HUMAN   MAS PROTO-ONCOGENE - HOMO SAPIENS (HUMAN).    
MAS_MOUSE MAS PROTO-ONCOGENE - MUS MUSCULUS (MOUSE).
MAS_RAT MAS PROTO-ONCOGENE - RATTUS NORVEGICUS (RAT).
Scan History
OWL28_0    1  75   NSINGLE    
SPTR37_9f 2 4 NSINGLE
Initial Motifs
Motif 1  width=17
Element Seqn Id St Int Rpt
RNASLGSSHPPIPIVHW MAS_MOUSE 20 20 -
RNASLGTSHPPIPIVHW MAS_RAT 20 20 -
RNASVGNAHRQIPIVHW MAS_HUMAN 21 21 -

Motif 2 width=17
Element Seqn Id St Int Rpt
FILSIDYALDYELSSGH MAS_MOUSE 84 47 -
FILSIDYALDYELSSGH MAS_RAT 84 47 -
FILSIDYALDYELSSGH MAS_HUMAN 85 47 -

Motif 3 width=17
Element Seqn Id St Int Rpt
EYVMCIDSGEESHSRSD MAS_MOUSE 166 65 -
EYVMCIDSGEESHSQSD MAS_RAT 166 65 -
EYVMCIDREEESHSRND MAS_HUMAN 167 65 -

Motif 4 width=17
Element Seqn Id St Int Rpt
VKIRKNTWASHSSKLYI MAS_MOUSE 211 28 -
VKIRKNTWASHSSKLYI MAS_RAT 211 28 -
VKIRKNTWASHSSKLYI MAS_HUMAN 212 28 -

Motif 5 width=18
Element Seqn Id St Int Rpt
RVLYLLYYEYWSAFGNLH MAS_MOUSE 244 16 -
RVLYLLYYEYWSTFGNLH MAS_RAT 244 16 -
RLLYLLYYEYWSTFGNLH MAS_HUMAN 245 16 -

Motif 6 width=16
Element Seqn Id St Int Rpt
FRESLKVVLTRAFKDE MAS_MOUSE 290 28 -
FRESLKVVLTRAFKDE MAS_RAT 290 28 -
FKESLKVVLTRAFKDE MAS_HUMAN 291 28 -

Motif 7 width=18
Element Seqn Id St Int Rpt
EMQPRRQEGNGNTVSIET MAS_MOUSE 305 -1 -
EMQPRRQEGNGNTVSIET MAS_RAT 305 -1 -
EMQPRRQKDNCNTVTVET MAS_HUMAN 306 -1 -
Final Motifs
Motif 1  width=17
Element Seqn Id St Int Rpt
RNASLGSSHPPIPIVHW MAS_MOUSE 20 20 -
RNASLGTSHPPIPIVHW MAS_RAT 20 20 -
RNASVGNAHRQIPIVHW MAS_HUMAN 21 21 -

Motif 2 width=17
Element Seqn Id St Int Rpt
FILSIDYALDYELSSGH MAS_MOUSE 84 47 -
FILSIDYALDYELSSGH MAS_RAT 84 47 -
FILSIDYALDYELSSGH MAS_HUMAN 85 47 -

Motif 3 width=17
Element Seqn Id St Int Rpt
EYVMCIDSGEESHSRSD MAS_MOUSE 166 65 -
EYVMCIDSGEESHSQSD MAS_RAT 166 65 -
EYVMCIDREEESHSRND MAS_HUMAN 167 65 -

Motif 4 width=17
Element Seqn Id St Int Rpt
VKIRKNTWASHSSKLYI MAS_MOUSE 211 28 -
VKIRKNTWASHSSKLYI MAS_RAT 211 28 -
VKIRKNTWASHSSKLYI MAS_HUMAN 212 28 -

Motif 5 width=18
Element Seqn Id St Int Rpt
RVLYLLYYEYWSAFGNLH MAS_MOUSE 244 16 -
RVLYLLYYEYWSTFGNLH MAS_RAT 244 16 -
RLLYLLYYEYWSTFGNLH MAS_HUMAN 245 16 -

Motif 6 width=16
Element Seqn Id St Int Rpt
FRESLKVVLTRAFKDE MAS_MOUSE 290 28 -
FRESLKVVLTRAFKDE MAS_RAT 290 28 -
FKESLKVVLTRAFKDE MAS_HUMAN 291 28 -

Motif 7 width=18
Element Seqn Id St Int Rpt
EMQPRRQEGNGNTVSIET MAS_MOUSE 305 -1 -
EMQPRRQEGNGNTVSIET MAS_RAT 305 -1 -
EMQPRRQKDNCNTVTVET MAS_HUMAN 306 -1 -