SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00042

Identifier
LEUZIPPRFOS  [View Relations]  [View Alignment]  
Accession
PR00042
No. of Motifs
5
Creation Date
17-MAY-1993  (UPDATE 10-JUN-1999)
Title
Fos transforming protein signature
Database References

PROSITE; PS00036 FOS_JUN_BASIC; PS00029 LEUCINE_ZIPPER
BLOCKS; BL00036
INTERPRO; IPR000837
Literature References
1. BOHMANN, D., BOS, T.J., ADMON, A., NISHIMURA, T., VOGT, P.K. AND 
TIJAN, R.
Human proto-oncogene c-jun encodes a DNA-binding protein with structural
and functional properties of transcription factor AP-1.
SCIENCE 238 1386-1392 (1987).
 
2. COHEN, D.R. AND CURRAN, T.
Fra-1 - A serum-inducible, cellular imediate early gene that encodes a
fos-related antigen.
MOL.CELL.BIOL. 8(5) 2063-2069 (1988).
 
3. VAN STRAATEN, F., MULLER, R., CURRAN, T., VAN BEVEREN, C. AND VERMA, I.
Complete nucleotide sequence of a human c-onc gene - deduced amino acid 
sequence of the human c-fos protein.
PROC.NATL.ACAD.SCI.U.S.A. 80(11) 3183-3187 (1983).

Documentation
Implicit in the growth regulatory functions of all proto-oncogenes is the 
potential to induce abnormal cell growth [1] and cancer as a result of
alterations in gene expression. This may be a qualitative or quantitative 
alteration, the viral oncogenes activating this potential by transducing a 
truncated or mutated form of the protein product, or by increasing 
transcription of the proto-oncogene by the integration of a viral promoter 
and enhancer sequence in its vicinity.
 
Both the cellular and viral forms of the fos gene encode a phosphoprotein
that is located in the nucleus of cells, and forms a noncovalent complex
with several other proteins, a leucine zipper holding the dimer together. 
The dimer is associated with chromatin and demonstrates specific and non-
specific DNA-binding properties [2], the DNA being bound by a highly basic 
area in the protein sequence immediately preceding the zipper domain.
Expression of the fos gene is stimulated by mitogens, suggesting that the
gene product is involved in cell growth [3], and may act as a nuclear
signal in a more general sense.
 
The `leucine zipper' is a structure that is believed to mediate the
function of several eukaryotic gene regulatory proteins. The zipper
consists of a periodic repetition of leucine residues at every seventh
position, and regions containing them appear to span 8 turns of alpha-
helix. The leucine side chains that extend from one helix interact with
those from a similar helix, hence facilitating dimerisation in the form
of a coiled-coil. Leucine zippers are present in many gene regulatory
proteins, including the CREB proteins, Jun/AP1 transcription factors,
fos oncogene and fos-related proteins, C-myc, L-myc and N-myc oncogenes,
and so on.
 
LEUZIPPRFOS is a 5-element fingerprint that provides a signature for the 
leucine zipper and DNA-binding domains characteristic of the fos oncogenes 
and fos-related proteins. The fingerprint was derived from an initial 
alignment of 6 sequences: motifs 2 and 3 span the highly basic DNA-
binding domain, while motifs 4 and 5 encode the zipper region (cf.
PROSITE patterns FOS_JUN_BASIC (PS00036) and LEUCINE_ZIPPER (PS00029)).
Two iterations on OWL19.1 were required to reach convergence, at which
point a true set comprising 14 sequences was identified. Several partial
matches were also found: of those matching just 4 motifs, both are CREB 
protein fragments that are highly similar to the DNA-binding and zipper
domains of the fos gene products; those matching just 2 or 3 motifs are
myosin heavy chains, which form coiled coils using a system similar to
leucine zippers.
 
An update on SPTR37_9f identified a true set of 24 sequences, and 6
partial matches.
Summary Information
  24 codes involving  5 elements
1 codes involving 4 elements
3 codes involving 3 elements
2 codes involving 2 elements
Composite Feature Index
52424242424
401111
303330
201210
12345
True Positives
FOSB_HUMAN    FOSB_MOUSE    FOSX_MSVFR    FOS_AVINK     
FOS_CHICK FOS_CYPCA FOS_FUGRU FOS_HUMAN
FOS_MOUSE FOS_MSVFB FOS_RAT FOS_TETFL
FRA1_HUMAN FRA1_MOUSE FRA1_RAT FRA2_CHICK
FRA2_HUMAN FRA2_MOUSE FRA2_RAT O35285
O56223 O88479 Q62592 Q91639
True Positive Partials
Codes involving 4 elements
Q62738
Codes involving 3 elements
ATF3_MOUSE ATF3_RAT Q62281
Codes involving 2 elements
ATF3_HUMAN FRA_DROME
Sequence Titles
FOSB_HUMAN  FOSB PROTEIN (G0/G1 SWITCH REGULATORY PROTEIN 3) - HOMO SAPIENS (HUMAN). 
FOSB_MOUSE FOSB PROTEIN - MUS MUSCULUS (MOUSE).
FOSX_MSVFR V-FOS/FOX TRANSFORMING PROTEIN - FBR MURINE OSTEOSARCOMA VIRUS.
FOS_AVINK P55-V-FOS TRANSFORMING PROTEIN - AVIAN RETROVIRUS NK24.
FOS_CHICK P55-C-FOS PROTO-ONCOGENE PROTEIN - GALLUS GALLUS (CHICKEN).
FOS_CYPCA P55-C-FOS PROTO-ONCOGENE PROTEIN - CYPRINUS CARPIO (COMMON CARP).
FOS_FUGRU P55-C-FOS PROTO-ONCOGENE PROTEIN - FUGU RUBRIPES (JAPANESE PUFFERFISH) (TAKIFUGU RUBRIPES).
FOS_HUMAN P55-C-FOS PROTO-ONCOGENE PROTEIN (G0S7 PROTEIN) - HOMO SAPIENS (HUMAN).
FOS_MOUSE P55-C-FOS PROTO-ONCOGENE PROTEIN - MUS MUSCULUS (MOUSE).
FOS_MSVFB P55-V-FOS TRANSFORMING PROTEIN - FBJ MURINE OSTEOSARCOMA VIRUS.
FOS_RAT P55-C-FOS PROTO-ONCOGENE PROTEIN - RATTUS NORVEGICUS (RAT).
FOS_TETFL P55-C-FOS PROTO-ONCOGENE PROTEIN - TETRAODON FLUVIATILIS (PUFFER FISH).
FRA1_HUMAN FOS-RELATED ANTIGEN 1 - HOMO SAPIENS (HUMAN).
FRA1_MOUSE FOS-RELATED ANTIGEN-1 - MUS MUSCULUS (MOUSE).
FRA1_RAT FOS-RELATED ANTIGEN 1 - RATTUS NORVEGICUS (RAT).
FRA2_CHICK FOS-RELATED ANTIGEN 2 - GALLUS GALLUS (CHICKEN).
FRA2_HUMAN FOS-RELATED ANTIGEN 2 - HOMO SAPIENS (HUMAN).
FRA2_MOUSE FOS-RELATED ANTIGEN 2 - MUS MUSCULUS (MOUSE).
FRA2_RAT FOS-RELATED ANTIGEN 2 - RATTUS NORVEGICUS (RAT).
O35285 FOS-LIKE ANTIGEN 1 (FOS-RELATED ANTIGEN 1) - MUS MUSCULUS (MOUSE).
O56223 COMPLETE GENOME - MURINE OSTEOSARCOMA VIRUS.
O88479 C-FOS PROTO-ONCOGENE PROTEIN - MESOCRICETUS AURATUS (GOLDEN HAMSTER).
Q62592 FBR-MURINE OSTEOSARCOMA PROVIRUS GENOME - RATTUS NORVEGICUS (RAT).
Q91639 FOS-RELATED ANTIGEN-2 - XENOPUS LAEVIS (AFRICAN CLAWED FROG).

Q62738 FOS-RELATED ANTIGEN 2 - RATTUS NORVEGICUS (RAT).

ATF3_MOUSE CYCLIC-AMP-DEPENDENT TRANSCRIPTION FACTOR ATF-3 (ACTIVATING FACTOR 3) (TRANSCRIPTION FACTOR LRG-21) - MUS MUSCULUS (MOUSE).
ATF3_RAT CYCLIC-AMP-DEPENDENT TRANSCRIPTION FACTOR ATF-3 (ACTIVATING FACTOR 3) (LIVER REGENERATION FACTOR 1) (LRF-1) - RATTUS NORVEGICUS (RAT).
Q62281 TI-241 - MUS MUSCULUS (MOUSE).

ATF3_HUMAN CYCLIC-AMP-DEPENDENT TRANSCRIPTION FACTOR ATF-3 (ACTIVATING FACTOR 3) - HOMO SAPIENS (HUMAN).
FRA_DROME TRANSCRIPTION FACTOR DFRA (FOS-RELATED ANTIGEN) (AP-1) (KAYAK PROTEIN) - DROSOPHILA MELANOGASTER (FRUIT FLY).
Scan History
OWL19_1    2  100  NSINGLE    
OWL26_0 1 200 NSINGLE
SPTR37_9f 2 67 NSINGLE
Initial Motifs
Motif 1  width=18
Element Seqn Id St Int Rpt
PTVTAISTSPDLQWLVQP FOS_HUMAN 62 62 -
PTVTAISTSPDLQWLVQP FOS_AVINK 17 17 -
PTETAISTSPDLQWLVQP FOSX_MSVFR 38 38 -
PTINAITTSQDLQWMVQP FRA2_CHICK 48 48 -
PSINAVSGSQELQWMVQP FRA1_RAT 41 41 -
PSINTMSGSQELQWMVQP FRA1_HUMAN 39 39 -

Motif 2 width=17
Element Seqn Id St Int Rpt
EQLSPEEEEKRRIRRER FOS_HUMAN 130 50 -
EQLSPEEEEKRRIRRER FOS_AVINK 84 49 -
EQLSPEEEVKRRIRRER FOSX_MSVFR 106 50 -
EQLSPEEEEKRRIRRER FRA2_CHICK 117 51 -
EQISPEEEERRRVRRER FRA1_RAT 100 41 -
EQISPEEEERRRVRRER FRA1_HUMAN 98 41 -

Motif 3 width=17
Element Seqn Id St Int Rpt
NKMAAAKCRNRRRELTD FOS_HUMAN 147 0 -
NKMAAAKCRNRRRELTD FOS_AVINK 101 0 -
NKMAAAKCRNRRRELTD FOSX_MSVFR 123 0 -
NKLAAAKCRNRRRELTE FRA2_CHICK 134 0 -
NKLAAAKCRNRRKELTD FRA1_RAT 117 0 -
NKLAAAKCRNRRKELTD FRA1_HUMAN 115 0 -

Motif 4 width=22
Element Seqn Id St Int Rpt
LQAETDQLEDEKSALQTEIANL FOS_HUMAN 165 1 -
LQAETDQLEEEKSALQAEIANL FOS_AVINK 119 1 -
LQAETDQLEDEKSALQTEIANL FOSX_MSVFR 141 1 -
LQAETEVLEEEKSVLQKEIAEL FRA2_CHICK 152 1 -
LQAETDKLEDEKSGLQREIEEL FRA1_RAT 135 1 -
LQAETDKLEDEKSGLQREIEEL FRA1_HUMAN 133 1 -

Motif 5 width=24
Element Seqn Id St Int Rpt
LLKEKEKLEFILAAHRPACKIPDD FOS_HUMAN 186 -1 -
LLKEKEKLEFILAAHRPACKMPEE FOS_AVINK 140 -1 -
LLKEKEKLEFILAAHRPACKIPDD FOSX_MSVFR 162 -1 -
LQKEKEKLEFMLVAHSPVCKISPE FRA2_CHICK 173 -1 -
LQKQKERLELVLEAHRPICKIPEE FRA1_RAT 156 -1 -
LQKQKERLELVLEAHRPICKIPEG FRA1_HUMAN 154 -1 -
Final Motifs
Motif 1  width=18
Element Seqn Id St Int Rpt
PTVTAISTSPDLQWLVQP FOS_HUMAN 62 62 -
PTVTAISTSPDLQWLVQP O88479 62 62 -
PTVTAISTSPDLQWLVQP FOS_MOUSE 62 62 -
PTVTAISTSPDLQWLVQP FOS_RAT 62 62 -
PTVTAISTSPDLQWLVQP FOS_AVINK 17 17 -
PTVTAISTSPDLQWLVQP FOS_CHICK 62 62 -
PTETAISTSPDLQWLVQP FOSX_MSVFR 38 38 -
PTETAISTSPDLQWLVQP O56223 347 347 -
PTETAISTSPDLQWLVQP Q62592 348 348 -
PTVTATSTSPDLQWLVQP FOS_MSVFB 62 62 -
PTVTAISTSPDLQWMVQP FOS_FUGRU 57 57 -
PTINAITTSQDLQWMVQP FRA2_HUMAN 49 49 -
PTVTAISSCPDLQWMVQP FOS_CYPCA 49 49 -
PTVTAISTSPDLQWMVQP FOS_TETFL 56 56 -
PTINAITTSQDLQWMVQP FRA2_CHICK 48 48 -
PTVNAITTSQDLQWMVQP Q91639 52 52 -
PTINAITTSQDLQWMVQP FRA2_MOUSE 49 49 -
PTVTAITTSQDLQWLVQP FOSB_HUMAN 56 56 -
PTVTAITTSQDLQWLVQP FOSB_MOUSE 56 56 -
TINAITTTSQDLQWMVQP FRA2_RAT 50 50 -
PSINAVSGSQELQWMVQP FRA1_RAT 41 41 -
PSINTMSGSQELQWMVQP FRA1_HUMAN 39 39 -
LVPSIDSSSQELHWMVQP O35285 39 39 -
FVPSIDSSSQELHWMVQP FRA1_MOUSE 39 39 -

Motif 2 width=17
Element Seqn Id St Int Rpt
EQLSPEEEEKRRIRRER FOS_HUMAN 130 50 -
EQLSPEEEEKRRIRRER O88479 130 50 -
EQLSPEEEEKRRIRRER FOS_MOUSE 130 50 -
EQLSPEEEEKRRIRRER FOS_RAT 130 50 -
EQLSPEEEEKRRIRRER FOS_AVINK 84 49 -
EQLSPEEEEKRRIRRER FOS_CHICK 129 49 -
EQLSPEEEVKRRIRRER FOSX_MSVFR 106 50 -
EQLSPEEEVKRRIRRER O56223 415 50 -
EQLSPEEEVKRRIRRER Q62592 416 50 -
EQLSPEEEEKRRIRRER FOS_MSVFB 130 50 -
EQTTPEEEEKKRIRRER FOS_FUGRU 114 39 -
EQLSPEEEEKRRIRRER FRA2_HUMAN 117 50 -
EQLSPEEEEKKRVRRER FOS_CYPCA 106 39 -
EQTTPEEEEKKRIRRER FOS_TETFL 113 39 -
EQLSPEEEEKRRIRRER FRA2_CHICK 117 51 -
EQLSPEEEEKRRVRRER Q91639 121 51 -
EQLSPEEEEKRRIRRER FRA2_MOUSE 117 50 -
ETLTPEEEEKRRVRRER FOSB_HUMAN 148 74 -
ETLTPEEEEKRRVRRER FOSB_MOUSE 148 74 -
EQLSPEEEEKRRIRRER FRA2_RAT 118 50 -
EQISPEEEERRRVRRER FRA1_RAT 100 41 -
EQISPEEEERRRVRRER FRA1_HUMAN 98 41 -
EQISPEEEERRRVRRER O35285 98 41 -
EQISPEEEERRRVRRER FRA1_MOUSE 98 41 -

Motif 3 width=17
Element Seqn Id St Int Rpt
NKMAAAKCRNRRRELTD FOS_HUMAN 147 0 -
NKMAAAKCRNRRRELTD O88479 147 0 -
NKMAAAKCRNRRRELTD FOS_MOUSE 147 0 -
NKMAAAKCRNRRRELTD FOS_RAT 147 0 -
NKMAAAKCRNRRRELTD FOS_AVINK 101 0 -
NKMAAAKCRNRRRELTD FOS_CHICK 146 0 -
NKMAAAKCRNRRRELTD FOSX_MSVFR 123 0 -
NKMAAAKCRNRRRELTD O56223 432 0 -
NKMAAAKCRNRRRELTD Q62592 433 0 -
NKMAAAKCRNRRRELTD FOS_MSVFB 147 0 -
NKQAAAKCRNRRRELTD FOS_FUGRU 131 0 -
NKLAAAKCRNRRRELTE FRA2_HUMAN 134 0 -
NKMAAAKCRNRRRELTD FOS_CYPCA 123 0 -
NKQAAAKCRNRRRELTD FOS_TETFL 130 0 -
NKLAAAKCRNRRRELTE FRA2_CHICK 134 0 -
NKLAAAKCRNRRRELTD Q91639 138 0 -
NKLAAAKCRNRRRELTE FRA2_MOUSE 134 0 -
NKLAAAKCRNRRRELTD FOSB_HUMAN 165 0 -
NKLAAAKCRNRRRELTD FOSB_MOUSE 165 0 -
NKLAAAKCRNRRRELTE FRA2_RAT 135 0 -
NKLAAAKCRNRRKELTD FRA1_RAT 117 0 -
NKLAAAKCRNRRKELTD FRA1_HUMAN 115 0 -
NKLAAAKCRNRRKELTD O35285 115 0 -
NKLAAAKCRNRRKELTD FRA1_MOUSE 115 0 -

Motif 4 width=22
Element Seqn Id St Int Rpt
LQAETDQLEDEKSALQTEIANL FOS_HUMAN 165 1 -
LQAETDQLEDEKSALQTEIANL O88479 165 1 -
LQAETDQLEDEKSALQTEIANL FOS_MOUSE 165 1 -
LQAETDQLEDEKSALQTEIANL FOS_RAT 165 1 -
LQAETDQLEEEKSALQAEIANL FOS_AVINK 119 1 -
LQAETDQLEEEKSALQAEIANL FOS_CHICK 164 1 -
LQAETDQLEDEKSALQTEIANL FOSX_MSVFR 141 1 -
LQAETDQLEDEKSALQTEIANL O56223 450 1 -
LQAETDQLEDEKSALQTEIANL Q62592 451 1 -
LQAETDQLEDKKSALQTEIANL FOS_MSVFB 165 1 -
LQAETDQLEDEKSSLQNDIANL FOS_FUGRU 149 1 -
LQAETEELEEEKSGLQKEIAEL FRA2_HUMAN 152 1 -
LQAETDELEDEKSALQNDIANL FOS_CYPCA 141 1 -
LQAETDQLEAEKSSLQNDIANL FOS_TETFL 148 1 -
LQAETEVLEEEKSVLQKEIAEL FRA2_CHICK 152 1 -
LQAETEKLEQEKSGLQKEIADL Q91639 156 1 -
LQAETEELEEEKSGLQKEIAEL FRA2_MOUSE 152 1 -
LQAETDQLEEEKAELESEIAEL FOSB_HUMAN 183 1 -
LQAETDQLEEEKAELESEIAEL FOSB_MOUSE 183 1 -
LQTETEELEEEKSGLQKEIAEL FRA2_RAT 153 1 -
LQAETDKLEDEKSGLQREIEEL FRA1_RAT 135 1 -
LQAETDKLEDEKSGLQREIEEL FRA1_HUMAN 133 1 -
LQAETDKLEDEKSGLQREIEEL O35285 133 1 -
LQAETDKLEDEKSGLQREIEEL FRA1_MOUSE 133 1 -

Motif 5 width=24
Element Seqn Id St Int Rpt
LLKEKEKLEFILAAHRPACKIPDD FOS_HUMAN 186 -1 -
LLKEKEKLEFILAAHRPACKIPDD O88479 186 -1 -
LLKEKEKLEFILAAHRPACKIPDD FOS_MOUSE 186 -1 -
LLKEKEKLEFILAAHRPACKIPND FOS_RAT 186 -1 -
LLKEKEKLEFILAAHRPACKMPEE FOS_AVINK 140 -1 -
LLKEKEKLEFILAAHRPACKMPEE FOS_CHICK 185 -1 -
LLKEKEKLEFILAAHRPACKIPDD FOSX_MSVFR 162 -1 -
LLKEKEKLEFILAAHRPACKIPDD O56223 471 -1 -
LLKEKEKLEFILAAHRPACKIPDD Q62592 472 -1 -
LLKEKEKLEFILAAHRPACKIPDD FOS_MSVFB 186 -1 -
LLKEKERLEFILAAHQPICKIPSQ FOS_FUGRU 170 -1 -
LQKEKEKLEFMLVAHGPVCKISPE FRA2_HUMAN 173 -1 -
LLKEKERLEFILAAHKPICKIPSS FOS_CYPCA 162 -1 -
LLKEKERLEFILAAHQPICKIPSQ FOS_TETFL 169 -1 -
LQKEKEKLEFMLVAHSPVCKISPE FRA2_CHICK 173 -1 -
LQKEKDKLEFMLVAHSPVCKISTD Q91639 177 -1 -
LQKEKEKLEFMKVAHGPVCKISPE FRA2_MOUSE 173 -1 -
LQKEKERLEFVLVAHKPGCKIPYE FOSB_HUMAN 204 -1 -
LQKEKERLEFVLVAHKPGCKIPYE FOSB_MOUSE 204 -1 -
LQKEKEKLEFMLVAHGPVCKISPE FRA2_RAT 174 -1 -
LQKQKERLELVLEAHRPICKIPEE FRA1_RAT 156 -1 -
LQKQKERLELVLEAHRPICKIPEG FRA1_HUMAN 154 -1 -
LQKQKERLELVLEAHRPICKIPEG O35285 154 -1 -
LQKQKERLELVLEAHRLICKIPEG FRA1_MOUSE 154 -1 -