SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00615

Identifier
CCAATSUBUNTA  [View Relations]  [View Alignment]  
Accession
PR00615
No. of Motifs
3
Creation Date
10-NOV-1996  (UPDATE 27-JUN-1999)
Title
CCAAT-binding transcription factor subunit A signature
Database References

PROSITE; PS00685 CBFA_NFYB
BLOCKS; BL00685
INTERPRO; IPR000947
Literature References
1. VOURIO, T., MAITY, S.N. AND DE CROMBRUGGHE, B.
Purification and molecular cloning of the "A" chain of a rat heterotrimeric
CCAAT-binding protein. Sequence identity with the yeast HAP3 transcription
factor.
J.BIOL.CHEM. 265 22480-22486 (1990).
 
2. LI, X-Y, MANTOVANI, R., HOOFT VAN HUIJSDUIJNEN, R., ANDRE, I.,
BENOIST, C. AND MATHIS, D.
Evolutionary variation of the CCAAT-binding transcription factor NF-Y.
NUCLEIC ACIDS RES. 20 1087-1091 (1992); ERRATUM: NUCLEIC ACIDS RES. 20 1841 (1992).
 
3. MAITY, S.N. AND DE CROMBRUGGHE, B.
Biochemical analysis of the B subunit of the heteromeric CCAAT-binding 
factor. A DNA-binding domain and a subunit interaction domain are specified
by two separate segments.
J.BIOL.CHEM. 267 8286-8292 (1992).
 
4. MULDER, W., SCHOLTEN, I.H., DE BOER, R.W. AND GRIVELL, L.A.
Sequence of the HAP3 transcription factor of Kluyveromyces lactis predicts
the presence of a novel 4-cysteine zinc-finger motif.
MOL.GEN.GENET. 245 96-106 (1994).

Documentation
The CCAAT-binding factor (CBF) is a mammalian transcription factor that
binds to a CCAAT motif in the promoters of a wide variety of genes,
including type I collagen and albumin [1,2]. The factor is a heteromeric
complex of A and B subunits, both of which are required for DNA-binding
[1,2]. The subunits can interact in the absence of DNA-binding, conserved
regions in each being important in mediating this interaction.
 
The A subunit can be split into 3 domains on the basis of sequence
similarity: a non-conserved N-terminal `A domain'; a highly-conserved
central `B domain' involved in DNA-binding; and a C-terminal `C domain',
which contains a number of glutamine and acidic residues involved in
protein-protein interactions [2]. The A subunit shows striking similarity
to the HAP3 subunit of the yeast CCAAT-binding heterotrimeric transcription
factor [2,4]. The K.lactis HAP3 protein has been predicted to contain a
4-cysteine zinc finger, which is thought to be present in similar HAP3
and CBF subunit A proteins, in which the third cysteine is replaced by a
serine [4].
 
CCAATSUBUNTA is a 3-element fingerprint that provides a signature for the
CCAAT-binding transcription factor A subunit. The fingerprint was derived
from an initial alignment of 6 sequences: the motifs were drawn from the
full length of the B domain, motif 1 including the region encoded by PROSITE
pattern CBFA_NFYB (PS00685), which is involved in DNA-binding. Two
iterations on OWL28.2 were required to reach convergence, at which point
a true set comprising 10 sequences was identified.
 
An update on SPTR37_9f identified a true set of 44 sequences, and 3
partial matches.
Summary Information
  44 codes involving  3 elements
3 codes involving 2 elements
Composite Feature Index
3444444
2330
123
True Positives
CBFA_HUMAN    CBFA_MAIZE    CBFA_MOUSE    CBFA_PETMA    
DBL_HUMAN HAP3_KLULA HAP3_YEAST O04027
O13068 O17286 O23310 O23633
O55078 O55079 O59848 O59864
O73744 O74336 O76256 O81130
O82248 PHP3_SCHPO Q00735 Q63091
TBAP_HUMAN TOP2_ARATH TOP2_BOMMO TOP2_CAEEL
TOP2_CANAL TOP2_DROME TOP2_PEA TOP2_PLAFK
TOP2_SCHPO TOP2_YEAST TP2A_CHICK TP2A_CRIGR
TP2A_HUMAN TP2A_MOUSE TP2A_PIG TP2A_RAT
TP2B_CHICK TP2B_CRILO TP2B_HUMAN TP2B_MOUSE
True Positive Partials
Codes involving 2 elements
DR1_ARATH O14348 TP2M_DICDI
Sequence Titles
CBFA_HUMAN  CCAAT-BINDING TRANSCRIPTION FACTOR SUBUNIT A (CBF-A) (NF-Y PROTEIN CHAIN B) (NF-YB) (CAAT-BOX DNA BINDING PROTEIN SUBUNIT B) - HOMO SAPIENS (HUMAN). 
CBFA_MAIZE CCAAT-BINDING TRANSCRIPTION FACTOR SUBUNIT A (CBF-A) (NF-Y PROTEIN CHAIN B) (NF-YB) (CAAT-BOX DNA BINDING PROTEIN SUBUNIT B) - ZEA MAYS (MAIZE).
CBFA_MOUSE CCAAT-BINDING TRANSCRIPTION FACTOR SUBUNIT A (CBF-A) (NF-Y PROTEIN CHAIN B) (NF-YB) (CAAT-BOX DNA BINDING PROTEIN SUBUNIT B) - MUS MUSCULUS (MOUSE), AND RATTUS NORVEGICUS (RAT).
CBFA_PETMA CCAAT-BINDING TRANSCRIPTION FACTOR SUBUNIT A (CBF-A) (NF-Y PROTEIN CHAIN B) (NF-YB) (CAAT-BOX DNA BINDING PROTEIN SUBUNIT B) - PETROMYZON MARINUS (SEA LAMPREY).
DBL_HUMAN PROTO-ONCOGENE DBL PRECURSOR [CONTAINS: MCF2] - HOMO SAPIENS (HUMAN).
HAP3_KLULA HAP3 TRANSCRIPTIONAL ACTIVATOR - KLUYVEROMYCES LACTIS (YEAST).
HAP3_YEAST HAP3 TRANSCRIPTIONAL ACTIVATOR (UAS2 REGULATORY PROTEIN A) - SACCHAROMYCES CEREVISIAE (BAKER'S YEAST).
O04027 F7G19.10 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O13068 DR1, COMPLETE CDS - XENOPUS LAEVIS (AFRICAN CLAWED FROG).
O17286 W10D9.4 PROTEIN - CAENORHABDITIS ELEGANS.
O23310 CCAAT-BINDING TRANSCRIPTION FACTOR SUBUNIT A - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O23633 TRANSCRIPTION FACTOR - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O55078 DNA TOPOISOMERASE II ALPHA (EC 5.99.1.3) - CRICETULUS LONGICAUDATUS (LONG-TAILED HAMSTER) (CHINESE HAMSTER).
O55079 DNA TOPOISOMERASE II ALPHA (EC 5.99.1.3) - CRICETULUS LONGICAUDATUS (LONG-TAILED HAMSTER) (CHINESE HAMSTER).
O59848 HAPC - ASPERGILLUS ORYZAE.
O59864 TYPEII DNA TOPOISOMERASE - EMERICELLA NIDULANS (ASPERGILLUS NIDULANS).
O73744 NUCLEAR Y/CCAAT-BOX BINDING FACTOR B SUBUNIT NF-YB - XENOPUS LAEVIS (AFRICAN CLAWED FROG).
O74336 DNA TOPOISOMERASE II - SCHIZOSACCHAROMYCES POMBE (FISSION YEAST).
O76256 NUCLEAR FACTOR Y TRANSCRIPTION FACTOR SUBUNIT B HOMOLOG - SCHISTOSOMA MANSONI (BLOOD FLUKE).
O81130 CCAAT-BOX BINDING FACTOR HAP3 HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O82248 PUTATIVE CCAAT-BINDING TRANSCRIPTION FACTOR SUBUNIT A (CBF-A) - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
PHP3_SCHPO PHP3 TRANSCRIPTIONAL ACTIVATOR - SCHIZOSACCHAROMYCES POMBE (FISSION YEAST).
Q00735 PUTATIVE COMPONENT OF CCAAT BINDING COMPLEX HAPC - EMERICELLA NIDULANS (ASPERGILLUS NIDULANS).
Q63091 CCAAT BINDING TRANSCRIPTION FACTOR-B SUBUNIT - RATTUS NORVEGICUS (RAT).
TBAP_HUMAN TATA-BINDING PROTEIN-ASSOCIATED PHOSPHOPROTEIN (DOWN-REGULATOR OF TRANSCRIPTION 1) (DR1 PROTEIN) - HOMO SAPIENS (HUMAN).
TOP2_ARATH DNA TOPOISOMERASE II (EC 5.99.1.3) - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
TOP2_BOMMO DNA TOPOISOMERASE II (EC 5.99.1.3) (TOPOII) - BOMBYX MORI (SILK MOTH).
TOP2_CAEEL PROBABLE DNA TOPOISOMERASE II (EC 5.99.1.3) - CAENORHABDITIS ELEGANS.
TOP2_CANAL DNA TOPOISOMERASE II (EC 5.99.1.3) - CANDIDA ALBICANS (YEAST).
TOP2_DROME DNA TOPOISOMERASE II (EC 5.99.1.3) - DROSOPHILA MELANOGASTER (FRUIT FLY).
TOP2_PEA DNA TOPOISOMERASE II (EC 5.99.1.3) - PISUM SATIVUM (GARDEN PEA).
TOP2_PLAFK DNA TOPOISOMERASE II (EC 5.99.1.3) - PLASMODIUM FALCIPARUM (ISOLATE K1 / THAILAND).
TOP2_SCHPO DNA TOPOISOMERASE II (EC 5.99.1.3) - SCHIZOSACCHAROMYCES POMBE (FISSION YEAST).
TOP2_YEAST DNA TOPOISOMERASE II (EC 5.99.1.3) - SACCHAROMYCES CEREVISIAE (BAKER'S YEAST).
TP2A_CHICK DNA TOPOISOMERASE II, ALPHA ISOZYME (EC 5.99.1.3) - GALLUS GALLUS (CHICKEN).
TP2A_CRIGR DNA TOPOISOMERASE II, ALPHA ISOZYME (EC 5.99.1.3) - CRICETULUS GRISEUS (CHINESE HAMSTER).
TP2A_HUMAN DNA TOPOISOMERASE II, ALPHA ISOZYME (EC 5.99.1.3) - HOMO SAPIENS (HUMAN).
TP2A_MOUSE DNA TOPOISOMERASE II, ALPHA (EC 5.99.1.3) - MUS MUSCULUS (MOUSE).
TP2A_PIG DNA TOPOISOMERASE II, ALPHA (EC 5.99.1.3) - SUS SCROFA (PIG).
TP2A_RAT DNA TOPOISOMERASE II, ALPHA (EC 5.99.1.3) - RATTUS NORVEGICUS (RAT).
TP2B_CHICK DNA TOPOISOMERASE II, BETA ISOZYME (EC 5.99.1.3) - GALLUS GALLUS (CHICKEN).
TP2B_CRILO DNA TOPOISOMERASE II, BETA ISOZYME (EC 5.99.1.3) - CRICETULUS LONGICAUDATUS (LONG-TAILED HAMSTER) (CHINESE HAMSTER).
TP2B_HUMAN DNA TOPOISOMERASE II, BETA ISOZYME (EC 5.99.1.3) - HOMO SAPIENS (HUMAN).
TP2B_MOUSE DNA TOPOISOMERASE II, BETA ISOZYME (EC 5.99.1.3) - MUS MUSCULUS (MOUSE).

DR1_ARATH DR1 PROTEIN HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O14348 PUTATIVE TRANSCRIPTIONAL REPRESSOR C30D10.02 - SCHIZOSACCHAROMYCES POMBE (FISSION YEAST).
TP2M_DICDI DNA TOPOISOMERASE II, MITOCHONDRIAL PRECURSOR (EC 5.99.1.3) - DICTYOSTELIUM DISCOIDEUM (SLIME MOLD).
Scan History
OWL26_2    2  100  NSINGLE    
SPTR37_9f 5 90 NSINGLE
Initial Motifs
Motif 1  width=19
Element Seqn Id St Int Rpt
VQECVSEFISFITSEASER CBFA_CHICK 84 84 -
VQECVSEFISFITSEASER CBFA_XENLA 1 1 -
VQECVSEFISFITSEASER CBFA_PETMA 87 87 -
VQECVSEFISFITSEASDK CBFA_MAIZE 63 63 -
MQECVSEFISFVTSEACDR HAP3_KLULA 54 54 -
VQDCVSEFISFVTGEASEQ PHP3_SCHPO 39 39 -

Motif 2 width=19
Element Seqn Id St Int Rpt
CTSGKRKTINGEDILLSLH HAP3_KLULA 73 0 -
CTQEKRKTITGEDVLLALN PHP3_SCHPO 58 0 -
CHQEKRKTINGEDILFAMS CBFA_CHICK 103 0 -
CHQEKRKTINGEDILFAMS CBFA_XENLA 20 0 -
CHQEKRKTINGEDILFAMS CBFA_PETMA 106 0 -
CQREKRKTINGDDLLWAMA CBFA_MAIZE 82 0 -

Motif 3 width=19
Element Seqn Id St Int Rpt
TLGFDSYVEPLKLYLQKFR CBFA_XENLA 39 0 -
TLGFEDYIEPLKVYLQKYR CBFA_MAIZE 101 0 -
ALGFENYAEVLKIYLAKYR HAP3_KLULA 92 0 -
TLGFENYAEVLKISLTKYR PHP3_SCHPO 77 0 -
TLGFDSYVEPLKLYLQKFR CBFA_CHICK 122 0 -
TLGFDSYVEPLKQYLQKYR CBFA_PETMA 125 0 -
Final Motifs
Motif 1  width=19
Element Seqn Id St Int Rpt
LSEKGFQQISFVNSIATTK TP2B_MOUSE 312 312 -
LSEKGFQQISFVNSIATTK TP2B_CRILO 312 312 -
LSEKGFQQISFVNSIATTK TP2B_HUMAN 324 324 -
MSEKGFQQISFVNSIATSK TP2A_RAT 301 301 -
MSEKGFQQISFVNSIATSK TP2A_CRIGR 302 302 -
MSEKGFQQISFVNSIATSK O55079 302 302 -
MSEKGFQQISFVNSIATSK O55078 302 302 -
LSEKGFQQISFVNSIATTK TP2B_CHICK 329 329 -
VQECVSEFISFITSEASER CBFA_PETMA 87 87 -
MSEKGFQQISFVNSIATSK TP2A_PIG 303 303 -
MSERGFQQISFVNSIATSK TP2A_MOUSE 302 302 -
VQECVSEFISFITSEASER CBFA_HUMAN 86 86 -
VQECVSEFISFITSEASER CBFA_MOUSE 86 86 -
VQECVSEFISFITSEASER Q63091 48 48 -
VQECVSEFISFITSEASER O73744 85 85 -
MSEKGFQQISFVNSIATSK TP2A_HUMAN 303 303 -
MQECVSEFISFITSEASEK Q00735 75 75 -
MQECVSEFISFITSEASEK O59848 75 75 -
LSEKGFQQVSFVNSIATTK TP2A_CHICK 304 304 -
VQECVSEFISFITSEASDK CBFA_MAIZE 63 63 -
MQECVSEFISFVTSEACDR HAP3_KLULA 54 54 -
MQECVSELISFVTSEASDR HAP3_YEAST 69 69 -
VQECVSEFISFITGEASDK O23310 53 53 -
VSDISFQQISFVNSIATTM TOP2_YEAST 294 294 -
ISDMGFQQVSFVNSIATTK TOP2_BOMMO 312 312 -
PSDRGFQQVSFVNSIATYK TOP2_DROME 284 284 -
VQECVSEFISFITSEASDK O23633 53 53 -
VQECVSEFISFITSELPDK O76256 56 56 -
VQDCVSEFISFVTGEASEQ PHP3_SCHPO 39 39 -
LSEGQFQQVSFVNSIATIK TOP2_ARATH 306 306 -
LSDGQFQQVSFVNSIATIK TOP2_PEA 293 293 -
MQECVSEFISFVTGEASDK O82248 83 83 -
VSDGSFNQVSFVNSIATTS TOP2_CANAL 343 343 -
LSEKGFQQVSFVNSIATTK TOP2_CAEEL 337 337 -
VSDGQFKQVSFVNNISTIR TOP2_SCHPO 295 295 -
VSDGQFKQVSFVNNISTIR O74336 349 349 -
VSDGSFKQVSFVNSIATTS O59864 333 333 -
IQECVSEYISFVTGEANER O81130 61 61 -
SDGSQFQQVSFVNSICTTK TOP2_PLAFK 338 338 -
VQECATEFISFVTCEASEK O04027 35 35 -
AQECVSEFISFIASEAAEI O17286 93 93 -
VVNCCTEFIHLISSEANEI TBAP_HUMAN 39 39 -
VVNCCTEFIHLISSEANEI O13068 39 39 -
LQYCHSEWIIFRNAIENFA DBL_HUMAN 84 84 -

Motif 2 width=19
Element Seqn Id St Int Rpt
KHLTYNDFINKELILFSNS TP2B_MOUSE 700 369 -
KHLTYNDFINKELILFSNS TP2B_CRILO 700 369 -
KHLTYNDFINKELILFSNS TP2B_HUMAN 712 369 -
MYLTYNDFINKELILFSNS TP2A_RAT 689 369 -
TYLTYNDFINKELILFSNS TP2A_CRIGR 690 369 -
TYLTYNDFINKELILFSNS O55079 690 369 -
TYLTYNDFINKELILFSNS O55078 690 369 -
KHLTYNDFINKELILFSNS TP2B_CHICK 717 369 -
CHQEKRKTINGEDILFAMS CBFA_PETMA 106 0 -
TYLTYNDFINKELILFSNS TP2A_PIG 691 369 -
SYLTYNDFINKELILFSNS TP2A_MOUSE 690 369 -
CHQEKRKTINGEDILFAMS CBFA_HUMAN 105 0 -
CHQEKRKTINGEDILFAMS CBFA_MOUSE 105 0 -
CHQEKRKTINGEDILFAMS Q63091 67 0 -
CHQEKRKTINGEDILFAMS O73744 104 0 -
TYLTYNDFINKELILFSNS TP2A_HUMAN 691 369 -
CQQEKRKTVNGEDILFAMT Q00735 94 0 -
CQQEKRKTVNGEDILFAMT O59848 94 0 -
NYLTYNDFINKELVLFSNS TP2A_CHICK 692 369 -
CQREKRKTINGDDLLWAMA CBFA_MAIZE 82 0 -
CTSGKRKTINGEDILLSLH HAP3_KLULA 73 0 -
CAADKRKTINGEDILISLH HAP3_YEAST 88 0 -
CQREKRKTINGDDLLWAMT O23310 72 0 -
KEIPISDFINKELILFSLA TOP2_YEAST 668 355 -
KTVTYSDFVNLELVLFSNG TOP2_BOMMO 699 368 -
KSITYADFINLELVLFSNA TOP2_DROME 671 368 -
CQKEKRKTVNGDDLLWAMA O23633 72 0 -
CQTEKRKTINGEDILCAMN O76256 75 0 -
CTQEKRKTITGEDVLLALN PHP3_SCHPO 58 0 -
PKVTYSDFVNKELILFSMA TOP2_ARATH 680 355 -
KLINYKDFVNKELILFSRA TOP2_PEA 666 354 -
CHKEKRKTVNGDDICWAMA O82248 102 0 -
TEIPISDFINKEFILFSMS TOP2_CANAL 728 366 -
RFVTFKDFVNRELVLFSNL TOP2_CAEEL 726 370 -
PQIPIDDFINRELIQFSMA TOP2_SCHPO 667 353 -
PQIPIDDFINRELIQFSMA O74336 721 353 -
AKISYTDFINNELIQFSMA O59864 711 359 -
CQREQRKTITAEDILWAMS O81130 80 0 -
KDLSYYDFVNKELIYYSRY TOP2_PLAFK 715 358 -
CHRENRKTVNGDDIWWALS O04027 54 0 -
CNITKRKTITADDLLTAME O17286 112 0 -
CNKSEKKTISPEHVIQALE TBAP_HUMAN 58 0 -
CNKSEKKTISPEHVIQALE O13068 58 0 -
ANQEIDKFQSKEDAQKALQ DBL_HUMAN 346 243 -

Motif 3 width=19
Element Seqn Id St Int Rpt
SLTKEKVEELIKQRDTKGR TP2B_MOUSE 1141 422 -
SLTKEKVEELIKQRDTKGR TP2B_CRILO 1141 422 -
SLTKEKVEELIKQRDAKGR TP2B_HUMAN 1153 422 -
YLTKEKKDELCKQRDEKEQ TP2A_RAT 1132 424 -
YLTKEKKDELCKQRNEKEQ TP2A_CRIGR 1130 421 -
YLTKEKKDELCKQRNEKEQ O55079 1130 421 -
YLTKEKKDELCKQRNEKEQ O55078 1130 421 -
SLTKEKVEELIKHRDSKER TP2B_CHICK 1158 422 -
TLGFDSYVEPLKQYLQKYR CBFA_PETMA 125 0 -
YLTKEKKDELCKLRNEKEQ TP2A_PIG 1135 425 -
YLTKEKKDELCKQRNEKEQ TP2A_MOUSE 1132 423 -
TLGFDSYVEPLKLYLQKFR CBFA_HUMAN 124 0 -
TLGFDSYVEPLKLYLQKFR CBFA_MOUSE 124 0 -
TLGFDSYVEPLKLYLQKFR Q63091 86 0 -
RLGFDSYVEPLKLYLQKFR O73744 123 0 -
YLTKEKKDELCRLRNEKEQ TP2A_HUMAN 1135 425 -
SLGFENYAEALKIYLSKYR Q00735 113 0 -
SLGFENYAEALKIYLSKYR O59848 113 0 -
YLTKEKKDELCKQRDNKDK TP2A_CHICK 1127 416 -
TLGFEDYIEPLKVYLQKYR CBFA_MAIZE 101 0 -
ALGFENYAEVLKIYLAKYR HAP3_KLULA 92 0 -
ALGFENYAEVLKIYLAKYR HAP3_YEAST 107 0 -
TLGFEDYVEPLKVYLQKYR O23310 91 0 -
SLTKERYQKLLKQKQEKET TOP2_YEAST 1123 436 -
MLTKEKKDELLKQRDQKLT TOP2_BOMMO 1156 438 -
MLTEEKKNELLKQRDTKLS TOP2_DROME 1133 443 -
TLGFEDYLEPLKIYLARYR O23633 91 0 -
TLGFDNYIEPLRAFLVKFR O76256 94 0 -
TLGFENYAEVLKISLTKYR PHP3_SCHPO 77 0 -
SLTIEKVEELLADRDKMII TOP2_ARATH 1127 428 -
TLTLESVQKLLDEKTEKEK TOP2_PEA 1095 410 -
NLGFDDYAAQLKKYLHRYR O82248 121 0 -
SLTYERFMRIMQQRDQKEA TOP2_CANAL 1190 443 -
KLSEEEKNKLIKESEEKMA TOP2_CAEEL 1183 438 -
SLTYERYVELLKKKDEVMA TOP2_SCHPO 1105 419 -
SLTYERYVELLKKKDEVMA O74336 1159 419 -
SLTQERVEKLRRQIGEKEH O59864 1151 421 -
KLGFDNYVDPLTVFINRYR O81130 99 0 -
SLTLEKVEDLLTQLKEKER TOP2_PLAFK 1155 421 -
TLGLDNYADAVGRHLHKYR O04027 73 0 -
ATGFDNYAEPMRIFLQKYR O17286 131 0 -
SLGFGSYISEVKEVLQECK TBAP_HUMAN 77 0 -
SLGFGSYISEVKEVLQECK O13068 77 0 -
RLRLDSYLLKPVQRITKYQ DBL_HUMAN 621 256 -