SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR01546

Identifier
YEAST73DUF  [View Relations]  [View Alignment]  
Accession
PR01546
No. of Motifs
13
Creation Date
05-MAY-2001
Title
Saccharomyces cerevisiae 73.5kDa hypothetical protein signature
Database References
Literature References
1. TIZON, B., RODRIGUEZ-TORRES, M., RODRIGUEZ-BELMONTE, E., CADAHIA, J.L.
AND CERDAN, E.
Identification of a putative methylenetetrahydrofolate reductase by sequence
analysis of a 6.8kb DNA fragment of yeast chromosome VII.
YEAST 12 1047-1051 (1996).

Documentation
The sequence of a 6.8kb DNA fragment from Saccharomyces cerevisiae 
chromosome VII has been analysed [1]. The sequence was found to contain
five open reading frames (ORFs) greater than 100 amino acids in length. One 
of these (a 73.5kDa protein) shares similarity with the 58.0kDa SPAC1D4.03C
from Schizosaccharomyces pombe, and with hypothetical proteins from Homo 
sapiens, Drosophila melanogaster, Caenorhabditis elegans and Fugu rubripes. 
 
The sequences are characterised by a variable N-terminal domain and a more
conserved C-terminal domain. They share no similarity with any other known, 
functionally or structurally characterised proteins. 
 
YEAST73DUF is a 13-element fingerprint that provides a signature for the
Saccharomyces cerevisiae 73.5kDa protein and related proteins. The 
fingerprint was derived from an initial alignment of 6 sequences: the motifs
were drawn from conserved regions spanning the C-terminal portion of the
alignment. A single iteration on SPTR39_15f was required to reach 
convergence, no further sequences being identified beyond the starting set.
Summary Information
6 codes involving 13 elements
0 codes involving 12 elements
0 codes involving 11 elements
0 codes involving 10 elements
0 codes involving 9 elements
0 codes involving 8 elements
0 codes involving 7 elements
0 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
136666666666666
120000000000000
110000000000000
100000000000000
90000000000000
80000000000000
70000000000000
60000000000000
50000000000000
40000000000000
30000000000000
20000000000000
12345678910111213
True Positives
O94949        Q20298        Q9VR38        Q9YGN1        
YAT3_SCHPO YGM4_YEAST
Sequence Titles
O94949      KIAA0872 PROTEIN - Homo sapiens (Human).      
Q20298 SIMILARITY TO PLASMODIUM YOELII RHOPTRY PROTEIN - Caenorhabditis elegans.
Q9VR38 CG11926 PROTEIN - Drosophila melanogaster (Fruit fly).
Q9YGN1 SAND PROTEIN - Fugu rubripes (Japanese pufferfish) (Takifugu rubripes).
YAT3_SCHPO HYPOTHETICAL 58.0 KDA PROTEIN C1D4.03C IN CHROMOSOME I - Schizosaccharomyces pombe (Fission yeast).
YGM4_YEAST HYPOTHETICAL 73.5 KDA PROTEIN IN MET13-RPS2 INTERGENIC REGION - Saccharomyces cerevisiae (Baker's yeast).
Scan History
SPTR39_15f 1  125  NSINGLE    
Initial Motifs
Motif 1  width=18
Element Seqn Id St Int Rpt
KHVFVLSEAGKPIYTRYG Q9YGN1 121 121 -
KHVFVLSEAGKPIYSRYG O94949 110 110 -
KHIFILSEAGKPIFSLHG Q9VR38 117 117 -
FQVFILSEFGKPIFVNND Q20298 631 631 -
KNFFIFTSAGKPIYCMHG YGM4_YEAST 182 182 -
RTYLIFSSSGKPVFSNIV YAT3_SCHPO 114 114 -

Motif 2 width=21
Element Seqn Id St Int Rpt
SEEALSSTMGVMMALVSFVES Q9YGN1 139 0 -
SVEALSATMGVMTALVSFVQS O94949 128 0 -
NEDKLATLFGVIQALVSFVQM Q9VR38 135 0 -
NEGEIVSLVALICAFVSRCQS Q20298 650 1 -
KDEQIMSYTGLVNTVISYFQV YGM4_YEAST 200 0 -
DDSIEPSTVGALQAIISSFEV YAT3_SCHPO 132 0 -

Motif 3 width=15
Element Seqn Id St Int Rpt
KVIFLAKSPLVLVGV Q9YGN1 173 13 -
KLVFLQQGPLLLVAM O94949 162 13 -
KFAFMQRSSLILVAA Q9VR38 169 13 -
HIQFLHKSPLIFCVV Q20298 684 13 -
RLTFLDKSPILLMAQ YGM4_YEAST 237 16 -
VIVVLSKNPLYLVGV YAT3_SCHPO 166 13 -

Motif 4 width=29
Element Seqn Id St Int Rpt
ELLRELQYIYYQIVSLLTLTQLNHIFQNK Q9YGN1 196 8 -
QLRGELLAVHAQIVSTLTRASVARIFAHK O94949 185 8 -
QLQLQLGDVYNQILSILTYSHMTKIFERR Q9VR38 192 8 -
QLDQQLEVLFEQICSILSKSQLENVYKKK Q20298 724 25 -
ELLNQLDFLYSYILSSLSERQLLRLFSKR YGM4_YEAST 260 8 -
YLLSELNLLYCQILTGVTAKAMQLTLNSR YAT3_SCHPO 190 9 -

Motif 5 width=13
Element Seqn Id St Int Rpt
QNYDLRRLLAGSE Q9YGN1 225 0 -
QNYDLRRLLAGSE O94949 214 0 -
KNFDLRRLLSGSE Q9VR38 221 0 -
DNYDLRKLLRGTD Q20298 754 1 -
ENFDLRNYLESTD YGM4_YEAST 289 0 -
PNFDLRRLIGSNE YAT3_SCHPO 219 0 -

Motif 6 width=22
Element Seqn Id St Int Rpt
LLSAVTCLPLSNSVRDVVSSSL Q9YGN1 254 16 -
LLGAVRCVPLARPLRDALGALL O94949 244 17 -
LTNSIRVFPLPTTIRSQITSAI Q9VR38 257 23 -
VDSSISAIPMNPSDREFLSTTM Q20298 784 17 -
LLNSLQCLPFNHSSRLKLQNVV YGM4_YEAST 321 19 -
TLNAISPLPLRSSFRDQLSQLL YAT3_SCHPO 249 17 -

Motif 7 width=14
Element Seqn Id St Int Rpt
AKAKNLVFSILLAG Q9YGN1 278 2 -
CTAPGLALSVLAVG O94949 268 2 -
SKIKNLVFAVLIAN Q9VR38 283 4 -
AKLDGALFGIMIAR Q20298 812 6 -
IPRGTLLYGLIIAP YGM4_YEAST 352 9 -
ETPKSLLFTFIAIR YAT3_SCHPO 273 2 -

Motif 8 width=17
Element Seqn Id St Int Rpt
DQFLHHIDLHLVMNLVG Q9YGN1 302 10 -
ECRLDPADLQLLLDWVG O94949 296 14 -
KYSIHPADLRLIFNLVE Q9VR38 307 10 -
KYMIHPRDLNIVINLVS Q20298 836 10 -
GHTLHTTDLHLLFCLIS YGM4_YEAST 377 11 -
KLLLHANDLYLLFLSLF YAT3_SCHPO 297 10 -

Motif 9 width=24
Element Seqn Id St Int Rpt
EGWTPICLPKFNTAGFFHAHISYL Q9YGN1 327 8 -
EAWAPVCLPRFNPDGFFYAYVARL O94949 320 7 -
ENWSPICLPKFDMNGYLHAHVSYL Q9VR38 332 8 -
QNWVPICLPRFNDTGFFYAYISYP Q20298 861 8 -
ELWVPICFPKFNSSGFLYCYIKFL YGM4_YEAST 404 10 -
EHWVPVCFPTLNPDAYIYIYSYFL YAT3_SCHPO 323 9 -

Motif 10 width=27
Element Seqn Id St Int Rpt
CLILVSTDREDFFNMSDCKQRFLERLT Q9YGN1 357 6 -
CLLLLGTQREAFHAMAACRRLVEDGMH O94949 349 5 -
CLLLLSVDRDAFFTLAEAKAKITEKLR Q9VR38 362 6 -
CIVLLSVKRDHFDGLKEVRQQIVTKLE Q20298 895 10 -
ALVLISAQKDAFFSLKSFSDELIIKLE YGM4_YEAST 438 10 -
VLIMGSSESGVFFEMQSVKCKVAQEIQ YAT3_SCHPO 351 4 -

Motif 11 width=15
Element Seqn Id St Int Rpt
LLAWVTNGFQLYLCF Q9YGN1 471 87 -
LLAWVTSKFELYTCL O94949 478 102 -
VLAWATGTYELYAIF Q9VR38 479 90 -
LFVWVTDLFSLYCIF Q20298 1012 90 -
GMAWVTPTFELYLIG YGM4_YEAST 594 129 -
LFTWSTASFDFHCIA YAT3_SCHPO 464 86 -

Motif 12 width=14
Element Seqn Id St Int Rpt
PLGTKAMAVSAVNK Q9YGN1 487 1 -
PLVTKAGAILVVTK O94949 494 1 -
PVVDKATVIKYVDK Q9VR38 495 1 -
PFVTATIAFQVVEK Q20298 1028 1 -
GIVDKRVLFKSARK YGM4_YEAST 611 2 -
ATTSSQLLIANVNK YAT3_SCHPO 480 1 -

Motif 13 width=21
Element Seqn Id St Int Rpt
KLLKWIRKEEDRLFILSPLTY Q9YGN1 500 -1 -
KLLRWVKKEEDRLFIRYPPKY O94949 507 -1 -
KLIKWIEKEYDVYFIRNHATF Q9VR38 508 -1 -
KLLKSLKSHEQRYFIINSTSF Q20298 1041 -1 -
KVANWCQKHESRLFISDGAVF YGM4_YEAST 624 -1 -
KILRWIRREENRLFIQTNLSF YAT3_SCHPO 493 -1 -
Final Motifs
Motif 1  width=18
Element Seqn Id St Int Rpt
KHVFVLSEAGKPIYTRYG Q9YGN1 121 121 -
KHVFVLSEAGKPIYSRYG O94949 110 110 -
KHIFILSEAGKPIFSLHG Q9VR38 117 117 -
FQVFILSEFGKPIFVNND Q20298 631 631 -
KNFFIFTSAGKPIYCMHG YGM4_YEAST 182 182 -
RTYLIFSSSGKPVFSNIV YAT3_SCHPO 114 114 -

Motif 2 width=21
Element Seqn Id St Int Rpt
SEEALSSTMGVMMALVSFVES Q9YGN1 139 0 -
SVEALSATMGVMTALVSFVQS O94949 128 0 -
NEDKLATLFGVIQALVSFVQM Q9VR38 135 0 -
NEGEIVSLVALICAFVSRCQS Q20298 650 1 -
KDEQIMSYTGLVNTVISYFQV YGM4_YEAST 200 0 -
DDSIEPSTVGALQAIISSFEV YAT3_SCHPO 132 0 -

Motif 3 width=15
Element Seqn Id St Int Rpt
KVIFLAKSPLVLVGV Q9YGN1 173 13 -
KLVFLQQGPLLLVAM O94949 162 13 -
KFAFMQRSSLILVAA Q9VR38 169 13 -
HIQFLHKSPLIFCVV Q20298 684 13 -
RLTFLDKSPILLMAQ YGM4_YEAST 237 16 -
VIVVLSKNPLYLVGV YAT3_SCHPO 166 13 -

Motif 4 width=29
Element Seqn Id St Int Rpt
ELLRELQYIYYQIVSLLTLTQLNHIFQNK Q9YGN1 196 8 -
QLRGELLAVHAQIVSTLTRASVARIFAHK O94949 185 8 -
QLQLQLGDVYNQILSILTYSHMTKIFERR Q9VR38 192 8 -
QLDQQLEVLFEQICSILSKSQLENVYKKK Q20298 724 25 -
ELLNQLDFLYSYILSSLSERQLLRLFSKR YGM4_YEAST 260 8 -
YLLSELNLLYCQILTGVTAKAMQLTLNSR YAT3_SCHPO 190 9 -

Motif 5 width=13
Element Seqn Id St Int Rpt
QNYDLRRLLAGSE Q9YGN1 225 0 -
QNYDLRRLLAGSE O94949 214 0 -
KNFDLRRLLSGSE Q9VR38 221 0 -
DNYDLRKLLRGTD Q20298 754 1 -
ENFDLRNYLESTD YGM4_YEAST 289 0 -
PNFDLRRLIGSNE YAT3_SCHPO 219 0 -

Motif 6 width=22
Element Seqn Id St Int Rpt
LLSAVTCLPLSNSVRDVVSSSL Q9YGN1 254 16 -
LLGAVRCVPLARPLRDALGALL O94949 244 17 -
LTNSIRVFPLPTTIRSQITSAI Q9VR38 257 23 -
VDSSISAIPMNPSDREFLSTTM Q20298 784 17 -
LLNSLQCLPFNHSSRLKLQNVV YGM4_YEAST 321 19 -
TLNAISPLPLRSSFRDQLSQLL YAT3_SCHPO 249 17 -

Motif 7 width=14
Element Seqn Id St Int Rpt
AKAKNLVFSILLAG Q9YGN1 278 2 -
CTAPGLALSVLAVG O94949 268 2 -
SKIKNLVFAVLIAN Q9VR38 283 4 -
AKLDGALFGIMIAR Q20298 812 6 -
IPRGTLLYGLIIAP YGM4_YEAST 352 9 -
ETPKSLLFTFIAIR YAT3_SCHPO 273 2 -

Motif 8 width=17
Element Seqn Id St Int Rpt
DQFLHHIDLHLVMNLVG Q9YGN1 302 10 -
ECRLDPADLQLLLDWVG O94949 296 14 -
KYSIHPADLRLIFNLVE Q9VR38 307 10 -
KYMIHPRDLNIVINLVS Q20298 836 10 -
GHTLHTTDLHLLFCLIS YGM4_YEAST 377 11 -
KLLLHANDLYLLFLSLF YAT3_SCHPO 297 10 -

Motif 9 width=24
Element Seqn Id St Int Rpt
EGWTPICLPKFNTAGFFHAHISYL Q9YGN1 327 8 -
EAWAPVCLPRFNPDGFFYAYVARL O94949 320 7 -
ENWSPICLPKFDMNGYLHAHVSYL Q9VR38 332 8 -
QNWVPICLPRFNDTGFFYAYISYP Q20298 861 8 -
ELWVPICFPKFNSSGFLYCYIKFL YGM4_YEAST 404 10 -
EHWVPVCFPTLNPDAYIYIYSYFL YAT3_SCHPO 323 9 -

Motif 10 width=27
Element Seqn Id St Int Rpt
CLILVSTDREDFFNMSDCKQRFLERLT Q9YGN1 357 6 -
CLLLLGTQREAFHAMAACRRLVEDGMH O94949 349 5 -
CLLLLSVDRDAFFTLAEAKAKITEKLR Q9VR38 362 6 -
CIVLLSVKRDHFDGLKEVRQQIVTKLE Q20298 895 10 -
ALVLISAQKDAFFSLKSFSDELIIKLE YGM4_YEAST 438 10 -
VLIMGSSESGVFFEMQSVKCKVAQEIQ YAT3_SCHPO 351 4 -

Motif 11 width=15
Element Seqn Id St Int Rpt
LLAWVTNGFQLYLCF Q9YGN1 471 87 -
LLAWVTSKFELYTCL O94949 478 102 -
VLAWATGTYELYAIF Q9VR38 479 90 -
LFVWVTDLFSLYCIF Q20298 1012 90 -
GMAWVTPTFELYLIG YGM4_YEAST 594 129 -
LFTWSTASFDFHCIA YAT3_SCHPO 464 86 -

Motif 12 width=14
Element Seqn Id St Int Rpt
PLGTKAMAVSAVNK Q9YGN1 487 1 -
PLVTKAGAILVVTK O94949 494 1 -
PVVDKATVIKYVDK Q9VR38 495 1 -
PFVTATIAFQVVEK Q20298 1028 1 -
GIVDKRVLFKSARK YGM4_YEAST 611 2 -
ATTSSQLLIANVNK YAT3_SCHPO 480 1 -

Motif 13 width=21
Element Seqn Id St Int Rpt
KLLKWIRKEEDRLFILSPLTY Q9YGN1 500 -1 -
KLLRWVKKEEDRLFIRYPPKY O94949 507 -1 -
KLIKWIEKEYDVYFIRNHATF Q9VR38 508 -1 -
KLLKSLKSHEQRYFIINSTSF Q20298 1041 -1 -
KVANWCQKHESRLFISDGAVF YGM4_YEAST 624 -1 -
KILRWIRREENRLFIQTNLSF YAT3_SCHPO 493 -1 -