SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00840

Identifier
Y06768FAMILY  [View Relations]  [View Alignment]  
Accession
PR00840
No. of Motifs
6
Creation Date
15-MAR-1998  (UPDATE 03-JUL-1999)
Title
MG067/MG068/MG395 hypothetical protein family signature
Database References

INTERPRO; IPR002414
Literature References
1. FRASER, C.M., GOCAYNE, J.D., WHITE, O., ADAMS, M.D., CLAYTON, R.A.,    
FLEISCHMANN, R.D., BULT, C.J., KERLAVAGE, A.R., SUTTON, G., KELLEY, J.M.,
FRITCHMAN, J.L., WEIDMAN, J.F., SMALL, K.V., SANDUSKY, M., FUHRMANN, J.,
NGUYEN, D., UTTERBACK, T.R., SAUDEK, D.M., PHILLIPS, C.A., MERRICK, J.M.,
TOMB, J.F., DOUGHERTY, B.A., BOTT, K.F., HU, P.C., LUCIER, T.S.,
PETERSON, S.N., SMITH, H.O., HUTCHISON, C.A. AND VENTER, J.C.
The minimal gene complement of Mycoplasma genitalium. 
SCIENCE 270 397-403 (1995). 
 
2. HIMMELREICH, R., HILBERT, H., PLAGENS, H., PIRKL, E., LI, B.C.
AND HERRMANN, R.
Complete sequence analysis of the genome of the bacterium Mycoplasma
pneumoniae.
NUCLEIC ACIDS RES. 24 4420-4449 (1996).

Documentation
Mycoplasma genitalium has the smallest known genome of any free-living 
organism. Its complete genome sequence has been determined by whole-
genome random sequencing and assembly [1]. Only 470 putative coding regions
were identified, including genes for DNA replication, transcription and
translation, DNA repair, cellular transport and energy metabolism [1]. 
One family of hypothetical proteins, which includes products of the
MG067/MG068/MG395 genes, has been shown to have homologues of similarly
unknown function in M.pneumoniae [2].
 
Y06768FAMILY is a 6-element fingerprint that provides a signature for 
the Mycoplasma MG067/MG068/MG395 family of hypothetical proteins. The
fingerprint was derived from an initial alignment of 5 sequences: the
motifs were drawn from short conserved regions spanning the full alignment
length - motif 5 lies in a region that bears some similarity to the
conserved serine active site motif of the V8 family of serine proteases
(cf. PROSITE pattern V8_SER (PS00673)). Two iterations on OWL30.0 were
required to reach convergence, at which point a true set comprising 9
sequences was identified. Several partial matches were also found, all
of which are fragments.
 
An update on SPTR37_9f identified a true set of 9 sequences, and 10
partial matches.
Summary Information
   9 codes involving  6 elements
0 codes involving 5 elements
1 codes involving 4 elements
8 codes involving 3 elements
1 codes involving 2 elements
Composite Feature Index
6999999
5000000
4011110
3444444
2000110
123456
True Positives
P75198        Q50335        Q50339        Y067_MYCGE    
Y067_MYCPN Y068_MYCGE Y068_MYCPN Y395_MYCGE
YKDA_MYCCA
True Positive Partials
Codes involving 4 elements
P75203
Codes involving 3 elements
P75193 P75194 P75195 P75196
P75199 P75200 Q50336 Q50337
Codes involving 2 elements
Q50338
Sequence Titles
P75198      PUTATIVE LIPOPROTEIN - MYCOPLASMA PNEUMONIAE. 
Q50335 PUTATIVE LIPOPROTEIN - MYCOPLASMA PNEUMONIAE.
Q50339 PUTATIVE LIPOPROTEIN - MYCOPLASMA PNEUMONIAE.
Y067_MYCGE HYPOTHETICAL LIPOPROTEIN MG067 PRECURSOR - MYCOPLASMA GENITALIUM.
Y067_MYCPN HYPOTHETICAL LIPOPROTEIN MG067 HOMOLOG PRECURSOR - MYCOPLASMA PNEUMONIAE.
Y068_MYCGE HYPOTHETICAL LIPOPROTEIN MG068 PRECURSOR - MYCOPLASMA GENITALIUM.
Y068_MYCPN HYPOTHETICAL LIPOPROTEIN MG068 HOMOLOG PRECURSOR - MYCOPLASMA PNEUMONIAE.
Y395_MYCGE HYPOTHETICAL LIPOPROTEIN MG395 PRECURSOR - MYCOPLASMA GENITALIUM.
YKDA_MYCCA HYPOTHETICAL 75.9 KD PROTEIN IN KDTB 5'REGION (ORFA) - MYCOPLASMA CAPRICOLUM.

P75203 MG068 HOMOLOG - MYCOPLASMA PNEUMONIAE.

P75193 MG068 HOMOLOG - MYCOPLASMA PNEUMONIAE.
P75194 MG067 HOMOLOG - MYCOPLASMA PNEUMONIAE.
P75195 PUTATIVE LIPOPROTEIN - MYCOPLASMA PNEUMONIAE.
P75196 MG067 HOMOLOG - MYCOPLASMA PNEUMONIAE.
P75199 MG068 HOMOLOG - MYCOPLASMA PNEUMONIAE.
P75200 MG395 HOMOLOG - MYCOPLASMA PNEUMONIAE.
Q50336 D02_ORF353V - MYCOPLASMA PNEUMONIAE.
Q50337 PUTATIVE LIPOPROTEIN - MYCOPLASMA PNEUMONIAE.

Q50338 D02_ORF157L - MYCOPLASMA PNEUMONIAE.
Scan History
OWL30_0    2  100  NSINGLE    
SPTR37_9f 2 50 NSINGLE
Initial Motifs
Motif 1  width=20
Element Seqn Id St Int Rpt
IPEESDIYRKGYDLTFTLNF Y395_MYCGE 85 85 -
VPGKDDIYSKFYDLTFSLNF Y067_MYCPN 48 48 -
IPTQGDVYHDNYDLTFSLNF Y068_MYCPN 41 41 -
IPTVSDPYHINYDLTFSLNF Y068_MYCGE 44 44 -
TVPKNNFYEKFYDLTFALNF Y067_MYCGE 48 48 -

Motif 2 width=13
Element Seqn Id St Int Rpt
SYGTGWLIDWKGD Y395_MYCGE 111 6 -
EFGTGWLIDWKGD Y067_MYCPN 74 6 -
SYGTGWLIDWKGD Y068_MYCPN 67 6 -
TYGTGWLIDWKGD Y068_MYCGE 70 6 -
EFGTGWLIDWKGD Y067_MYCGE 74 6 -

Motif 3 width=21
Element Seqn Id St Int Rpt
YIATNLHVADGLRNIGDHWPY Y395_MYCGE 136 12 -
YIATNLHVADGLKNDQDYAPY Y067_MYCPN 121 34 -
YLATNLHVVDALRNPQDYEPY Y068_MYCPN 92 12 -
YLATNLHVIDALRNNNDYEPY Y068_MYCGE 95 12 -
YIATNLHLIDGLKNDHDYQPY Y067_MYCGE 122 35 -

Motif 4 width=21
Element Seqn Id St Int Rpt
YQQYGYGLMLDDTNLPGGSSG Y395_MYCGE 392 235 -
YKQFGYGTILWDTNFGGGSSG Y067_MYCPN 411 269 -
YQQYGKGLALANTNFSGGSSG Y068_MYCPN 374 261 -
YRQYGRGFALQNTNFRPGSSG Y068_MYCGE 347 231 -
YKLFGYGTILNNTNFPGGSSG Y067_MYCGE 400 257 -

Motif 5 width=18
Element Seqn Id St Int Rpt
GSAIFNNNQKINSIYFGV Y395_MYCGE 412 -1 -
GSAIFNQNKQINSIYFGA Y067_MYCPN 431 -1 -
GTLVLNQQKQISGVYFGV Y068_MYCPN 394 -1 -
GTLMLNNQKQIAGIYFGV Y068_MYCGE 367 -1 -
GSAVFNKEKQLTSIYFGS Y067_MYCGE 420 -1 -

Motif 6 width=19
Element Seqn Id St Int Rpt
YDLIFGDSNTKSFYAQFAK Y395_MYCGE 473 43 -
YDLIFGDVNTTNFYAQFAK Y067_MYCPN 484 35 -
YDIIFGNKDTKNYYAQFAK Y068_MYCPN 461 49 -
YDLIFGDSNTTNFYAKFAR Y068_MYCGE 423 38 -
YDLIFGDKNTIKFYAQFAK Y067_MYCGE 473 35 -
Final Motifs
Motif 1  width=20
Element Seqn Id St Int Rpt
IPEESDIYRKGYDLTFTLNF Y395_MYCGE 85 85 -
VPGKDDIYSKFYDLTFSLNF Y067_MYCPN 48 48 -
IPTQGDVYHDNYDLTFSLNF Y068_MYCPN 41 41 -
IPTVSDPYHINYDLTFSLNF Y068_MYCGE 44 44 -
VPSESDVFKHNYDLTFSLNF Q50339 71 71 -
TVPKNNFYEKFYDLTFALNF Y067_MYCGE 48 48 -
IPKVNTKYRPGYDLSFALKF Q50335 100 100 -
IPGSAEIDHANYDLTFFLGF P75198 40 40 -
TVSAEEIYKELYDRTFSIKF YKDA_MYCCA 188 188 -

Motif 2 width=13
Element Seqn Id St Int Rpt
SYGTGWLIDWKGD Y395_MYCGE 111 6 -
EFGTGWLIDWKGD Y067_MYCPN 74 6 -
SYGTGWLIDWKGD Y068_MYCPN 67 6 -
TYGTGWLIDWKGD Y068_MYCGE 70 6 -
IYGTGWLFDWKGD Q50339 97 6 -
EFGTGWLIDWKGD Y067_MYCGE 74 6 -
AYGTGWLIDWKDV Q50335 126 6 -
IHGTGWLIDWKEV P75198 78 18 -
GTGTGWLLDYHKY YKDA_MYCCA 220 12 -

Motif 3 width=21
Element Seqn Id St Int Rpt
YIATNLHVADGLRNIGDHWPY Y395_MYCGE 136 12 -
YIATNLHVADGLKNDQDYAPY Y067_MYCPN 121 34 -
YLATNLHVVDALRNPQDYEPY Y068_MYCPN 92 12 -
YLATNLHVIDALRNNNDYEPY Y068_MYCGE 95 12 -
YLATNLHVADALRNDQDYEPY Q50339 140 30 -
YIATNLHLIDGLKNDHDYQPY Y067_MYCGE 122 35 -
YLATNLHVADSLRNKDDYKPY Q50335 147 8 -
YLATNLHVIQALKNREDHPPY P75198 103 12 -
FIATNLHVLADFSNSLTDEQN YKDA_MYCCA 241 8 -

Motif 4 width=21
Element Seqn Id St Int Rpt
YQQYGYGLMLDDTNLPGGSSG Y395_MYCGE 392 235 -
YKQFGYGTILWDTNFGGGSSG Y067_MYCPN 411 269 -
YQQYGKGLALANTNFSGGSSG Y068_MYCPN 374 261 -
YRQYGRGFALQNTNFRPGSSG Y068_MYCGE 347 231 -
FQQYGYGLMLNDTNFPGGSSG Q50339 408 247 -
YKLFGYGTILNNTNFPGGSSG Y067_MYCGE 400 257 -
YQHHGYGLLLEDTDFPGGSSG Q50335 407 239 -
YQMYGKGIGITNGSLSRGASG P75198 352 228 -
ATFYGYQYNINFSSLYYGASG YKDA_MYCCA 539 277 -

Motif 5 width=18
Element Seqn Id St Int Rpt
GSAIFNNNQKINSIYFGV Y395_MYCGE 412 -1 -
GSAIFNQNKQINSIYFGA Y067_MYCPN 431 -1 -
GTLVLNQQKQISGVYFGV Y068_MYCPN 394 -1 -
GTLMLNNQKQIAGIYFGV Y068_MYCGE 367 -1 -
GSPLIGKDNKLNSIYFGV Q50339 428 -1 -
GSAVFNKEKQLTSIYFGS Y067_MYCGE 420 -1 -
GSPLFNQNKQINSIYFAA Q50335 427 -1 -
GSLVLNNKRQIVAIYFAS P75198 372 -1 -
GSLAYNEFGQMIGIYNNV YKDA_MYCCA 559 -1 -

Motif 6 width=19
Element Seqn Id St Int Rpt
YDLIFGDSNTKSFYAQFAK Y395_MYCGE 473 43 -
YDLIFGDVNTTNFYAQFAK Y067_MYCPN 484 35 -
YDIIFGNKDTKNYYAQFAK Y068_MYCPN 461 49 -
YDLIFGDSNTTNFYAKFAR Y068_MYCGE 423 38 -
YDLIFGDKNTKNYYAKFAK Q50339 481 35 -
YDLIFGDKNTIKFYAQFAK Y067_MYCGE 473 35 -
YDLIFGDSNTKKYYAQFAK Q50335 468 23 -
YDLIFGNSNTKKYYAQFAK P75198 420 30 -
YNLIDGTDKTKYKYQKSSF YKDA_MYCCA 610 33 -