SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00698

Identifier
TMPROTEINSRG  [View Relations]  [View Alignment]  
Accession
PR00698
No. of Motifs
7
Creation Date
08-APR-1997  (UPDATE 28-JUN-1999)
Title
C.elegans Srg family integral membrane protein signature
Database References

INTERPRO; IPR000609
Literature References
1. TROEMEL, E.R., CHOU, J.H., DWYER, N.D., COLBERT, H.A. AND BARGMANN, C.I.
Divergent seven transmembrane receptors are candidate chemosensory
receptors in C.elegans.
CELL 83 207-218 (1995).

Documentation
Animals recognise a wide variety of chemicals using their senses of taste
and smell. The nematode C.elegans has only 14 types of chemosensory neuron,
yet is able to respond to dozens of chemicals because each neuron detects
several stimuli. More than 40 highly divergent transmembrane proteins that
could contribute to this functional diversity have been described [1]. Most
of the candidate receptor genes are in clusters of similar genes; 11 of 
these appear to be expressed in small subsets of chemosensory neurons. A
single type of neuron can potentially express at least 4 different receptor
genes [1]. Some of these might encode receptors for water-soluble
attractants, repellents and pheromones, which may be divergent members
of the G protein-coupled receptor family [1].
 
Sequences of the srg family of C.elegans receptor-like proteins contain
7 hydrophobic, putative transmembrane, regions. These can be distinguished
from other 7TM proteins (especially those known to couple G proteins) by
their own characteristic TM signatures.
 
TMPROTEINSRG is a 7-element fingerprint that provides a signature for the
C.elegans srg family of integral membrane proteins. The fingerprint was
derived from an initial alignment of 10 sequences: the motifs were drawn
from the putative TM regions. A single iteration on OWL29.1 was required to
reach convergence, no further sequences being identified beyond the starting
set. Several partial matches were found: these include family members that
contain insertions within the putative TM domains; a number of fragments;
and other closely related C.elegans TM proteins.
 
An update on SPTR37_9f identified a true set of 12 sequences, and 10
partial matches.
Summary Information
  12 codes involving  7 elements
6 codes involving 6 elements
2 codes involving 5 elements
2 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
712121212121212
65566662
51212121
40122210
30000000
20000000
1234567
True Positives
O45150        P91275        P91535        SG10_CAEEL    
SG13_CAEEL SRG1_CAEEL SRG3_CAEEL SRG5_CAEEL
SRG6_CAEEL SRG7_CAEEL SRG8_CAEEL SRG9_CAEEL
True Positive Partials
Codes involving 6 elements
O17819 O17820 Q18428 SG11_CAEEL
SRG2_CAEEL SRG4_CAEEL
Codes involving 5 elements
Q22044 Q23035
Codes involving 4 elements
Q23034 Q94168
Sequence Titles
O45150      W02F12.7 PROTEIN - CAENORHABDITIS ELEGANS.    
P91275 COSMID F26B1 - CAENORHABDITIS ELEGANS.
P91535 COSMID ZC204 - CAENORHABDITIS ELEGANS.
SG10_CAEEL SRG-10 PROTEIN - CAENORHABDITIS ELEGANS.
SG13_CAEEL SRG-13 PROTEIN - CAENORHABDITIS ELEGANS.
SRG1_CAEEL SRG-1 PROTEIN - CAENORHABDITIS ELEGANS.
SRG3_CAEEL SRG-3 PROTEIN - CAENORHABDITIS ELEGANS.
SRG5_CAEEL SRG-5 PROTEIN - CAENORHABDITIS ELEGANS.
SRG6_CAEEL SRG-6 PROTEIN - CAENORHABDITIS ELEGANS.
SRG7_CAEEL SRG-7 PROTEIN - CAENORHABDITIS ELEGANS.
SRG8_CAEEL SRG-8 PROTEIN - CAENORHABDITIS ELEGANS.
SRG9_CAEEL SRG-9 PROTEIN - CAENORHABDITIS ELEGANS.

O17819 F15A4.4 PROTEIN - CAENORHABDITIS ELEGANS.
O17820 F15A4.7 PROTEIN - CAENORHABDITIS ELEGANS.
Q18428 C34C6.1 PROTEIN - CAENORHABDITIS ELEGANS.
SG11_CAEEL SRG-11 PROTEIN - CAENORHABDITIS ELEGANS.
SRG2_CAEEL SRG-2 PROTEIN - CAENORHABDITIS ELEGANS.
SRG4_CAEEL SRG-4 PROTEIN - CAENORHABDITIS ELEGANS.

Q22044 T01B7.2 PROTEIN - CAENORHABDITIS ELEGANS.
Q23035 COSMID T09D3 - CAENORHABDITIS ELEGANS.

Q23034 COSMID T09D3 - CAENORHABDITIS ELEGANS.
Q94168 COSMID C10G8 - CAENORHABDITIS ELEGANS.
Scan History
OWL29_1    1  100  NSINGLE    
SPTR37_9f 2 50 NSINGLE
Initial Motifs
Motif 1  width=27
Element Seqn Id St Int Rpt
ENLKLMGQLVYLIPSFILISKMIYVIQ SRG6_CAEEL 19 19 -
ENSKYWIQCLWLIPTLFLLVWIIITTR SRG7_CAEEL 26 26 -
EVLKFTAQIIYIITGIFLNSAVLGTIL CELF26B12 29 29 -
SILEYLVQATYLSVSAVLNSMIVYTIF CELZC204 30 30 -
ENLKYLTQLFYMVPGIIIHFRILSIML SRG5_CAEEL 24 24 -
ENIHYFYQFAYLFTAICINYRILYVIW SRG3_CAEEL 15 15 -
ENFKYFVQVAYLAPAVFLYSRILYVVW SRG8_CAEEL 23 23 -
ELLKYIIQVTLLSINFILNFLIIRVTM SG13_CAEEL 30 30 -
SLAMYGMQSSYLIVGAVLNVMIVYTVF SG10_CAEEL 31 31 -
ENLKLFLQLCYLTPSALFLSRVIYITA SRG1_CAEEL 21 21 -

Motif 2 width=24
Element Seqn Id St Int Rpt
FYMLYCADAIVGIYINTAEVIFGR SG10_CAEEL 67 9 -
FFIIYAADLIMGMYMSLSEILVGR SG13_CAEEL 68 11 -
FFIIFSMDSIASLTQLILDLFIQR SRG5_CAEEL 62 11 -
FFTLFSVDCIANISILITEGLFAR CELF26B12 67 11 -
YHPFFMVYSMVGLILVLLDIFITR SRG8_CAEEL 58 8 -
FFVLYAAEAVMNVYSCVIEVLFGR CELZC204 69 12 -
FYNLYSVDCFTSVLAMSNELIFTR SRG3_CAEEL 53 11 -
PYWILTADCVVSIILILLDLFVVR SRG7_CAEEL 63 10 -
FWLLYTMDLALSLLNLFFDIFYYR SRG6_CAEEL 58 12 -
FYTIFLADCVTGFILVNFSIFFTR SRG1_CAEEL 59 11 -

Motif 3 width=25
Element Seqn Id St Int Rpt
IYYPCFRYLQAFQILVQILFVANRA SRG1_CAEEL 107 24 -
LYFLLNHYCLAFKTLSQIAISFNRM CELZC204 117 24 -
IIMLLTHHVSICKSLLQVLLVLNRM CELF26B12 115 24 -
IIFSIYNYMRAAKSIIQIFLTVNRM SRG5_CAEEL 110 24 -
IYYCLLSYLIAIKPVIHIFIAVNRM SRG3_CAEEL 101 24 -
LYYPLLNYLHCAQPFIQIFLTTNRM SRG8_CAEEL 106 24 -
IFFTLSHYSQGFKTVSQVFLSFNRM SG13_CAEEL 116 24 -
MYYAALHYSLGFKTFSQIFMSFNRM SG10_CAEEL 115 24 -
ITYPLWFYFHVGKMVAQMSISFERM SRG6_CAEEL 106 24 -
IYFPIYNYARVFKTGSQCGMILSRL SRG7_CAEEL 111 24 -

Motif 4 width=26
Element Seqn Id St Int Rpt
WKINFSRILILNLIAPFFFIWNTIIS SRG8_CAEEL 143 12 -
WRKSLKYVISAVFLIPFCADWNIAIS CELF26B12 152 12 -
LRRHIPLFLTIICILPILVVWNTVIS SRG7_CAEEL 148 12 -
WKHGLTACVIMIIFVPFSIIWNILIS SRG6_CAEEL 144 13 -
WKKWLKSILTTMAISPCLWIWTIAIS SRG1_CAEEL 144 12 -
WQNILAPVLVSLFVLPLGVTWNILVS CELZC204 154 12 -
WKQILKPVLIITFILPLGVIWKILLS SG10_CAEEL 152 12 -
WKRILTPIIIVLFVLPIGIIWNVLIS SG13_CAEEL 153 12 -
WRRFIPVTIAFITLSPFLVIWNVIIS SRG5_CAEEL 147 12 -
WSQKLRIMLIVIFLAPFLVIWNVLIS SRG3_CAEEL 138 12 -

Motif 5 width=26
Element Seqn Id St Int Rpt
WANISILHLFHFTLCFVLVIIFFVAT SG10_CAEEL 198 20 -
WASLSMFQMIFMAISLTITVFTTSIT SRG5_CAEEL 193 20 -
WASLSLMQFTLIILTVLITMVTTTVT SRG3_CAEEL 184 20 -
WASMSLFLFIIRSAVVMITVVTTSIT SRG8_CAEEL 189 20 -
WASLSKLHLTYFIVSLILIIVISGVT SG13_CAEEL 199 20 -
WARSTLFFSILRLTSVITIVVATTTM SRG1_CAEEL 190 20 -
WFGTTAWQLTYMQISMAVTLLSNIVT SRG6_CAEEL 190 20 -
WVSLSKLHLTFIFVSISFILISSLLL SRG7_CAEEL 194 20 -
WASQSRFQLVFIIIALSFTFICTAIT CELF26B12 198 20 -
WANISFLHLFHCIPCLFLMIVFFLAS CELZC204 200 20 -

Motif 6 width=23
Element Seqn Id St Int Rpt
SRLLCRIWFAISTEYLLSACAFC SRG6_CAEEL 229 13 -
ERSLIIFTMTLGVETMLFAIAQI CELZC204 240 14 -
EKAISNATVIISIGFTFKVLFQI CELF26B12 235 11 -
ERTLCFASFYMSAAFFSAALFQS SRG5_CAEEL 233 14 -
ERALCIAAALISVGFLLEAITQS SRG3_CAEEL 224 14 -
EGTLCKACAANSICFLVPAVFEA SRG8_CAEEL 229 14 -
ERRVITNSIFIIVAFFFQAAFQS SRG7_CAEEL 232 12 -
EQTLTIATMVLSLEFSFLSVIQI SG13_CAEEL 239 14 -
ERSLTIVTMIMAVQTVTFASIQI SG10_CAEEL 238 14 -
ERRLCWASVYLSVCYLLPAIAEV SRG1_CAEEL 230 14 -

Motif 7 width=24
Element Seqn Id St Int Rpt
FDVLYVYSPIALILMNRQLRRDIF CELZC204 286 23 -
WDVLNVGSPLVMIFASGQLRTHAF SRG8_CAEEL 273 21 -
YDVMTVGYPLIFLNFAKEFRNHVF SRG7_CAEEL 277 22 -
WDGFNILSPVIMISMNKSLRKQVF SRG6_CAEEL 274 22 -
LDVIILISVRILPGLKSEVVRKLW SRG1_CAEEL 259 6 -
FDSLYVFSPIALIVMSRQLRKDIF SG10_CAEEL 284 23 -
YDLLNFSTTIIFISCNPKLRKMLL SG13_CAEEL 285 23 -
FDVLNVGSPIVMVLISGQLRYHVI SRG5_CAEEL 278 22 -
LDFLTVGSPIVMICVSRNLRTHIF CELF26B12 280 22 -
MDILFVGSPLVLLLVSDQFRGHVL SRG3_CAEEL 269 22 -
Final Motifs
Motif 1  width=27
Element Seqn Id St Int Rpt
ENIKYLLQAAYMVPPAFLYARILYVIW SRG9_CAEEL 25 25 -
ENLKYLTQLFYMVPGIIIHFRILSIML SRG5_CAEEL 24 24 -
ENIHYFYQFAYLFTAICINYRILYVIW SRG3_CAEEL 15 15 -
ENFKYFVQVAYLAPAVFLYSRILYVVW SRG8_CAEEL 23 23 -
ELLKYIIQVTLLSINFILNFLIIRVTM SG13_CAEEL 30 30 -
SLAMYGMQSSYLIVGAVLNVMIVYTVF SG10_CAEEL 31 31 -
SILEYLVQATYLSVSAVLNSMIVYTIF P91535 30 30 -
EVLKFTAQIIYIITGIFLNSAVLGTIL P91275 29 29 -
ENLKLFLQLCYLTPSALFLSRVIYITA SRG1_CAEEL 21 21 -
ENLKLMGQLVYLIPSFILISKMIYVIQ SRG6_CAEEL 19 19 -
ENSKYWIQCLWLIPTLFLLVWIIITTR SRG7_CAEEL 26 26 -
ELLKYGIQFVYFIVGLGFHFAVIKVLH O45150 87 87 -

Motif 2 width=24
Element Seqn Id St Int Rpt
FFVIYSMDSIVGFILLLLDIFITR SRG9_CAEEL 63 11 -
FFIIFSMDSIASLTQLILDLFIQR SRG5_CAEEL 62 11 -
FYNLYSVDCFTSVLAMSNELIFTR SRG3_CAEEL 53 11 -
YHPFFMVYSMVGLILVLLDIFITR SRG8_CAEEL 58 8 -
FFIIYAADLIMGMYMSLSEILVGR SG13_CAEEL 68 11 -
FYMLYCADAIVGIYINTAEVIFGR SG10_CAEEL 67 9 -
FFVLYAAEAVMNVYSCVIEVLFGR P91535 69 12 -
FFTLFSVDCIANISILITEGLFAR P91275 67 11 -
FYTIFLADCVTGFILVNFSIFFTR SRG1_CAEEL 59 11 -
FWLLYTMDLALSLLNLFFDIFYYR SRG6_CAEEL 58 12 -
PYWILTADCVVSIILILLDLFVVR SRG7_CAEEL 63 10 -
FLKLYYVDSILSVLIILLDLVLIR O45150 124 10 -

Motif 3 width=25
Element Seqn Id St Int Rpt
IYYPLLNYLHCAQPLIQIFLTLNRM SRG9_CAEEL 111 24 -
IIFSIYNYMRAAKSIIQIFLTVNRM SRG5_CAEEL 110 24 -
IYYCLLSYLIAIKPVIHIFIAVNRM SRG3_CAEEL 101 24 -
LYYPLLNYLHCAQPFIQIFLTTNRM SRG8_CAEEL 106 24 -
IFFTLSHYSQGFKTVSQVFLSFNRM SG13_CAEEL 116 24 -
MYYAALHYSLGFKTFSQIFMSFNRM SG10_CAEEL 115 24 -
LYFLLNHYCLAFKTLSQIAISFNRM P91535 117 24 -
IIMLLTHHVSICKSLLQVLLVLNRM P91275 115 24 -
IYYPCFRYLQAFQILVQILFVANRA SRG1_CAEEL 107 24 -
ITYPLWFYFHVGKMVAQMSISFERM SRG6_CAEEL 106 24 -
IYFPIYNYARVFKTGSQCGMILSRL SRG7_CAEEL 111 24 -
SILFIEQYLQFVKSLIFCFMVVNRA O45150 171 23 -

Motif 4 width=26
Element Seqn Id St Int Rpt
WSKNLSFIVAFVSLSPFLIIWNTIIS SRG9_CAEEL 148 12 -
WRRFIPVTIAFITLSPFLVIWNVIIS SRG5_CAEEL 147 12 -
WSQKLRIMLIVIFLAPFLVIWNVLIS SRG3_CAEEL 138 12 -
WKINFSRILILNLIAPFFFIWNTIIS SRG8_CAEEL 143 12 -
WKRILTPIIIVLFVLPIGIIWNVLIS SG13_CAEEL 153 12 -
WKQILKPVLIITFILPLGVIWKILLS SG10_CAEEL 152 12 -
WQNILAPVLVSLFVLPLGVTWNILVS P91535 154 12 -
WRKSLKYVISAVFLIPFCADWNIAIS P91275 152 12 -
WKKWLKSILTTMAISPCLWIWTIAIS SRG1_CAEEL 144 12 -
WKHGLTACVIMIIFVPFSIIWNILIS SRG6_CAEEL 144 13 -
LRRHIPLFLTIICILPILVVWNTVIS SRG7_CAEEL 148 12 -
QSCIIPHVIVFCILCPLLGVWTAFLS O45150 208 12 -

Motif 5 width=26
Element Seqn Id St Int Rpt
WADISLFLFLVRSVAVIITVASTVIM SRG9_CAEEL 194 20 -
WASLSMFQMIFMAISLTITVFTTSIT SRG5_CAEEL 193 20 -
WASLSLMQFTLIILTVLITMVTTTVT SRG3_CAEEL 184 20 -
WASMSLFLFIIRSAVVMITVVTTSIT SRG8_CAEEL 189 20 -
WASLSKLHLTYFIVSLILIIVISGVT SG13_CAEEL 199 20 -
WANISILHLFHFTLCFVLVIIFFVAT SG10_CAEEL 198 20 -
WANISFLHLFHCIPCLFLMIVFFLAS P91535 200 20 -
WASQSRFQLVFIIIALSFTFICTAIT P91275 198 20 -
WARSTLFFSILRLTSVITIVVATTTM SRG1_CAEEL 190 20 -
WFGTTAWQLTYMQISMAVTLLSNIVT SRG6_CAEEL 190 20 -
WVSLSKLHLTFIFVSISFILISSLLL SRG7_CAEEL 194 20 -
WITVSQFSVIISSITIVTVCICSVIS O45150 254 20 -

Motif 6 width=23
Element Seqn Id St Int Rpt
ERTLCLACVIHSICFMVPSFFEA SRG9_CAEEL 234 14 -
ERTLCFASFYMSAAFFSAALFQS SRG5_CAEEL 233 14 -
ERALCIAAALISVGFLLEAITQS SRG3_CAEEL 224 14 -
EGTLCKACAANSICFLVPAVFEA SRG8_CAEEL 229 14 -
EQTLTIATMVLSLEFSFLSVIQI SG13_CAEEL 239 14 -
ERSLTIVTMIMAVQTVTFASIQI SG10_CAEEL 238 14 -
ERSLIIFTMTLGVETMLFAIAQI P91535 240 14 -
EKAISNATVIISIGFTFKVLFQI P91275 235 11 -
ERRLCWASVYLSVCYLLPAIAEV SRG1_CAEEL 230 14 -
SRLLCRIWFAISTEYLLSACAFC SRG6_CAEEL 229 13 -
ERRVITNSIFIIVAFFFQAAFQS SRG7_CAEEL 232 12 -
EQSLTASALAMSIFYVFALSMNI O45150 294 14 -

Motif 7 width=24
Element Seqn Id St Int Rpt
WDVLNVGSPLIMIFVSGQLRHHVL SRG9_CAEEL 278 21 -
FDVLNVGSPIVMVLISGQLRYHVI SRG5_CAEEL 278 22 -
MDILFVGSPLVLLLVSDQFRGHVL SRG3_CAEEL 269 22 -
WDVLNVGSPLVMIFASGQLRTHAF SRG8_CAEEL 273 21 -
YDLLNFSTTIIFISCNPKLRKMLL SG13_CAEEL 285 23 -
FDSLYVFSPIALIVMSRQLRKDIF SG10_CAEEL 284 23 -
FDVLYVYSPIALILMNRQLRRDIF P91535 286 23 -
LDFLTVGSPIVMICVSRNLRTHIF P91275 280 22 -
LDVIILISVRILPGLKSEVVRKLW SRG1_CAEEL 259 6 -
WDGFNILSPVIMISMNKSLRKQVF SRG6_CAEEL 274 22 -
YDVMTVGYPLIFLNFAKEFRNHVF SRG7_CAEEL 277 22 -
FDIILVCPPVIMLCLNVRLRINVF O45150 340 23 -