SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00011

Identifier
EGFLAMININ  [View Relations]  [View Alignment]  
Accession
PR00011
No. of Motifs
4
Creation Date
11-SEP-1991  (UPDATE 24-JUN-1999)
Title
Type III EGF-like signature
Database References

PROSITE; PS00022 EGF
BLOCKS; BL00022
PFAM; PF00053 laminin_EGF
INTERPRO; IPR002049
Literature References
1. SASAKI, M., KATO, S., MARTIN, G.R. AND YAMADA, Y.
Sequence of the cDNA encoding the laminin B1 chain reveals a multidomain 
protein containing cysteine-rich repeats.
PROC.NATL.ACAD.SCI.U.S.A. 84 935-939 (1987).
 
2. SASAKI, M. AND YAMADA, Y.
The laminin B2 chain has a multidomain structure homologous 
to the B1 chain.
J.BIOL.CHEM. 262(35) 17111-17117 (1987).
 
3. PANAYOTOU, G., END, P., AUMAILLEY, M., TIMPL, R. AND ENGEL, J.
Domains of laminin with growth-factor activity.
CELL 56 93-101 (1989).

Documentation
Laminins are extracellular proteins whose primary sequences contain regions
of EGF-like repeats: these differ from both the type I EGF repeats of 
epidermal and transforming growth factors (see EGFTGF), and the type II 
EGF-like repeats occurring in coagulation factors, notch proteins, etc. 
(see EGFBLOOD) - this difference lies primarily in the exact nature of the 
Cys spacings, thus warranting a further classification of EGF-like motifs, 
the so-called type IIIs.
 
The laminins contain an A, a B1 and a B2 chain, each of which display the 
repeated type III EGF-like motif [1-3]. These domains have been shown to
contain growth factor activity [3] and, in view of the clear relationship 
between their amino acid sequences, it is believed they may share a common 
structural framework with the type I EGF repeat.
 
EGFLAMININ is a 4-element fingerprint that provides a signature for type 
III EGF-like repeats. The fingerprint was derived from an initial align-
ment of 4 sequences: the motifs include a number of cysteines believed to
be involved in disulphide bond formation, motifs 1 and 2 spanning the region
encoded by PROSITE pattern EGF (PS00022). Four iterations on OWL12.0 were
required to reach convergence, at which point a true set of 12 sequences
was identified. 
 
An update on SPTR37_9f identified a true set of 22 sequences, and 6
partial matches.
Summary Information
  22 codes involving  4 elements
4 codes involving 3 elements
2 codes involving 2 elements
Composite Feature Index
422222222
34233
21012
1234
True Positives
LMA1_HUMAN    LMA1_MOUSE    LMA2_HUMAN    LMA2_MOUSE    
LMA_DROME LMB1_DROME LMB1_HUMAN LMB1_MOUSE
LMB2_HUMAN LMB2_MOUSE LMB2_RAT LMB3_HUMAN
LMB3_MOUSE LMG1_HUMAN LMG1_MOUSE LML1_CAEEL
LML2_CAEEL O44565 O57484 O75445
P91904 Q93022
True Positive Partials
Codes involving 3 elements
AGRI_CHICK LMG1_DROME O14947 O88281
Codes involving 2 elements
AGRI_RAT LMG2_HUMAN
Sequence Titles
LMA1_HUMAN  LAMININ ALPHA-1 CHAIN PRECURSOR (LAMININ A CHAIN) - HOMO SAPIENS (HUMAN). 
LMA1_MOUSE LAMININ ALPHA-1 CHAIN PRECURSOR (LAMININ A CHAIN) - MUS MUSCULUS (MOUSE).
LMA2_HUMAN LAMININ ALPHA-2 CHAIN PRECURSOR (LAMININ M CHAIN) (MEROSIN HEAVY CHAIN) - HOMO SAPIENS (HUMAN).
LMA2_MOUSE LAMININ ALPHA-2 CHAIN PRECURSOR (LAMININ M CHAIN) (MEROSIN HEAVY CHAIN) - MUS MUSCULUS (MOUSE).
LMA_DROME LAMININ ALPHA CHAIN PRECURSOR - DROSOPHILA MELANOGASTER (FRUIT FLY).
LMB1_DROME LAMININ BETA-1 CHAIN PRECURSOR (LAMININ B1 CHAIN) - DROSOPHILA MELANOGASTER (FRUIT FLY).
LMB1_HUMAN LAMININ BETA-1 CHAIN PRECURSOR (LAMININ B1 CHAIN) - HOMO SAPIENS (HUMAN).
LMB1_MOUSE LAMININ BETA-1 CHAIN PRECURSOR (LAMININ B1 CHAIN) - MUS MUSCULUS (MOUSE).
LMB2_HUMAN LAMININ BETA-2 CHAIN PRECURSOR (S-LAMININ) - HOMO SAPIENS (HUMAN).
LMB2_MOUSE LAMININ BETA-2 CHAIN PRECURSOR - MUS MUSCULUS (MOUSE).
LMB2_RAT LAMININ BETA-2 CHAIN PRECURSOR (S-LAMININ) (LAMININ CHAIN B3) - RATTUS NORVEGICUS (RAT).
LMB3_HUMAN LAMININ BETA-3 CHAIN PRECURSOR (LAMININ B1K CHAIN) (KALININ B1 CHAIN) - HOMO SAPIENS (HUMAN).
LMB3_MOUSE LAMININ BETA-3 CHAIN PRECURSOR (KALININ B1 CHAIN) - MUS MUSCULUS (MOUSE).
LMG1_HUMAN LAMININ GAMMA-1 CHAIN PRECURSOR (LAMININ B2 CHAIN) - HOMO SAPIENS (HUMAN).
LMG1_MOUSE LAMININ GAMMA-1 CHAIN PRECURSOR (LAMININ B2 CHAIN) - MUS MUSCULUS (MOUSE).
LML1_CAEEL LAMININ-LIKE PROTEIN C54D1.5 PRECURSOR - CAENORHABDITIS ELEGANS.
LML2_CAEEL LAMININ-LIKE PROTEIN K08C7.3 PRECURSOR - CAENORHABDITIS ELEGANS.
O44565 W03F8.5 PROTEIN - CAENORHABDITIS ELEGANS.
O57484 LAMININ BETA 2-LIKE CHAIN - GALLUS GALLUS (CHICKEN).
O75445 USHER SYNDROME TYPE IIA PROTEIN - HOMO SAPIENS (HUMAN).
P91904 LAMININ ALPHA - CAENORHABDITIS ELEGANS.
Q93022 LAMININ ALPHA 2 CHAIN - HOMO SAPIENS (HUMAN).

AGRI_CHICK AGRIN PRECURSOR - GALLUS GALLUS (CHICKEN).
LMG1_DROME LAMININ GAMMA-1 CHAIN PRECURSOR (LAMININ B2 CHAIN) - DROSOPHILA MELANOGASTER (FRUIT FLY).
O14947 LAMININ-5 BETA3 CHAIN - HOMO SAPIENS (HUMAN).
O88281 MEGF6 - RATTUS NORVEGICUS (RAT).

AGRI_RAT AGRIN PRECURSOR - RATTUS NORVEGICUS (RAT).
LMG2_HUMAN LAMININ GAMMA-2 CHAIN PRECURSOR - HOMO SAPIENS (HUMAN).
Scan History
OWL12_0    4  50   NSINGLE    
OWL17_1 1 20 NSINGLE
OWL18_0 1 50 NSINGLE
OWL19_1 1 60 NSINGLE
OWL26_0 1 100 NSINGLE
SPTR37_9f 3 120 NSINGLE
Initial Motifs
Motif 1  width=19
Element Seqn Id St Int Rpt
CDKATGQCLCLPNVIGQNC LMB1_HUMAN 1047 1047 -
CDKATGQCSCLPNVIGQNC LMB1_MOUSE 1047 1047 -
CDRFTGQCPCLPNVQGVRC A28783 1059 1059 -
CDRFTGQCPCLPNVQGVRC LMB1_DROME 1057 1057 -

Motif 2 width=19
Element Seqn Id St Int Rpt
CNEFTGQCQCMPGFGGRTC LMB1_MOUSE 1096 30 -
CNSYTGQCQCKPGFGGRAC A28783 1108 30 -
CNEFTGQCQCMPGFGGRTC LMB1_HUMAN 1096 30 -
CNSYTGQCQCKPGFGGRAC LMB1_DROME 1106 30 -

Motif 3 width=29
Element Seqn Id St Int Rpt
NQCQAHYWGNPNEKCQPCECDQFGAADFQ A28783 1127 0 -
SECQELFWGDPDVECRACDCDPRGIETPQ LMB1_HUMAN 1115 0 -
SECQELFWGDPDVECRACDCDPRGIETPQ LMB1_MOUSE 1115 0 -
NQCQAHYWGNPNEKCQPCECDQFGAADFQ LMB1_DROME 1125 0 -

Motif 4 width=19
Element Seqn Id St Int Rpt
CDQSTGQCVCVEGVEGPRC LMB1_MOUSE 1144 0 -
CDRETGNCVCHEGIGGYKC LMB1_DROME 1154 0 -
CDQSTGQCVCVEGVEGPRC LMB1_HUMAN 1144 0 -
CDRETGNCVCHEGIGGYKC A28783 1156 0 -
Final Motifs
Motif 1  width=19
Element Seqn Id St Int Rpt
CDPSTGQCPCLPHVQGLSC LMB2_RAT 1061 1061 -
CDPSTGQCPCLPHVQGLNC LMB2_MOUSE 1059 1059 -
CDPSSGQCPCLPNVQGPSC LMB2_HUMAN 1058 1058 -
CDKATGQCLCLPNVIGQNC LMB1_HUMAN 1047 1047 -
CDKATGQCSCLPNVIGQNC LMB1_MOUSE 1047 1047 -
CDQRSGQCHCLPHVEGQSC O57484 1052 1052 -
CDPVTGQCVCKEHVQGERC LMB3_HUMAN 392 392 -
CDRLTGQCVCKEYVQGERC LMB3_MOUSE 389 389 -
CDPKTGRCICPPNTIGEKC LMA2_HUMAN 1023 1023 -
CDPKTGRCICPPNTIGEKC Q93022 1023 1023 -
CDPKTGQCICPPNTTGEKC LMA2_MOUSE 1019 1019 -
CDRFTGQCPCLPNVQGVRC LMB1_DROME 1057 1057 -
CNPVTGQCECLPHVTGQDC LMG1_HUMAN 898 898 -
CDITSGQCKCRPRVTGLRC LML2_CAEEL 821 821 -
CDITSGQCKCRPRVTGLRC P91904 821 821 -
CNPVTGQCQCLPHVSGRDC LMG1_MOUSE 896 896 -
CDPETGECVCPPHTQGGKC LMA1_HUMAN 1007 1007 -
CNQQDGQCDCLPNVIGIQC LML1_CAEEL 894 894 -
CDPASGECLCPPHTQGLKC LMA1_MOUSE 1014 1014 -
CHQNSGQCKCKANVIGLRC O75445 713 713 -
CDKYTGMCTCKRLVTGENC O44565 498 498 -
CTSKSGQCPCKPHTQGRRC LMA_DROME 746 746 -

Motif 2 width=19
Element Seqn Id St Int Rpt
CNEFTGQCHCHAGFGGRTC LMB2_RAT 1110 30 -
CNEFTGQCHCHAGFGGRTC LMB2_MOUSE 1108 30 -
CNEFTGQCHCRAGFGGRTC LMB2_HUMAN 1107 30 -
CNEFTGQCQCMPGFGGRTC LMB1_HUMAN 1096 30 -
CNEFTGQCQCMPGFGGRTC LMB1_MOUSE 1096 30 -
CNQFTGQCSCRPGFGGRTC O57484 1101 30 -
CNQFTGQCPCREGFGGLMC LMB3_HUMAN 493 82 -
CNQFTGQCPCREGFGGLTC LMB3_MOUSE 490 82 -
CNVNTGQCNCHPKFSGAKC LMA2_HUMAN 1072 30 -
CNVNTGQCNCHPKFSGAKC Q93022 1072 30 -
CNVNTGQCSCHPKFSGMKC LMA2_MOUSE 1068 30 -
CNSYTGQCQCKPGFGGRAC LMB1_DROME 1106 30 -
CDIRTGQCECQPGITGQHC LMG1_HUMAN 947 30 -
CDERTGQCFCPPHVEGQTC LML2_CAEEL 1469 629 -
CDERTGQCFCPPHVEGQTC P91904 1469 629 -
CDIRTGQCECQPGITGQHC LMG1_MOUSE 945 30 -
CDVVTGHCQCKSKFGGRAC LMA1_HUMAN 1056 30 -
CDVNTGQCQCKPGVTGQRC LML1_CAEEL 943 30 -
CDVLSGHCPCKKGFGGQSC LMA1_MOUSE 1063 30 -
CNPHSGQCECKKEAKGLQC O75445 764 32 -
CEITTGQCKCREGFSGRRC O44565 548 31 -
CDPHTGHCACKSGVTGRQC LMA_DROME 2028 1263 -

Motif 3 width=29
Element Seqn Id St Int Rpt
SECQELHWGDPGLQCRACDCDPRGIDKPQ LMB2_RAT 1129 0 -
SECQELYWGDPGLQCRACDCDPRGIDKPQ LMB2_MOUSE 1127 0 -
SECQELHWGDPGLQCHACDCDSRGIDTPQ LMB2_HUMAN 1126 0 -
SECQELFWGDPDVECRACDCDPRGIETPQ LMB1_HUMAN 1115 0 -
SECQELFWGDPDVECRACDCDPRGIETPQ LMB1_MOUSE 1115 0 -
ANCQEQHWGDPRLQCRACDCDPRGIASTQ O57484 1120 0 -
RQCPDRTYGDVATGCRACDCDFRGTEGPG LMB3_HUMAN 517 5 -
RQCPDQTYGHVPTGCRACDCDFRGTEGPA LMB3_MOUSE 514 5 -
APGYTGSPGNPGGSCQECECDPYGSLPVP LMA2_HUMAN 1510 419 -
APGYTGSPGNPGGSCQECECDPYGSLPVP Q93022 1510 419 -
APGYTGSPSSPGGSCQECECDPYGSLPVP LMA2_MOUSE 1506 419 -
NQCQAHYWGNPNEKCQPCECDQFGAADFQ LMB1_DROME 1125 0 -
ERCEVNHFGFGPEGCKPCDCHPEGSLSLQ LMG1_HUMAN 966 0 -
CKCKENVYGGRCEACKAGTFDLSAENPLG LML2_CAEEL 1573 85 -
CKCKENVYGGRCEACKAGTFDLSAENPLG P91904 1573 85 -
ERCETNHFGFGPEGCKPCDCHHEGSLSLQ LMG1_MOUSE 964 0 -
CPCKENVFGPQCNECREGTFALRADNPLG LMA1_HUMAN 1118 43 -
DRCADYHFGFSANGCQPCDCEYIGSENQQ LML1_CAEEL 962 0 -
CSCKENVVGPQCSKCQAGTFALRGDNPQG LMA1_MOUSE 1125 43 -
DTCRENFYGLDVTNCKACDCDTAGSLPGT O75445 783 0 -
DQCAIGTYGFGPSGCKKCDCDAVGSLGND O44565 828 261 -
DRCAVDHWKYEKDGCTPCNCNQGYSRGFG LMA_DROME 2047 0 -

Motif 4 width=19
Element Seqn Id St Int Rpt
CHRSTGHCSCRPGVSGVRC LMB2_RAT 1158 0 -
CHRSTGHCSCRPGVSGVRC LMB2_MOUSE 1156 0 -
CHRFTGHCSCRPGVSGVRC LMB2_HUMAN 1155 0 -
CDQSTGQCVCVEGVEGPRC LMB1_HUMAN 1144 0 -
CDQSTGQCVCVEGVEGPRC LMB1_MOUSE 1144 0 -
CHHGSGHCDCRPGISGVRC O57484 1149 0 -
CDKASGRCLCRPGLTGPRC LMB3_HUMAN 546 0 -
CDKASGRCLCRPGFTGPRC LMB3_MOUSE 543 0 -
CDPVTGFCTCRPGATGRKC LMA2_HUMAN 1539 0 -
CDPVTGFCTCRPGATGRKC Q93022 1539 0 -
CDRVTGLCTCRPGATGRKC LMA2_MOUSE 1535 0 -
CDRETGNCVCHEGIGGYKC LMB1_DROME 1154 0 -
QCKDDGRCECREGFVGNRC LMG1_HUMAN 994 -1 -
CNVENGQCTCRPGATGMRC LML2_CAEEL 2048 446 -
CNVENGQCTCRPGATGMRC P91904 2048 446 -
QCKDDGRCECREGFVGNRC LMG1_MOUSE 992 -1 -
CDRTSGQCVCRLGASGLRC LMA1_HUMAN 1521 374 -
CDVNSGQCLCKENVEGRRC LML1_CAEEL 991 0 -
CDRASGQCVCKPGATGLHC LMA1_MOUSE 1528 374 -
CNKSTGQCPCKLGVTGLRC O75445 866 54 -
CNIFDGQCQCKPGRGGRKC O44565 1126 269 -
CNPNTGKCQCLPGVIGDRC LMA_DROME 2076 0 -