SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00007

Identifier
COMPLEMNTC1Q  [View Relations]  [View Alignment]  
Accession
PR00007
No. of Motifs
4
Creation Date
24-OCT-1994  (UPDATE 07-JUN-1999)
Title
Complement C1Q domain signature
Database References

PROSITE; PS01113 C1Q
PFAM; PF00386 C1q
INTERPRO; IPR001073
Literature References
1. SELLAR, G.C., BLAKE, D.J. AND REID, K.B.
Characterization and organization of the genes encoding the A-, B- and C-
chains of human complement subcomponent C1q. The complete derived amino
acid sequence of human C1q.
BIOCHEM.J. 274 481-490 (1991).
 
2. PETRY, F., REID, K.B.M. AND LOOS, M.
Molecular cloning and characterization of the complementary DNA coding for
the B-chain of murine C1q.
FEBS LETT. 258 89-93 (1989).
 
3. MURAGAKI, Y., JACENKO, O., APTE, S., MATTEI, M.G., NINONIYA, Y. AND
OLSEN, B.R.
The alpha 2(VIII) collagen gene. A novel member of the short chain collagen
family located on the human chromosome 1.
J.BIOL.CHEM. 266 7721-7727 (1991).

Documentation
The complement C1q polypeptide comprises 6 A, 6 B and 6 C chains. These
share the same topology, each possessing a small, globular N-terminal
domain, a collagen-like Gly/Pro-rich central region, and a conserved C-
terminal region, the so-called C1q domain. Electron microscopy (EM) has
shown that 3 subunits (A, B and C) come together, joined via a triple-
helix in the collagen-like region [1], 6 such assemblies associating to
form a hexameric structure with 6 globular heads linked by 6 collagen-like
stalks to a fibril-like central region [1]. The stalks themselves interact
with 2C1r and 2C1s proenzymes to form the C1 protein, the first component
of the serum complement system [1].
 
The C1q protein is produced in collagen-producing cells [2] and shows
sequence and structural similarity to collagens VIII and X [1,3],
indicating a possible evolutionary relationship. Additionally, some
carbohydrate-binding proteins, such as mannan-binding protein, lung
surfactant protein SP-A and conglutinin, also have collagen-like regions
with globular domains. These globular domains are distinct from C1q at the
sequence level, but appear structurally similar using EM [1]. In fact,
mannan-binding protein can mimic C1q activity by activating the 2C1r2C1s
complex after interaction with suitable carbohydrate ligands [1]. This
process may be important for antibody-independent complement activation in
infections where there is little or no antibody response [1].
 
COMPLEMNTC1Q is a 4-element fingerprint that provides a signature for the 
complement C1q domain. The fingerprint was derived from an initial alignment
of 6 sequences: the motifs were drawn from conserved regions spanning
virtually the full domain length. Four iterations on OWL24.0 were 
required to reach convergence, at which point a true set comprising 26
sequences was identified. Three partial matches were also found: S15826 and
CHKCX are collagen X fragments, and C41752 is a series of fragments from
the chipmunk hibernation-specific protein HP-20, which is similar to the
complement C1q, collagen VIII and collagen X proteins.
 
An update on SPTR37_9f identified a true set of 25 sequences.
Summary Information
25 codes involving  4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
425252525
30000
20000
1234
True Positives
ACR3_HUMAN    ACR3_MOUSE    C1QA_HUMAN    C1QA_MOUSE    
C1QB_HUMAN C1QB_MOUSE C1QB_RAT C1QC_HUMAN
C1QC_MOUSE CA18_HUMAN CA18_MOUSE CA18_RABIT
CA1A_BOVIN CA1A_CHICK CA1A_HUMAN CA1A_MOUSE
CERB_HUMAN CERL_RAT COLE_LEPMA ECM_HUMAN
HP20_TAMAS HP25_TAMAS HP27_TAMAS O75973
O88992
Sequence Titles
ACR3_HUMAN  30 KD ADIPOCYTE COMPLEMENT-RELATED PROTEIN PRECURSOR (ACRP30) (ADIPOSE MOST ABUNDANT GENE TRANSCRIPT 1) - HOMO SAPIENS (HUMAN). 
ACR3_MOUSE 30 KD ADIPOCYTE COMPLEMENT-RELATED PROTEIN PRECURSOR (ACRP30) (ADIPOCYTE SPECIFIC PROTEIN ADIPOQ) - MUS MUSCULUS (MOUSE).
C1QA_HUMAN COMPLEMENT C1Q SUBCOMPONENT, A CHAIN PRECURSOR - HOMO SAPIENS (HUMAN).
C1QA_MOUSE COMPLEMENT C1Q SUBCOMPONENT, A CHAIN PRECURSOR - MUS MUSCULUS (MOUSE).
C1QB_HUMAN COMPLEMENT C1Q SUBCOMPONENT, B CHAIN PRECURSOR - HOMO SAPIENS (HUMAN).
C1QB_MOUSE COMPLEMENT C1Q SUBCOMPONENT, B CHAIN PRECURSOR - MUS MUSCULUS (MOUSE).
C1QB_RAT COMPLEMENT C1Q SUBCOMPONENT, B CHAIN PRECURSOR - RATTUS NORVEGICUS (RAT).
C1QC_HUMAN COMPLEMENT C1Q SUBCOMPONENT, C CHAIN PRECURSOR - HOMO SAPIENS (HUMAN).
C1QC_MOUSE COMPLEMENT C1Q SUBCOMPONENT, C CHAIN PRECURSOR - MUS MUSCULUS (MOUSE).
CA18_HUMAN COLLAGEN ALPHA 1(VIII) CHAIN PRECURSOR (ENDOTHELIAL COLLAGEN) - HOMO SAPIENS (HUMAN).
CA18_MOUSE COLLAGEN ALPHA 1(VIII) CHAIN PRECURSOR - MUS MUSCULUS (MOUSE).
CA18_RABIT COLLAGEN ALPHA 1(VIII) CHAIN PRECURSOR (ENDOTHELIAL COLLAGEN) - ORYCTOLAGUS CUNICULUS (RABBIT).
CA1A_BOVIN COLLAGEN ALPHA 1(X) CHAIN PRECURSOR - BOS TAURUS (BOVINE).
CA1A_CHICK COLLAGEN ALPHA 1(X) CHAIN PRECURSOR - GALLUS GALLUS (CHICKEN).
CA1A_HUMAN COLLAGEN ALPHA 1(X) CHAIN PRECURSOR - HOMO SAPIENS (HUMAN).
CA1A_MOUSE COLLAGEN ALPHA 1(X) CHAIN PRECURSOR - MUS MUSCULUS (MOUSE).
CERB_HUMAN CEREBELLIN 1 PRECURSOR (PRECEREBELLIN) - HOMO SAPIENS (HUMAN).
CERL_RAT CEREBELLIN-LIKE GLYCOPROTEIN - RATTUS NORVEGICUS (RAT).
COLE_LEPMA INNER EAR-SPECIFIC COLLAGEN PRECURSOR (SACCULAR COLLAGEN) - LEPOMIS MACROCHIRUS (BLUEGILL).
ECM_HUMAN ENDOTHELIAL CELL MULTIMERIN PRECURSOR - HOMO SAPIENS (HUMAN).
HP20_TAMAS HIBERNATION-ASSOCIATED PLASMA PROTEIN HP-20 PRECURSOR (HIBERNATOR- SPECIFIC BLOOD COMPLEX, 20 KD SUBUNIT) - TAMIAS ASIATICUS (CHIPMUNK).
HP25_TAMAS HIBERNATION-ASSOCIATED PLASMA PROTEIN HP-25 PRECURSOR (HIBERNATOR- SPECIFIC BLOOD COMPLEX, 25 KD SUBUNIT) - TAMIAS ASIATICUS (CHIPMUNK).
HP27_TAMAS HIBERNATION-ASSOCIATED PLASMA PROTEIN HP-27 PRECURSOR (HIBERNATOR- SPECIFIC BLOOD COMPLEX, 27 KD SUBUNIT) - TAMIAS ASIATICUS (CHIPMUNK).
O75973 C1Q-RELATED FACTOR - HOMO SAPIENS (HUMAN).
O88992 C1Q-RELATED FACTOR - MUS MUSCULUS (MOUSE).
Scan History
OWL24_0    4  100  NSINGLE    
OWL28_0 2 100 NSINGLE
SPTR37_9f 2 27 NSINGLE
Initial Motifs
Motif 1  width=27
Element Seqn Id St Int Rpt
YPEANALVRFNSVVTNPQGHYNPSTGK C1QC_MOUSE 132 132 -
PLRPNQVIRFEKVITNANENYEPRNGK C1QB_MOUSE 131 131 -
PMTLGNVVIFDKVLTNQESPYQNHTGR S19018 124 124 -
PPMGGNVVIFDTVITNQEEPYQNHSGR C1QA_HUMAN 124 124 -
PPAPNSLIRFNAVLTNPQGDYDTSTGK C1QC_HUMAN 131 131 -
PLRRDQTIRFDHVITNMNNNYEPRSGK C1QB_HUMAN 131 131 -

Motif 2 width=20
Element Seqn Id St Int Rpt
FTCKVPGLYYFTYHASSRGN C1QB_MOUSE 158 0 -
FICAVPGFYYFNFQVISKWD S19018 151 0 -
FTCKVPGLYYFVYHASHTAN C1QC_HUMAN 158 0 -
FTCKVPGLYYFTYHASSRGN C1QB_HUMAN 158 0 -
FTCEVPGLYYFVYYTSHTAN C1QC_MOUSE 159 0 -
FVCTVPGYYYFTFQVLSQWE C1QA_HUMAN 151 0 -

Motif 3 width=22
Element Seqn Id St Int Rpt
QVLAGGTVLQLRRGDEVWIEKD S19018 199 28 -
QVNSGGVLLRLQVGEEVWLAVN C1QC_HUMAN 201 23 -
QVTTGGMVLKLEQGENVFLQAT C1QB_HUMAN 204 26 -
QVSSGGALLRLQRGDEVWLSVN C1QC_MOUSE 202 23 -
QVTTGGVVLKLEQEEVVHLQAT C1QB_MOUSE 206 28 -
QVVSGGMVLQLQQGDQVWVEKD C1QA_HUMAN 199 28 -

Motif 4 width=11
Element Seqn Id St Int Rpt
ANSIFTGFLLF C1QB_MOUSE 238 10 -
ADSVFSGFLIF C1QA_HUMAN 232 11 -
SDSVFSGFLLF C1QC_HUMAN 233 10 -
ADSIFSGFLIF S19018 232 11 -
ANSIFSGFLLF C1QB_HUMAN 236 10 -
SNSVFSGFLLF C1QC_MOUSE 234 10 -
Final Motifs
Motif 1  width=27
Element Seqn Id St Int Rpt
YPAIGTPIPFDKILYNKQQHYDPRTGI CA1A_BOVIN 556 556 -
YPAIGTPIPFDKILYNRQQHYDPRTGI CA1A_HUMAN 562 562 -
FPPVGAPVKFDKLLYNGRQNYNPQTGI CA18_MOUSE 625 625 -
YPAVGAPIPFDEILYNRQQHYDPRSGI CA1A_MOUSE 562 562 -
YPGATVPIKFDKILYNRQQHYDPRTGI CA1A_CHICK 556 556 -
FPPVGAPIKFDRLLYNGRQNYNPQTGI CA18_RABIT 626 626 -
FPPVGGPVKFNKLLYNGRQNYNPQTGI CA18_HUMAN 626 626 -
VTVPNVPIRFTKIFYNQQNHYDGSTGK ACR3_MOUSE 126 126 -
VTIPNMPIRFTKIFYNQQNHYDGSTGK ACR3_HUMAN 123 123 -
PPAPNSLIRFNAVLTNPQGDYDTSTGK C1QC_HUMAN 131 131 -
PLRRDQTIRFDHVITNMNNNYEPRSGK C1QB_HUMAN 131 131 -
YPEANALVRFNSVVTNPQGHYNPSTGK C1QC_MOUSE 132 132 -
PLRPNQVIRFEKVITNANENYEPRNGK C1QB_MOUSE 131 131 -
ALRPNQAIRFEKVITNVNDNYEPRSGK C1QB_RAT 131 131 -
FPPPSLPVKFDKVFYNGEGHWDPTLNK COLE_LEPMA 292 292 -
PHEGYEVLKFDDVVTNLGNNYDAASGK O75973 139 139 -
PHEGYEVLKFDDVVTNLGNNYDAASGK O88992 139 139 -
PPMGGNVVIFDTVITNQEEPYQNHSGR C1QA_HUMAN 124 124 -
PMTLGNVVIFDKVLTNQESPYQNHTGR C1QA_MOUSE 124 124 -
PPAPSQPVIFKEALHDAQGHFDLATGV HP27_TAMAS 100 100 -
LPPPSEPVVFTEVLYNTQRDLKESTGV HP20_TAMAS 82 82 -
MSNRTMTIYFDQVLVNIGNHFDLASSI CERL_RAT 108 108 -
PPEPFQPIVFKEALYNQEGHFNMATGE HP25_TAMAS 100 100 -
MSNRTMIIYFDQVLVNIGNNFDSERST CERB_HUMAN 77 77 -
GMTIPGPILFNNLDVNYGASYTPRTGK ECM_HUMAN 1110 1110 -

Motif 2 width=20
Element Seqn Id St Int Rpt
FTCKIPGIYYFSYHIHVKGT CA1A_BOVIN 583 0 -
FTCQIPGIYYFSYHVHVKGT CA1A_HUMAN 589 0 -
FTCEVPGVYYFAYHVHCKGG CA18_MOUSE 652 0 -
FTCKIPGIYYFSYHVHVKGT CA1A_MOUSE 589 0 -
FTCRIPGLYYFSYHVHAKGT CA1A_CHICK 583 0 -
FTCEVPGVYYFAYHVHCKGG CA18_RABIT 653 0 -
FTCEVPGVYYFAYHVHCKGG CA18_HUMAN 653 0 -
FYCNIPGLYYFSYHITVYMK ACR3_MOUSE 153 0 -
FHCNIPGLYYFAYHITVYMK ACR3_HUMAN 150 0 -
FTCKVPGLYYFVYHASHTAN C1QC_HUMAN 158 0 -
FTCKVPGLYYFTYHASSRGN C1QB_HUMAN 158 0 -
FTCEVPGLYYFVYYTSHTAN C1QC_MOUSE 159 0 -
FTCKVPGLYYFTYHASSRGN C1QB_MOUSE 158 0 -
FTCKVPGLYYFTYHASSRGN C1QB_RAT 158 0 -
FNVTYPGVYLFSYHITVRNR COLE_LEPMA 319 0 -
FTCNIPGTYFFTYHVLMRGG O75973 166 0 -
FTCNIPGTYFFTYHVLMRGG O88992 166 0 -
FVCTVPGYYYFTFQVLSQWE C1QA_HUMAN 151 0 -
FICAVPGFYYFNFQVISKWD C1QA_MOUSE 151 0 -
FTCPVPGLYQFGFHIEAVQR HP27_TAMAS 127 0 -
FNCVEPGNYHFSFDVELYHC HP20_TAMAS 109 0 -
FVAPRKGIYSFSFHVVKVYN CERL_RAT 135 0 -
FSCVLPGVYNFGFDIRLFQS HP25_TAMAS 127 0 -
FIAPRKGIYSFNFHVVKVYN CERB_HUMAN 104 0 -
FRIPYLGVYVFKYTIESFSA ECM_HUMAN 1137 0 -

Motif 3 width=22
Element Seqn Id St Int Rpt
DQASGSAVIDLTENDQVWLQLP CA1A_BOVIN 628 25 -
DQASGSAIIDLTENDQVWLQLP CA1A_HUMAN 634 25 -
DQASGSAVLLLRPGDQVFLQNP CA18_MOUSE 697 25 -
DQASGSAIMELTENDQVWLQLP CA1A_MOUSE 634 25 -
DQASGSAVIDLMENDQVWLQLP CA1A_CHICK 628 25 -
DQASGSAVLLLRPGDRVFLQMP CA18_RABIT 698 25 -
DQASGSAVLLLRPGDRVFLQMP CA18_HUMAN 698 25 -
DQASGSVLLHLEVGDQVWLQVY ACR3_MOUSE 198 25 -
DQASGSVLLHLEVGDQVWLQVY ACR3_HUMAN 195 25 -
QVNSGGVLLRLQVGEEVWLAVN C1QC_HUMAN 201 23 -
QVTTGGMVLKLEQGENVFLQAT C1QB_HUMAN 204 26 -
QVSSGGALLRLQRGDEVWLSVN C1QC_MOUSE 202 23 -
QVTTGGVVLKLEQEEVVHLQAT C1QB_MOUSE 206 28 -
QVTTGGVVLKLEQEEVVHLQAT C1QB_RAT 206 28 -
DQASNLALLHLTDGDQVWLETL COLE_LEPMA 364 25 -
DYASNSVILHLDAGDEVFIKLD O75973 214 28 -
DYASNSVILHLDAGDEVFIKLD O88992 214 28 -
QVVSGGMVLQLQQGDQVWVEKD C1QA_HUMAN 199 28 -
QVLAGGTVLQLRRGDEVWIEKD C1QA_MOUSE 199 28 -
EHISGTAILQLGMEDRVWLENK HP27_TAMAS 171 24 -
ENASGAMIMPLRQGDKVWLEAD HP20_TAMAS 153 24 -
EAASNGVLLLMEREDKVHLKLE CERL_RAT 182 27 -
KHAMGSVIMALGKGDKVWLESK HP25_TAMAS 171 24 -
EAASNGVLIQMEKGDRAYLKLE CERB_HUMAN 151 27 -
RVLTGDALLELNYGQEVWLRLA ECM_HUMAN 1185 28 -

Motif 4 width=11
Element Seqn Id St Int Rpt
VHSSFSGFLVA CA1A_BOVIN 662 12 -
VHSSFSGFLVA CA1A_HUMAN 668 12 -
VHSSFSGYLLY CA18_MOUSE 731 12 -
VHSSFSGFLVA CA1A_MOUSE 668 12 -
VHSSFSGFLFA CA1A_CHICK 662 12 -
VHSSFSGYLLY CA18_RABIT 732 12 -
VHSSFSGYLLY CA18_HUMAN 732 12 -
NDSTFTGFLLY ACR3_MOUSE 233 13 -
NDSTFTGFLLY ACR3_HUMAN 230 13 -
SDSVFSGFLLF C1QC_HUMAN 233 10 -
ANSIFSGFLLF C1QB_HUMAN 236 10 -
SNSVFSGFLLF C1QC_MOUSE 234 10 -
ANSIFTGFLLF C1QB_MOUSE 238 10 -
ANSIFTGFLLF C1QB_RAT 238 10 -
DDSTFSGFLLY COLE_LEPMA 397 11 -
KYSTFSGFIIY O75973 246 10 -
KYSTFSGFIIY O88992 246 10 -
ADSVFSGFLIF C1QA_HUMAN 232 11 -
ADSIFSGFLIF C1QA_MOUSE 232 11 -
VQAVFSGFLIH HP27_TAMAS 203 10 -
VVIYFSGFLIS HP20_TAMAS 185 10 -
KYSTFSGFLVF CERL_RAT 212 8 -
THIVFFGYLLY HP25_TAMAS 203 10 -
KYSTFSGFLVF CERB_HUMAN 181 8 -
PVTTFSGYLLY ECM_HUMAN 1216 9 -