SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00692

Identifier
CD4TCANTIGEN  [View Relations]  [View Alignment]  
Accession
PR00692
No. of Motifs
6
Creation Date
11-APR-1997  (UPDATE 06-JUN-1999)
Title
T-cell surface glycoprotein CD4 signature
Database References

INTERPRO; IPR000973
PDB; 1CHD
SCOP; 1CHD
CATH; 1CHD
Literature References
1. RYU, S.E., KWONG, P.D., TRUNEH, A., PORTER, T.G., ARTHOS, J., 
ROSENBERG, M., DAI, X.P., XUONG, N.H., AXEL, R., SWEET, R.W.
AND HENDRICKSON, W.A.
Crystal structure of an HIV-binding recombinant fragment of human CD4.
NATURE 348(6300) 419-426 (1990).
 
2. WANG, J.H., YAN, Y.W., GARRETT, T.P., LIU, J.H., RODGERS, D.W.,
GARLICK, R.L., TARR, G.E., HUSAIN, Y., REINHERZ, E.L. AND HARRISON, S.C.
Atomic structure of a fragment of human CD4 containing two
immunoglobulin-like domains.
NATURE 348(6300) 411-418 (1990).

Documentation
The CD4 glycoprotein on the surface of T cells participates in the immune
response and is the receptor for HIV infection. The structure of a soluble
fragment of CD4 has been determined to 2.3A and reveals that the molecule
has two intimately-associated immunoglobulin-like domains connected by a 
continuous beta strand [1,2]. Residues implicated in HIV recognition reside
in domain D1. Domain D2 is distinguished by a variation in the beta-strand
topologies of antibody domains that results in a truncated beta-barrel
with a non-standard intra-sheet disulphide bond [1,2].  The binding sites
for monoclonal antibodies, class II major histocompatibility complex
molecules, and HIV gp120 can be mapped on the molecular surface. 
 
CD4TCANTIGEN is a 6-element fingerprint that provides a signature for 
T-cell surface glycoprotein CD4. The fingerprint was derived from an
initial alignment of 8 sequences: the motifs were drawn from conserved
regions spanning the full alignment length. Two iterations on OWL29.2
were required to reach convergence, at which point a true set comprising
20 sequences was identified.
 
An update on SPTR37_9f identified a true set of 15 sequences.
Summary Information
15 codes involving  6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
6151515151515
5000000
4000000
3000000
2000000
123456
True Positives
CD4_CANFA     CD4_HUMAN     CD4_MACFA     CD4_MACFU     
CD4_MACMU CD4_MACNE CD4_MOUSE CD4_PANTR
CD4_RABIT CD4_RAT CD4_SAISC P79355
Q28217 Q29617 Q61396
Sequence Titles
CD4_CANFA   T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - CANIS FAMILIARIS (DOG). 
CD4_HUMAN T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - HOMO SAPIENS (HUMAN).
CD4_MACFA T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - MACACA FASCICULARIS (CRAB EATING MACAQUE) (CYNOMOLGUS MONKEY).
CD4_MACFU T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - MACACA FUSCATA FUSCATA (JAPANESE MACAQUE).
CD4_MACMU T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - MACACA MULATTA (RHESUS MACAQUE).
CD4_MACNE T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - MACACA NEMESTRINA (PIG-TAILED MACAQUE).
CD4_MOUSE T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) (T-CELL DIFFERENTIATION ANTIGEN L3T4) - MUS MUSCULUS (MOUSE).
CD4_PANTR T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - PAN TROGLODYTES (CHIMPANZEE).
CD4_RABIT T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - ORYCTOLAGUS CUNICULUS (RABBIT).
CD4_RAT T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) (W3/25 ANTIGEN) - RATTUS NORVEGICUS (RAT).
CD4_SAISC T-CELL SURFACE GLYCOPROTEIN CD4 PRECURSOR (T-CELL SURFACE ANTIGEN T4/LEU-3) - SAIMIRI SCIUREUS (COMMON SQUIRREL MONKEY).
P79355 CD4 ANTIGEN PRECURSOR - FELIS SILVESTRIS CATUS (CAT).
Q28217 CD4 - CERCOPITHECUS AETHIOPS (GREEN MONKEY) (GRIVET).
Q29617 CD4 PRECURSOR - MACACA MULATTA (RHESUS MACAQUE).
Q61396 T-CELL DIFFERENTIATION ANTIGEN - MUS MUSCULUS (MOUSE).
Scan History
OWL29_2    2  50   NSINGLE    
SPTR37_9f 2 17 NSINGLE
Initial Motifs
Motif 1  width=21
Element Seqn Id St Int Rpt
TAYKSEGESAEFSFPLNFAEE CD4_MOUSE 214 214 -
IVYKKEGEQVEFSFPLAFTVE CD4_PANTR 210 210 -
TVYKKEGEQVEFSFPLAFTLE CD4_MACMU 210 210 -
TVYKKEGEQVEFSFPLAFTLE CD4_CERAE 183 183 -
TVYKKEGEQVEFSFPLAFTLE CD4_CERTO 183 183 -
TVYKKEGEQVEFSFPLAFTLE CD4_ERYPA 183 183 -
TVYKKEGEQVEFSFPLNFEDE CD4_RABIT 215 215 -
TFYAREGDQVEFSFPLSFEDE CD4_CANFA 218 218 -

Motif 2 width=18
Element Seqn Id St Int Rpt
ELWWQAERASSSKSWITF CD4_MACMU 237 6 -
ELMWQVDGASSAQSWVSF CD4_RABIT 240 4 -
ELMWKAEKDSFFQPWISF CD4_MOUSE 239 4 -
ELWWQAERASSSKSWITF CD4_CERAE 210 6 -
ELWWQAERASSSKSWITF CD4_CERTO 210 6 -
ELWWQAERASSSKSWITF CD4_ERYPA 210 6 -
ELRWQAQGASSSLLWISF CD4_CANFA 243 4 -
ELWWQAERASSSKSWITF CD4_PANTR 237 6 -

Motif 3 width=16
Element Seqn Id St Int Rpt
KLQMGKKLPLNLTLPQ CD4_CERAE 244 16 -
KLQMGKKLPLHLTLPQ CD4_CERTO 244 16 -
KLQMGEKLPLHLTLPQ CD4_ERYPA 244 16 -
KLQMGKKLPLHLTLPQ CD4_MACMU 271 16 -
KLQMGKKLPLHLTLPQ CD4_PANTR 271 16 -
KLQMKESLPLRFTLPQ CD4_CANFA 277 16 -
KLQLKETLPLTLKIPQ CD4_MOUSE 273 16 -
KIQMSKGLPLSLTLPQ CD4_RABIT 274 16 -

Motif 4 width=17
Element Seqn Id St Int Rpt
QALPQYAGSGNLTLALE CD4_MACMU 286 -1 -
QALPHYAGSGNLTLALE CD4_ERYPA 259 -1 -
QALPQYAGSGNLTLALE CD4_CERTO 259 -1 -
QALPQYAGSGNLTLALE CD4_CERAE 259 -1 -
QALHRYAGSGNLSLTLD CD4_RABIT 289 -1 -
QVSLQFAGSGNLTLTLD CD4_MOUSE 288 -1 -
QVLSRYAGSGILTLNLA CD4_CANFA 292 -1 -
QALPQYAGSGNLTLALE CD4_PANTR 286 -1 -

Motif 5 width=16
Element Seqn Id St Int Rpt
LHQEVNLVVMRATQFQ CD4_CERTO 281 5 -
LHQEVNLVVMRATQFQ CD4_MACMU 308 5 -
LHQEVNLVVMRATQLQ CD4_PANTR 308 5 -
LYQEVNLVVMRANSSQ CD4_CANFA 312 3 -
LHQEVNLVVMKVAQLN CD4_MOUSE 308 3 -
LHQQVSLVMLKVTQVK CD4_RABIT 309 3 -
LHQEVNLVVMRATQFQ CD4_ERYPA 281 5 -
LHQEVNLVVMRATQFQ CD4_CERAE 281 5 -

Motif 6 width=18
Element Seqn Id St Int Rpt
KAVWVLNPEEGMWQCLLS CD4_ERYPA 329 32 -
KMVQVLDPKAGTWQCLLS CD4_RABIT 356 31 -
KVVQVVAPETGLWQCLLS CD4_MOUSE 356 32 -
KLVWVVDPEGGTWQCLLS CD4_CANFA 360 32 -
KAVWVLNPEAGMWQCLLS CD4_PANTR 356 32 -
KAVWVLNPEAGMWQCLLS CD4_MACMU 356 32 -
KAVWVLNPEAGMWQCLLS CD4_CERTO 329 32 -
KAVWVLNPEEGMWQCLLS CD4_CERAE 329 32 -
Final Motifs
Motif 1  width=21
Element Seqn Id St Int Rpt
TVYKKEGEQVEFSFPLAFTLE CD4_MACMU 210 210 -
TVYKKEGEQVEFSFPLAFTLE CD4_MACNE 210 210 -
TVYKKEGEQVEFSFPLAFTLE CD4_MACFU 210 210 -
TVYKKEGEQVEFSFPLAFTLE Q29617 210 210 -
TVYKKEGEQVEFSFPPAFTLE CD4_MACFA 210 210 -
IVYKKEGEQVEFSFPLAFTVE CD4_HUMAN 210 210 -
IVYKKEGEQVEFSFPLAFTVE CD4_PANTR 210 210 -
TVYKKEGEQVEFSFPLAFTLE Q28217 210 210 -
TVYKKEGEQVEFSFPLAFAAE CD4_SAISC 209 209 -
TFYAREGDQVEFSFPLSFEDE CD4_CANFA 218 218 -
TAYKSEGESAEFSFPLNFAEE CD4_MOUSE 214 214 -
TAYKSEGESAEFSFPLNFAEE Q61396 214 214 -
TVYKKEGEQVEFSFPLNFEDE CD4_RABIT 215 215 -
TAYKSEGESAEFSFPLNLGEE CD4_RAT 213 213 -
TVYAKEGEQVEFSFPLNFEDE P79355 229 229 -

Motif 2 width=18
Element Seqn Id St Int Rpt
ELWWQAERASSSKSWITF CD4_MACMU 237 6 -
ELWWQAERASSSKSWITF CD4_MACNE 237 6 -
ELWWQAERASSSKSWITF CD4_MACFU 237 6 -
ELWWQAERASSPKSWITF Q29617 237 6 -
ELWWQAERASSSKSWITF CD4_MACFA 237 6 -
ELWWQAERASSSKSWITF CD4_HUMAN 237 6 -
ELWWQAERASSSKSWITF CD4_PANTR 237 6 -
ELWWQAERASSSKSWITF Q28217 237 6 -
ELCWQAERASSSKSWITF CD4_SAISC 236 6 -
ELRWQAQGASSSLLWISF CD4_CANFA 243 4 -
ELMWKAEKDSFFQPWISF CD4_MOUSE 239 4 -
ELMWKAEKDSFFQPWISF Q61396 239 4 -
ELMWQVDGASSAQSWVSF CD4_RABIT 240 4 -
ELRWKAEKAPSSQSWITF CD4_RAT 238 4 -
NLRWKAEGAPSSLLWISF P79355 254 4 -

Motif 3 width=16
Element Seqn Id St Int Rpt
KLQMGKKLPLHLTLPQ CD4_MACMU 271 16 -
KLQMGKKLPLHLTLPQ CD4_MACNE 271 16 -
KLQMGKKLPLHLTLPQ CD4_MACFU 271 16 -
KLQMGKKLPLHLTLPQ Q29617 271 16 -
KLQMGKKLPLHLTLPQ CD4_MACFA 271 16 -
KLQMGKKLPLHLTLPQ CD4_HUMAN 271 16 -
KLQMGKKLPLHLTLPQ CD4_PANTR 271 16 -
KLQMGKKLPLNLTLPQ Q28217 271 16 -
KLRMGKKLPLHLTLAQ CD4_SAISC 270 16 -
KLQMKESLPLRFTLPQ CD4_CANFA 277 16 -
KLQLKETLPLTLKIPQ CD4_MOUSE 273 16 -
KLQLKETLPLTLKIPQ Q61396 273 16 -
KIQMSKGLPLSLTLPQ CD4_RABIT 274 16 -
KFQLSETLPLTLQIPQ CD4_RAT 272 16 -
KLQMMDSLPLRFTLPN P79355 288 16 -

Motif 4 width=17
Element Seqn Id St Int Rpt
QALPQYAGSGNLTLALE CD4_MACMU 286 -1 -
QALPQYAGSGNLTLALD CD4_MACNE 286 -1 -
QALPQYAGSGNLTLALE CD4_MACFU 286 -1 -
QALPQYAGSGNLTLALE Q29617 286 -1 -
QALPQYAGSGNLTLALE CD4_MACFA 286 -1 -
QALPQYAGSGNLTLALE CD4_HUMAN 286 -1 -
QALPQYAGSGNLTLALE CD4_PANTR 286 -1 -
QALPQYAGSGNLTLALE Q28217 286 -1 -
QALPQYAGSGNFTLALK CD4_SAISC 285 -1 -
QVLSRYAGSGILTLNLA CD4_CANFA 292 -1 -
QVSLQFAGSGNLTLTLD CD4_MOUSE 288 -1 -
QVSLQFAGSGNLTLTLD Q61396 288 -1 -
QALHRYAGSGNLSLTLD CD4_RABIT 289 -1 -
QVSLQFAGSGNLTLTLD CD4_RAT 287 -1 -
NVLSRYAGSGNLTLVLD P79355 303 -1 -

Motif 5 width=16
Element Seqn Id St Int Rpt
LHQEVNLVVMRATQFQ CD4_MACMU 308 5 -
LHQEVNLVVMRATQFQ CD4_MACNE 308 5 -
LHQEVNLVVMRAAQFQ CD4_MACFU 308 5 -
LHQEVNLVVMRATQFQ Q29617 308 5 -
LHQEVNLVVMRATQFQ CD4_MACFA 308 5 -
LHQEVNLVVMRATQLQ CD4_HUMAN 308 5 -
LHQEVNLVVMRATQLQ CD4_PANTR 308 5 -
LHQEVNLVVMRATQFQ Q28217 308 5 -
LHQEVNLVVMRVTQLQ CD4_SAISC 307 5 -
LYQEVNLVVMRANSSQ CD4_CANFA 312 3 -
LHQEVNLVVMKVAQLN CD4_MOUSE 308 3 -
LHQEVNLVVMKVAQLN Q61396 308 3 -
LHQQVSLVMLKVTQVK CD4_RABIT 309 3 -
LYQEVNLVVMKVTQPD CD4_RAT 307 3 -
LQQEVKLVVMRVTQSG P79355 323 3 -

Motif 6 width=18
Element Seqn Id St Int Rpt
KAVWVLNPEAGMWQCLLS CD4_MACMU 356 32 -
KAVWVLNPEAGMWQCLLS CD4_MACNE 356 32 -
KAVWVLNPEAGMWQCLLS CD4_MACFU 356 32 -
KAVWVLNPEAGMWQCLLS Q29617 356 32 -
KAVWVLNPEAGMWQCLLS CD4_MACFA 356 32 -
KAVWVLNPEAGMWQCLLS CD4_HUMAN 356 32 -
KAVWVLNPEAGMWQCLLS CD4_PANTR 356 32 -
KAVWVLNPEEGMWQCLLS Q28217 356 32 -
KAVWVLNPEPGAWQCLLS CD4_SAISC 355 32 -
KLVWVVDPEGGTWQCLLS CD4_CANFA 360 32 -
KVVQVVAPETGLWQCLLS CD4_MOUSE 356 32 -
KVVQVVAPETGLWQCLLS Q61396 356 32 -
KMVQVLDPKAGTWQCLLS CD4_RABIT 356 31 -
KVIQVQAPEAGVWQCLLS CD4_RAT 356 33 -
KMVRVEDAEAGTWQCLLS P79355 371 32 -