SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00611

Identifier
ERYTHCRUORIN  [View Relations]  [View Alignment]  
Accession
PR00611
No. of Motifs
4
Creation Date
30-SEP-1996  (UPDATE 14-JUN-1999)
Title
Erythrocruorin family signature 
Database References
PRINTS; PR01907 WORMGLOBIN
PROSITE; PS01033 GLOBIN
BLOCKS; BL01033
INTERPRO; IPR002336
PDB; 1ECA; 1ASH
SCOP; 1ECA; 1ASH
CATH; 1ECA; 1ASH
Literature References
1. COWAN, J.A.
Inorganic Biochemistry: An Introduction.
VCH PUBLISHERS, 1993, NEW YORK.
 
2. KAIM, W. AND SCHWEDERSKI, B.
Bioinorganic Chemistry: Inorganic Elements in the Chemistry of Life.
WILEY, 1991, CHICHESTER.
 
3. KAPP, O.H., MOENS, L., VANFLETEREN, J., TROTMAN, C.N.A., SUZUKI, T.
AND VINOGRADOV, S.N.
Alignment of 700 globin sequences: Extent of amino acid substitution
and its correlation with variation in volume.
PROTEIN SCI. 4 2179-2190 (1995).
 
4. MOENS, L., VANFLETEREN, J., VAN DE PEER, Y., PEETERS, K., KAPP, O., 
CZELUZNIAK, J., GOODMAN, M., BLAXTER, M. AND VINOGRADOV, S.N.
Globins in nonvertebrate species: dispersal by horizontal gene transfer
and evolution of the structure-function relationships.
MOL.BIOL.EVOL. 13 324-333 (1996).
 
5. VINOGRADOV, S.N.
The structure of invertebrate extracellular hemoglobins (erythrocruorins
and chlorocruorins).
COMP.BIOCHEM.PHYSIOL.[B] 82 1-15 (1985).
 
6. GOLDBERG, D.E.
The enigmatic oxygen-avid hemoglobin of Ascaris.
BIOESSAYS 17 177-182 (1995).
 
7. TROTMAN, C.N., MANNING, A.M., BRAY, J.A., JELLIE, A.M., MOENS, L.
AND TATE, W.P.
Interdomain linkage in the polymeric hemoglobin molecule of Artemia.
J.MOL.EVOL. 38 628-636 (1994).

Documentation
Globins are haem-containing proteins involved in dioxygen binding and/or 
transport [1,2]. At present, more than 700 globin sequences are known [3].
It has been proposed that all globins have evolved from a family of 
ancestral, approximately 17kDa haemoproteins that displayed the globin 
fold and functioned as redox proteins [4]. The globin superfamily includes
vertebrate haemoglobins (Hb); vertebrate myoglobins (Mb); invertebrate 
globins; plant leghaemoglobins; and bacterial flavohaemoglobins. 
 
The function of haemoglobins (Hbs) is transport of dioxygen in blood plasma.
Erythrocruorins (Ec) are extracellular Hbs found freely dissolved in the 
blood of annelids and arthropods. Ec molecules exist as aggregates of up to
200 small globin-like subunits, some of which are disulphide-bonded and not
all of which contain haem [5]. Nematodes (e.g., Ascaris) possess an octa-
meric Hb, each subunit containing two globin-like domains. Ascaris Hb binds
oxygen four orders of magnitude more tightly than does human Hb [6]. The
brine shrimp Artemia has evolved the longest known concatenation of globin
domains: each subunit contains 9 globin-like domains, connected by linking
peptides [7]. Artemia possess three types of dimeric haemoglobins: HbI
(alpha+alpha); HbII (alpha+beta); and HbIII (beta+beta). 
 
The 3D structures of a number of Ecs are known. The protein is largely 
alpha-helical, eight conserved helices (A to H) providing the scaffold for 
a well-defined haem-binding pocket. The imidazole ring of the "proximal" His 
residue provides the fifth haem iron ligand; the other axial haem iron 
position remains essentially free for O(2) coordination. Many Ecs lack
the "distal" His and Val residues that are conserved in vertebrate globins.
 
ERYTHCRUORIN is a 4-element fingerprint that provides a signature for the 
erythrocruorins. The fingerprint was derived from an initial alignment
of 17 sequences (both N- and C-terminal globin-like domains of GLB_ASCSU 
and GLB_PSEDC were used to construct the alignment): motif 1 spans helices B
and C; motif 2 corresponds to helix F, and includes the invariant proximal
His residue; motif 3 includes helix G; and motif 4 spans helix H. Three
iterations on OWL28.2 were required to reach convergence, at which point a
true set comprising 62 sequences was identified. Numerous partial matches
were also found, most of which are members of the globin superfamily.
 
An update on SPTR37_9f identified a true set of 64 sequences, and 104
partial matches.
Summary Information
  64 codes involving  4 elements
25 codes involving 3 elements
79 codes involving 2 elements
Composite Feature Index
464646464
324241413
253422241
1234
True Positives
GLB1_LUMTE    GLB1_PHESE    GLB2_CHITH    GLB2_LUMTE    
GLB2_NIPBR GLB2_TYLHE GLB3_CHITH GLB3_CHITP
GLB3_LUMTE GLB3_TYLHE GLB4_CHITH GLB4_TYLHE
GLB6_CHITH GLB7_CHITH GLB9_CHITH GLBC_CAUAR
GLBC_CHITH GLBD_CHITH GLBE_CHITH GLBF_CHITH
GLBH_CAEEL GLBH_CHITH GLBH_CHITP GLBI_CHITH
GLBI_CHITP GLBK_CHITH GLBP_CHITH GLBT_CHITH
GLBV_CHITP GLBZ_CHITH GLBZ_CHITP GLB_APLJU
GLB_APLKU GLB_APLLI GLB_BURLE GLB_CERRH
GLB_DOLAU GLB_NASMU GLB_TUBTU O02004
O02368 O02369 O02370 O02567
O07944 O61233 P91592 P91593
P91594 P91595 P91600 P92191
Q25215 Q25216 Q25217 Q25218
Q25219 Q27302 Q27303 Q27430
Q94442 Q94443 Q94444 Q94445
True Positive Partials
Codes involving 3 elements
GLB1_CHITH GLB1_GLYDI GLB1_PARCH GLB3_LAMSP
GLB4_LUMTE GLB8_CHITH GLBA_ANATR GLBA_SCAIN
GLBC_NIPBR GLBH_TRICO GLBW_CHITH GLBW_CHITP
GLBX_CHITH GLBY_CHITP GLB_PSEDC HBB2_XENLA
HBF1_URECA O02480 O61234 Q17154
Q17155 Q25689 Q26506 Q27126
Q93101
Codes involving 2 elements
DCOR_NEUCR GLB1_LUCPE GLB2_LUCPE GLB3_LUCPE
GLB4_GLYDI GLB7_ARTSX GLBB_RIFPA GLBB_SCAIN
GLBD_CAUAR GLBM_ANATR GLB_ASCSU GLB_BUSCA
GLB_ISOHY GLB_PAREP HBA3_PLEWA HBA3_RANCA
HBA3_XENLA HBA3_XENTR HBA4_XENLA HBA5_XENLA
HBAM_RANCA HBA_LIOMI HBA_TRAST HBB_LATCH
HBP2_CASGL HBPL_PARAD HBPL_TRETO HD_FUGRU
HD_HUMAN HD_MOUSE HD_RAT HMPA_ALCEU
HYPF_AZOCH LGBA_PHAVU O04939 O07407
O21882 O24520 O30764 O30765
O30766 O55574 O61603 O76242
O76243 O77003 O81116 O82467
O83423 O85168 O86363 P89459
P96645 POLG_PVYHU PPS1_BACSU Q03972
Q09164 Q17153 Q17156 Q17157
Q17286 Q20798 Q24367 Q24409
Q42785 Q43236 Q43296 Q50585
Q54296 Q54299 Q55179 Q65553
Q85265 Q85438 Q89815 Q94543
RYNR_PIG VP7_RDV Y06B_MYCTU
Sequence Titles
GLB1_LUMTE  GLOBIN I, EXTRACELLULAR (ERYTHROCRUORIN) (GLOBIN D) - LUMBRICUS TERRESTRIS (COMMON EARTHWORM). 
GLB1_PHESE GLOBIN I, EXTRACELLULAR (ERYTHROCRUORIN) - PHERETIMA SIEBOLDI (EARTHWORM).
GLB2_CHITH GLOBIN CTT-II BETA PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLB2_LUMTE GLOBIN II, EXTRACELLULAR (ERYTHROCRUORIN) (GLOBIN AIII) (GLOBIN B) - LUMBRICUS TERRESTRIS (COMMON EARTHWORM).
GLB2_NIPBR MYOGLOBIN (GLOBIN, BODY WALL ISOFORM) - NIPPOSTRONGYLUS BRASILIENSIS.
GLB2_TYLHE GLOBIN IIA, EXTRACELLULAR (ERYTHROCRUORIN) - TYLORRHYNCHUS HETEROCHAETUS (MARINE WORM).
GLB3_CHITH GLOBIN CTT-III PRECURSOR (ERYTHROCRUORIN III) - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLB3_CHITP GLOBIN CTP-III (ERYTHROCRUORIN III) - CHIRONOMUS THUMMI PIGER (MIDGE).
GLB3_LUMTE GLOBIN III, EXTRACELLULAR PRECURSOR (ERYTHROCRUORIN) (GLOBIN C) - LUMBRICUS TERRESTRIS (COMMON EARTHWORM).
GLB3_TYLHE GLOBIN IIB, EXTRACELLULAR (ERYTHROCRUORIN) - TYLORRHYNCHUS HETEROCHAETUS (MARINE WORM).
GLB4_CHITH GLOBIN CTT-IV PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLB4_TYLHE GLOBIN IIC, EXTRACELLULAR (ERYTHROCRUORIN) - TYLORRHYNCHUS HETEROCHAETUS (MARINE WORM).
GLB6_CHITH GLOBIN CTT-VI PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLB7_CHITH GLOBIN CTT-VIIA - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLB9_CHITH GLOBIN CTT-IX PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBC_CAUAR GLOBIN C, COELOMIC - CAUDINA ARENICOLA (SEA CUCUMBER) (MOLPADIA ARENICOLA).
GLBC_CHITH GLOBIN CTT-VIIB-3 PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBD_CHITH GLOBIN CTT-VIIB-4 PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE), AND CHIRONOMUS THUMMI PIGER (MIDGE).
GLBE_CHITH GLOBIN CTT-VIIB-5/CTT-VIIB-9 PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE), AND CHIRONOMUS THUMMI PIGER (MIDGE).
GLBF_CHITH GLOBIN CTT-VIIB-6 PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBH_CAEEL PUTATIVE GLOBIN-LIKE PROTEIN - CAENORHABDITIS ELEGANS.
GLBH_CHITH GLOBIN CTT-VIIB-7 PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBH_CHITP GLOBIN CTT-VIIB-7 PRECURSOR - CHIRONOMUS THUMMI PIGER (MIDGE).
GLBI_CHITH GLOBIN CTT-VIIB-8 PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBI_CHITP GLOBIN CTT-VIIB-8 PRECURSOR - CHIRONOMUS THUMMI PIGER (MIDGE).
GLBK_CHITH GLOBIN CTT-VIIB-10 PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBP_CHITH GLOBIN CTT-E/E' PRECURSOR - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBT_CHITH GLOBIN CTT-IIIA - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBV_CHITP GLOBIN CTT-V PRECURSOR (HBV) - CHIRONOMUS THUMMI PIGER (MIDGE).
GLBZ_CHITH GLOBIN CTT-Z PRECURSOR (HBZ) - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBZ_CHITP GLOBIN CTT-Z PRECURSOR (HBZ) - CHIRONOMUS THUMMI PIGER (MIDGE).
GLB_APLJU GLOBIN (MYOGLOBIN) - APLYSIA JULIANA (SEA HARE).
GLB_APLKU GLOBIN (MYOGLOBIN) - APLYSIA KURODAI (KURODA'S SEA HARE).
GLB_APLLI GLOBIN (MYOGLOBIN) - APLYSIA LIMACINA (SLUG SEA HARE).
GLB_BURLE GLOBIN (MYOGLOBIN) - BURSATELLA LEACHII (RAGGED SEA HARE).
GLB_CERRH GLOBIN (MYOGLOBIN) - CERITHIDEA RHIZOPHORARUM (WATER SNAIL) (HORN SHELL).
GLB_DOLAU GLOBIN (MYOGLOBIN) - DOLABELLA AURICULARIA (SEA HARE).
GLB_NASMU GLOBIN (MYOGLOBIN) - NASSA MUTABILIS (SEA SNAIL).
GLB_TUBTU GLOBIN, EXTRACELLULAR MONOMERIC - TUBIFEX TUBIFEX (SLUDGE WORM).
O02004 HEMOGLOBIN - DAPHNIA MAGNA.
O02368 GLOBIN VIIA.1 - CHIRONOMUS THUMMI THUMMI (MIDGE).
O02369 GLOBIN XII - CHIRONOMUS THUMMI THUMMI (MIDGE).
O02370 GLOBIN XI - CHIRONOMUS THUMMI THUMMI (MIDGE).
O02567 MYOGLOBIN - APLYSIA JULIANA (SEA HARE).
O07944 PRISTINAMYCIN I SYNTHASE 3 AND 4 - STREPTOMYCES PRISTINAESPIRALIS.
O61233 HEMOGLOBIN CHAIN D1 PRECURSOR - LUMBRICUS TERRESTRIS (COMMON EARTHWORM).
P91592 GLOBIN CPA 3-1 - CHIRONOMUS PALLIDIVITTATUS (MIDGE).
P91593 GLOBIN CPA F - CHIRONOMUS PALLIDIVITTATUS (MIDGE).
P91594 GLOBIN CPA 3-2 - CHIRONOMUS PALLIDIVITTATUS (MIDGE).
P91595 GLOBIN CPA E - CHIRONOMUS PALLIDIVITTATUS (MIDGE).
P91600 GLOBIN CTT 3-1 - CHIRONOMUS THUMMI THUMMI (MIDGE).
P92191 GLOBIN CPA 4-2 - CHIRONOMUS PALLIDIVITTATUS (MIDGE).
Q25215 KC HBVIIIB-A PRECURSOR - KIEFFERULUS CORNISHI.
Q25216 KC HBVIIB-B PRECURSOR - KIEFFERULUS CORNISHI.
Q25217 KC HBVIIB-E PRECURSOR - KIEFFERULUS CORNISHI.
Q25218 KC HBVIIB-G PRECURSOR - KIEFFERULUS CORNISHI.
Q25219 KC HBVIIB-H PRECURSOR - KIEFFERULUS CORNISHI.
Q27302 GLOBIN - CAENORHABDITIS BRIGGSAE.
Q27303 KC HBVIIB-C PRECURSOR - KIEFFERULUS CORNISHI.
Q27430 GLOBIN - CAENORHABDITIS REMANEI.
Q94442 TENTANS ORF'S (A-E) FOR HEMOGLOBIN PRECURSOR (A-E) - CHIRONOMUS TENTANS (MIDGE).
Q94443 TENTANS ORF'S (A-E) FOR HEMOGLOBIN PRECURSOR (A-E) - CHIRONOMUS TENTANS (MIDGE).
Q94444 TENTANS ORF'S (A-E) FOR HEMOGLOBIN PRECURSOR (A-E) - CHIRONOMUS TENTANS (MIDGE).
Q94445 TENTANS ORF'S (A-E) FOR HEMOGLOBIN PRECURSOR (A-E) - CHIRONOMUS TENTANS (MIDGE).

GLB1_CHITH GLOBIN CTT-I/CTT-IA PRECURSOR (ERYTHROCRUORIN) - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLB1_GLYDI Globin, major monomeric component - Glycera dibranchiata (Bloodworm).
GLB1_PARCH GLOBIN I - PARACAUDINA CHILENSIS (SEA CUCUMBER).
GLB3_LAMSP GIANT HEMOGLOBIN AIII CHAIN - LAMELLIBRACHIA SP. (DEEP-SEA GIANT TUBE WORM).
GLB4_LUMTE GLOBIN IV, EXTRACELLULAR (ERYTHROCRUORIN) (GLOBIN A) - LUMBRICUS TERRESTRIS (COMMON EARTHWORM).
GLB8_CHITH GLOBIN CTT-VIII - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBA_ANATR GLOBIN I ALPHA CHAIN - ANADARA TRAPEZIA (ARK CLAM).
GLBA_SCAIN GLOBIN II, A CHAIN (HBII-A) - SCAPHARCA INAEQUIVALVIS (ARK CLAM).
GLBC_NIPBR GLOBIN, CUTICULAR ISOFORM PRECURSOR - NIPPOSTRONGYLUS BRASILIENSIS.
GLBH_TRICO GLOBIN-LIKE HOST-PROTECTIVE ANTIGEN PRECURSOR - TRICHOSTRONGYLUS COLUBRIFORMIS.
GLBW_CHITH GLOBIN CTT-W PRECURSOR (HBW) - CHIRONOMUS THUMMI THUMMI (MIDGE).
GLBW_CHITP GLOBIN CTT-W PRECURSOR (HBW) - CHIRONOMUS THUMMI PIGER (MIDGE).
GLBX_CHITH Globin CTT-X - Chironomus thummi thummi (Midge).
GLBY_CHITP GLOBIN CTT-Y PRECURSOR (HBY) - CHIRONOMUS THUMMI PIGER (MIDGE).
GLB_PSEDC EXTRACELLULAR GLOBIN PRECURSOR - PSEUDOTERRANOVA DECIPIENS (COD WORM).
HBB2_XENLA HEMOGLOBIN BETA-2 CHAIN (MINOR) (LARVAL BETA-II-GLOBIN) (B2G) - XENOPUS LAEVIS (AFRICAN CLAWED FROG).
HBF1_URECA HEMOGLOBIN F-I - URECHIS CAUPO (INNKEEPER WORM) (SPOONWORM).
O02480 HEMOGLOBIN B CHAIN - SCAPHARCA INAEQUIVALVIS (ARK CLAM).
O61234 HEMOGLOBIN CHAIN D2 PRECURSOR - LUMBRICUS TERRESTRIS (COMMON EARTHWORM).
Q17154 TWO-DOMAIN CHAIN OF THE POLYMERIC HEMOGLOBIN (INTRACELLULAR) - BARBATIA LIMA.
Q17155 ALPHA CHAIN OF THE TETRAMERIC HEMOGLOBIN (INTRACELLULAR) - BARBATIA LIMA.
Q25689 'HEMOGLOBIN, ABNORMAL' PRECURSOR - PSEUDOTERRANOVA DECIPIENS (COD WORM).
Q26506 A POLYPEPTIDE CHAIN OF CHLOROCRUORIN PRECURSOR - SABELLASTARTE INDICA.
Q27126 F-I HEMOGLOBIN - URECHIS CAUPO (INNKEEPER WORM) (SPOONWORM).
Q93101 Nerve myoglobin - Aphrodite aculeata.

DCOR_NEUCR ORNITHINE DECARBOXYLASE (EC 4.1.1.17) (ODC) - NEUROSPORA CRASSA.
GLB1_LUCPE HEMOGLOBIN I (HB I) - LUCINA PECTINATA (CLAM).
GLB2_LUCPE HEMOGLOBIN II (HB II) - LUCINA PECTINATA (CLAM).
GLB3_LUCPE HEMOGLOBIN III (HB III) - LUCINA PECTINATA (CLAM).
GLB4_GLYDI Globin, monomeric component M-IV (GMH4) - Glycera dibranchiata (Bloodworm).
GLB7_ARTSX GLOBIN E7, EXTRACELLULAR - ARTEMIA SP. (BRINE SHRIMP).
GLBB_RIFPA GIANT HEMOGLOBINS B CHAIN - RIFTIA PACHYPTILA (TUBE WORM).
GLBB_SCAIN GLOBIN II, B CHAIN (HBII-B) - SCAPHARCA INAEQUIVALVIS (ARK CLAM).
GLBD_CAUAR GLOBIN D, COELOMIC - CAUDINA ARENICOLA (SEA CUCUMBER) (MOLPADIA ARENICOLA).
GLBM_ANATR GLOBIN, MINOR - ANADARA TRAPEZIA (ARK CLAM).
GLB_ASCSU EXTRACELLULAR GLOBIN PRECURSOR - ASCARIS SUUM (PIG ROUNDWORM) (ASCARIS LUMBRICOIDES).
GLB_BUSCA GLOBIN (MYOGLOBIN) - BUSYCON CANALICULATUM (CHANNELED WHELK).
GLB_ISOHY GLOBIN (MYOGLOBIN) - ISOPARORCHIS HYPSELOBAGRI.
GLB_PAREP GLOBIN-3 (MYOGLOBIN) - PARAMPHISTOMUM EPICLITUM.
HBA3_PLEWA HEMOGLOBIN ALPHA CHAIN, LARVAL - PLEURODELES WALTLII (IBERIAN RIBBED NEWT).
HBA3_RANCA HEMOGLOBIN ALPHA-III CHAIN, LARVAL - RANA CATESBEIANA (BULL FROG).
HBA3_XENLA HEMOGLOBIN ALPHA-3 CHAIN (ALPHA-T3) - XENOPUS LAEVIS (AFRICAN CLAWED FROG).
HBA3_XENTR HEMOGLOBIN ALPHA-3 CHAIN (LARVAL) - XENOPUS TROPICALIS (WESTERN CLAWED FROG) (SILURANA TROPICALIS).
HBA4_XENLA HEMOGLOBIN ALPHA-4 CHAIN (ALPHA-T4) - XENOPUS LAEVIS (AFRICAN CLAWED FROG).
HBA5_XENLA HEMOGLOBIN ALPHA-5 CHAIN (ALPHA-T5) - XENOPUS LAEVIS (AFRICAN CLAWED FROG).
HBAM_RANCA HEMOGLOBIN ALPHA-TYPE CHAIN, HEART MUSCLE - RANA CATESBEIANA (BULL FROG).
HBA_LIOMI HEMOGLOBIN ALPHA CHAIN - LIOPHIS MILIARIS.
HBA_TRAST HEMOGLOBIN ALPHA CHAIN - TRAGELAPHUS STREPSICEROS (GREATER KUDU).
HBB_LATCH HEMOGLOBIN BETA CHAIN - LATIMERIA CHALUMNAE (LATIMERIA) (COELACANTH).
HBP2_CASGL Hemoglobin-2 (Hemoglobin II) - Casuarina glauca (Swamp oak).
HBPL_PARAD Non-legume hemoglobin - Parasponia andersonii.
HBPL_TRETO HEMOGLOBIN - TREMA TOMENTOSA.
HD_FUGRU HUNTINGTIN (HUNTINGTON'S DISEASE PROTEIN HOMOLOG) (HD PROTEIN) - FUGU RUBRIPES (JAPANESE PUFFERFISH) (TAKIFUGU RUBRIPES).
HD_HUMAN HUNTINGTIN (HUNTINGTON'S DISEASE PROTEIN) (HD PROTEIN) - HOMO SAPIENS (HUMAN).
HD_MOUSE HUNTINGTIN (HUNTINGTON'S DISEASE PROTEIN HOMOLOG) (HD PROTEIN) - MUS MUSCULUS (MOUSE).
HD_RAT HUNTINGTIN (HUNTINGTON'S DISEASE PROTEIN HOMOLOG) (HD PROTEIN) - RATTUS NORVEGICUS (RAT).
HMPA_ALCEU FLAVOHEMOPROTEIN (HEMOGLOBIN-LIKE PROTEIN) (FLAVOHEMOGLOBIN) - ALCALIGENES EUTROPHUS.
HYPF_AZOCH TRANSCRIPTIONAL REGULATORY PROTEIN HYPF - AZOTOBACTER CHROOCOCCUM MCD 1.
LGBA_PHAVU Leghemoglobin A - Phaseolus vulgaris (Kidney bean) (French bean).
O04939 LEGHEMOGLOBIN - PHASEOLUS VULGARIS (KIDNEY BEAN) (FRENCH BEAN).
O07407 ALCOHOL DEHYDROGENASE - MYCOBACTERIUM TUBERCULOSIS.
O21882 HYPOTHETICAL 104.5 KD PROTEIN - BACTERIOPHAGE SK1.
O24520 LEGHEMOGLOBIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O30764 POLYKETIDE SYNTHASE MODULES 1 AND 2 - STREPTOMYCES CAELESTIS.
O30765 POLYKETIDE SYNTHASE MODULE 3 - STREPTOMYCES CAELESTIS.
O30766 POLYKETIDE SYNTHASE MODULES 4 AND 5 - STREPTOMYCES CAELESTIS.
O55574 VP80 - LEUCANIA SEPARATA NUCLEAR POLYHEDROSIS VIRUS (LSNPV).
O61603 EYELID - DROSOPHILA MELANOGASTER (FRUIT FLY).
O76242 NEURAL GLOBIN - CEREBRATULUS LACTEUS (MILKY RIBBON WORM).
O76243 BODY WALL GLOBIN - CEREBRATULUS LACTEUS (MILKY RIBBON WORM).
O77003 MYOGLOBIN - BIOMPHALARIA GLABRATA (BLOODFLUKE PLANORB).
O81116 LEGHEMOGLOBIN - TREMA ORIENTALIS.
O82467 PEROXISOMAL TARGETING SIGNAL TYPE 1 RECEPTOR - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O83423 HYPOTHETICAL 124.0 KD PROTEIN - TREPONEMA PALLIDUM.
O85168 SYRINGOMYCIN SYNTHETASE - PSEUDOMONAS SYRINGAE (PV. SYRINGAE).
O86363 HYPOTHETICAL 43.3 KD PROTEIN - MYCOBACTERIUM TUBERCULOSIS.
P89459 VERY LARGE TEGUMENT PROTEIN - HERPES SIMPLEX VIRUS (TYPE 2).
P96645 YDDH PROTEIN - BACILLUS SUBTILIS.
POLG_PVYHU GENOME POLYPROTEIN [CONTAINS: N-TERMINAL PROTEIN (P1); HELPER COMPONENT PROTEINASE (EC 3.4.22.-) (HC-PRO); PROTEIN P3; 6 KD PROTEIN 1 (6K1); CYTOPLASMIC INCLUSION PROTEIN (CI); 6 KD PROTEIN 2 (6K2); GENOME-LINKED PROTEIN (VPG); NUCLEAR INCLUSION PROTEIN A (NI-A) (NIA) (EC 3.4.22.-) (49 KD PROTEINASE) (49 KD-PRO); NUCLEAR INCLUSION PROTEIN B (NI-B) (NIB) (RNA-DIRECTED RNA POLYMERASE) (EC 2.7.7.48); COAT PROTEIN (CP)] - POTATO VIRUS Y (STRAIN HUNGARIAN) (PVY).
PPS1_BACSU PEPTIDE SYNTHETASE 1 - BACILLUS SUBTILIS.
Q03972 LEGHEMOGLOBIN 2 - PHASEOLUS VULGARIS (KIDNEY BEAN) (FRENCH BEAN).
Q09164 CYCLOSPORIN SYNTHETASE (CYSYN) (EC 6.-.-.-) - TOLYPOCLADIUM INFLATUM.
Q17153 HEMOGLOBIN (2 DOMAIN) - BARBATIA LIMA.
Q17156 BETA CHAIN OF THE TETRAMERIC HEMOGLOBIN (INTRACELLULAR) - BARBATIA LIMA.
Q17157 DELTA CHAIN OF THE HOMODIMERIC HEMOGLOBIN (INTRACELLULAR) - BARBATIA LIMA.
Q17286 HEMOGLOBIN (HETERODIMERIC) - BARBATIA VIRESCENS.
Q20798 F55A11.3 PROTEIN - CAENORHABDITIS ELEGANS.
Q24367 INSCUTEABLE - DROSOPHILA MELANOGASTER (FRUIT FLY).
Q24409 MUSASHI - DROSOPHILA MELANOGASTER (FRUIT FLY).
Q42785 LEGHEMOGLOBIN - GLYCINE MAX (SOYBEAN).
Q43236 LEGHEMOGLOBIN - VIGNA UNGUICULATA (COWPEA).
Q43296 LEGHEMOGLOBIN - VIGNA UNGUICULATA (COWPEA).
Q50585 HYPOTHETICAL 122.4 KD PROTEIN CY19G5.06 - MYCOBACTERIUM TUBERCULOSIS.
Q54296 POLYKETIDE SYNTHASE - STREPTOMYCES HYGROSCOPICUS.
Q54299 POLYKETIDE SYNTHASE - STREPTOMYCES HYGROSCOPICUS.
Q55179 HYPOTHETICAL 57.1 KD PROTEIN - SYNECHOCYSTIS SP. (STRAIN PCC 6803).
Q65553 UL36 - BOVINE HERPESVIRUS 1.
Q85265 POLYPROTEIN - POTATO VIRUS Y.
Q85438 NONSTRUCTURAL PROTEIN - RICE DWARF VIRUS (RDV).
Q89815 UL37 - BOVINE HERPESVIRUS 1.
Q94543 NEM (NEM) - DROSOPHILA MELANOGASTER (FRUIT FLY).
RYNR_PIG RYANODINE RECEPTOR, SKELETAL MUSCLE (SKELETAL MUSCLE CALCIUM RELEASE CHANNEL) - SUS SCROFA (PIG).
VP7_RDV NONSTRUCTURAL PROTEIN PNS7 - RICE DWARF VIRUS (RDV).
Y06B_MYCTU HYPOTHETICAL 139.6 KD PROTEIN CY338.11C PRECURSOR - MYCOBACTERIUM TUBERCULOSIS.
Scan History
OWL28_2    3  1370 NSINGLE    
SPTR37_9f 3 1000 NSINGLE
Initial Motifs
Motif 1  width=23
Element Seqn Id St Int Rpt
GNAFFRYFFTNFPDLRVYFKGAE GLBH_CAEEL 31 31 -
SQAIWRATFAQVPESRSLFKRVH GLB2_LUMTE 30 30 -
GRLLLTKLAKDIPDVNDLFKRVD GLB3_LUMTE 53 53 -
GIALWKSMFAQDNDARDLFKRVH GLB2_TYLHE 31 31 -
GLDFLVALFEKFPDSANFFADFK GLB_APLLI 25 25 -
SVGILYAVFKADPTIQAAFPQFV GLBZ_CHITH 35 35 -
EVEILAAVFAAYPDIQNKFSQFA GLBE_CHITH 38 38 -
GLELFTKYFHENPQMMFIFGYSG GLP2_GLYDI 28 28 -
VRAVFDDLFKHYPTSKALFERVK GLB4_LUMTE 35 35 -
GREFYKYFFTNHQDLRKYFKGAE GLBH_TRICO 44 44 -
GIDLYKHMFENYPPLRKYFKNRE GLB_ASCSU 44 44 -
GIDLYKHMFENYPSMREAFKDRE GLB_ASCSU 193 193 -
GIDLYKHMFEHYPAMKKYFKHRE GLB_PSEDC 44 44 -
GIDLYKHMFEHYPHMRKAFKGRE GLB_PSEDC 193 193 -
GLELWKGILREHPEIKAPFSRVR GLB1_LUMTE 28 28 -
GQAIFQELFALDPNAKGVFGRVN GLB3_TYLHE 32 32 -
SLHFWKEFLHDHPDLVSLFKRVQ GLB1_PHESE 28 28 -

Motif 2 width=24
Element Seqn Id St Int Rpt
MDNAKQMAGTLHALGVRHKGFGDI GLP2_GLYDI 79 28 -
LDDTLVLQSHLGHLADQHIQRKGV GLB4_LUMTE 84 26 -
YDNEMIFRAFVRDTIDRHVDRGLD GLBH_TRICO 97 30 -
YDDRETFNAYTRELLDRHARDHVH GLB_ASCSU 97 30 -
YDDEETFHMYVHELMERHERLGVQ GLB_ASCSU 246 30 -
YDDRETFDAYVGELMARHERDHVK GLB_PSEDC 97 30 -
YDDEPTFDYFVDALMDRHIKDDIH GLB_PSEDC 246 30 -
TSNAAAVNSLVSKLGDDHKARGVS GLBE_CHITH 94 33 -
IDDLPNIGKHVDALVATHKPRGVT GLBZ_CHITH 85 27 -
AANAGKMSAMLSQFAKEHVGFGVG GLB_APLLI 78 30 -
LTDEPVLNAQLEHLRQQHIKLGIT GLB2_TYLHE 80 26 -
LDDPPALDAALDHLAHQHEVREGV GLB3_LUMTE 102 26 -
LDQPATLKEELDHLQVQHEGRKIP GLB2_LUMTE 79 26 -
YTNEEVFKGYVRETINRHRIYKMD GLBH_CAEEL 84 30 -
LDEDDTFTVQLAHLKAQHTERGTK GLB1_PHESE 77 26 -
LEDPKALQEELKHLARQHRERSGV GLB3_TYLHE 81 26 -
LDTPDMLAAQLAHLKVQHVERNLK GLB1_LUMTE 77 26 -

Motif 3 width=17
Element Seqn Id St Int Rpt
HWTDFWKLFEEFLEKKS GLB_ASCSU 274 4 -
LWKEFWSIYQKFLESKG GLBH_TRICO 123 2 -
FFPALGMCLLDAMEEKV GLP2_GLYDI 106 3 -
YFRGIGEAFARVLPQVL GLB4_LUMTE 111 3 -
FFDIFLKHLLHVLGDRL GLB1_LUMTE 103 2 -
QFGEFRTALVAYLQANV GLBE_CHITH 120 2 -
QFNNFRAAFIAYLKGHV GLBZ_CHITH 111 2 -
QFENVRSMFPGFVASVA GLB_APLLI 104 2 -
MFNLMRTGLAYVLPAQL GLB2_TYLHE 106 2 -
HFKKFGEILATGLPQVL GLB3_LUMTE 129 3 -
YFDAFKTAILHVVAAQL GLB2_LUMTE 105 2 -
LWMAFFTVFTGYLESVG GLBH_CAEEL 110 2 -
YFDLFGTQLFDILGDKL GLB1_PHESE 103 2 -
YFDEMEKALLKVLPQVS GLB3_TYLHE 108 3 -
QWHEFWKLFAEYLNEKS GLB_PSEDC 274 4 -
VWNHFWEHFIEFLGSKT GLB_PSEDC 125 4 -
VWTDFWKLFEEYLGKKT GLB_ASCSU 125 4 -

Motif 4 width=15
Element Seqn Id St Int Rpt
TKQAWHEIGREFAKE GLB_ASCSU 147 5 -
TKHAWAVIGKEFAYE GLB_ASCSU 296 5 -
TKHAWQEIGKEFSHE GLB_PSEDC 147 5 -
VAAAWNKALDNTFAI GLBE_CHITH 142 5 -
EKHAWSTIGEDFAHE GLB_PSEDC 298 7 -
QKAAFDAIGTRFNDE GLBH_TRICO 146 6 -
VEAAWGATFDAFFGA GLBZ_CHITH 133 5 -
ADAAWTKLFGLIIDA GLB_APLLI 126 5 -
DKEAWAACWDEVIYP GLB2_TYLHE 127 4 -
DALAWKSCLKGILTK GLB3_LUMTE 149 3 -
DREAWDACIDHIEDG GLB2_LUMTE 126 4 -
QKAAWMALGKEFNAE GLBH_CAEEL 132 5 -
DQAAWRDCYAVIAAG GLB1_PHESE 124 4 -
NSGAWDRCFTRIADV GLB3_TYLHE 128 3 -
DFGAWHDCVDQIIDG GLB1_LUMTE 124 4 -
WAAAYREISDALVAG GLP2_GLYDI 130 7 -
NVDAWNRCFHRLVAR GLB4_LUMTE 131 3 -
Final Motifs
Motif 1  width=23
Element Seqn Id St Int Rpt
VVARLAAHLAGRPDLADKVTVDA O07944 2053 2053 -
EADILYAVFKAYPDIQAKFPQFA Q25219 38 38 -
EADILYAVFKAYPDIQAKFPQFA Q25217 38 38 -
EVEILAAVFAAYPDIQNKFPQFA GLBI_CHITP 38 38 -
EVDILAAVFAAYPDIQAKFPQFA GLBC_CHITH 38 38 -
EVEILAAVFAAYPDIQNKFSQFA GLBK_CHITH 38 38 -
EVDILAAVFAAYPDIQAKFPQFA GLBH_CHITP 38 38 -
EVDILAAVFAAYPDIQAKFPQFA GLBH_CHITH 38 38 -
EVEILAAVFAAYPDIQNKFSQFA GLBF_CHITH 38 38 -
EVEILAAVFAAYPDIQNKFSQFA GLBE_CHITH 38 38 -
EVEILAAVFAAYPDIQNKFPQFA GLBI_CHITH 38 38 -
EVDILAAVFAAYPDIQAKFPQFA GLBD_CHITH 38 38 -
EVDILAAVFTANPDIQARFPQFA Q94445 38 38 -
EADILYAVFKAYPDIQAKFPQFA Q25215 38 38 -
EVDILAAIFAANPDIQARFSQFA Q94442 40 40 -
EVDILYAVFKAYPDIQNKFSQFA Q27303 38 38 -
EVDILYAVFKAYPDIQNKFSQFA Q25216 38 38 -
EVDILAAVFKAYPDIQAKFPQFA GLBV_CHITP 38 38 -
EVEILAAVFTAYPDIQARFPQFA GLB7_CHITH 22 22 -
EVEILAAVFTAYPDIQARFPQFA O02368 38 38 -
EVDILYTVFKAYPDIQARFPQFA GLBZ_CHITP 38 38 -
SVGILYAVFKADPTIQAAFPQFV GLBZ_CHITH 35 35 -
EVDILAAVFSDHPDIQARFPQFA GLB9_CHITH 38 38 -
EVDILYAVFKAYPDIMAKFPQFA GLB6_CHITH 37 37 -
EVDILAAVFKDHPDIQARFPQFA Q94444 38 38 -
EVDILYYIFKANPDIMAKFPQFA GLB2_CHITH 37 37 -
SVGILYAVFKADPTIQAAFPQFV GLBP_CHITH 35 35 -
EVDILYYIFKANPDIMAKFPQFV Q94443 37 37 -
PVGILYAVFKADPSIMAKFTQFA GLB3_CHITP 20 20 -
PVGILYAVFKADPSIMAKFTQFA GLB3_CHITH 35 35 -
PVGILYAVFKADPSIMAKFTQFA P91600 35 35 -
STGILYAVFKADSSIQAAFPQFV P91595 35 35 -
AVGILYAVFKADPSIQAKFTQFA GLB4_CHITH 35 35 -
EVDILYSIFAANPDIQARFPQFA O02369 39 39 -
SVGILYAVFKADPSIQAKFSQFA P92191 35 35 -
EVDILYAIFKANPDIQARFPQFA O02370 39 39 -
PVGILYAVFKADPSIMAKFTQFA P91592 35 35 -
SVGILYAVFKADPSIQTKFTQFA P91594 35 35 -
PVGILYACLKADPSIQEKFPQFA P91593 38 38 -
GVEILYFFLNKFPGNFPMFKKLG GLBT_CHITH 28 28 -
GDNFLIALFEAFPDSANFFGDFK GLB_BURLE 25 25 -
GLDFLVALFEKFPDSANFFADFK GLB_APLLI 25 25 -
GDNFLIALFEAYPDSPNFFADFK GLB_DOLAU 25 25 -
GASFLVALFTQFPESANFFNDFK GLB_APLJU 25 25 -
GDSFLVALFTQFPESANFFNDFK O02567 26 26 -
GDAFLLSLFEKFPNNANYFADFK GLB_APLKU 25 25 -
GATLFSLLFKQFPDTRNYFTHFG GLB_CERRH 28 28 -
SAAMFGLLFEKYPDTKKHFKTFD GLB_NASMU 28 28 -
GRLLFEELFEIDGATKGLFKRVN GLB4_TYLHE 32 32 -
GIALWKSMFAQDNDARDLFKRVH GLB2_TYLHE 31 31 -
GKDFYKFFFTNHPDLRKYFKGAE GLB2_NIPBR 26 26 -
GLKLWNSIFRDAPEIRGLFKRVD GLB_TUBTU 28 28 -
APQVLFRFVKAHPEYQKMFSKFA O02004 70 70 -
VTDVFIRIFAYDPSAQNKFPQMA GLBC_CAUAR 35 35 -
GRLLLTKLAKDIPDVNDLFKRVD GLB3_LUMTE 53 53 -
GNGFYQYFFTNFPDLRVYFKGAE Q27430 31 31 -
SQAIWRATFAQVPESRSLFKRVH GLB2_LUMTE 30 30 -
GNAFFRYFFTNFPDLRVYFKGAE GLBH_CAEEL 31 31 -
GNGFYQYFFTNFPDLRVYFKGAE Q27302 31 31 -
SLHFWKEFLHDHPDLVSLFKRVQ GLB1_PHESE 28 28 -
GQAIFQELFALDPNAKGVFGRVN GLB3_TYLHE 32 32 -
EADILYAVFKAYPDIQAKFPQFA Q25218 38 38 -
GLELWKGILREHPEIKAPFSRVR GLB1_LUMTE 28 28 -
GLELWRDIIDDHPEIKAPFSRVR O61233 46 46 -

Motif 2 width=24
Element Seqn Id St Int Rpt
IDDLPNIGKHVDALVATHKPRGVT P91595 85 27 -
ESNLAAVNNLVSKLGADHKARGVT Q25218 94 33 -
ESNLAAVNNLVSKLGADHKARGVT Q25219 94 33 -
AANLSAVYNLVSKLGADHKARGVT Q25217 94 33 -
ESNASAVNSLVSKLGDDHKARGVS GLBI_CHITP 94 33 -
ESNASAVNSLVSKLGDDHKARGVS GLBC_CHITH 94 33 -
ESNASAVNSLVSKLGDDHKARGVS GLBK_CHITH 94 33 -
QANLSAVYALVSKLGVDHKARGIS GLBH_CHITP 94 33 -
QANLSAVYALVSKLGVDHKARGIS GLBH_CHITH 94 33 -
DSNAAAVNSLVSKLGDDHKARGVS GLBF_CHITH 94 33 -
TSNAAAVNSLVSKLGDDHKARGVS GLBE_CHITH 94 33 -
ESNASAVNSLVSKLGDDHKARGVS GLBI_CHITH 94 33 -
ASNAAAVEGLLNKLGSDHKARGVS GLBD_CHITH 94 33 -
ESNAPAVQTLVGQLAASHKARGIS Q94445 94 33 -
ASNLGAINNIVSKLGADHNGRGVT Q25215 94 33 -
AANAPALQTLVGQLAASHKARGIP Q94442 96 33 -
EANAGAIQNIVSKFGADHNARGVT Q27303 94 33 -
EANAVAIQNIVSKFGADHNARGVT Q25216 94 33 -
EANLSAVYGLVKKLGVDHKNRGIT GLBV_CHITP 94 33 -
ESNAPAVQTLVGQLAASHKARGIS GLB7_CHITH 78 33 -
ESNAPAVQTLVGQLAASHKARGIS O02368 94 33 -
ESNLSAIYGLISKMGTDHKNRGIT GLBZ_CHITP 94 33 -
IDDLPNIGKHVDALVATHKPRGVT GLBZ_CHITH 85 27 -
ESNAPAMATLINELSTSHHNRGIT GLB9_CHITH 94 33 -
DANIPAIQNLAKELATSHKPRGVS GLB6_CHITH 93 33 -
EANRPAMNTLTNELATNHHNRGIS Q94444 94 33 -
SANMPAMETLIKDMAANHKARGIP GLB2_CHITH 93 33 -
IDDLPNIGKHVDALVATHKPRGVT GLBP_CHITH 85 27 -
EANRPAMVTLINEMAANHKARKIP Q94443 93 33 -
IGELPNIEADVNTFVASHKPRGVT GLB3_CHITP 70 27 -
IGELPNIEADVNTFVASHKPRGVT GLB3_CHITH 85 27 -
IGELPNIDGDVNTFVASHKPRGVT P91600 85 27 -
IGDLPNIDGDVTTFVASHTPRGVT GLB4_CHITH 85 27 -
DSGVSAAKTLINEVAASHKGRGVS O02369 95 33 -
VGDLPNISGDVDTFVASHKPRGAT P92191 85 27 -
ESGISAAKTLINALGASHRGRGIS O02370 95 33 -
IGDLPSIEGDVDTFVTSHKPRGVT P91592 85 27 -
ISELPNIDADVDAFVATHKPRSVT P91594 85 27 -
IYELPDMERDVDTFVASHKPRGIT P91593 88 27 -
GSDMGGAKALLNQLGTSHKAMGIT GLBT_CHITH 81 30 -
AADAGKMAGMLDQFSKEHVGFGVG GLB_BURLE 78 30 -
AANAGKMSAMLSQFAKEHVGFGVG GLB_APLLI 78 30 -
AADAGKMAAMLDQFSKEHAGFGVG GLB_DOLAU 78 30 -
AADAGKMGSMLQQFATEHAGFGVG GLB_APLJU 78 30 -
AADAGKMGSMLQQFATEHAGFGVG O02567 79 30 -
AADAGKMSAMLSQFASEHVGFGVG GLB_APLKU 78 30 -
MDDADCMNGLALKLSRNHIQRKIG GLB_CERRH 81 30 -
VDDGECVLGLAKKLSRNHTARGVT GLB_NASMU 81 30 -
LGDSDTLNSLIDHLAEQHKARAGF GLB4_TYLHE 81 26 -
LTDEPVLNAQLEHLRQQHIKLGIT GLB2_TYLHE 80 26 -
FDNEDVFRAFCRETIDRHVGRGLD GLB2_NIPBR 79 30 -
LDDQAAFDAQLAHLKSQHAERNIK GLB_TUBTU 77 26 -
LFSQELMANQLNALGGAHQPRGAT O02004 123 30 -
ELDSDILPELLATLARTHDLNKVG GLBC_CAUAR 87 29 -
LDDPPALDAALDHLAHQHEVREGV GLB3_LUMTE 102 26 -
YTNEEVFKAYVRETVNRHRIYKMD Q27430 84 30 -
LDQPATLKEELDHLQVQHEGRKIP GLB2_LUMTE 79 26 -
YTNEEVFKGYVRETINRHRIYKMD GLBH_CAEEL 84 30 -
FTNEEVFKAYVRETINRHRIYKMD Q27302 84 30 -
LDEDDTFTVQLAHLKAQHTERGTK GLB1_PHESE 77 26 -
LEDPKALQEELKHLARQHRERSGV GLB3_TYLHE 81 26 -
LDTPDMLAAQLAHLKVQHVERNLK GLB1_LUMTE 77 26 -
LDTPDMLAAQLAHLKVQHVERNLK O61233 95 26 -
TLDTAALRAALADVTARHEALRTV O07944 2519 443 -

Motif 3 width=17
Element Seqn Id St Int Rpt
QFNNFRAAFIGYLKGHV P91595 111 2 -
QFGEFRTALVAYLQAHV Q25218 120 2 -
QFGEFRTALVAYLQAHV Q25219 120 2 -
QFGEFRTALVAYLQAHV Q25217 120 2 -
QFGEFRTALVAYLQANV GLBI_CHITP 120 2 -
QFGEFRTALVAYLSNHV GLBC_CHITH 120 2 -
QFGEFRTALVAYLQANV GLBK_CHITH 120 2 -
QFGEFRTALVSYLQAHV GLBH_CHITP 120 2 -
QFGEFRTALVSYLQAHV GLBH_CHITH 120 2 -
QFGEFRTALVAYLQANV GLBF_CHITH 120 2 -
QFGEFRTALVAYLQANV GLBE_CHITH 120 2 -
QFGEFRTALVAYLSNHV GLBI_CHITH 120 2 -
QFGEFRTALVSYLSNHV GLBD_CHITH 120 2 -
QFNEFRASLVSYLQANV Q94445 120 2 -
QFGEFRTALMAYLQAHV Q25215 120 2 -
QFGEFRTSLVAYLQANV Q94442 122 2 -
QFGEFRTALFAYLQAHV Q27303 120 2 -
QFGEFRTALFAYLQAHV Q25216 120 2 -
QFNEFKTALISYLSSHV GLBV_CHITP 120 2 -
QFNEFRAGLVSYVSSNV GLB7_CHITH 104 2 -
QFNEFRAGLVSYVSSNV O02368 120 2 -
QFNEFRTALVSYISSNV GLBZ_CHITP 120 2 -
QFNNFRAAFIAYLKGHV GLBZ_CHITH 111 2 -
QFNEFRSSLVSYLSSHA GLB9_CHITH 120 2 -
QFTEFRTALFTYLKAHI GLB6_CHITH 119 2 -
QFNEFRASMTSYLSHHT Q94444 120 2 -
QFNEFRASLVSYLQSKV GLB2_CHITH 119 2 -
QFNNFRAAFIAYLKGHV GLBP_CHITH 111 2 -
QFNEFRASLVSYLQSHV Q94443 119 2 -
QLNNFRAGFVSYMKAHT GLB3_CHITP 96 2 -
QLNNFRAGFVSYMKAHT GLB3_CHITH 111 2 -
QLNNFRAGFVSYMKAHT P91600 111 2 -
QLNNFRAGFVSYMKAHT GLB4_CHITH 111 2 -
QFNAFRVSLTAYLADHV O02369 121 2 -
QLNNFRSAFVSYMKAHT P92191 111 2 -
QFNEFRASLITYLSQNV O02370 121 2 -
QLNNFRAGFVSYMKAHT P91592 111 2 -
QLNNFRAGFVGYMKAHT P91594 111 2 -
QLDNFRAGFVTYMKAHT P91593 114 2 -
QFDQFRQALTELLGNLG GLBT_CHITH 107 2 -
QFENVRSMFPGFVSSVA GLB_BURLE 104 2 -
QFENVRSMFPGFVASVA GLB_APLLI 104 2 -
QFQNVSAMFPGFVASIA GLB_DOLAU 104 2 -
QFQNVRSMFPGFVASLS GLB_APLJU 104 2 -
QFQNVRSMFPGFVASLS O02567 105 2 -
QFENVRSMFPAFVASLS GLB_APLKU 104 2 -
RFGEMRQVFPNFLDEAL GLB_CERRH 107 2 -
DFKLMRSIFGEFLDKAT GLB_NASMU 107 2 -
YFKEFGKALNHVLPEVA GLB4_TYLHE 108 3 -
MFNLMRTGLAYVLPAQL GLB2_TYLHE 106 2 -
LWKAFWSVWVAFLESKG GLB2_NIPBR 105 2 -
FVNELLAVLPDYLGTKL GLB_TUBTU 107 6 -
MFEQFGGILEEVLAEEL O02004 149 2 -
HYNLFAKVLMEALQAEL GLBC_CAUAR 113 2 -
HFKKFGEILATGLPQVL GLB3_LUMTE 129 3 -
LWMAFFTVFTGYLGSTG Q27430 110 2 -
YFDAFKTAILHVVAAQL GLB2_LUMTE 105 2 -
LWMAFFTVFTGYLESVG GLBH_CAEEL 110 2 -
LWMAFFTVFTGYLESTG Q27302 110 2 -
YFDLFGTQLFDILGDKL GLB1_PHESE 103 2 -
YFDEMEKALLKVLPQVS GLB3_TYLHE 108 3 -
FFDIFLKHLLHVLGDRL GLB1_LUMTE 103 2 -
FFDIFLKHLLHVLGDRL O61233 121 2 -
LATEGRASLFMVLQAAF O07944 2722 179 -

Motif 4 width=15
Element Seqn Id St Int Rpt
DFGAWHDCVDQIIDG O61233 142 4 -
VAAAWNQALDNTFAI Q25218 142 5 -
VAAAWNQALDNTFAI Q25219 142 5 -
VAAAWNHALDNTYAV Q25217 142 5 -
VAAAWNKALDNTFAI GLBI_CHITP 142 5 -
VAAAWNKALDNTYAI GLBC_CHITH 142 5 -
VAAAWNKALDNTFAI GLBK_CHITH 142 5 -
VAAAWNHALDNTYAV GLBH_CHITP 142 5 -
VAAAWNHALDNTYAV GLBH_CHITH 142 5 -
VAAAWNKALDNTFAI GLBF_CHITH 142 5 -
VAAAWNKALDNTYAI GLBI_CHITH 142 5 -
VAAAWNKALDNTMAV GLBD_CHITH 142 5 -
VAAAWTQGLDNIYGL Q94445 142 5 -
VAAAWNHALDNTMEI Q25215 142 5 -
VAAAWNQALDNLFFV Q94442 144 5 -
VAAAWNQAVDNTFTI Q27303 142 5 -
VAAAWNQAVDNVFVV Q25216 142 5 -
VAAAWEHALENTYTV GLBV_CHITP 142 5 -
AESAWTAGLDNIFGL GLB7_CHITH 126 5 -
AESAWTAGLDNIFGL O02368 142 5 -
VAAAWTHALDNVYTA GLBZ_CHITP 142 5 -
VEAAWGATFDAFFGA GLBZ_CHITH 133 5 -
TADAWTHGLDNIFGM GLB9_CHITH 142 5 -
TETAWTLALDTTYAM GLB6_CHITH 141 5 -
TAAAWTHGLDNIFDA Q94444 142 5 -
LGAAWTQGLDNVFNM GLB2_CHITH 141 5 -
VEAAWGATFDAFFGA GLBP_CHITH 133 5 -
LGAAWTQGLDNAFTM Q94443 141 5 -
AEAAWGATLDTFFGM GLB3_CHITP 117 4 -
AEAAWGATLDTFFGM GLB3_CHITH 132 4 -
AEAAWGATLDTFFGM P91600 132 4 -
VEAAWGATFDAFFGA P91595 133 5 -
LAAAARHCFDLTTEL O07944 3621 882 -
VAAAWNKALDNTFAI GLBE_CHITH 142 5 -
AEAAWGATLDAFFGM GLB4_CHITH 132 4 -
VAQAWEKGLDNVYFV O02369 143 5 -
SESAWGATLDAFFGA P92191 132 4 -
VAQAWEKGFNNVYFI O02370 143 5 -
SESAWGATLDTFFGM P91592 132 4 -
AESAWGATLDTFFGA P91594 132 4 -
SESAWGASLDNFFGM P91593 135 4 -
NIGAWNATVDLMFHV GLBT_CHITH 127 3 -
ADAAWGKLFGLIIDA GLB_BURLE 126 5 -
ADAAWTKLFGLIIDA GLB_APLLI 126 5 -
ADAAWGKLFGLIIDA GLB_DOLAU 126 5 -
ADAAWNSLFGLIISA GLB_APLJU 124 3 -
GDAAWNSLFGLIISA O02567 125 3 -
ADDAWNKLFGLIVAA GLB_APLKU 124 3 -
VKGAWDALLAYLQDN GLB_CERRH 131 7 -
MKSAWDALLGVLIEN GLB_NASMU 131 7 -
NPEAWNHCFDGLVDV GLB4_TYLHE 128 3 -
DKEAWAACWDEVIYP GLB2_TYLHE 127 4 -
QKAAWDKLGTVFNDE GLB2_NIPBR 127 5 -
DFKAWSECLGVITGA GLB_TUBTU 124 0 -
ARQAWKNGLAALVAG O02004 173 7 -
TRDAWAKAFSVVQAV GLBC_CAUAR 137 7 -
DALAWKSCLKGILTK GLB3_LUMTE 149 3 -
QKAAWMALGKEFNAE Q27430 132 5 -
DREAWDACIDHIEDG GLB2_LUMTE 126 4 -
QKAAWMALGKEFNAE GLBH_CAEEL 132 5 -
QKAAWMALGKEFNAE Q27302 132 5 -
DQAAWRDCYAVIAAG GLB1_PHESE 124 4 -
NSGAWDRCFTRIADV GLB3_TYLHE 128 3 -
DFGAWHDCVDQIIDG GLB1_LUMTE 124 4 -