FIELD OF THE INVENTION
This invention relates generally to the field of signal transduction between cells, via cytokines and their receptors. More specifically, it relates to enzymatic activity that cleaves and releases the receptor for TNF found on the cell surface,and the consequent biological effects. Certain embodiments of this invention are compositions that affect such enzymatic activity, and may be included in medicaments for disease treatment.
BACKGROUND OF THE INVENTION
Cytokines play a central role in the communication between cells. Secretion of a cytokine from one cell in response to a stimulus can trigger an adjacent cell to undergo an appropriate biological response--such as stimulation, differentiation,or apoptosis. It is hypothesized that important biological events can be influenced not only by affecting cytokine release from the first cell, but also by binding to receptors on the second cell, which mediates the subsequent response. The inventiondescribed in this patent application provides new compounds for affecting signal transduction from tumor necrosis factor.
The cytokine known as tumor necrosis factor (TNF or TNF-.alpha.) is structurally related to lymphotoxin (LT or TNF-.beta.). They have about 40 percent amino acid sequence homology (Old, Nature 330:602-603, 1987). These cytokines are released bymacrophages, monocytes and natural killer cells and play a role in inflammatory and immunological events. The two cytokines cause a broad spectrum of effects both in vitro and in vivo, including: (i) vascular thrombosis and tumor necrosis; (ii)inflammation; (iii) activation of macrophages and neutrophils; (iv) leukocytosis; (v) apoptosis; and (vi) shock. TNF has been associated with a variety of disease states including various forms of cancer, arthritis, psoriasis, endotoxic shock, sepsis,autoimmune diseases, infections, obesity, and cachexia. TNF appears to play a role in the three factors contributing to body weight control: intake, expenditure, and storage of energy (Rothwell, Int. J. Obesity 17:S98-S101, 1993). In septicemia,increased endotoxin concentrations appear to raise TNF levels (Beutler et al. Science 229:869-871, 1985).
Attempts have been made to alter the course of a disease by treating the patient with TNF inhibitors, with varying degrees of success. For example, the TNF inhibitor dexanabinol provided protection against TNF mediated effects followingtraumatic brain injury (Shohami et al. J. Neuroimmun. 72:169-77, 1997). Some improvement in Crohn's disease was afforded by treatment with anti-TNF antibodies (Neurath et al., Eur. J. Immun. 27:1743-50, 1997).
Human TNF and LT mediate their biological activities by binding specifically to two distinct glycoprotein plasma membrane receptors (55 kDa and 75 kDa in size, known as p55 and p75 TNF-R, respectively). The two receptors share 28 percent aminoacid sequence homology in their extracellular domains, which are composed of four repeating cysteine-rich regions (Tartaglia and Goeddel, Immunol. Today 13:151-153, 1992). However, the receptors lack significant sequence homology in their intracellulardomains, and mediate different intracellular responses to receptor activation. In accordance with the different activities of TNF and LT, most human cells express low levels of both TNF receptors: about 2,000 to 10,000 receptors per cell (Brockhaus etal., Proc. Natl. Acad. Sci. USA 87:3127-3131, 1990).
Expression of TNF receptors on both lymphoid and non-lymphoid cells can be influenced experimentally by many different agents, such as bacterial lipopolysaccharide (LPS), phorbol myristate acetate (PMA; a protein kinase C activator),interleukin-1 (IL-1), interferon-gamma (IFN-.gamma.) and IL-2 (Gatanaga et al. Cell Immunol. 138:1-10, 1991; Yui et al. Placenta 15:819-835, 1994). It has been shown that complexes of human TNF bound to its receptor are internalized from the cellmembrane, and then the receptor is either degraded or recycled (Armitage, Curr. Opin. Immunol. 6:407-413, 1994). It has been proposed that TNF receptor activity can be modulated using peptides that bind intracellularly to the receptor, or which bindto the ligand binding site, or that affect receptor shedding. See for example patent publications WO 95/31544, WO 95/33051, WO 96/01642, and EP 568 925.
TNF binding proteins (TNF-BP) have been identified at elevated levels in the serum and urine of febrile patients, patients with renal failure, and cancer patients, and even certain healthy individuals. Human brain and ovarian tumors producedhigh serum levels of TNF-BP These molecules have been purified, characterized, and cloned (Gatanaga et al., Lymphokine Res. 9:225-229, 1990a; Gatanaga et al., Proc. Natl. Acad. Sci USA 87:8781-8784, 1990b). Human TNF-BP consists of 30 kDa and 40 kDaproteins which are identical to the N-terminal extracellular domains of p55 and p75 TNF receptors, respectively (U.S. Pat. No. 5,395,760; EP 418,014). Such proteins have been suggested for use in treating endotoxic shock. Mohler et al. J. Immunol. 151:1548-1561, 1993.
There are several mechanisms possible for the production of secreted proteins resembling membrane bound receptors. One involves translation from alternatively spliced mRNAs lacking transmembrane and cytoplasmic regions. Another involvesproteolytic cleavage of the intact membrane receptors, followed by shedding of the cleaved receptor from the cell. The soluble form of p55 and p75 TNF-R do not appear to be generated from mRNA splicing, since only full length receptor mRNA has beendetected in human cells in vitro (Gatanaga et al., 1991). Carboxyl-terminal sequencing and mutation studies on human p55 TNF-R indicates that a cleavage site may exist between residues Asn 172 and Val 173 (Gullberg et al. Eur. J. Cell. Biol. 58:307-312, 1992).
There are reports that a specific metalloprotease inhibitor, TNF-.alpha. protease inhibitor (TAPI) blocks the shedding of soluble p75 and p55 TNF-R (Crowe et al. J. Exp. Med. 181:1205-1210, 1995; Mullberg et al. J. Immunol 155:5198-5205,1995). The processing of pro-TNF on the cell membrane to release the TNF ligand appears to be dependent on a matrix metalloprotease like enzyme (Gearing et al. Nature 370:555-557, 1994). This is a family of structurally related matrix-degrading enzymesthat play a major role in tissue remodeling and repair associated with development and inflammation (Birkedal-Hansen et al. Crit. Rev. Oral Biol. Med. 4:197-250, 1993). The enzymes have Zn.sup.2+ in their catalytic domains, and Ca.sup.2+ stabilizestheir tertiary structure significantly.
In European patent application EP 657536A1, Wallach et al. suggest that it would be possible to obtain an enzyme that cleaves the 55,000 kDa TNF receptor by finding a mutated form of the receptor that is not cleaved by the enzyme, but still bindsto it. The only proposed source for the enzyme is a detergent extract of membranes for cells that appear to have the protease activity. If it were possible to obtain an enzyme according to this scheme, then the enzyme would presumably comprise amembrane spanning region. The patent application does not describe any protease that was actually obtained.
In a previous patent application in the present series (International Patent Publication WO 9820140), methods are described for obtaining an isolated enzyme that cleaves both the p55 and p75 TNF-R from cell surfaces. A convenient source is theculture medium of cells that have been stimulated with phorbol myristate acetate (PMA). The enzyme activity was given the name TRRE (TNF receptor releasing enzyme). In other studies, TRRE was released immediately upon PMA stimulation, indicating thatit is presynthesized in an inactive form to be rapidly converted to the active form upon stimulation. Evidence for direct cleavage of TNF-R is that the shedding begins very quickly (.about.5 min) with maximal shedding within 30 min. TRRE is specific forthe TNF-R, and does not cleave IL-1 receptors, CD30, ICAM-1 or CD11b. TRRE activity is enhanced by adding Ca.sup.++ or Zn.sup.++, and inhibited by EDTA and phenantroline.
Given the involvement of TNF in a variety of pathological conditions, it is desirable to obtain a variety of factors that would allow receptor shedding to be modulated, thereby controlling the signal transduction from TNF at a disease site.
SUMMARY OF THE INVENTION
This disclosure provides new compounds that promote enzymatic cleavage and release of TNF receptors from the cell surface. Nine new DNA clones have been selected after repeat screening in an assay that tests the ability to enhance receptorrelease. The polynucleotide sequences of this invention and the proteins encoded by them have potential as diagnostic aids, and therapeutic compounds that can be used to adjust TNF signal transduction in a beneficial way.
One embodiment of the invention is an isolated polynucleotide comprising a nucleotide sequence with the following properties: a) the sequence is expressed at the mRNA level in Jurkat T cells; b) when COS-1 cells expressing TNF-receptor aregenetically transformed to express the sequence, the cells have increased enzymatic activity for cleaving and releasing the receptor. If a polynucleotide sequence is expressed in Jurkat cells, then it can be found in the Jurkat cell expression librarydeposited with the ATCC (Accession No. TIB-152). It is recognized that the polynucleotide can be obtained from other cell lines, or produced by recombinant techniques.
Included are polynucleotides in which the nucleotide sequence is contained in any of SEQ. ID NOS:1-10. Also embodied are polynucleotides comprising at least 30 and preferably more consecutive nucleotides in said nucleotide sequence, or at least50 consecutive nucleotides that are homologous to said sequence at a significant level, preferably at the 90% level or more. Also included antisense and ribozyme polynucleotides that inhibit the expression of a TRRE modulator.
Another embodiment of the invention is isolated polypeptides comprising an amino acid sequence encoded by a polynucleotide of this invention. Non-limiting examples are sequences shown in SEQ. ID NOS: 147-158. Fragments and fusion proteins areincluded in this invention, and preferably comprise at least 10 consecutive residues encoded by a polynucleotide of this invention, or at least 15 consecutive amino acids that are homologous at a significant level, preferably at least 80%. Preferredpolypeptides promote cleavage and release of TNF receptors from the cell surface, especially COS-1 cells genetically transformed to express TNF receptor. The polypeptides may or may not have a membrane spanning domain, and may optionally be produced bya process that involves secretion from a cell. Included are species homologs with the desired activity, and artificial mutants with additional beneficial properties.
Another embodiment of this invention is an antibody specific for a polypeptide of this invention. Preferred are antibodies that bind a TRRE modulator protein, but not other substances found in human tissue samples in comparable amounts.
Another embodiment of the invention is an assay method of determining altered TRRE activity in a cell or tissue sample, using a polynucleotide or antibody of this invention to detect the presence br absence of the corresponding TRRE modulator. The assay method can optionally be used for the diagnosis or evaluation of a clinical condition relating to abnormal TNF levels or TNF signal transduction.
Another embodiment of the invention is a method for increasing or decreasing signal transduction from a cytokine into a cell (including but not limited to TNF), comprising contacting the cell with a polynucleotide, polypeptide, or antibody ofthis invention.
A further embodiment of the invention is a method for screening polynucleotides for an ability to modulate TRRE activity. The method involves providing cells that express both TRRE and the TNF-receptor; genetically altering the cells with thepolynucleotides to be screened; cloning the cells; and identifying clones with the desired activity.
Yet another embodiment of the invention is a method for screening substances for an ability to affect TRRE activity. This typically involves incubating cells expressing TNF receptor with a TRRE modulator of this invention in the presence orabsence of the test substance; and measuring the effect on shedding of the TNF receptor.
The products of this invention can be used in the preparation of a medicament for treatment of the human or animal body. The medicament contains a clinically effective amount for treatment of a disease such as heart failure, cachexia,inflammation, endotoxic shock, arthritis, multiple sclerosis, sepsis, and cancer. These compositions can be used for administration to a subject suspected of having or being at risk for the disease, optionally in combination with other forms oftreatment appropriate for their condition.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a schematic representation of plasmid pCDTR2. This plasmid expresses p75 TNF-R, the .about.75 kDa form of the TNF receptor. PCMV stands for cytomegalovirus; BGHpA stands for bovine growth hormone polyadenylation signal.
FIG. 2 is a line depicting the levels of p75 TNF-R detected on COS-1 cells genetically altered to express the receptor. Results from the transformed cells, designated C75R (.circle-solid., upward swooping line) is compared with that from theparental COS-1 cells (.box-solid., baseline). The receptor number was calculated by Scatchard analysis (inset).
FIG. 3 is a survival graph, showing that TRRE decreases mortality in mice challenged with lipopolysaccharide (LPS) to induce septic peritonitis. (.diamond-solid.) LPS alone; (.box-solid.) LPS plus control buffer; (.circle-solid.) LPS plus TRRE(2,000 U); (.tangle-solidup.) LPS plus TRRE (4,000 U).
FIG. 4 is a half-tone reproduction of a bar graph, showing the effect of 9 new clones on TRRE activity on C75R cells (COS-1 cells transfected to express the TNF-receptor. Each of the 9 clones increases TRRE activity by over 2-fold.
FIG. 5 is a survival graph, showing the ability of 4 new expressed to save mice challenged with LPS. (.diamond-solid.) saline; (.box-solid.) BSA; (.DELTA.) Mey-3 (100 .mu.g); (X) Mey-3 (10 .mu.g); (*) Mey-5 (10 .mu.g); (.circle-solid.) Mey-8 (10.mu.g).
DETAILED DESCRIPTION OF THE INVENTION
It has been discovered that certain cells involved in the TNF transduction pathway express enzymatic activity that causes TNF receptors to be shed from the cell surface. Enzymatic activity for cleaving and releasing TNF receptors has been giventhe designation TRRE. Phorbol myristate acetate induces release of TRRE from cells into the culture medium. An exemplary TRRE protein had been purified from the supernatant of TNF-1 cells (Example 2). The protease bears certain hallmarks of themetalloprotease family, and is released rapidly from the cell upon activation.
In order to elucidate the nature of this protein, functional cloning was performed. Jurkat cells were selected as being a good source of TRRE. The cDNA from a Jurkat library was expressed, and cell supernatant was tested for an ability torelease TNF receptors from cell surfaces. Cloning and testing of the expression product was conducted through several cycles, and nine clones were obtained that more than doubled TRRE activity in the assay (FIG. 4). At the DNA level, all 9 clones haddifferent sequences.
Protein expression products from the clones have been tested in a lipopolysaccharide animal model for sepsis. Protein from three different clones successfully rescued animals from a lethal dose of LPS (FIG. 5). This points to an important rolefor these molecules in the management of pathological conditions mediated by TNF.
The number of new TRRE promoting clones obtained from the expression library was surprising. The substrate specificity of the TRRE isolated in Example 2 distinguishes the 75 kDa and 55 kDa TNF receptors from other cytokine receptors and cellsurface proteins. There was little reason beforehand to suspect that cells might have nine different proteases for the TNF receptor. It is possible that one of the clones encodes the TRRE isolated in Example 2, or a related protein. It is possiblethat some of the other clones have proteolytic activity to cleave TNF receptors at the same site, or at another site that causes release of the soluble form from the cell. It is a hypothesis of this disclosure that some of the clones may not haveproteolytic activity themselves, but play a role in promoting TRRE activity in a secondary fashion.
This possibility is consistent with the observations made, because there is an endogenous level of TRRE activity in the cells used in the assay. The cleavage assay involves monitoring TNF receptor release from C75 cells, which are COS-1 cellsgenetically altered to express p75 TNF-R. The standard assay is conducted by contacting the transformed cells with a fluid believed to contain TRRE. The level of endogenous TRRE activity is evident from the rate of spontaneous release of the receptoreven when no exogenous TRRE is added (about 200 units). Accordingly, accessory proteins that promote TRRE activity would increase the activity measured in the assay. Many mechanisms of promotion are possible, including proteins that activate a zymogenform of TRRE, proteins that free TRRE from other cell surface components, or proteins that stimulate secretion of TRRE from inside the cell. It is not necessary to understand the mechanism in order to use the products of this invention in most of theembodiments described.
It is anticipated that several of the clones will have activity not just for promoting TNF receptor cleavage, but also having an effect on other surface proteins. To the extent that cleavage sequences or accessory proteins are shared betweendifferent receptors, certain clones would promote phenotypic change (such as receptor release) for the family of related substrates.
This disclosure provides polypeptides that promote TRRE activity, polynucleotides that encode such polypeptides, and antibodies that bind such peptides. The binding of TNF to its receptor mediates a number of biological effects. Cleavage of theTNF-receptor by TRRE diminishes signal transduction by TRRE. Potentiators of TRRE activity have the same effect. Thus, the products of this invention can be used to modulate signal transduction by cytokines, which is of considerable importance in themanagement of disease conditions that are affected by cytokine action. The products of this invention can also be used in diagnostic methods, to determine when signal transduction is being inappropriately affected by abnormal TRRE activity. The assaysystems described in this disclosure provide a method for screening additional compounds that can influence TRRE activity, and thus the signal transduction from TNF.
Based on the summary of the invention, and guided by the illustrations in the example section, one skilled in the art will readily know what techniques to employ in the practice of the invention. The following detailed description is providedfor the additional convenience of the reader.
Definitions and Basic Techniques
As used in this disclosure, "TRRE activity" refers to the ability of a composition to cleave and release TNF receptors from the surface of cells expressing them. A preferred assay is cleavage from transfected COS-1 cells, as described in Example1. However, TRRE activity can be measured on any cells that bear TNF receptors of the 55 kDa or 75 kDa size. Other features of the TRRE enzyme obtained from PMA induction of THP-1 cells (exemplified in Example 2) need not be a property of the TRREactivity measured in the assay.
Unit activity of TRRE is defined as 1 pg of soluble p75 TNF-R released from cell surface in a standard assay, after correction for spontaneous release. The measurement of TRRE activity is explained further in Example 1.
A "TRRE modulator" is a compound that has the property of either increasing or decreasing TRRE activity for processing TNF on the surface of cells. Those that increase TRRE activity may be referred to as TRRE promoters, and those that decreaseTRRE activity may be referred to as TRRE inhibitors. TRRE promoters include compounds that have proteolytic activity for TNF-R, and compounds that augment the activity of TNF-R proteases. The nine polynucleotide clones described in Example 5, and theirprotein products, are exemplary TRRE promoters. Inhibitors of TRRE activity can be obtained using the screening assays described below.
The term "polynucleotide" refers to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three-dimensional structure, and may perform any function, knownor unknown. The following are non-limiting examples of polynucleotides: a gene or gene fragment, exons, introns, (mRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, nucleic acid probes, and primers. Apolynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The term polynucleotide refersinterchangeably to double-and single-stranded molecules. Unless otherwise specified or required, any embodiment of the invention described herein that is a polynucleotide encompasses both the double-stranded form, and each of two complementarysingle-stranded forms known or predicted to make up the double-stranded form.
"Hybridization" refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. Hybridization reactions can be performed under conditions ofdifferent "stringency". Relevant conditions include temperature, ionic strength, and the presence of additional solutes in the reaction mixture such as formamide. Conditions of increasing stringency are 30.degree. C. in 10.times.SSC (0.15M NaCl, 15 mMcitrate buffer); 40.degree. C. in 6.times.SSC; 50.degree. C. in 6..times.SSC 60.degree. C. in 6.times.SSC, or at about 40.degree. C. in 0.5.times.SSC, or at about 30.degree. C. in 6..times..SSC containing 50% formamide. SDS and a source offragmented DNA (such as salmon sperm) are typically also present during hybridization. Higher stringency requires higher minimum complementarity between hybridizing elements for a stable hybridization complex to form. See "Molecular Cloning: ALaboratory Manual", Second Edition (Sambrook, Fritsch & Maniatis, 1989).
It is understood that purine and pyrimidine nitrogenous bases with similar structures can be functionally equivalent in terms of Watson-Crick base-pairing; and the inter-substitution of like nitrogenous bases, particularly uracil and thymine, orthe modification of nitrogenous bases, such as by methylation, does not constitute a material substitution.
The percentage of sequence identity for polynucleotides or polypeptides is calculated by aligning the sequences being compared, and then counting the number of shared residues at each aligned position. No penalty is imposed for the presence ofinsertions or deletions, but are permitted only where required to accommodate an obviously increased number of amino acid residues in one of the sequences being aligned. When one of the sequences being compared is indicated as being "consecutive", thenno gaps are permitted in that sequence during the comparison. The percentage identity is given in terms of residues in the test sequence that are identical to residues in the comparison or reference sequence.
As used herein, "expression" of a polynucleotide refers to the production of an RNA transcript. Subsequent translation into protein or other effector compounds may also occur, but is not required unless specified.
"Genetic alteration" refers to a process wherein a genetic element is introduced into a cell other than by mitosis or meiosis. The element may be heterologous to the cell, or it may be an additional copy or improved version of an element alreadypresent in the cell. Genetic alternation may be effected, for example, by transducing a cell with a recombinant plasmid or other polynucleotide through any process known in the art, such as electroporation, calcium phosphate precipitation, or contactingwith a polynucleotide-liposome complex. Genetic alteration may also be effected, for example, by transduction or infection with a DNA or RNA virus or viral vector. It is preferable that the genetic alteration is inheritable by progeny of the cell, butthis is not generally required unless specified.
The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted bynon-amino acids. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labelingcomponent.
A "fusion polypeptide" is a polypeptide comprising regions in a different position in the sequence than occurs in nature. The regions can normally exist in separate proteins and are brought together in the fusion polypeptide; they can normallyexist in the same protein but are placed in a new arrangement in the fusion polypeptide; or they can be synthetically arranged. A "functionally equivalent fragment" of a polypeptide varies from the native sequence by addition, deletion, or substitutionof amino acid residues, or any combination thereof, while preserving a functional property of the fragment relevant to the context in which it is being used. Fusion peptides and functionally equivalent fragments are included in the definition ofpolypeptides used in this disclosure.
It is understood that the folding and the biological function of proteins can accommodate insertions, deletions, and substitutions in the amino acid sequence. Some amino acid substitutions are more easily tolerated. For example, substitution ofan amino acid with hydrophobic side chains, aromatic side chains, polar side chains, side chains with a positive or negative charge, or side chains comprising two or fewer carbon atoms, by another amino acid with a side chain of like properties can occurwithout disturbing the essential identity of the two sequences. Methods for determining homologous regions and scoring the degree of homology are described in Altschul et al. Bull. Math. Bio. 48:603-616, 1986; and Henikoff et al. Proc. Natl. Acad. Sci. USA 89:10915-10919, 1992. Substitutions that preserve the functionality of the polypeptide, or confer a new and beneficial property (such as enhanced activity, stability, or decreased immunogenicity) are especially preferred.
An "antibody" (interchangeably used in plural form) is an immunoglobulin molecule capable of specific binding to a target, such as a polypeptide, through at least one antigen recognition site, located in the variable region of the immunoglobulinmolecule. As used herein, the term encompasses not only intact antibodies, but also antibody equivalents that include at least one antigen combining site of the desired specificity. These include but are not limited to enzymatic or recombinantlyproduced fragments antibody, fusion proteins, humanized antibodies, single chain variable regions, diabodies, and antibody chains that undergo antigen-induced assembly.
An "isolated" polynucleotide, polypeptide, protein, antibody, or other substance refers to a preparation of the substance devoid of at least some of the other components that may also be present where the substance or a similar substancenaturally occurs or is initially obtained from. Thus, for example, an isolated substance may be prepared by using a purification technique to enrich it from a source mixture. Enrichment can be measured on an absolute basis, such as weight per volume ofsolution, or it can be measured in relation to a second, potentially interfering substance present in the source mixture. Increasing enrichments of the embodiments of this invention are increasingly more preferred. Thus, for example, a 2-foldenrichment is preferred, 10-fold enrichment is more preferred, 100-fold enrichment is more preferred, 1000-fold enrichment is even more preferred. A substance can also be provided in an isolated state by a process of artificial assembly, such as bychemical synthesis or recombinant expression.
A "host cell" is a cell which has been genetically altered, or is capable of being transformed, by administration of an exogenous polynucleotide.
The term "clinical sample" encompasses a variety of sample types obtained from a subject and useful in an in vitro procedure, such as a diagnostic test. The definition encompasses solid tissue samples obtained as a surgical removal, a pathologyspecimen, or a biopsy specimen, cells obtained from a clinical subject or their progeny obtained from culture, liquid samples such as blood, serum, plasma, spinal fluid, and urine, and any fractions or extracts of such samples that contain a potentialindication of the disease.
Unless otherwise indicated, the practice of the invention will employ conventional techniques of molecular biology, microbiology, recombinant DNA, and immunology, within the skill of the art. Such techniques are explained in the standardliterature, such as: "Molecular Cloning: A Laboratory Manual", Second Edition (Sambrook, Fritsch & Maniatis, 1989), "Oligonucleotide Synthesis" (M. J. Gait, ed., 1984), "Animal Cell Culture" (R. I. Freshney, ed., 1987); the series "Methods in Enzymology"(Academic Press, Inc.); "Handbook of Experimental Immunology" (D. M. Weir & C. C. Blackwell, Eds.), "Gene Transfer Vectors for Mammalian Cells" (J. M. Miller & M. P. Calos, eds., 1987), "Current Protocols in Molecular Biology" (F. M. Ausubel et al.,eds., 1987); and "Current Protocols in Immunology" (J. E. Coligan et al., eds., 1991). The reader may also choose to refer to a previous patent application relating to TRRE, International Patent Application WO 98020140.
For purposes of prosecution in the U.S., and in other jurisdictions where allowed, all patents, patent applications, articles and publications indicated anywhere in this disclosure are hereby incorporated herein by reference in their entirety.
Polynucleotides
Polynucleotides of this invention can be prepared by any suitable technique in the art. Using the data provided in this disclosure, sequences of less than .about.50 base pairs are conveniently prepared by chemical synthesis, either through acommercial service or by a known synthetic method, such as the triester method or the phosphite method. A preferred method is solid phase synthesis using mononucleoside phosphoramidite coupling units (Hirose et al., Tetra. Left. 19:2449-2452, 1978;U.S. Pat. No. 4,415,732).
For use in antisense therapy, polynucleotides can be prepared by chemistry that produce more stable in pharmaceutical preparations. Non-limiting examples include thiol-derivatized nucleosides (U.S. Pat. No. 5,578,718), and oligonucleotideswith modified backbones (U.S. Pat. Nos. 5,541,307 and 5,378,825).
Polynucleotides of this invention can also be obtained by PCR amplification of a template with the desired sequence. Oligonucleotide primers spanning the desired sequence are annealed to the template, elongated by a DNA polymerase, and thenmelted at higher temperature so that the template and elongated oligonucleotides dissociate. The cycle is repeated until the desired amount of amplified polynucleotide is obtained (U.S. Pat. Nos. 4,683,195 and 4,683,202). Suitable templates includethe Jurkat T cell library and other human or animal expression libraries that contain TRRE modulator encoding sequences. The Jurkat T cell library is available from the American Type Culture Collection, 10801 University Blvd., Manassas Va. 20110,U.S.A. (ATCC #TIB-152). Mutations and other adaptations can be performed during amplification by designing suitable primers, or can be incorporated afterwards by genetic splicing.
Production scale amounts of large polynucleotides are most conveniently obtained by inserting the desired sequence into a suitable cloning vector and reproducing the clone. Techniques for nucleotide cloning are given in Sambrook, Fritsch &Maniatis (supra) and in U.S. Pat. No. 5,552,524. Exemplary cloning and expression methods are illustrated in Example 6.
Preferred polynucleotide sequences are 50%, 70%, 80%, 90%, or 100% identical to one of the sequences exemplified in this disclosure; in order if increasing preference. The length of consecutive residues in the identical or homologous sequencecompared with the exemplary sequence can be about 15, 30, 50, 75, 100, 200 or 500 residues in order of increasing preference, up to the length of the entire clone. Nucleotide changes that cause a conservative substitution or retain the function of theencoded polypeptide (in terms of hybridization properties or what is encoded) are especially preferred substitutions.
The polynucleotides of this can be used to measure altered TRRE activity in a cell or tissue sample. This involves contacting the sample with the polynucleotide under conditions that permit the polynucleotide to hybridize specifically withnucleic acid that encodes a modulator of TRRE activity, if present in the sample, and determining polynucleotide that has hybridized as a result of step a). Specificity of the test can be provided in one of several ways. One method involves the use ofa specific probe--a polynucleotide of this invention with a sequence long enough and of sufficient identity to the sequence being detected, so that it binds the target and not other nucleic acid that might be present in the sample. The probe istypically labeled (either directly or through a secondary reagent) so that it can be subsequently detected. Suitable labels include .sup.32P and .sup.33P, chemiluminescent and fluorescent reagents. After the hybridization reaction, unreacted probe iswashed away so that the amount of hybridized probe can be determined. Signal can be amplified using branched probes (U.S. Pat. No. 5,124,246). In another method, the polynucleotide is a primer for a PCR reaction. Specificity is provided by theability of the paired probes to amplify the sequence of interest. After a suitable number of PCR cycles, the amount of amplification product present correlates with the amount of target sequence originally present in the sample.
Such tests are useful both in research, and in the diagnosis or assessment of a disease condition. For example, TNF activity plays a role in eliminating tumor cells (Example 4), and a cancer may evade the elimination process by activating TRREactivity in the diseased tissue. Hence, under some conditions, high expression of TRRE modulators may correlate with progression of cancer. Diagnostic tests are also of use in monitoring therapy, such as when gene therapy is performed to increase TRREactivity.
Polynucleotides of this invention can also be used for production of polypeptides and the preparation of medicaments, as explained below.
Polypeptides
Short polypeptides of this invention can be prepared by solid-phase chemical synthesis. The principles of solid phase chemical synthesis can be found in Dugas & Penney, Bioorganic Chemistry, Springer-Verlag NY pp 54-92 (1981), and U.S. Pat. No. 4,493,795. Automated solid-phase peptide synthesis can be performed using devices such as a PE-Applied Biosystems 430A peptide synthesizer (commercially available from Applied Biosystems, Foster City Calif.).
Longer polypeptides are conveniently obtained by expression cloning. A polynucleotide encoding the desired polypeptide is operably linked to control elements for transcription and translation, and then transfected into a suitable host cell. Expression may be effected in procaryotes such as E. coli (ATCC Accession No. 31446 or 27325), eukaryotic microorganisms such as the yeast Saccharomyces cerevisiae, or higher eukaryotes, such as insect or mammalian cells. A number of expression systemsare described in U.S. Pat. No. 5,552,524. Expression cloning is available from such commercial services as Lark Technologies, Houston Tex. The production of protein from 4 exemplary clones of this invention in insect cells is illustrated in Example6. The protein is purified from the producing host cell by standard methods in protein chemistry, such as affinity chromatography and HPLC. Expression products are optionally produced with a sequence tag to facilitate affinity purification, which cansubsequently be removed.
Preferred sequences are 40%, 60%, 80%, 90%, or 100% identical to one of the sequences exemplified in this disclosure; in order if increasing preference. The length of the identical or homologous sequence compared with the native humanpolynucleotide can be about 7, 10, 15, 20, 30, 50 or 100 residues in order of increasing preference, up to the length of the entire encoding region.
Polypeptides can be tested for an ability to modulate TRRE in a TNF-R cleavage assay. The polypeptide is contacted with the receptor (preferably expressed on the surface of a cell, such as a C75 cell), and the ability of the polypeptide toincrease or decrease receptor cleavage and release is determined. Cleavage of TNF-R by exemplary polypeptides of this invention is illustrated in Example 7.
Polypeptides of this invention can be used as immunogens for raising antibody. Large proteins will raise a cocktail of antibodies, while short peptide fragments will raise antibodies against small region of the intact protein. Antibody clonescan be mapped for protein binding site by producing short overlapping peptides of about 10 amino acids in length. Overlapping peptides can be prepared on a nylon membrane support by standard F-Moc chemistry, using a SPOTS.TM. kit from Genosys accordingto manufacturer's directions.
Polypeptides of this invention can also be used to affect TNF signal transduction, as explained below.
Antibodies
Polyclonal antibodies can be prepared by injecting a vertebrate with a polypeptide of this invention in an immunogenic form. Immunogenicity of a polypeptide can be enhanced by linking to a carrier such as KLH, or combining with an adjuvant, suchas Freund's adjuvant. Typically, a priming injection is followed by a booster injection is after about 4 weeks, and antiserum is harvested a week later. Unwanted activity cross-reacting with other antigens, if present, can be removed, for example, byrunning the preparation over adsorbants made of those antigens attached to a solid phase, and collecting the unbound fraction. If desired, the specific antibody activity can be further purified by a combination of techniques, which may include protein,A chromatography, ammonium sulfate precipitation, ion exchange chromatography, HPLC, and immunoaffinity chromatography using the immunizing polypeptide coupled to a solid support. Antibody fragments and other derivatives can be prepared by standardimmunochemical methods, such as subjecting the antibody to cleavage with enzymes such as papain or pepsin.
Production of monoclonal antibodies is described in such standard references as Harrow & Lane (1988), U.S. Pat. Nos. 4,491,632, 4,472,500 and 4,444,887, and Methods in Enzymology 73B:3 (1981). Briefly, a mammal is immunized, andantibody-producing cells (usually splenocytes) are harvested. Cells are immortalized by fusion with a non-producing myeloma, transfecting with Epstein Barr Virus, or transforming with oncogenic DNA. The treated cells are cloned and cultured, and theclones are selected that produce antibody of the desired specificity.
Other methods of obtaining specific antibody molecules (optimally in the form of single-chain variable regions) involve contacting a library of immunocompetent cells or viral particles with the target antigen, and growing out positively selectedclones. Immunocompetent phage can be constructed to express immunoglobulin variable region segments on their surface. See Marks et al., New Eng. J. Med. 335:730, 1996, International Patent Applications WO 9413804, WO 9201047, WO 90 02809, andMcGuiness et al., Nature Biotechnol. 14:1449, 1996.
The antibodies of this invention are can be used in immunoassays for TRRE modulators. General techniques of immunoassay can be found in "The Immunoassay Handbook", Stockton Press NY, 1994; and "Methods of Immunological Analysis", Weinheim: VCHVerlags gesellschaft mbH, 1993). The antibody is combined with a test sample under conditions where the antibody will bind specifically to any modulator that might be present, but not any other proteins liable to be in the sample. The complex formedcan be measured in situ (U.S. Pat. Nos. 4,208,479 and 4,708,929), or by physically separating it from unreacted reagents (U.S. Pat. No. 3,646,346). Separation assays typically involve labeled TRRE reagent (competition assay), or labeled antibody(sandwich assay) to facilitate detection and quantitation of the complex. Suitable labels are radioisotopes such as .sup.125I, enzymes such as .beta.-galactosidase, and fluorescent labels such as fluorescein. Antibodies of this invention can also beused to detect TRRE modulators in fixed tissue sections by immunohistology. The antibody is contacted with the tissue, unreacted antibody is washed away, and then bound antibody is detected--typically using a labeled anti-immunoglobulin reagent. Immunohistology will show not only whether the modulator is present, but where it is located in the tissue.
Detection of TRRE modulators is of interest for research purposes, and for clinical use. As indicated earlier, high expression of TRRE modulators may correlate with progression of cancer. Diagnostic tests are also of use in monitoring TRREmodulators that are administered in the course of therapy.
Antibodies of this invention can also be used for preparation of medicaments. Antibodies with therapeutic potential include those that affect TRRE activity--either by promoting clearance of a TRRE modulator, or by blocking its physiologicalaction. Antibodies can be screened for desirable activity according to assays described in the next section.
Screening Assays
This invention provides a number of screening methods for selecting and developing products that modulate TRRE, and thus affect TNF signal transduction.
One screening method is for polynucleotides that have an ability to modulate TRRE activity. To do this screening, cells are obtained that express both TRRE and the TNF receptor. Suitable cell lines can be constructed from any cell thatexpresses a level of functional TRRE activity. These cells are identifiable by testing culture supernatant for an ability to release membrane-bound TNF-R. The level of TRRE expression should be moderate, so that an increase in activity can be detected. The cells can then be genetically altered to express either p55 or p75 TNF-R, illustrated in Example 1. Exemplary is the C75R line: COS-1 cells genetically altered to express the 75 kDa form of the TNF-R. Release of TNF-R from the cell can be measuredeither by testing residual binding of labeled TNF ligand to the cell, or by immunoassay of the supernatant for released receptor (Example 1).
The screening assay is conducted by contacting the cells expressing TRRE and TNF-R with the polynucleotides to be screened. The effect of the polynucleotide on the enzymatic release of TNF-R from the cell is determined, and polynucleotides withdesirable activity (either promoting or inhibiting TRRE activity) are selected. In a variation of this method, cells expressing TRRE activity but not TNF-R (such as untransfected COS-1 cells) are contacted with the test polynucleotide. Then the culturemedium is collected, and used to assay for TRRE activity using a second cell expressing TNF-R (such as C75 cells).
This type of screening assay is useful for the selection of polynucleotides from an expression library believed to contain encoding sequences for TRRE modulators. The Jurkat cell expression library (ATCC Accession No. TIB-152) is exemplary. Other cells from which suitable libraries can be constructed are those known to express high levels of TRRE, especially after PMA stimulation, such as THP-1, U-937, HL-60, ME-180, MRC-5, Raji, K-562, and normal human monocytes. The screening involvesexpressing DNA from the library in the selected cell line being used for screening. Wells with the desired activity are selected, and the DNA is recovered, optionally after replication or cloning of the cells. Repeat cycles of functional screening andselection can lead to identification of new polynucleotide clones that promote or inhibit TRRE activity. This is illustrated below in Example 5. Further experiments can be performed on the selected polynucleotides to determine it modulates TRREactivity inside the cell, or through the action of a protein product. A long open reading frame suggests a role for a protein product, and examination of the amino acid sequence for a signal peptide and a membrane spanning region can help determinewhether the protein is secreted from the cell or expressed in the surface membrane.
This type of screening is also useful for further development of the polynucleotides of this invention. For example, expression constructs can be developed that encode functional peptide fragments, fusion proteins, and other variants. Theminimum size of polynucleotide sequence that still encodes TRRE modulation activity can be determined by removing part of the sequence and then using the screening assay to determine whether the activity is still present. Mutated and extended sequencescan be tested in the same way.
This type of screening assay is also useful for developing compounds that affect TRRE activity by interfering with mRNA that encode a TRRE modulator. Of particular interest are ribozymes and antisense oligonucleotides. Ribozymes areendoribonucleases that catalyze cleavage of RNA at a specific site. They comprise a polynucleotide sequence that is complementary to the cleavage site on the target, and additional sequence that provide the tertiary structure to effect the cleavage. Construction of ribozymes is described in U.S. Pat. Nos. 4,987,071 and 5,591,610. Antisense oligonucleotides that bind mRNA comprise a short sequence complementary to the mRNA (typically 8-25 bases in length). Preferred chemistry for constructingantisense oligonucleotides is outlined in an earlier section. Specificity is provided both by the complementary sequence, and by features of the chemical structure. Antisense molecules that inhibit expression of cell surface receptors are described inU.S. Pat. Nos. 5,135,917 and 5,789,573. Screening involves contacting the cell expressing TRRE activity and TNF-R with the compound and determining the effect on receptor release. Ribozymes and antisense molecules effective in altering expression ofa TRRE promoter would decrease TNF-R release. Ribozymes and antisense molecules effective in altering expression of a TRRE inhibitor would increase TNF-R release.
Another screening method described in this disclosure is for testing the ability of polypeptides to modulate TRRE activity (Example 7). Cells expressing both TNF-R and a moderate level of TRRE activity are contacted with the test polypeptides,and the rate of receptor release is compared with the rate of spontaneous release. An increased rate of release indicates that the polypeptide is a TRRE promoter, while a decreased rate indicates that the polypeptide is a TRRE inhibitor. This assay canbe used to test the activity of new polypeptides, and develop variants of polypeptides already known to modulate TRRE. The minimum size of polypeptide sequence that still encodes TRRE modulation activity can be determined by making a smaller fragment ofthe polypeptide and then using the screening assay to determine whether the activity is still present. Mutated and extended sequences can be tested in the same way.
Another screening method embodied in this invention is a method for screening substances that interfere with the action of a TRRE modulator at the protein level. The method involves incubating cells expressing TNF receptor (such as C75R cells)with a polypeptide of this invention having TNF promoting activity. There are two options for supplying the TRRE modulator in this assay. In one option, the polypeptide is added to the medium of the cells as a reagent, along with the substance to betested. In another option, the cells are genetically altered to express the TRRE modulator at a high level, and the assay requires only that the test substance be contacted with the cells. This option allows for high throughput screening of a number oftest compounds.
Either way, the rate of receptor release is compared in the presence and absence of the test substance, to identify compounds that enhance or diminish TRRE activity. Parallel experiments should be conducted in which the activity of the substanceon receptor shedding is tested in the absence of added polypeptide (using cells that don't express the polypeptide). This will determine whether the activity of the test substance occurs via an effect on the TRRE promoter being added, or through someother mechanism.
This type of screening assay is useful for identifying antibodies that affect the activity of a TRRE modulator. Antibodies are raised against a TRRE modulator as described in the previous section. If the antibody decreases TRRE activity in thescreening assay, then it has therapeutic potential to lower TRRE activity in vivo. Screening of monoclonal antibodies using this assay can also help identify binding or catalytic sites in the polypeptide.
This type of screening assay is also useful for high throughput screening of small molecule compounds that have the ability to affect the level of TNF receptors' on a cell, by way of its influence on a TRRE modulator. Small molecule compoundsthat have the desired activity are often preferred for pharmaceutical compositions, because they are often more stable and less expensive to produce.
Medicaments and their Use
As described earlier, a utility of certain products embodied in this invention is to affect signal transduction from cytokines (particularly TNF). Products that promote TRRE activity have the effect of decreasing TNF receptors on the surface ofcells, which would decrease signal transduction from TNF. Conversely, products that inhibit TRRE activity prevent cleavage of TNF receptors, increasing signal transduction.
The ability to affect TNF signal transduction is of considerable interest in the management of clinical conditions in which TNF signaling contributes to the pathology of the condition. Such conditions include: Heart failure. IL-1.beta. and TNFare believed to be central mediators for perpetuating the inflammatory process, recruiting and activating inflammatory cells. The inflammation depress cardiac function in congestive heart failure, transplant rejection, myocarditis, sepsis, and burnshock. Cachexia. The general weight loss and wasting occurring in the course of chronic diseases, such as cancer. TNF is believed to affect appetite, energy expenditure, and metabolic rate. Crohn's disease. The inflammatory process mediated by TNFleads to thickening of the intestinal wall, ensuing from lymphedema and lymphocytic infiltration. Endotoxic shock. The shock induced by release of endotoxins from gram-negative bacteria, such as E. coli, involves TNF-mediated inflammation Arthritis. TNF promotes expression of nitric oxide synthetase, believed to be involved in disease pathogenesis. Other conditions of interest are multiple sclerosis, sepsis, inflammation brought on by microbe infection, and diseases that have an autoimmuneetiology, such as Type I Diabetes.
Polypeptides of this invention that promote TRRE activity can be administered with the objective of decreasing or normalizing TNF signal transduction. For example, in congestive heart failure or Crohn's disease, the polypeptide is given atregular intervals to lessen the inflammatory sequelae. The treatment is optionally in combination with other agents that affect TNF signal transduction (such as antibodies to TNF or receptor antagonists) or that lessen the extent of inflammation inother ways.
Polynucleotides of this invention can also be used to promote TRRE activity by gene therapy. The encoding sequence is operably linked to control elements for transcription and translation in human cells. It is then provided in a form that willpromote entry and expression of the encoding sequence in cells at the disease site. Forms suitable for local injection include naked DNA, polynucleotides packaged with cationic lipids, and polynucleotides in the form of viral vectors (such as adenovirusand AAV constructs). Methods of gene therapy known to the practitioner skilled in the art will include those outlined in U.S. Pat. Nos. 5,399,346, 5,827,703, and 5,866,696.
The ability to affect TNF signal transduction is also of interest where TNF is thought to play a beneficial role in resolving the disease. In particular, TNF plays a beneficial role in the necrotizing of solid tumors. Accordingly, products ofthis invention can be administered to cancer patients to inhibit TRRE activity, thereby increasing TNF signal transduction and improve the beneficial effect.
Embodiments of the invention that inhibit TRRE activity include antisense polynucleotides. A method of conferring long-standing inhibitory activity is to administer antisense gene therapy. A genetic construct is designed that will express RNAinside the cell which in turn will decrease the transcription of the target gene (U.S. Pat. No. 5,759,829). In humans, a more frequent form of antisense therapy is to administer the effector antisense molecule directly, in the form of a short stablepolynucleotide fragment that is complementary to a segment of the target mRNA (U.S. Pat. Nos. 5,135,917 and 5,789,573)--in this case, the transcript that encodes the TRRE modulator. Another embodiment of the invention that inhibits TRRE areribozymes, constructed as described in an earlier section. The function of ribozymes in inhibiting mRNA translation is described in U.S. Pat. Nos. 4,987,071 and 5,591,610.
Once a product of this invention is found to have suitable TRRE modulation activity in the in vitro assays described in this disclosure, it is preferable to also test its effectiveness in an animal model of a TNF mediated disease process. Example 3 describes an LPS model for sepsis that can be used to test promoters of TRRE activity. Example 4 describes a tumor necrosis model, in which TRRE inhibitors could be tested for an ability to enhance necrotizing activity. Those skilled in theart will know of other animal models suitable for testing effects on TNF signal transduction or inflammation. Other illustrations are the cardiac ischemia reperfusion models of Weyrich et al. (J. Clin. Invest. 91:2620, 1993) and Garcia-Criado et al.(J. Am. Coll. Surg. 181:327, 1995); the pulmonary ischemia reperfusion model of Steinberg et al. (J. Heart Lung Transplant. 13:306, 1994), the lung inflammation model of International Patent Application WO 9635418; the bacterial peritonitis model ofSharar et al. (J. Immunol. 151:4982, 1993), the colitis model of Meenan et al. (Scand. J. Gastroenterol. 31:786, 1996), and the diabetes model of von Herrath et al. (J. Clin. Invest 98:1324, 1996). Models for septic shock are described in Mack et al.J. Surg. Res. 69:399, 1997; and Seljelid et al. Scand. J. Immunol. 45:683-7.
For use as an active ingredient in a pharmaceutical preparation, a polypeptide, polynucleotide, or antibody of this invention is generally purified away from other reactive or potentially immunogenic components present in the mixture in whichthey are prepared. Typically, each active ingredient is provided in at least about 90% homogeneity, and more preferably 95% or 99% homogeneity, as determined by functional assay, chromatography, or SDS polyacrylamide gel electrophoresis. The activeingredient is then compounded into a medicament in accordance with generally accepted procedures for the preparation of pharmaceutical preparations, such as described in Remington's Pharmaceutical Sciences 18th Edition (1990), E. W. Martin ed., MackPublishing Co., Pennsylvania. Steps in the compounding of the medicament depend in part on the intended use and mode of administration, and may include sterilizing, mixing with appropriate non-toxic and non-interfering excipients and carriers, dividinginto dose units, and enclosing in a delivery device. The medicament will typically be packaged with information about its intended use.
Mode of administration will depend on the nature of the condition being treated. For conditions that are expected to require moderate dosing and that are at well perfused sites (such as cardiac failure), systemic administration is acceptable. For example, the medicament may be formulated for intravenous administration, intramuscular injection, or absorption sublingually or intranasally. Where it is possible to administer the active ingredient locally, this is usually preferred. Localadministration will both enhance the concentration of the active ingredient at the disease site, and minimize effects on TNF receptors on other tissues not involved in the disease process. Conditions that lend themselves to administration directly atthe disease site include cancer and rheumatoid arthritis. Solid tumors can be injected directly when close to the skin, or when they can be reached by an endoscopic procedure. Active ingredients can also be administered to a tumor site during surgicalresection, being implanted in a gelatinous matrix or in a suitable membrane such as Gliadel.RTM. (Guilford Sciences). Where direct administration is not possible, the administration may be given through an arteriole leading to the disease site. Alternatively, the pharmaceutical composition may be formulated to enhance accumulation of the active ingredient at the disease site. For example, the active ingredient can be encapsulated in a liposome or other matrix structure that displays anantibody or ligand capable of binding a cell surface protein on the target cell. Suitable targeting agents include antibodies against cancer antigens, ligands for tissue-specific receptors (e.g., serotonin for pulmonary targeting). For compositionsthat decrease TNF signal transduction, an appropriate targeting molecule may be the TNF ligand, since the target tissue may likely display an unusually high density of the TNF receptor.
Effective amounts of the compositions of the present invention are those that alter TRRE activity by at least about 10%, typically by at least about 25%, more preferably by about 50% or 75%. Where near complete ablation of TRRE activity isdesirable, preferred compositions decrease TRRE activity by at least 90%. Where increase of TRRE activity is desirable, preferred compositions increase TRRE activity by at least 2-fold. A minimum effective amount of the active compound will depend onthe disease being treated, which of the TRRE modulators is selected for use, and whether the administration will be systemic or local. For systemic administration, an effective amount of activity will generally be an amount of the TRRE modulator thatcan cause a change in the enzyme activity by 100 to 50,000 Units--typically about 10,000 Units. The mass amount of protein, nucleic acid, or antibody is chosen accordingly, based on the specific activity of the active compound in Units per gram.
The following examples provided as a further guide to the practitioner, and are not intended to limit the invention in any way.
EXAMPLES
Example 1
Assay System for TRRE Activity
This Example illustrates an assay system that measures TRRE activity on the human TNF-R in its native conformation in the cell surface membrane.
Membrane-associated TNF-R was chosen as the substrate, as having microenvironment similar to that of the substrate for TRRE in vivo. Membrane-associated TNF-R also requires more specific activity, which would differentiate less-specificproteases. Cells expressing an elevated level of the p75 form of TNF-R were constructed by cDNA transfection into monkey COS-1 cells which express little TNF-R of either the 75 kDa or 55 kDa size.
The procedure for constructing these cells was as follows: cDNA of human p75 TNF-R was cloned from a .lamda.gt10 cDNA library derived from human monocytic U-937 cells (Clontech Laboratories, Palo Alto, Calif.). The first 300 bp on both 5' and 3'ends of the cloned fragment was sequenced and compared to the reported cDNA sequence of human p75 TNF-R. The cloned sequence was a 2.3 kb fragment covering positions 58-2380 of the reported p75 TNF-R sequence, which encompasses the full length of the p75TNF-R-coding sequence from positions 90-1475. The 2.3 kb p75 TNF-R cDNA was then subcloned into the multiple cloning site of the pcDNA3 eukaryotic expression vector. The orientation of the p75 TNF-R cDNA was verified by restriction endonucleasemapping.
FIG. 1 illustrates the final 7.7 kb construct, pCDTR2. It carries the neomycin-resistance gene for the selection of transfected cells in G418, and the expression of the p75 TNF-R is driven by the cytomegalovirus promoter. The pCDTR2 was thentransfected into monkey kidney COS-1 cells (ATCC CRL-1650) using the calcium phosphate-DNA precipitation method. The selected clone in G418 medium was identified and subcultured. This clone was given the designation C75R.
To determine the level of p75 TNF-R expression on C75R cells, 2.times.10.sup.5 cells/well were plated into a 24-well culture plate and incubated for 12 to 16 hours in 5% CO.sub.2 at 37.degree. C. They were then incubated with 2-30 ng .sup.125Ihuman recombinant TNF (radiolabeled using the chloramine T method) in the presence or absence of 100-fold excess of unlabeled human TNF at 4.degree. C. for 2 h. After three washes with ice-cold PBS, cells were lysed with 0.1N NaOH and boundradioactivity was determined in a Pharmacia Clinigamma counter (Uppsala, Sweden).
FIG. 2 shows the results obtained. C75R had a very high level of specific binding of radiolabeled .sup.125I-TNF, while parental COS-1 cells did not. The number of TNF-R expressed on C75R was determined to be 60,000-70,000 receptors per cell byScatchard analysis (FIG. 2, inset). The Kd value calculated was 5.6.times.10.sup.-10 M. This Kd value was in close agreement to the values previously reported for native p75 TNF-R.
TRRE was obtained by PHA stimulation of THP-1 cells (WO 9802140). THP-1 cells (ATCC 45503) growing in logarithmic phase were collected and resuspended to 1.times.10.sup.6 cells/ml of RPMI-1640 supplemented with 1% FCS and incubated with10.sup.-6 M PMA for 30 min in 5% CO.sub.2 at 37.degree. C. The cells were collected and washed once with serum-free medium to remove PMA and resuspended in the same volume of RPMI-1640 with 1% FCS. After 2 hours incubation in 5% CO.sub.2 at 37.degree. C., the cell suspension was collected, centrifuged, and the cell-free supernatant was collected as the source of TRRE.
In order to measure the effect of TRRE on membrane-bound TNF-R in the COS-1 cell constructs, the following experiment was performed. C75R cells were seeded at a density of 2.times.10.sup.5 cells/well in a 24-well cell culture plate and incubatedfor 12 to 16 hours at 37.degree. C. in 5% CO.sub.2. The medium in the wells was aspirated, replaced with fresh medium alone or with TRRE medium, and incubated for 30 min at 37.degree. C., The medium was then replaced with fresh medium containing 30ng/ml .sup.125I-labeled TNF. After 2 hours at 4.degree. C., the cells were lysed with 0.1 N NaOH and the level of bound radioactivity was measured. The level of specific binding of C75R by .sup.125I-TNF was significantly decreased after incubationwith TRRE. The radioactive count was 1,393 cpm on the cells incubated with TRRE compared to 10,567 cpm on the cells not treated with TRRE, a loss of 87% of binding capacity.
In order to determine the size of the p75 TNF-R cleared from C75R by TRRE, the following experiment was performed. 15.times.10.sup.6 C75R cells were seeded in a 150 mm cell culture plate and incubated at 37.degree. C. in 5% CO.sub.2 for 12 to16 hours. TRRE medium was incubated with C75R cells in the 150 mm plate for 30 min and the resulting supernatant was collected and centrifuged. The concentrated sample was applied to 10% acrylamide SDS-PAGE and electrophoretically transferred to apolyvinylidene difluoride membrane (Immobilon). Immunostaining resulted in a single band of 40 kDa, similar to the size found in biological fluids. Thus, transfected COS-1 cells expressed high levels of human p75 TNF-R in a form similar to nativeTNF-R.
The following assay method was adopted for routine measurement of TRRE activity. C75R cells and COS-1 cells were seeded into 24-well culture plates at a density of 2.5.times.10.sup.5 cells/ml/well and incubated overnight (for 12 to 16 hours) in5% CO.sub.2 at 37.degree. C. After aspirating the medium in the well, 300 .mu.l of TRRE medium was incubated in each well of both the C75R and COS-1 plates for 30 min in 5% CO.sub.2 at 37.degree. C. (corresponding to A and C mentioned below,respectively). Simultaneously, C75R cells in 24-well plates were also incubated with 300 .mu.l of fresh medium or buffer. The supernatants were collected, centrifuged, and then assayed for the concentration of soluble p75 TNF-R by ELISA.
ELISA assay for released TNF-R (WO 9802140) was performed as follows: Polyclonal antibodies to human p75 TNF-R were generated by immunization of New Zealand white female rabbits (Yamamoto et al. Cell. Immunol. 38:403-416, 1978). The IgGfraction of the immunized rabbit serum was purified using a protein G (Pharmacia Fine Chemicals, Uppsala, Sweden) affinity column (Ey et al. (1978) Immunochemistry 15:429-436, 1978). The IgG fraction was then labeled with horseradish peroxidase (SigmaChemical Co., St. Louis, Mo.) (Tijssen and Kurstok, Anal. Biochem. 136:451-457, 1984). In the first step of the assay, 5 .mu.g of unlabeled IgG in 100 .mu.l of 0.05 M carbonate buffer (pH 9.6) was bound to a 96-well ELISA microplate (Corning, Corning,N.Y.) by overnight incubation at 4.degree. C. Individual wells were washed three times with 300 .mu.l of 0.2% Tween-20 in phosphate buffered saline (PBS). The 100 .mu.l of samples and recombinant receptor standards were added to each well and incubatedat 37.degree. C. for 1 to 2 hours. The wells were then washed in the same manner, 100 .mu.l of horseradish peroxidase-labeled IgG added and incubated for 1 hour at 37.degree. C. The wells were washed once more and the color was developed for 20minutes (min) at room temperature with the substrates ABTS (Pierce, Rockford, Ill.) and 30% H.sub.2O.sub.2 (Fisher Scientific, Fair Lawn, N.J.). Color development was measured at 405 nm.
When C75R cells were incubated with TRRE medium, soluble p75 TNF-R was released into the supernatant which was measurable by ELISA. The amount of receptors released corresponded to the amount of TRRE added There was also a level of spontaneousTNF-R release in C75R cells incubated with just medium alone. It is hypothesized that this is due to an endogenous source of proteolytic enzyme, a homolog of the human TRRE of monkey origin.
The following calculations were performed. A=(amount of soluble p75 TNF-R in a C75R plate treated with the TRRE containing sample); i.e. the total amount of sTNF-R in a C75R plate. B=(amount of soluble p75 TNF-R spontaneously released in a C75Rplate treated with only medium or buffer containing the same reagent as the corresponding samples but without exogenous TRRE); i.e. the spontaneous release of sTNF-R from C75R cells. C=(amount of soluble p75 TNF-R in a COS-1 plate treated with the TRREsample or the background level of soluble p75 TNF-R released by THP-1.); i.e. the degraded value of transferred (pre-existing) sTNF-R in the TRRE sample during 30 min incubation in a COS-1 plate. This corresponds to the background level of sTNF-Rdegraded in a C75R plate. The net release of soluble p75 TNF-R produced only by TRRE activity existing in the initial sample is calculated as follows: (Net release of soluble p75 TNF-R only by TRRE)=A-B-C.
Unit activity of TRRE was defined as follows: 1 pg of soluble p75 TNF-R net release (A-B-C) in the course of the assay is one unit (U) of TRRE activity.
Using this assay, the time course of receptor shedding by TRRE was measured in the following experiment. TRRE-medium was incubated with C75R and COS-1 cells for varying lengths of time. The supernatants were then collected and assayed for thelevel of soluble p75 TNF-R by ELISA and the net TRRE activity was calculated. Detectable levels of soluble receptor were released by TRRE within 5 min and increased up to 30 min. Longer incubation times showed that the level of TRRE remained relativelyconstant after 30 min, presumably from the depletion of substrates. Therefore, 30 min was determined to be the optimal incubation time.
The induction patterns of TRRE and known MMPs by PMA stimulation are quite different. In order to induce MMPs, monocytic U-937 cells, fibrosarcoma HT-1080 cells, or peritoneal exudate macrophages (PEM) usually have to be stimulated for one tothree days with LPS or PMA. On the other hand, as compared with this prolonged induction, TRRE is released very quickly in culture supernatant following 30 min of PMA-stimulation. The hypothesis that TRRE and sTNF-R form a complex in vitro wasconfirmed by the experiment that 25% TRRE activity was recovered from soluble p75 TNF-R affinity column. This means that free TRRE has the ability to bind to its catalytic product, sTNF-R. The remaining 75% which did not combine to the affinity columnmay already be bound to sTNF-R or may not have enough affinity to bind to sTNF-R even though it is in a free form.
Example 2
Characterization of TRRE Obtained from THP-1 Cells
TRRE obtained by PHA stimulation of THP-1 cells was partially purified from the culture medium (WO 9802140). First, protein from the medium was concentrated by 100% saturated ammonium sulfate precipitation at 4.degree. C. The precipitate waspelleted by centrifugation at 10,000.times.g for 30 min and resuspended in PBS in approximately twice the volume of the pellet. This solution was then dialyzed at 4.degree. C. against 10 mM Tris-HCl, 60 mM NaCl, pH 7.0. This sample was loaded on ananion-exchange chromatography, Diethylaminoethyl (DEAE)-Sephadex A-25 column (Pharmacia Biotech) (2.5.times.10 cm) previously equilibrated with 50 mM Tris-HCl, 60 mM NaCl, pH 8.0. TRRE was then eluted with an ionic strength linear gradient of 60 to 250mM NaCl, 50 mM Tris-HCl, pH 8.0. Each fraction was measured for absorbance at 280 nm and assayed for TRRE activity. The DEAE fraction with the highest specific activity (the highest value of TRRE units/A280) was pooled and used in the characterizationsof TRRE described in this example.
In the next experiment, the substrate specificity of the enzyme was elucidated using immunohistochemical techniques. Fluorescein isothiocyanate (FITC)-conjugated anti-CD54, FITC-conjugated goat anti-rabbit and mouse antibodies, mouse monoclonalanti-CD30, anti-CD11b and anti-IL-1R (Serotec, Washington D.C.) were used. Rabbit polyclonal anti-p55 and p75 TNF-R were obtained according to Yamamoto et al. (1978) Cell Immunol. 38:403-416. THP-1 cells were treated for 30 min with 1,000 and/or 5,000U/ml of TRRE eluted from the DEAE-Sephadex column, and then transferred to 12.times.75 mm polystyrene tubes (Fischer Scientific, Pittsburgh, Pa.) at 1.times.10.sup.5 cells/100 .mu.l/tube. The cells were then pelleted by centrifugation at 350.times.g for5 min at 4.degree. C. and stained directly with 10 .mu.l FITC-conjugated anti-CD54 (diluted in cold PBS/0.5% sodium aside), indirectly with FITC-conjugated anti-mouse antibody after treatment of mouse monoclonal anti-CD11b, IL-1R and CD30 and alsoindirectly with FITC-conjugated anti-rabbit antibody after treatment of rabbit polyclonal anti-p55 and p75 TNF-R.
THP-1 cells stained with each of the antibodies without treatment of TRRE were used as negative controls. The tubes were incubated for 45 min at 4.degree. C., agitated every 15 min, washed twice with PBS/2% FCS, repelleted and then resuspendedin 200 .mu.l of 1% paraformaldehyde. These labeled THP-1 cells were analyzed using a fluorescence activated cell sorter (FACS) (Becton-Dickinson, San Jose, Calif.) with a 15 mW argon laser with an excitation of 488 nm. Fluorescent signals were gated onthe basis of forward and right angle light scattering to eliminate dead cells and aggregates from analysis. Gated signals (10.sup.4) were detected at 585 BP filter and analyzed using Lysis II software. Values were expressed as percentage of positivecells, which was calculated by dividing mean channel fluorescence intensity (MFI) of stained THP-1 cells treated with TRRE by the MFI of the cells without TRRE treatment (negative control cells).
To test the in vitro TNF cytolytic assay by TRRE treatment the L929 cytolytic assay was performed according to the method described by Gatanaga et al. (1990b). Briefly, L929 cells, an adherent murine fibroblast cell line, were plated (70,000cells/0.1 ml/well in a 96-well plate) overnight. Monolayered L929 cells were pretreated for 30 min with 100, 500 or 2,500 U/ml of partially-purified TRRE and then exposed to serial dilutions of recombinant human TNF for 1 hour. After washing the platewith RPMI-1640 with 10% FCS to remove the TRRE and TNF, the cells were incubated for 18 hours in RPMI-1640 with 10% FCS containing 1 .mu.g/ml actinomycin D at 37.degree. C. in 5% CO.sub.2. Culture supernatants were then aspirated and 50 .mu.l of 1%crystal violet solution was added to each well. The plates were incubated for 15 min at room temperature. After the plates were washed with tap water and air-dried, the cells stained with crystal violet were lysed by 100 .mu.L per well of 100 mM HCl inmethanol. The absorbance at 550 nm was measured using an EAR 400 AT plate reader (SLT-Labinstruments, Salzburg, Austria).
To investigate whether TRRE also truncates the .about.55 kDa size of TNF-R, partially-purified TRRE was applied to THP-1 cells which express low levels of both p55 and p75 TNF-R (approximately 1,500 receptors/cell by Scatchard analysis). TRREeluate from the DEAE-Sephadex column was added to THP-1 cells (5.times.10.sup.6 cells/ml) at a final TRRE concentration of 1,000 U/ml for 30 min. The concentration of soluble p55 and p75 TNF-R in that supernatant was measured by soluble p55 and p75 TNF-RELISA. TRRE was found to truncate both human p55 and p75 TNF-R on THP-1 cells and released 2,382 and 1,662 pg/ml soluble p55 and p75 TNF-R, respectively.
Therefore, TRRE obtained by PHA stimulation of THP-1 cells is capable of enzymatically cleaving and releasing human p75 TNF-R on C75R cells, and both human p55 and p75 TNF-R on THP-1 cells.
Partial inhibition of TRRE activity was obtained by chelating agents such as 1,10-phenanthroline, EDTA and EGTA (% TRRE activity remaining were 41%, 67% and 73%, respectively, at 2 mM concentration). On the other hand, serine protease inhibitorssuch as PMSF, AEBSF and 3,4-DCI, and serine and cysteine protease inhibitors such as TLCK and TPCK had no effect on the inhibition of TRRE. TRRE was slightly activated in the presence of Mn.sup.2+, Ca.sup.2+, Mg.sup.2+, and Co.sup.2+ (% TRRE activitiesremaining were 157%, 151%, 127%, and 123%, respectively), whereas partial inhibition occurred in the presence of Zn.sup.2+ and Cu.sup.2+ (% TRRE activities remaining were 23% and 47%, respectively) (WO 9802140).
TRRE fractions from the most active DEAE fraction (60 mM to 250 mM NaCl) can be purified further. In one method (WO 9802140), the fractions were concentrated to 500 .mu.L with a Centriprep-10 filter (10,000 MW cut-off membrane) (Amicon). Thisconcentrated sample was applied to 6% PAGE under non-denaturing native conditions. The gel was sliced horizontally into 5 mm strips and each was eluted into 1 ml PBS. The eluates were then tested according to the assay (Example 1) for TRRE activity.
Example 3
TRRE Activity Alleviates Septic Shock
The following protocol was used to test the effects of TRRE in preventing mortality in a model for septic shock. Mice were injected with lethal or sublethal levels of LPS, and then with a control buffer or TRRE. Samples of peripheral blood werethen collected at intervals to establish if TRRE blocked TNF-induced production of other cytokines in the bloodstream. Animals were assessed for the ability of TRRE to block the clinical effects of shock, and then euthanized and tissues examined byhistopathological methods.
Details were as follows: adult Balb/c mice, were placed in a restraining device and injected intravenously via the tail vein with a 0.1 ml solution containing 10 ng to 10 mg of LPS in phosphate buffer saline (PBS). These levels of LPS inducemild to lethal levels of shock in this strain of mice. Shock results from changes in vascular permeability, fluid loss, and dehydration, and is often accompanied by symptoms including lethargy, a hunched, stationary position, rumpled fur, cessation ofeating, cyanosis, and, in serious cases, death within 12 to 24 hours. Control mice received an injection of PBS. Different amounts (2,000 or 4,000 U) of purified human TRRE were injected IV in a 0.1 ml volume within an hour prior to or after LPSinjection. Serum (0.1 ml) was collected with a 27 gauge needle and 1 ml syringe IV from the tail vein at 30, 60 and 90 minutes after LPS injection. This serum was heparinized and stored frozen at -20.degree. C. Samples from multiple experiments weretested by ELISA for the presence of sTNF-R, TNF, IL-8 and IL-6. Animals were monitored over the next 12 hours for the clinical effects of shock. Selected animals were euthanized at periods from 3 to 12 hours after treatment, autopsied and variousorgans and tissues fixed in formalin, imbedded in paraffin, sectioned and stained by hematoxalin-eosin (H and E). Tissue sections were subjected to histopathologic and immunopathologic examination.
FIG. 3 shows the results obtained. (.diamond-solid.) LPS alone; (.box-solid.) LPS plus control buffer; (.circle-solid.) LPS plus TRRE (2,000 U); (.tangle-solidup.) LPS plus TRRE (4,000 U).
Mice injected with LPS alone or LPS and a control buffer died shortly after injection. 50% of the test animals were dead after 8 hours (LPS) or 9 hours (LPS plus control buffer), and 100% of the animals were dead at 15 hours. In contrast,animals treated with TRRE obtained as described in Example 1 did much better. When injections of LPS were accompanied by injections of a 2,000 U of TRRE, death was delayed and death rates were lower. Only 40% of the animals were dead at 24 hours. When4,000 U of TRRE was injected along with LPS, all of the animals had survived at 24 hours. Thus, TRRE is able to counteract the mortality induced by LPS in test animals.
Example 4
TRRE Activity Decreases Tumor Necrotizing Activity
The following protocol was followed to test the effects of TRRE on tumor necrosis in test animals in which tumors were produced, and in which TNF was subsequently injected.
On Day 0, cutaneous Meth A tumors were produced on the abdominal wall of fifteen BALB/c mice by intradermal injection of 2.times.20.sup.5 Meth A tumor cells. On Day 7, the mice were divided into three groups of five mice each and treated asfollows: Group 1: Injected intravenously with TNF (1 .mu.g/mouse). Group 2: Injected intravenously with TNF (1 .mu.g/mouse) and injected intratumorally with TRRE obtained as in Example 1 (400 units/mouse, 6, 12 hours after TNF injection). Group 3:Injected intravenously with TNF (1 .mu.g/mouse) and injected intratumorally with control medium (6, 12 hours after TNF injection).
On Day 8, tumor necrosis was measured with the following results: Group 1:100% of necrosis ( 5/5); Group 2: 20% (1/5); Group 3: 80% (4/5). Injections of TRRE greatly reduced the ability of TNF to induce necrosis in Meth A tumors in BALB/c mice.
Since adding TRRE activity ablates the beneficial necrotizing activity of TNF, blocking endogenous TRRE activity would promote the beneficial effects of TNF.
Example 5
Nine New Polynucleotide Clones that Affect TRRE Activity
A number of cells have been found to express high levels of TRRE activity, especially after PMA stimulation. These include the cell lines designated THP-1, U-937, HL-60, ME-180, MRC-5, Raji, K-562. Jurkat cells have a high TRRE activity (850TRRE U/mL at 10.sup.-2 PMA). In this experiment, the expression library of the Jurkat T cell (ATCC #TIB-152) was obtained and used to obtain 9 polynucleotide clones that augment TRRE activity.
Selection of expression sequences in the library was done by repeated cycles of transfection into COS-1 cells, followed by assaying of the supernatant as in Example 1 for the presence of activity cleaving and releasing the TNF receptor. Standardtechniques were used in the genetic manipulation. Briefly, the DNA of 10.sup.6 Jurkat cells was extracted using an InVitrogen plasmid extraction kit according to manufacturer's directions. cDNA was inserted in the ZAP Express.TM./EcoR/vector (cat. no.938201, Stratagene, La Jolla Calif. The library was divided into 48 groups of DNA and transformed into COS-1 cells using the CaCl transfection method. Once the cells were grown out, the TRRE assay was performed, and five positive groups were selected. DNA from each of these five groups was obtained, and transfected into E coli, with 15 plates per group. DNA was prepared from these cells and then transfected into COS-1 cells once more. The cells were grown out, and TRRE activity was tested again. Two positive groups were selected and transfected into E. coli, yielding 98 colonies. DNA was prepared from 96 of these colonies and transfected into COS-1 cells. The TRRE activity was performed again, and nine clones were found to substantiallyincrease TRRE activity in the assay. These clones were designated 2-8, 2-9, 2-14, 2-15, P2-2, P2-10, P2-13, P2-14, and P2-15.
FIG. 4 is a bar graph showing the TRRE activity observed when the 9 clones were tested with C75 cells in the standard assay (Example 1).
These nine clones were then sequenced according to the following procedure: 1. Plasmid DNA was prepared using a modified alkaline lysis procedure. 2. DNA sequencing was performed using DyeDeoxy termination reactions (ABI). Base-specificfluorescent dyes were used as labels. 3. Sequencing reactions were analyzed on 5.75% Long Ranger.TM. gels by an ABI 373A-S or on 5.0% Long Ranger.TM. gels by an ABI 377 automated sequencer. 4. Subsequent data analysis was performed usingSequencher.TM. 3.0 software. Standard primers T7X, T3X, -40, -48 Reverse, and BK Reverse (BKR) were used in sequencing reactions. For each clone, several additional internal sequencing primers (listed below) were synthesized.
NCBI BLAST (Basic Local Alignment Search Tool) sequence analysis (Altschul et al. (1990) J. Mol. Biol. 215:403-410) was performed to determine if other sequences were significantly similar to these sequences. Both the DNA sequences of theclones and the corresponding ORFs (if any) were compared to sequences available in databases.
The following clones were obtained and sequenced:
TABLE-US-00001 TABLE 1 DNA sequences affecting TRRE activity Related Approx. sequences Sequence SEQ ID Length Expression (potential Clone Designation NO: (bp) Designation homology) 2-9 AIM2 1 4,047 -- 2-8 AIM3T3 2 739 M. musculus 45S (partialpre-rRNA sequence) gene AIM3T7 3 233 (partial sequence) 2-14 AIM4 4 2,998 Mey3 human arfaptin 2 and others (see below) 2-15 AIM5 5 4,152 -- P2-2 AIM6 6 3,117 Mey5 -- P2-10 AIM7 7 3,306 Mey6 Human Insulin- like Growth factor II Receptor P1-13 AIM8 8 4,218-- P2-14 AIM9 9 1,187 Mey8 -- P2-15 AIM10 10 3,306 E1b-55kDa- associated protein
Clone 2-9 (AIM2): The internal primers used for sequencing are shown in SEQ. ID NOS:11-38. The sequence of AIM2 is presented in SEQ ID NO:1. The complementary strand of the AIM2 sequence is SEQ ID NO:147. The longest open reading frame (ORF)in the AIM2 sequence is 474 AA long and represented in SEQ ID NO:148.
Clone 2-8 (AIM3): Two partial sequences of length 739 and 233 were obtained and designated AIM3T3 and AIM3T7. The internal primers used for sequencing are shown in SEQ. ID NOS:39-46. The sequences of AIM3T3 and AIM3T7 are presented in SEQ IDNOs:2 and 3, respectively. The BLAST search revealed that the AIM3T3 sequence may be homologous to the mouse (M. musculus) 28S ribosomal RNA (Hassouna et al. Nucleic Acids Res. 12:3563-3583, 1984) and the M. musculus 45S pre-rRNA genes (Accession No.X82564. The complementary sequence of the AIM3T3 sequence showed 99% similarity over 408 bp beginning with nt 221 of SEQ ID NO:2 to the former and 97% similarity over the same span to the latter.
Clone 2-14 (AIM4). The internal primers used for sequencing are shown in SEQ. ID NOS:14-65. The sequence of AIM4 is presented in SEQ ID NO:4. The complementary strand of the AIM4 sequence is SEQ ID NO:149. The longest ORF in the AIM4sequence is 236 AA long and represented in SEQ ID NO:150. AIM4 has significant alignments to human sequences arfaptin 2, ADE2H1 mRNA showing homologies to SAICAR synthetase, polypyrimidine tract binding protein (heterogeneous nuclear ribonucleoprotein1) mRNA, several PTB genes for polypirimidine tract binding proteins, mRNA for por1 protein. Human arfaptin 2 is a putative target protein of ADP-ribosylation factor that interacts with RAC1 by binding directly to it. RAC1 is involved in membraneruffling. Arfaptin 2 has possible transmembrane segments, potential CK2 phosphorylation sites, PKC phosphorylation site and RGD cell attachment sequence.
Clone 2-15 (AIM5): The internal primers used for sequencing are shown in SEQ. ID NOS:66-80. The sequence of AIM5 is presented in SEQ ID NO:5. The BLAST search revealed that the AIM5 sequence displays some similarity to Human Initiation Factor5A (eIF-5A) Koettnitz et al. (1995) Gene 159:283-284, 1995 and Human Initiation Factor 4D (eIF 4D) Smit-McBride et al. (1989) J. Biol. Chem. 264:1578-1583,1989.
Clone P2-2 (AIM6): The internal primers used for sequencing are shown in SEQ. ID NOS:81-93. The sequence of AIM6 is presented in SEQ ID NO:6. The longest ORF in the AIM6 sequence is 1038 AA long and represented in SEQ ID NO:151.
Clone P2-10 (AIM7): The internal primers used for sequencing are shown in SEQ. ID NOS:94-106. The sequence of AIM7 is presented as SEQ ID NO:7. The longest ORF in the AIM7 sequence is 849 M long and represented in SEQ ID NO:152. The BLASTsearch revealed that this clone may be related to the Human Insulin-like Growth Factor II Receptor (Morgan et al. Nature 329:301-307, 1987 or the Human Cation-independent Mannose 6-Phosphate Receptor mRNA (Oshima et al. J. Biol. Chem. 263:2553-2562,1988). The AIM7 sequence showed roughly 99% identity to both sequences over 2520 nucleotides beginning with nt 12 of SEQ ID NO:7 and 99% similarity to the latter over the same span.
Clone P2-13 (AIM8): The internal primers used for sequencing are shown in SEQ. ID NOS:107-118. The sequence of AIM8 is presented as SEQ ID NO:8. The longest ORF in the AIM8 sequence is 852 AA long and represented in SEQ ID NO:153.
Clone P2-14 (AIM9): The internal primers used for sequencing are shown in SEQ. ID NOS:119-124. The sequence of AIM9 is presented as SEQ ID NO:9. The longest ORF was about 149 amino acids in length.
Clone P2-15 (AIM10): The internal primers used for sequencing are shown in SEQ. ID NOS:125-146. The sequence of AIM10 is presented as SEQ ID NO:10. The longest ORF in the AIM10 sequence is 693 AA long and represented in SEQ ID NO:154. Sequence 10 on BLASTN search of non-redundant databases at NCBI aligns with Human mRNA for E1b-55 kDa-associated protein, locus HSA7509 (Accession AJ007509, NID g3319955).
Clonal DNA may be directly injected into test animals in order to test the ability of these nucleic acids to induce TRRE activity, counteract septic shock and/or affect tumor necrosis, as is described in detail in Examples 3 and 4. Alternatively, proteins or RNA can be generated from the clonal DNA for similar testing.
Example 6
Expression of Newly Obtained Clones
Example 5 describes 9 new clones which enhance TRRE activity in a cell surface assay system. The clones were obtained in the pBK-CMB Phagmid vector.
The following work was done on contract through the commercial laboratory Lark Technologies, Houston, Tex. The clones were removed from shuttle vectors and inserted into expression vectors in the following manner. Recombinant plasmid (pBK-CMVcontaining insert) was digested with appropriate restriction enzyme(s) such as Spe I, Xba I, EcoR I or others, as appropriate. The Baculovirus Transfer Vector (pAcGHLT-A Baculovirus Transfer Vector, PharMingen, San Diego, Calif., Cat. No. 21460P) wasalso cut with appropriate restriction enzyme(s) within or near the multiple cloning site to receive the insert removed from the shuttle vector.
The fragment of interest being sublconed was isolated from the digest using Low-Melting agarose electrophoresis and purified from the gel using a Qiaquick Gel Extraction Kit following Lark SOP MB 020602. If necessary, the receiving vector wastreated with alkaline phosphatase according to Lark SOP MB 090201. The fragment was ligated into the chosen site of the vector pAcGHLT-A. The recombinant plasmid was transformed into E. coli XL1 Blue MRF' cells and the transformed bacterial cells wereselected on LB agar plates containing ampicillin (100 .mu.g/ml). Ampicillin resistant colonies were picked and grown on LB broth containing ampicillin for plasmid preparation.
Plasmid DNA was prepared using Alkaline Minilysate Procedure (Lark SOP MB 010802 and digested with appropriate restriction enzyme(s). Selected subclones were confirmed to be of the correct size. Subclones were digested with other appropriaterestriction enzyme(s) to ascertain correct orientation of the insert by confirming presence of fragments of proper size(s). A subclone was grown in 100 ml of LB broth containing ampicillin (100 .mu.g/ml) and the plasmid DNA prepared using Qiagen MidiPlasmid Preparation Kit (Lark SOP MB 011001). The DNA concentration was determined by measuring the absorbance at 260 nm and the DNA sample was verified to be originated from correct subclone by restriction digestion.
Thus were produced the expression constructs for Mey3, Mey5, Mey6, Mey8 now with the coding sequence of interest fused to GST gene with polyhistitidine tag, protein kinase A site and thrombin cleavage site. The GST gene and now the fusionprotein are under the polyhedrin promotor. PharMingen (San Diego, Calif.) incorporated the vector with insert into functional baculovirus particles by co-inserting the transfer vector (pAcGHLT) into susceptible insect cell line S along with linearizedvirus DNA (PharMingen, San Diego, Calif., BaculoGold viral DNA, Cat. No. 21100D). The functional virus particles were grown again on the insect cells to generate a high titer stock. Protein production was then done by infecting a large culture ofcells in Tini cell. The cells were harvested when the protein yield reached a maximum and before the virus killed the cells. Fusion proteins were collected on a glutatione-agarose column, washed and released with glutathionine.
Proteins collected from the affinity column were quantified by measuring OD.sub.280 and were assayed on gels using SDS-PAGE and Western blotting with labeled anti-GST (PharMingen, San Diego, Calif., mAbGST Cat. No. 21441A) to confirm that allthe bands present included the GST portion.
Four of the ten sequences have been cloned, expressed in bacculovirus infected insect cells, and then purified.
TABLE-US-00002 TABLE 2 Expressed protein from Jurkat library clones Amount of protein Name Sequence in insert (mg/mL) Mey3 AIM4 4.7, 5.0 Mey5 AIM6 1.36, 1.50 Mey6 AIM7 0.33 Mey8 AIM9 1.53
Gels indicated the presence of the GST protein in addition to larger proteins that were also positive with the anti-GST antibody in Western analyses. Mey3 repeatedly exhibited the presence of proteins around 32 kDa, 56 kDa, bands around 60-70kDa and another larger than 70 kDa. Mey5 consistently had proteins migrating as approximately 34 kDa, 38 kDa, 58 kDa, around 60-70 kDa, and others larger than 70 kDa. Mey6 had protein bands around 34 kDa, 56 kDa, 58 kDa, and bands around 60-70 kDa. Mey8 had protein bands around 36 kDa, 58 kDa and bands around 60-70 kDa. All of the indicated bands were positive for GST. The bands may represent the desired fusion protein or degradation/cleavage product generated during growth and purification.
Example 7
Assay of Expression Products for Effect on TNF-R Cleaving Activity
The following method was used to measure TRRE activity of Mey 3, 5, 6 and 8. C75R cells and COS-1 cells were seeded into 24-well culture plates at a density of 2.5.times.10.sup.5 cells/ml/well and incubated overnight (for 12 to 16 hours) in 5%CO.sub.2 at 37.degree. C. After aspirating the medium in the well, 300 .mu.l of 1 .mu.g of Mey 3, 5 and 8 were incubated in each well of both the C75R and COS-1 plates for 30 min in 5% CO.sub.2 at 37.degree. C. (corresponding to A and C mentionedbelow, respectively). Simultaneously, C75R cells in 24-well plates were also incubated with 300 .mu.l of fresh medium or buffer (corresponding to B mentioned below). The supernatants were collected, centrifuged, and then assayed for the concentrationof soluble p75 TNF-R by ELISA as described in Example 1.
The following results were obtained:
TABLE-US-00003 TABLE 3 Enzymatic activity of expressed clones TNF-receptor releasing activity Clone No. U/mg Mey-3 341 Mey-5 671 Mey-6 452 Mey-8 191
Example 8
Effectiveness of Expression Products in Treating Septic Shock
The protocol outlined in Example 3 was used to test the effects of the expression products from the new clones in preventing mortality in the septic shock model.
Different amounts of recombinant Mey 3, 5, and 8 (10-100 ug/mouse) were injected i.v. in a 0.05 ml volume within an hour prior to or after injection of a lethal dose of LPS. Serum (0.1 ml) was collected using a 27 gauge needle and 1 ml syringefrom the tail vein at 30, 60 and 90 minutes after LPS injection. This serum was heparinized and stored frozen at -20.degree. C. Samples from multiple experiments were tested by ELISA for the presence of solubilized TNR-R, the TNR ligand, IL-8, andIL-6. Animals were monitored over the next 12 hours for the clinical effects of shock. Selected animals were euthanized from 3 to 12 hours after treatment, autopsied and various organs and tissues fixed in formalin, imbedded in paraffin, sectioned andstained by hematoxalin-eosin (H and E). Tissue sections were subjected to histopathologic and immunopathologic examination.
FIG. 5 shows the results obtained. (.diamond-solid.) saline; (.box-solid.) BSA; (.DELTA.) Mey-3 (100 .mu.g); (X) Mey-3 (10 .mu.g); (*) Mey-5 (10 .mu.g); (.circle-solid.) Mey-8 (10 .mu.g).
Mice injected with LPS alone or LPS, a control buffer or control protein (BSA) died rapidly. All of the animals in this group were dead at 24 hours. In contrast, when injections of LPS were accompanied by injections of a 10-100 ug of Mey 3, 5and 8, death was delayed and death rates were lower. None of the animal were dead at 24 hours that had been treated with Mey 3 and Mey 5. Only 66% of the animals were dead at 24 hours that had been treated with Mey 8. Thus, Mey 3, 5 and 8 were able tocounteract the mortality induced by LPS in test animals.
>
7 base pairs nucleic acid double linear Genomic DNA TTTTG CTTTCCTTCC CCGGGAAAGG CCGGGGCCAG AGACCCGCAC TCGGACCAGG 6GCTGC GGGGCCAGAG TGGGCTGGGGAGGGCTGGGA GGGCGTCTGG GGCCGGCTCC AGGCTGG GGGCCGCCAG CTCCGGGAAG GCAGTCCTGG CCTGCGGATG GGGCCGCGCG GGCCCGG CGGGGCGGCC TCGGGAGGCG TCCAGGCTGC GGGAGCGGGA GGAGCGGCCG 24GCGCC AGCGCCGTGG GTGGAGGTCG CCGTCCCTCC TGAGGGGCAG CCAGTGCGTT 3ACCCGG GAGCAGAGCC CGCGCCTCCC CAGCGGCCTC CCCGGGGGTC TCACCGGGTC 36AGAGC GGAGGCCCCG GCTCCGCAGA AACCCGGGGC GGCCGCGGGG AAGCAGCGCC 42GCGTC GGAGGAGCCC CCAGAAGGAC CTCGCGCCTT CCCGCCGGGC TCCGACCGCC 48TCGGT GCGGGACGGC CCAGGCCGCC AGGACCCCCAAGCGCAGCTC AGTCTGCGGG 54ACCCA GAGGCCAGCA GCAGAGGACG GGGCCGGGGC CGGGAGAGGG CGGGGAGGGC 6CTGGGA GGTCAAGGCC AGGGCTAGAC TTTCAGGGTC ATGGCCTGGC CCCTCATCCC 66AGGTG AGGGGGCTCT GTGAGCAGAG GGGGCCCCGG TGGAGAAGGC GCTGCTAGCC 72CGGGGCAGGAGCCCA GGTGGGGACT TAAGGGTGGC TGAAGGGACC CTCAGGCTGC 78TAGGG AGGGAAGCTA GGGGTGTGGC TTGGGGAGGT GCTGGGGGAC CGCGGGCGCC 84TTCTG AAGCCGAATG TGCTGCCGGA GTCCCCAGTG ACCTAGAAAT CCATTTCAAG 9TCAGGA GTTTCAGGTG GAGACAAAGG CCAGGCCCAG GTGAAAATGTGGCAGTGACA 96TGGGG TGAGAACCAC GGAGAGAGGA AGTCCCCGAG GCGGATGATG GGACAGAGAG GGGACCAG AATTTTTTAA AACGCATCTG AGATGCGTTT GGCAGACTCA TAGTTGTTTT TTTCACGG AGAAAGTGTG GGCAGAAGCC AGCTCTAAAG CCCAGGCTGC CCAGCCTGCA GGCAGAGC TGACGGAAGGCCAGGGCAGA GCCTTCCCTC CCTGTCACAG ACATGAGCCC GAGATCTG GAATGAGGCA GATGTGCCCA GGGAAAGCTG ATCCGCCCCG ACCCAGGGCC CCGGGTGC CCCTTTGAGC GTGGAATCGT TGCCAGGTCA TGGCTCCCTG CTATCGAACA GGACACGG GTCGTGTGCT GCACCTGGCA GTTGCAGGAC CGACACCCACAATGCCTTAA GGTGATGA CTGCCTTCCA GGGGCCTGGC TGGCTGACAC TTTGCATGGC TCCTGGAGAA GGGATTGA GTGGAGTCCA CGGGTCATGG CCACGTCCTG GGTGCTGCCT CTGAGGCAGG CCGGCTGG GGTGAGAAGG GGCTGGAGAC AGGTTCCTGC CAGTTCAGCC TCTAACCGGT TCTTCATG CCTAGGAACCCACTGGGGGC TTATGAAACT GCAGGTGGCT GAGTCCTTGC TGGGGTCT CTCCTTCAGG AGGTCTGGGT GGGGCCGGAG ACTGTACCCC ACAAAGGGTC AGGTGAGG CGGATGTGGC CTGGCGCTGT GTGGCTCTGG ACCTAGTCCT TGGGCTTGGG GGCGCCCA GGGCCTGGGC TTGAGACAGC TGTGACGCAG GCAAGCCATTTACCCCGTTT GGGGACAT TACATCTTCC TAGCTTGGAA CACACAGGCA GCCAGGGTTG TTATCCACAT CTCCTCCA TGTTCTTCTC TTGAGAACTT TTACCAGGTA TGTCAGGAGC TGGGCTCCAC GGGAGACT CAAGTGGAAA GCCCTCATCC TTGTCCTCCA GGAGACAGGA AAACCTATGG ACAATTCC AGGGACAAGAGCGATGCATG TGAGGTGTGG CAAATCTCAC TGTTCAACTG 2AAATCAG AGACAGCTTC CTGGAGGCAG TGACACCTGG ACAGGCTTCT CCACAGGAGG 2CGAGTGA GAGAAGCCAA CTGGGATGGA CCCATCATGT AGGGGGAACA GTGCGCGCAG 2CAACAAC CACCCCCACC CTAGGCCCAG AGCTCACGGA GAGAGCTGGGCCTCTCGGGG 222ACATA GTTCCCTGCT GGATCTTAGG TCTTGTCCTT GGGCAGCTCT GCTGAGACCT 228CCTGT TCCAGGCTGC ACCAAGGTTT TGTGACTATT GGTCTGGGGT TGTTTTGCAG 234GAAGT GTTCTGTTGT AAAACAGGCA CTTGATTTGC TGGAAGGAAT GCTGTTTGTT 24CTGCGA CAAACATTGAGCAGCATTTA GTGGGCGGTT TATATCTTGT GGAGTAATGG 246TTTGA AGTCTGTCCT GGGTACTGCA CATTAAAAGG AATATCATTT TCTGAAACAT 252TTTTC CACACCAGAA ATCATATCCT CTTGCTGGTC CATGTCTGAA GACCTTACAC 258AGTCT TAATGTAAGT TTAGTAGAGT CCTTGGATGG AGAACTAATTATATCATACA 264GCTTT CTCACTCTGC TCTTTTTCAT CCTTGCCTAA TTTCATTTTC TTCTGCTTCT 27TTTTCT TTCTGGAGAA TCTAGCAAGA TATCTGGTGG AACATCTCGA GGTGATGAAC 276AGAGA CTGAGATTGT AGGATTAAAG GTGGTCTTGA GCCTTTAGGA GTTCCTTCAC 282GCAGG GGAGCATACTGGCTGTGGAG ATCTCAAGGG AAAAGATGCA GCATTCCTCA 288GAAGA ATCTCCATCG TCACTACTTA GCCTGTGCAC CATGTGTAGG TAGTCCTCAC 294CCATG TCTAGGATTA TCAGCATGAT GATTAGCTGA ATTGCCAGAC AACGGACCAG 3CTTTATT ATCATGTATG TTTCTCAAAC CACCTGCAAC AATGGGACTTGATACCGATG 3GTTGCAT CTGTGGATGT GTTGTGTAAC TTGAAGGATG GGAATATGGC ATGTATCCTG 3GGCTTTG TGGGGCGTAT GGACTAGGCA CTGGGCTATT TTGCTGTGGC ATAAATCTGT 3CAGAGCT TGTCTGTGGT GGCACAAACC GGCTGGAGGG GCTATGTGAG ATAGTGGTTT 324TAATT GGAAGATGCAGGACTACTGT GCATGGAATT CTGAGAAAGT TTATACTGAG 33CATCAT TCCACTTTGT ACATATCTGT TCTGCATGCT TTTCTCCCTG AAAACATTAG 336CTTGC CAGGACGGCC TGCAACAAGA CTGGTATGTC ACCTTCTGGG TCATCACTGC 342TTATC TTTCAACTCT ATGTGATCTG TTGATACCTG GTTGAGGCTATGGACAAGCT 348ACCAA ATTGTCATCC CTACAAGCCA AAAGGCAGTT CACCTCTTCT GCTATTCGTG 354AAGAG AAGGCTCTTT GTAGTTGTAG CAGGTAAAGG AGATGGAAGA GGCAGCTGGT 36GAGGTC TGTGAGACTA GCAATCCCCG CAAGAGTAGT AATGGGGACA TGGGGCATAT 366TTCAT CCTGAATTTCTGGAATGGTG TTGCCTATAA AAGTACTTAG TTCAGGTGCC 372TCATT ACTTCCCATT TCCCAAACAC TGGGCGAATC GGCGTCTGAA TCCAAGGGGA 378AGGCC GCTGTGGCGA GAGACTATAA TCCGGGCCGG GAGGGGGGGC GGCTACGGCT 384TCCGT CTCCTCAGTG CGGGGAACAT GTAGAGCCGG GGGGAGACCAGCCGAGAAGA 39TCGTTG CTTCTTCTTC CTCCTCCTCC TCCTTCTCCC ACATAGAAAC ACTCACAAAC 396ACCAC GGGCCCGAGC TACCGGGGGG GCATCGCCGC GGGCCCGGGA ACCAATTCTC 4TCGGCGG GGGCGTCCTT TGGATCC 4 base pairs nucleic acid double linear Genomic DNA 2GGATCCAAAG GTCAAACTCC CCACCTGGCA CTGTCCCCGG AGCGGGTCGC GCCCGGCCGG 6GGCCG GGCGCTTGGC GCCAGAAGCG AGAGCCCCTC GGGGCTCGCC CCCCCGCCTC GGGTCAG TGAAAAAACG ATCAGAGTAG TGGTATTTCA CCGGCGGCCC GCAGGGCCGG ACCCCGC CCCGGGCCCC TCGCGGGGAC ACCGGGGGGGCGCCGGGGGC CTCCCACTTA 24CACCT CTCATGTCTC TTCACCGTGC CAGACTAGAG TCAAGCTCAA CAGGGTCTTC 3CCCGCT GATTCCGCCA AGCCCGTTCC CTTGGCTGTG GTTTCGCTGG ATAGTAGGTA 36AGTGG GAATCTCGTT CATCCATTCA TGCGCGTCAC TAATTAGATG ACGAGGCATT 42ACCTTAAGAGAGTCA TAGTTACTCC CGCCGTTTAC CCGCGCTTCA TTGAATTTCT 48TTGAC ATTCAGAGCA CTGGGCAGAA ATCACATCGC GTCAACACCC GCCGCGGGCC 54GATGC TTTGTTTTAA TTAAACAGTC GGATTCCCCT GGTCCGCACC AGTTCTAAGT 6TGCTAG GCGCCGGCCG AAGCGAGGCG CCGCGCGGAA CCGCGGCCCCCGGGGCGGAC 66GGGGG GACCGGGCCG CGGCCCCTCC GCCGCCTGCC GCCGCCGCCG CCGCCGCGCG 72GAAGA AGGGGGAAA 739 233 base pairs nucleic acid double linear Genomic DNA 3 CAAGAGTGGC GGCCGCAGCA GGCCCCCCGG GTGCCCGGGC CCCCCTCGAG GGGGACAGTG 6GCCGCGGGGGCCCCG CGGCGGGCCG CCGCCGGCCC CTGCCGCCCC GACCCTTCTC CCGCCGC CGCCCCCACG CGGCGCTCCC CCGGGGAGGG GGGAGGACGG GGAGCGGGGG GAGAGAG AGAGAGAGGG CGCGGGGTGG CTCGTGCCGA ATTCAAAAAG CTT 233 2998 base pairs nucleic acid double linear Genomic DNA 4GGATCCAAAG AATTCGGCAC GAGGTAGTCA CGGCTCTTGT CATTGTTGTA CTTGACGTTG 6GGTGA GCTTGGAAAA GTCGATGCGC AGCGTGCAGC AGGCGTTGTA GATGTTCTGC TCCAGCG ACAGCTTGGC GTGCTGGGCG CTCACGGGGT CCGCATACTG CAGCAGGGCC AACTGGT TGTTCTTGGT GAAGGTGATG ATCTTCAACACTGTGCCGAA CTTGGAGAAA 24GTGCA GCACATCCAG GGTCACAGGG TAGAAGAGGT TCTCCACGAT GATCCTGAGC 3GGCTCT GCCCGGCCAT CGCCATCCCT GCATCCACGG CCGCCGCCGA GGCAGCCAAG 36GTTCC CCGACTGGAC CGAGTTCACC GCCTGCAGGG CCGCCTGGGC CCGCGCCTGG 42AGAGCTGTCGGTCTT CAGCTCCTTG TGGTTGGAGA ACTGGATGTA GATGGGCTGG 48CAGCA CAGGGGTCAC CGAGGTGTAG TAGTTCACCA TGGTATTGGC AGCCTCCTCC 54CATCT CGATGAAGGC CTGGTTTTTC CCCTTCAGCA TCAGGAGGTT GGTGACCTTC 6AGGGCA GCCCCAGGGA GATGACTTCC CCCTCCGTGA CGTCGATGGGGAGCTTCCGG 66GATCA CTCTAGAGGG GACGCCTGCA CTTCGGCTGT CACCTTTGAA CTTCTTGCTG 72TCCGT TTGCTGCAGA AGCCGAGTTG CTGCTCATGA TAAACGGTCC GTTAGTGACA 78AGAGA AAAGCTCGTC AGATCCCCGC TTTGTACCAA CGGCTATATC TGGGACAATG 84CATGG CACACAGAGCAGACCCGCGG GGGACGGAGT GGAGGCGCCG GAATCCTGGA 9GAGCTG CAGATTGAGT TGCTGCGTGA GACGAAGCGC AAGTATGAGA GTGTCCTGCA 96GCCGG GCACTGACAG CCCACCTCTA CAGCCTGCTG CAGACCCAGC ATGCACTGGG ATGCCTTT GCTGACCTCA GCCAGAAGTC CCCAGAGCTT CAGGAGGAAT TTGGCTACAACAGAGACA CAGAAACTAC TATGCAAGAA TGGGGAAACG CTGCTAGGAG CCGTGAACTT TTGTCTCT AGCATCAACA CATTGGTCAC CAAGACCATG GAAGACACGC TCATGACTGT AACAGTAT GAGGCTGCCA GGCTGGAATA TGATGCCTAC CGAACAGACT TAGAGGAGCT GTCTAGGC CCCCGGGATG CAGGGACACGTGGTCGACTT GAGAGTGCCC AGGCCACTTT AGGCCCAT CGGGACAAGT ATGAGAAGCT GCGGGGAGAT GTGGCCATCA AGCTCAAGTT TGGAAGAA AACAAGATCA AGGTGATGCA CAAGCAGCTG CTGCTCTTCC ACAATGCTGT CCGCCTAC TTTGCTGGGA ACCAGAAACA GCTGGAGCAG ACCCTGCAGC AGTTCAACAT AGCTGCGG CCTCCAGGAG CTGAGAAACC CTCCTGGCTA GAGGAGCAGT GAGCTGCTCC GCCCAACT TGGCTATCAA GAAAGACATT GGGAAGGGCA GCCCCAGGGT GTGGGAGATT ACATGGTA CATCCTTTGT CACTTGCCCT CTGGCTTGGG CTCCTTTTTC TGGCTGGGGC GACACCAG TTTTGCCCAC ATTGCTATGGTGGGAAGAGG GCCTGGAGGC CCAGAAGTTG GCCCTGTC TATCTTCCTG GCCACAGGGC TTCATTCCCA GATCTTTTCC TTCCACTTCA GCCAACGG CTATGACAAA ACCACTCCCT GGCCAATGGC ATCACTCTTC AGGCTGGGGT GCTCCCTG ACCAATGACA GAGCCTGAAA ATGCCCTGTC AGCCAATGGC AGCTCTTCTC ACTCCCCT GGGCCAATGA TGTTGCGTCT AATACCCTTT GTCTCTCCTC TATGCGTGCC TTGCAGAG AAGGGGACTG GGACCAAAGG GGTGGGGATA ATGGGGAGCC CCATTGCTGG 2TGCATCT GAATAGGCCT ACCCTCACCA TTTATTCACT AATACATTTT ATTTGTGTTC 2AATTTAA AATTACCTTT TCATCTTGCTTGATTTTCCT TCAGCTAAAT TAGAAATTTG 2TTTTTCC CCTAAAAAAT TCAATGGCAT TCTTTCTTAT AAATTACATT CTCTGATTTT 222CAGCC TGCTTCAAGG AAATCCATGT GTTCAAAATG CTTGCTCGCA GTTTGCTCCA 228AATGG TTGCTTAACC CAAATATCTG AGCAGCAAAT TGAGCTGATC CTTCTGGAGA 234CGGTT GAACAGCCAA GACCACTGGG TAGTCGAAGA GAAGACCACA CATCCTGAAC 24CAGTCT GGTGTGAGGG GAGGACAGCT GATAACTGGA TATGCAGTGT TCCCAGACAT 246GTCCC AAACCATTAC TTCTGCCTGC CACTGCCACA AATACAGTAG GAATGCCATC 252CATAC TCAGCTTTAA TCCTCAGAGTTTCATCTGGT CCTTTATGCG CAGATGTTAC 258GTTCA CATGGAATGC CAAAATTTCC ACAGGCCTTC TTGATTTTTT CACAGTGACC 264CAGAA GTAGAGCCCA TCAACACTAC AACCCTGCAC TGACTTTCTG ATTTCAAAAG 27TCTACT CTCTCTGCAA CCCACTCAAA GTTTTTCTTT ACCATTTGGA GCCCTTCAGG 276CTTCT TTGAGGTCCC GATAAGACTG TTTGTCTTTC TGTTGGCTTC GATCTCCTGA 282AGAGT CTCCAGGAAT CATTGTCAAT AACATCAGCA AGAACAATTT CTTTGGTGGT 288CAACA CCAAATTCAA TCTTCATATC AACCAGTGTA CAATTCTGGG GCAACCAGGA 294CCAGT ATTTCAAATA TAGCCTGTGTAGCATCTCGT GCCGAATTCA AAAAGCTT 2998 4e pairs nucleic acid double linear Genomic DNA 5 AAGCTTTTTG TGAAAACCCT AGGATATGTC CCCTCCCTCA CCACACCCAA CCCCCCGCCC 6CCAGG ACATGACGAT GCCTCACACA CACACACACA CACACATACA CACAAGGCCG GCTGCAC GCAGGAACATGGGCTGCACT CACGACAACA TTGAAAAAAT ATACATTATA GTACACC CGGGGCCCCC ACGTCCCCTC CCGTCCCCGC AGCCTGGCCA CACCAGGTCA 24GAGGG GCCGGGGCTG CAGGACCTCA GGACTGCAAG GGCAGGAAGG GAAACAGGAC 3AAGGAA GGAAGTTGGA AAGGAGGGAG AAATGGGGTC CCCAGACTGA AATGGAAATG36GGGCG ATCATAAGAG AAGCAGGGAC GATGGTCCAG CTGAGGGAGC CCTGCAGAGG 42AAGCT TCCCATGGAC AGGAGAGAGA AGGGAAGGGG AGAGGAGAGG GTTTCCTTCA 48ACCCC CAGCCCCAGC CCCAGCCCCA GCCATTGCAA TCGTCACCCT CTCCCCAACA 54AGTGC TAAGGGGGCA GCTGCCATTGGGGGTAGAAA GGCAGCTGAA GTCCAGCCCA 6CCAACC CAGCCAGCCC CAGTGCAAGG GGCACACCAG GAGCATGACA GCCCAGAAGT 66ATGGG GGGCCGGGGG AGGGGCAGGG CGGACTCCAG AGGGCCCGCT GGGGTTTTGA 72AAGGA GGACTGGTTC TGAAGCCTCT CTCCCTCTTG GTCTCTGTGT TCCCAGAAAG 78CTCCC ATGTCTGGAG TGTCTGTTTC ACCAGGGCAG AATTCCCCCT CTGCGTGGGG 84TGTAG GCCTTAGTAG CGGTGTGGGG GGGTCTCGAT GATGCGTCTC TCGTCGCTGC 9GGAATC GGCCACCTCC GAGTCACTGC TGTCCTCATC CTCCTGCTGG CCCCCAACAG 96GTCAC ACAGGACTGC CGATTCTGGT AGGACTCCATGGGGTTCACA ATGATGGTGA GCTGAGTC ATCCCAGAAG AGGTCTGGGT CCTTGGGGTC ACTGGAGGCC CCTGGAGGCC CCGGCCCC TGAGACGCGG CGGTGAAGGG AATGGATGCG CACCAGGCCC AGGACGACCA AGCACCAG GAAGCCCACG CACACCACAA TGATGAGGGT TGCGGCGCTG GGTATCATGG TTTCTGTGGGAGCTGGCT AGGCTGTGTC CAGCCATCTC AGGCGGGGGC TGGTGACCAC TGCAGGAA CTGCTGGGAG CTGAGCACGT GGCTGGGGTG GGCAACCCGG TTCATGCTGT AGGACATT GACCTCCACG ATGAATTCAT TGCTGGAGTA ACGGCCATTC ATTTCCGAGC GAAAGCCG GAACTTCCTG GTGTAGAGGG CAGCTCCGTGTCGCAGCCGA TAACGAGCCT CTCAGGAT CTCTTCATAC ACAGTGATGC TCTCCACCCC AGCAATAGTG AGGTAGGCAG GTGTTGGT GAGCTCCAGC CCCCGCTGCT GCAGAGAGGT TGTGTCCAGG AGCAGGCTTT CGCTCGGG ATCCAGGTCA TCCCCCACCA GAGAAATTTC ACAGCCATCC AGGTTGTGCA ATCTCATCCGACATGCGT GTGTCTGTCA CTGTGCCCTG CCAACTCTCA TCCTTTTTGG TCCACCTG GTGAGAAATG GAGCAGGTGA TTTGAAGATC AGGGAACAAA GGGACGCCGT GTTCCCTC AAAGTCCACA GCTGGGCGGG CAAAATGAGC AGTGCCACTC AGCAGGATCT GGGGCGTC AGGCTGAAGG ACGACCACGT AGCCCTCCACTTCAGGGATG GAGACGCAGG TCTTCGCT GAAGCACTTG ACAGCAGTGG TGAGGCGCAG GGGCCTGACG CCGGGCGTGG AAGCGCAG AGTGTTCATG TAAGCCACAT GCTGCAGGGC ATGGTTGAAG GTCTCCACAT TCCCCCTC CAGGGTGAGC AGGGACTGTG AGGGGTTCAC GTGGACCTTC ATGCCTTTGC 2GGCTCTCGAAATCCCTA TAGTCCAGCC CCTCCCGACA TGCATAGAGG CACTCGATGA 2CGCGGCT CTCCAGGCGA CCTGAGCGCA CGCTGAAACC AGCCAGGTAG CCATGGAAGT 2GGTGGAT CGACAAAGGG TCTCCTTGGG TGGTGTCTGT ACTGTTGTCT CCCTTTTCCT 222TTGTT CTTCTCCTCA GTCCAGCAGG CCCCAATCATGAGAGCAGGC TCCCTTCGGG 228TGGAT GAGGCCATTG TCATGGATGA GGGCAGGGTC GAAGGAGATG CCGTCGGTAT 234GTGAC TGTGGGGAAC TCGAGGTTCA GAGCGTAGTG GTGCCACTCA TCATCACAGA 24CTCCAG CTTCCAGAGG AACTTGACTG GGCGGGCACT CTCAAGCAGG GGCCAGTAGA 246GCAATCCTACAGCCG TGGACAGTCA GCGAGTAGTG AGAGAAGCCG TCCTCATTCT 252GTGTT ACATACGATG GTTTCCTCTT CCTTCTTGCC CTTGTTGGGA GTTACGCCAT 258ATCCA GAAGGACAGG GTGAAGTGGT CACTGAGGCT GTCCTGGGGC CCAGAGCCCA 264CTGGG GCCACCCAGG GGCACCTGCA CAGCCTGGGTGCCATTGAAC CAGTAGATCA 27GCTGTC CTGGCTGTAG TGCACCGAGA GTCCTGCTGT CCAGTTGGCA TTGGGGCCAG 276GGCAA CAGATCCACT TCCCCAGTGG CAGCACCACA GAGTTTCCGC AGCGCCCGCT 282TAGTT GTCACGGTCA CAGCCCTTGG CCACATGGCT GGTCTGCAGC TCTATGGTGG 288ATGTTCCAGAGTGGT TCATCACAGG TCTCCAGGCG GATACCAGGG AACAAAGCCA 294CCAGC ACCTGGTGCA TATTCGATCC TTTTGTTCCA GCCTTGCCAG CTGGGTTTAC 3TGGGCTT CACCTGAATC TCCACCTCAG CATCATCTGC TGCCCGCTTC TTCCCACAGT 3AAGCTGT CACTGTAAAC TTATAGAGCC TCTCACCACTGTACTGCAGC TTCTCTGTGT 3CAATGTT CCCGTCATTG TCAATGAGGA AAGGGGTGTT GGGTGTGAGA ATCTCATAGT 3AGATCTG GCTGTACTGG GGGGAGCAGT CACCGTCAAT GGCTTCCACC CGCAGGATGC 324TACAG CTTCCCCTCT GTCACAGCCG CACGATACAG CCGTTCCACA AACACTGGGG 33CTCGTTCACATCGTTG ACCCGCACAT GCACAGTGGC CTTGTGGGAC TTCTTGGTGT 336CCGTC GGGGCCCTCG CCACAGTCAT AGGCCTGGAT GGTGAAGGTG TGTTCCTTCT 342TCGCA GTCCACAGGC TCCTTGGCCC GGATCAGCCC CTCTCCTGTC GCCTTGTCAA 348ACAGC CTCAAAGGGC ACCCCAGACC CATGGAGCCGGAAGCCGCAG ATCTCACCTG 354CGCAG CGGGGCATCC TTGTCCAAGG CAAAGAGTGG TGGATTCAGT AGGACCGTGT 36ATTCTC CATGACGATG CCCTGGTACT CTGCCTCAAT CCATGGCTTG TGCTTGTTGG 366TTACA GGAGCAGGAC GCGAGCAGAG AGGCCAGCAG AAGGGGCAGC AGCAGGAGGG 372GTGCGGCGTGGGGCA GGGCAGGGCC AGGCGTTTGC CTCCCCTGGG AGCCTCCAGC 378GATTC CACCTTGCGG GAGGGATACA GGGGGGGAAA ACCAAAATAA AACGTCAAAT 384GTGTA GGAGGAGTCC AGCTTAGGAC CGGGCCAGAG CCAGGCCAGG CTCGGGGAGG 39CTCTGC AGGTTCAGAG GATCACTGCT GCCACCACCGCCACCCTGGG AGCCAGTTAT 396CATGG CCTTGATTGC AACAGCTGCC TCCTCTGTCA TGGCAGACAG CACCGTGATC 4ATCTCTT CTCCACAGTC GTACTTCTGC TCAATCTCCT TGCCAAGGTC TCCCTCAGGG 4CGAAGGT CCTCTCGTAC CTCCCCGCTG TCCTGGAGCA GTGATAGGTA CCCATCCTGG 4TTTGGAT CC47 base pairs nucleic acid double linear Genomic DNA 6 GGATCCAAAG ATTCGGCACG AGTGGCCACA TCATGAACCT CCAGGCCCAG CCCAAGGCTC 6AAGCG GAAGCGTTGC CTCTTTGGGG GCCAGGAACC AGCTCCCAAG GAGCAGCCCC CCCTGCA GCCCCCCCAG CAGTCCATCA GAGTGAAGGAGGAGCAGTAC CTCGGGCACG GTCCAGG AGGGGCAGTC TCCACCTCTC AGCCTGTGGA ACTGCCCCCT CCTAGCAGCC 24CTGCT GAACTCTGTG GTGTATGGGC CTGAGCGGAC CTCAGCAGCC ATGCTGTCCC 3GGTGGC CTCAGTAAAG TGGCCCAACT CTGTGATGGC TCCAGGGCGG GGCCCGGAGC 36GGAGGTGGGGGTGTC AGTGACAGCA GCTGGCAGCA GCAGCCAGGC CAGCCTCCAC 42TCAAC ATGGAACTGC CACAGTCTGT CCCTCTACAG TGCAACCAAG GGGAGCCCGC 48GGAGT GGGAGTCCCG ACTTACTATA ACCACCCTGA GGCACTGAAG CGGGAGAAAG 54GGCCC ACAGCTGGAC CGCTATGTGC GACCAATGAT GCCACAGAAGGTGCAGCTGG 6AGGGCG GCCCCAGGCA CCCCTGAATT CTTTCCACGC AGCCAAGAAA CCCCCAAACC 66CTGCC CCTGCAACCC TTCCAGCTGG CATTCGGCCA CCAGGTGAAC CGGCAGGTCT 72CAGGG CCCACCGCCC CCAAACCCGG TGGCTGCCTT CCCTCCACAG AAGCAGCAGC 78CAGCA ACCACAGCAGCAGCAGCAGC AGCAGCAGGC AGCCCTACCC CAGATGCCGC 84GAGAA CTTCTATTCC ATGCCACAGC AACCCTCGCA GCAACCCCAG GACTTTGGCC 9GCCAGC TGGGCCACTG GGACAGTCCC ACCTGGCTCA CCACAGCATG GCACCCTACC 96CCCCC CAACCCAGAT ATGAACCCAG AACTGCGCAA GGCCCTTCTG CAGGACTCAGCCGCAGCC AGCGCTACCT CAGGTCCAGA TCCCCTTCCC CCGCCGCTCC CGCCGCCTCT AAGGAGGG TATCCTGCCT CCCAGCGCCC TGGATGGGGC TGGCACCCAG CCTGGGCAGG GCCACTGG CAACCTGTTC CTACATCACT GGCCCCTGCA GCAGCCGCCA CCTGGCTCCC GGGCAGCC CCATCCTGAA GCTCTGGGATTCCCGCTGGA GCTGAGGGAG TCGCAGCTAC CCTGATGG GGAGAGACTA GCACCCAATG GCCGGGAGCG AGAGGCTCCT GCCATGGGCA GAGGAGGG CATGAGGGCA GTGAGCACAG GGGACTGTGG GCAGGTGCTA CGGGGCGGAG ATCCAGAG CACGCGACGG AGGCGCCGGG CATCCCAGGA GGCCAATTTG CTGACCCTGG CAGAAGGC TGTGGAGCTG GCCTCACTGC AGAATGCAAA GGATGGCAGT GGTTCTGAAG AAGCGGAA AAGTGTATTG GCCTCAACTA CCAAGTGTGG GGTGGAGTTT TCTGAGCCTT TTAGCCAC CAAGCGAGCA CGAGAAGACA GTGGGATGGT ACCCCTCATC ATCCCAGTGT GTGCCTGT GCGAACTGTG GACCCAACTGAGGCAGCCCA GGCTGGAGGT CTTGATGAGG GGGAAGGG TCTTGAACAG AACCCTGCTG AGCACAAGCC ATCAGTCATC GTCACCCGCA CGGTCCAC CCGAATCCCC GGGACAGATG CTCAAGCTCA GGCGGAGGAC ATGAATGTCA TTGGAGGG GGAGCCTTCC GTGCGGAAAC CAAAGCAGCG GCCCAGGCCC GAGCCCCTCA ATCCCCAC CAAGGCGGGC ACTTTCATCG CCCCTCCCGT CTACTCCAAC ATCACCCCAT CAGAGCCA CCTGCGCTCT CCCGTGCGCC TAGCTGACCA CCCCTCTGAG CGGAGCTTTG CTACCTCC CTACACGCCG CCCCCCATCC TCAGCCCTGT GCGGGAAGGC TCTGGCCTCT 2TCAATGC CATCATATCA ACCAGCACCATCCCTGCCCC TCCTCCCATC ACGCCTAAGA 2CCCATCG CACGCTGCTC CGGACTAACA GTGCTGAAGT AACCCCGCCT GTCCTCTCTG 2TGGGGGA GGCCACCCCA GTGAGCATCG AGCCACGGAT CAACGTGGGC TCCCGGTTCC 222GAAAT CCCCTTGATG AGGGACCGTG CCCTGGCAGC TGCAGATCCC CACAAGGCTG 228GTGTG GCAGCCATGG GAGGACCTAG AGAGCAGCCG GGAGAAGCAG AGGCAAGTGG 234CTGCT GACAGCCGCC TGCTCCAGCA TTTTCCCTGG TGCTGGCACC AACCAGGAGC 24CCTGCA CTGTCTGCAC GAATCCAGAG GAGACATCCT GGAAACGCTG AATAAGCTGC
246AAGAA GCCCCTGCGG CCCCACAACC ATCCGCTGGC AACTTATCAC TACACAGGCT 252CAGTG GAAGATGGCC GAGAGGAAGC TGTTCAACAA AGGCATTGCC ATCTACAAGA 258TTCTT CCTGGTGCAG AAGCTGATCC AGACCAAGAC CGTGGCCCAG TGCGTGGAGT 264TACAC CTACAAGAAGCAGGTGAAAA TCGGCCGCAA TGGGACTCTA ACCTTTGGGG 27GGATAC GAGCGATGAG AAGTCGGCCC AGGAAGAGGT TGAAGTGGAT ATTAAGACTT 276AAGTT CCCAAGGGTG CCTCTTCCCA GAAGAGAGTC CCCAAGTGAA GAGAGGCTGG 282AAGAG GGAGGTGAAG GAGCCCAGGA AGGAGGGGGA GGAGGAGGTGCCAGAGATCC 288AAGGA GGAGCAGGAA GAGGGGCGAG AGCGCAGCAG GCGGGCAGCG GCAGTCAAAG 294CAGAC ACTACAGGCC AATGAGTCGG CCAGTGACAT CCTCATCCTC CGGAGCCACG 3CCAACGC CCCTGGGTCT GCCGGTGGCC AGGCCTCGGA GAAGCCAAGG GAAGGGACAG 3AGTCACG AAGGGCACTACCTTTTTCAG AAAAAAAAAA AAAAAAACAA AAAGCTT 36 base pairs nucleic acid double linear Genomic DNA 7 GAATTCGGCA CGAGGTCAGT TTCCTGTGGA ACACAGAGGC TGCCTGTCCC ATTCAGACAA 6GATAC AGACCAGGCT TGCTCTATAA GGGATCCCAA CAGTGGATTT GTGTTTAATC ATCCGCTAAACAGTTCG CAAGGATATA ACGTCTCTGG CATTGGGAAG ATTTTTATGT ATGTCTG CGGCACAATG CCTGTCTGTG GGACCATCCT GGGAAAACCT GCTTCTGGCT 24GCAGA AACCCAAACT GAAGAGCTCA AGAATTGGAA GCCAGCAAGG CCAGTCGGAA 3GAAAAG CCTCCAGCTG TCCACAGAGG GCTTCATCAC TCTGACCTACAAAGGGCCTC 36GCCAA AGGTACCGCT GATGCTTTTA TCGTCCGCTT TGTTTGCAAT GATGATGTTT 42GGGCC CCTCAAATTC CTGCATCAAG ATATCGACTC TGGGCAAGGG ATCCGAAACA 48TTTGA GTTTGAAACC GCGTTGGCCT GTGTTCCTTC TCCAGTGGAC TGCCAAGTCA 54CTGGC TGGAAATGAGTACGACCTGA CTGGCCTAAG CACAGTCAGG AAACCTTGGA 6TGTTGA CACCTCTGTC GATGGGAGAA AGAGGACTTT CTATTTGAGC GTTTGCAATC 66CCTTA CATTCCTGGA TGCCAGGGCA GCGCAGTGGG GTCTTGCTTA GTGTCAGAAG 72AGCTG GAATCTGGGT GTGGTGCAGA TGAGTCCCCA AGCCGCGGCG AATGGATCTT78ATCAT GTATGTCAAC GGTGACAAGT GTGGGAACCA GCGCTTCTCC ACCAGGATCA 84GAGTG TGCTCAGATA TCGGGCTCAC CAGCATTTCA GCTTCAGGAT GGTTGTGAGT 9GTTTAT CTGGAGAACT GTGGAAGCCT GTCCCGTTGT CAGAGTGGAA GGGGACAACT 96GTGAA AGACCCAAGG CATGGCAACTTGTATGACCT GAAGCCCCTG GGCCTCAACG ACCATCGT GAGCGCTGGC GAATACACTT ATTACTTCCG GGTCTGTGGG AAGCTTTCCT GACGTCTG CCCCACAAGT GACAAGTCCA AGGTGGTCTC CTCATGTCAG GAAAAGCGGG CCGCAGGG ATTTCACAAA GTGGCAGGTC TCCTGACTCA GAAGCTAACT TATGAAAATG TTGTTAAA AATGAACTTC ACGGGGGGGG ACACTTGCCA TAAGGTTTAT CAGCGCTCCA GCCATCTT CTTCTACTGT GACCGCGGCA CCCAGCGGCC AGTATTTCTA AAGGAGACTT GATTGTTC CTACTTGTTT GAGTGGCGAA CGCAGTATGC CTGCCCACCT TTCGATCTGA GAATGTTC ATTCAAAGAT GGGGCTGGCAACTCCTTCGA CCTCTCGTCC CTGTCAAGGT AGTGACAA CTGGGAAGCC ATCACTGGGA CGGGGGACCC GGAGCACTAC CTCATCAATG TGCAAGTC TCTGGCCCCG CAGGCTGGCA CTGAGCCGTG CCCTCCAGAA GCAGCCGCGT CTGCTGGG TGGCTCCAAG CCCGTGAACC TCGGCAGGGT AAGGGACGGA CCTCAGTGGA GATGGCAT AATTGTCCTG AAATACGTTG ATGGCGACTT ATGTCCAGAT GGGATTCGGA AAGTCAAC CACCATCCGA TTCACCTGCA GCGAGAGCCA AGTGAACTCC AGGCCCATGT ATCAGCGC CGTGGAGGAC TGTGAGTACA CCTTTGCCTG GCCCACAGCC ACAGCCTGTC ATGAAGAG CAACGAGCAT GATGACTGCCAGGTCACCAA CCCAAGCACA GGACACCTGT GATCTGAG CTCCTTAAGT GGCAGGGCGG GATTCACAGC TGCTTACAGC GAGAAGGGGT GTTTACAT GAGCATCTGT GGGGAGAATG AAAACTGCCC TCCTGGCGTG GGGGCCTGCT GGACAGAC CAGGATTAGC GTGGGCAAGG CCAACAAGAG GCTGAGATAC GTGGACCAGG 2TGCAGCT GGTGTACAAG GATGGGTCCC CTTGTCCCTC CAAATCCGGC CTGAGCTATA 2GTGTGAT CAGTTTCGTG TGCAGGCCTG AGGCCGGGCC AACCAATAGG CCCATGCTCA 2CCCTGGA CAAGCAGACA TGCACTCTCT TCTTCTCCTG GCACACGCCG CTGGCCTGCG 222GCGAC CGAATGTTCC GTGAGGAATGGAAGCTCTAT TGTTGACTTG TCTCCCCTTA 228CGCAC TGGTGGTTAT GAGGCTTATG ATGAGAGTGA GGATGATGCC TCCGATACCA 234GATTT CTACATCAAT ATTTGTCAGC CACTAAATCC CATGCACGGA GTGCCCTGTC 24CGGAGC CGCTGTGTGC AAAGTTCCTA TTGATGGTCC CCCCATAGAT ATCGGCCGGG 246GGACC ACCAATACTC AATCCAATAG CAAATGAGAT TTACTTGAAT TTTGAAAGCA 252CCTTG CCAGGAATTC AGTTGTAAAT AAAATTGAAC CTGCTCAACA GCTGAGGGAG 258AAATG ATGGGTCCAT ATCCTGGTGC ATTGTCATAC AATTCAAACA ATGGTGCAGC 264GCTTG TAATTTTTAG GGACTGCAAACAAGGCTTTT TCTTGAAGCT GAACCAGAAA 27TTCTTA TGTTCCTTAG GCTTTGTAAT ATGTGCAGGA ATATATGGAT ACTGAGGAGG 276AATTT GGTCTCCACC AGTTACCAAT GCAATCGTCA ATGACCCAGT CTTGCAAAAC 282CCTGA CGACCCAGTA TCTCTGTCAT TAAGCGTTTT AGTCCTTCAA CTTCATCTTC 288GGTTA AGTTCACCAC CAGGTAGTTT GAAGAAAGTT GTTCCCAGCT GCAGCAGTAA 294GGGGT AGCCGGTGCT CATGTACAAT CAGAACCCCT TCTACAGTCC TCCTCATTCC 3TTTATCA AATTCTTCCC TCATGCGCTG AAATCTGGCT GCAACAGAGC TGTCCTTCTC 3GAGGGGC TCTTTTGTAC CAAAAGTATAATTGGTAAGA GGGTACAGGT TGATGGTGCG 3CAGGGTG AGGGGCTTCG TCTGCTGGAT GTACTTGTTG CCGAACTGAG TGACCCCCCG 3CCAGCCG GTCTGCGAGC GATTGGGCGG TACCACAGAC ATGCTGGCGA GCTCCGGCGC 324GCGAG CAGAAAGTGG CAGGCAGGGT AGACTTTCCC CGTGCGGGAA GCCTCGTGCC 33TC 33 base pairs nucleic acid double linear Genomic DNA 8 GAATTCGGCA CGAGAATGGA TCAACCTCAA CAACACGTTA AAGCTAGACG AAAGAAGTAA 6AGTGT ATGAGTCTCA CATGAAATAC CCGGATGTAA ATCCAAAGAA ACAGGAAGCA TGGTGGT TGCCAGGGAC AAGGGCGGTG GGAGGAGAAAATGGAGAGTA ACGGGACTTT TTTGGAG TGATGAGAAT GTTTTGGAGC TAGATAGAAG TGGTGGTTGT ACACCATTGT 24TACTA CCACTTAATT GTTCACTTAA AAAGTTAATT TATGTGAATT GCATCTTAAT 3AACAAG GATAACATTC CAACTCCTGG ACATTATCCT TCCTTTCCAT TTGATGTCAG 36TGTTAGAATTCTCAT CCGGTTTGGT CACTGCACTT AAGATGTGGA GAAATTAGGA 42AGTTA AGAGGAAGGA TAACACTGAT TAAGGTAGTG CTTTTCTAGG TTTCCCCTAA 48TTAAC AGATGGATAG TGGCACCACT TACGAGATGG AAAAACCAGC GGAAGGAAGA 54GGGAG AAGTTAAGTT TGTCTTGGGC CTGTGTTTTG CAACCTGAGTGTAAAAGACA 6TTAAGT CTTCAGTGGC GAAACACTAA AACTAGAAAT GGATCAGAAT TTTATCTTTG 66GACTT CTCAAGGATG GTCTTGTCAC TTCAGTGCCT GGTCAAATGA CAAGATGGGC 72TTTCC TGAAGGTCCA AGCACCTGAA CGTGGCAGGG TGACCCGATT CCGATTTGCT 78CAATC CTAGTTCATGCCTATTGTCC CTCATGTAAT TAATATCACT CTCAAAATGT 84TTTGT GCAATAAATT CTGCAACGTG ATGGCGCGAC TCTCGCGGCC CGAGCGGCCG 9TTGTCT TCGAGGAAGA GGACCTCCCC TATGAGGAGG AAATCATGCG GAACCAATTC 96CAAAT GCTGGCTTCA CTACATCGAG TTCAAACAGG GCGCCCCGAA GCCCAGGCTCTCAGCTAT ACGAGCGGGC ACTCAAGCTG CTGCCCTGCA GCTACAAACT CTGGTACCGA CCTGAAGG CGCGTCGGGC ACAGGTGAAG CATCGCTGTG TGACCGACCC TGCCTATGAA TGTCAACA ACTGTCATGA GAGGGCCTTT GTGTTCATGC ACAAGATGCC TCGTCTGTGG AGATTACT GCCAGTTCCT CATGGACCAGGGGCGCGTCA CACACACCCG CCGCACCTTC CCGTGCCC TCCGGGCACT GCCCATCACG CAGCACTCTC GAATTTGGCC CCTGTATCTG CTTCCTGC GCTCACACCC ACTGCCTGAG ACAGCTGTGC GAGGCTATCG GCGCTTCCTC GCTGAGTC CTGAGAGTGC AGAGGAGTAC ATTGAGTACC TCAAGTCAAG TGACCGGCTG TGAGGCCG CCCAGCGCCT GGCCACCGTG GTGAACGACG AGCGTTTCGT GTCTAAGGCC CAAGTCCA ACTACCAGCT GTGGCACGAG CTGTGCGACC TCATCTCCCA GAATCCGGAC GGTACAGT CCCTCAATGT GGACGCCATC ATCCGCGGGG GCCTCACCCG CTTCACCGAC GCTGGGCA AGCTCTGGTG TTCTCTCGCCGACTACTACA TCCGCAGCGG CCATTTCGAG GGCTCGGG ACGTGTACGA GGAGGCCATC CGGACAGTGA TGACCGTGCG GGACTTCACA GGTGTTTG ACAGCTACGC CCAGTTCGAG GAGAGCATGA TCGCTGCAAA GATGGAGACC CTCGGAGC TGGGGCGCGA GGAGGAGGAT GATGTGGACC TGGAGCTGCG CCTGGCCCGC CGAGCAGC TCATCAGCCG GCGGCCCCTG CTCCTCAACA GCGTCTTGCT GCGCCAAAAC ACACCACG TGCACGAGTG GCACAAGCGT GTCGCCCTGC ACCAGGGCCG CCCCCGGGAG CATCAACA CCTACACAGA GGCTGTGCAG ACGGTGGACC CCTTCAAGGC CACAGGCAAG 2CACACTC TGTGGGTGGC GTTTGCCAAGTTTTATGAGG ACAACGGACA GCTGGACGAT 2CGTGTCA TCCTGGAGAA GGCCACCAAG GTGAACTTCA AGCAGGTGGA TGACCTGGCA 2GTGTGGT GTCAGTGCGG AGAGCTGGAG CTCCGACACG AGAACTACGA TGAGGCCTTG 222GCTGC GAAAGGCCAC GGCGCTGCCT GCCCGCCGGG CCGAGTACTT TGATGGTTCA 228CGTGC AGAACCGCGT GTACAAGTCA CTGAAGGTCT GGTCCATGCT CGCCGACCTG 234GAGCC TCGGCACCTT CCAGTCCACC AAGGCCGTGT ACGACCGCAT CCTGGACCTG 24TCGCAA CACCCCAGAT CGTCATCAAC TATGCCATGT TCCTGGAGGA GCACAAGTAC 246GGAGA GCTTCAAGGC GTACGAGCGCGGCATCTCGC TGTTCAAGTG GCCCAACGTG 252CATCT GGAGCACCTA CCTGACCAAA TTCATTGCCC GCTATGGGGG CCGCAAGCTG 258GGCAC GGGACCTGTT TGAACAGGCT CTGGACGGCT GCCCCCCAAA ATATGCCAAG 264GTACC TGCTGTACGC ACAGCTGGAG GAGGAGTGGG GCCTGGCCCG GCATGCCATG 27TGTACG AGCGTGCCAC CAGGGCCGTG GAGCCCGCCC AGCAGTATGA CATGTTCAAC 276CATCA AGCGGGCGGC CGAGATCTAT GGGGTCACCC ACACCCGCGG CATCTACCAG 282CATTG AGGTGCTGTC GGACGAGCAC GCGCGTGAGA TGTGCCTGCG GTTTGCAGAC 288GTGCA AGCTCGGGGA GATTGACCGCGCCCGGGCCA TCTACAGCTT CTGCTCCCAG 294TGACC CCCGGACGAC CGGCGCGTTC TGGCAGACGT GGAAGGACTT TGAGGTCCGG 3GGCAATG AGGACACCAT CAAGGAAATG CTGCGTATCC GGCGCAGCGT GCAGGCCACG 3AACACGC AGGTCAACTT CATGGCCTCG CAGATGCTCA AGGTCTCGGG CAGTGCCACG 3ACCGTGT CTGACCTGGC CCCTGGGCAG AGTGGCATGG ACGACATGAA GCTGCTGGAA 3CGGGCAG AGCAGCTGGC GGCTGAGGCG GAGCGTGACC AGCCCTTGCG CGCCCAGAGC 324CCTGT TCGTGAGGAG TGACGCCTCC CGGGAGGAGC TGGCAGAGCT GGCACAGCAG 33ACCCCG AGGAGATCCA GCTGGGCGAGGACGAGGACG AGGACGAGAT GGACCTGGAG 336CGAGG TTCGGCTGGA GCAGCAGAGC GTGCCAGCCG CAGTGTTTGG GAGCCTGAAG 342CTGAC CCGTCCCCTC GTGCCGAATT CGGCACGAGC AAGACCAGCC CCCAGATCAT 348TCAAA GGTTTTCCCT CGAAGTCACA AATGTTTCAA GGAATCTCAA ATTTTACAAA 354AAGTG TGGGCATTGG TGGCCTGTGG CTGTGTCCTC TCTCTGTAGC TGTTTTCTCC 36ATCCCT GAAAGGAAGT TGAGCCTGCT CCTCCATCCG CAGACCTCCC TTTCCAGCGC 366GCATG GGGTGCTGTG AGGGCAGCAT GCTAGGTGTG ACCGTGCTCC TGGCCTCCAG 372TGTCC CTCTGTCCTC TAGCCCACTAAGGCCCTGGC CCATTTGTGC TAAACAGGCA 378ACCTA GAAAGAGCAG ACAATCTCTC TGGGTCACCA GTCTGGCTAG GAGCTGGTCT 384CTGGG ATCCAGGCCT TCTCCCCTGC CCATGTGAAT TCCCAGGGGC AGAGCCTGAA 39TGAACA CAGCACTGGC CAAAGAGATG TCACCGTGGG AACCGAGGCT CTCTTCTCCT 396CTGCT TTCGTGGGTT CAGAGTAGCT GAGGCTTGTC TGAGAGGAGT TGGAGTGCTG 4TTCACCC TGGTTGGTGT GCTTTGCTTT GAGGGCACTT AGAAAGCCCA GCCCAGCCCT 4TCCTGCC CTGCACACAG CGGAGCGACT TTTCTAGGTA TGCTCTTGAT TTCTGCAGAA 4GCAGGTG GCATGGAGCC AAGAGGAAGTGTGACTGAAA CTGTCCACTC ATAGCCCGGC 42GTATTG AGAGGGCT 42 base pairs nucleic acid double linear Genomic DNA 9 GAGCTCGCGC GCCTGCAGGT CGACACTAGT GGATCCAAAG AATTCGGCAC GAGGGAAACT 6GTGTA CGAGTGGAGG ACAGGGACAG AGCCCTCTGT GGTGGAACGA CCCCACCTCGAGCTTCC TGAGCAGGTG GCAGAAGATG CGATTGACTG GGGCGACTTT GGGGTAGAGG TGTCTGA GGGGACTGAC TCTGGCATCT CTGCCGAGGC TGCTGGAATC GACTGGGGCA 24CCGGA ATCAGATTCA AAGGATCCTG GAGGTGATGG GATAGACTGG GGAGACGATG 3TGCTTT GCAGATCACA GTGCTGGAAGCAGGAACCCA GGCTCCAGAA GGTGTTGCCA 36CCAGA TGCCCTGACA CTGCTTGAAT ACACTGAGAC CCGGAATCAG TTCCTTGATG 42ATGGA GCTTGAGATC TTCTTAGCCC AGAGAGCAGT GGAGTTGAGT GAGGAGGCAG 48CTGTC TGTGAGCCAG TTCCAGCTGG CTCCAGCCAT CCTGCAGGGC CAGACCAAAG 54ATGGT TACCATGGTG TCAGTGCTGG AGGATCTGAT TGGCAAGCTT ACCAGTCTTC 6GCAACA CCTGTTTATG ATCCTGGCCT CACCAAGGTA TGTGGACCGA GTGACTGAAT 66CAGCA AAAGCTGAAG CAGTCCCAGC TGCTGGCTTT GAAGAAAGAG CTGATGGTGC 72CAGCA GGAGGCACTT GAGGAGCAGG CGGCTCTGGAGCCTAAGCTG GACCTGCTAC 78AAGAC CAAGGAGCTG CAGAAGCTGA TTGAAGCTGA CATCTCCAAG AGGTACAGCG 84CCTGT GAACCTGATG GGAACCTCTC TGTGACACCC TCCGTGTTCT TGCCTGCCCA 9CTCCGC TTTTGGGATG AAGATGATAG CCAGGGCTGT TGTTTTGGGG CCCTTCAAGG 96GACCAGGCTGACTGG AAGATGGAAA GCCACAGGAA GGAAGCGGCA CCTGATGGTG CTTGGCAC TCTCCATGTT CTCTACAAGA AGCTGTGGTG ATTGGCCCTG TGGTCTATCA CGAAAACC ACAGATTCTC CTTCTAGTTA GTATAGCGCA AAAAGCTTCT CGAGAGTACT TAGAGCGG CCGCGGGCCC ATCGATTTTC CACCCGGGTG GGGTACC pairs nucleic acid double linear Genomic DNA CACTAA AGGGAACAAA AGCTGGAGCT CGCGCGCCTG CAGGTCGACA CTAGTGGATC 6TTCGT TACGCCAAGC TCGAAATTAA CTCTGGGCTG ACCCATAAAC ATTTGTCTGA AGGATAT AGTTGCGTTT CTTGCGGGCA GCAATCTGGATGAGGCGGTT GAGGCACTGG GCCTGCT GGATCAGGAC ATCCCAGCGG CCAGCATAGT TCCGCTGCCG GCGTAGGCCC 24CCGCA TCTTATCCAT GATGGCATTG GTACCCAGGA TGTTGTACTT CTTGGAAGGG 3AGGCTG CATGTTTGAT GGCCCATGTG GTCTTGCCAG CAGCAGGCAG GCCCACCATC 36AATCTCACATTCTGC CTTGCTCTTT GGTCCAACGG TGCCCCGGAT ACGCTCACTA 42AAGGT GCTGGATGAA GGTAAACCCC GGGAGGACAG AACAGTAGGG CTCTGCTCTC 48GAAGT TGAACTCCAC TGCGCAATTC TTCACCAGGA CATGAGGATA GAGGGCCTGA 54CAAGG CTTCCTTCTG GATTCGGAAA GCAATGCCCA TCCACTTTCCATTCTTGGTA 6ACAGTT CCACGTCATT TCCACATTCA AAATCCGCAA AGCAGCCAAT CACCGGAGAG 66CGGTG CTAGGAGAGC GGCTGGGCCC GCAGACTGGG GGGAAAGCTC CGCAGCCGCA 72CCCCA GGATCAGGCC CCGCGTGGCC TGGAGAAGCC CAGTCTGGGC TGGAGCGGGA 78ACAGT GTGGCCTTGCGTTCGCCCCC GGGAGCGCTG CGAGTGTCGC GGCCTCGGGT 84TGCTG AGCACCAATA CCTCACGGTT GCCAACCTGG GGTTTTAGCT CCCTTGGTTT 9CCCCTA GGGGCGGGTG GGGGCACGGG AGGAAGGATG GGCCAGCTGG GTGCAATCCT 96AAGCC AGCCATTCCT TGATTTCTTA GAATTAACTA AACGGTCGCG CCGGAGGCCGGGGGCCGG AGCGGAGCAG CCGCGGCTGA GGTTCCCGAG TCGGCCGCTC GGGGCTGCGC CGCCGCCG GGACCCCGGC CTCTGGCCGC GCCGGCTCCG GCCTCCGGGG GGGCCGGGGC CCGGGACA TGGTGCCAGT CGCACCCCTT CCCCGCCGCC GCTGAGCTCG CCGGCCGCGC GGGCTGGG ACGTCCGAGC GGGAAGATGTTTTCCGCCCT GAAGAAGCTG GTGGGGTCGG CAGGCCCC GGGCCGGGAC AAGAACATCC CCGCCGGGCT GCAGTCCATG AACCAGGCGT CAGAGGCG CTTCGCCAAG GGGGTGCAGT ACAACATGAA GATAGTGATC CGGGGAGACA AACACGGG CAAGACAGCG CTGTGGCACC GCCTGCAGGG CCGGCCGTTC GTGGAGGAGT ATCCCCAC ACAGGAGATC CAGGTCACCA GCATCCACTG GAGCTACAAG ACCACGGATG ATCGTGAA GGTTGAAGTC TGGGATGTAG TAGACAAAGG AAAATGCAAA AAGCGAGGCG GGCTTAAA GATGGAGAAC GACCCCCAGG AGNCGGAGTC TGAAATGGCC CTGGATGCTG TTCCTGGA CGTGTACAAG AACTGCAACGGGGTGGTCAT GATGTTCGAC ATTACCAAGC TGGACCTT CAATTACATT CTCCGGGAGC TTCCAAAAGT GCCCACCCAC GTGCCAGTGT GTGCTGGG GAACTACCGG GACATGGGCG AGCACCGAGT CATCCTGCCG GACGACGTGC GACTTCAT CGACAACCTG GACAGACCTC CAGGTTCCTC CTACTTCCGC TATGCTGAGT TCCATGAA GAACAGCTTC GGCCTAAAGT ACCTTCATAA GTTCTTCAAT ATCCCATTTT CAGCTTCA GAGGGAGACG CTGTTGCGGC AGCTGGAGAC GAACCAGCTG GACATGGACG ACGCTGGA GGAGCTGTCG GTGCAGCAGG AGACGGAGGA CCAGAACTAC GGCATCTTCC 2AGATGAT GGAGGCTCGC AGCCGTGGCCATGCGTCCCC ACTGGCGGCC AACGGGCAGA 2CATCCCC GGGCTCCCAG TCACCAGTCC TGCCTGCACC CGCTGTGTCC ACGGGGAGCT 2GCCCCGG CACACCCCAG CCCGCCCCAC AGCTGCCCCT CAATGCTGCC CCACCATCCT 222CCCCC TGTACCACCC TCAGAGGCCC TGCCCCCACC TGCGTGCCCC TCAGCCCCCG 228CGGCG CAGCATCATC TCTAGGCTGT TTGGGACGTC ACCTGCCACC GAGGCAGCCC 234CCTCC AGAGCCAGTC CCGGCCGCAC AGGGCCCAGC AACGGTCCAG AGTGTGGAGG 24TGTTCC TGACGACCGC CTGGACCGCA GCTTCCTGGA AGACACAACC CCCGCCAGGG 246AAGAA GGTGGGGGCC AAGGCTGCCCAGCAGGACAG TGACAGTGAT GGGGAGGCCC 252GGCAA CCCGATGGTG GCAGGGTTCC AGGACGATGT GGACCTCGAA GACCAGCCAC 258AGTCC CCCGCTGCCT GCAGGCCCCG TCCCCAGTCA AGACATCACT CTTTCGAGTG 264GAAGC AGAAGTGGCA GCTCCCACAA AAGGCCCTGC CCCAGCTCCC CAGCAGTGCT 27GCCAGA GACCAAGTGG TCCTCCATAC CAGCTTCGAA GCCACGGAGG GGGACAGCTC 276AGGAC CGCAGCACCC CCCTGGCCAG GCGGTGTCTC TGTTCGCACA GGTCCGGAGA 282AGCAG CACCAGGCCC CCTGCTGAGA TGGAGCCGGG GAAGGGTGAG CAGGCCTCCT 288GAGAG TGACCCCGAG GGACCCATTGCTGCACAAAT GCTGTCCTTC GTCATGGATG 294GACTT TGAGAGCGAG GGATCAGACA CACAGCGCAG GGCGGATGAC TTTCCCGTGC 3ATGACCC CTCCGATGTG ACTGACGAGG ATGAGGGCCC TGCCGAGCCG CCCCCACCCC 3AGCTCCC TCTCCCCGCC TTCAGACTGA AGAATGACTC GGACCTCTTC GGGCTGGGGC 3AGGAGGC CGGACCCAAG GAGAGCAGTG AGGAAGGTAA GGAGGGCAAA ACCCCCTCTA 3AGAAGAA AAAAAAAACA AAAAGCTTCT CGAGAGTACT TCTAGAGCGG CCGCGGGCCC 324TTTTC CACCCGGGTG GGGTACCAGG TAAGTGTACC CAATTCGCCC TATAGTGAGT 33TT 33ase pairs nucleic acidsingle linear GGGCCA GAGTGGGCTG 2se pairs nucleic acid single linear TCCTGG CCTGCGGATG 2se pairs nucleic acid single linear ACAGGA GAATTGGTTC 2se pairs nucleic acid single linear GGGTTC GGTGCGGGAC 2se pairs nucleic acid single linear CGGGTG TTTGTGAGTG 2se pairs nucleic acid single linear TTCCGT CTCCTCAGTG 2se pairs nucleic acid single linear TGCTAG TCTCACAGAC 2se pairs nucleic acid single linear GGGTGG CTGAAGGGAC 2se pairs nucleic acid single linear TCCCTC CCTGTCACAG 2se pairs nucleic acid single linear 2GGGTG TTTGTGAGTG 2se pairs nucleic acid single linear 2ATTCC AGAAATTCAG 2se pairs nucleicacid single linear 22 AAACTGCAGG TGGCTGAGTC 2se pairs nucleic acid single linear 23 GTCCTAATGT TTTCAGGGAG 2se pairs nucleic acid single linear 24 AAAACCTATG GTTACAATTC 2se pairs nucleic acid single linear 25 TCCTAGACAT GGTTCAAGTG 2se pairs nucleic acid single linear 26 GATATAATTA GTTCTCCATC 2se pairs nucleic acid single linear 27 ATGCCTGTTC CAGGCTGCAC 2se pairs nucleic acid single linear 28 GGACGGCGAC CTCCACCCAC 2se pairs nucleic acid single linear 29GGGCTCCTCC GACGCCTGAG 2se pairs nucleic acid single linear 3AGCCC TGGCCTTGAC 2se pairs nucleic acid single
linear 3TGGGG ACTCCGGCAG 2se pairs nucleic acid single linear 32 CAGCTTTCCC TGGGCACATG 2se pairs nucleic acid single linear 33 CACAGCTGTC TCAAGCCCAG 2se pairs nucleic acid single linear 34 ACTGTTCCCC CTACATGATG 2se pairs nucleic acid single linear 35 ATCATATCCT CTTGCTGGTC 2se pairs nucleic acid single linear 36 GTTCCCAGAG CTTGTCTGTG 2se pairs nucleic acid single linear 37 GTTTGGCAGA CTCATAGTTG 2se pairs nucleic acid single linear 38TAGCAGGGAG CCATGACCTG 2se pairs nucleic acid single linear 39 CTTGGCGCCA GAAGCGAGAG 2se pairs nucleic acid single linear 4CTCTC TCTCTCTCTC 2se pairs nucleic acid single linear 4GCTGA TTCCGCCAAG 2se pairs nucleicacid single linear 42 CTTTTTGAAT TCGGCACGAG 2se pairs nucleic acid single linear 43 CCCCTGGTCC GCACCAGTTC 2se pairs nucleic acid single linear 44 GAGAAGGGTC GGGGCGGCAG 2se pairs nucleic acid single linear 45 AAATCACATC GCGTCAACAC 2se pairs nucleic acid single linear 46 TAAGAGAGTC ATAGTTACTC 2se pairs nucleic acid single linear 47 GCTCTAGAAG TACTCTCGAG 2se pairs nucleic acid single linear 48 ACTCTGGCCA TCAGGAGATC 2se pairs nucleic acid single linear 49CAGGCGTTGT AGATGTTCTG 2se pairs nucleic acid single linear 5CAGGC AGAAGTAATG 2se pairs nucleic acid single linear 5GAGAA CTGGATGTAG 2se pairs nucleic acid single linear 52 CTATTCAGAT GCAACGCCAG 2se pairs nucleicacid single linear 53 CCATGGCACA CAGAGCAGAC 2se pairs nucleic acid single linear 54 GCTACCATGC AGAGACACAG 2se pairs nucleic acid single linear 55 CAGGCTGACA AGAAAATCAG 2se pairs nucleic acid single linear 56 GGCACGCATA GAGGAGAGAC 2se pairs nucleic acid single linear 57 TGGGTGATGC CTTTGCTGAC 2se pairs nucleic acid single linear 58 AAAACAAGAT CAAGGTGATG 2se pairs nucleic acid single linear 59 TTGCCCACAT TGCTATGGTG 2se pairs nucleic acid single linear 6AGATC AGAAGTAGAG 2se pairs nucleic acid single linear 6GGGCC AATGATGTTG 2se pairs nucleic acid single linear 62 TCTTCCCACC ATAGCAATG ase pairs nucleic acid single linear 63 TGGTCTTGGT GACCAATGTG 2se pairs nucleicacid single linear 64 ACACCTCGGT GACCCCTGTG 2se pairs nucleic acid single linear 65 TCTCCAAGTT CGGCACAGTG 2se pairs nucleic acid single linear 66 ACATGGGCTG CACTCACGAC 2se pairs nucleic acid single linear 67 GATCCTCTGA ACCTGCAGAG 2se pairs nucleic acid single linear 68 GGAAATGAGG TGGGGCGATC 2se pairs nucleic acid single linear 69 CTTTGCCTTG GACAAGGATG 2se pairs nucleic acid single linear 7TGCCA TTGGGGGTAG 2se pairs nucleic acid single linear 7AAGCC ATTGACGGTG 2se pairs nucleic acid single linear 72 TGCGTCTCTC GTCGCTGCTG 2se pairs nucleic acid single linear 73 GCGGAAACTC TGTGGTGCTG 2se pairs nucleic acid single linear 74 AGGATTGCCT TCCTCTACTG 2se pairs nucleicacid single linear 75 TGTCTGTTTC ACCAGGGCAG 2se pairs nucleic acid single linear 76 CCAGTGCCTC TATGCATGTC 2se pairs nucleic acid single linear 77 AGGAAGCCCA CGCACACCAC 2se pairs nucleic acid single linear 78 CCCTTTGTTC CCTGATCTTC 2se pairs nucleic acid single linear 79 CGCTCGGGAT CCAGGTCATC 2se pairs nucleic acid single linear 8GTTCA GAGCGTAGTG 2se pairs nucleic acid single linear 8GATCT CTGGCACCTC 2se pairs nucleic acid single linear 82CCATCAGAGT GAAGGAGGAG 2se pairs nucleic acid single linear 83 CCATCTTCCA CTGGTCAGAG 2se pairs nucleic acid single linear 84 CTCCTTCTCT TGGATCTCTG 2se pairs nucleic acid single linear 85 TTACTTCAGC ACTGTTAGTC 2se pairs nucleicacid single linear 86 AGGGAGGTAG CTCAAAGCTC 2se pairs nucleic acid single linear 87 TGGGTCCACA GTTCGCACAG 2se pairs nucleic acid single linear 88 CAACTCTGTG ATGGCTCCAG 2se pairs nucleic acid single linear 89 AGCAGGGTTC TGTTCAAGAC 2se pairs nucleic acid single linear 9GGGTG CTAGTCTCTC 2se pairs nucleic acid single linear 9ATGCT GTCCCAGCAG 2se pairs nucleic acid single linear 92 CTGGACCTGA GGTAGCGCTG 2se pairs nucleic acid single linear 93ATAACCACCC TGAGGCACTG 2se pairs nucleic acid single linear 94 CCTGCAGGTC GACACTAGTG 2se pairs nucleic acid single linear 95 AATTGGAATG AGGAGGACTG 2se pairs nucleic acid single linear 96 GCTCTAGAAG TACTCTCGAG 2se pairs nucleicacid single linear 97 ATTGTATGAC AATGCACCAG 2se pairs nucleic acid single linear 98 TCCACAGAGG GCTTCATCAC 2se pairs nucleic acid single linear 99 CCTGACTGGC CTAAGCACAG 2se pairs nucleic acid single linear CCTCATA ACCACCAGTG 2se pairs nucleic acid single linear CAACGGT GACAAGTGTG 2se pairs nucleic acid single linear TACACCA GCTGCAGGTC 2se pairs nucleic acid single linear TGTGGTG CAGATGAGTC 2se pairs nucleic acid single linear ACACTCT TATAGCTCAG 2se pairs nucleic acid single linear GGAAGCT TTCCTCAGAC 2se pairs nucleic acid single linear TGAACAT GGGCCTGGAG 2se pairs nucleic acid single linear TGTGGAT GTACTACCAC 2se pairsnucleic acid single linear GTTTTGC AACCTGAGTG 2se pairs nucleic acid single linear GTGGCAC CACTTACGAG 2se pairs nucleic acid single linear TCTGCAA CGTGATGGCG 2se pairs nucleic acid single linear AAGATGCCTCGTCTGTG 2se pairs nucleic acid single linear CCGGACA AGGTACAGTC 2se pairs nucleic acid single linear CGAGTGG CACAAGCGTG 2se pairs nucleic acid single linear AGCGTGT GGTGTCAGTG 2se pairs nucleic acidsingle linear TTGAACA GGCTCTGGAC 2se pairs nucleic acid single linear CATGGCA ATGAGGACAC 2se pairs nucleic acid single linear ACGAGAT GGACCTCCAG 2se pairs nucleic acid single linear TCTGTCC TCTAGCCCAC 2se pairs nucleic acid single linear TGAGGGG ACTGACTCTG 2se pairs nucleic acid single linear GTGAGGA GGCAGATGTC 2se pairs nucleic acid single linear CTTTGAA GAAAGAGCTG 2se pairs nucleic acid single linear AAAGACC AGGCTGACTG 2se pairs nucleic acid single linear AGCTCCT TGGTCTTCTC 2se pairs nucleic acid single linear TCACAGT CCCAAGGCTC 2se pairs nucleic acid single linear TGGATGA GGCGGTTGAG 2se pairsnucleic acid single linear CACTCTC CGACGAGGAG 2se pairs nucleic acid single linear TCCAAAG TTCGTCTCTG 2se pairs nucleic acid single linear TGTGTGT CTGATCCCTC 2se pairs nucleic acid single linear AAGGTAAACCCCGGGAG 2se pairs nucleic acid single linear TCTCTGG CTCTGAGCAC 2se pairs nucleic acid single linear TGGAGAA GCCCAGTCTG 2se pairs nucleic acid single linear ACTCTGG ACCGTTGCTG 2se pairs nucleic acidsingle linear GCTCCGC AGCCGCAGTG 2se pairs nucleic acid single linear TCCAGGA AGCTGCGGTC 2se pairs nucleic acid single linear GGTGGGG CAGCATTGAG 2se pairs nucleic acid single linear ACCAGTG GTGCCTGCAG 2se pairs nucleic acid single linear TCACGGT TGCCAACCTG 2se pairs nucleic acid single linear AACAGCG TCTCCCTCTG 2se pairs nucleic acid single linear ACCTTCA TAAGTTCTTC 2se pairs nucleic acid single linear CAGACTT CAACCTTCAC 2se pairs nucleic acid single linear CATCTTC CCGGTCGGAC 2se pairs nucleic acid single linear GAGCACC TTTACCTCAC 2se pairs nucleic acid single linear GTCCGTC CGGGAAGATG 2se pairsnucleic acid single linear CAGGAGA TGCAGGTCAC 2se pairs nucleic acid single linear TCTTCCA TGAAGAACAG 2se pairs nucleic acid single linear GTGAGGA AGGTAAGGAG 2base pairs nucleic acid double linear Genomic DNACoding Sequence 378...7 GGATCCAAAG GACGCCCCCG CCGACAGGAG AATTGGTTCC CGGGCCCGCG GCGATGCCCC 6TAGCT CGGGCCCGTG GTCGGGTGTT TGTGAGTGTT TCTATGTGGG AGAAGGAGGA GGAGGAA GAAGAAGCAA CGATTTGTCT TCTCGGCTGG TCTCCCCCCG GCTCTACATG CCCGCACTGAGGAGACG GAAGAGGAGC CGTAGCCGCC CCCCCTCCCG GCCCGGATTA 24TCTCG CCACAGCGGC CTCGGCCTCC CCTTGGATTC AGACGCCGAT TCGCCCAGTG 3GGAAAT GGGAAGTAAT GACAGCTGGC ACCTGAACTA AGTACTTTTA TAGGCAACAC 36CAGAA ATTCAGG ATG AAT GGG GAT ATG CCC CAT GTC CCCATT ACT 4Asn Gly Asp Met Pro His Val Pro Ile Thr ACT CTT GCG GGG ATT GCT AGT CTC ACA GAC CTC CTG AAC CAG CTG CCT 458 Thr Leu Ala Gly Ile Ala Ser Leu Thr Asp Leu Leu Asn Gln Leu Pro 5 CTT CCA TCT CCT TTA CCT GCT ACA ACT ACA AAG AGCCTT CTC TTT AAT 5Pro Ser Pro Leu Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn 3 GCA CGA ATA GCA GAA GAG GTG AAC TGC CTT TTG GCT TGT AGG GAT GAC 554 Ala Arg Ile Ala Glu Glu Val Asn Cys Leu Leu Ala Cys Arg Asp Asp 45 5T TTG GTT TCA CAGCTT GTC CAT AGC CTC AAC CAG GTA TCA ACA GAT 6Leu Val Ser Gln Leu Val His Ser Leu Asn Gln Val Ser Thr Asp 6 75 CAC ATA GAG TTG AAA GAT AAC CTT GGC AGT GAT GAC CCA GAA GGT GAC 65le Glu Leu Lys Asp Asn Leu Gly Ser Asp Asp Pro Glu GlyAsp 8 ATA CCA GTC TTG TTG CAG GCC GTC CTG GCA AGG AGT CCT AAT GTT TTC 698 Ile Pro Val Leu Leu Gln Ala Val Leu Ala Arg Ser Pro Asn Val Phe 95 AGG GAG AAA AGC ATG CAG AAC AGA TAT GTA CAA AGT GGA ATG ATG ATG 746 Arg Glu Lys Ser Met Gln AsnArg Tyr Val Gln Ser Gly Met Met Met CAG TAT AAA CTT TCT CAG AAT TCC ATG CAC AGT AGT CCT GCA TCT 794 Ser Gln Tyr Lys Leu Ser Gln Asn Ser Met His Ser Ser Pro Ala Ser AAT TAT CAA CAA ACC ACT ATC TCA CAT AGC CCC TCC AGC CGGTTT 842 Ser Asn Tyr Gln Gln Thr Thr Ile Ser His Ser Pro Ser Ser Arg Phe GTG CCA CCA CAG ACA AGC TCT GGG AAC AGA TTT ATG CCA CAG CAA AAT 89ro Pro Gln Thr Ser Ser Gly Asn Arg Phe Met Pro Gln Gln Asn CCA GTG CCT AGTCCA TAC GCC CCA CAA AGC CCT GCA GGA TAC ATG 938 Ser Pro Val Pro Ser Pro Tyr Ala Pro Gln Ser Pro Ala Gly Tyr Met TAT TCC CAT CCT TCA AGT TAC ACA ACA CAT CCA CAG ATG CAA CAA 986 Pro Tyr Ser His Pro Ser Ser Tyr Thr Thr His Pro Gln Met GlnGln 2TCG GTA TCA AGT CCC ATT GTT GCA GGT GGT TTG AGA AAC ATA CAT a Ser Val Ser Ser Pro Ile Val Ala Gly Gly Leu Arg Asn Ile His 22AAT AAA GTT TCT GGT CCG TTG TCT GGC AAT TCA GCT AAT CAT CAT p Asn Lys Val Ser GlyPro Leu Ser Gly Asn Ser Ala Asn His His 223CT GAT AAT CCT AGA CAT GGT TCA AGT GAG GAC TAC CTA CAC ATG GTG a Asp Asn Pro Arg His Gly Ser Ser Glu Asp Tyr Leu His Met Val 245GG CTA AGT AGT GAC GAT GGA GAT TCT TCA ACA ATGAGG AAT GCT s Arg Leu Ser Ser Asp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala 255 26CA TCT TTT CCC TTG AGA TCT CCA CAG CCA GTA TGC TCC CCT GCT GGA a Ser Phe Pro Leu Arg Ser Pro Gln Pro Val Cys Ser Pro Ala Gly 278AA GGA ACTCCT AAA GGC TCA AGA CCA CCT TTA ATC CTA CAA TCT r Glu Gly Thr Pro Lys Gly Ser Arg Pro Pro Leu Ile Leu Gln Ser 285 29AG TCT CTA CCT TGT TCA TCA CCT CGA GAT GTT CCA CCA GAT ATC TTG n
Ser Leu Pro Cys Ser Ser Pro Arg Asp Val Pro Pro Asp Ile Leu 33CTA GAT TCT CCA GAA AGA AAA CAA AAG AAG CAG AAG AAA ATG AAA TTA u Asp Ser Pro Glu Arg Lys Gln Lys Lys Gln Lys Lys Met Lys Leu 323AG GAT GAA AAA GAGCAG AGT GAG AAA GCG GCA ATG TAT GAT ATA y Lys Asp Glu Lys Glu Gln Ser Glu Lys Ala Ala Met Tyr Asp Ile 335 34TT AGT TCT CCA TCC AAG GAC TCT ACT AAA CTT ACA TTA AGA CTT TCT e Ser Ser Pro Ser Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser356TA AGG TCT TCA GAC ATG GAC CAG CAA GAG GAT ATG ATT TCT GGT g Val Arg Ser Ser Asp Met Asp Gln Gln Glu Asp Met Ile Ser Gly 365 37TG GAA AAT AGC AAT GTT TCA GAA AAT GAT ATT CCT TTT AAT GTG CAG l Glu Asn Ser Asn Val SerGlu Asn Asp Ile Pro Phe Asn Val Gln 389AC CCA GGA CAG ACT TCA AAA ACA CCC ATT ACT CCA CAA GAT ATA AAC r Pro Gly Gln Thr Ser Lys Thr Pro Ile Thr Pro Gln Asp Ile Asn 44CCA CTA AAT GCT GCT CAA TGT TTG TCG CAG CAA GAA CAAACA GCA g Pro Leu Asn Ala Ala Gln Cys Leu Ser Gln Gln Glu Gln Thr Ala 4425 TTC CTT CCA GCA AAT CAA GTG CCT GTT TTA CAA CAG AAC ACT TCA GTT e Leu Pro Ala Asn Gln Val Pro Val Leu Gln Gln Asn Thr Ser Val 434CA AAA CAA CCCCAG ACC AAT AGT CAC AAA ACC TTG GTG CAG CCT a Ala Lys Gln Pro Gln Thr Asn Ser His Lys Thr Leu Val Gln Pro 445 45GA ACA GGC ATA GAG GTC TCA GCA GAG CTG CCC AAG GAC AAG ACC TAAGA y Thr Gly Ile Glu Val Ser Ala Glu Leu Pro Lys Asp LysThr 467CAGGG AACTATGTAG TCACCCCGAG AGGCCCAGCT CTCTCCGTGA GCTCTGGGCC GGGTGGGG GTGGTTGTTG GTTCTGCGCG CACTGTTCCC CCTACATGAT GGGTCCATCC GTTGGCTT CTCTCACTCG CTTCCTCCTG TGGAGAAGCC TGTCCAGGTG TCACTGCCTC GGAAGCTG TCTCTGATTTCTCCAGTTGA ACAGTGAGAT TTGCCACACC TCACATGCAT 2TCTTGTC CCTGGAATTG TAACCATAGG TTTTCCTGTC TCCTGGAGGA CAAGGATGAG 2TTTCCAC TTGAGTCTCC CTGGTGGAGC CCAGCTCCTG ACATACCTGG TAAAAGTTCT 2GAGAAGA ACATGGAGGA GGAATGTGGA TAACAACCCT GGCTGCCTGTGTGTTCCAAG 2224 CTAGGAAGAT GTAATGTCCC CACAAACGGG GTAAATGGCT TGCCTGCGTC ACAGCTGTCT 2284 CAAGCCCAGG CCCTGGGCGC CAGCCCAAGC CCAAGGACTA GGTCCAGAGC CACACAGCGC 2344 CAGGCCACAT CCGCCTCACC TGGGACCCTT TGTGGGGTAC AGTCTCCGGC CCCACCCAGA 24CTGAAG GAGAGACCCCATGGCAAGGA CTCAGCCACC TGCAGTTTCA TAAGCCCCCA 2464 GTGGGTTCCT AGGCATGAAG ACCACCGGTT AGAGGCTGAA CTGGCAGGAA CCTGTCTCCA 2524 GCCCCTTCTC ACCCCAGCCG GGCCCTGCCT CAGAGGCAGC ACCCAGGACG TGGCCATGAC 2584 CCGTGGACTC CACTCAATCC CTCTTCTCCA GGAGCCATGC AAAGTGTCAGCCAGCCAGGC 2644 CCCTGGAAGG CAGTCATCAC CTCTTAAGGC ATTGTGGGTG TCGGTCCTGC AACTGCCAGG 27GCACAC GACCCGTGTC CGGTGTTCGA TAGCAGGGAG CCATGACCTG GCAACGATTC 2764 CACGCTCAAA GGGGCACCCG GGGGGCCCTG GGTCGGGGCG GATCAGCTTT CCCTGGGCAC 2824 ATCTGCCTCA TTCCAGATCTCCAGGGCTCA TGTCTGTGAC AGGGAGGGAA GGCTCTGCCC 2884 TGGCCTTCCG TCAGCTCTGC CAGTGCAGGC TGGGCAGCCT GGGCTTTAGA GCTGGCTTCT 2944 GCCCACACTT TCTCCGTGAA AGGAAAACAA CTATGAGTCT GCCAAACGCA TCTCAGATGC 3TTAAAAA ATTCTGGTCC CCGCTCTCTG TCCCATCATC CGCCTCGGGGACTTCCTCTC 3GTGGTTC TCACCCCATA CTCTGTCACT GCCACATTTT CACCTGGGCC TGGCCTTTGT 3CACCTGA AACTCCTGAA AATCTTGAAA TGGATTTCTA GGTCACTGGG GACTCCGGCA 3CATTCGG CTTCAGAATA AAGGGCGCCC GCGGTCCCCC AGCACCTCCC CAAGCCACAC 3244 CCCTAGCTTC CCTCCCTATCCCTGCAGCCT GAGGGTCCCT TCAGCCACCC TTAAGTCCCC 33GGGCTC CTGCCCCGCC CCTGGCTAGC AGCGCCTTCT CCACCGGGGC CCCCTCTGCT 3364 CACAGAGCCC CCTCACCTCC CTGGGGATGA GGGGCCAGGC CATGACCCTG AAAGTCTAGC 3424 CCTGGCCTTG ACCTCCCAGG AGCGCCCTCC CCGCCCTCTC CCGGCCCCGGCCCCGTCCTC 3484 TGCTGCTGGC CTCTGGGTCG TGCCCCGCAG ACTGAGCTGC GCTTGGGGGT CCTGGCGGCC 3544 TGGGCCGTCC CGCACCGAAC CCAGGCGGTC GGAGCCCGGC GGGAAGGCGC GAGGTCCTTC 36GGCTCC TCCGACGCCT GAGGGCGCTG CTTCCCCGCG GCCGCCCCGG GTTTCTGCGG 3664 AGCCGGGGCC TCCGCTCTCGGGTGACCCGG TGAGACCCCC GGGGAGGCCG CTGGGGAGGC 3724 GCGGGCTCTG CTCCCGGGTC CCAAACGCAC TGGCTGCCCC TCAGGAGGGA CGGCGACCTC 3784 CACCCACGGC GCTGGCGCCC GCACGGCCGC TCCTCCCGCT CCCGCAGCCT GGACGCCTCC 3844 CGAGGCCGCC CCGCCGGGCC CCACGCGCGG CCCCATCCGC AGGCCAGGACTGCCTTCCCG 39TGGCGG CCCCCAGCCT GGAGGAGCCG GCCCCAGACG CCCTCCCAGC CCTCCCCAGC 3964 CCACTCTGGC CCCGCAGCCC CCGCCTGGTC CGAGTGCGGG TCTCTGGCCC CGGCCTTTCC 4GGAAGGA AAGCAAAAAG CTT 4 amino acids amino acid single linear protein internal Asn Gly Asp Met Pro His Val Pro Ile Thr Thr Leu Ala Gly Ile Ser Leu Thr Asp Leu Leu Asn Gln Leu Pro Leu Pro Ser Pro Leu 2 Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn Ala Arg Ile Ala Glu 35 4u Val Asn Cys Leu Leu Ala Cys ArgAsp Asp Asn Leu Val Ser Gln 5 Leu Val His Ser Leu Asn Gln Val Ser Thr Asp His Ile Glu Leu Lys 65 7 Asp Asn Leu Gly Ser Asp Asp Pro Glu Gly Asp Ile Pro Val Leu Leu 85 9n Ala Val Leu Ala Arg Ser Pro Asn Val Phe Arg Glu Lys Ser Met Asn Arg Tyr Val Gln Ser Gly Met Met Met Ser Gln Tyr Lys Leu Gln Asn Ser Met His Ser Ser Pro Ala Ser Ser Asn Tyr Gln Gln Thr Ile Ser His Ser Pro Ser Ser Arg Phe Val Pro Pro Gln Thr Ser Ser GlyAsn Arg Phe Met Pro Gln Gln Asn Ser Pro Val Pro Ser Tyr Ala Pro Gln Ser Pro Ala Gly Tyr Met Pro Tyr Ser His Pro Ser Tyr Thr Thr His Pro Gln Met Gln Gln Ala Ser Val Ser Ser 2Ile Val Ala Gly Gly Leu Arg AsnIle His Asp Asn Lys Val Ser 222ro Leu Ser Gly Asn Ser Ala Asn His His Ala Asp Asn Pro Arg 225 234ly Ser Ser Glu Asp Tyr Leu His Met Val His Arg Leu Ser Ser 245 25sp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala Ala Ser PhePro Leu 267er Pro Gln Pro Val Cys Ser Pro Ala Gly Ser Glu Gly Thr Pro 275 28ys Gly Ser Arg Pro Pro Leu Ile Leu Gln Ser Gln Ser Leu Pro Cys 29Ser Pro Arg Asp Val Pro Pro Asp Ile Leu Leu Asp Ser Pro Glu 33Arg Lys Gln Lys Lys Gln Lys Lys Met Lys Leu Gly Lys Asp Glu Lys 325 33lu Gln Ser Glu Lys Ala Ala Met Tyr Asp Ile Ile Ser Ser Pro Ser 345sp Ser Thr Lys Leu Thr Leu Arg Leu Ser Arg Val Arg Ser Ser 355 36sp Met Asp Gln Gln GluAsp Met Ile Ser Gly Val Glu Asn Ser Asn 378er Glu Asn Asp Ile Pro Phe Asn Val Gln Tyr Pro Gly Gln Thr 385 39Lys Thr Pro Ile Thr Pro Gln Asp Ile Asn Arg Pro Leu Asn Ala 44Gln Cys Leu Ser Gln Gln Glu Gln Thr AlaPhe Leu Pro Ala Asn 423al Pro Val Leu Gln Gln Asn Thr Ser Val Ala Ala Lys Gln Pro 435 44ln Thr Asn Ser His Lys Thr Leu Val Gln Pro Gly Thr Gly Ile Glu 456er Ala Glu Leu Pro Lys Asp Lys Thr 465 47base pairsnucleic acid double linear Genomic DNA Coding Sequence 26...799 CTTTTTG AATTCGGCAC GAGAT GCT ACA CAG GCT ATA TTT GAA ATA CTG 52 Ala Thr Gln Ala Ile Phe Glu Ile Leu AAA TCC TGG TTG CCC CAG AAT TGT ACA CTG GTT GAT ATG AAG ATT Lys SerTrp Leu Pro Gln Asn Cys Thr Leu Val Asp Met Lys Ile A TTT GGT GTT GAT GTA ACC ACC AAA GAA ATT GTT CTT GCT GAT GTT Phe Gly Val Asp Val Thr Thr Lys Glu Ile Val Leu Ala Asp Val 3 ATT GAC AAT GAT TCC TGG AGA CTC TGG CCA TCA GGAGAT CGA AGC CAA Asp Asn Asp Ser Trp Arg Leu Trp Pro Ser Gly Asp Arg Ser Gln 45 5G AAA GAC AAA CAG TCT TAT CGG GAC CTC AAA GAA GTA ACT CCT GAA 244 Gln Lys Asp Lys Gln Ser Tyr Arg Asp Leu Lys Glu Val Thr Pro Glu 6 GGG CTC CAA ATG GTAAAG AAA AAC TTT GAG TGG GTT GCA GAG AGA GTA 292 Gly Leu Gln Met Val Lys Lys Asn Phe Glu Trp Val Ala Glu Arg Val 75 8G TTG CTT TTG AAA TCA GAA AGT CAG TGC AGG GTT GTA GTG TTG ATG 34eu Leu Leu Lys Ser Glu Ser Gln Cys Arg Val Val Val Leu Met9GC TCT ACT TCT GAT CTT GGT CAC TGT GAA AAA ATC AAG AAG GCC TGT 388 Gly Ser Thr Ser Asp Leu Gly His Cys Glu Lys Ile Lys Lys Ala Cys AAT TTT GGC ATT CCA TGT GAA CTT CGA GTA ACA TCT GCG CAT AAA 436 Gly Asn Phe Gly Ile Pro CysGlu Leu Arg Val Thr Ser Ala His Lys CCA GAT GAA ACT CTG AGG ATT AAA GCT GAG TAT GAA GGG GAT GGC 484 Gly Pro Asp Glu Thr Leu Arg Ile Lys Ala Glu Tyr Glu Gly Asp Gly CCT ACT GTA TTT GTG GCA GTG GCA GGC AGA AGT AAT GGT TTGGGA 532 Ile Pro Thr Val Phe Val Ala Val Ala Gly Arg Ser Asn Gly Leu Gly GTG ATG TCT GGG AAC ACT GCA TAT CCA GTT ATC AGC TGT CCT CCC 58al Met Ser Gly Asn Thr Ala Tyr Pro Val Ile Ser Cys Pro Pro CTC ACA CCA GAC TGGGGA GTT CAG GAT GTG TGG TCT TCT CTT CGA CTA 628 Leu Thr Pro Asp Trp Gly Val Gln Asp Val Trp Ser Ser Leu Arg Leu 2AGT GGT CTT GGC TGT TCA ACC GTA CTT TCT CCA GAA GGA TCA GCT 676 Pro Ser Gly Leu Gly Cys Ser Thr Val Leu Ser Pro Glu Gly SerAla 22TTT GCT GCT CAG ATA TTT GGG TTA AGC AAC CAT TTG GTA TGG AGC 724 Gln Phe Ala Ala Gln Ile Phe Gly Leu Ser Asn His Leu Val Trp Ser 223TG CGA GCA AGC ATT TTG AAC ACA TGG ATT TCC TTG AAG CAG GCT 772 Lys Leu Arg Ala Ser IleLeu Asn Thr Trp Ile Ser Leu Lys Gln Ala 235 24AC AAG AAA ATC AGA GAA TGT AAT TTA TAAGAAAGAA TGCCATTGAA TTTTTTA 826 Asp Lys Lys Ile Arg Glu Cys Asn Leu 25GGGAAAAAC TACAAATTTC TAATTTAGCT GAAGGAAAAT CAAGCAAGAT GAAAAGGTAA 886 TTTTAAATTAGAGAACACAA ATAAAATGTA TTAGTGAATA AATGGTGAGG GTAGGCCTAT 946 TCAGATGCAA GGCCAGCAAT GGGGCTCCCC ATTATCCCCA CCCCTTTGGT CCCAGTCCCC CTCTGCAA TGGGCACGCA TAGAGGAGAG ACAAAGGGTA TTAGACGCAA CATCATTGGC AGGGGAGT CCGAGAAGAG CTGCCATTGG CTGACAGGGCATTTTCAGGC TCTGTCATTG CAGGGAGC ACACCCCAGC CTGAAGAGTG ATGCCATTGG CCAGGGAGTG GTTTTGTCAT CCGTTGGC TGTGAAGTGG AAGGAAAAGA TCTGGGAATG AAGCCCTGTG GCCAGGAAGA GACAGGGC AGCAACTTCT GGGCCTCCAG GCCCTCTTCC CACCATAGCA ATGTGGGCAA CTGGTGTCAGGCCCCAGC CAGAAAAAGG AGCCCAAGCC AGAGGGCAAG TGACAAAGGA TACCATGT CCAATCTCCC ACACCCTGGG GCTGCCCTTC CCAATGTCTT TCTTGATAGC AGTTGGGC TGGGAGCAGC TCACTGCTCC TCTAGCCAGG AGGGTTTCTC AGCTCCTGGA CCGCAGCT TGATGTTGAA CTGCTGCAGG GTCTGCTCCAGCTGTTTCTG GTTCCCAGCA GTAGGCGG ACACAGCATT GTGGAAGAGC AGCAGCTGCT TGTGCATCAC CTTGATCTTG TTCTTCCA GGAACTTGAG CTTGATGGCC ACATCTCCCC GCAGCTTCTC ATACTTGTCC ATGGGCCT GGAAAGTGGC CTGGGCACTC TCAAGTCGAC CACGTGTCCC TGCATCCCGG GCCTAGACTCAGCTCCTC TAAGTCTGTT CGGTAGGCAT CATATTCCAG CCTGGCAGCC ATACTGTT TCACAGTCAT GAGCGTGTCT TCCATGGTCT TGGTGACCAA TGTGTTGATG AGAGACAA AGAAGTTCAC GGCTCCTAGC AGCGTTTCCC CATTCTTGCA TAGTAGTTTC TGTCTCTG CATTGTAGCC AAATTCCTCC TGAAGCTCTGGGGACTTCTG GCTGAGGTCA AAAGGCAT CACCCAGTGC ATGCTGGGTC TGCAGCAGGC TGTAGAGGTG GGCTGTCAGT 2CGGCCCA GCTGCAGGAC ACTCTCATAC TTGCGCTTCG TCTCACGCAG CAACTCAATC 2AGCTCTA GCTCCAGGAT TCCGGCGCCT CCACTCCGTC CCCCGCGGGT CTGCTCTGTG 2CATGGACGGCATTGTCC CAGATATAGC CGTTGGTACA AAGCGGGGAT CTGACGAGCT 22TCTACT TGTGTCACTA ACGGACCGTT TATCATGAGC AGCAACTCGG CTTCTGCAGC 2266 AAACGGAAAT GACAGCAAGA AGTTCAAAGG TGACAGCCGA AGTGCAGGCG TCCCCTCTAG 2326 AGTGATCCAC ATCCGGAAGC TCCCCATCGA CGTCACGGAGGGGGAAGTCA TCTCCCTGGG 2386 GCTGCCCTTT GGGAAGGTCA CCAACCTCCT GATGCTGAAG GGGAAAAACC AGGCCTTCAT 2446 CGAGATGAAC ACGGAGGAGG CTGCCAATAC CATGGTGAAC TACTACACCT CGGTGACCCC 25CTGCGC GGCCAGCCCA TCTACATCCA GTTCTCCAAC CACAAGGAGC TGAAGACCGA 2566 CAGCTCTCCCAACCAGGCGC GGGCCCAGGC GGCCCTGCAG GCGGTGAACT CGGTCCAGTC 2626 GGGGAACCTG GCCTTGGCTG CCTCGGCGGC GGCCGTGGAT GCAGGGATGG CGATGGCCGG 2686 GCAGAGCCCC GTGCTCAGGA TCATCGTGGA GAACCTCTTC TACCCTGTGA CCCTGGATGT 2746 GCTGCACCAG ATTTTCTCCA AGTTCGGCAC AGTGTTGAAGATCATCACCT TCACCAAGAA 28CAGTTC CAGGCCCTGC TGCAGTATGC GGACCCCGTG AGCGCCCAGC ACGCCAAGCT 2866 GTCGCTGGAC GGGCAGAACA TCTACAACGC CTGCTGCACG CTGCGCATCG ACTTTTCCAA 2926 GCTCACCAGC CTCAACGTCA AGTACAACAA TGACAAGAGC CGTGACTACC TCGTGCCGAA 2986 TTCTTTGGAT CC2998 258 amino acids amino acid single linear protein internal Thr Gln Ala Ile Phe Glu Ile Leu Glu Lys Ser Trp Leu Pro Gln Cys Thr Leu Val Asp Met Lys Ile Glu Phe Gly Val Asp Val Thr 2 Thr Lys Glu Ile Val Leu Ala Asp Val IleAsp Asn Asp Ser Trp Arg 35 4u Trp Pro Ser Gly Asp Arg Ser Gln Gln Lys Asp Lys Gln Ser Tyr 5 Arg Asp Leu Lys Glu Val Thr Pro Glu Gly Leu Gln Met Val Lys Lys 65 7 Asn Phe Glu Trp Val Ala Glu Arg Val Glu Leu Leu Leu Lys Ser Glu 85 9r Gln Cys Arg Val Val Val Leu Met Gly Ser Thr Ser Asp Leu Gly Cys Glu Lys Ile Lys Lys Ala Cys Gly Asn Phe Gly Ile Pro Cys Leu Arg Val Thr Ser Ala His Lys Gly Pro Asp Glu Thr Leu Arg Lys Ala Glu Tyr GluGly Asp Gly Ile Pro Thr Val Phe Val Ala Val Ala Gly Arg Ser Asn Gly Leu Gly Pro Val Met Ser Gly Asn Thr Tyr Pro Val Ile Ser Cys Pro Pro Leu Thr Pro Asp Trp Gly Val Asp Val Trp Ser Ser Leu Arg Leu Pro SerGly Leu Gly Cys Ser 2Val Leu Ser Pro Glu Gly Ser Ala Gln Phe Ala Ala Gln Ile Phe 222eu Ser Asn His Leu Val Trp Ser Lys Leu Arg Ala Ser Ile Leu 225 234hr Trp Ile Ser Leu Lys Gln Ala Asp Lys Lys Ile Arg Glu Cys245 25sn Leu ino acids amino acid single linear Gln Arg Phe Gly Thr Ser Gly His Ile Met Asn Leu Gln Ala Gln Lys Ala Gln Asn Lys Arg Lys Arg Cys Leu Phe Gly Gly Gln Glu 2 Pro Ala Pro Lys Glu Gln Pro Pro Pro LeuGln Pro Pro Gln Gln Ser 35 4e Arg Val Lys Glu Glu Gln Tyr Leu Gly His Glu Gly Pro Gly Gly 5 Ala Val Ser Thr Ser Gln Pro Val Glu Leu Pro Pro Pro Ser Ser Leu 65 7 Ala Leu Leu Asn Ser Val Val Tyr Gly Pro Glu Arg Thr Ser Ala Ala 85 9t Leu Ser Gln Gln Val Ala Ser Val Lys Trp Pro Asn Ser Val Met Pro Gly Arg Gly Pro Glu Arg Gly Gly Gly Gly Gly Val Ser Asp Ser Trp Gln Gln Gln Pro Gly Gln Pro Pro Pro His Ser Thr Trp Cys His Ser Leu SerLeu Tyr Ser Ala Thr Lys Gly Ser Pro His Pro Gly Val Gly Val Pro Thr Tyr Tyr Asn His Pro Glu Ala Leu Lys Glu Lys Ala Gly Gly Pro Gln Leu Asp Arg Tyr Val Arg Pro Met Pro Gln Lys Val Gln Leu Glu Val Gly ArgPro Gln Ala Pro Leu 2Ser Phe His Ala Ala Lys Lys Pro Pro Asn Gln Ser Leu Pro Leu 222ro Phe Gln Leu Ala Phe Gly His Gln Val Asn Arg Gln Val Phe 225 234ln Gly Pro Pro Pro Pro Asn Pro Val Ala Ala Phe Pro Pro Gln245
25ys Gln Gln Gln Gln Gln Gln Pro Gln Gln Gln Gln Gln Gln Gln Gln 267la Leu Pro Gln Met Pro Leu Phe Glu Asn Phe Tyr Ser Met Pro 275 28ln Gln Pro Ser Gln Gln Pro Gln Asp Phe Gly Leu Gln Pro Ala Gly 29LeuGly Gln Ser His Leu Ala His His Ser Met Ala Pro Tyr Pro 33Phe Pro Pro Asn Pro Asp Met Asn Pro Glu Leu Arg Lys Ala Leu Leu 325 33ln Asp Ser Ala Pro Gln Pro Ala Leu Pro Gln Val Gln Ile Pro Phe 345rg Arg Ser Arg Arg LeuSer Lys Glu Gly Ile Leu Pro Pro Ser 355 36la Leu Asp Gly Ala Gly Thr Gln Pro Gly Gln Glu Ala Thr Gly Asn 378he Leu His His Trp Pro Leu Gln Gln Pro Pro Pro Gly Ser Leu 385 39Gln Pro His Pro Glu Ala Leu Gly Phe Pro LeuGlu Leu Arg Glu 44Gln Leu Leu Pro Asp Gly Glu Arg Leu Ala Pro Asn Gly Arg Glu 423lu Ala Pro Ala Met Gly Ser Glu Glu Gly Met Arg Ala Val Ser 435 44hr Gly Asp Cys Gly Gln Val Leu Arg Gly Gly Val Ile Gln Ser Thr 456rg Arg Arg Arg Ala Ser Gln Glu Ala Asn Leu Leu Thr Leu Ala 465 478ys Ala Val Glu Leu Ala Ser Leu Gln Asn Ala Lys Asp Gly Ser 485 49ly Ser Glu Glu Lys Arg Lys Ser Val Leu Ala Ser Thr Thr Lys Cys 55Val Glu PheSer Glu Pro Ser Leu Ala Thr Lys Arg Ala Arg Glu 5525 Asp Ser Gly Met Val Pro Leu Ile Ile Pro Val Ser Val Pro Val Arg 534al Asp Pro Thr Glu Ala Ala Gln Ala Gly Gly Leu Asp Glu Asp 545 556ys Gly Leu Glu Gln Asn Pro AlaGlu His Lys Pro Ser Val Ile 565 57al Thr Arg Arg Arg Ser Thr Arg Ile Pro Gly Thr Asp Ala Gln Ala 589la Glu Asp Met Asn Val Lys Leu Glu Gly Glu Pro Ser Val Arg 595 6Lys Pro Lys Gln Arg Pro Arg Pro Glu Pro Leu Ile Ile Pro ThrLys 662ly Thr Phe Ile Ala Pro Pro Val Tyr Ser Asn Ile Thr Pro Tyr 625 634er His Leu Arg Ser Pro Val Arg Leu Ala Asp His Pro Ser Glu 645 65rg Ser Phe Glu Leu Pro Pro Tyr Thr Pro Pro Pro Ile Leu Ser Pro 667rg Glu Gly Ser Gly Leu Tyr Phe Asn Ala Ile Ile Ser Thr Ser 675 68hr Ile Pro Ala Pro Pro Pro Ile Thr Pro Lys Ser Ala His Arg Thr 69Leu Arg Thr Asn Ser Ala Glu Val Thr Pro Pro Val Leu Ser Val 77Met Gly Glu Ala Thr ProVal Ser Ile Glu Pro Arg Ile Asn Val Gly 725 73er Arg Phe Gln Ala Glu Ile Pro Leu Met Arg Asp Arg Ala Leu Ala 745la Asp Pro His Lys Ala Asp Leu Val Trp Gln Pro Trp Glu Asp 755 76eu Glu Ser Ser Arg Glu Lys Gln Arg Gln Val GluAsp Leu Leu Thr 778la Cys Ser Ser Ile Phe Pro Gly Ala Gly Thr Asn Gln Glu Leu 785 79Leu His Cys Leu His Glu Ser Arg Gly Asp Ile Leu Glu Thr Leu 88Lys Leu Leu Leu Lys Lys Pro Leu Arg Pro His Asn His Pro Leu 823hr Tyr His Tyr Thr Gly Ser Asp Gln Trp Lys Met Ala Glu Arg 835 84ys Leu Phe Asn Lys Gly Ile Ala Ile Tyr Lys Lys Asp Phe Phe Leu 856ln Lys Leu Ile Gln Thr Lys Thr Val Ala Gln Cys Val Glu Phe 865 878yr ThrTyr Lys Lys Gln Val Lys Ile Gly Arg Asn Gly Thr Leu 885 89hr Phe Gly Asp Val Asp Thr Ser Asp Glu Lys Ser Ala Gln Glu Glu 99Glu Val Asp Ile Lys Thr Ser Gln Lys Phe Pro Arg Val Pro Leu 9925 Pro Arg Arg Glu Ser Pro Ser Glu GluArg Leu Glu Pro Lys Arg Glu 934ys Glu Pro Arg Lys Glu Gly Glu Glu Glu Val Pro Glu Ile Gln 945 956ys Glu Glu Gln Glu Glu Gly Arg Glu Arg Ser Arg Arg Ala Ala 965 97la Val Lys Ala Thr Gln Thr Leu Gln Ala Asn Glu Ser AlaSer Asp 989eu Ile Leu Arg Ser His Glu Ser Asn Ala Pro Gly Ser Ala Gly 995 Gln Ala Ser Glu Lys Pro Arg Glu Gly Thr Gly Lys Ser Arg Arg Ala Leu Pro Phe Ser Glu Lys Lys Lys Lys Lys Gln Lys Ala 3849amino acids amino acid single linear Arg His Glu Val Ser Phe Leu Trp Asn Thr Glu Ala Ala Cys Pro Gln Thr Thr Thr Asp Thr Asp Gln Ala Cys Ser Ile Arg Asp Pro 2 Asn Ser Gly Phe Val Phe Asn Leu Asn Pro Leu Asn Ser Ser Gln Gly 354r Asn Val Ser Gly Ile Gly Lys Ile Phe Met Phe Asn Val Cys Gly 5 Thr Met Pro Val Cys Gly Thr Ile Leu Gly Lys Pro Ala Ser Gly Cys 65 7 Glu Ala Glu Thr Gln Thr Glu Glu Leu Lys Asn Trp Lys Pro Ala Arg 85 9o Val Gly Ile Glu LysSer Leu Gln Leu Ser Thr Glu Gly Phe Ile Leu Thr Tyr Lys Gly Pro Leu Ser Ala Lys Gly Thr Ala Asp Ala Ile Val Arg Phe Val Cys Asn Asp Asp Val Tyr Ser Gly Pro Leu Phe Leu His Gln Asp Ile Asp Ser Gly Gln GlyIle Arg Asn Thr Tyr Phe Glu Phe Glu Thr Ala Leu Ala Cys Val Pro Ser Pro Val Asp Gln Val Thr Asp Leu Ala Gly Asn Glu Tyr Asp Leu Thr Gly Leu Thr Val Arg Lys Pro Trp Thr Ala Val Asp Thr Ser Val Asp Gly 2Lys Arg Thr Phe Tyr Leu Ser Val Cys Asn Pro Leu Pro Tyr Ile 222ly Cys Gln Gly Ser Ala Val Gly Ser Cys Leu Val Ser Glu Gly 225 234er Trp Asn Leu Gly Val Val Gln Met Ser Pro Gln Ala Ala Ala 245 25sn Gly SerLeu Ser Ile Met Tyr Val Asn Gly Asp Lys Cys Gly Asn 267rg Phe Ser Thr Arg Ile Thr Phe Glu Cys Ala Gln Ile Ser Gly 275 28er Pro Ala Phe Gln Leu Gln Asp Gly Cys Glu Tyr Val Phe Ile Trp 29Thr Val Glu Ala Cys Pro Val ValArg Val Glu Gly Asp Asn Cys 33Glu Val Lys Asp Pro Arg His Gly Asn Leu Tyr Asp Leu Lys Pro Leu 325 33ly Leu Asn Asp Thr Ile Val Ser Ala Gly Glu Tyr Thr Tyr Tyr Phe 345al Cys Gly Lys Leu Ser Ser Asp Val Cys Pro Thr SerAsp Lys 355 36er Lys Val Val Ser Ser Cys Gln Glu Lys Arg Glu Pro Gln Gly Phe 378ys Val Ala Gly Leu Leu Thr Gln Lys Leu Thr Tyr Glu Asn Gly 385 39Leu Lys Met Asn Phe Thr Gly Gly Asp Thr Cys His Lys Val Tyr 44Arg Ser Thr Ala Ile Phe Phe Tyr Cys Asp Arg Gly Thr Gln Arg 423al Phe Leu Lys Glu Thr Ser Asp Cys Ser Tyr Leu Phe Glu Trp 435 44rg Thr Gln Tyr Ala Cys Pro Pro Phe Asp Leu Thr Glu Cys Ser Phe 456sp Gly Ala Gly AsnSer Phe Asp Leu Ser Ser Leu Ser Arg Tyr 465 478sp Asn Trp Glu Ala Ile Thr Gly Thr Gly Asp Pro Glu His Tyr 485 49eu Ile Asn Val Cys Lys Ser Leu Ala Pro Gln Ala Gly Thr Glu Pro 55Pro Pro Glu Ala Ala Ala Cys Leu Leu GlyGly Ser Lys Pro Val 5525 Asn Leu Gly Arg Val Arg Asp Gly Pro Gln Trp Arg Asp Gly Ile Ile 534eu Lys Tyr Val Asp Gly Asp Leu Cys Pro Asp Gly Ile Arg Lys 545 556er Thr Thr Ile Arg Phe Thr Cys Ser Glu Ser Gln Val Asn Ser565 57rg Pro Met Phe Ile Ser Ala Val Glu Asp Cys Glu Tyr Thr Phe Ala 589ro Thr Ala Thr Ala Cys Pro Met Lys Ser Asn Glu His Asp Asp 595 6Cys Gln Val Thr Asn Pro Ser Thr Gly His Leu Phe Asp Leu Ser Ser 662er GlyArg Ala Gly Phe Thr Ala Ala Tyr Ser Glu Lys Gly Leu 625 634yr Met Ser Ile Cys Gly Glu Asn Glu Asn Cys Pro Pro Gly Val 645 65ly Ala Cys Phe Gly Gln Thr Arg Ile Ser Val Gly Lys Ala Asn Lys 667eu Arg Tyr Val Asp Gln ValLeu Gln Leu Val Tyr Lys Asp Gly 675 68er Pro Cys Pro Ser Lys Ser Gly Leu Ser Tyr Lys Ser Val Ile Ser 69Val Cys Arg Pro Glu Ala Gly Pro Thr Asn Arg Pro Met Leu Ile 77Ser Leu Asp Lys Gln Thr Cys Thr Leu Phe Phe Ser TrpHis Thr Pro 725 73eu Ala Cys Glu Gln Ala Thr Glu Cys Ser Val Arg Asn Gly Ser Ser 745al Asp Leu Ser Pro Leu Ile His Arg Thr Gly Gly Tyr Glu Ala 755 76yr Asp Glu Ser Glu Asp Asp Ala Ser Asp Thr Asn Pro Asp Phe Tyr 778sn Ile Cys Gln Pro Leu Asn Pro Met His Gly Val Pro Cys Pro 785 79Gly Ala Ala Val Cys Lys Val Pro Ile Asp Gly Pro Pro Ile Asp 88Gly Arg Val Ala Gly Pro Pro Ile Leu Asn Pro Ile Ala Asn Glu 823yr Leu Asn PheGlu Ser Ser Thr Pro Cys Gln Glu Phe Ser Cys 835 84ys 852 amino acids amino acid single linear Ala Arg Leu Ser Arg Pro Glu Arg Pro Asp Leu Val Phe Glu Glu Asp Leu Pro Tyr Glu Glu Glu Ile Met Arg Asn Gln Phe Ser Val 2Lys Cys Trp Leu His Tyr Ile Glu Phe Lys Gln Gly Ala Pro Lys Pro 35 4g Leu Asn Gln Leu Tyr Glu Arg Ala Leu Lys Leu Leu Pro Cys Ser 5 Tyr Lys Leu Trp Tyr Arg Tyr Leu Lys Ala Arg Arg Ala Gln Val Lys 65 7 His Arg Cys Val Thr Asp Pro AlaTyr Glu Asp Val Asn Asn Cys His 85 9u Arg Ala Phe Val Phe Met His Lys Met Pro Arg Leu Trp Leu Asp Cys Gln Phe Leu Met Asp Gln Gly Arg Val Thr His Thr Arg Arg Phe Asp Arg Ala Leu Arg Ala Leu Pro Ile Thr Gln His SerArg Trp Pro Leu Tyr Leu Arg Phe Leu Arg Ser His Pro Leu Pro Glu Thr Ala Val Arg Gly Tyr Arg Arg Phe Leu Lys Leu Ser Pro Glu Ser Glu Glu Tyr Ile Glu Tyr Leu Lys Ser Ser Asp Arg Leu Asp Glu Ala Gln Arg Leu Ala Thr Val Val Asn Asp Glu Arg Phe Val Ser 2Ala Gly Lys Ser Asn Tyr Gln Leu Trp His Glu Leu Cys Asp Leu 222er Gln Asn Pro Asp Lys Val Gln Ser Leu Asn Val Asp Ala Ile 225 234rg Gly Gly Leu ThrArg Phe Thr Asp Gln Leu Gly Lys Leu Trp 245 25ys Ser Leu Ala Asp Tyr Tyr Ile Arg Ser Gly His Phe Glu Lys Ala 267sp Val Tyr Glu Glu Ala Ile Arg Thr Val Met Thr Val Arg Asp 275 28he Thr Gln Val Phe Asp Ser Tyr Ala Gln Phe GluGlu Ser Met Ile 29Ala Lys Met Glu Thr Ala Ser Glu Leu Gly Arg Glu Glu Glu Asp 33Asp Val Asp Leu Glu Leu Arg Leu Ala Arg Phe Glu Gln Leu Ile Ser 325 33rg Arg Pro Leu Leu Leu Asn Ser Val Leu Leu Arg Gln Asn Pro His 345al His Glu Trp His Lys Arg Val Ala Leu His Gln Gly Arg Pro 355 36rg Glu Ile Ile Asn Thr Tyr Thr Glu Ala Val Gln Thr Val Asp Pro 378ys Ala Thr Gly Lys Pro His Thr Leu Trp Val Ala Phe Ala Lys 385 39Tyr GluAsp Asn Gly Gln Leu Asp Asp Ala Arg Val Ile Leu Glu 44Ala Thr Lys Val Asn Phe Lys Gln Val Asp Asp Leu Ala Ser Val 423ys Gln Cys Gly Glu Leu Glu Leu Arg His Glu Asn Tyr Asp Glu 435 44la Leu Arg Leu Leu Arg Lys Ala ThrAla Leu Pro Ala Arg Arg Ala 456yr Phe Asp Gly Ser Glu Pro Val Gln Asn Arg Val Tyr Lys Ser 465 478ys Val Trp Ser Met Leu Ala Asp Leu Glu Glu Ser Leu Gly Thr 485 49he Gln Ser Thr Lys Ala Val Tyr Asp Arg Ile Leu Asp LeuArg Ile 55Thr Pro Gln Ile Val Ile Asn Tyr Ala Met Phe Leu Glu Glu His 5525 Lys Tyr Phe Glu Glu Ser Phe Lys Ala Tyr Glu Arg Gly Ile Ser Leu 534ys Trp Pro Asn Val Ser Asp Ile Trp Ser Thr Tyr Leu Thr Lys 545 556le Ala Arg Tyr Gly Gly Arg Lys Leu Glu Arg Ala Arg Asp Leu 565 57he Glu Gln Ala Leu Asp Gly Cys Pro Pro Lys Tyr Ala Lys Thr Leu 589eu Leu Tyr Ala Gln Leu Glu Glu Glu Trp Gly Leu Ala Arg His 595 6Ala Met Ala Val Tyr GluArg Ala Thr Arg Ala Val Glu Pro Ala Gln 662yr Asp Met Phe Asn Ile Tyr Ile Lys Arg Ala Ala Glu Ile Tyr 625 634al Thr His Thr Arg Gly Ile Tyr Gln Lys Ala Ile Glu Val Leu 645 65er Asp Glu His Ala Arg Glu Met Cys Leu ArgPhe Ala Asp Met Glu 667ys Leu Gly Glu Ile Asp Arg Ala Arg Ala Ile Tyr Ser Phe Cys 675 68er Gln Ile Cys Asp Pro Arg Thr Thr Gly Ala Phe Trp Gln Thr Trp 69Asp Phe Glu Val Arg His Gly Asn Glu Asp Thr Ile Lys Glu Met 77Leu Arg Ile Arg Arg Ser Val Gln Ala Thr Tyr Asn Thr Gln Val Asn 725 73he Met Ala Ser Gln Met Leu Lys Val Ser Gly Ser Ala Thr Gly Thr 745er Asp Leu Ala Pro Gly Gln Ser Gly Met Asp Asp Met Lys Leu 755 76eu Glu GlnArg Ala Glu Gln Leu Ala Ala Glu Ala Glu Arg Asp Gln 778eu Arg Ala Gln Ser Lys Ile Leu Phe Val Arg Ser Asp Ala Ser 785 79Glu Glu Leu Ala Glu Leu Ala Gln Gln
Val Asn Pro Glu Glu Ile 88Leu Gly Glu Asp Glu Asp Glu Asp Glu Met Asp Leu Glu Pro Asn 823al Arg Leu Glu Gln Gln Ser Val Pro Ala Ala Val Phe Gly Ser 835 84eu Lys Glu Asp 85mino acids amino acid singlelinear Phe Ser Ala Leu Lys Lys Leu Val Gly Ser Asp Gln Ala Pro Gly Asp Lys Asn Ile Pro Ala Gly Leu Gln Ser Met Asn Gln Ala Leu 2 Gln Arg Arg Phe Ala Lys Gly Val Gln Tyr Asn Met Lys Ile Val Ile 35 4g Gly Asp Arg Asn ThrGly Lys Thr Ala Leu Trp His Arg Leu Gln 5 Gly Arg Pro Phe Val Glu Glu Tyr Ile Pro Thr Gln Glu Ile Gln Val 65 7 Thr Ser Ile His Trp Ser Tyr Lys Thr Thr Asp Asp Ile Val Lys Val 85 9u Val Trp Asp Val Val Asp Lys Gly Lys Cys Lys Lys ArgGly Asp Leu Lys Met Glu Asn Asp Pro Gln Glu Xaa Glu Ser Glu Met Ala Asp Ala Glu Phe Leu Asp Val Tyr Lys Asn Cys Asn Gly Val Val Met Phe Asp Ile Thr Lys Gln Trp Thr Phe Asn Tyr Ile Leu Arg Glu Leu Pro Lys Val Pro Thr His Val Pro Val Cys Val Leu Gly Asn Arg Asp Met Gly Glu His Arg Val Ile Leu Pro Asp Asp Val Arg Phe Ile Asp Asn Leu Asp Arg Pro Pro Gly Ser Ser Tyr Phe Arg 2Ala Glu Ser Ser MetLys Asn Ser Phe Gly Leu Lys Tyr Leu His 222he Phe Asn Ile Pro Phe Leu Gln Leu Gln Arg Glu Thr Leu Leu 225 234ln Leu Glu Thr Asn Gln Leu Asp Met Asp Ala Thr Leu Glu Glu 245 25eu Ser Val Gln Gln Glu Thr Glu Asp Gln AsnTyr Gly Ile Phe Leu 267et Met Glu Ala Arg Ser Arg Gly His Ala Ser Pro Leu Ala Ala 275 28sn Gly Gln Ser Pro Ser Pro Gly Ser Gln Ser Pro Val Leu Pro Ala 29Ala Val Ser Thr Gly Ser Ser Ser Pro Gly Thr Pro Gln Pro Ala 33Pro Gln Leu Pro Leu Asn Ala Ala Pro Pro Ser Ser Val Pro Pro Val 325 33ro Pro Ser Glu Ala Leu Pro Pro Pro Ala Cys Pro Ser Ala Pro Ala 345rg Arg Ser Ile Ile Ser Arg Leu Phe Gly Thr Ser Pro Ala Thr 355 36lu Ala AlaPro Pro Pro Pro Glu Pro Val Pro Ala Ala Gln Gly Pro 378hr Val Gln Ser Val Glu Asp Phe Val Pro Asp Asp Arg Leu Asp 385 39Ser Phe Leu Glu Asp Thr Thr Pro Ala Arg Asp Glu Lys Lys Val 44Ala Lys Ala Ala Gln Gln AspSer Asp Ser Asp Gly Glu Ala Leu 423ly Asn Pro Met Val Ala Gly Phe Gln Asp Asp Val Asp Leu Glu 435 44sp Gln Pro Arg Gly Ser Pro Pro Leu Pro Ala Gly Pro Val Pro Ser 456sp Ile Thr Leu Ser Ser Glu Glu Glu Ala Glu Val AlaAla Pro 465 478ys Gly Pro Ala Pro Ala Pro Gln Gln Cys Ser Glu Pro Glu Thr 485 49ys Trp Ser Ser Ile Pro Ala Ser Lys Pro Arg Arg Gly Thr Ala Pro 55Arg Thr Ala Ala Pro Pro Trp Pro Gly Gly Val Ser Val Arg Thr 5525Gly Pro Glu Lys Arg Ser Ser Thr Arg Pro Pro Ala Glu Met Glu Pro 534ys Gly Glu Gln Ala Ser Ser Ser Glu Ser Asp Pro Glu Gly Pro 545 556la Ala Gln Met Leu Ser Phe Val Met Asp Asp Pro Asp Phe Glu 565 57er Glu Gly Ser AspThr Gln Arg Arg Ala Asp Asp Phe Pro Val Arg 589sp Pro Ser Asp Val Thr Asp Glu Asp Glu Gly Pro Ala Glu Pro 595 6Pro Pro Pro Pro Lys Leu Pro Leu Pro Ala Phe Arg Leu Lys Asn Asp 662sp Leu Phe Gly Leu Gly Leu Glu Glu AlaGly Pro Lys Glu Ser 625 634lu Glu Gly Lys Glu Gly Lys Thr Pro Ser Lys Glu Lys Lys Lys 645 65ys Thr Lys Ser Phe Ser Arg Val Leu Leu Glu Arg Pro Arg Ala His 667he Ser Thr Arg Val Gly Tyr Gln Val Ser Val Pro Asn Ser Pro675 68yr Ser Glu Ser Tyr 69BR>* * * * *