이 애플리케이션의 일부 콘텐츠는 현재 사용할 수 없습니다.
이 상황이 계속되면 다음 주소로 문의하십시오피드백 및 연락
1. (US20020018767) Anti-cancer cellular vaccine
유의사항: 이 문서는 자동 광학문자판독장치(OCR)로 처리된 텍스트입니다. 법률상의 용도로 사용하고자 하는 경우 PDF 버전을 사용하십시오

FIELD OF THE INVENTION

       The present invention relates generally to the field of recombinant DNA molecules. More specifically, the present invention relates to an expression vector comprising IL-12 and a co-stimulatory molecule and methods of using same in a cancer vaccine.

BACKGROUND OF THE INVENTION

       Although there is considerable evidence from scientific and clinical studies that the immune system is capable of destroying cancerous tissue, in most cases the immune system either fails to recognize the tumor or the response that is generated is too weak to be effective (Farzaneh et al., 1998, Immunol. Today 19:294). While early detection may cure tumors in many cases, once the disease becomes metastatic to distant organs, it is almost always fatal. Furthermore, the disappointing results observed with chemotherapy, radiotherapy and surgery, individually or in combination, has shifted the attention of many investigators to immunological or biological agents (Ockert et al., 1999. Immunol. Today 20:63). As such, increasing the capacity of the immune system to mediate tumor regression has been a major goal in tumor immunology. Progress towards this goal has recently been aided by the identification of immunogenic tumor antigens and by a better understanding of the mechanisms of T cell-mediated immune response and tumor escape (Boon et al., 1997, Immunol. Today 18:267; Chen, 1998, Immunol. Today 19:27).
       An understanding of the mechanisms by which some animals reject tumors whereas others display progressive tumor outgrowth is gradually evolving based on an appreciation of the underlying concepts of cellular and tumor immunology. Simply put, these are that tumor cells can be eliminated by the immune system and that cellular cytotoxicity plays a major role in antitumor immunity and the effector cells in many cases are either CD8+ CTL or CD4+ Th cells (Denfeld et al., Int J. Cancer 62:259; Greten and Jaffee, 1999, J. Clin. Oncol. 17:1047; Sampson et al., 1996, Proc. Natl. Acad. Sci. USA 93:10399). However, the induction and amplification of an effective T cell-mediated immune response in malignancies characterized by poor immungenicity is the most challenging aspects of tumor vaccine development (Sampson et al., 1996). A two signal model of lymphocyte activation postulates that for optimal activation, lymphocytes require both an antigen-specific signal delivered through TCR and an antigen-nonspecific costimulatory signal (van Seventer et al., 1991 Curr. Opin. Immunol. 3:294; Linsley et al., 1991, J. Exp. Med. 173:721). In this regard, tumor cells may effectively evade the immune system by several mechanisms which are not only confined to tumor cells, but may also be related to impaired function of the immune response in a tumor bearing host (Gretten and Jaffee, 1999). These include: defective expression of MHC complex on tumor cells, antigen processing defects, lack of T cell recognition by outgrowth of antigen negative clones of tumor cells, inadequate expression of costimulatory molecules on tumor cells, inadequate expression of adhesion molecules on tumor cells, inadequate expression of Fas receptor and/or FaL expression on tumor cells, immune-suppressive cytokine secretion into tumor microenvironment, and host defense failure due to impaired immune cell function (Boon et al., 1997). Therefore, in the majority of cases the immune system either fails to recognize the tumor or the response that is generated is too weak to be effective. Furthermore, the management of residual and metastatic disease is a central problem in the treatment of tumors. During a normal immune response, full activation of antigen-specific naive T cells requires at least two distinct signals from surface receptors to proliferate in response to antigens (Young et al., 1992, J. Clin. Invest. 90:229; Allison and Krummel, 1995, Science 270:932). One of the signals is supplied by T cell receptor (TCR) engagement with peptide (antigen)-loaded major histocompatibility complex (MHC) molecules on antigen-presenting cells (APC). The second signal, at present poorly understood, can be delivered by the interaction of various molecules on the surface of T cells and the APC, one of which is the interaction of CD28 and B7-1 (Linsley et al, 1991; Young et al, 1992; Bluestone, 1995, Immunity 2:555). The combination of these two signals leads to activation, clonal expansion and differentiation into effector cells of T lymphocytes (Guerder et al., 1995, J. Immunol. 155:5167; Webb and Feldmann, 1995, Blood 86:3479; Thompson, 1995, Cell 81:979). Effector T lymphocytes, unlike naive T cells, no longer require costimulatory signals to recognize and kill antigen-bearing targets. After the immune response, a fraction of the effector cells remain as memory cells that form the basis of a faster and stronger immune response upon subsequent presentation of the same antigen (Gray, 1993, Ann. Rev. Immunol. 11:49; Ahmed and Gray, 1996, Science 272:54). The absence of second signal results in T cell clonal anergy, thus preventing clonal expansion of T lymphocytes (Chen, 1998; van Gool et al., 1994, Res. Immunol. 146:183).
       Although many tumor cells express target antigens, they are generally incapable of stimulating an immune response (Boon et al., 1997; Boon and van der Bruggen, 1996, J. Exp. Med. 183:725). Cytotoxic T lymphocytes (CTL) have been recognized as a critical component of the immune response to tumors (Boon and van der Bruggen, 1996; Chen et al., 1994, J. Exp. Med. 179:523). CTL responses are sufficient to protect against tumors and can eliminate even established cancers in murine models (Mogi et al., 1998, Clin. Cancer Res. 4:713) and in humans (Gong et al., 2000, Proc. Natl. Acad. Sci. USA 97:2715). Inducing strong antigen-specific CTL responses is the goal of many current cancer vaccine strategies. The development of CTL-dependent anti-tumor immunization strategies depends on both the identification of tumor antigens recognized by CTLs and the development of methods for effective antigen delivery. CTL target tumors through recognition of a ligand consisting of a self MHC class I molecule and a peptide antigen generally derived from proteins synthesized within the tumor cell. However, for CTL induction and expansion to occur, the antigenic ligand must be presented to CTLs in the appropriate context of costimulation usually provided by professional APCs. Delivery of exogenous antigen to the endogenous MHC class I restricted processing pathway of professional APCs is a critical challenge in cancer vaccine design. Antigen delivery strategies currently under development include immunization with defined peptides, particulate proteins capable of accessing the class I pathway of professional APCs in vivo, heat shock proteins isolated from tumor cells, or adoptive transfer of antigen-loaded APCs. In addition, recent studies suggest that DNA vaccines encoding tumor antigens delivered by viral vectors or liposomes, or as naked DNA, can induce potent anti-tumor immunity.
       In addition to the challenge of antigen delivery, most current tumor immunization strategies depend on the identification and production of appropriate tumor antigens. To overcome this limitation, tumor cells themselves may be used as immunogens as described in ACV (autologous cell vaccine). Engineering tumor cells to provide APC function could potentially result in polyvalent immunization to multiple tumor-specific epitopes, while obviating the need to identify specific tumor antigens. Many tumor vaccine strategies, including cytokine-transduced tumor cells, commonly referred to as gene therapy (Asher et al., 1991, J. Immunol. 146:3227; Tahara et al., 1996, Ann. NY Acad. Sci. 795:275; Lotze et al., 1996, Ann. NY Acad. Sci. 795:440; Rakhmilevich et al., 1997, Hum. Gene Ther. 8:1301; Nawrocki and Mackiewicz, 1998, Cancer Treat Rev. 25:29), synthetic peptide vaccine (Rosenberg et al., 1998, Nature Med. 4:321), tumor-antigen(peptide)-pulsed dendritic cells (Flamand et al, 1994, Eur. J. Immunol. 24:605; Bianchi et al., 1996, J. Immunol. 157:1589; Ashley et al., 1997, J. Exp. Med. 186:1177; Yang et al., 1997, Cell. Immunol. 179:84; Thurner et al., 1999, J. Exp. Med. 190:1669), and DNA vaccine (Leclerc and Ronco, 1998, Immunol. Today 19:300; Akbari et al., 1999, J. Exp. Med. 189:169) are currently under pre-clinical and clinical investigation but to date have yielded only marginal immunological and clinical response.
       U.S. Pat. Nos. 5,635,188 and 5,993,829 teach a cancer vaccine comprising purified cell surface antigens shed by human cancer cell lines during culturing. The peptides are then used as a vaccine for immunization against cancer. That is, the antigens are used to sensitize the recipient's immune system to these antigens so that they are recognized in the event that tumors expressing these antigens develop.
       U.S. Pat. No. 5,571,515 teaches the use of purified IL-12 protein in combination with an antigen from a pathogenic organism in a vaccine wherein IL-12 acts as an adjuvant to increase the vaccinated host's immune response to the pathogen. IL-12, either purified peptide or “naked” DNA encoding IL-12 or an expression vector containing IL-12 is administered with the vaccine either simultaneously or separately. This patent also describes a cancer vaccine, comprising a tumor antigen coadministered with purified IL-12.
       U.S. Pat. No. 5,744,132 teaches methods for preparing formulations of IL-12 for use as a pharmaceutical. Specifically, the IL-12 protein is produced via an expression vector system and the IL-12 protein is recovered and lyophilized.
       U.S. Pat. No. 5,891,680 teaches bioactive fusion proteins comprising, in one example, the p35 and p40 subunits of IL-12 in either order joined together by an intervening peptide linker. This patent mentions the fusion proteins as “potentially useful for the enhancement of anti-tumor immunity” but does not describe how this would be done.
       U.S. Pat. No. 6,080,399 teaches a method wherein isolated antigen presenting cells are pulsed with a melanoma or similar peptide antigen. The isolated cells are then injected into patients as a vaccine in conjunction with IL-12 protein. The method teaches that subsequent injections with IL-12 are required for maximum efficiency of immune response induction.
       As can be seen, these prior art patents teach the use of vaccines including IL-12 protein. However, in these instances, a finite amount of IL-12 is being supplied, and this supply diminishes over time as the protein is degraded. As a consequence, subsequent doses or injections of IL-12 may be required, as discussed above.
       U.S. Pat. No. 5,922,685 describes DNA cancer vaccines which comprise p35 and p40 subunits of IL-12 under control of a single promoter (bicistronic transcript) as well as under the control of separate promoters. Furthermore, the inventors note that the non-bicistronic transcript vector (separate promoters for each subunit) was most effective. The inventors describe using the vaccine for noninvasive immunization, that is, to transfect epidermal cells and possibly mucosal surfaces.
       As discussed above, methods requiring administration of peptides or proteins have inherent limitations, due to turn-over and degradation. Furthermore, the prior art does not teach methods for modifying tumor cells or cancer cells such that the cells express IL-12 and a costimulatory molecule so that the patient's immune system more effectively recognizes the tumor cell or cancer cell and elicits an immune response to destroy the tumor or cancer cells.

SUMMARY OF THE INVENTION

       According to a first aspect of the invention, there is provided a method of eliciting an anti-tumor immune response in a patient comprising: isolating cancerous cells from a patient; transfecting said cancerous cells with an expression vector system comprising a DNA molecule encoding IL-12 and a costimulatory molecule operably linked to a promoter capable of directing expression of said DNA molecule in said cancerous cells; incubating said transfected cells under conditions whereby the IL-12 and the costimulatory molecules are expressed; and eliciting an anti-tumor immune response in the patient by injecting said transfected cells into the patient.
       According to a second aspect of the invention, there is provided a method of vaccinating an individual comprising: providing cancerous cells isolated from a donor; transfecting said cancerous cells with an expression vector system comprising a DNA molecule encoding IL-12 and a costimulatory molecule operably linked to a promoter capable of directing expression of said DNA molecule in said cancerous cells; incubating said transfected cells under conditions whereby the IL-12 and the costimulatory molecules are expressed; isolating naive T cells from the individual; exposing the T cells to the transfected cancerous cells, thereby activating the T cells; separating the active T cells from the transfected cancerous cells; and injecting the activated T cells into the patient.
       According to a third aspect of the invention, there is provided an expression system comprising a DNA molecule encoding IL-12 and a costimulatory molecule operably linked to a promoter.
       According to a fourth aspect of the invention, there is provided a cancerous cell transfected with the expression system described above.

BRIEF DESCRIPTION OF THE DRAWINGS

       FIG. 1 shows the overall procedure for generating DNA vector encoding IL-12 and B7-1.
       FIG. 2 shows IL-12 production of IL-12 gene-transfected COS cells using ELISA.
       FIG. 3 shows functional analysis of IL-12 produced by gene-transfected COS cells. Augmentation of PHA-stimulated lymphocyte proliferation was compared to standard recombinant human IL-12.
       FIG. 4 shows FACS analyses of COS cell transfected with B7-1 gene. Note that unmodified or mock-transfected COS cell is negative for B7-1.
       FIG. 5 shows costimulation of PHA-stimulated PBMC by B7-1 gene transfected COS cells.
       FIG. 6 shows the amino acid sequence of IL12.1 (SEQ ID No. 2).
       FIG. 7 shows the amino acid sequence of IL12.0 (SEQ ID No. 4).
       FIG. 8 shows the amino acid sequence of IL12.3 (SEQ ID No. 6).
       FIG. 9 shows the amino acid sequence of IL12.2 (SEQ ID No. 8).
       FIG. 10 shows the amino acid sequence of IL12.4 (SEQ ID No. 10).

DESCRIPTION OF THE PREFERRED EMBODIMENTS

       Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. All publications mentioned hereunder are incorporated herein by reference.
       Definitions
       As used herein, “IL-12” refers to bioactive interleukin-12. As will be appreciated by one knowledgeable in the art, this includes IL-12 assembled from p35 and p40 subunits or bioactive fragments thereof, a recombinant IL-12 comprising a fusion of p35 and p40 (or bioactive fragments thereof). The subunits may be joined by a linker.
       As used herein, “costimulatory molecule” refers to molecules capable of amplifying an immune response, for example, B7-1, B7-2 and CD40L.
       Described herein is a novel expression vector that comprises DNA sequences encoding IL-12 and a costimulatory molecule as well as methods of using same.
       As will be appreciated by one knowledgeable in the art, any suitable control sequences (promoters, polyadenylation sites, ribosome binding sites etc) known in the art may be utilized in the expression vector.
       In one embodiment, the costimulatory molecule is B7-1, although, as will be appreciated by one knowledgeable in the art, other suitable costimulatory molecules, for example, B7-2 and CD40L, may also be used.
       The method of use of the vaccine comprises isolating tumor cells or cancer cells from either donors or the patient to be treated, as described below. The isolated cancer cells are transfected with the above-described expression vector and are grown under conditions such that IL-12 and the costimulatory molecule are expressed. As a result of this arrangement, the cancer cells are effectively converted to antigen presenting cells (APC). The APC cancer cells are then exposed to T cells isolated from the patient, either in vivo or ex vivo. That is, in one embodiment, the APC cancer cells are irradiated to prevent reproduction of the APC cancer cells prior to injecting the APC cancer cells into the patient. In another embodiment, T cells isolated from the patient are exposed to the APC cancer cells and are then isolated from the APC cancer cells before being injected into the patient. In both embodiments, the T cell response is activated which in turn elicits an immune response against tumors. Furthermore, once the tumor has been destroyed, memory cells remain, meaning that the patient is effectively immunized against the tumor, and a subsequent immune response will be faster and stronger.
       Thus, as discussed above, the instant invention relates to the development of a new method of cancer immunotherapy and its in vitro, ex vivo, and in vivo uses. More specifically, this invention relates to the development of DNA vector comprising IL-12 and a costimulatory molecule and the protocol suitable for the in vitro generation of genetically modified human cancer cells for cancer therapy. These cells share phenotypes of both antigen presenting cells and cancer cells and are suitable as a cellular vaccine for certain types of cancer.
       Since most tumor cells do not express costimulatory molecules, tumor-specific antigens are not presented to T cells efficiently (Denfeld et al., 1995; Gajewski et al., 1996, J Immunol. 156:2909). Indeed, this may represent one mechanism by which tumor cells elude recognition by the immune system. Therefore, costimulatory molecules have been expressed on the surface of tumor cells to enable them to present tumor-associated antigens, together with the costimulatory signal, directly to T cells thus obviating the need for helper T cells and APCs. In support of this model, at least in primary responses against tumors in vivo, some B7-1-expressing tumors have been found to elicit an effective response that is mediated by CD+ and CD+ T cells (Dranoff et al., 1993, Proc. Natl. Acad. Sci. USA 90:3539). In contrast, nonimmungenic tumors failed to lose tumorigenicity with the transduction of B7-1 alone (Chen et al., 1994). On the basis of these experiments, we can speculate that vaccination of tumor patients with B7 autologous tumor cells will only boost antitumor responses when the tumors are immunogenic, i.e., expresses sufficient amounts of tumor peptide-loaded MHC class-I or class-II molecules that can be recognized by the patients' T cells. An additional costimulatory signal seems to be required for inducing antitumor effector cells against poorly immunogenic tumors. In this regard, augmentation of antitumor immune responses might then be obtained by co-expressing B7 with a cytokine which up-regulate MHC molecules, attract and stimulate other immune effector cells. IL-12 has been selected for its pleiotropic effects on a variety of immune cell types and also because B7-1 and IL-12 co-operate in stimulating lymphocyte proliferation and activation in vitro (Trinchieri, 1998, Adv. Immunol. 70:83). This cytokine induces a strong local response against a number of tumors, in the context of using recombinant IL-12 paracrine secretion at the tumor site using engineered fibroblasts (Zitvogel et al., 1995, J. Immunol. 155:1393) or transformed tumor cells (Tahara et al, 1995, J. Immunol. 154:6466), as well as direct in vivo gene delivery (Rakhmilevich et al., 1997). IL-12 also elicits protective immunity against poorly immunogenic tumors and was effective in causing regression of pre-established tumors (Tahara, et al., 1995, J. Immunol. 154:6466; Cavallo et al., 1997, J. Natl. Cancer lnst. 89:1049). IL-12 stimulates the proliferation of activated T and NK cells, synergizes with IL-1 in the generation of lymphokine-activated killer cells (LAK) and enhances their lytic activity (Gately et al., 1991, J. Immunol. 147:874). Its ability to stimulate directly the production of IFN-γ both in vitro (Chan et al., 1991, J. Exp. Med. 173:869) and in vivo (Nastala et al., 1994, J. Immunol. 153:1697) and to induce primarily a Th1 response in vitro suggests its potential utility as an antitumor agent. Indeed, recent studies (Tahara et al., 1996; Brunda et al., 1996, Ann. NY Acad. Sci. 795:266) implementing systemic administration of the rIL-12 protein had profound effects on virtually every murine tumor model evaluated. Systemic administration of murine rIL-12 in several tumor models mediates profound T cell-mediated antitumor effects in vivo, leading to regression of established tumor masses, which is frequently associated with the generation of antitumor immunological memory. By the same token, however, prolonged systemic administration of rIL-12 has been related with systemic toxicity (Motzer et al., 1998, Clin. Cancer Res. 4:1183.). To address this problem, several studies on experimental tumors have evaluated the therapeutic potential of IL-12-based tumor cell vaccines and, with few exceptions, the vaccines were found to cure or significantly improve the survival of mice bearing a variety of tumors. The antitumor activity of systemic or local release of IL-12 is largely mediated by IFN-γ secreted at the tumor site by stimulated NK cells ad T cells, along with up-regulation of MHC expression on tumor cells, NOS induction, release of other cytokines, and inhibition of angiogenesis through the induction of the chemokine IP-10 by both tumor cells and infiltrating immune cells (Trinchieri, 1998).
       The heterodimeric structure of IL-12 substantially complicated the construction of vectors for IL-12 production, because the expression of bioactive IL-12 requires the expression of two separate genes and subsequent correct heterodimeric assembly of the subunits (Mattner et al., 1993, Eur. J. Immunol. 23:2202). In vivo, activity of IL-12 requires the expression of the p35 and p40 subunits, which are located on different chromosomes which are regulated independently (Wolf et al., 1991, J. Immunol. 146:3074). Furthermore, it has been shown that excessive p40 subunit specifically antagonizes the effects of the IL-12 heterodimer in vitro (Mattner et al, 1993). We therefore designed a single chain recombinant IL-12 molecules by genetic recombination. Specifically, in order to achieve balanced expression of both subunits of IL-12 (p35 and p40), we generated a single chain p70 molecule by joining both subunits with various size of flexible linker. The Gly-Gly-Gly-Gly-Ser repeats were chosen for their flexibility and also because it had been used previously in constructing single chain antibodies and PIXY-321 (Curtis et al., 1991, Proc. Natl. Acad. Sci. USA 88:5809), a fused form of GM-CSF and IL-3 already in clinical use. As shown in FIG. 1, IL-12 and B7-1 were inserted into a bicistronic vector which contains an internal ribosome entry site (IRES).
       In the embodiment shown in FIG. 1, the costimulatory molecule is B7-1, which enhances sensitization and activation of tumor-reactive CD+ T cells (Guerder et al., 1995); however, costimulatory molecules are not required for recognition and destruction of tumor cells by activated effector T cells (CTL). Moreover, it is clear that IL-12, even when present at pg/ml concentration, has profound effects on the generation of human CTLs in synergy with B7-1 stimuli (Komata et al., 1997, J. Immunother. 20:256). That is, direct activation of CTLs by tumor cells expressing B7-1 and IL-12 induces antitumor responses, bypassing the need for exogenous CD4+T cells or APCs or both, which leads to a faster and more effective antitumor response in situ. This may provide a means of eliciting effective CTL responses to tumors in patients. Thus, immunotherapy of the tumor aims to generate an effective systemic immune response capable of controlling the growth of metastatic tumors.
       Our vaccine, ACV (autologous cell vaccine), utilizes cell lines derived from tumors that are modified and administered in ways that substantially enhance the response by the immune system. The cells used in the manufacture of the vaccine are sterilized by irradiation prior to administration to ensure that they are unable to replicate. The costimulatory molecule signals the immune cells in the treated patients while IL-12 recruits immune cells to the tumor site(s), and kills tumor cells by preventing angiogenesis. Our vaccine therapy would supplement existing modalities such as chemotherapy, radiotherapy and surgery by using the immune system to contain or destroy residual tumor cells, thereby increasing the length of remission, or preventing recurrence of the cancer. The immune response will be selective for the tumor and will be capable of hunting down cancer cells throughout the body.
       This approach has several advantages over other above mentioned approaches that (a) it does not require the identification and purification of antigenic peptides, (b) it can be applied to almost every types of cancers, and (c) it may not induce GvHD since cancer cells are derived from self. Thus, this method will provide a new and safe therapeutic strategy for primary and metastatic cancer by activating patient's own immune system.
       The present invention provides methods and compositions for use of genetically modified cancer cells to activate T cells for immunotherapeutic responses against primary or metastatic cancer. The cancer cells obtained from human donors, after transfection or transduction with the IL-12 and B7-1 expression vector described above, are administered to a cancer patient to activate the relevant T cell responses in vivo. Alternatively, T cells from patients are exposed to genetically modified cancer cells in vitro to activate the relevant T cell responses in vitro. The activated T cells are then administered to a cancer patient. In either case, the genetically modified cancer cells are advantageously used to elicit an immunotherapeutic growth-inhibiting response against a primary or metastatic tumor.
       Examples of suitable vectors are discussed below and are shown in SEQ ID Nos. 1, 3, 5, 7, 9, 11, 13 and 15. As will be apparent to one knowledgeable in the art, other suitable vectors may also be used.
       IL-12 fusions are shown in FIGS. 6-10 and in SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14 and 16 and are discussed below.

EXAMPLE I

       Construction of Vectors Encoding hB7-1 and Single Chain IL-12
       Human IL-12 gene and B7-1 genes are obtained from antigen presenting cells of various tissues, for example, the spleen, bone marrow and lymph nodes as well as the circulatory system including blood and lymph. It is of note that human peripheral blood is an easily accessible ready source of antigen presenting cells and is used as a source according to a preferred embodiment of the invention. Cord blood is another source of human antigen presenting cells.
       Because antigen presenting cells that express both IL-12 and B7-1 spontaneously exist in low numbers in any tissues in which they reside, including human peripheral blood, antigen presenting cells must be enriched or isolated for use. Any of a number of procedures, for example, repetitive density gradient separation, positive selection, negative selection or a combination thereof may be used to obtain enriched populations or isolated antigen presenting cells.
       Once the antigen presenting cells are obtained, they are cultured in appropriate culture medium to stimulate the expression of IL-12 and B7-1. Particularly advantageous for inducing the proper state of antigen presenting cells in in vitro culture is the presence of LPS for monocytes, Anti-Ig and IL-4 or CD40 ligand (CD154) and IL-4 for B cells, or GM-CSF, IL-4 and CD40L for dendritic cells Preferred cells and conditions are of monocytes and LPS at concentration of 100-1000 ng/ml.
       According to a preferred embodiment, both B7-1 and IL-12 genes have been cloned by standard recombinant DNA techniques. The cDNA encoding the p35 and p40 chains of human IL-12 and human B7-1 were generated from LPS-stimulated human peripheral blood monocytes by reverse transcriptase-PCR.
       Primers selected from the 5′ and 3′-end of the coding sequences of each gene were designed to introduces the appropriate restriction sites for further genetic recombinations, as shown in FIG. 1.
       After cloning of B7.1 and IL-12, each gene was subcloned into a vector.
       Specifically, B7-1 was excised at appropriate restriction sites and inserted into MCS A site on pIRES excised with corresponding restriction digests or into MCS B site excised with corresponding restriction enzyme generating pIRES-hB7-1 (A) and pIRES-hB7-1(B).
       The cDNA for the single chain IL-12 fusion protein was constructed by linkage of the p40 and p35 cDNAs with a synthetic flexible linker. We have utilized three different linkers that have 2, 3 and 4 repeats of Gly-Gly-Gly-Gly-Ser (shown in FIGS. 9, 8 and 10, respectively). These products (about 1.6 kb) have been cloned into pGEM-T easy vector generating pGEM-IL12.2, pGEM-IL12.3, and pGEM-IL12.4. PCR products from these three vectors were digested with appropriate sets of restriction enzyme and ligated into the MCS B on pIRES-hB7-1 (A) or into the MCS A on pIRES-hB7-1(B) generating 2 series (pIRES-hB7-1-IL12 and pIRES-IL12-hB7-1 series) of 6 different expression vectors (pIRES-hB7-1-IL12.2 (SEQ ID No. 7), pIRES-hB7-1-IL12.3 (SEQ ID No. 5), and pIRES-hB7-1-IL12.4 (SEQ ID No. 9); pIRES-IL12.2-hB7-1 (SEQ ID No. 15), pIRES-IL12.3-hB7-1 (SEQ ID No. 13), and pIRES-IL12.4-hB7-1 (SEQ ID No. 11)). In addition, constructs with a single linker (IL12.1, shown in FIG. 6; SEQ ID No. 2) and no linker (IL12.0, shown in FIG. 7; SEQ ID No. 4) were used to construct pIRES-hB7.1-IL12.1 (SEQ ID No. 1) and pIRES-hB7.1-IL12.0 (SEQ ID No. 3) respectively. Constructs were sequenced across all cloning junctions to determine the fidelity of recombination process.

EXAMPLE II

       Gene Transfer into Cancer Cells
       According to the present invention, the pIRES DNA vectors are introduced into a desired cancer cells by lipofectamin. After gene transfection, cells are analyzed to assess the expression of both IL-12 by ELISA and B7-1 by FACS.
       Alternatively, DNA vectors can be introduced into cancer cells by other means known in the art, for example, PEG, electroporation, DEAE-dextran method or calcium phosphate method.
       After 24-72 hr culture, supernatants were collected for IL-12 ELISA and bioassay and cells were collected for B7-1 expression and costimulation assay.

EXAMPLE III

       Applications or Methods of use of Genetically Modified Cancer Cell's to Stimulate T Cells Against Cancer in vitro and in vivo
       According to a preferred embodiment of the invention, cancer tissues are obtained from a patient to be treated to generate a cancer cell line. The cell lines are in turn used to activate autologous T cells of the patient, either in vitro or in vivo, for cancer immunotherapy and/or tumor growth inhibition.
       According to another embodiment, a tumor cell recovered from surgical specimens without further treatment can be used for gene transfer, as discussed herein.
       Using an approach wherein the patient's own cancer cells that are genetically modified provides the following advantages: (A) identification of cancer antigen is not required; (B) temporal expression of genes eliminates dangerous side effects; (C) antigen on cancer cells along with costimulatory factor, B7-1, is presented to T lymphocytes; (D) the use of B7-1 on and IL-12 from cancer cell surface eliminates the need to provide T cells with IL-2 or other cytokines either in the form of the cytokine itself or transfection of the cDNA into specific cells; (E) all procedures are carried out using the patient's own cells.

EXAMPLE IV

       Cell Lines and Reagents
       Cell lines are established from pathologically proven cancer tissues. Soild tumors were first finely minced with scissors and dissociated into small aggregates by pipetting. Appropriate amounts of fine neoplastic-tissue fragments were seeded into 25 cm2 flasks. Tumor cells were initial cultured in RPMI-1640 medium containing 10% heat-inactivated fetal calf serum. After establishment, passages were performed when heavy tumor cell growth is observed. If stromal cell growth is noted in initial cultures, differential trypsinization is used to obtain a pure tumor cell population. Established human cancer cells are maintained in culture in RPMI 1640. Mouse monoclonal anti-human CD80 and isotype-matched control antibodies were purchased from Becton-Dickinson, San Jose, Calif.

EXAMPLE V

       Cancer Patients
       Patients with a histologic confirmation of cancer are selected for this study which included a signed informed consent. 50 ml of heparinized peripheral blood were drawn every 2 weeks during the period of observation which continues. Details regarding clinical stage, hematologic status, and other relevant treatments are recorded.

EXAMPLE VI

       Isolation of Monocytes and Stimulation With LPS
       Peripheral blood was drawn from normal donors and was subjected to Ficoll-Hypaque (Pharmacia, Uppssala, Sweden) density gradient centrifugation. After washing twice with Hank's Balanced salt solution (HBSS, Life Technologies, Grand Island, N.Y.), monocytes were separated according to their plastic adherence (Steinbach et al., 1998, Res. Immunol. 149:627; Thurner et al., 1999, J. Immunol. Methods 223:1) for 30 min -1 hr incubation in 5% CO2, 37° C. incubator. Enriched monocytes were cultured in the presence of 100-1000 ng/ml of LPS (Sigma Chem. Co., St. Louis, Mo.). Alternatively, monocytes are purified from eripheral blood mononuclear cells by positive selection by CD14-MACS column (Miltenyi Biotec). After 16-24 hr of incubation, cells were treated with Trizol (Life Technologies) for RNA preparation and subsequent gene cloning.

EXAMPLE VII

       Construction of Vectors Encoding HB7-1 and Single Chain IL-12
       Both B7-1 and IL-12 genes were cloned by standard recombinant DNA techniques, as discussed herein. The cDNA encoding the p35 and p40 chains of human IL-12 and human B7-1 were generated from 1 μg/ml of LPS-stimulated human peripheral blood monocytes or CD40 ligand and IL4 stimulated human B cells by reverse transcriptase- PCR. Typically, peripheral blood monocytes were utilized for experimental convenience. First, mRNA was isolated by oligo-dT MACS column (Miltenyi Biotec) and subjected to first strand cDNA synthesis. Primers selected from the 5′ and 3′-end of the coding sequences of each gene were designed to introduces restriction sites. PCR was done with PCR-premix (Bioneer, Cheongwon, Chungbuk, Korea) containing PCR buffer, dNTP, Taq polymerase and MgCl2. After cloning of B7.1, each gene has been subcloned into PGEM vector generating pGEM-hp40, pGEM-hp35, and pGEM-hB7-1 and clones were verified by sequencing.
       Next, the PCR product of hB7-1 was either excised at appropriate restriction sites and inserted into MCS A site on pIRES or into MCS B site excised with restriction enzymes generating pIRES-hB7-1 (A) and pIRES-hB7-1(B).
       The CDNA for the single chain IL-12 fusion protein was constructed by linkage of the p40 and p35 cDNAs with synthetic linkers of 2, 3 and 4 repeats of Gly-Gly-Gly-Gly-Ser containing NcoI restriction sites. The p40 from pGEM-hp40 has been amplified with linker-containing primers producing p40L2, p40L3, and p40L4. The p35 was cloned downstream of the linker with restriction site. These products (about 1.6 kb) have been cloned into pGEM-T easy vector generating pGEM-IL12.2, pGEM-IL12.3, and pGEM-IL12.4. PCR products from these three vectors were digested with the appropriate restriction enzymes and ligated into the MCS B on pIRES-hB7-1 (A) or into restriction site of MCS A on pIRES-hB7-1(B) generating 2 series (pIRES-hB7-1-IL12 and pIRES-IL12-hB7-1 series) of 6 different expression vectors (pIRES-hB7-1-IL12.2, pIRES-hB7-1-IL12.3, and pIRES-hB7-1-IL12.4; pIRES-IL12.2-hB7-1, pIRES-IL12.3-hB7-1, and pIRES-IL12.4-hB7-1). Constructs were sequenced across all cloning junctions to determine the fidelity of recombination process.

EXAMPLE VII

       Transfection of Plasmid Encoding IL-12/B7-1 Genes and Analysis of Gene Expression
       COS-7 cells or human cancer cell lines in 6-well plates at 60-70% confluency were transfected with 5 μg of vector prepared by Qiagen plasmid kit (Qiagen Inc, Valencia, Calif.) using Lipofectin (Life Technologies) according to manufacturer's instructions. After 5 hr incubation at 37° C., the DNA/Lipofectin mixture was removed and replaced with fresh medium. After 48 hr, supernatants were collected for IL-12 ELISA and bioassay, and cells were collected for B7-1 expression. IL-12 in the culture supernatants of transfected cells was quantitated using human IL-12 p70 ELISA kit (Endogen, Woburn, Mass.). The function of IL-12 produced by gene transfected cells can be analyzed by PHA-blast assay (Gately et al., 1991) and/or IFN-γ induction assay of PHA-stimulated PBMC. B7-1 expression on gene-transfected cells can be analyzed by flow cytometer after staining with monoclonal anti-B7-1 antibody, while the function of B7-1 can be analyzed by proliferation of PBMC stimulated with sub-optimal dose of PHA in the presence of gene-transfected cells.

EXAMPLE IX

       Collection of Tissue and Preparation Cells
       Patients with a histologic confirmation of cancer were selected for this study which included a signed informed consent. 50 ml of heparinized peripheral blood was drawn. Details regarding clinical stage, hematologic status, and other relevant treatments were recorded. Cancer cell lines were established from patients. Solid tumors were finely minced with scissors and dissociated into small aggregates by pipetting. Appropriate amounts of fine tissue fragments were seeded into 25 cm2 flasks. In most cases, tumor cells were cultured in RPMI-1640 medium supplemented with 10% heat inactivated FCS. Initial passages were performed when heavy tumor cell growth was observed; subsequent passages were performed once or twice a week. Adherent cell cultures were passaged at sub-confluency after trypsin-EDTA (Life Technologies) treatment, and floating cells were passaged after dissociation, if necessary, of the cells by pipetting. If stromal cell or fibroblast growth was noted in initial cultures, differential trypsinization was used to obtain a pure tumor cell population. All cultures were maintained in humidified incubators at 37° C. in an atmosphere of 5% CO2 and 95% air. Phenotypes of established cell lines were analyzed with FACS (Becton Dickinson) after fluorochrome-labeled monoclonal antibody staining.

EXAMPLE X

       Construction of DNA Vectors Encoding hB7-1 and Single Chain IL-12
       Mononuclear cells were isolated by Ficoll-Paque (Pharmacia) density gradient centrifugation of heparinized blood obtained from healthy adult donors according to standard protocols. Then, cells (5×106/ml, 10 ml) resuspended in RPMI-1640 medium supplemented with 10% FCS, 10 mM of glutamine, and penicillin/streptomycin and subjected to 1 hr plastic adherence. Non-adherent mononuclear cells were removed by rinsing with warm PBS and remaining monocytes were stimulated with LPS(1 μg/ml, Sigma) for 20 hr at 37° C., 5% CO2 incubator. Both B7-1 and IL-12 genes have been cloned by standard recombinant DNA techniques. The cDNA encoding the p35 and p40 chains of human IL-12 and human B7-1 were generated from LPS-stimulated human peripheral blood monocytes by reverse transcriptase-PCR. Primers selected from the 5′ and 3′-end of the coding sequences of each gene were designed to introduces SalI, NcoI, NotI, NheI and MluI restriction sites. After cloning of B7.1, each gene has been subcloned into pGEM vector generating pGEM-hp40, pGEM-hp35, and pGEM-hB7-1 and clones were verified by sequencing.
       The sets of 5′ and 3′ primers for PCR were hp4F 5′-CTAGCTAGCGGCCCAGAG CMGATGTG-3′, hp4° F.b 5′-ACGCGTCGACGGCCCGAGCMGATGTG-3′, and hp40R1 5′-ACTGCAGGGCACAGATGC-3′ for p40, hp35F 5′-CATGCCATGGAG M ACCTCCCCGTGGC-3′, hp35R 5′-CCGACGCGTACCTCGCTTTTTAGGAAGCAT-3′, and hp35Rb 5′-ATMGAATGCGGCCGCACCTCGCTTTTTAGGAAGCAT-3′ for p35, and hB7-1 F 5′-ACGAGTCGACATGGGCCACACACGGA-3′, hB7-1 R 5′-GGCGGCC GTTTCAGCCCCTTGCTTTT-3′, hB7. 1 Fb 5′-CTAGCTAGCATGGGCCACACACGG A-3′, and hB7.1 Rb 5′-CCGACGCGTTTTCAGCCCCTTGCTTCT-3′ for B7.1
       Next, PCR product of hB7-1 has been either excised at NheI/MluI or at Sall/Notl and inserted into MCS A site on pIRES excised with NheI/MluI or into MCS B site excised with SalI/NotI generating pIRES-hB7-1 (A) and pIRES-hB7-1(B).
       The cDNA for the single chain IL-12 fusion protein was constructed by linkage of the p40 and p35 cDNAs with a synthetic linker containing Ncol restriction sites. Previous studies suggested that an accessible N-terminus of the p40 subunit is important for IL-12 bioactivity. When the p35 subunit came before the p40 subunit, there was greatly decreased IL-12 activity, in contrast, when the subunits were reversed, with p40 in front of p35, the single chain IL-12 had biological activity comparable to rIL-12.
       We have utilized three different linkers that have 2, 3 and 4 repeats of Gly-Gly-Gly-Gly-Ser. The p40 from pGEM-hp40 has been amplified with linker-containing primers producing p40L2, p40L3, and p40L4. The p35 was cloned down stream of the linker with Ncol restriction site. These products (about 1.6 kb) have been cloned into pGEM-T easy vector generating pGEM-IL12.2, pGEM-IL12.3, and pGEM-IL12.4. PCR products from these three vectors were digested with SalI/NotI or NheI/MluI and ligated into the MCS B on pIRES-hB7-1(A) or into NheI/MluI site of MCS A on pIRES-hB7-1(B) generating 2 series (pIRES-hB7-1-IL12 and pIRES-IL12-hB7-1 series series) of 6 different expression vectors (pIRES-hB7-1-IL12.2, pIRES-hB7-1-IL12.3, and pIRES-hB7-1-IL12.4; pIRES-IL12.2-hB7-1, pIRES-IL12.3-hB7-1, and pIRES-IL12.4-hB7-1). Constructs were sequenced across all cloning junctions to determine the fidelity of recombination process.
       In some cases, peripheral blood monocytes were subjected to MACS purification to further enrich monocyte fraction. Cells stained with CD14-microbeads (Milteyi Biotec) in the presence of FcR blocking reagent (human IgG) for 30 minutes at 4° C. After washing cells applied to MS+ column which was subjected to magnetic fields. Cells remaining in the column were collected and washed once before undergoing culturing as indicated above.

EXAMPLE XI

       IL-12 Production by Gene-Modified COS Cells
       IL-12 in the culture supernatants of transfected cells was quantitated using human IL-12 p70 ELISA kit (Endogen). After transfection, culture supernatants of COS cells were harvested, filtered with 0.22 μm syringe filter, and stored until analyzing the quantity and quality of gene expression.
       While cells transfected with pIRES-hB7-1-IL12 series(pIRES-hB7-1-IL12.2, pIRES-hB7-1-IL12.3, and pIRES-hB7-1-IL12.4) produced 0.6-1.5 ng/ml, culture supernatants from cells with pIRES-IL12-hB7-1 series (pIRES-IL12.2-hB7-1, pIRES-IL12.3-hB7-1, and pIRES-IL12.4-hB7-1) contained 21-32 ng/ml of IL-12 according to ELISA analysis.
       All three vectors with different length of flexible linkers produced similar quantities of IL-12 indicating that the linker length did not influence the level of gene expression or conformation of IL-12.

EXAMPLE XII

       Bioactivity Measurement of IL-12 Produced by Transfected COS Cells
       Assays for IL-12 induced proliferation of PBMC were performed as previously described (Gately et al., 1991, J. Immunol. 147:874.). PBMC were isolated from normal blood donors by Ficoll-Hypaque density centrifugation (Pharmacia, Piscataway, N.J.). These cells were then depleted of monocytes by plastic adherence, and non adherent cells were resuspended at 5×105 cells/ml in RPMI-1640 supplemented with 10% FCS, penicillin, streptomycin, L-glutamine containing 2 μg/ml of PHA(Sigma) and were cultured for 3 days. Cells were then washed and recultured with rIL-2 (20 iu/ml) for an additional 48 hr. The PHA-blasts were then washed with acidified RPMI-1640 (pH 6.4) and rest for 4 hr in RPMI-1640 supplemented with 0.5% human AB serum. The cell concentration was adjusted to 2×106 cells/ml in complete medium. 50 μl of serial 1:5 dilutions of IL-12 standard (Leinco Tech) and culture supernatants of transfected cells were made in complete medium over a range of 0.16 pg to 200 pg/mi. Next, 50 μl cell suspension (PHA-blasts) was mixed with these serial diutions in triplicate in a flat-bottomed 96-well tissue culture plate. Neutralizing anti-IL-2 antibody was included to block IL-2-induced proliferation. [3H]-Thymidine (1 μCi/well, NEN, Boston, Mass.) was added to the wells after 36 hr of culture at 37 C, 5%CO2. After an additional 16 hr of incubation, cultures were harvested onto glass filters, and radioactivity was assessed by liquid scintillation.
       The bioactivity of IL-12 produced by vector-transfected cell lines was also measured using an assay based on the ability of this cytokine to stimulate production of IFN-γ by lymphocytes. PBMC activated with PHA 5 ug/ml for 5 days were collected, washed and restimulated with serial dilutions of standard IL-12 (1-1000 pg/ml) and culture supernatants from transfected cell lines stimulated. Following a 20 hr incubation, culture supernatants of PBMC were harvested and assayed for IFN-γ by ELISA (Endogen). The assay for IL-12 was quantified by comparing the amount of IFN-γ produced in the test samples with that induced by the recombinant IL-12 standards.
       In accordance with ELISA data, the bioactivity of IL-12 in the culture supernatants of pIRES-hB7-1-IL12 series was significantly lower than that of pIRES-IL12-hB7-1 series. In PHA-blast assays, IL-12 produced by transfected COS cells efficiently induced the proliferation of T cells and NK cells comparable to the standard IL-12. Moreover, in IFN-γ induction assay, the specific activity of IL-12 produced by vector transfected cells at equal concentration was almost identical or even better to rIL-12 standard.

EXAMPLE XII

       B7-1 Expression on Gene-Modified Cells by Flow Cytometric Analysis
       Assessment of B7-1 on vector transfected COS-7 was done with the aid of FITC-conjugated anti-B7-1 (Pharmingen, San Diego, Calif.). As a control, we used an appropriate FITC-conjugated isotype-matched normal IgG1 (Pharmingen). Flow cytometric analysis of 10,000 viable cells was conducted on a FACS vantage (Becton Dickinson). Each experiments was repeated at least three times, and the results of a representative experiment are provided in the form of a histogram.
       While a significant proportion (20-30%) of cells transfected with pIRES-B7-1-IL12 series expressed B7-1, little (less than 5%) was shown on cells transfected with pIRES-IL12-hB7-1 series.

EXAMPLE XIII

       B7-1-induced Costimulation
       T lymphocytes were purified with T cell enrichment kit (Miltenyi Biotech). Allogneic T lymphocytes (1×106 cells/ml) were co-cultured with irradiated (3000 rad) vector-transfected ME-180 (1, 2, 4, 8×104 cells/ml) in 96 well flat-bottomed plates in the presence of 1 ug/ml of PHA. In some cases, murine CD28-Fc (Chemicon) was included to determine the direct effects of B7-1 expressed on vector-transfected ME-180. 48 hr later, each well received 1 μCi [3H]-thymidine, and after an additional 16 hr, cultures were harvested onto glass filters, and radioactivity was assessed by liquid scintillation.
       While a significant T cell costimulatory activity (50-100 fold higher cpm over background) was shown by cells transfected with pIRES-B7-1-IL12 series, only a minor (3 fold higher cpm over background) was detected on cells transfected with pIRES-IL12-hB7-1 series. Furthermore, there was a dose-dependent inhibition of costimulatory activity by chimeric CD28-Fc protein, suggesting the major costimulation was provided directly by B7-1 on vector-transfected ME-180.

EXAMPLE XIV

       Cytotoxicity Assay
       PBMC (1×107 cells) from normal donors were stimulated with ME-180 in a volume of 10 ml. After 6 days, effector cells were collected, adjusted to 2×106 cells/ml, and titrated in triplicate in flat-bottomed plates to give the indicated E:T ratios along with vector-transfected ME-180 cells. Supernatants were collected after 4 hr and cytotoxicity was measured by the kit purchased from Roche (Manheim, Germany). Percentage-specific lysis was calculated according to the manufacturer's instructions.
       While a significant cytotoxicity was shown by PBMC co-cultured with ME-180 cells transfected with pIRES-B7-1, only a minor cytotoxicity was detected on PBMC co-cultured with ME-180. The co-toxic effects of PBMC co-cultured with ME-180 cells transfected with pIRES-B7-1 were exerted not only to pIRES-B7-1-IL12 series and pIRES-IL12-B7-1 series but also to unmodified ME-180, suggesting the maturation of effector CTL by B7-1+ ME-180 during co-culture and implicating the immunotherapeutic potential of ACV.
       Once generated, IL-12 and B7-1 genetically-modified cancer cells can, if desired, be injected by any standard method into the patient. Preferably, the cells are injected back into the same patient from whom the source cancer cells were obtained. The injection site may be subcutaneous, intraperitoneal, intramuscular, intradermal, or intravenous. The number of cells injected into the patient in need of treatment is according to standard protocols, e.g., 1×104 to 1×106 are injected back into the individual. The number of cells used for treating any of the disorders described herein varies depending upon the manner of administration, the age and the body weight of the subject, and the condition of the subject to be treated, and ultimately will be decided by the attending physician.
       The methods and media compositions described herein are useful for the culture and production of cancer cells and gene-modified cancer cells in vitro and in vivo. Given the capacity of gene-modified cancer cells to elicit strong antigen-specific helper and cytotoxic T cell responses, the ability to generate large numbers of homogeneous preparations of cancer cells facilitates the manipulation of these cells for the development of a variety of therapies, including without limitation, ex vivo and in vivo human therapies.
       The invention therefore encompasses the use of gene-modified cells produced according to the methods of the invention for any number of human or veterinary therapeutics. For example, vector types other than above mentioned combination of genes can be produced as described herein can be used in ex vivo cell transplantation therapies for the treatment of a variety of human diseases, e.g., disorders of the immune system. Accordingly, such cells are useful for modulating autoimmunity and limiting a variety of autoimmune diseases.
       Gene-modified cancer cells also find use in the ex vivo expansion of T cells, e.g., CD4+ cells or CD8+ cells or both. Thus, such cells are useful for stimulating the proliferation and reconstitution of CD4+ cells or CD8+ cells or both in a human having an immune disorder. Reconstitution of the immune system, e.g., a patient's CD4+ cells, is useful in immunotherapy for preventing, suppressing, or inhibiting a broad range of immunological disorders, e.g., as found during HIV infection.
       The cancer cells described herein are also useful for vaccine development. For example, administration of antigens (as a form of cell lysate) to immuno-competent host further facilitate the use of these cells for active immunization in situ.
       In addition, gene-modified cancer cells are useful for the generation of antibodies (e.g., monoclonal antibodies) that recognize cancer cell-specific markers. Anti-cancer cell antibodies are produced according to standard hybridoma technology. Such antibodies are useful for the evaluation and diagnosis of a variety of immunological disorders.
       In some embodiments, the ACV as described herein at therapeutically effective concentrations or dosages may be combined with a pharmaceutically or pharmacologically acceptable carrier, excipient or diluent, either biodegradable or non-biodegradable. Exemplary examples of carriers include, but are by no means limited to, for example, poly(ethylene-vinyl acetate), copolymers of lactic acid and glycolic acid, poly(lactic acid), gelatin, collagen matrices, polysaccharides, poly(D,L lactide), poly(malic acid), poly(caprolactone), celluloses, albumin, starch, casein, dextran, polyesters, ethanol, mathacrylate, polyurethane, polyethylene, vinyl polymers, glycols, mixtures thereof and the like. Standard excipients include gelatin, casein, lecithin, gum acacia, cholesterol, tragacanth, stearic acid, benzalkonium chloride, calcium stearate, glyceryl monostearate, cetostearyl alcohol, cetomacrogol emulsifying wax, sorbitan esters, polyoxyethylene alkyl ethers, polyoxyethylene castor oil derivatives, polyoxyethylene sorbitan fatty acid esters, polyethylene glycols, polyoxyethylene stearates, colloidol silicon dioxide, phosphates, sodium dodecylsulfate, carboxymethylcellulose calcium, carboxymethylcellulose sodium, methylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropylmethycellulose phthalate, noncrystalline cellulose, magnesium aluminum silicate, triethanolamine, polyvinyl alcohol, polyvinylpyrrolidone, sugars and starches. See, for example, Remington: The Science and Practice of Pharmacy, 2000, Gennaro, AR ed., Eaton, Pa.: Mack Publishing Co.
       The invention provides kits for carrying out the methods of the invention. Accordingly, a variety of kits are provided. The kits may be used for any one or more of the following (and, accordingly, may contain instructions for any one or more of the following uses): treating some forms of cancer in an individual; preventing the spread or metastasis of some forms of cancer; preventing one or more symptoms of some forms of cancer; reducing severity of one or more symptoms associated with cancer; delaying development of cancer in an individual; or vaccinating an individual against some forms of cancer.
       The kits of the invention comprise one or more containers comprising the ACV and a suitable excipient as described herein and a set of instructions, generally written instructions although electronic storage media (e.g., magnetic diskette or optical disk) containing instructions are also acceptable, relating to the use and dosage of the ACV for the intended treatment. The instructions included with the kit generally include information as to dosage, dosing schedule, and route of administration for the intended treatment. The containers of the ACV may be unit doses, bulk packages (e.g., multi-dose packages) or sub-unit doses.
       The ACV may be packaged in any convenient, appropriate packaging.
       As will be appreciated by one knowledgeable in the art, the anti-cancer vaccine may be combined or used in combination with other treatments known in the art.
       While the preferred embodiments of the invention have been described above, it will be recognized and understood that various modifications may be made therein, and the appended claims are intended to cover all such modifications which may fall within the spirit and scope of the invention.
Number of Sequences: 16
Sequence ID: 1
Length of Sequence: 8578
Sequence Type: DNA
Scientific Name: Artificial Sequence
Name/Key: enhancer
Location: (1)..(659)
Other Information: CMV IE
 1 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctaga tgggccacac acggaggcag ggaacatcac catccaagtg tccatacctc   1140 

aatttctttc agctcttggt gctggctggt ctttctcact tctgttcagg tgttatccac   1200 

gtgaccaagg aagtgaaaga agtggcaacg ctgtcctgtg gtcacaatgt ttctgttgaa   1260 

gagctggcac aaactcgcat ctactggcaa aaggagaaga aaatggtgct gactatgatg   1320 

tctggggaca tgaatatatg gcccgagtac aagaaccgga ccatctttga tatcactaat   1380 

aacctctcca ttgtgatcct ggctctgcgc ccatctgacg agggcacata cgagtgtgtt   1440 

gttctgaagt atgaaaaaga cgctttcaag cgggaacacc tggctgaagt gacgttatca   1500 

gtcaaagctg acttccctac acctagtata tctgactttg aaattccaac ttctaatatt   1560 

agaaggataa tttgctcaac ctctggaggt tttccagagc ctcacctctc ctggttggaa   1620 

aatggagaag aattaaatgc catcaacaca acagtttccc aagatcctga aactgagctc   1680 

tatgctgtta gcagcaaact ggatttcaat atgacaacca accacagctt catgtgtctc   1740 

atcaagtatg gacatttaag agtgaatcag accttcaact ggaatacaac caagcaagag   1800 

cattttcctg ataacctgct cccatcctgg gccattacct taatctcagt aaatggaatt   1860 

tttgtgatat gctgcctgac ctactgcttt gccccaagat gcagagagag aaggaggaat   1920 

gagagattga gaagggaaag tgtacgccct gtataacagt gtccgcagaa gcaaggggct   1980 

gaaaacgcgt cgagcatgca tctagggcgg ccaattccgc ccctctccct cccccccccc   2040 

taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct atatgtgatt   2100 

ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt   2160 

gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc tgttgaatgt   2220 

cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg tagcgaccct   2280 

ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa agccacgtgt   2340 

ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt ggatagttgt   2400 

ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg atgcccagaa   2460 

ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta catgtgttta   2520 

gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt tcctttgaaa   2580 

aacacgatga taagcttgcc acaacccggg atcctctaga gtcgacggcc cagagcaag    2639 

atg tgt cac cag cag ttg gtc atc tct tgg ttt tcc ctg gtt ttt ctg     2687 
Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

gca tct ccc ctc gtg gcc ata tgg gaa ctg aag aaa gat gtt tat gtc     2735 
Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

gta gaa ttg gat tgg tat ccg gat gcc cct gga gaa atg gtg gtc ctc     2783 
Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

acc tgt gac acc cct gaa gaa gat ggt atc acc tgg acc ttg gac cag     2831 
Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

agc agt gag gtc tta ggc tct ggc aaa acc ctg acc atc caa gtc aaa     2879 
Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

gag ttt gga gat gct ggc cag tac acc tgt cac aaa gga ggc gag gtt     2927 
Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

cta agc cat tcg ctc ctg ctg ctt cac aaa aag gaa gat gga att tgg     2975 
Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

tcc act gat att tta aag gac cag aaa gaa ccc aaa aat aag acc ttt     3023 
Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

cta aga tgc gag gcc aag aat tat tct gga cgt ttc acc tgc tgg tgg     3071 
Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

ctg acg aca atc agt act gat ttg aca ttc agt gtc aaa agc agc aga     3119 
Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

ggc tct tct gac ccc caa ggg gtg acg tgc gga gct gct aca ctc tct     3167 
Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

gca gag aga gtc aga ggg gac aac aag gag tat gag tac tca gtg gag     3215 
Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

tgc cag gag gac agt gcc tgc cca gct gct gag gag agt ctg ccc att     3263 
Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

gag gtc atg gtg gat gcc gtt cac aag ctc aag tat gaa aac tac acc     3311 
Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

agc agc ttc ttc atc agg gac atc atc aaa cct gac cca ccc aac aac     3359 
Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

ttg cag ctg aag cca tta aag aat tct cgg cag gtg gag gtc agc tgg     3407 
Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

gag tac cct gac acc tgg agt act cca cat tcc tac ttc tcc ctg aca     3455 
Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

ttc tgc gtt cag gtc cag ggc aag agc aag aga gaa aag aaa gat aga     3503 
Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

gtc ttc acc gac aag acc tca gcc acg gtc atc tgc cgc aaa aat gcc     3551 
Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

agc att agc gtg cgg gcc cag gac cgc tac tat agc tca tct tgg agc     3599 
Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

gaa tgg gca tct gtg ccc tgc agt cca tgg aga aac ctc ccc gtg gcc     3647 
Glu Trp Ala Ser Val Pro Cys Ser Pro Trp Arg Asn Leu Pro Val Ala 
                325                 330                 335 

act cca gac cca gga atg ttc cca tgc ctt cac cac tcc caa aac ctg     3695 
Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu 
            340                 345                 350 

ctg agg gcc gtc agc aac atg ctc cag aag gcc aga caa act cta gaa     3743 
Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu 
        355                 360                 365 

ttt tac cct tgc act tct gaa gag att gat cat gaa gat atc aca aaa     3791 
Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys 
    370                 375                 380 

gat aaa acc agc aca gtg gag gcc tgt tta cca ttg gaa tta acc aag     3839 
Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys 
385                 390                 395                 400 

aat gag agt tgc cta aat tcc aga gag acc tct ttc ata act aat ggg     3887 
Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly 
                405                 410                 415 

agt tgc ctg gcc tcc aga aag acc tct ttt atg atg gcc ctg tgc ctt     3935 
Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu 
            420                 425                 430 

agt agt att tat gaa gac ttg aag atg tac cag gtg gag ttc aag acc     3983 
Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr 
        435                 440                 445 

atg aat gca aag ctt ctg atg gat cct aag agg cag atc ttt cta gat     4031 
Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp 
    450                 455                 460 

caa aac atg ctg gca gtt att gat gag ctg atg cag gcc ctg aat ttc     4079 
Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe 
465                 470                 475                 480 

aac agt gag act gtg cca caa aaa tcc tcc ctt gaa gaa ccg gat ttt     4127 
Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe 
                485                 490                 495 

tat aaa act aaa atc aag ctc tgc ata ctt ctt cat gct ttc aga att     4175 
Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile 
            500                 505                 510 

cgg gca gtg act att gat aga gtg atg agc tat ctg aat gct tcc taa     4223 
Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 
        515                 520                 525 

aaagcgaggt gcggccgctt ccctttagtg agggttaatg cttcgagcag acatgataag   4283 

atacattgat gagtttggac aaaccacaac tagaatgcag tgaaaaaaat gctttatttg   4343 

tgaaatttgt gatgctattg ctttatttgt aaccattata agctgcaata aacaagttaa   4403 

caacaacaat tgcattcatt ttatgtttca ggttcagggg gagatgtggg aggtttttta   4463 

aagcaagtaa aacctctaca aatgtggtaa aatccgataa ggatcgatcc gggctggcgt   4523 

aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa   4583 

tggacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga   4643 

ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg   4703 

ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat   4763 

ttagagcttt acggcacctc gaccgcaaaa aacttgattt gggtgatggt tcacgtagtg   4823 

ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata   4883 

gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt   4943 

tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaatat   5003 

ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc gcctgatgcg gtattttctc   5063 

cttacgcatc tgtgcggtat ttcacaccgc atacgcggat ctgcgcagca ccatggcctg   5123 

aaataacctc tgaaagagga acttggttag gtaccttctg aggcggaaag aaccagctgt   5183 

ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccagcaggc agaagtatgc   5243 

aaagcatgca tctcaattag tcagcaacca ggtgtggaaa gtccccaggc tccccagcag   5303 

gcagaagtat gcaaagcatg catctcaatt agtcagcaac catagtcccg cccctaactc   5363 

cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa   5423 

ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt   5483 

gaggaggctt ttttggaggc ctaggctttt gcaaaaagct tgattcttct gacacaacag   5543 

tctcgaactt aaggctagag ccaccatgat tgaacaagat ggattgcacg caggttctcc   5603 

ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa tcggctgctc   5663 

tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg tcaagaccga   5723 

cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt ggctggccac   5783 

gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa gggactggct   5843 

gctattgggc gaagtgccgg ggcaggatct cctgtcatct caccttgctc ctgccgagaa   5903 

agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg ctacctgccc   5963 

attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg aagccggtct   6023 

tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg aactgttcgc   6083 

caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgacccatg gcgatgcctg   6143 

cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact gtggccggct   6203 

gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg ctgaagagct   6263 

tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc ccgattcgca   6323 

gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga gcgggactct ggggttcgaa   6383 

atgaccgacc aagcgacgcc caacctgcca tcacgatggc cgcaataaaa tatctttatt   6443 

ttcattacat ctgtgtgttg gttttttgtg tgaatcgata gcgataagga tccgcgtatg   6503 

gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc   6563 

aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc   6623 

tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc   6683 

gagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt   6743 

ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt   6803 

tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca   6863 

ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt   6923 

ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga   6983 

tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa   7043 

gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct   7103 

gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat   7163 

acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga   7223 

tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc   7283 

caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat   7343 

gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa   7403 

cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac   7463 

tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa   7523 

agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc   7583 

tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc   7643 

ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag   7703 

acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta   7763 

ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa   7823 

gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc   7883 

gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat   7943 

ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga   8003 

gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt   8063 

ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata   8123 

cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac   8183 

cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg   8243 

ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg   8303 

tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag   8363 

cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct   8423 

ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc   8483 

aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt   8543 

ttgctggcct tttgctcaca tggctcgaca gatct                              8578 

Sequence ID: 2
Length of Sequence: 527
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence Plasmid
 2 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Pro Trp Arg Asn Leu Pro Val Ala 
                325                 330                 335 

Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu 
            340                 345                 350 

Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu 
        355                 360                 365 

Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys 
    370                 375                 380 

Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys 
385                 390                 395                 400 

Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly 
                405                 410                 415 

Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu 
            420                 425                 430 

Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr 
        435                 440                 445 

Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp 
    450                 455                 460 

Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe 
465                 470                 475                 480 

Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe 
                485                 490                 495 

Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile 
            500                 505                 510 

Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 
        515                 520                 525 

Sequence ID: 3
Length of Sequence: 8578
Sequence Type: DNA
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 3 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctaga tgggccacac acggaggcag ggaacatcac catccaagtg tccatacctc   1140 

aatttctttc agctcttggt gctggctggt ctttctcact tctgttcagg tgttatccac   1200 

gtgaccaagg aagtgaaaga agtggcaacg ctgtcctgtg gtcacaatgt ttctgttgaa   1260 

gagctggcac aaactcgcat ctactggcaa aaggagaaga aaatggtgct gactatgatg   1320 

tctggggaca tgaatatatg gcccgagtac aagaaccgga ccatctttga tatcactaat   1380 

aacctctcca ttgtgatcct ggctctgcgc ccatctgacg agggcacata cgagtgtgtt   1440 

gttctgaagt atgaaaaaga cgctttcaag cgggaacacc tggctgaagt gacgttatca   1500 

gtcaaagctg acttccctac acctagtata tctgactttg aaattccaac ttctaatatt   1560 

agaaggataa tttgctcaac ctctggaggt tttccagagc ctcacctctc ctggttggaa   1620 

aatggagaag aattaaatgc catcaacaca acagtttccc aagatcctga aactgagctc   1680 

tatgctgtta gcagcaaact ggatttcaat atgacaacca accacagctt catgtgtctc   1740 

atcaagtatg gacatttaag agtgaatcag accttcaact ggaatacaac caagcaagag   1800 

cattttcctg ataacctgct cccatcctgg gccattacct taatctcagt aaatggaatt   1860 

tttgtgatat gctgcctgac ctactgcttt gccccaagat gcagagagag aaggaggaat   1920 

gagagattga gaagggaaag tgtacgccct gtataacagt gtccgcagaa gcaaggggct   1980 

gaaaacgcgt cgagcatgca tctagggcgg ccaattccgc ccctctccct cccccccccc   2040 

taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct atatgtgatt   2100 

ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt   2160 

gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc tgttgaatgt   2220 

cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg tagcgaccct   2280 

ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa agccacgtgt   2340 

ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt ggatagttgt   2400 

ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg atgcccagaa   2460 

ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta catgtgttta   2520 

gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt tcctttgaaa   2580 

aacacgatga taagcttgcc acaacccggg atcctctaga gtcgacggcc cagagcaag    2639 

atg tgt cac cag cag ttg gtc atc tct tgg ttt tcc ctg gtt ttt ctg     2687 
Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

gca tct ccc ctc gtg gcc ata tgg gaa ctg aag aaa gat gtt tat gtc     2735 
Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

gta gaa ttg gat tgg tat ccg gat gcc cct gga gaa atg gtg gtc ctc     2783 
Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

acc tgt gac acc cct gaa gaa gat ggt atc acc tgg acc ttg gac cag     2831 
Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

agc agt gag gtc tta ggc tct ggc aaa acc ctg acc atc caa gtc aaa     2879 
Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

gag ttt gga gat gct ggc cag tac acc tgt cac aaa gga ggc gag gtt     2927 
Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

cta agc cat tcg ctc ctg ctg ctt cac aaa aag gaa gat gga att tgg     2975 
Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

tcc act gat att tta aag gac cag aaa gaa ccc aaa aat aag acc ttt     3023 
Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

cta aga tgc gag gcc aag aat tat tct gga cgt ttc acc tgc tgg tgg     3071 
Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

ctg acg aca atc agt act gat ttg aca ttc agt gtc aaa agc agc aga     3119 
Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

ggc tct tct gac ccc caa ggg gtg acg tgc gga gct gct aca ctc tct     3167 
Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

gca gag aga gtc aga ggg gac aac aag gag tat gag tac tca gtg gag     3215 
Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

tgc cag gag gac agt gcc tgc cca gct gct gag gag agt ctg ccc att     3263 
Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

gag gtc atg gtg gat gcc gtt cac aag ctc aag tat gaa aac tac acc     3311 
Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

agc agc ttc ttc atc agg gac atc atc aaa cct gac cca ccc aac aac     3359 
Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

ttg cag ctg aag cca tta aag aat tct cgg cag gtg gag gtc agc tgg     3407 
Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

gag tac cct gac acc tgg agt act cca cat tcc tac ttc tcc ctg aca     3455 
Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

ttc tgc gtt cag gtc cag ggc aag agc aag aga gaa aag aaa gat aga     3503 
Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

gtc ttc acc gac aag acc tca gcc acg gtc atc tgc cgc aaa aat gcc     3551 
Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

agc att agc gtg cgg gcc cag gac cgc tac tat agc tca tct tgg agc     3599 
Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

gaa tgg gca tct gtg ccc tgc agt cca tgg aga aac ctc ccc gtg gcc     3647 
Glu Trp Ala Ser Val Pro Cys Ser Pro Trp Arg Asn Leu Pro Val Ala 
                325                 330                 335 

act cca gac cca gga atg ttc cca tgc ctt cac cac tcc caa aac ctg     3695 
Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu 
            340                 345                 350 

ctg agg gcc gtc agc aac atg ctc cag aag gcc aga caa act cta gaa     3743 
Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu 
        355                 360                 365 

ttt tac cct tgc act tct gaa gag att gat cat gaa gat atc aca aaa     3791 
Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys 
    370                 375                 380 

gat aaa acc agc aca gtg gag gcc tgt tta cca ttg gaa tta acc aag     3839 
Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys 
385                 390                 395                 400 

aat gag agt tgc cta aat tcc aga gag acc tct ttc ata act aat ggg     3887 
Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly 
                405                 410                 415 

agt tgc ctg gcc tcc aga aag acc tct ttt atg atg gcc ctg tgc ctt     3935 
Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu 
            420                 425                 430 

agt agt att tat gaa gac ttg aag atg tac cag gtg gag ttc aag acc     3983 
Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr 
        435                 440                 445 

atg aat gca aag ctt ctg atg gat cct aag agg cag atc ttt cta gat     4031 
Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp 
    450                 455                 460 

caa aac atg ctg gca gtt att gat gag ctg atg cag gcc ctg aat ttc     4079 
Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe 
465                 470                 475                 480 

aac agt gag act gtg cca caa aaa tcc tcc ctt gaa gaa ccg gat ttt     4127 
Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe 
                485                 490                 495 

tat aaa act aaa atc aag ctc tgc ata ctt ctt cat gct ttc aga att     4175 
Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile 
            500                 505                 510 

cgg gca gtg act att gat aga gtg atg agc tat ctg aat gct             4217 
Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala 
        515                 520                 525 

tcctaaaaag cgaggtgcgg ccgcttccct ttagtgaggg ttaatgcttc gagcagacat   4277 

gataagatac attgatgagt ttggacaaac cacaactaga atgcagtgaa aaaaatgctt   4337 

tatttgtgaa atttgtgatg ctattgcttt atttgtaacc attataagct gcaataaaca   4397 

agttaacaac aacaattgca ttcattttat gtttcaggtt cagggggaga tgtgggaggt   4457 

tttttaaagc aagtaaaacc tctacaaatg tggtaaaatc cgataaggat cgatccgggc   4517 

tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat   4577 

ggcgaatgga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca   4637 

gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct   4697 

ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt   4757 

tccgatttag agctttacgg cacctcgacc gcaaaaaact tgatttgggt gatggttcac   4817 

gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct   4877 

ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt   4937 

ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac   4997 

aaatatttaa cgcgaatttt aacaaaatat taacgtttac aatttcgcct gatgcggtat   5057 

tttctcctta cgcatctgtg cggtatttca caccgcatac gcggatctgc gcagcaccat   5117 

ggcctgaaat aacctctgaa agaggaactt ggttaggtac cttctgaggc ggaaagaacc   5177 

agctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc aggctcccca gcaggcagaa   5237 

gtatgcaaag catgcatctc aattagtcag caaccaggtg tggaaagtcc ccaggctccc   5297 

cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccata gtcccgcccc   5357 

taactccgcc catcccgccc ctaactccgc ccagttccgc ccattctccg ccccatggct   5417 

gactaatttt ttttatttat gcagaggccg aggccgcctc ggcctctgag ctattccaga   5477 

agtagtgagg aggctttttt ggaggcctag gcttttgcaa aaagcttgat tcttctgaca   5537 

caacagtctc gaacttaagg ctagagccac catgattgaa caagatggat tgcacgcagg   5597 

ttctccggcc gcttgggtgg agaggctatt cggctatgac tgggcacaac agacaatcgg   5657 

ctgctctgat gccgccgtgt tccggctgtc agcgcagggg cgcccggttc tttttgtcaa   5717 

gaccgacctg tccggtgccc tgaatgaact gcaggacgag gcagcgcggc tatcgtggct   5777 

ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga   5837 

ctggctgcta ttgggcgaag tgccggggca ggatctcctg tcatctcacc ttgctcctgc   5897 

cgagaaagta tccatcatgg ctgatgcaat gcggcggctg catacgcttg atccggctac   5957 

ctgcccattc gaccaccaag cgaaacatcg catcgagcga gcacgtactc ggatggaagc   6017 

cggtcttgtc gatcaggatg atctggacga agagcatcag gggctcgcgc cagccgaact   6077 

gttcgccagg ctcaaggcgc gcatgcccga cggcgaggat ctcgtcgtga cccatggcga   6137 

tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt tctggattca tcgactgtgg   6197 

ccggctgggt gtggcggacc gctatcagga catagcgttg gctacccgtg atattgctga   6257 

agagcttggc ggcgaatggg ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga   6317 

ttcgcagcgc atcgccttct atcgccttct tgacgagttc ttctgagcgg gactctgggg   6377 

ttcgaaatga ccgaccaagc gacgcccaac ctgccatcac gatggccgca ataaaatatc   6437 

tttattttca ttacatctgt gtgttggttt tttgtgtgaa tcgatagcga taaggatccg   6497 

cgtatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca   6557 

cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag   6617 

acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa   6677 

acgcgcgaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat   6737 

aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg   6797 

tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat   6857 

gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat   6917 

tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt   6977 

aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag   7037 

cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa   7097 

agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg   7157 

ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct   7217 

tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac   7277 

tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca   7337 

caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat   7397 

accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact   7457 

attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc   7517 

ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga   7577 

taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg   7637 

taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg   7697 

aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca   7757 

agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta   7817 

ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca   7877 

ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg   7937 

cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga   7997 

tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa   8057 

tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc   8117 

tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg   8177 

tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac   8237 

ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct   8297 

acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc   8357 

ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg   8417 

gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg   8477 

ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct   8537 

ggccttttgc tggccttttg ctcacatggc tcgacagatc t                       8578 

Sequence ID: 4
Length of Sequence: 526
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 4 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Pro Trp Arg Asn Leu Pro Val Ala 
                325                 330                 335 

Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu 
            340                 345                 350 

Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu 
        355                 360                 365 

Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys 
    370                 375                 380 

Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys 
385                 390                 395                 400 

Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly 
                405                 410                 415 

Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu 
            420                 425                 430 

Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr 
        435                 440                 445 

Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp 
    450                 455                 460 

Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe 
465                 470                 475                 480 

Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe 
                485                 490                 495 

Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile 
            500                 505                 510 

Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala 
        515                 520                 525 

Sequence ID: 5
Length of Sequence: 8623
Sequence Type: DNA
Scientific Name: Artificial Sequence
Name/Key: misc_feature
Location: (1)..(750)
Other Information: CMV enhancer and promoter
 5 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctaga tgggccacac acggaggcag ggaacatcac catccaagtg tccatacctc   1140 

aatttctttc agctcttggt gctggctggt ctttctcact tctgttcagg tgttatccac   1200 

gtgaccaagg aagtgaaaga agtggcaacg ctgtcctgtg gtcacaatgt ttctgttgaa   1260 

gagctggcac aaactcgcat ctactggcaa aaggagaaga aaatggtgct gactatgatg   1320 

tctggggaca tgaatatatg gcccgagtac aagaaccgga ccatctttga tatcactaat   1380 

aacctctcca ttgtgatcct ggctctgcgc ccatctgacg agggcacata cgagtgtgtt   1440 

gttctgaagt atgaaaaaga cgctttcaag cgggaacacc tggctgaagt gacgttatca   1500 

gtcaaagctg acttccctac acctagtata tctgactttg aaattccaac ttctaatatt   1560 

agaaggataa tttgctcaac ctctggaggt tttccagagc ctcacctctc ctggttggaa   1620 

aatggagaag aattaaatgc catcaacaca acagtttccc aagatcctga aactgagctc   1680 

tatgctgtta gcagcaaact ggatttcaat atgacaacca accacagctt catgtgtctc   1740 

atcaagtatg gacatttaag agtgaatcag accttcaact ggaatacaac caagcaagag   1800 

cattttcctg ataacctgct cccatcctgg gccattacct taatctcagt aaatggaatt   1860 

tttgtgatat gctgcctgac ctactgcttt gccccaagat gcagagagag aaggaggaat   1920 

gagagattga gaagggaaag tgtacgccct gtataacagt gtccgcagaa gcaaggggct   1980 

gaaaacgcgt cgagcatgca tctagggcgg ccaattccgc ccctctccct cccccccccc   2040 

taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct atatgtgatt   2100 

ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt   2160 

gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc tgttgaatgt   2220 

cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg tagcgaccct   2280 

ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa agccacgtgt   2340 

ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt ggatagttgt   2400 

ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg atgcccagaa   2460 

ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta catgtgttta   2520 

gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt tcctttgaaa   2580 

aacacgatga taagcttgcc acaacccggg atcctctaga gtcgacggcc cagagcaag    2639 

atg tgt cac cag cag ttg gtc atc tct tgg ttt tcc ctg gtt ttt ctg     2687 
Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

gca tct ccc ctc gtg gcc ata tgg gaa ctg aag aaa gat gtt tat gtc     2735 
Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

gta gaa ttg gat tgg tat ccg gat gcc cct gga gaa atg gtg gtc ctc     2783 
Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

acc tgt gac acc cct gaa gaa gat ggt atc acc tgg acc ttg gac cag     2831 
Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

agc agt gag gtc tta ggc tct ggc aaa acc ctg acc atc caa gtc aaa     2879 
Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

gag ttt gga gat gct ggc cag tac acc tgt cac aaa gga ggc gag gtt     2927 
Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

cta agc cat tcg ctc ctg ctg ctt cac aaa aag gaa gat gga att tgg     2975 
Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

tcc act gat att tta aag gac cag aaa gaa ccc aaa aat aag acc ttt     3023 
Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

cta aga tgc gag gcc aag aat tat tct gga cgt ttc acc tgc tgg tgg     3071 
Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

ctg acg aca atc agt act gat ttg aca ttc agt gtc aaa agc agc aga     3119 
Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

ggc tct tct gac ccc caa ggg gtg acg tgc gga gct gct aca ctc tct     3167 
Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

gca gag aga gtc aga ggg gac aac aag gag tat gag tac tca gtg gag     3215 
Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

tgc cag gag gac agt gcc tgc cca gct gct gag gag agt ctg ccc att     3263 
Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

gag gtc atg gtg gat gcc gtt cac aag ctc aag tat gaa aac tac acc     3311 
Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

agc agc ttc ttc atc agg gac atc atc aaa cct gac cca ccc aac aac     3359 
Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

ttg cag ctg aag cca tta aag aat tct cgg cag gtg gag gtc agc tgg     3407 
Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

gag tac cct gac acc tgg agt act cca cat tcc tac ttc tcc ctg aca     3455 
Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

ttc tgc gtt cag gtc cag ggc aag agc aag aga gaa aag aaa gat aga     3503 
Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

gtc ttc acc gac aag acc tca gcc acg gtc atc tgc cgc aaa aat gcc     3551 
Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

agc att agc gtg cgg gcc cag gac cgc tac tat agc tca tct tgg agc     3599 
Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

gaa tgg gca tct gtg ccc tgc agt ggt ggc ggt gga agc ggt ggc ggt     3647 
Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

gga agc ggt ggc ggt gga agc cca tgg aga aac ctc ccc gtg gcc act     3695 
Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg Asn Leu Pro Val Ala Thr 
            340                 345                 350 

cca gac cca gga atg ttc cca tgc ctt cac cac tcc caa aac ctg ctg     3743 
Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu Leu 
        355                 360                 365 

agg gcc gtc agc aac atg ctc cag aag gcc aga caa act cta gaa ttt     3791 
Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe 
    370                 375                 380 

tac cct tgc act tct gaa gag att gat cat gaa gat atc aca aaa gat     3839 
Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp 
385                 390                 395                 400 

aaa acc agc aca gtg gag gcc tgt tta cca ttg gaa tta acc aag aat     3887 
Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn 
                405                 410                 415 

gag agt tgc cta aat tcc aga gag acc tct ttc ata act aat ggg agt     3935 
Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser 
            420                 425                 430 

tgc ctg gcc tcc aga aag acc tct ttt atg atg gcc ctg tgc ctt agt     3983 
Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser 
        435                 440                 445 

agt att tat gaa gac ttg aag atg tac cag gtg gag ttc aag acc atg     4031 
Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met 
    450                 455                 460 

aat gca aag ctt ctg atg gat cct aag agg cag atc ttt cta gat caa     4079 
Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln 
465                 470                 475                 480 

aac atg ctg gca gtt att gat gag ctg atg cag gcc ctg aat ttc aac     4127 
Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn 
                485                 490                 495 

agt gag act gtg cca caa aaa tcc tcc ctt gaa gaa ccg gat ttt tat     4175 
Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr 
            500                 505                 510 

aaa act aaa atc aag ctc tgc ata ctt ctt cat gct ttc aga att cgg     4223 
Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg 
        515                 520                 525 

gca gtg act att gat aga gtg atg agc tat ctg aat gct tcc taa         4268 
Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 
    530                 535                 540 

aaagcgaggt gcggccgctt ccctttagtg agggttaatg cttcgagcag acatgataag   4328 

atacattgat gagtttggac aaaccacaac tagaatgcag tgaaaaaaat gctttatttg   4388 

tgaaatttgt gatgctattg ctttatttgt aaccattata agctgcaata aacaagttaa   4448 

caacaacaat tgcattcatt ttatgtttca ggttcagggg gagatgtggg aggtttttta   4508 

aagcaagtaa aacctctaca aatgtggtaa aatccgataa ggatcgatcc gggctggcgt   4568 

aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa   4628 

tggacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga   4688 

ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg   4748 

ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat   4808 

ttagagcttt acggcacctc gaccgcaaaa aacttgattt gggtgatggt tcacgtagtg   4868 

ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata   4928 

gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt   4988 

tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaatat   5048 

ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc gcctgatgcg gtattttctc   5108 

cttacgcatc tgtgcggtat ttcacaccgc atacgcggat ctgcgcagca ccatggcctg   5168 

aaataacctc tgaaagagga acttggttag gtaccttctg aggcggaaag aaccagctgt   5228 

ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccagcaggc agaagtatgc   5288 

aaagcatgca tctcaattag tcagcaacca ggtgtggaaa gtccccaggc tccccagcag   5348 

gcagaagtat gcaaagcatg catctcaatt agtcagcaac catagtcccg cccctaactc   5408 

cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa   5468 

ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt   5528 

gaggaggctt ttttggaggc ctaggctttt gcaaaaagct tgattcttct gacacaacag   5588 

tctcgaactt aaggctagag ccaccatgat tgaacaagat ggattgcacg caggttctcc   5648 

ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa tcggctgctc   5708 

tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg tcaagaccga   5768 

cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt ggctggccac   5828 

gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa gggactggct   5888 

gctattgggc gaagtgccgg ggcaggatct cctgtcatct caccttgctc ctgccgagaa   5948 

agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg ctacctgccc   6008 

attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg aagccggtct   6068 

tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg aactgttcgc   6128 

caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgacccatg gcgatgcctg   6188 

cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact gtggccggct   6248 

gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg ctgaagagct   6308 

tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc ccgattcgca   6368 

gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga gcgggactct ggggttcgaa   6428 

atgaccgacc aagcgacgcc caacctgcca tcacgatggc cgcaataaaa tatctttatt   6488 

ttcattacat ctgtgtgttg gttttttgtg tgaatcgata gcgataagga tccgcgtatg   6548 

gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc   6608 

aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc   6668 

tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc   6728 

gagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt   6788 

ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt   6848 

tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca   6908 

ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt   6968 

ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga   7028 

tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa   7088 

gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct   7148 

gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat   7208 

acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga   7268 

tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc   7328 

caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat   7388 

gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa   7448 

cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac   7508 

tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa   7568 

agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc   7628 

tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc   7688 

ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag   7748 

acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta   7808 

ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa   7868 

gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc   7928 

gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat   7988 

ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga   8048 

gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt   8108 

ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata   8168 

cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac   8228 

cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg   8288 

ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg   8348 

tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag   8408 

cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct   8468 

ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc   8528 

aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt   8588 

ttgctggcct tttgctcaca tggctcgaca gatct                              8623 

Sequence ID: 6
Length of Sequence: 542
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 6 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg Asn Leu Pro Val Ala Thr 
            340                 345                 350 

Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu Leu 
        355                 360                 365 

Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe 
    370                 375                 380 

Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp 
385                 390                 395                 400 

Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn 
                405                 410                 415 

Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser 
            420                 425                 430 

Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser 
        435                 440                 445 

Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met 
    450                 455                 460 

Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln 
465                 470                 475                 480 

Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn 
                485                 490                 495 

Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr 
            500                 505                 510 

Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg 
        515                 520                 525 

Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 
    530                 535                 540 

Sequence ID: 7
Length of Sequence: 8608
Sequence Type: DNA
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 7 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctaga tgggccacac acggaggcag ggaacatcac catccaagtg tccatacctc   1140 

aatttctttc agctcttggt gctggctggt ctttctcact tctgttcagg tgttatccac   1200 

gtgaccaagg aagtgaaaga agtggcaacg ctgtcctgtg gtcacaatgt ttctgttgaa   1260 

gagctggcac aaactcgcat ctactggcaa aaggagaaga aaatggtgct gactatgatg   1320 

tctggggaca tgaatatatg gcccgagtac aagaaccgga ccatctttga tatcactaat   1380 

aacctctcca ttgtgatcct ggctctgcgc ccatctgacg agggcacata cgagtgtgtt   1440 

gttctgaagt atgaaaaaga cgctttcaag cgggaacacc tggctgaagt gacgttatca   1500 

gtcaaagctg acttccctac acctagtata tctgactttg aaattccaac ttctaatatt   1560 

agaaggataa tttgctcaac ctctggaggt tttccagagc ctcacctctc ctggttggaa   1620 

aatggagaag aattaaatgc catcaacaca acagtttccc aagatcctga aactgagctc   1680 

tatgctgtta gcagcaaact ggatttcaat atgacaacca accacagctt catgtgtctc   1740 

atcaagtatg gacatttaag agtgaatcag accttcaact ggaatacaac caagcaagag   1800 

cattttcctg ataacctgct cccatcctgg gccattacct taatctcagt aaatggaatt   1860 

tttgtgatat gctgcctgac ctactgcttt gccccaagat gcagagagag aaggaggaat   1920 

gagagattga gaagggaaag tgtacgccct gtataacagt gtccgcagaa gcaaggggct   1980 

gaaaacgcgt cgagcatgca tctagggcgg ccaattccgc ccctctccct cccccccccc   2040 

taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct atatgtgatt   2100 

ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt   2160 

gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc tgttgaatgt   2220 

cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg tagcgaccct   2280 

ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa agccacgtgt   2340 

ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt ggatagttgt   2400 

ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg atgcccagaa   2460 

ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta catgtgttta   2520 

gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt tcctttgaaa   2580 

aacacgatga taagcttgcc acaacccggg atcctctaga gtcgacggcc cagagcaag    2639 

atg tgt cac cag cag ttg gtc atc tct tgg ttt tcc ctg gtt ttt ctg     2687 
Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

gca tct ccc ctc gtg gcc ata tgg gaa ctg aag aaa gat gtt tat gtc     2735 
Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

gta gaa ttg gat tgg tat ccg gat gcc cct gga gaa atg gtg gtc ctc     2783 
Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

acc tgt gac acc cct gaa gaa gat ggt atc acc tgg acc ttg gac cag     2831 
Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

agc agt gag gtc tta ggc tct ggc aaa acc ctg acc atc caa gtc aaa     2879 
Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

gag ttt gga gat gct ggc cag tac acc tgt cac aaa gga ggc gag gtt     2927 
Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

cta agc cat tcg ctc ctg ctg ctt cac aaa aag gaa gat gga att tgg     2975 
Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

tcc act gat att tta aag gac cag aaa gaa ccc aaa aat aag acc ttt     3023 
Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

cta aga tgc gag gcc aag aat tat tct gga cgt ttc acc tgc tgg tgg     3071 
Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

ctg acg aca atc agt act gat ttg aca ttc agt gtc aaa agc agc aga     3119 
Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

ggc tct tct gac ccc caa ggg gtg acg tgc gga gct gct aca ctc tct     3167 
Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

gca gag aga gtc aga ggg gac aac aag gag tat gag tac tca gtg gag     3215 
Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

tgc cag gag gac agt gcc tgc cca gct gct gag gag agt ctg ccc att     3263 
Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

gag gtc atg gtg gat gcc gtt cac aag ctc aag tat gaa aac tac acc     3311 
Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

agc agc ttc ttc atc agg gac atc atc aaa cct gac cca ccc aac aac     3359 
Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

ttg cag ctg aag cca tta aag aat tct cgg cag gtg gag gtc agc tgg     3407 
Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

gag tac cct gac acc tgg agt act cca cat tcc tac ttc tcc ctg aca     3455 
Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

ttc tgc gtt cag gtc cag ggc aag agc aag aga gaa aag aaa gat aga     3503 
Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

gtc ttc acc gac aag acc tca gcc acg gtc atc tgc cgc aaa aat gcc     3551 
Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

agc att agc gtg cgg gcc cag gac cgc tac tat agc tca tct tgg agc     3599 
Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

gaa tgg gca tct gtg ccc tgc agt ggt ggc ggt gga agc ggt ggc ggt     3647 
Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

gga agc cca tgg aga aac ctc ccc gtg gcc act cca gac cca gga atg     3695 
Gly Ser Pro Trp Arg Asn Leu Pro Val Ala Thr Pro Asp Pro Gly Met 
            340                 345                 350 

ttc cca tgc ctt cac cac tcc caa aac ctg ctg agg gcc gtc agc aac     3743 
Phe Pro Cys Leu His His Ser Gln Asn Leu Leu Arg Ala Val Ser Asn 
        355                 360                 365 

atg ctc cag aag gcc aga caa act cta gaa ttt tac cct tgc act tct     3791 
Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser 
    370                 375                 380 

gaa gag att gat cat gaa gat atc aca aaa gat aaa acc agc aca gtg     3839 
Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp Lys Thr Ser Thr Val 
385                 390                 395                 400 

gag gcc tgt tta cca ttg gaa tta acc aag aat gag agt tgc cta aat     3887 
Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn 
                405                 410                 415 

tcc aga gag acc tct ttc ata act aat ggg agt tgc ctg gcc tcc aga     3935 
Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg 
            420                 425                 430 

aag acc tct ttt atg atg gcc ctg tgc ctt agt agt att tat gaa gac     3983 
Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp 
        435                 440                 445 

ttg aag atg tac cag gtg gag ttc aag acc atg aat gca aag ctt ctg     4031 
Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met Asn Ala Lys Leu Leu 
    450                 455                 460 

atg gat cct aag agg cag atc ttt cta gat caa aac atg ctg gca gtt     4079 
Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln Asn Met Leu Ala Val 
465                 470                 475                 480 

att gat gag ctg atg cag gcc ctg aat ttc aac agt gag act gtg cca     4127 
Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn Ser Glu Thr Val Pro 
                485                 490                 495 

caa aaa tcc tcc ctt gaa gaa ccg gat ttt tat aaa act aaa atc aag     4175 
Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys 
            500                 505                 510 

ctc tgc ata ctt ctt cat gct ttc aga att cgg gca gtg act att gat     4223 
Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg Ala Val Thr Ile Asp 
        515                 520                 525 

aga gtg atg agc tat ctg aat gct tcc taa aaagcgaggt gcggccgctt       4273 
Arg Val Met Ser Tyr Leu Asn Ala Ser 
    530                 535 

ccctttagtg agggttaatg cttcgagcag acatgataag atacattgat gagtttggac   4333 

aaaccacaac tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt gatgctattg   4393 

ctttatttgt aaccattata agctgcaata aacaagttaa caacaacaat tgcattcatt   4453 

ttatgtttca ggttcagggg gagatgtggg aggtttttta aagcaagtaa aacctctaca   4513 

aatgtggtaa aatccgataa ggatcgatcc gggctggcgt aatagcgaag aggcccgcac   4573 

cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tggacgcgcc ctgtagcggc   4633 

gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc   4693 

ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc   4753 

cgtcaagctc taaatcgggg gctcccttta gggttccgat ttagagcttt acggcacctc   4813 

gaccgcaaaa aacttgattt gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg   4873 

gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact   4933 

ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt   4993 

tcggcctatt ggttaaaaaa tgagctgatt taacaaatat ttaacgcgaa ttttaacaaa   5053 

atattaacgt ttacaatttc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat   5113 

ttcacaccgc atacgcggat ctgcgcagca ccatggcctg aaataacctc tgaaagagga   5173 

acttggttag gtaccttctg aggcggaaag aaccagctgt ggaatgtgtg tcagttaggg   5233 

tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag   5293 

tcagcaacca ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg   5353 

catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc gcccctaact   5413 

ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag   5473 

gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc   5533 

ctaggctttt gcaaaaagct tgattcttct gacacaacag tctcgaactt aaggctagag   5593 

ccaccatgat tgaacaagat ggattgcacg caggttctcc ggccgcttgg gtggagaggc   5653 

tattcggcta tgactgggca caacagacaa tcggctgctc tgatgccgcc gtgttccggc   5713 

tgtcagcgca ggggcgcccg gttctttttg tcaagaccga cctgtccggt gccctgaatg   5773 

aactgcagga cgaggcagcg cggctatcgt ggctggccac gacgggcgtt ccttgcgcag   5833 

ctgtgctcga cgttgtcact gaagcgggaa gggactggct gctattgggc gaagtgccgg   5893 

ggcaggatct cctgtcatct caccttgctc ctgccgagaa agtatccatc atggctgatg   5953 

caatgcggcg gctgcatacg cttgatccgg ctacctgccc attcgaccac caagcgaaac   6013 

atcgcatcga gcgagcacgt actcggatgg aagccggtct tgtcgatcag gatgatctgg   6073 

acgaagagca tcaggggctc gcgccagccg aactgttcgc caggctcaag gcgcgcatgc   6133 

ccgacggcga ggatctcgtc gtgacccatg gcgatgcctg cttgccgaat atcatggtgg   6193 

aaaatggccg cttttctgga ttcatcgact gtggccggct gggtgtggcg gaccgctatc   6253 

aggacatagc gttggctacc cgtgatattg ctgaagagct tggcggcgaa tgggctgacc   6313 

gcttcctcgt gctttacggt atcgccgctc ccgattcgca gcgcatcgcc ttctatcgcc   6373 

ttcttgacga gttcttctga gcgggactct ggggttcgaa atgaccgacc aagcgacgcc   6433 

caacctgcca tcacgatggc cgcaataaaa tatctttatt ttcattacat ctgtgtgttg   6493 

gttttttgtg tgaatcgata gcgataagga tccgcgtatg gtgcactctc agtacaatct   6553 

gctctgatgc cgcatagtta agccagcccc gacacccgcc aacacccgct gacgcgccct   6613 

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct   6673 

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga   6733 

tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca   6793 

cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata   6853 

tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga   6913 

gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc   6973 

ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg   7033 

cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc   7093 

ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat   7153 

cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact   7213 

tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat   7273 

tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga   7333 

tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc   7393 

ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga   7453 

tgcctgtagc aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag   7513 

cttcccggca acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc   7573 

gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt   7633 

ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct   7693 

acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg   7753 

cctcactgat taagcattgg taactgtcag accaagttta ctcatatata ctttagattg   7813 

atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca   7873 

tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga   7933 

tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa   7993 

aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga   8053 

aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt   8113 

taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt   8173 

taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat   8233 

agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct   8293 

tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca   8353 

cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag   8413 

agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc   8473 

gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga   8533 

aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca   8593 

tggctcgaca gatct                                                    8608 

Sequence ID: 8
Length of Sequence: 537
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 8 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

Gly Ser Pro Trp Arg Asn Leu Pro Val Ala Thr Pro Asp Pro Gly Met 
            340                 345                 350 

Phe Pro Cys Leu His His Ser Gln Asn Leu Leu Arg Ala Val Ser Asn 
        355                 360                 365 

Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser 
    370                 375                 380 

Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp Lys Thr Ser Thr Val 
385                 390                 395                 400 

Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn 
                405                 410                 415 

Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg 
            420                 425                 430 

Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp 
        435                 440                 445 

Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met Asn Ala Lys Leu Leu 
    450                 455                 460 

Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln Asn Met Leu Ala Val 
465                 470                 475                 480 

Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn Ser Glu Thr Val Pro 
                485                 490                 495 

Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys 
            500                 505                 510 

Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg Ala Val Thr Ile Asp 
        515                 520                 525 

Arg Val Met Ser Tyr Leu Asn Ala Ser 
    530                 535 

Sequence ID: 9
Length of Sequence: 8638
Sequence Type: DNA
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 9 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctaga tgggccacac acggaggcag ggaacatcac catccaagtg tccatacctc   1140 

aatttctttc agctcttggt gctggctggt ctttctcact tctgttcagg tgttatccac   1200 

gtgaccaagg aagtgaaaga agtggcaacg ctgtcctgtg gtcacaatgt ttctgttgaa   1260 

gagctggcac aaactcgcat ctactggcaa aaggagaaga aaatggtgct gactatgatg   1320 

tctggggaca tgaatatatg gcccgagtac aagaaccgga ccatctttga tatcactaat   1380 

aacctctcca ttgtgatcct ggctctgcgc ccatctgacg agggcacata cgagtgtgtt   1440 

gttctgaagt atgaaaaaga cgctttcaag cgggaacacc tggctgaagt gacgttatca   1500 

gtcaaagctg acttccctac acctagtata tctgactttg aaattccaac ttctaatatt   1560 

agaaggataa tttgctcaac ctctggaggt tttccagagc ctcacctctc ctggttggaa   1620 

aatggagaag aattaaatgc catcaacaca acagtttccc aagatcctga aactgagctc   1680 

tatgctgtta gcagcaaact ggatttcaat atgacaacca accacagctt catgtgtctc   1740 

atcaagtatg gacatttaag agtgaatcag accttcaact ggaatacaac caagcaagag   1800 

cattttcctg ataacctgct cccatcctgg gccattacct taatctcagt aaatggaatt   1860 

tttgtgatat gctgcctgac ctactgcttt gccccaagat gcagagagag aaggaggaat   1920 

gagagattga gaagggaaag tgtacgccct gtataacagt gtccgcagaa gcaaggggct   1980 

gaaaacgcgt cgagcatgca tctagggcgg ccaattccgc ccctctccct cccccccccc   2040 

taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct atatgtgatt   2100 

ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt   2160 

gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc tgttgaatgt   2220 

cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg tagcgaccct   2280 

ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa agccacgtgt   2340 

ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt ggatagttgt   2400 

ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg atgcccagaa   2460 

ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta catgtgttta   2520 

gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt tcctttgaaa   2580 

aacacgatga taagcttgcc acaacccggg atcctctaga gtcgacggcc cagagcaag    2639 

atg tgt cac cag cag ttg gtc atc tct tgg ttt tcc ctg gtt ttt ctg     2687 
Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

gca tct ccc ctc gtg gcc ata tgg gaa ctg aag aaa gat gtt tat gtc     2735 
Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

gta gaa ttg gat tgg tat ccg gat gcc cct gga gaa atg gtg gtc ctc     2783 
Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

acc tgt gac acc cct gaa gaa gat ggt atc acc tgg acc ttg gac cag     2831 
Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

agc agt gag gtc tta ggc tct ggc aaa acc ctg acc atc caa gtc aaa     2879 
Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

gag ttt gga gat gct ggc cag tac acc tgt cac aaa gga ggc gag gtt     2927 
Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

cta agc cat tcg ctc ctg ctg ctt cac aaa aag gaa gat gga att tgg     2975 
Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

tcc act gat att tta aag gac cag aaa gaa ccc aaa aat aag acc ttt     3023 
Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

cta aga tgc gag gcc aag aat tat tct gga cgt ttc acc tgc tgg tgg     3071 
Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

ctg acg aca atc agt act gat ttg aca ttc agt gtc aaa agc agc aga     3119 
Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

ggc tct tct gac ccc caa ggg gtg acg tgc gga gct gct aca ctc tct     3167 
Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

gca gag aga gtc aga ggg gac aac aag gag tat gag tac tca gtg gag     3215 
Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

tgc cag gag gac agt gcc tgc cca gct gct gag gag agt ctg ccc att     3263 
Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

gag gtc atg gtg gat gcc gtt cac aag ctc aag tat gaa aac tac acc     3311 
Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

agc agc ttc ttc atc agg gac atc atc aaa cct gac cca ccc aac aac     3359 
Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

ttg cag ctg aag cca tta aag aat tct cgg cag gtg gag gtc agc tgg     3407 
Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

gag tac cct gac acc tgg agt act cca cat tcc tac ttc tcc ctg aca     3455 
Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

ttc tgc gtt cag gtc cag ggc aag agc aag aga gaa aag aaa gat aga     3503 
Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

gtc ttc acc gac aag acc tca gcc acg gtc atc tgc cgc aaa aat gcc     3551 
Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

agc att agc gtg cgg gcc cag gac cgc tac tat agc tca tct tgg agc     3599 
Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

gaa tgg gca tct gtg ccc tgc agt ggt ggc ggt gga agc ggt ggc ggt     3647 
Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

gga agc ggt ggc ggt gga agc ggt ggc ggt gga agc cca tgg aga aac     3695 
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg Asn 
            340                 345                 350 

ctc ccc gtg gcc act cca gac cca gga atg ttc cca tgc ctt cac cac     3743 
Leu Pro Val Ala Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His 
        355                 360                 365 

tcc caa aac ctg ctg agg gcc gtc agc aac atg ctc cag aag gcc aga     3791 
Ser Gln Asn Leu Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg 
    370                 375                 380 

caa act cta gaa ttt tac cct tgc act tct gaa gag att gat cat gaa     3839 
Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu 
385                 390                 395                 400 

gat atc aca aaa gat aaa acc agc aca gtg gag gcc tgt tta cca ttg     3887 
Asp Ile Thr Lys Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu 
                405                 410                 415 

gaa tta acc aag aat gag agt tgc cta aat tcc aga gag acc tct ttc     3935 
Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe 
            420                 425                 430 

ata act aat ggg agt tgc ctg gcc tcc aga aag acc tct ttt atg atg     3983 
Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met 
        435                 440                 445 

gcc ctg tgc ctt agt agt att tat gaa gac ttg aag atg tac cag gtg     4031 
Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val 
    450                 455                 460 

gag ttc aag acc atg aat gca aag ctt ctg atg gat cct aag agg cag     4079 
Glu Phe Lys Thr Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln 
465                 470                 475                 480 

atc ttt cta gat caa aac atg ctg gca gtt att gat gag ctg atg cag     4127 
Ile Phe Leu Asp Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln 
                485                 490                 495 

gcc ctg aat ttc aac agt gag act gtg cca caa aaa tcc tcc ctt gaa     4175 
Ala Leu Asn Phe Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu 
            500                 505                 510 

gaa ccg gat ttt tat aaa act aaa atc aag ctc tgc ata ctt ctt cat     4223 
Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His 
        515                 520                 525 

gct ttc aga att cgg gca gtg act att gat aga gtg atg agc tat ctg     4271 
Ala Phe Arg Ile Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu 
    530                 535                 540 

aat gct tcc taa aaagcgaggt gcggccgctt ccctttagtg agggttaatg         4323 
Asn Ala Ser 
545 

cttcgagcag acatgataag atacattgat gagtttggac aaaccacaac tagaatgcag   4383 

tgaaaaaaat gctttatttg tgaaatttgt gatgctattg ctttatttgt aaccattata   4443 

agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca ggttcagggg   4503 

gagatgtggg aggtttttta aagcaagtaa aacctctaca aatgtggtaa aatccgataa   4563 

ggatcgatcc gggctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt   4623 

tgcgcagcct gaatggcgaa tggacgcgcc ctgtagcggc gcattaagcg cggcgggtgt   4683 

ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc   4743 

tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg   4803 

gctcccttta gggttccgat ttagagcttt acggcacctc gaccgcaaaa aacttgattt   4863 

gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt   4923 

ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat   4983 

ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa   5043 

tgagctgatt taacaaatat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc   5103 

gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atacgcggat   5163 

ctgcgcagca ccatggcctg aaataacctc tgaaagagga acttggttag gtaccttctg   5223 

aggcggaaag aaccagctgt ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc   5283 

cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca ggtgtggaaa   5343 

gtccccaggc tccccagcag gcagaagtat gcaaagcatg catctcaatt agtcagcaac   5403 

catagtcccg cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc   5463 

tccgccccat ggctgactaa ttttttttat ttatgcagag gccgaggccg cctcggcctc   5523 

tgagctattc cagaagtagt gaggaggctt ttttggaggc ctaggctttt gcaaaaagct   5583 

tgattcttct gacacaacag tctcgaactt aaggctagag ccaccatgat tgaacaagat   5643 

ggattgcacg caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca   5703 

caacagacaa tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg   5763 

gttctttttg tcaagaccga cctgtccggt gccctgaatg aactgcagga cgaggcagcg   5823 

cggctatcgt ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact   5883 

gaagcgggaa gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatct   5943 

caccttgctc ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg   6003 

cttgatccgg ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt   6063 

actcggatgg aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc   6123 

gcgccagccg aactgttcgc caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc   6183 

gtgacccatg gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga   6243 

ttcatcgact gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggctacc   6303 

cgtgatattg ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt   6363 

atcgccgctc ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga   6423 

gcgggactct ggggttcgaa atgaccgacc aagcgacgcc caacctgcca tcacgatggc   6483 

cgcaataaaa tatctttatt ttcattacat ctgtgtgttg gttttttgtg tgaatcgata   6543 

gcgataagga tccgcgtatg gtgcactctc agtacaatct gctctgatgc cgcatagtta   6603 

agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg   6663 

gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca   6723 

ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt   6783 

aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc   6843 

ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa   6903 

taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc   6963 

cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa   7023 

acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa   7083 

ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg   7143 

atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa   7203 

gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc   7263 

acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc   7323 

atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta   7383 

accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag   7443 

ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca   7503 

acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata   7563 

gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc   7623 

tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca   7683 

ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca   7743 

actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg   7803 

taactgtcag accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa   7863 

tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt   7923 

gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat   7983 

cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg   8043 

gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga   8103 

gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac   8163 

tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt   8223 

ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag   8283 

cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc   8343 

gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag   8403 

gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca   8463 

gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt   8523 

cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc   8583 

tttttacggt tcctggcctt ttgctggcct tttgctcaca tggctcgaca gatct        8638 

Sequence ID: 10
Length of Sequence: 547
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 10 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg Asn 
            340                 345                 350 

Leu Pro Val Ala Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His 
        355                 360                 365 

Ser Gln Asn Leu Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg 
    370                 375                 380 

Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu 
385                 390                 395                 400 

Asp Ile Thr Lys Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu 
                405                 410                 415 

Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe 
            420                 425                 430 

Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met 
        435                 440                 445 

Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val 
    450                 455                 460 

Glu Phe Lys Thr Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln 
465                 470                 475                 480 

Ile Phe Leu Asp Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln 
                485                 490                 495 

Ala Leu Asn Phe Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu 
            500                 505                 510 

Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His 
        515                 520                 525 

Ala Phe Arg Ile Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu 
    530                 535                 540 

Asn Ala Ser 
545 

Sequence ID: 11
Length of Sequence: 8659
Sequence Type: DNA
Scientific Name: Artificial Sequence
Name/Key: misc_feature
Location: (1)..(750)
Other Information: CMV enhancer and promoter
 11 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctagg gcccagagca ag atg tgt cac cag cag ttg gtc atc tct tgg    1132 
                         Met Cys His Gln Gln Leu Val Ile Ser Trp 
                           1               5                  10 

ttt tcc ctg gtt ttt ctg gca tct ccc ctc gtg gcc ata tgg gaa ctg     1180 
Phe Ser Leu Val Phe Leu Ala Ser Pro Leu Val Ala Ile Trp Glu Leu 
                 15                  20                  25 

aag aaa gat gtt tat gtc gta gaa ttg gat tgg tat ccg gat gcc cct     1228 
Lys Lys Asp Val Tyr Val Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro 
             30                  35                  40 

gga gaa atg gtg gtc ctc acc tgt gac acc cct gaa gaa gat ggt atc     1276 
Gly Glu Met Val Val Leu Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile 
         45                  50                  55 

acc tgg acc ttg gac cag agc agt gag gtc tta ggc tct ggc aaa acc     1324 
Thr Trp Thr Leu Asp Gln Ser Ser Glu Val Leu Gly Ser Gly Lys Thr 
     60                  65                  70 

ctg acc atc caa gtc aaa gag ttt gga gat gct ggc cag tac acc tgt     1372 
Leu Thr Ile Gln Val Lys Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys 
 75                  80                  85                  90 

cac aaa gga ggc gag gtt cta agc cat tcg ctc ctg ctg ctt cac aaa     1420 
His Lys Gly Gly Glu Val Leu Ser His Ser Leu Leu Leu Leu His Lys 
                 95                 100                 105 

aag gaa gat gga att tgg tcc act gat att tta aag gac cag aaa gaa     1468 
Lys Glu Asp Gly Ile Trp Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu 
            110                 115                 120 

ccc aaa aat aag acc ttt cta aga tgc gag gcc aag aat tat tct gga     1516 
Pro Lys Asn Lys Thr Phe Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly 
        125                 130                 135 

cgt ttc acc tgc tgg tgg ctg acg aca atc agt act gat ttg aca ttc     1564 
Arg Phe Thr Cys Trp Trp Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe 
    140                 145                 150 

agt gtc aaa agc agc aga ggc tct tct gac ccc caa ggg gtg acg tgc     1612 
Ser Val Lys Ser Ser Arg Gly Ser Ser Asp Pro Gln Gly Val Thr Cys 
155                 160                 165                 170 

gga gct gct aca ctc tct gca gag aga gtc aga ggg gac aac aag gag     1660 
Gly Ala Ala Thr Leu Ser Ala Glu Arg Val Arg Gly Asp Asn Lys Glu 
                175                 180                 185 

tat gag tac tca gtg gag tgc cag gag gac agt gcc tgc cca gct gct     1708 
Tyr Glu Tyr Ser Val Glu Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala 
            190                 195                 200 

gag gag agt ctg ccc att gag gtc atg gtg gat gcc gtt cac aag ctc     1756 
Glu Glu Ser Leu Pro Ile Glu Val Met Val Asp Ala Val His Lys Leu 
        205                 210                 215 

aag tat gaa aac tac acc agc agc ttc ttc atc agg gac atc atc aaa     1804 
Lys Tyr Glu Asn Tyr Thr Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys 
    220                 225                 230 

cct gac cca ccc aac aac ttg cag ctg aag cca tta aag aat tct cgg     1852 
Pro Asp Pro Pro Asn Asn Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg 
235                 240                 245                 250 

cag gtg gag gtc agc tgg gag tac cct gac acc tgg agt act cca cat     1900 
Gln Val Glu Val Ser Trp Glu Tyr Pro Asp Thr Trp Ser Thr Pro His 
                255                 260                 265 

tcc tac ttc tcc ctg aca ttc tgc gtt cag gtc cag ggc aag agc aag     1948 
Ser Tyr Phe Ser Leu Thr Phe Cys Val Gln Val Gln Gly Lys Ser Lys 
            270                 275                 280 

aga gaa aag aaa gat aga gtc ttc acc gac aag acc tca gcc acg gtc     1996 
Arg Glu Lys Lys Asp Arg Val Phe Thr Asp Lys Thr Ser Ala Thr Val 
        285                 290                 295 

atc tgc cgc aaa aat gcc agc att agc gtg cgg gcc cag gac cgc tac     2044 
Ile Cys Arg Lys Asn Ala Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr 
    300                 305                 310 

tat agc tca tct tgg agc gaa tgg gca tct gtg ccc tgc agt ggt ggc     2092 
Tyr Ser Ser Ser Trp Ser Glu Trp Ala Ser Val Pro Cys Ser Gly Gly 
315                 320                 325                 330 

ggt gga agc ggt ggc ggt gga agc ggt ggc ggt gga agc ggt ggc ggt     2140 
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                335                 340                 345 

gga agc cca tgg aga aac ctc ccc gtg gcc act cca gac cca gga atg     2188 
Gly Ser Pro Trp Arg Asn Leu Pro Val Ala Thr Pro Asp Pro Gly Met 
            350                 355                 360 

ttc cca tgc ctt cac cac tcc caa aac ctg ctg agg gcc gtc agc aac     2236 
Phe Pro Cys Leu His His Ser Gln Asn Leu Leu Arg Ala Val Ser Asn 
        365                 370                 375 

atg ctc cag aag gcc aga caa act cta gaa ttt tac cct tgc act tct     2284 
Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser 
    380                 385                 390 

gaa gag att gat cat gaa gat atc aca aaa gat aaa acc agc aca gtg     2332 
Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp Lys Thr Ser Thr Val 
395                 400                 405                 410 

gag gcc tgt tta cca ttg gaa tta acc aag aat gag agt tgc cta aat     2380 
Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn 
                415                 420                 425 

tcc aga gag acc tct ttc ata act aat ggg agt tgc ctg gcc tcc aga     2428 
Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg 
            430                 435                 440 

aag acc tct ttt atg atg gcc ctg tgc ctt agt agt att tat gaa gac     2476 
Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp 
        445                 450                 455 

ttg aag atg tac cag gtg gag ttc aag acc atg aat gca aag ctt ctg     2524 
Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met Asn Ala Lys Leu Leu 
    460                 465                 470 

atg gat cct aag agg cag atc ttt cta gat caa aac atg ctg gca gtt     2572 
Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln Asn Met Leu Ala Val 
475                 480                 485                 490 

att gat gag ctg atg cag gcc ctg aat ttc aac agt gag act gtg cca     2620 
Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn Ser Glu Thr Val Pro 
                495                 500                 505 

caa aaa tcc tcc ctt gaa gaa ccg gat ttt tat aaa act aaa atc aag     2668 
Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys 
            510                 515                 520 

ctc tgc ata ctt ctt cat gct ttc aga att cgg gca gtg act att gat     2716 
Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg Ala Val Thr Ile Asp 
        525                 530                 535 

aga gtg atg agc tat ctg aat gct tcc taa aaagcgaggt acgcgtcgag       2766 
Arg Val Met Ser Tyr Leu Asn Ala Ser 
    540                 545 

catgcatcta gggcggccaa ttccgcccct ctccctcccc cccccctaac gttactggcc   2826 

gaagccgctt ggaataaggc cggtgtgcgt ttgtctatat gtgattttcc accatattgc   2886 

cgtcttttgg caatgtgagg gcccggaaac ctggccctgt cttcttgacg agcattccta   2946 

ggggtctttc ccctctcgcc aaaggaatgc aaggtctgtt gaatgtcgtg aaggaagcag   3006 

ttcctctgga agcttcttga agacaaacaa cgtctgtagc gaccctttgc aggcagcgga   3066 

accccccacc tggcgacagg tgcctctgcg gccaaaagcc acgtgtataa gatacacctg   3126 

caaaggcggc acaaccccag tgccacgttg tgagttggat agttgtggaa agagtcaaat   3186 

ggctctcctc aagcgtattc aacaaggggc tgaaggatgc ccagaaggta ccccattgta   3246 

tgggatctga tctggggcct cggtgcacat gctttacatg tgtttagtcg aggttaaaaa   3306 

aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgatgataag   3366 

cttgccacaa cccgggatcc tctagagtcg acatgggcca cacacggagg cagggaacat   3426 

caccatccaa gtgtccatac ctcaatttct ttcagctctt ggtgctggct ggtctttctc   3486 

acttctgttc aggtgttatc cacgtgacca aggaagtgaa agaagtggca acgctgtcct   3546 

gtggtcacaa tgtttctgtt gaagagctgg cacaaactcg catctactgg caaaaggaga   3606 

agaaaatggt gctgactatg atgtctgggg acatgaatat atggcccgag tacaagaacc   3666 

ggaccatctt tgatatcact aataacctct ccattgtgat cctggctctg cgcccatctg   3726 

acgagggcac atacgagtgt gttgttctga agtatgaaaa agacgctttc aagcgggaac   3786 

acctggctga agtgacgtta tcagtcaaag ctgacttccc tacacctagt atatctgact   3846 

ttgaaattcc aacttctaat attagaagga taatttgctc aacctctgga ggttttccag   3906 

agcctcacct ctcctggttg gaaaatggag aagaattaaa tgccatcaac acaacagttt   3966 

cccaagatcc tgaaactgag ctctatgctg ttagcagcaa actggatttc aatatgacaa   4026 

ccaaccacag cttcatgtgt ctcatcaagt atggacattt aagagtgaat cagaccttca   4086 

actggaatac aaccaagcaa gagcattttc ctgataacct gctcccatcc tgggccatta   4146 

ccttaatctc agtaaatgga atttttgtga tatgctgcct gacctactgc tttgccccaa   4206 

gatgcagaga gagaaggagg aatgagagat tgagaaggga aagtgtacgc cctgtataac   4266 

agtgtccgca gaagcaaggg gctgaaacgg ccgatcacta gtgaattcgc ggccgcttcc   4326 

ctttagtgag ggttaatgct tcgagcagac atgataagat acattgatga gtttggacaa   4386 

accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct   4446 

ttatttgtaa ccattataag ctgcaataaa caagttaaca acaacaattg cattcatttt   4506 

atgtttcagg ttcaggggga gatgtgggag gttttttaaa gcaagtaaaa cctctacaaa   4566 

tgtggtaaaa tccgataagg atcgatccgg gctggcgtaa tagcgaagag gcccgcaccg   4626 

atcgcccttc ccaacagttg cgcagcctga atggcgaatg gacgcgccct gtagcggcgc   4686 

attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct   4746 

agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg   4806 

tcaagctcta aatcgggggc tccctttagg gttccgattt agagctttac ggcacctcga   4866 

ccgcaaaaaa cttgatttgg gtgatggttc acgtagtggg ccatcgccct gatagacggt   4926 

ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg   4986 

aacaacactc aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc   5046 

ggcctattgg ttaaaaaatg agctgattta acaaatattt aacgcgaatt ttaacaaaat   5106 

attaacgttt acaatttcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt   5166 

cacaccgcat acgcggatct gcgcagcacc atggcctgaa ataacctctg aaagaggaac   5226 

ttggttaggt accttctgag gcggaaagaa ccagctgtgg aatgtgtgtc agttagggtg   5286 

tggaaagtcc ccaggctccc cagcaggcag aagtatgcaa agcatgcatc tcaattagtc   5346 

agcaaccagg tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca   5406 

tctcaattag tcagcaacca tagtcccgcc cctaactccg cccatcccgc ccctaactcc   5466 

gcccagttcc gcccattctc cgccccatgg ctgactaatt ttttttattt atgcagaggc   5526 

cgaggccgcc tcggcctctg agctattcca gaagtagtga ggaggctttt ttggaggcct   5586 

aggcttttgc aaaaagcttg attcttctga cacaacagtc tcgaacttaa ggctagagcc   5646 

accatgattg aacaagatgg attgcacgca ggttctccgg ccgcttgggt ggagaggcta   5706 

ttcggctatg actgggcaca acagacaatc ggctgctctg atgccgccgt gttccggctg   5766 

tcagcgcagg ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa   5826 

ctgcaggacg aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct   5886 

gtgctcgacg ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg   5946 

caggatctcc tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca   6006 

atgcggcggc tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat   6066 

cgcatcgagc gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac   6126 

gaagagcatc aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc   6186 

gacggcgagg atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa   6246 

aatggccgct tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag   6306 

gacatagcgt tggctacccg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc   6366 

ttcctcgtgc tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt   6426 

cttgacgagt tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca   6486 

acctgccatc acgatggccg caataaaata tctttatttt cattacatct gtgtgttggt   6546 

tttttgtgtg aatcgatagc gataaggatc cgcgtatggt gcactctcag tacaatctgc   6606 

tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga   6666 

cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc   6726 

atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata   6786 

cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact   6846 

tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg   6906 

tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt   6966 

atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct   7026 

gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca   7086 

cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc   7146 

gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc   7206 

cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg   7266 

gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta   7326 

tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc   7386 

ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt   7446 

gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg   7506 

cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactttagct   7566 

tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc   7626 

tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct   7686 

cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac   7746 

acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc   7806 

tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat   7866 

ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg   7926 

accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc   7986 

aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa   8046 

ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag   8106 

gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta   8166 

ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta   8226 

ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag   8286 

ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg   8346 

gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg   8406 

cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag   8466 

cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc   8526 

cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa   8586 

aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg   8646 

gctcgacaga tct                                                      8659 

Sequence ID: 12
Length of Sequence: 547
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 12 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg Asn 
            340                 345                 350 

Leu Pro Val Ala Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His 
        355                 360                 365 

Ser Gln Asn Leu Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg 
    370                 375                 380 

Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu 
385                 390                 395                 400 

Asp Ile Thr Lys Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu 
                405                 410                 415 

Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe 
            420                 425                 430 

Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met 
        435                 440                 445 

Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val 
    450                 455                 460 

Glu Phe Lys Thr Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln 
465                 470                 475                 480 

Ile Phe Leu Asp Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln 
                485                 490                 495 

Ala Leu Asn Phe Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu 
            500                 505                 510 

Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His 
        515                 520                 525 

Ala Phe Arg Ile Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu 
    530                 535                 540 

Asn Ala Ser 
545 

Sequence ID: 13
Length of Sequence: 8644
Sequence Type: DNA
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 13 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctagg gcccagagca ag atg tgt cac cag cag ttg gtc atc tct tgg    1132 
                         Met Cys His Gln Gln Leu Val Ile Ser Trp 
                           1               5                  10 

ttt tcc ctg gtt ttt ctg gca tct ccc ctc gtg gcc ata tgg gaa ctg     1180 
Phe Ser Leu Val Phe Leu Ala Ser Pro Leu Val Ala Ile Trp Glu Leu 
                 15                  20                  25 

aag aaa gat gtt tat gtc gta gaa ttg gat tgg tat ccg gat gcc cct     1228 
Lys Lys Asp Val Tyr Val Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro 
             30                  35                  40 

gga gaa atg gtg gtc ctc acc tgt gac acc cct gaa gaa gat ggt atc     1276 
Gly Glu Met Val Val Leu Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile 
         45                  50                  55 

acc tgg acc ttg gac cag agc agt gag gtc tta ggc tct ggc aaa acc     1324 
Thr Trp Thr Leu Asp Gln Ser Ser Glu Val Leu Gly Ser Gly Lys Thr 
     60                  65                  70 

ctg acc atc caa gtc aaa gag ttt gga gat gct ggc cag tac acc tgt     1372 
Leu Thr Ile Gln Val Lys Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys 
 75                  80                  85                  90 

cac aaa gga ggc gag gtt cta agc cat tcg ctc ctg ctg ctt cac aaa     1420 
His Lys Gly Gly Glu Val Leu Ser His Ser Leu Leu Leu Leu His Lys 
                 95                 100                 105 

aag gaa gat gga att tgg tcc act gat att tta aag gac cag aaa gaa     1468 
Lys Glu Asp Gly Ile Trp Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu 
            110                 115                 120 

ccc aaa aat aag acc ttt cta aga tgc gag gcc aag aat tat tct gga     1516 
Pro Lys Asn Lys Thr Phe Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly 
        125                 130                 135 

cgt ttc acc tgc tgg tgg ctg acg aca atc agt act gat ttg aca ttc     1564 
Arg Phe Thr Cys Trp Trp Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe 
    140                 145                 150 

agt gtc aaa agc agc aga ggc tct tct gac ccc caa ggg gtg acg tgc     1612 
Ser Val Lys Ser Ser Arg Gly Ser Ser Asp Pro Gln Gly Val Thr Cys 
155                 160                 165                 170 

gga gct gct aca ctc tct gca gag aga gtc aga ggg gac aac aag gag     1660 
Gly Ala Ala Thr Leu Ser Ala Glu Arg Val Arg Gly Asp Asn Lys Glu 
                175                 180                 185 

tat gag tac tca gtg gag tgc cag gag gac agt gcc tgc cca gct gct     1708 
Tyr Glu Tyr Ser Val Glu Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala 
            190                 195                 200 

gag gag agt ctg ccc att gag gtc atg gtg gat gcc gtt cac aag ctc     1756 
Glu Glu Ser Leu Pro Ile Glu Val Met Val Asp Ala Val His Lys Leu 
        205                 210                 215 

aag tat gaa aac tac acc agc agc ttc ttc atc agg gac atc atc aaa     1804 
Lys Tyr Glu Asn Tyr Thr Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys 
    220                 225                 230 

cct gac cca ccc aac aac ttg cag ctg aag cca tta aag aat tct cgg     1852 
Pro Asp Pro Pro Asn Asn Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg 
235                 240                 245                 250 

cag gtg gag gtc agc tgg gag tac cct gac acc tgg agt act cca cat     1900 
Gln Val Glu Val Ser Trp Glu Tyr Pro Asp Thr Trp Ser Thr Pro His 
                255                 260                 265 

tcc tac ttc tcc ctg aca ttc tgc gtt cag gtc cag ggc aag agc aag     1948 
Ser Tyr Phe Ser Leu Thr Phe Cys Val Gln Val Gln Gly Lys Ser Lys 
            270                 275                 280 

aga gaa aag aaa gat aga gtc ttc acc gac aag acc tca gcc acg gtc     1996 
Arg Glu Lys Lys Asp Arg Val Phe Thr Asp Lys Thr Ser Ala Thr Val 
        285                 290                 295 

atc tgc cgc aaa aat gcc agc att agc gtg cgg gcc cag gac cgc tac     2044 
Ile Cys Arg Lys Asn Ala Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr 
    300                 305                 310 

tat agc tca tct tgg agc gaa tgg gca tct gtg ccc tgc agt ggt ggc     2092 
Tyr Ser Ser Ser Trp Ser Glu Trp Ala Ser Val Pro Cys Ser Gly Gly 
315                 320                 325                 330 

ggt gga agc ggt ggc ggt gga agc ggt ggc ggt gga agc cca tgg aga     2140 
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg 
                335                 340                 345 

aac ctc ccc gtg gcc act cca gac cca gga atg ttc cca tgc ctt cac     2188 
Asn Leu Pro Val Ala Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His 
            350                 355                 360 

cac tcc caa aac ctg ctg agg gcc gtc agc aac atg ctc cag aag gcc     2236 
His Ser Gln Asn Leu Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala 
        365                 370                 375 

aga caa act cta gaa ttt tac cct tgc act tct gaa gag att gat cat     2284 
Arg Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His 
    380                 385                 390 

gaa gat atc aca aaa gat aaa acc agc aca gtg gag gcc tgt tta cca     2332 
Glu Asp Ile Thr Lys Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro 
395                 400                 405                 410 

ttg gaa tta acc aag aat gag agt tgc cta aat tcc aga gag acc tct     2380 
Leu Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser 
                415                 420                 425 

ttc ata act aat ggg agt tgc ctg gcc tcc aga aag acc tct ttt atg     2428 
Phe Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met 
            430                 435                 440 

atg gcc ctg tgc ctt agt agt att tat gaa gac ttg aag atg tac cag     2476 
Met Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln 
        445                 450                 455 

gtg gag ttc aag acc atg aat gca aag ctt ctg atg gat cct aag agg     2524 
Val Glu Phe Lys Thr Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg 
    460                 465                 470 

cag atc ttt cta gat caa aac atg ctg gca gtt att gat gag ctg atg     2572 
Gln Ile Phe Leu Asp Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met 
475                 480                 485                 490 

cag gcc ctg aat ttc aac agt gag act gtg cca caa aaa tcc tcc ctt     2620 
Gln Ala Leu Asn Phe Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu 
                495                 500                 505 

gaa gaa ccg gat ttt tat aaa act aaa atc aag ctc tgc ata ctt ctt     2668 
Glu Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu 
            510                 515                 520 

cat gct ttc aga att cgg gca gtg act att gat aga gtg atg agc tat     2716 
His Ala Phe Arg Ile Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr 
        525                 530                 535 

ctg aat gct tcc taa aaagcgaggt acgcgtcgag catgcatcta gggcggccaa     2771 
Leu Asn Ala Ser 
    540 

ttccgcccct ctccctcccc cccccctaac gttactggcc gaagccgctt ggaataaggc   2831 

cggtgtgcgt ttgtctatat gtgattttcc accatattgc cgtcttttgg caatgtgagg   2891 

gcccggaaac ctggccctgt cttcttgacg agcattccta ggggtctttc ccctctcgcc   2951 

aaaggaatgc aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga agcttcttga   3011 

agacaaacaa cgtctgtagc gaccctttgc aggcagcgga accccccacc tggcgacagg   3071 

tgcctctgcg gccaaaagcc acgtgtataa gatacacctg caaaggcggc acaaccccag   3131 

tgccacgttg tgagttggat agttgtggaa agagtcaaat ggctctcctc aagcgtattc   3191 

aacaaggggc tgaaggatgc ccagaaggta ccccattgta tgggatctga tctggggcct   3251 

cggtgcacat gctttacatg tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac   3311 

cacggggacg tggttttcct ttgaaaaaca cgatgataag cttgccacaa cccgggatcc   3371 

tctagagtcg acatgggcca cacacggagg cagggaacat caccatccaa gtgtccatac   3431 

ctcaatttct ttcagctctt ggtgctggct ggtctttctc acttctgttc aggtgttatc   3491 

cacgtgacca aggaagtgaa agaagtggca acgctgtcct gtggtcacaa tgtttctgtt   3551 

gaagagctgg cacaaactcg catctactgg caaaaggaga agaaaatggt gctgactatg   3611 

atgtctgggg acatgaatat atggcccgag tacaagaacc ggaccatctt tgatatcact   3671 

aataacctct ccattgtgat cctggctctg cgcccatctg acgagggcac atacgagtgt   3731 

gttgttctga agtatgaaaa agacgctttc aagcgggaac acctggctga agtgacgtta   3791 

tcagtcaaag ctgacttccc tacacctagt atatctgact ttgaaattcc aacttctaat   3851 

attagaagga taatttgctc aacctctgga ggttttccag agcctcacct ctcctggttg   3911 

gaaaatggag aagaattaaa tgccatcaac acaacagttt cccaagatcc tgaaactgag   3971 

ctctatgctg ttagcagcaa actggatttc aatatgacaa ccaaccacag cttcatgtgt   4031 

ctcatcaagt atggacattt aagagtgaat cagaccttca actggaatac aaccaagcaa   4091 

gagcattttc ctgataacct gctcccatcc tgggccatta ccttaatctc agtaaatgga   4151 

atttttgtga tatgctgcct gacctactgc tttgccccaa gatgcagaga gagaaggagg   4211 

aatgagagat tgagaaggga aagtgtacgc cctgtataac agtgtccgca gaagcaaggg   4271 

gctgaaacgg ccgatcacta gtgaattcgc ggccgcttcc ctttagtgag ggttaatgct   4331 

tcgagcagac atgataagat acattgatga gtttggacaa accacaacta gaatgcagtg   4391 

aaaaaaatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa ccattataag   4451 

ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg ttcaggggga   4511 

gatgtgggag gttttttaaa gcaagtaaaa cctctacaaa tgtggtaaaa tccgataagg   4571 

atcgatccgg gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg   4631 

cgcagcctga atggcgaatg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg   4691 

tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt   4751 

tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc   4811 

tccctttagg gttccgattt agagctttac ggcacctcga ccgcaaaaaa cttgatttgg   4871 

gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg   4931 

agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct   4991 

cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg   5051 

agctgattta acaaatattt aacgcgaatt ttaacaaaat attaacgttt acaatttcgc   5111 

ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat acgcggatct   5171 

gcgcagcacc atggcctgaa ataacctctg aaagaggaac ttggttaggt accttctgag   5231 

gcggaaagaa ccagctgtgg aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc   5291 

cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccagg tgtggaaagt   5351 

ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca   5411 

tagtcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc gcccattctc   5471 

cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc tcggcctctg   5531 

agctattcca gaagtagtga ggaggctttt ttggaggcct aggcttttgc aaaaagcttg   5591 

attcttctga cacaacagtc tcgaacttaa ggctagagcc accatgattg aacaagatgg   5651 

attgcacgca ggttctccgg ccgcttgggt ggagaggcta ttcggctatg actgggcaca   5711 

acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg ggcgcccggt   5771 

tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg   5831 

gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga   5891 

agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc tgtcatctca   5951 

ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc tgcatacgct   6011 

tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc gagcacgtac   6071 

tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc aggggctcgc   6131 

gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt   6191 

gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct tttctggatt   6251 

catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt tggctacccg   6311 

tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat   6371 

cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc   6431 

gggactctgg ggttcgaaat gaccgaccaa gcgacgccca acctgccatc acgatggccg   6491 

caataaaata tctttatttt cattacatct gtgtgttggt tttttgtgtg aatcgatagc   6551 

gataaggatc cgcgtatggt gcactctcag tacaatctgc tctgatgccg catagttaag   6611 

ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc   6671 

atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc   6731 

gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa   6791 

tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg   6851 

aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata   6911 

accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg   6971 

tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac   7031 

gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact   7091 

ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat   7151 

gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga   7211 

gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac   7271 

agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat   7331 

gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac   7391 

cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct   7451 

gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac   7511 

gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga   7571 

ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg   7631 

gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact   7691 

ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac   7751 

tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta   7811 

actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt   7871 

taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga   7931 

gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc   7991 

tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt   8051 

ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc   8111 

gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact tcaagaactc   8171 

tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg   8231 

cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg   8291 

gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga   8351 

actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc   8411 

ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg   8471 

gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg   8531 

atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt   8591 

tttacggttc ctggcctttt gctggccttt tgctcacatg gctcgacaga tct          8644 

Sequence ID: 14
Length of Sequence: 542
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 14 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg Asn Leu Pro Val Ala Thr 
            340                 345                 350 

Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu Leu 
        355                 360                 365 

Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe 
    370                 375                 380 

Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp 
385                 390                 395                 400 

Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn 
                405                 410                 415 

Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser 
            420                 425                 430 

Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser 
        435                 440                 445 

Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met 
    450                 455                 460 

Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln 
465                 470                 475                 480 

Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn 
                485                 490                 495 

Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr 
            500                 505                 510 

Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg 
        515                 520                 525 

Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 
    530                 535                 540 

Sequence ID: 15
Length of Sequence: 8629
Sequence Type: DNA
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 15 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta     60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc    120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg    180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc    240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat    300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc    360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga    420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg    480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac    540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt    600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg    660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata    720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac    780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt    840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa    900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact    960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac   1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact   1080 

ataggctagg gcccagagca ag atg tgt cac cag cag ttg gtc atc tct tgg    1132 
                         Met Cys His Gln Gln Leu Val Ile Ser Trp 
                           1               5                  10 

ttt tcc ctg gtt ttt ctg gca tct ccc ctc gtg gcc ata tgg gaa ctg     1180 
Phe Ser Leu Val Phe Leu Ala Ser Pro Leu Val Ala Ile Trp Glu Leu 
                 15                  20                  25 

aag aaa gat gtt tat gtc gta gaa ttg gat tgg tat ccg gat gcc cct     1228 
Lys Lys Asp Val Tyr Val Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro 
             30                  35                  40 

gga gaa atg gtg gtc ctc acc tgt gac acc cct gaa gaa gat ggt atc     1276 
Gly Glu Met Val Val Leu Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile 
         45                  50                  55 

acc tgg acc ttg gac cag agc agt gag gtc tta ggc tct ggc aaa acc     1324 
Thr Trp Thr Leu Asp Gln Ser Ser Glu Val Leu Gly Ser Gly Lys Thr 
     60                  65                  70 

ctg acc atc caa gtc aaa gag ttt gga gat gct ggc cag tac acc tgt     1372 
Leu Thr Ile Gln Val Lys Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys 
 75                  80                  85                  90 

cac aaa gga ggc gag gtt cta agc cat tcg ctc ctg ctg ctt cac aaa     1420 
His Lys Gly Gly Glu Val Leu Ser His Ser Leu Leu Leu Leu His Lys 
                 95                 100                 105 

aag gaa gat gga att tgg tcc act gat att tta aag gac cag aaa gaa     1468 
Lys Glu Asp Gly Ile Trp Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu 
            110                 115                 120 

ccc aaa aat aag acc ttt cta aga tgc gag gcc aag aat tat tct gga     1516 
Pro Lys Asn Lys Thr Phe Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly 
        125                 130                 135 

cgt ttc acc tgc tgg tgg ctg acg aca atc agt act gat ttg aca ttc     1564 
Arg Phe Thr Cys Trp Trp Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe 
    140                 145                 150 

agt gtc aaa agc agc aga ggc tct tct gac ccc caa ggg gtg acg tgc     1612 
Ser Val Lys Ser Ser Arg Gly Ser Ser Asp Pro Gln Gly Val Thr Cys 
155                 160                 165                 170 

gga gct gct aca ctc tct gca gag aga gtc aga ggg gac aac aag gag     1660 
Gly Ala Ala Thr Leu Ser Ala Glu Arg Val Arg Gly Asp Asn Lys Glu 
                175                 180                 185 

tat gag tac tca gtg gag tgc cag gag gac agt gcc tgc cca gct gct     1708 
Tyr Glu Tyr Ser Val Glu Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala 
            190                 195                 200 

gag gag agt ctg ccc att gag gtc atg gtg gat gcc gtt cac aag ctc     1756 
Glu Glu Ser Leu Pro Ile Glu Val Met Val Asp Ala Val His Lys Leu 
        205                 210                 215 

aag tat gaa aac tac acc agc agc ttc ttc atc agg gac atc atc aaa     1804 
Lys Tyr Glu Asn Tyr Thr Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys 
    220                 225                 230 

cct gac cca ccc aac aac ttg cag ctg aag cca tta aag aat tct cgg     1852 
Pro Asp Pro Pro Asn Asn Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg 
235                 240                 245                 250 

cag gtg gag gtc agc tgg gag tac cct gac acc tgg agt act cca cat     1900 
Gln Val Glu Val Ser Trp Glu Tyr Pro Asp Thr Trp Ser Thr Pro His 
                255                 260                 265 

tcc tac ttc tcc ctg aca ttc tgc gtt cag gtc cag ggc aag agc aag     1948 
Ser Tyr Phe Ser Leu Thr Phe Cys Val Gln Val Gln Gly Lys Ser Lys 
            270                 275                 280 

aga gaa aag aaa gat aga gtc ttc acc gac aag acc tca gcc acg gtc     1996 
Arg Glu Lys Lys Asp Arg Val Phe Thr Asp Lys Thr Ser Ala Thr Val 
        285                 290                 295 

atc tgc cgc aaa aat gcc agc att agc gtg cgg gcc cag gac cgc tac     2044 
Ile Cys Arg Lys Asn Ala Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr 
    300                 305                 310 

tat agc tca tct tgg agc gaa tgg gca tct gtg ccc tgc agt ggt ggc     2092 
Tyr Ser Ser Ser Trp Ser Glu Trp Ala Ser Val Pro Cys Ser Gly Gly 
315                 320                 325                 330 

ggt gga agc ggt ggc ggt gga agc cca tgg aga aac ctc ccc gtg gcc     2140 
Gly Gly Ser Gly Gly Gly Gly Ser Pro Trp Arg Asn Leu Pro Val Ala 
                335                 340                 345 

act cca gac cca gga atg ttc cca tgc ctt cac cac tcc caa aac ctg     2188 
Thr Pro Asp Pro Gly Met Phe Pro Cys Leu His His Ser Gln Asn Leu 
            350                 355                 360 

ctg agg gcc gtc agc aac atg ctc cag aag gcc aga caa act cta gaa     2236 
Leu Arg Ala Val Ser Asn Met Leu Gln Lys Ala Arg Gln Thr Leu Glu 
        365                 370                 375 

ttt tac cct tgc act tct gaa gag att gat cat gaa gat atc aca aaa     2284 
Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp His Glu Asp Ile Thr Lys 
    380                 385                 390 

gat aaa acc agc aca gtg gag gcc tgt tta cca ttg gaa tta acc aag     2332 
Asp Lys Thr Ser Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys 
395                 400                 405                 410 

aat gag agt tgc cta aat tcc aga gag acc tct ttc ata act aat ggg     2380 
Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly 
                415                 420                 425 

agt tgc ctg gcc tcc aga aag acc tct ttt atg atg gcc ctg tgc ctt     2428 
Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu 
            430                 435                 440 

agt agt att tat gaa gac ttg aag atg tac cag gtg gag ttc aag acc     2476 
Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr Gln Val Glu Phe Lys Thr 
        445                 450                 455 

atg aat gca aag ctt ctg atg gat cct aag agg cag atc ttt cta gat     2524 
Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gln Ile Phe Leu Asp 
    460                 465                 470 

caa aac atg ctg gca gtt att gat gag ctg atg cag gcc ctg aat ttc     2572 
Gln Asn Met Leu Ala Val Ile Asp Glu Leu Met Gln Ala Leu Asn Phe 
475                 480                 485                 490 

aac agt gag act gtg cca caa aaa tcc tcc ctt gaa gaa ccg gat ttt     2620 
Asn Ser Glu Thr Val Pro Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe 
                495                 500                 505 

tat aaa act aaa atc aag ctc tgc ata ctt ctt cat gct ttc aga att     2668 
Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu Leu His Ala Phe Arg Ile 
            510                 515                 520 

cgg gca gtg act att gat aga gtg atg agc tat ctg aat gct tcc taa     2716 
Arg Ala Val Thr Ile Asp Arg Val Met Ser Tyr Leu Asn Ala Ser 
        525                 530                 535 

aaagcgaggt acgcgtcgag catgcatcta gggcggccaa ttccgcccct ctccctcccc   2776 

cccccctaac gttactggcc gaagccgctt ggaataaggc cggtgtgcgt ttgtctatat   2836 

gtgattttcc accatattgc cgtcttttgg caatgtgagg gcccggaaac ctggccctgt   2896 

cttcttgacg agcattccta ggggtctttc ccctctcgcc aaaggaatgc aaggtctgtt   2956 

gaatgtcgtg aaggaagcag ttcctctgga agcttcttga agacaaacaa cgtctgtagc   3016 

gaccctttgc aggcagcgga accccccacc tggcgacagg tgcctctgcg gccaaaagcc   3076 

acgtgtataa gatacacctg caaaggcggc acaaccccag tgccacgttg tgagttggat   3136 

agttgtggaa agagtcaaat ggctctcctc aagcgtattc aacaaggggc tgaaggatgc   3196 

ccagaaggta ccccattgta tgggatctga tctggggcct cggtgcacat gctttacatg   3256 

tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac cacggggacg tggttttcct   3316 

ttgaaaaaca cgatgataag cttgccacaa cccgggatcc tctagagtcg acatgggcca   3376 

cacacggagg cagggaacat caccatccaa gtgtccatac ctcaatttct ttcagctctt   3436 

ggtgctggct ggtctttctc acttctgttc aggtgttatc cacgtgacca aggaagtgaa   3496 

agaagtggca acgctgtcct gtggtcacaa tgtttctgtt gaagagctgg cacaaactcg   3556 

catctactgg caaaaggaga agaaaatggt gctgactatg atgtctgggg acatgaatat   3616 

atggcccgag tacaagaacc ggaccatctt tgatatcact aataacctct ccattgtgat   3676 

cctggctctg cgcccatctg acgagggcac atacgagtgt gttgttctga agtatgaaaa   3736 

agacgctttc aagcgggaac acctggctga agtgacgtta tcagtcaaag ctgacttccc   3796 

tacacctagt atatctgact ttgaaattcc aacttctaat attagaagga taatttgctc   3856 

aacctctgga ggttttccag agcctcacct ctcctggttg gaaaatggag aagaattaaa   3916 

tgccatcaac acaacagttt cccaagatcc tgaaactgag ctctatgctg ttagcagcaa   3976 

actggatttc aatatgacaa ccaaccacag cttcatgtgt ctcatcaagt atggacattt   4036 

aagagtgaat cagaccttca actggaatac aaccaagcaa gagcattttc ctgataacct   4096 

gctcccatcc tgggccatta ccttaatctc agtaaatgga atttttgtga tatgctgcct   4156 

gacctactgc tttgccccaa gatgcagaga gagaaggagg aatgagagat tgagaaggga   4216 

aagtgtacgc cctgtataac agtgtccgca gaagcaaggg gctgaaacgg ccgatcacta   4276 

gtgaattcgc ggccgcttcc ctttagtgag ggttaatgct tcgagcagac atgataagat   4336 

acattgatga gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg   4396 

aaatttgtga tgctattgct ttatttgtaa ccattataag ctgcaataaa caagttaaca   4456 

acaacaattg cattcatttt atgtttcagg ttcaggggga gatgtgggag gttttttaaa   4516 

gcaagtaaaa cctctacaaa tgtggtaaaa tccgataagg atcgatccgg gctggcgtaa   4576 

tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg   4636 

gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc   4696 

gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc   4756 

acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt   4816 

agagctttac ggcacctcga ccgcaaaaaa cttgatttgg gtgatggttc acgtagtggg   4876 

ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt   4936 

ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta   4996 

taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaatattt   5056 

aacgcgaatt ttaacaaaat attaacgttt acaatttcgc ctgatgcggt attttctcct   5116 

tacgcatctg tgcggtattt cacaccgcat acgcggatct gcgcagcacc atggcctgaa   5176 

ataacctctg aaagaggaac ttggttaggt accttctgag gcggaaagaa ccagctgtgg   5236 

aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc cagcaggcag aagtatgcaa   5296 

agcatgcatc tcaattagtc agcaaccagg tgtggaaagt ccccaggctc cccagcaggc   5356 

agaagtatgc aaagcatgca tctcaattag tcagcaacca tagtcccgcc cctaactccg   5416 

cccatcccgc ccctaactcc gcccagttcc gcccattctc cgccccatgg ctgactaatt   5476 

ttttttattt atgcagaggc cgaggccgcc tcggcctctg agctattcca gaagtagtga   5536 

ggaggctttt ttggaggcct aggcttttgc aaaaagcttg attcttctga cacaacagtc   5596 

tcgaacttaa ggctagagcc accatgattg aacaagatgg attgcacgca ggttctccgg   5656 

ccgcttgggt ggagaggcta ttcggctatg actgggcaca acagacaatc ggctgctctg   5716 

atgccgccgt gttccggctg tcagcgcagg ggcgcccggt tctttttgtc aagaccgacc   5776 

tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg gctatcgtgg ctggccacga   5836 

cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg gactggctgc   5896 

tattgggcga agtgccgggg caggatctcc tgtcatctca ccttgctcct gccgagaaag   5956 

tatccatcat ggctgatgca atgcggcggc tgcatacgct tgatccggct acctgcccat   6016 

tcgaccacca agcgaaacat cgcatcgagc gagcacgtac tcggatggaa gccggtcttg   6076 

tcgatcagga tgatctggac gaagagcatc aggggctcgc gccagccgaa ctgttcgcca   6136 

ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt gacccatggc gatgcctgct   6196 

tgccgaatat catggtggaa aatggccgct tttctggatt catcgactgt ggccggctgg   6256 

gtgtggcgga ccgctatcag gacatagcgt tggctacccg tgatattgct gaagagcttg   6316 

gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc gattcgcagc   6376 

gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc gggactctgg ggttcgaaat   6436 

gaccgaccaa gcgacgccca acctgccatc acgatggccg caataaaata tctttatttt   6496 

cattacatct gtgtgttggt tttttgtgtg aatcgatagc gataaggatc cgcgtatggt   6556 

gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa   6616 

cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg   6676 

tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga   6736 

gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt   6796 

cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt   6856 

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat   6916 

aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt   6976 

ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg   7036 

ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga   7096 

tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc   7156 

tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac   7216 

actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg   7276 

gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca   7336 

acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg   7396 

gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg   7456 

acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg   7516 

gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag   7576 

ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg   7636 

gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct   7696 

cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac   7756 

agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact   7816 

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga   7876 

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt   7936 

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct   7996 

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc   8056 

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc   8116 

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc   8176 

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg   8236 

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt   8296 

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg   8356 

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg   8416 

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt   8476 

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag   8536 

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt   8596 

gctggccttt tgctcacatg gctcgacaga tct                                8629 

Sequence ID: 16
Length of Sequence: 537
Sequence Type: PRT
Scientific Name: Artificial Sequence
Other Information: Description of Artificial Sequence plasmid
 16 

Met Cys His Gln Gln Leu Val Ile Ser Trp Phe Ser Leu Val Phe Leu 
  1               5                  10                  15 

Ala Ser Pro Leu Val Ala Ile Trp Glu Leu Lys Lys Asp Val Tyr Val 
             20                  25                  30 

Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
         35                  40                  45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly Ile Thr Trp Thr Leu Asp Gln 
     50                  55                  60 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr Ile Gln Val Lys 
 65                  70                  75                  80 

Glu Phe Gly Asp Ala Gly Gln Tyr Thr Cys His Lys Gly Gly Glu Val 
                 85                  90                  95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly Ile Trp 
            100                 105                 110 

Ser Thr Asp Ile Leu Lys Asp Gln Lys Glu Pro Lys Asn Lys Thr Phe 
        115                 120                 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
    130                 135                 140 

Leu Thr Thr Ile Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145                 150                 155                 160 

Gly Ser Ser Asp Pro Gln Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
                165                 170                 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
            180                 185                 190 

Cys Gln Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro Ile 
        195                 200                 205 

Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 
    210                 215                 220 

Ser Ser Phe Phe Ile Arg Asp Ile Ile Lys Pro Asp Pro Pro Asn Asn 
225                 230                 235                 240 

Leu Gln Leu Lys Pro Leu Lys Asn Ser Arg Gln Val Glu Val Ser Trp 
                245                 250                 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
            260                 265                 270 

Phe Cys Val Gln Val Gln Gly Lys Ser Lys Arg Glu Lys Lys Asp Arg 
        275                 280                 285 

Val Phe Thr Asp Lys Thr Ser Ala Thr Val Ile Cys Arg Lys Asn Ala 
    290                 295                 300 

Ser Ile Ser Val Arg Ala Gln Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
305                 310                 315                 320 

Glu Trp Ala Ser Val Pro Cys Ser Gly Gly Gly Gly Ser Gly Gly Gly 
                325                 330                 335 

Gly Ser Pro Trp Arg Asn Leu Pro Val Ala Thr Pro Asp Pro Gly Met 
            340                 345                 350 

Phe Pro Cys Leu His His Ser Gln Asn Leu Leu Arg Ala Val Ser Asn 
        355                 360                 365 

Met Leu Gln Lys Ala Arg Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser 
    370                 375                 380 

Glu Glu Ile Asp His Glu Asp Ile Thr Lys Asp Lys Thr Ser Thr Val 
385                 390                 395                 400 

Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn 
                405                 410                 415 

Ser Arg Glu Thr Ser Phe Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg 
            420                 425                 430 

Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp 
        435                 440                 445 

Leu Lys Met Tyr Gln Val Glu Phe Lys Thr Met Asn Ala Lys Leu Leu 
    450                 455                 460 

Met Asp Pro Lys Arg Gln Ile Phe Leu Asp Gln Asn Met Leu Ala Val 
465                 470                 475                 480 

Ile Asp Glu Leu Met Gln Ala Leu Asn Phe Asn Ser Glu Thr Val Pro 
                485                 490                 495 

Gln Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys 
            500                 505                 510 

Leu Cys Ile Leu Leu His Ala Phe Arg Ile Arg Ala Val Thr Ile Asp 
        515                 520                 525 

Arg Val Met Ser Tyr Leu Asn Ala Ser 
    530                 535