(en)Disclosed is a family of P450 monooxygenases, each member of which regioselectively oxidizes avermectin to 4″-keto-avermectin. The P450 monooxgenases find use in methods and formulations for making emamectin from avermectin. Also disclosed are methods for purifying the P450 monooxygenases of the invention, binding agents that specifically bind to the P450 monooxygenases of the invention, and genetically engineered cells that express the P450 monooxygenases of the invention. Also disclosed are ferredoxins and ferredoxin reductases that are active with the P450 monooxygenases of the invention.
1.ApplicationNumber: US-14541502-A
1.PublishNumber: US-2003068788-A1
2.Date Publish: 20030410
3.Inventor: BUCKEL THOMAS GUNTHER
HAMMER PHILIP EUGENE
HILL DWIGHT STEVEN
LIGON JAMES MADISON
DURHAM ISTVAN MOLNAR
PACHLATKO JOHANNES PAUL
ZIRKLE ROSS ERIC
4.Inventor Harmonized: BUCKEL THOMAS GUNTHER(DE)
HAMMER PHILIP EUGENE(US)
HILL DWIGHT STEVEN(US)
LIGON JAMES MADISON(US)
DURHAM ISTVAN MOLNAR(US)
PACHLATKO JOHANNES PAUL(CH)
ZIRKLE ROSS ERIC(US)
5.Country: US
6.Claims:
(en)Disclosed is a family of P450 monooxygenases, each member of which regioselectively oxidizes avermectin to 4″-keto-avermectin. The P450 monooxgenases find use in methods and formulations for making emamectin from avermectin. Also disclosed are methods for purifying the P450 monooxygenases of the invention, binding agents that specifically bind to the P450 monooxygenases of the invention, and genetically engineered cells that express the P450 monooxygenases of the invention. Also disclosed are ferredoxins and ferredoxin reductases that are active with the P450 monooxygenases of the invention.
7.Description:
(en)CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority to U.S. Provisional Patent Application Serial No. 60/291,149 filed May 16, 2001, the entire contents of which are hereby incorporated by reference.
BACKGROUND OF THE INVENTION
[0002] 1. Field of the Invention
[0003] The invention relates to the field of agrochemicals, and in particular, to insecticides. More specifically, this invention relates to the derivatization of avermectin, particularly for the synthesis of emamectin.
[0004] 1. Summary of the Related Art
[0005] Emamectin is a potent insecticide and controls many pests such as thrips, leafminers, and worm pests including alfalfa caterpillar, beet armyworm, cabbage looper, corn earworm, cutworms, diamondback moth, tobacco budworm, tomato fruitworm, and tomato pinworm. Emamectin (4″-deoxy-4″-epi-N-methylamino avermectin B1a/B1b) is described in U.S. Pat. No. 4,874,749 and in Cvetovich, R. J. et al, J. Organic Chem. 59:7704-7708, 1994 (as MK-244).
[0006] U.S. Pat. No. 5,288,710 describes salts of emamectin that are especially valuable agrochemically. These salts of emamectin are valuable pesticides, especially for combating insects and representatives of the order Acarina. Some pests for which emamectin is useful in combating are listed in European Patent Application EP-A 736,252.
[0007] One drawback to the use of emamectin is the difficulty of its synthesis from avermectin. This is due to the first step of the process, which is the most costly and time-consuming step of producing emamectin, in which the 4″-carbinol group of avermectin must be oxidized to a ketone. The oxidation of the 4″-carbinol group is problematic due to the presence of two other hydroxyl groups on the molecule that must be chemically protected before oxidation and deprotected after oxidation. Thus, this first step, significantly increases the overall cost and time of producing emamectin from avermectin.
[0008] Because of the efficacy and potency of emamectin as an insecticide, there is a need to develop a cost and time effective method and/or reagent for regioselectively oxidizing the 4″-carbinol group of avermectin to produce 4″-keto-avermectin, which is a necessary intermediate for producing emamectin from avermectin.
BRIEF SUMMARY OF THE INVENTION
[0009] The invention provides a novel family of P450 monooxygenases, each member of which is able to regioselectively oxidize the 4″-carbinol group of unprotected avermectin, thereby resulting in a cheap, effective method to produce 4″-keto-avermectin, a necessary intermediate in the production of emamectin. The invention allows elimination of the costly, time-consuming steps of (1) chemically protecting the two other hydroxyl groups on the avermectin molecule prior to oxidation of the 4″-carbinol group that must be chemically protected before oxidation; and (2) chemically deprotecting these two other hydroxyl groups after oxidation. The invention thus provides reagents and methods for significantly reducing the overall cost of producing emamectin from avermectin.
[0010] Accordingly, in one aspect, the invention provides a purified nucleic acid molecule encoding a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol at position 4″ of a compound of formula (II), in free form or in salt form
[0011] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0012] or a single bond and a methylene bridge of the formula
[0013] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, in order to produce a compound of the formula (III), in free form or in salt form
[0014] in which R1, R2, R3, R4, R5, R6, R7, m, n, A, B, C, D, E and F have the meanings given for formula (II).
[0015] In another aspect, the invention provides a purified nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 66% identical to SEQ ID NO: 1. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 70% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31,SEQ ID NO: 33, or SEQ ID NO: 94. In some embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 80% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO:23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 90% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 95% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0016] In some embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0017] In particular embodiments, the nucleic acid molecule is isolated from a Streptomyces strain. In certain embodiments, the Streptomyces strain is selected from the group consisting of Streptomyces tubercidicus, Streptomyces lydicus, Streptomyces platensis, Streptomyces chattanoogensis, Streptomyces kasugaensis, Streptomyces rimosus, and Streptomyces albofaciens.
[0018] In some embodiments of this aspect, the nucleic acid molecule further comprises a nucleic acid sequence encoding a tag which is linked to the P450 monooxygenase via a covalent bond. In certain embodiments, the tag is selected from the group consisting of a His tag, a GST tag, an HA tag, a HSV tag, a Myc-tag, and VSV-G-Tag.
[0019] In another aspect, the invention provides a purified polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol at position 4″ of a compound of formula (II), in free form or in salt form
[0020] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0021] or a single bond and a methylene bridge of the formula
[0022] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, in order to produce a compound of the formula (III), in free form or in salt form
[0023] in which R1, R2, R3, R4, R5, R6, R7, m, n, A, B, C, D, E and F have the meanings given for formula (II).
[0024] In another aspect, the invention provides a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 50% identical to SEQ ID NO: 2. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 60% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 70% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 90% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 95% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95.
[0025] In some embodiments of this aspect of the invention, the P450 monooxygenase comprises or consists essentially of an amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, and SEQ ID NO: 95.
[0026] In certain embodiments, the P450 monooxygenase further comprises a tag. In some embodiments, the tag is selected from the group consisting of a His tag, a GST tag, an HA tag, a HSV tag, a Myc-tag, and VSV-G-Tag.
[0027] In another aspect, the invention provides a binding agent that specifically binds to a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the binding agent is an antibody. In certain embodiments, the antibody is a polyclonal antibody or a monoclonal antibody.
[0028] In yet another aspect, the invention provides a family of P450 monooxygenase polypeptides, wherein each member of the family regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 50% identical to SEQ ID NO: 2. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 60% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 70% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 90% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 95% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments of this aspect of the invention, each member of the family comprises or consists essentially of an amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, and SEQ ID NO: 95.
[0029] In still another aspect, the invention provides a cell genetically engineered to comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the nucleic acid molecule is positioned for expression in the cell. In certain embodiments, the cell further comprises a nucleic acid molecule encoding a ferredoxin protein. In some embodiments, the cell further comprises a nucleic acid molecule encoding a ferredoxin reductase protein.
[0030] In certain embodiments, the cell is a genetically engineered Streptomyces strain. In some embodiments, the cell is a genetically engineered Streptomyces lividans strain. In particular embodiments, the genetically engineered Streptomyces lividans strain has NRRL Designation No. B-30478. In particular embodiments, the cell is a genetically engineered Pseudomonas strain. In some embodiments, the cell is a genetically engineered Pseudomonas putida strain. In certain embodiments, the genetically engineered Pseudomonas putida strain has NRRL Designation No. B-30479. In some embodiments, the cell is a genetically engineered Escherichia coli strain.
[0031] In another aspect, the invention provides a purified nucleic acid molecule encoding a ferredoxin, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the nucleic acid molecule encoding a ferredoxin of the invention comprises or consists essentially of a nucleic acid sequence that is at least 81% identical to SEQ ID NO: 35 or SEQ ID NO: 37. In some embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 35 or SEQ ID NO: 37. In certain embodiments, the nucleic acid molecule encoding a ferredoxin of the invention comprises or consists essentially of the nucleic acid sequence of SEQ ID NO: 35 or SEQ ID NO: 37.
[0032] In yet another aspect, the invention provides a purified ferredoxin protein, wherein the ferredoxin protein is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the ferredoxin of the invention comprises or consists essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 36 or SEQ ID NO: 38. In some embodiments, the nucleic acid molecule comprises or consists essentially of an amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 36 or SEQ ID NO: 38. In particular embodiments, the ferredoxin of the invention comprises or consists essentially of the amino acid sequence of SEQ ID NO: 36 or SEQ ID NO: 38.
[0033] In another aspect, the invention provides a purified nucleic acid molecule encoding a ferredoxin reductase, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the nucleic acid molecule encoding a ferredoxin reductase of the invention comprises or consists essentially of the nucleic acid sequence of SEQ ID NO: 98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104.
[0034] In yet another aspect, the invention provides a purified ferredoxin reductase protein, wherein the ferredoxin reductase protein is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the ferredoxin reductase of the invention comprises or consists essentially of the amino acid sequence of SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, or SEQ ID NO: 105.
[0035] In another aspect, the invention provides a process for the preparation a compound of the formula (I) in free form or in salt form
[0036] in which R 1 -R 9 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2, or 3; and the bonds marked with A, B, C, D, E, and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0037] or a single bond and a methylene bridge of the formula
[0038] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, which process comprises
[0039] 1) bringing a compound of the formula (II), in free form or in salt form
[0040] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, into contact with a polypeptide according to the invention that regioselectively oxidizes the alcohol at position 4″in order to form a compound of the formula (III), in free form or in salt form
[0041] in which R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the meanings given for formula (I); and
[0042] 2) reacting the compound of the formula (III) with an amine of the formula HN(R 8 )R 9 , wherein R 8 and R 9 have the same meanings as given for formula (I), and which is known, in the presence of a reducing agent; and, in each case, if desired, converting a compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, into a different compound of formula (I) or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, separating a mixture of E/Z isomers obtainable in accordance with the process and isolating the desired isomer, and/or converting a free compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, into a salt or converting a salt, obtainable in accordance with the process or by another method, of a compound of formula (I) or of an E/Z isomer or tautomer thereof into the free compound of formula (I) or an E/Z isomer or tautomer thereof or into a different salt.
[0043] In some embodiments, the compound of formula (II) is further brought into contact with a polypeptide according to the invention exhibiting an enzymatic activity of a ferredoxin. In certain embodiments, the compound of formula (II) is further brought into contact with a polypeptide according to the invention exhibiting an enzymatic activity of a ferredoxin reductase. In some embodiments, the compound of formula (II) is further brought into contact with a reducing agent (e.g., NADH or NADPH).
[0044] In still a further embodiment, the invention provides a process for the preparation of a compound of the formula (III), in free form or in salt form
[0045] in which R 1 -R 7 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0046] or a single bond and a methylene bridge of the formula
[0047] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, which process comprises bringing a compound of the formula (II), in free form or in salt form
[0048] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the same meanings as given for formula (III) above, into contact with a polypeptide according to the invention that regioselectively oxidizes the alcohol at position 4″, and maintaining said contact for a time sufficient for the oxidation reaction to occur and isolating and purifying the compound of formula (III).
[0049] In yet another embodiment, the invention provides a process according to the invention for the preparation of a compound of the formula (I), in which n is 1; m is 1; A is a double bond; B is single bond or a double bond; C is a double bond; D is a single bond; E is a double bond; F is a double bond, or a single bond and a epoxy bridge, or a single bond and a methylene bridge; R 1 , R 2 and R 3 are H; R 4 is methyl; R 5 is C 1 -C 10 alkyl, C 3 -C 8 -cycloalkyl or C 2 -C 10 -alkenyl; R 6 is H; R 7 is OH; R 8 and R 9 are independently of each other H; C 1 -C 10 -alkyl or C 1 -C 10 -acyl, or together form —(CH 2 ) q —, where q is 4, 5 or 6.
[0050] In still another embodiment, the invention provides a process according to the invention for the preparation of a compound of the formula (I), in which n is 1; m is 1; A, B, C, E and F are double bonds; D is a single bond; R 1 , R 2 , and R 3 are H; R 4 is methyl; R 5 is s-butyl or isopropyl; R 6 is H; R 7 is OH; R 8 is methyl; and R 9 is H.
[0051] In still another embodiment, the invention provides a process according to the invention for the preparation of 4″-deoxy-4″-N-methylamino avermectin B 1a /B 1b benzoate salt.
[0052] In another aspect, the invention provides a method for making emamectin. The method comprises adding a P450 monooxygenase, that regioselectively oxidizes avermectin to 4″-keto-avermectin, to a reaction mixture comprising avermectin and incubating the reaction mixture under conditions that allow the P450 monooxygenase to regioselectively oxidize avermectin to 4″-keto-avermectin. In some embodiments, the reaction mixture further comprises a ferredoxin. In certain embodiments, the reaction mixture further comprises a ferredoxin reductase. In some embodiments, the reaction mixture further comprises a reducing agent (e.g., NADH or NADPH).
[0053] In still another aspect, the invention provides a formulation for making a compound of formula (I) comprising a polypeptide according to the invention exhibiting a P450 monooxygenase activity that is capable of regioselectively oxidising the alcohol at position 4″ in order to form a compound of formula (II). In some embodiments, the formulation further comprises a polypeptide according to the invention exhibiting an enzymatic activity of a ferredoxin (e.g., a ferredoxin from cell or strain from which the P450 monooxygenase was isolated or derived).
[0054] In still another aspect, the invention provides a formulation for making emamectin comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the formulation further comprises a ferredoxin (e.g., a ferredoxin from cell or strain from which the P450 monooxygenase was isolated or derived). In certain embodiments, the formulation further comprises a ferredoxin reductase (e.g., a ferredoxin reductase from cell or strain from which the P450 monooxygenase was isolated or derived). In some embodiments, the formulation further comprises a reducing agent (e.g., NADH or NADPH). dr
BRIEF DESCRIPTION OF THE DRAWINGS
[0055]FIGS. 1A and 1B are schematic representations of an HPLC chromatogram (FIG. 1A) and data (FIG. 1B) showing the conversion of avermectin B1a to 4″-hydroxy-avermectin B1a and 4″-keto-avermectin B1a (also called 4″-oxo-avermectin B1a) and a side product from the biocatalysis reaction by a non-limiting P450 monooxygenase of the invention, P450 Ema1 . The HPLC chromatogram using HPLC protocol I to resolve the products is shown in FIG. 1A, and the peaks are identified in FIG. 1B by their retention times. The Y-axis of FIG. 1A shows the milli-absorbance units (mAU) at 243 nm.
[0056]FIG. 2 is a representation of an HPLC chromatogram showing the increased biocatalysis activity (ie., the ability to regioselectively oxidize avermectin to 4″-keto-avermectin) by Streptomyces tubercidicus R-922 UV Mutant as compared to wild-type Streptomyces tubercidicus R-922. The Y-axis shows the milli-absorbance units (mAU) at 243 nm.
[0057]FIG. 3 is a schematic representation of the alignment of the deduced amino acid sequences of P450 gene-specific PCR fragments derived from Streptomyces tubercidicus strain R-922 and the P450 monooxygenase from S. thermotolerans that is involved in the synthesis of carbomycin (Stol-ORFA) (GenBank Accession No. D30759) by the program Pretty from the University of Wisconsin Package version 10.1 (Altschul et al., Nucl. Acids Res. 25:3389-3402). Only the portion of Stol-ORFA that is homologous to the PCR fragments is shown. The O 2 and heme binding sites at each end are indicated. The consensus sequence is shown on the bottom line and includes only those residues that are completely conserved in all sequences compared.
[0058]FIG. 4 is a schematic representation of the alignment of the deduced amino acid sequences of P450 gene-specific PCR fragments derived from Streptomyces strain I-1529 and the P450 monooxygenase from S. thermotolerans that is involved in the synthesis of carbomycin (Stol-ORFA) (GenBank Accession No. D30759) by Pretty. Only the portion of Stol-ORFA that is homologous to the PCR fragments is shown. The O 2 and heme binding sites at each end are indicated. The consensus sequence is shown on the bottom line and includes only those residues that are completely conserved in all sequences compared.
[0059]FIG. 5 is a schematic representation of the alignment of the deduced amino acid sequence of the 600 bp P450 gene fragment from Example VIII with the amino acid sequences of peptide fragments derived from purified P450 Ema1 enzyme from Example VII.
[0060]FIG. 6 is a schematic representation of the alignment of the deduced amino acid sequence of two non-limiting P450 monooxygenases of the invention, namely from Streptomyces strains R-922 and I-1529, that are involved in emamectin biosynthesis. These are compared to the amino acid sequence of a P450 monooxygenase from S. thermotolerans that is involved in the synthesis of carbomycin (Carb-P450) (GenBank Accession No. D30759). Conserved residues in all three P450's are shown on the bottom line of the figure as the “consensus” sequence.
[0061]FIG. 7 is a diagrammatic representation showing a map of plasmid pTBBKA. Recognition sites by the indicated restriction endonucleases are shown, along with the location of the site in the nucleotide sequence of the plasmid. Also shown are genes (e.g., kanamycin resistance “KanR”), and other functional aspects (e.g., Tip promoter) contained in the plasmid.
[0062]FIG. 8 is a diagrammatic representation showing a map of plasmid pTUA1A. Recognition sites by the indicated restriction endonucleases are shown, along with the location of the site in the nucleotide sequence of the plasmid. Also shown are genes (e.g., ampicillin resistance “AmpR”) and other functional aspects (e.g., Tip promoter) contained in the plasmid.
[0063]FIG. 9 is a representation of an HPLC chromatogram showing the oxidation of avermectin to 4″-keto-avermectin by S. lividans transformed with the pTBBKA-ema1, following induction of ema1 expression with 0, 0,5, or 5.0 μg/ml thiostrepton. The Y-axis shows the milli-absorbance units (mAU) at 243 nm.
[0064]FIG. 10 is a diagrammatic representation of a phylogenetic tree showing the relationships between the seventeen ema genes described herein based on the deduced amino acid sequences of their protein products.
[0065]FIG. 11 is a diagrammatic representation showing a map of plasmid pRK-ema1/fd233. This plasmid was derived by ligating a Bg1II fragment containing the ema1 and fd233 genes organized on a single transcriptional unit into the Bg1II site of the broad host-range plasmid pRK290. The ema1/fd233 genes are expressed by the tac promoter (Ptac), and they are followed by the tac terminator (Ttac). Restriction endonuclease recognition sites shown are Bg1II “B”; EcoRI “E”; PacI “Pc”; PmeI “Pm”; and Sa1I “S.”
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0066] The patent and scientific literature referred to herein establishes knowledge that is available to those with skill in the art. The issued patents, applications, and references, including GenBank database sequences, that are cited herein are hereby incorporated by reference to the same extent as if each was specifically and individually indicated to be incorporated by reference. Any conflict between any reference cited herein and the specific teachings of this specification shall be resolved in favor of the latter. Likewise, any conflict between an art-understood definition of a word or phrase and a definition of the word or phrase as specifically taught in this specification shall be resolved in favor of the latter.
[0067] More particularly, the family of polypeptides according to the invention may be used in a process for the preparation a compound of the formula (I), in free form or in salt form
[0068] in which R 1 -R 9 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0069] or a single bond and a methylene bridge of the formula
[0070] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, which process comprises
[0071] 1) bringing a compound of the formula (II), in free form or in salt form
[0072] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, into contact with a polypeptide according to the invention which exhibits an enzymatic activity of a P450 monooxygenases and is capable of regioselectively oxidizing the alcohol at position 4″ of formula (II) in order to produce a compound of the formula (III), in free form or in salt form
[0073] in which R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the meanings given for formula (I); and
[0074] 2) reacting the compound of the formula (III) with an amine of the formula HN(R 8 )R 9 , wherein R 8 and R 9 have the same meanings as given for formula (I), and which is known, in the presence of a reducing agent; and, in each case, if desired, converting a compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, into a different compound of formula (I) or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, separating a mixture of E/Z isomers obtainable in accordance with the process and isolating the desired isomer, and/or converting a free compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, into a salt or converting a salt, obtainable in accordance with the process or by another method, of a compound of formula (I) or of an E/Z isomer or tautomer thereof into the free compound of formula (I) or an E/Z isomer or tautomer thereof or into a different salt.
[0075] Methods of synthesis for the compounds of formula (I) are described in the literature. It has been found, however, that the processes known in the literature cause considerable problems during production basically on account of the low yields and the tedious procedures which have to be used. Accordingly, the known processes are not satisfactory in that respect, giving rise to the need to make available improved preparation processes for those compounds.
[0076] The compounds (I), (II) and (III) may be in the form of tautomers. Accordingly, hereinbefore and hereinafter, where appropriate the compounds (I), (II) and (III) are to be understood to include corresponding tautomers, even if the latter are not specifically mentioned in each case.
[0077] The compounds (I), (II), and (III) are capable of forming acid addition salts. Those salts are formed, for example, with strong inorganic acids, such as mineral acids, for example perchloric acid, sulfuric acid, nitric acid, nitrous acid, a phosphoric acid or a hydrohalic acid, with strong organic carboxylic acids, such as unsubstituted or substituted, for example halo-substituted, C 1 -C 4 alkanecarboxylic acids, for example acetic acid, saturated or unsaturated dicarboxylic acids, for example oxalic, malonic, succinic, maleic, fumaric or phthalic acid, hydroxycarboxylic acids, for example ascorbic, lactic, malic, tartaric or citric acid, or benzoic acid, or with organic sulfonic acids, such as unsubstituted or substituted, for example halo-substituted, C 1 -C 4 alkane- or aryl-sulfonic acids, for example methane- or p-toluene-sulfonic acid. Furthermore, compounds of formula (I), (II), and (III) having at least one acidic group are capable of forming salts with bases. Suitable salts with bases are, for example, metal salts, such as alkali metal or alkaline earth metal salts, for example sodium, potassium or magnesium salts, or salts with ammonia or an organic amine, such as morpholine, piperidine, pyrrolidine, a mono-, di- or tri-lower alkylamine, for example ethyl-, diethyl-, triethyl- or dimethyl-propyl-amine, or a mono-, di- or tri-hydroxy-lower alkylamine, for example mono-, di- or tri-ethanolamine. In addition, corresponding internal salts may also be formed. Particularly useful, within the scope of the invention, are agrochemically advantageous salts. In view of the close relationship between the compounds of formula (I), (II) and (III) in free form and in the form of their salts, any reference hereinbefore or hereinafter to the free compounds of formula (I), (II) and (III) or to their respective salts is to be understood as including also the corresponding salts or the free compounds of formula (I), (II) and (III), where appropriate and expedient. The same applies in the case of tautomers of compounds of formula (I), (II) and (III) and the salts thereof. The free form is generally useful in each case.
[0078] Useful, within the scope of this invention, is a process for the preparation of compounds of the formula (I), in which n is 1; m is 1; A is a double bond; B is single bond or a double bond; C is a double bond; D is a single bond; E is a double bond; F is a double bond, or a single bond and a epoxy bridge, or a single bond and a methylene bridge; R 1 , R 2 and R 3 are H; R 4 is methyl; R 5 is C 1 -C 10 -alkyl, C 3 -C 8 -cycloalkyl or C 2 -C 10 -alkenyl; R 6 is H; R 7 is OH; R 8 and R 9 are independently of each other H; C 1 -C 10 -alkyl or C 1 -C 10 -acyl, or together form —(CH 2 ) q —; and q is 4, 5 or 6.
[0079] Especially useful within the scope of this invention is a process for the preparation of a compound of the formula (I) in which n is 1; m is 1; A, B, C, E and F are double bonds; D is a single bond; R 1 , R 2 , and R 3 are H; R 4 is methyl; R 5 is s-butyl or isopropyl; R 6 is H; R 7 is OH; R8 is methyl; and R 9 is H.
[0080] Very especially useful is a process for the preparation of emamectin, more particularly the benzoate salt of emamectin. Emamectin is a mixture of 4″-deoxy-4″-N-methylamino avermectin B 1a /B 1b and is described in U.S. Pat. No. 4,4874,749 and as MK-244 in J. Organic Chem. 59:7704-7708, 1994. Salts of emamectin that are especially valuable agrochemically are described in U.S. Pat. No. 5,288,710. Each member of this family of peptides exhibiting an enzymatic activity of a P450 monooxygenases as described hereinbefore is able to oxidize unprotected avermectin regioselectively at position 4″, thus opening a new and more economical route for the production of emamectin.
[0081] The family members each catalyze the following reaction:
[0082] Accordingly, the invention provides a purified nucleic acid molecule encoding a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol at position 4″ of a compound of formula (II) such as avermectin in order to produce a compound of formula (III), but especially 4″-keto-avermectin.
[0083] In particular, the invention provides a purified nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. A “nucleic acid molecule” refers to single-stranded or double-stranded polynucleotides, such as deoxyribonucleic acid (DNA), ribonucleic acid (RNA), or analogs of either DNA or RNA.
[0084] The invention also provides a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. As used herein, by “purified” is meant a nucleic acid molecule or polypeptide (e.g., an enzyme or antibody) that has been separated from components which naturally accompany it. For example, in the case of a nucleic acid molecule, the purified nucleic acid molecule is separated from nucleotide sequences, such as promoter or enhancer sequences, that flank the nucleic acid molecule as it naturally occurs in the chromosome. In the case of a protein, the purified protein is separated from components, such as other proteins or fragments of cell membrane, that accompany it in the cell. Of course, those of ordinary skill in molecular biology will understand that water, buffers, and other small molecules may additionally be present in a purified nucleic acid molecule or purified protein preparation. A purified nucleic acid molecule or purified polypeptide (e.g., enzyme) of the invention that is at least 95% by weight, or at least 98% by weight, or at least 99% by weight, or 100% by weight free of components which naturally accompany the nucleic acid molecule or polypeptide.
[0085] According to the invention, a purified nucleic acid molecule may be generated, for example, by excising the nucleic acid molecule from the chromosome. It may then be ligated into an expression plasmid. Other methods for generating a purified nucleic acid molecule encoding a P450 monooxygenase of the invention are available and include, without limitation, artificial synthesis of the nucleic acid molecule on a nucleic acid synthesizer.
[0086] Similarly, a purified P450 monooxygenase of the invention may be generated, for example, by recombinant expression of a nucleic acid molecule encoding the P450 monooxygenase in a cell in which the P450 monooxygenase does not naturally occur. Of course, other methods for obtaining a purified P450 monooxygenase of the invention include, without limitation, artificial synthesis of the P450 monooxygenase on a peptide synthesizer and isolation of the P450 monooxygenase from a cell in which it naturally occurs using, e.g., an antibody that specifically binds the P450 monooxygenase.
[0087] Biotransformations of secondary alcohols to ketones by Streptomyces bacteria are known to be catalyzed by dehydrogenases or oxidases. However, prior to the present discovery of the cytochrome P450 monooxygenase from Streptomyces tubercidicus strain R-922 responsible for the regioselective oxidation of avermectin to 4″-keto-avermectin, no experimental data of another cytochrome P450 monooxygenase from Streptomyces to oxidize a secondary alcohol to a ketone had been reported.
[0088] According to some embodiments of the invention, the nucleic acid molecule and/or the polypeptide encoded by the nucleic acid molecule are isolated from a Streptomyces strain. Thus, the nucleic acid molecule (or polypeptide encoded thereby) may be isolated from, without limitation, Streptomyces tubercidicus, Streptomyces lydicus, Streptomyces platensis, Streptomyces chattanoogensis, Streptomyces kasugaensis, Streptomyces rimosus, or Streptomyces albofaciens.
[0089] As described below, an entire family of P450 monooxygenases capable of regioselectively oxidizing avermectin to 4″-keto-avermectin has been discovered. All of these family members are related by at least 60% identity at the amino acid level. A useful nucleic acid molecule encoding a P450 monooxygenase of the invention comprises or consists essentially of a nucleic acid sequence that is at least 70% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. In certain embodiments, the nucleic acid molecule encoding a P450 monooxygenase of the invention comprises or consists essentially of a nucleic acid sequence that is at least 80% identical; or at least 85% identical; or at least 90% identical; or at least 95% identical; or at least 98% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO:I 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0090] Similarly, the invention provides a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin which, in some embodiments, comprises or consists essentially of an amino acid sequence that is at least 60% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, the purified P450 monooxygenase of the invention comprises or consists essentially of an amino acid sequence that is at least 70% identical; or at least 80% identical; or at least 90% identical; or at least 95% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95.
[0091] In some embodiments, the nucleic acid molecule encoding a P450 monooxygenase of the invention comprises or consists essentially of the nucleic acid sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. Similarly, the P450 monooxygenase of the invention, in some embodiments, comprises or consists essentially of the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95.
[0092] One non-limiting source of a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin is the cell-free extract described in the examples below. Another method for purifying a P450 monooxygenase in accordance with the invention is to attach a tag to the protein, thereby facilitating its purification. Accordingly, the invention provides a P450 monooxygenase which regioselectively oxidizes avermectin to 4″-keto-avermectin, wherein the P450 monooxygenase is covalently bound to a tag. The invention further provides a nucleic acid molecule encoding such a tagged P450 monooxygenase.
[0093] As used herein, a “tag” is meant a peptide or other molecule covalently bound to a polypeptide of the invention, whereby a binding agent (e.g., a polypeptide or molecule) specifically binds the tag. In accordance with the invention, by “specifically binds” is meant that the binding agent (e.g., an antibody or Ni 2+ resin) recognizes and binds to a particular polypeptide or chemical but does not substantially recognize or bind to other molecules in the sample. In some embodiments, a binding agent that specifically binds a ligand forms an association with that ligand with an affinity of at least 10 6 M −1 , or at least 10 7 M −1 , or at least 10 8 M −1 , or at least 10 9 M −1 either in water, under physiological conditions, or under conditions which approximate physiological conditions with respect to ionic strength, e.g., 140 mM NaCl, 5 mM MgCl 2 . For example, a His tag is specifically bound by nickel (e.g., the Ni 2+ -charged column commercially available as His•Bind® Resin from Novagen Inc, Madison, Wis.). Likewise, a Myc tag is specifically bound by an antibody that specifically binds Myc.
[0094] As described below, a his tag is attached to the P450 monooxygenases of the invention by generating a nucleic acid molecule encoding the His-tagged polypeptide, and expressing the polypeptide in E. coli. These polypeptides, once expressed by E. coli, are readily purified by standard techniques (e.g., using one of the His•Bind® Kits commercially available from Novagen or using the TALON™ Resin (and manufacturer's instructions) commercially available from Clontech Laboratories, Inc., Palo Alto, Calif.).
[0095] Additional tags may be attached to any or all of the P450 monooxygenases of the invention to facilitate purification. These tags include, without limitation, the HA-Tag (amino acid sequence: YPYDVPDYA (SEQ ID NO: 39)), the Myc-tag (amino acid sequence: EQKLISEEDL (SEQ ID NO: 40)), the HSV tag (amino acid sequence: QPELAPEDPED (SEQ ID NO: 41)), and the VSV-G-Tag (amino acid sequence: YTDIEMNRLGK (SEQ ID NO: 42)). Covalent attachment (e.g., via a peptide bond) of these tags to a polypeptide of the invention allows purification of the tagged polypeptide using, respectively, an anti-HA antibody, an anti-Myc antibody, an anti-HSV antibody, or an anti-VSV-G antibody, all of which are commercially available (for example, from MBL International Corp., Watertown, Mass.; Novagen Inc.; Research Diagnostics Inc., Flanders, N.J.).
[0096] The tagged P450 monooxygenases of the invention may also be tagged by a covalent bond to a chemical, such as biotin, which is specifically bound by streptavidin, and thus may be purified on a streptavidin column. Similarly, the tagged P450 monooxygenases of the invention may be covalently bound (e.g., via a peptide bond) to the constant region of an antibody. Such a tagged P450 monooxygenase may be purified, for example, on protein A sepharose.
[0097] The tagged P450 monooxygenases of the invention may also be tagged to a GST (glutathione-S-transferase) or the constant region of an immunoglobulin. For example, a nucleic acid molecule of the invention (e.g., comprising SEQ ID NO: 1) can be cloned into one of the pGEX plasmids commercially available from Amersham Pharmacia Biotech, Inc. (Piscataway N.J.), and the plasmid expressed in E. coli. The resulting P450 monooxygenase encoded by the nucleic acid molecule is covalently bound to a GST (glutathione-S-transferase). These GST fusion proteins can be purified on a glutathione agarose column (commercially available from, e.g., Amersham Pharmacia Biotech), and thus purified. Many of the pGEX plasmids enable easy removal of the GST portion from the fusion protein. For example, the pGEX-2T plasmid contains a thrombin recognition site between the inserted nucleic acid molecule of interest and the GST-encoding nucleic acid sequence. Similarly, the pGES-3T plasmid contains a factor Xa site. By treating the fusion protein with the appropriate enzyme, and then separating the GST portion from the P450 monooxygenase of the invention using glutathione agarose (to which the GST specifically binds), the P450 monooxygenase of the invention can be purified.
[0098] Yet another method to obtain a purified P450 monooxygenase of the invention is to use a binding agent that specifically binds to the P450 monooxygenase. Accordingly, the invention provides a binding agent that specifically binds to a P450 monooxygenase of the invention. This binding agent of the invention may be a chemical compound (e.g., a protein), a metal ion, or a small molecule.
[0099] In particular embodiments, the binding agent is an antibody. The term “antibody” encompasses, without limitation, polyclonal antibody, monoclonal antibody, antibody fragments (e.g., Fab, Fv, or Fab′ fragments), single chain antibody, chimeric antibody, bi-specific antibody, antibody of any isotype (e.g., IgG, IgA, and IgE), and antibody from any specifies (e.g., rabbit, mouse, and human).
[0100] In one non-limiting example, the binding agent of the invention is a polyclonal antibody. In another non-limiting example, the binding agent of the invention is a monoclonal antibody. Methods for making both monoclonal and polyclonal antibodies are well known (see, e.g., Current Protocols in Immunology, ed. John E. Coligan, John Wiley & Sons, Inc. 1993; Current Protocols in Molecular Biology, eds. Ausubel et a., John Wiley & Sons, Inc. 2000).
[0101] The P450 monooxygenases described herein that regioselectively oxidize avermectin to 4″-keto-avermectin belong to a family of novel P450 monooxygenases. Accordingly, the invention also provides a family of P450 monooxygenase polypeptides, wherein each member of the family regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, each member of the family comprises or consists of an amino acid sequence that is at least 50% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In particular embodiments, each member of the family is encoded by a nucleic acid molecule comprising or consisting of a nucleic acid sequence that is at least 66% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0102] The present invention, which provides an entire family of P450 monooxygenases, each member of which is able to regioselectively oxidize avermectin to 4″-keto-avermectin, allowed for the generation of an improved P450 monooxygenase, which may not be naturally occurring, but which regioselectively oxidizes avermectin to 4″-keto-avermectin with efficiency and with reduced undesirable side product. For instance, one of the members of the P450 monooxygenase family of the invention, P450 Ema1 enzyme catalyzes a further oxidation that is not desirable, since the formation of 3″-O-demethyl-4″-keto-avermectin has been detected in the reaction by Streptomyces tubercidicus strain R-922 and by Streptomyces lividans containing the ema1 gene. The formation of 3″-O-demethyl-4″-keto-avermectin is brought about by the oxidation of the 3″-O-methyl group, whereby the hydrolytically labile 3″-O-hydroxymethyl group is formed which hydrolyzes to form formaldehyde and the 3″-hydroxyl group.
[0103] An HPLC chromatogram showing product and side product from the reaction is shown in FIGS. 1A and 1B.
[0104] By providing a family of P450 monooxygenases that regioselectively oxidize avermectin to 4″-keto-avermectin (see, e.g., Table 3 below), individual members of the family can be subjected to family gene shuffling efforts in order to produce new hybrid genes encoding optimized P450 monooxygenases of the invention. In one non-limiting example, a portion of the ema1 gene encoding the O 2 binding site of the P450 Ema1 protein can be swapped with the portion of the ema2 gene encoding the O 2 binding site of the P450 Ema2 protein. Such a chimeric ema1/2 protein is within definition of a P450 monooxygenase of the invention.
[0105] Site-directed mutagenesis or directed evolution technologies may also be employed to generate derivatives of the ema1 gene that encode enzymes with improved properties, including higher overall activity and/or reduced side product formation. One method for deriving such a mutant is to mutate the Streptomyces strain itself, in a manner similar to the UV mutation of Streptomyces tubercidicus strain R-922 described below.
[0106] Additional derivatives may be made by making conservative or non-conservative changes to the amino acid sequence of a P450 monooxygenase. Conservative and non-conservative amino acid substitutions are well known (see, e.g., Stryer, Biochemistry, 3 rd Ed., W. H. Freeman and Co., NY 1988). Similarly, truncations of a P450 monooxygenase of the invention may be generated by truncating the protein at its N-terminus (e.g., see the ema1A gene described below), at its C-terminus, or truncating (i.e., removing amino acid residues) from the middle of the protein.
[0107] Such a mutant, derivative, or truncated P450 monooxygenase is a P450 monooxygenase of the invention as long as the mutant, derivative, or truncated P450 monooxygenase is able to regioselectively oxidize avermectin to 4″-keto-avermectin.
[0108] In another aspect, the invention provides a cell genetically engineered to comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. By “genetically engineered” is meant that the nucleic acid molecule is heterologous to the cell into which it is introduced. Introduction of the heterologous nucleic acid molecule into the genetically engineered cell may be accomplished by any means, including, without limitation, transfection, transduction, and transformation.
[0109] In certain embodiments, the nucleic acid molecule is positioned for expression in the genetically engineered cell. By “positioned for expression” is meant that the heterologous nucleic acid molecule encoding the polypeptide is linked to a regulatory sequence in such a way as to permit expression of the nucleic acid molecule when introduced into a cell. By “regulatory sequence” is meant nucleic acid sequences, such as initiation signals, polyadenylation (polyA) signals, promoters, and enhancers, which control expression of protein coding sequences with which they are operably linked. By “expression” of a nucleic acid molecule encoding a protein or polypeptide fragment is meant expression of that nucleic acid molecule as protein and/or mRNA.
[0110] A genetically engineered cell of the invention may be a prokaryotic cell (e.g., E. coli ) or a eukaryotic cell (e.g., Saccharomyces cerevisiae or mammalian cell (e.g., HeLa)). According to some embodiments of the invention, the genetically engineered cell is a cell wherein the wild-type (i.e., not genetically engineered) cell does not naturally contain the inserted nucleic acid molecule and does not naturally express the protein encoded by the inserted nucleic acid molecule. Accordingly, the cell may be a genetically engineered Streptomyces strain, such as a Streptomyces lividans or a Streptomyces avermitilis strain. Alternatively, the cell may be a genetically engineered Pseudomonas strain, such as a Pseudomonas putida strain or a Pseudomonas fluorescens strain. In another alternative, the cell may be a genetically engineered Escherichia coli strain.
[0111] Note that in some types of cells genetically engineered to comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin, the actual genetically engineered cell, itself, may not be able to convert avermectin into 4″-keto-avermectin. Rather, the P450 monooxygenase heterologously expressed by such a genetically engineered cell may be purified from that cell, where the purified P450 monooxygenase of the invention is able to regioselectively oxidize avermectin to 4″-keto-avermectin. Thus, the genetically engineered cell of the invention need not, itself, be able to regioselectively convert avermectin to 4″-keto-avermection; rather, the genetically engineered cell of the invention need only comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin, regardless of whether the P450 monooxygenase is active inside that cell.
[0112] In addition, a cell (e.g., E. coli ) geneticially engineered to comprise a nucleic acid molecule encoding P450 monooxygenase of the invention may not be able to regioselectively oxidize avermectin to 4″-keto-avermection, although the P450 monooxygenase purified from the genetically engineered cell is able to regioselectively oxidize avermectin to 4″-keto-avermectin. However, if the same cell were genetically engineered to comprise a P450 monooxygenase of the invention, a ferredoxin of the invention, and/or a ferredoxin reductase of the invention, then the P450 monooxygenase together with the ferredoxin and the ferredoxin reductase, all purified from that cell, and in the presence of a reducing agent (e.g., NADH or NADPH), would be able to regioselectively oxidize avermectin to 4″-keto-avermectin. Furthermore,-the genetically engineered cell comprising a P450 monooxygenase of the invention, a ferredoxin of the invention, and/or a ferredoxin reductase of the invention, itself would be able to carry out this oxidation.
[0113] Moreover, in a non-limiting example where a cell (e.g., E. coli ) is genetically engineered to express P450 monooxygenase, a ferredoxin, and a ferredoxin reductase proteins of the invention, all three of these proteins, when purified from the genetically engineered E. coli, are active and together are able to regioselectively oxidize avermectin to 4″-keto-avermectin (e.g., in the presence of a reducing agent, such as NADH or NADPH), and so are useful in a method for making emamectin.
[0114] In accordance with the present invention, the following material has been deposited with the Agricultural Research Service, Patent Culture Collection (NRRL), 1815 North University Street, Peoria, Ill. 61604, under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure: (1) Streptomyces lividans ZX7 (ema1/fd233-TUA1A) NRRL Designation No. B-30478; and (2) Pseudomonas putida NRRL B-4067 containing plasmid pRK290-ema1/fd233, NRRL Designation No.B-30479.
[0115] In identifying the novel family of P450 monooxygenases that regioselectively oxidize avermectin to 4″-keto-avermectin, novel ferredoxins and novel ferredoxin reductases were also discovered in the same strains of bacteria in which the P450 monooxygenases were found. Accordingly, in a further aspect, the invention provides purified nucleic acid molecule encoding a ferredoxin, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a polypeptide that regioselectively oxidizes avermectin to 4″-keto-avermectin. Similarly, the invention provides a purified nucleic acid molecule encoding a ferredoxin reductase, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a polypeptide that regioselectively oxidizes avermectin to 4″-keto-avermectin. The invention also provides a purified ferredoxin protein, as well as a purified ferredoxin reductase protein, wherein the ferredoxin protein and the ferredoxin reductase protein are isolated from a Streptomyces strain comprising a polypeptide that regioselectively oxidizes avermectin to 4″-keto-avermectin.
[0116] A useful nucleic acid molecule encoding a ferredoxin of the invention comprises or consists essentially of a nucleic acid sequence that is at least 81% identical to SEQ ID NO: 35 or SEQ ID NO: 37. Alternatively, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 35 or SEQ ID NO: 37. The nucleic acid molecule encoding a ferredoxin of the invention may comprise or consist essentially of the nucleic acid sequence of SEQ ID NO: 35 or SEQ ID NO: 37.
[0117] The ferredoxin of the invention may comprise or consist essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 36 or SEQ ID NO: 38. In some embodiments, the nucleic acid molecule comprises or consists essentially an amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 36 or SEQ ID NO: 38. The ferredoxin of the invention may comprise or consist essentially of the amino acid sequence of SEQ ID NO: 36 or SEQ ID NO: 38.
[0118] A useful nucleic acid molecule encoding a ferredoxin reductase of the invention comprises or consists essentially of a nucleic acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104. The nucleic acid molecule encoding a ferredoxin reductase of the invention may comprise or consist essentially of the nucleic acid sequence of SEQ ID NO: 98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104. The ferredoxin reductase of the invention may comprise or consist essentially of an amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103,or SEQ ID NO: 105. The ferredoxin reductase of the invention may comprise or consist essentially of the amino acid sequence of SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, or SEQ ID NO: 105.
[0119] Methods for purifying ferredoxin and ferredoxin reductase proteins and nucleic acid molecules encoding such ferredoxin and ferredoxin reductase proteins are known in the art and are the same as those described above for purifying P450 monooxygenases of the invention and nucleic acid molecules encoding P450 monooxygenases of the invention.
[0120] In one non-limiting example to obtain a purified P450 monooxygenase of the invention with a purified ferredoxin, a S. lividans strain (or P. putida strain, or any other cell in which the P450 monooxygenase of the invention does not naturally occur) may be genetically engineered to contain a first nucleic acid molecule encoding a P450 monooxygenase of the invention and a second nucleic acid molecule encoding a ferredoxin protein, where both the first and second nucleic acid molecules are positioned for expression in the genetically engineered cell. The first and the second nucleic acid molecules can be on separate plasmids, or can be on the same plasmid. Thus, the same engineered cell or strain will produce both the P450 monooxygenase of the invention and the ferredoxin protein of the invention.
[0121] In a further non-limiting example to obtain a purified P450 monooxygenase of the invention with a purified ferredoxin and with a purified ferredoxin reductase of the invention, a S. lividans strain (or P. putida strain, or any other cell in which the P450 monooxygenase of the invention does not naturally occur) may be genetically engineered to contain a first nucleic acid molecule encoding a P450 monooxygenase of the invention and a second nucleic acid molecule encoding a ferredoxin protein of the invention and a third nucleic acid molecule encoding a ferredoxin reductase protein of the invention, where all the first and second and third nucleic acid molecules are positioned for expression in the genetically engineered cell. The first and the second and the third nucleic acid molecules can be on separate plasmids, or can be on the same plasmid. Thus, the same engineered cell or strain will produce all the P450 monooxygenase of the invention and the ferredoxin protein and the ferredoxin reductase proteins of the invention.
[0122] As described above for the P450 monooxygenases of the invention, the ferredoxin protein and/or the ferredoxin reductase protein may further comprise a tag. Moreover, the invention contemplates binding agents (e.g., antibodies) that specifically bind to the ferredoxin protein, and binding agents that specifically bind to the ferredoxin reductase proteins of the invention. Methods for generating tagged ferredoxin protein, tagged ferredoxin reductase protein, and binding agents (e.g., antibodies) that specifically bind to ferredoxin or ferredoxin reductase are the same as those as described above for generating tagged P450 monooxygenases of the invention and generating binding agents that specifically bind P450 monooxygenases of the invention.
[0123] The invention also provides a method for making emamectin. In this method, a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin is added to a reaction mixture containing avermectin. The reaction mixture is then incubated under conditions that allow the P450 monooxygenase to regioselectively oxidize avermectin to 4″-keto-avermectin. The reaction mixture may further comprise a ferredoxin, such as a ferredoxin of the present invention. In particular embodiments, the reaction mixture further comprises a ferredoxin reductase, such as a ferredoxin reductase of the present invention. The reaction mixture may further comprise a reducing agent, such as NADH or NADPH.
[0124] Additionally, the invention provides a method for making 4″-keto-avermectin. The method comprises adding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin to a reaction mixture comprising avermectin and incubating the reaction mixture under conditions that allow the P450 monooxygenase to regioselectively oxidize avermectin to 4″-keto-avermectin. In some embodiments, the reaction mixture further comprises a ferredoxin, such as a ferredoxin of the present invention. The reaction mixture may also further comprise a ferredoxin reductase, such as a ferredoxin reductase of the present invention. In particular embodiments, the reaction mixture further comprises a reducing agent, such as NADH or NADPH.
[0125] The invention also provides a formulation for making emamectin comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the formulation further comprises a ferredoxin, such as a ferredoxin of the present invention. In particular embodiments, the ferredoxin is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. The formulation may further comprise a ferredoxin reductase, such as a ferredoxin reductase of the present invention. In particular embodiments, the ferredoxin reductase is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. In some embodiments, the formulation further comprises a reducing agent, such as NADH or NADPH.
[0126] In addition, the invention provides a formulation for making 4″-keto-avermectin comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the formulation further comprises a ferredoxin, such as a ferredoxin of the present invention. In particular embodiments, the ferredoxin is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. In some embodiments, the formulation further comprises a ferredoxin reductase, such as a ferredoxin reductase of the present invention. In particular embodiments, the ferredoxin reductase is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. The formulation may further comprise a reducing agent, such as NADH or NADPH.
[0127] The following examples are intended to further illustrate certain preferred embodiments of the invention and are not limiting in nature.
EXAMPLE I
Optimized Growth Conditions for Streptomyces Tubercidicus Strain R-922
[0128] In one non-limiting example, the fermentation conditions needed to provide a steady supply of cells of Streptomyces tubercidicus strain R- 922 highly capable of regioselectively oxidizing avermectin to 4″-keto-avermectin were optimized.
[0129] First, the following solutions were made. For ISP-2 agar, 4 g of yeast extract (commercially available from Oxoid Ltd, Basingstoke, UK), 4 g of D(+)-glucose, 10 g of bacto malt extract (Difco No. 0186-17-7 (Difco products commercially available from, e.g., Voigt Global Distribution, Kansas City, Mo.)), and 20 g of agar (Difco No. 0140-01) were dissolved in one liter of demineralized water, and the pH is adjusted to 7.0. The solution was sterilized at 121° C. for 20 min., cooled down, and kept at 55° C. for the time needed for the immediate preparation of the agar plates.
[0130] For PHG medium, 10 g of peptone (Sigma 0521; commercially available from Sigma Chemical Co., St. Louis, Mo.), 10 g of yeast extract (commercially available from Difco), 10 g of D-(+)-glucose, 2 g of NaCl, 0.15 g of MgSO 4 ×7 H 2 O, 1.3 g of NaH 2 PO 4 ×H 2 O, and 4.4 g of K 2 HPO 4 were dissolved in 1 liter of demineralized water, and the pH was adjusted to 7.0.
[0131] Streptomyces tubercidicus strain R-922 was grown in a Petri dish on ISP-2 agar at 28° C. This culture was used to inoculate four 500 ml shaker flasks with baffle, each containing 100 ml PHG medium. These pre-cultures were grown on an orbital shaker with 120 rpm at 28° C. for 72 hours and then used to inoculate a 10-liter fermenter equipped with a mechanical stirrer and containing 8 liters PHG medium. This main culture was grown at 28° C. with stirring at 500 rpm and with aeration of 1.75 vvm (14 l/min.) and a pressure of 0.7 bar. At the end of the exponential growth, after about 20 hours, the cells were harvested by centrifugation. The yield of wet cells was 70-80 g/l culture.
EXAMPLE II
Whole Cell Biocatalysis Assay
[0132] As determined in accordance with the present invention, the following whole cell biocatalysis assay was employed to determine that the activity from Streptomyces cells capable of regioselectively oxidizing avermectin to 4″-keto-avermectin is catalyzed by a P450 monooxygenase.
[0133] Streptomyces tubercidicus strain R-922 was grown in PHG medium, and Streptomyces tubercidicus strain I-1529 was grown in M-17 or PHG medium. PHG medium contains 10 g/l Peptone (Sigma, 0.521), 10 g/l Yeast Extract (Difco, 0127-17-9), 10 g/l D-Glucose, 2 g/l NaCl, 0.15 g/l MgSO 4 ×7 H 2 O, 1.3 g/l NaH 2 PO 4 ×1 H 2 O, and 4.4 g/l K 2 HPO 4 at pH 7.0. M-17 medium contains 10 g/l glycerol, 20 g/l Dextrin white, 10 g/l Soytone (Difco 0437-17), 3 g/l Yeast Extract (Difco 0127-17-9), 2 g/l (NH 4 ) 2 SO 4 , and 2 g/l CaCO 3 at pH 7.0
[0134] To grow the cells, an ISP2 agar plate (not older than 1-2 weeks) was inoculated and incubated for 3-7 days until good growth was achieved. Next, an overgrown agar piece was transferred (with an inoculation loop) to a 250 ml Erlenmeyer flask with 1 baffle containing 50 ml PHG medium. This pre-culture is incubated at 28° C. and 120 rpm for 2-3 days. Next, 5 ml of the pre-culture were transferred to a 500 ml Erlenmeyer flask with 1 baffle containing 100 ml PHG medium. The main culture was incubated at 28° C. and 120 rpm for 2 days. Next, the culture was centrifuged for 10 min. at 8000 rpm in a Beckman Rotor JA-14. The cells were next washed once with 50 mM potassium phosphate buffer, pH 7.0.
[0135] To perform the whole cell biocatalysis assay, 500 mg wet cells were placed into a 25 ml Erlenmeyer flask, to which were added 10 ml of 50 mM potassium phosphate buffer, pH 7.0. The cells were stirred with a magnetic stir bar to distribute the cells. Next, 15 μl of a solution of avermectin B1a in isopropanol (30 mg/ml) were added, and the mixture shaken on an orbital shaker at 160 rpm and 28° C. Strain R-922 was reacted for 2 hours, and strain I-1529 was reacted for 30 hours.
[0136] To work up the cultures in the whole cell biocatalysis assay, 10 ml methyl-t-butyl-ether was added to an Erlenmeyer flask containing the resting cells and the entire cell mixture was transferred to a 30 ml-centrifuge tube, shaken vigorously, and then centrifuged at 16000 rpm for 10 min. The ether phase was pipetted into a 50 ml pear flask, and evaporated in vacuo by means of a rotary evaporator (≦0.1 mbar). The residue was re-dissolved in 1.2 ml acetonitrile and transferred to an HPLC-sample vial. Formation of 4″-keto-avermectin B1a could be observed by HPLC analysis using HPLC protocol I.
[0137] For HPLC protocol I, the following parameters were used:
Hardware Pump: L-6250 Merck-Hitachi Autosampler: AS-2000A Merck-Hitachi Interface D-6000 Merck-Hitachi Module: Channel 1- L-7450A UV-Diode Array Detector: Merck-Hitachi Colunm Oven: none Column: 70 mm × 4 mm Adsorbent: Kromasil 100 Å-3.5 μ-C18 Gradient Mode: Low Pressure Limit: 5-300 bar Column ambient (≈20° C.) Temperature Solvent A: acetonitrile Solvent B: water Flow: 1.5 ml/min Detection: 243 nm Pump Table: 0.0 min 75% A 25% B linear gradient 7.0 min 100% A 0% B 9.0 min 100% A 0% B jump 9.1 min 75% A 25% B 12.0 min 75% A 25% B Stop time: 12 min Sampling every 200 msec Period: Retention time time References table: 2.12 min 4″-hydroxy- avermectin B1a 3.27 min avermectin B1a 3.77 min 3″-O-demethyl- 4″-keto- avermectin B1a 4.83 min 4″-keto- avermectin B1a
EXAMPLE III
Biotransformation With Cell-Free Extract From Streptomyces Strain R-922
[0138] To prepare an active cell-free extract from Streptomyces tubercidicus strain R-922 capable of regioselective oxidation of avermectin to 4″-keto-avermectin, the following solutions were made, stored at 4° C., and kept on ice when used.
Solution Formula PP-buffer 50 mM K 2 HPO 4 /KH 2 PO 4 (pH 7.0) Disruption buffer 50 mM K 2 HPO 4 /KH 2 PO 4 (pH 7.0), 5 mM benzamidine, 2 mM dithiothreitol, and 0.5 mM Pefabloc (from Roche Diagnostics) Substrate 10 mg avermectin were dissolved in 1 ml isopropanol
[0139] Six grams of wet cells from Streptomyces strain R-922 were washed in PP-buffer and then resuspended in 35 ml disruption buffer and disrupted in a French press at 4° C. The resulting suspension was centrifuged for 1 hour at 35000×g. The supernatant of the cell free extract was collected. One μl substrate was added to 499 μl of cleared cell free extract and incubated at 30° C. for 1 hour. Then, 1 ml methyl-t-butyl ether was added to the reaction mixture and thoroughly mixed. The mixture was next centrifuged for 2 min. at 14000 rpm, and the methyl-t-butyl ether phase was transferred into a 10 ml flask and evaporated in vacuo by means of a rotary evaporator. The residue was dissolved in 200 μl acetonitrile and transferred into an HPLC-sample vial.
[0140] For HPLC, the HPLC protocol I was used.
[0141] When 1 μt substrate was added to 499 μl of cleared cell free extract and incubated at 30° C., no conversion of avermectin to 4″-keto-avermectin was observed by HPLC analysis using HPLC protocol I.
[0142] However, the possibility of addition of spinach ferredoxin and spinach ferredoxin 5 reductase and NADPH to the cell free extract to restore the biocatalytic activity was explored (see, generally, D. E. Cane and E. I. Graziani, J. Amer. Chem. Soc. 120:2682, 1998).
[0143] Accordingly, the following solutions were made:
Solution Formula Substrate 10 mg avermectin were dissolved in 1 ml isopropanol Ferredoxin 5 mg ferredoxin (from spinach), solution 1-3 mg/ml in Tris/ HCl-buffer (from Fluka) or 5 mg ferredoxin (from Clostridium pasteurianum ), solution of 1-3 mg/ml in Tris/HCl-buffer (from Fluka) or 5 mg ferredoxin (from Porphyra umbilicalis ), solution of 1-3 mg/ml in Tris/HCl-buffer (from Fluka) Ferredoxin 1 mg freeze-dried ferredoxin reductase (from spinach), Reductase solution of 3.9 U/mg in 1 ml H 2 O (from Sigma) NADPH 100 mM NADPH in H 2 O (from Roche Diagnostics)
[0144] Thus, to 475 μl of cleared cell free extract the following solutions were added: 10 μl ferredoxin, 10 μl ferredoxin reductase and 1 μl substrate. After the addition of substrate to the cells, the mixture was immediately and thoroughly mixed and aerated. Then, 5 μl of NADPH were added and the mixture incubated at 30° C. for 30 min. Then, 1 ml methyl-t-butyl ether was added to the reaction mixture and thoroughly mixed. The mixture was next centrifuged for 2 min. at 14000 rpm, and the methyl-t-butyl ether phase was transferred into a 10 ml flask and evaporated in vacuo by means of a rotary evaporator. The residue was dissolved in 200 μl acetonitrile and transferred into an HPLC-sample vial, and HPLC analysis performed using HPLC protocol I.
[0145] Formation of 4″-keto-avermectin was observable by HPLC analysis. Thus, addition of spinach ferredoxin and spinach ferredoxin reductase and NADPH to the cell free extract restored the biocatalytic activity.
[0146] Upon injection of a 30 μl sample, a peak appeared at 4.83 min., indicating the presence of 4″-keto-avermectin B la. A mass of 870 D was assigned to this peak by HPLC-mass spectrometry which corresponds to the molecular weight of 4″-keto-avermectin B1a.
[0147] Note that when analyzing product formation by HPLC and HPLC-mass spectrometry, in addition to the 4″-keto-avermectin, the corresponding ketohydrate 4″-hydroxy-avermectin was also found giving a peak at 2.12 min. This finding indicated that the P450 monooxygenase converts avermectin by hydroxylation to 4″-hydroxy-avermectin, from which 4″-keto-avermectin is formed by dehydration. Interestingly, when the spinach ferredoxin was replaced by ferredoxin from the bacterium Clostridium pasteurianum or from the red alga Porphyra umbilicalis, the biocatalytic conversion of avermectin to 4″-keto-avermectin still took place, indicating that the enzyme does not depend on a specific ferredoxin for receiving reduction equivalents.
EXAMPLE IV
Isolation of a Mutant Streptomyces Strain R-922 With Enhanced Activity
[0148] To obtain strains of Streptomyces strain R-922 that have an enhanced ability to regioselectively oxidize avermectin to 4″-keto-avermectin, UV mutants were generated. To do this, spores of Streptomyces strain R-922 were collected and stored in 15% glycerol at −20° C. This stock solution contained 2×10 9 spores.
[0149] The spore stock solution was next diluted and transferred to petri plates containing 10 ml of sterile water, and the suspension was exposed to UV light in a Stratalinker UV crosslinker 2400 (commercially available from Stratagene, La Jolla, Calif.). The Stratalinker UV crosslinker uses a 254-nm light source and the amount of energy used to irradiate a sample can be set in the “energy mode.”
[0150] Through experimentation, it was determined that an exposure of 8000 microjoules of UV irradiation (254 nm) was required to kill 99.9% of the spores. This level of UV exposure was used in the mutagenesis.
[0151] Surviving UV-mutagenized spores were plated, cultured, and transferred to minimal media. Approximately 0.3-0.4% of the viable spores were determined to be auxotrophic, indicating a good level of mutagenesis in the population.
[0152] The mutagenized clones were screened for activity in the whole cell biocatalysis assay described in Example II. As shown in FIG. 2, one mutant (“R-922 UV mutant”) showed a two to three fold increase in an ability to regioselectively oxidize avermectin to 4″-keto-avermectin as compared to wild-type strain R-922. Although the gene encoding the P450 monooxygenase responsible for the regioselectively oxidation activity, ema1, is not mutated in the R-922 UV mutant, this mutant nonetheless provides an excellent source for a cell-free extract containing ema1 protein.
EXAMPLE V
Isolation of the P450 Monooxygenase from Streptomyces Strain R-922
[0153] To enrich the P450 enzyme, 35 ml of active cell free extract were filtered through a 45 μm filter and fractionated by anion exchange chromatography. Anion exchange chromatography conditions were as follows:
FPLC instrument: Ä kta prime (from Pharmacia Biotech) FPLC-column: HiTrap ™Q (5 ml) stacked onto Resource ®Q (6 ml) (from Pharmacia Biotech) eluents buffer A: 25 mM Tris/HCl (pH 7.5) buffer B: 25 mM Tris/HCl (pH 7.5) containing 1 M KCl temperature eluent bottles and fractions in ice bath, flow 3 ml/min detection UV 280 nm Pump table: 0.0 min 100% A 0% B linear gradient to 2.0 min 90% A 10% B 5.0 min 90% A 10% B linear gradient to 30.0 min 50% A 50% B linear gradient to 40.0 min 0% A 100% B 50.0 min 0% A 100% B
[0154] Enzyme activity eluted with 35%-40% buffer B. The active fractions were pooled and concentrated by centrifugal filtration through Biomax™ filters with an exclusion limit of 5 kD (commercially available from Millipore Corp., Bedford, Mass.) at 5000 rpm and then rediluted in disruption buffer containing 20% glycerol to a volume of 5 ml containing 3-10 mg/ml protein. This enriched enzyme solution contained at least 25% of the original enzyme activity.
[0155] The enzyme was further purified by size exclusion chromatography. Size exclusion chromatography conditions were as follows:
FPLC instrument: Ä kta prime (from Pharmacia Biotech) FPLC-column: HiLoad 26/60 Superdex ® 200 prep grade (from Pharmacia Biotech) sample: 3-5 ml enriched enzyme solution from the anion chromatography step sample preparation: filtered through 45 μm filter eluent buffer: PP-buffer (pH 7.0) + 0.1 M KCl temperature: 4° C. flow: 2 ml/min detection: UV 280 nm
[0156] Enzyme activity eluted between 205-235 ml eluent buffer. The active fractions were pooled, concentrated by centrifugal filtration through Biomax™ filters with an exclusion limit of 5 kD (from Millipore) at 5000 rpm, and rediluted in disruption buffer containing 20% glycerol to form a solution of 0.5-1 ml containing 2-5 mg/ml protein. This enriched enzyme solution contained 10% of the original enzyme activity. This enzyme preparation, when checked for purity by SDS page, (see, generally, Laemmli, U. K., Nature 227:680-685, 1970 and Current Protocols in Molecular Biology, supra) and stained with Coomassie blue, showed one dominant protein band with a molecular weight of 45-50 kD, according to reference proteins of known molecular weight.
EXAMPLE VI
Attempted Isolation of P450 Monooxygenase Genes From Streptomyces Strains R-922 and I-1529
[0157] Based on results described above that suggested the enzyme from strain R-922 that is responsible for the regiospecific oxidation of avermectin to 4″-keto-avermectin is a P450 monooxygenase, a direct PCR-based approach to clone P450 monooxygenase genes from this strain was initiated (see, generally, Hyun et al., J. Microbiol. Biotechnol. 8(3):295-299, 1998). This approach is based on the fact that all P450 monooxygenase enzymes contain highly conserved oxygen-binding and heme-binding domains that are also conserved at the vii, nucleotide level. PCR primers were designed to prime to these conserved domains and to amplify the DNA fragment from P450 genes using R-922 or I-1529 genomic DNA as a template. The PCR primers used are shown in Table 1.
TABLE 1 SEQ Degen- ID eracy NOs +TL,1 O 2 -Binding Domain Primers (5′ to 3′)* I A G H E T T 43 ATC GCS GGS CAC GAG ACS AC 8 44 V A G H E T T 45 GTS GCS GGS CAC GAG ACS AC 16 46 L A G H E T T 47 CTS GCS GGS CAC GAG ACS AC 16 48 L L L I A G H E T 49 TS CTS CTS ATC GCS GGS CAC GAG AC & 32 50 Heme-Binding Domain Primers (3′ to 5′)* H Q C L G Q N L A 51 GTG GTC ACG GAS CCS TGC TTG GAS CG & 8 52 F G H G V H Q C 53 AAG CCS GTG CCS CAS GTG GTC ACG 8 54 F G F G V H Q C 55 AAG GCS AAG CCS CAS GTG GTC ACG 8 56 F G H G I H Q C 57 AAG CCS GTG CCS TAG GTG GTC ACG 4 58 F G H G V H F C 59 AAG CCS GTG CCS CAS GTG AAG ACG 8 60
[0158] PCR amplification using any of the primers specific to nucleotide sequences encoding the O 2 -binding domain with any of the primers specific to nucleotide sequences encoding the heme-binding domain and genomic DNA from Streptomyces strains R-922 or I-1529 resulted in the amplification of an approximately 350 bp DNA fragment. This is exactly the size that would be expected from this PCR amplification due to the approximately 350 bp separation in P450 genes of the gene segments encoding the O 2 -binding and heme-binding sites.
[0159] The 350 bp PCR fragments were cloned into the pCR2.1-TOPO TA cloning plasmid (commercially available Invitrogen, Carlsbad, Calif.) and transformed into E. coli strain TOP10 (Invitrogen, Carlsbad, Calif.). Approximately 150 individual clones from strains R-922 and I-1529 were sequenced to determine how many unique P450 gene fragments were represented. Analysis of the sequences revealed that they included 8 unique P450 gene fragments from strain R-922 and 7 unique fragments from 1-1529.
[0160] Blast analysis (Altschul et al., J. Mol. Biol. 215:403-410, 1990) demonstrated that all of the unique P450 gene fragments from both the R-922 and I-1529 strains were derived from P450 genes and encoded the region between the O 2 -binding and heme-binding domains (see FIG. 3 for strain R-922 and FIG. 4 for strain I-1529).
[0161] Next, in order to clone the full-length genes from which the PCR fragments were derived, the DNA fragments cloned by PCR were used as hybridization probes to gene libraries containing genomic DNA from strains R-922 and I-1529. To do this, genomic DNA from the R-922 and I-1529 strains was partially digested with Sau3A I, dephosphorylated with calf intestinal alkaline phosphatase (CIP) and ligated into the cosmid plasmid pPEH215, a modified version of SuperCos 1 (commercially available from Stratagene, La Jolla, Calif.). Ligation products were packaged using the Gigapack III XL packaging extract and transfected into E. coli XL1 Blue MR host cells. Twelve cosmids that strongly hybridized to the PCR-generated P450 gene fragments were identified from the R-922 library, from which three unique P-450 genes were subcloned and sequenced. The hybridizations were performed at high stringency conditions according to the protocol of Church and Gilbert (Church and Gilbert, Proc. Natl. Acad. Sci. USA 81:1991-1995, 1984). In brief, these high stringency conditions include Hybrid Buffer containing 500 mM Na-phosphate, 1 mM EDTA, 7% SDS, 1% BSA; Wash Buffer 1 containing 40 mM Na-phosphate, 1 mM EDTA, 5% SDS, 0.5% BSA; and Wash Buffer 2 containing 40 mM Na-phosphate, 1 mM EDTA, 1% SDS (Note that other high stringency hybridizations conditions are described, for example, in Current Protocols in Molecular Biology, supra.) Nineteen strongly hybridizing cosmids were identified from the I-1529 library, and from these, four unique P-450 genes were subcloned and sequenced.
[0162] In yet a further approach to isolate diverse P450 monooxygenase genes from strains R-922 and I-1529, a known P450 gene from another bacterium was used as a hybridization probe to identify cosmid clones containing homologous P450 genes from strains R-922 and I-1529. The epoF P450 gene from Sorangium cellulosum strain So ce90 that is involved in the synthesis of epothilones (Molnar et al., Chem Biol. 7(2):97-109, 2000) was used as a probe in this effort. Using the epoF P450 gene probe, one cosmid was identified from strain R-922 (clone LC), and three were identified from strain I-1529 (clones LA, LB, and EA). In each case, the homologous gene fragment was subcloned and sequenced, and found to code for P450 monooxygenase enzymes.
[0163] However, a comparison of the 17 peptide sequences identified in Example VII (below) failed to match any of these cloned genes. Two of the peptide sequences (namely, LVKDDPALLPR (SEQ ID NO: 70) and AVHELMR (SEQ ID NO: 76)) mapped to the region between the O 2 and heme binding domains, and so these should have identified any of the partial gene fragments derived by the PCR approach. Thus, the standard approaches based on the known PCR technique of Hyun et al., supra, and using known P450 genes as hybridization probes failed to identify the gene that encodes the specific P450 monooxygenase responsible for the regioselective oxidation of avermectin. Accordingly, it was determined that additional experimentation was required to isolate the gene encoding the P450 monooxygenase of the invention.
EXAMPLE VII
Partial Sequencing of the P450 Monooxygenase from Streptomyces Strain R-922
[0164] Partial amino acid sequencing of the P450 monooxygenase from Streptomyces strain R-922 was carried out by the Friedrich Miescher Institute, Basel Switzerland. The protein of the dominant band on the SDS page was tryptically digested and the formed peptides separated and sequenced by mass spectrometry and Edman degradation (see, generally, Zerbe-Burkhardt et al., J. Biol. Chem. 273:6508, 1998). The sequence of the following 17 peptides were found:
Sequence Sequence I.D. No. HPGEPNVMDPALITDPFTGYGALR (SEQ ID NO:61) FVNNPASPSLNYAPEDNPLTR (SEQ ID NO:62) LLTHYPDISLGIAPEHLER (SEQ ID NO:63) VYLLGSILNYDAPDHTR (SEQ ID NO:64) TWGADLISMDPDR (SEQ ID NO:65) EALTDDLLSELIR (SEQ ID NO:66) FMDDSPVWLVTR (SEQ ID NO:67) LMEMLGLPEHLR (SEQ ID NO:68) VEQIADALLAR (SEQ ID NO:69) LVKDDPALLPR (SEQ ID NO:70) DDPALLPR (SEQ ID NO:71) TPLPGNWR (SEQ ID NO:72) LNSLPVR (SEQ ID NO:73) ITDLRPR (SEQ ID NO:74) EQGPVVR (SEQ ID NO:75) AVHELMR (SEQ ID NO:76) AFTAR (SEQ ID NO:77) FEEVR (SEQ ID NO:78)
[0165] Alignment of these peptides to a selection of actinomycete P450 monooxygenase sequences indicated that all the peptides were fragments of a single P450 mono-oxygenase.
EXAMPLE VIII
Cloning the P450 Monooxygenase Gene from Strain R-922 that Encodes the Enzyme Responsible for the Oxidation of Avermectin to 4″-Keto-Avermectin
[0166] PCR primers were designed by reverse translation from the amino acid sequences of several of the peptides derived from the P450 enzyme of strain R-922 (see Example VII and Table 2 below). Each of five forward primers (2aF, 2bF, 3F, 1F, and 7F) was paired with one reverse primer (5R) in PCR reactions with R-922 genomic DNA as a template. In each reaction, a DNA fragment of the expected size was produced.
TABLE 2 Expected Primer sequence and the amino acid size Primer sequence to which they were designed* Degeneracy (bp)** SEQ ID NO: 2aF P G E D N V M 64 600 79 5′-CCS GGS GAR CCS AAY GTS ATG-3′ 80 2bF A L I T D P F 32 580 81 5′-GCS CTS ATY ACS GAC CCS TTC-3′ 82 3F F M D D S P V W 32 549 83 5′-TTC ATG GAC GAC WSS CCS GTS TGG-3′ 84 1F L N Y D A P D H 32 350 85 5′-CTS AAY TAY GAC GCS CCS GAC CAC-3′ 86 7F V E Q I A D A L 32 300 87 5′-GTS GAR CAG ATY GCS GAC GCS CTS-3′ 88 5R D L I S M D P D 64 — 89 3′-CTG GAS TAR WSS TAC CTG GGS CTG-5′ 90
[0167] The 580 and 600 bp PCR fragments generated by using primers (2bF and 5R) and (2aF and 5R), respectively, were cloned into the pCR-Blunt II -TOPO cloning plasmid (commercially available from Invitrogen, Carlsbad, Calif.) and transformed into E. coli strain TOP10 (Invitrogen, Carlsbad, Calif.). The inserted DNA fragments were then sequenced. Examination of the sequences revealed that the 600 and 580 bp fragments were identical in the 580 bp of sequence that they have in common. Also, there was a perfect match between the deduced amino acid sequence derived from the nucleotide sequence of the 600 bp and 580 bp fragments and the amino acid sequences of peptides isolated from the purified P450 Ema1 enzyme that aligned in this region of the isolated gene (see FIG. 5). This result strongly suggested that the gene fragments isolated in these clones are derived from the gene that encodes the P450 Ema1 enzyme that is responsible for the oxidation of avermectin to 4″-keto-avermectin.
[0168] The 600 bp PCR fragment produced using primers 2aF (SEQ ID NO: 80) and 5R(SEQ ID No: 90) was used as a hybridization probe to a cosmid library of genomic DNA isolated from strain R-922 (cosmid library described in Example VI). Two cosmids named pPEH249 and pPEH250 were identified that hybridized strongly with the probe. The portion of each cosmid encoding the P450 enzyme was sequenced and the sequences were found to be identical between the two cosmids. The complete coding sequence of the ema1 gene was identified (SEQ ID NO: 1). The amino acid sequence of all peptide fragments from P450 Ema1 matched perfectly with the deduced amino acid sequence from the ema1 gene (see FIG. 5). Comparison of the deduced amino acid sequence of the protein encoded by the ema1 gene using BLASTP (Altschul et al., supra) determined that the closest match in the databases is to a P450 monooxygenase from S. thermotolerans that has a role in the biosynthesis of carbomycin (Arisawa et al., Biosci. Biotech. Biochem. 59(4):582-588, 1995) and whose identity with ema1 is only 49% (Identities=202/409 (49%), Positives=271/409 (65%), Gaps=2/409 (0%)). In the Blast analysis, the following settings were employed:
BLASTP 2.0.10 Lambda K H 0.322 0.140 0.428 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 375001765 Number of Sequences: 1271323 Number of extensions: 16451653 Number of successful extensions: 46738 Number of sequences better than 10.0: 2211 Number of HSP's better than 10.0 without gapping: 628 Number of HSP's successfully gapped in prelim test: 1583 Number of HSP's that attempted gapping in prelim test: 43251 Number of HSP's gapped (non-prelim): 2577 length of query: 430 length of database: 409,691,007 effective HSP length: 55 effective length of query: 375 effective length of database: 339,768,242 effective search space: 127413090750 effective search space used: 127413090750
[0169] A similar comparison of the nucleotide sequences of these two genes demonstrated that they are 65% identical at the nucleotide level. These results demonstrate that P450 Ema1 is a new enzyme.
EXAMPLE IX
Heterologous Expression of the ema1 Gene in Strentomyces lividans Strain ZX7
[0170] The coding sequence of the ema1 gene was fused to the thiostrepton-inducible promoter (tipA) (Murakami et al., J. Bacteriol. 171:1459-1466, 1989). The tipA promoter was derived from plasmid pSIT151 (Herron and Evans, FEMS Microbiology Letters 171:215-221, 1999).
[0171] The fusion of the tipA promoter and the ema1 coding sequence was achieved by first amplifying the ema1 coding sequence with the following primers to introduce a PacI cloning site at the 5′ end and a PmeI compatible end on the 3′ end.
[0172] Forward Primer: The underlined sequence is a PacI recognition sequence; the sequence in bold-face type is the start of the coding sequence of ema1.
[0173] Reverse Primer: The underlined sequence is half of a PmeI recognition sequence; the bold-face type sequence is the reverse complement of the ema1 translation stop codon followed by the 3′ end of the ema1 coding sequence.
(SEQ ID NO:92) 5′-AAACTCACCCCAACCGCACCGGCAGCGAGTTC-3″
[0174] The PacI-digested PCR fragment containing the ema1 coding sequence was cloned into plasmid pTBBKA (see FIG. 7) that was restricted (i.e., digested) with PacI and PmeI, and the ligated plasmid transformed into E. coli. Four clones were sequenced. Three of the four contained the complete and correct ema1 coding sequence. The fourth ema1 gene clone contained a truncated version of the ema1 gene. The full-length ema1 gene encodes a protein that begins with the amino acid sequence MSELMNS (SEQ ID NO: 93). The truncated gene encodes a protein that lacks the first 4 amino acids and begins with the second methionine residue. This gene has been named ema1A. The nucleotide and amino acid sequence of ema1A are provided as SEQ ID NO: 33 and SEQ ID NO: 34, respectively. The ema1 and ema1A genes in these plasmids, pTBBKA-ema1 and pTBBKA-ema1A, are in the correct juxtaposition with the tipA promoter to cause expression of the genes from this promoter.
[0175] Plasmid pTBBKA contains a gene from the Streptomyces insertion element IS117 that encodes an integrase that catalyzes site-specific integration of the plasmid into the chromosome of Streptomyces species (Henderson et al., Mol. Microbiol. 3:1307-1318, 1989 and Lydiate et al., Mol. Gen. Genet. 203:79-88, 1986). Since plasmid pTBBKA has only an E. coli replication origin and contains a mobilization site, it can be transferred from E. coli to Streptomyces strains by conjugation where it will not replicate. However, it is able to integrate into the chromosome due to the IS 117 integrase and Streptomyces clones containing chromosomal integrations can be selected by resistance to kanamycin due to the plasmid-borne kanamycin resistance gene.
[0176] The ema1 coding sequence was also cloned into other plasmids that are either replicative in Streptomyces or, like pTBBKA, integrate into the chromosome upon introduction into a Streptomyces host. For example, ema1 was cloned into plasmid pEAA, which is similar to plasmid pTBBKA but the KpnI/PacI fragment containing the tipA promoter was replaced with the ermE gene promoter (Schmitt-John and Engels, Appl Microbiol Biotechnol. 36(4):493-498, 1992). In addition, pEAA does not contain the kanamycin resistance gene. The ema1 gene was cloned into pEAA as a PacI/PmeI fragment to create plasmid pEAA-ema1 in which the ema1 gene is expressed from the constitutive ermE promoter.
[0177] Plasmid pTUA1A is a Streptomyces- E.coli shuttle plasmid (see FIG. 8) that contains the tipA promoter. The ema1 gene was also cloned into the PacI/PmeI sites in plasmid pTUA1A to create plasmid pTUA-ema1.
[0178] The ema1 A gene fragment was also ligated as a PacI/PmeI fragment into plasmids pTUA1A, and pEAA in the same way as the ema1 gene fragment to create plasmids pTUA-ema1A, and pEAA-ema1 A, respectively.
[0179] The pTBBKA, pTUA1A, and pEAA-based plasmids containing the ema1 or ema1A genes were introduced into S. lividans ZX7 and in each case transformants were obtained and verified ( S. lividans l strains ZX 7::pTBBKA-ema1 or ema1A, ZX7 (pTUA-ema1 or -ema1 A), and ZX7::pEAA-ema1 or -ema1A, respectively).
[0180] Wild-type Streptomyces lividans strain ZX7 was tested and found to be incapable of the oxidation of avermectin to 4″-keto-avermectin. Transformed S. lividans strains ZX7::pTBBKA-ema1, ZX7::pTBBKA-ema1A, ZX7 (pTUA-ema1), ZX7 (pTUA-ema1A), ZX7::pEAA-ema1, and ZX7::pEAA-ema1A were each tested for the ability to oxidize avermectin to 4″-keto-avermectin using resting cells. To do this, the whole cell biocatalysis assay described above (including analysis method) was performed. Note that for the whole cell biocatalysis assay, transformed Streptomyces lividans, like strain R-922, was grown in PHG medium and, again like strain R-922, had a reaction time of 16 hours (i.e., during which time the 500 mg transformed Streptomyces lividans wet cells in 10 ml of 50 mM potassium phosphate buffer, pH 7.0, were shaken at 160 rpm at 28° C. in the presence of 15 μl of a solution of avermectin in isopropanol (30 mg/ml)).
[0181] In the presence of the inducer, thiostrepton (5 ug/ml), the ema1- or ema1A-containing strains ZX7::pTBBKA-ema1, ZX7::pTBBKA-ema1A, ZX7 (pTUA-ema1), ZX7 (pTUA-ema1A) were found to oxidize avermectin to 4″-keto-avermectin as evidenced by the appearance of the oxidized 4″-keto-avermectin compound (see Table 3).
TABLE 3 % Conversion of Avermectin Strain 2 hour 16 hour Streptomyces lividans ZX7 + Plasmid 1 None 0 0 pTBBKA-ema1A 0.5 ± 0.059 1.17 ± 0.112 pTBBKA-ema1 0.21 ± 0.0.356 0.65 ± 0.079 pTUA-ema1 20.96 ± 1.044 42.0 ± 2.5 pEAA-ema1 3.0 ± 0.232 24.1 ± 0.358 pTBBKA-ema2 4.79 ± 0.096 9.57 ± 0.423 pTUA-ema2 0.77 ± 0.138 2.05 ± 0.537 pEAA-ema2 0.0 1.73 ± 3.00 pTBBKA-ema1/fd233 8.89 ± 0.720 30.99 ± 0.880 pTUA-ema1/fd233 23.29 ± 0.854 61.2 ± 3.548 pEAA-ema1/fd233 8.26 ± 0.845 10.66 ± 0.858 pTUA-ema2/fd233 1.85 ± 0.861 6.40 ± 1.918 Pseudomonas putida S12 + Plasmid None 0 pRK-ema1 ND 2 18 pRK-ema1/fd233 ND 32
[0182] These results conclusively demonstrate that the P450 Ema1 enzyme encoded by the ema1 gene is responsible for the oxidation of avermectin to 4″-keto-avermectin in S. tubercidicus strain R-922. Furthermore, the data demonstrates that the ema1A gene that is 4 amino acids shorter on the N-terminus than the native ema1 gene also encodes an active P450 Ema1 enzyme.
[0183] As can be seen in FIG. 9, oxidation of avermectin to 4″-keto-avermectin by S. lividans strain ZX7::pTBBKA-ema1, as detected by HPLC analysis, is variable depending upon the amount of thiostrepton used to induce expression of ema1. Note that S. lividans strains ZX7::pEAA-ema1 and ZX7::pEAA-ema1A (see Table 3) demonstrated this oxidation activity in the absence of thiostrepton since in these strains the ema1 or ema1A genes are expressed from the ermE promoter that does not require induction.
EXAMPLE X
Isolation of an ema1-Homolosous Gene From Streptomyces tubercidicus Strain I-1529
[0184] Streptomyces tubercidicus strain I-1529 was also found to be active in biocatalysis of avermectin to form the 4″-keto-avermectin derivative. The cosmid library from strain I-1529, described in Example VI, was probed at the high stringency conditions of Church and Gilbert (Church and Gilbert, Proc. Natl. Acad. Sci. USA 81:1991-1995, 1984) with the 600 bp ema1 PCR fragment produced using primers 2aF (SEQ ID NO: 80) and 5R (SEQ ID NO: 90) described previously to identify clones containing the ema1 homolog from strain I-1529. Three strongly hybridizing cosmids were identified. The P450 gene regions in two of the cosmids, pPEH252 and pPEH253, were sequenced and found to be identical. Analysis of the DNA sequence revealed the presence of a gene with high homology to the ema1 gene of strain R-922. FIG. 6 shows a comparison of the deduced amino acid sequence of Ema2 (i.e., P450 Ema2 ), Ema1 (i.e., P450 Ema1 ), and a P450 monooxygenase from Streptomyces thermotolerans that is involved in the biosynthesis of carbomycin (Carb-450) (GenBank Accession No. D30759).
[0185] The gene from Streptomyces tubercidicus strain I-1529, named ema2, encodes an enzyme with 90% identity at the amino acid level and 90.6% identity at the nucleotide level to the P450 Ema1 enzyme. The nucleotide sequence of the ema2 gene and the deduced amino acid sequence of P450 Ema2 are provided in SEQ ID NO: 3 and SEQ ID NO: 4, respectively.
[0186] The ema2 coding sequence was cloned in the same manner as the ema1 and ema1A genes into plasmids pTBBKA, pTUA1A, and pEAA such that the coding sequence was functionally fused to the tipA or ermE promoter in these plasmids. The resulting plasmids, pTBBKA-ema2, pTUA-ema2, and pEAA-ema2 were transferred from E. coli to S. lividans ZX7 by conjugation to create strains ZX7::TBBKA-ema2 and ZX7 (pTUA-ema2), and ZX7::pEAA-ema2 containing the ema2 gene integrated into the chromosome or maintained on a plasmid.
[0187] Strains ZX7::TBBKA-ema2, ZX7 (pTUA-ema2), and ZX7::pEAA-ema2 were next tested for the ability to oxidize avermectin to 4″-keto-avermectin. The ema2 gene was also shown to provide biocatalysis activity, although at a lower level compared to the ema1 gene (see Table 3).
[0188] These results demonstrate that the ema2 gene from S. tubercidicus strain I-1529 also encodes a P450 enzyme (P450 Ema2 ) capable of oxidizing avermectin to 4″-keto-avermectin.
EXAMPLE XI
Characterization of ema1 Homologs From Other Biocatalysis Strains
[0189] Seventeen Streptomyces sp. strains, including strains R-922 and I-1529, were identified that are capable of catalyzing the regiospecific oxidation of the 4″-carbinol of avermectin to a ketone. Next, the isolation and characterization of the genes encoding the biocatalysis enzyme from all of these strains was accomplished.
[0190] To do this, genomic DNA was isolated from the strains and was evaluated by restriction with several restriction endonucleases and Southern hybridization with the ema1 gene. A specific restriction endonuclease was identified for each DNA that would generate a single DNA fragment of a defined size to which the ema1 gene hybridizes. For each strain, there was only one strongly hybridizing DNA fragment, thus suggesting that other P450 genes were not detected under the high stringency hybridization conditions used in these experiments. Each DNA was digested with the appropriate restriction endonuclease, and the DNA was subjected to agarose gel electrophoresis. DNA in a narrow size range that included the size of the ema1-hybridizing fragment was excised from the gel. The size-selected DNA was ligated into an appropriate cloning plasmid and this ligated plasmid was used to transform E. coli. The E. coli clones from each experiment were screened by colony hybridization with the ema1 gene fragment to identify clones containing the ema1-homologous DNA fragment.
[0191] The nucleotide sequence of the cloned DNA in each ema1-homologous clone was determined and examined for the presence of a gene encoding a P450 enzyme with homology to ema1. In this way, ema1-homologous genes were isolated from 14 of the 15 other active strains. The nucleotide and deduced amino acid sequences of these are referenced in Table 4 as SEQ ID NOS: 5-32 and 94-95. A diagram of the relationship of these enzymes in the form of a phylogenetic tree is shown in FIG. 10. This phylogenetic tree was generated using the commercially available GCG Wisconsin software program version 1.0 (Madison, Wis.).
TABLE 4 SEQ ID NO (nucleotide and amino acid, Strain Number Gene Classification respectively) R-0922 ema1 Strept. tubercidicus 1 and 2 I-1529 ema2 Strept. tubercidicus 3 and 4 1053 ema3 Streptomyces rimosus 5 and 6 R-0401 ema4 Streptomyces lydicus 7 and 8 I-1525 ema5 Streptomyces sp. 9 and 10 DSM-40241 ema6 Strept. chattanoogensis * 11 and 12 IHS-0435 ema7 Streptomyces sp. 13 and 14 C-00083 ema8 Streptomyces albofaciens 15 and 16 MAAG-7479 ema9 Streptomyces platensis 17 and 18 A/96-1208710 ema10 Strept. kasugaensis 19 and 20 R-2374 ema11 Streptomyces rimosus 21 and 22 MAAG-7027 ema12 Strept. tubercidicus 23 and 24 Tue-3077 ema13 Streptomyces platensis 25 and 26 I-1548 ema14 Streptomyces platensis 27 and 28 NRRL-2433 ema15 Strept. lydicus 29 and 30 MAAG-0114 ema16 Streptomyces lydicus 31 and 32 DSM-40261 ema17 Streptomyces tubercidicus 94 and 95
EXAMPLE XII
Construction of His-tagged ema1 and ema1 Homologs to Facilitate Enzyme Purification
[0192] In order to purify the P450 Ema1 enzyme and the P450 enzymes encoded by the ema1 homologs from other biocatalysis strains, each of the P450 genes was cloned into the E. coli expression plasmid pET-28b(+) (commercially available from Novagen, Madison, Wis.). The pET-28 plasmids are designed to facilitate His-tag fusions at either the N-, or C-terminus and to provide strong expression of the genes in E. coli from the T7 phage promoter. In many cases, the coding sequence of the ema genes begins with the sequence ATGT. These genes were amplified by PCR such that the primers on the 5′ end incorporated a PciI recognition site (5′ ATATGT 3′) at the 5′ terminus. The last four bases of the PciI site correspond to the ATGT at the beginning of the ema gene coding sequence.
[0193] PCR primers at the 3′ end of the genes were designed to remove the translation stop codon at the end of the ema gene coding sequence and to add an XhoI recognition site to the 3′ terminus. The resulting PCR fragments were restricted with PciI and XhoI to generate PciI ends at the 5′ termini and XhoI ends at the 3′ termini, thereby facilitating cloning of the fragments into pET-28b(+) previously restricted with NcoI and XhoI. Since PciI and NcoI ends are compatible, the fragments were cloned into pET-28b(+) in the proper orientation to the T7 promoter and ribosome binding site in the plasmid to provide expression of the genes.
[0194] At the 3′ end of each ema gene, the coding sequence was fused in frame at the XhoI site to the His-tag sequence followed by a translation stop codon. This results in the production of an Ema enzyme with six histidine residues added to the C-terminus to facilitate purification on nickel columns.
[0195] In the case of ema genes in which the ATG translation initiation codon is not followed by a T nucleotide, the ema genes were amplified by PCR using a different strategy for the 5′ end. The primers at the 5′ end were designed to incorporate a C immediately preceding the ATG translation initiation codon and the primers at the 3′ end were the same as described above. The PCR fragments that were amplified were restricted with XhoI to create an XhoI end at the 3′ -terminus and the 5′ end was left as a blunt end. These fragments were cloned into pET-28b(+) that had been restricted with NcoI, but the NcoI ends were made blunt-ended by treatment with mung bean exonuclease, and restricted with XhoI.
[0196] In this manner, the ema genes were cloned into pET-28b(+) to create a functional fusion with the T7 promoter and the His-tag at the C-terminus as described previously. All His-tagged ema genes were sequenced to ensure that no errors were introduced by PCR.
[0197] Large amounts of the His-tagged P450 Ema1 and P450 Ema2 enzymes were isolated and purified by standard protocols. E. coli strain BL21 DE3 (commercially available from Invitrogen; Carlsbad, Calif.) containing the T7 RNA polymerase gene under the control of the inducible tac promoter and the appropriate pET-28/ema plasmid was cultured and the cells were harvested and lysed. The lysates were applied to Ni-NTA columns (commercially available from Qiagen Inc., Valencia, Calif.) and the protein was purified according to the procedure recommended by the manufacturer.
[0198] Purified His-tagged P450 Ema1 and P450 Ema2 were highly active in in vitro activity assays as evidenced by a high rate of conversion of avermectin to 4″-keto-avermectin.
EXAMPLE XIII
Expression of ema1 in Pseudomonas
[0199] The ema1 gene constructs were next introduced into P. putida (wildtype P. putida commercially available from the American Type Culture Collection, Manassas, Va.; ATCC Nos. 700801 and 17453). The ema1 and ema1/fd233 gene fragments were cloned as PacI/PmeI fragments into the plasmid pUK21 (Viera and Messing, Gene 100:189-194, 1991). The fragments were cloned into a position located between the tac promoter (P tac ) and terminator (T tac ) on pUK21 in the proper orientation for expression from the tac promoter. The P tac-ema 1-T tac and P tac -ema1/fd233-T tac gene fragments were removed from pUK21 as Bg1II fragments and these were cloned into the broad host-range, transmissible plasmid, pRK290 (Ditta et al., Proc. Natl. Acad. Sci. USA 77:7347-7351, 1980) to create plasmids pRK-ema1 and pRK-ema1/fd233 (FIG. 11). These plasmids were introduced into P. putida strains ATCC 700801 and ATCC 17453 by conjugal transfer from E. coli hosts by standard methodology (see, e.g., Ditta et al., Proc. Natl. Acad. Sci. USA 77:7347-7351, 1980).
[0200] P. putida ATCC 700801 and ATCC 17453 containing plasmids pRK-ema1 or pRK-ema1/fd233 were tested for the ability to catalyze the oxidation of avermectin. The results shown in Table 3 demonstrate that these strains are able to catalyze this reaction.
EXAMPLE XIV
Identification of Genes Encoding Ferredoxins That Are Active With the P450 Ema1 Monooxygenase
[0201] P450 monooxygenases require two electrons for each hydroxylation reaction catalyzed (Mueller et al., “Twenty-five years of P450 cam research: Mechanistic Insights into Oxygenase Catalysis.” Cytochrome P 450, 2 nd Edition, P. R. Ortiz de Montellano (ed.), pp. 83-124; Plenum Press, NY 1995). These electrons are transferred to the P450 monooxygenase one at a time by a ferredoxin. The electrons are ultimately derived from NAD(P)H and are passed to the ferredoxin by a ferredoxin reductase. Specific P-450 monooxygenase enzymes have a higher activity when they interact with a specific ferredoxin. In many cases, the gene encoding a ferredoxin that interacts specifically with a given P450 monooxygenase is located adjacent to the gene encoding the P450 enzyme.
[0202] As described above, in addition to the ema1 gene, four P450 genes from strain R-922 and seven P450 genes from strain I-1529 (see Example VI) were isolated and sequenced. In some of these, there was sufficient sequence information about the DNA flanking the P-450 genes to look for the presence of associated ferredoxin genes. By this approach, two unique ferredoxin genes were identified from each of the two strains. Ferredoxin genes fd229 and fd230 were identified from strain R-922, and fd233 and fdEA were identified from strain I-1529. In addition, a ferredoxin reductase gene was found to reside adjacent to the fdEA gene from strain I-1529. In order to test the biological activity of each of these ferredoxins in combination with P450 Ema1 , each individual ferredoxin gene was amplified by PCR to produce a gene fragment that included a blunt 5′-end, the native ribosome-binding site and ferredoxin gene coding sequence, and a PmeI restriction site on the 3′-end. Each such ferredoxin gene fragment was cloned into the PmeI site located 3′ to the ema1 gene in plasmid pTUA-ema1. In this way, artificial operons consisting of the ema1 gene and one of the ferredoxin genes operably linked to a functional promoter were created.
[0203] In the case of the fdEA ferredoxin gene in which a ferredoxin reductase gene, freEA, was found to be located adjacent to the fdEA gene, a DNA fragment containing both the fdEA and freEA genes was generated by a similar PCR strategy. This gene fragment was also cloned in the PmeI site of plasmid pTUA-ema1 as described for the other ferredoxin genes.
[0204] Each ema1-ferredoxin gene combination was tested for biological activity by introduction of the individual ema1-ferredoxin gene plasmids into S. lividans strain ZX7. The biocatalysis activity derived from each plasmid in S. lividans was determined. Of the four different constructs, only the ferredoxin gene fd233 derived from strain I-1529 provided increased activity when compared to the expression of ema1 alone in the same plasmid and host background (see Table 3). The pTUA-ema1/fd233 plasmid in S. lividans gave approximately 1.5 to 3 fold higher activity compared to the pTUA-ema1 plasmid. The other three plasmids containing the other ferredoxin genes provided results essentially the same as the plasmid with only the ema1 gene. Likewise, the pTUA-ema1/fdEA/freEA plasmid did not yield results different from those of pTUA-ema1. The nucleotide and deduced amino acid sequences of the fd233 gene are shown in SEQ ID NOs: 35 and 36, respectively.
[0205] A BLAST analysis of the nucelotide and amino acid sequences of fd233 revealed that the closest matches were to ferredoxins from S. coelicolor (GenBank Accession AL445945) and S. lividans (GenBank Accession AF072709). At the nucleotide level, fd233 shares 80 and 79.8% identity with the ferredoxin genes from S. coelicolor and S. lividans, respectively. At the peptide level, fd233 shares 79.4 and 77.8% identity with the ferredoxins from S. coelicolor and S. lividans, respectively.
[0206] Since fd233 is derived from strain I-1529 and ema1 is from strain R-922, the proteins encoded by the two genes cannot interact with each other in nature. In an approach designed to identify a ferredoxin gene from strain R-922 that is homologous to the fd233 gene and that might encode a ferredoxin that interacts optimally with the P450 Ema1 , the fd233 gene was used as a hybridization probe to a gene library of DNA from strain R-922. A strongly hybridizing cosmid, pPEH232, was identified and the hybridizing DNA was cloned and sequenced. Comparison of the deduced amino acid sequences from fd233 and the ferredoxin gene on cosmid pPEH232,fd232, revealed that they differed in only a single amino acid.
[0207] In a similar manner, plasmid pTUA-ema1-fd232 was constructed and tested in S. lividans ZX7. This plasmid gave similar results as those obtained with plasmid pTUA-ema1-fd233 (see Table 3). The nucleotide and deduced amino acid sequences of fd232 are shown in SEQ ID NOs: 37 and 38, respectively.
[0208] The ema1-fd233 operon was also subcloned, as a PacI-PmeI fragment, into pTBBKA and pEAA that had been digested with the same restriction enzymes. S. lividans ZX7::pTBBKA-ema1-fd233, and S. lividans ZX7::pEAA-ema1-fd233 were tested in the avermectin conversion assay and found to have higher activities than the strains harboring the ema1 gene alone in the comparable plasmids (see Table 3).
EXAMPLE XV
Heterologous Expression of P450 Ema1 and P450 Ema2 in Other Cells
[0209] The expression constructs pRK-ema1 (Example XIII) and pRK-ema2 (created in a way analogous to that described in Example XIII for pRK-ema1) were mobilized by conjugation into three fluorescent soil Pseudomonas strains. Conjugation was performed according to standard methods (see, e.g., Ditta et al., Proc. Natl. Acad. Sci. USA 77:7347-7351, 1980). The strains were: P. fluorescens MOCG134, P. fluorescens Pf-5, and P. fluorescens CHAO. Standard resting cell assays for the conversion of avermectin to 4″-ketoavermectin were conducted for each of the transconjugants. For strains Pf-5 and CHAO, the levels of conversion were below the detection limit. Strain MOCG134 yielded 3% conversion for ema1 and 5% for ema2.
[0210] In addition, the constructs listed in the Table 5 were introduced into Streptomyces avermitilis MOS-0001 by protoplast-mediated transformation (see, e.g., Kieser, T.; Bibb, M. J.; Buttner, M. J.; Chater, K. F.; Hopwood, D. A. (eds.): Practical Streptomyces Genetics. The John Innes Foundation, Norwich (England), 2000); Stutzman-Engwall, K. et al. (1999) Streptomyces avermitilis gene directing the ratio of B 2 :B 1 avermectins, WO 99/41389).
TABLE 5 Construct % Conversion of avermectin, 16 hrs None 0 pTBBKA-ema1 10.90 +/− 3.48 pTUA-ema1 5.326 +/− 2.19 pEAA-ema1 6.74 +/− 0.08 pTBBKA-ema1A/fd233 28.50 +/− 0.20 pTUA-ema1A/fd233 23.97 +/− 5.95
[0211] Wild-type Str. avermitilis MOS-0001 was tested and found to be incapable of the oxidation of avermectin to 4″-ketoavermectin.
[0212] Transformed S. avermitilis strains MOS-0001::pTBBKA-ema1, MOS-0001 (pTUA-ema1), MOS-0001::pEAA-ema1, MOS-0001::pTBBKA-ema1A-fd233, and MOS-0001 (pTUA-ema1A-fd233) were each tested for their ability to oxidize avermectin to 4″-keto-avermectin using resting cells. To do this, the whole cell biocatalysis assay described above (including analysis method) was performed. Note that for the whole cell biocatalysis assay, transformed Streptomyces avermitilis, like strain R-922, was grown in PHG medium and, again like strain R-922, had a reaction time of 16 hours (i.e., during which time the 500 mg transformed Streptomyces avermitilis wet cells in 10 ml of 50 mM potassium phosphate buffer, pH 7.0, were shaken at 160 rpm at 28° C. in the presence of 15 μl of a solution of avermectin in isopropanol (30 mg/ml)).
[0213] As shown in Table 5, in the presence of the inducer, thiostrepton (5 μg/ml), the ema1- or ema1A-fd233-containing strains MOS-0001::pTBBKA-ema1, MOS-0001::pTBBKA-ema1A-fd233, MOS-0001 (pTUA-ema1), MOS-0001 (pTUA-ema1A-fd233) were found to oxidize avermectin to 4″-keto-avermectin as evidenced by the appearance of the oxidized 4″-keto-avermectin compound. Note that the S. avermitilis strain MOS-0001::pEAA-ema1 demonstrated this oxidation activity in the absence of thiostrepton since in this strain the ema1 gene is expressed from the ermE promoter that does not require induction.
[0214] Thus, expression of the ema1 P450 monooxygenase gene in various Streptomyces and Pseudomonas strains provided recombinant cells that were able to convert avermectin to 4″-ketoavermectin in resting cell assays.
[0215] Next, expression and activity of P450 Ema1 monooxygenase was tested in E. coli. To do this, the ema1 gene was cloned into the E. coli expression plasmid pET-28b(+) (commercially available from Novagen, Madison, Wis.) as described previously. E. coli strain BL21 DE3 (commercially available from Invitrogen; Carlsbad, Calif.) that contains the T7 RNA polymerase gene under control of the inducible tac promoter and the pET-28/ema1 plasmid was cultured in 50 ml LB medium containing 5 mg/l kanamycin in a 250-ml flask with one baffle, for 16 hours at 37° C., with shaking at 130 rpm. 0.5 ml of this culture was used to inoculate 500 ml LB medium with 5 mg/l kanamycin in a 2-liter flask with one baffle, and the culture was incubated for 4 hours at 37° C. followed by 4 hours and 30° C., with shaking at 130 rpm throughout. The cells were harvested by centrifugation, washed in 50 mM potassium phosphate buffer, and centrifuged again.
[0216] For the resting cell assays, 90 mg wet cells were weighed into deep-well plates in triplicate and resuspended in 0.5 ml 50 mM potassium phosphate buffer. For cell-free extracts, 4 grams wet cells in 8 ml disruption buffer were disrupted in French press.
[0217] For the resting cell assays, 5 μl of substrate (2.5 mg/ml in 2-propanol) was added to the cell suspension. The plate was sealed with air permeable foil, and the reaction was incubated on an orbital shaker at 1000 rpm at 28° C. for 22 hours. No conversion of avermectin to 4″-ketoavermectin was detected.
[0218] For the cell-free assays, 100 μl cell free extract, 1 μl substrate solution (20 mg/ml) in 2-propanol, 5 μl 100 mM NADPH, 10 μl ferredoxin, 10 μl ferredoxin reductase, and 374 μl potassium phosphate buffer pH 7.0 were added as described in Example III, and the assay was incubated at 30° C. with shaking at 600 rpm for 20 hours. 9.2% +/−0.3% of avermectin was converted to 4″-ketoavermectin.
[0219] Thus, expression of the ema1 gene in E. coli resulted in the production of the active Ema1 P450 monooxygenase enzyme which, when purified from the cells, was able to convert avermectin to 4″-ketoavermectin.
EXAMPLE XVI
Identification and Cloning of Genes Encoding Ferredoxin Reductases that Support Increased Activity of the P450 Ema1 Monooxygenase
[0220] The electron transport pathway that supports the activity of P450 monooxygenases also includes ferredoxin reductases. These proteins donate electrons to the ferredoxin and, as is the case with ferredoxins and P450 monooxygenases, specific ferredoxin reductases are known to be better electron donors for certain ferredoxins than others.
[0221] Accordingly, a number of ferredoxin reductase genes from Streptomyces strains were cloned and were evaluated for their impacts on the biocatalysis reaction. To do this, numerous bacterial ferredoxin reductase (Fre) protein sequences were retrieved from NCBI and aligned with the program Pretty from the GCG package. Two conserved regions, approximately 266 amino acid residues apart, were used to make degenerate oligonucleotides for PCR. The forward primer (CGSCCSCCSCTSWSSAAS (SEQ ID NO: 96; where “S” is C or G; and “W” is A or G)) and the reverse primer (SASSGCSTTSBCCCARTGYTC (SEQ ID NO: 97; where “S” is C or G; “B” is C, G, or T; “R” is A or G; and “Y” is C or T)) were used to amplify 800 bp products from the biocatalytically active Streptomyces strains R-922 and I-1529. These pools of products were cloned into TOPO TA cloning vectors (commercially available from Invitrogen Inc., Carlsbad, Calif.), and 20 clones each from R922 and I-1529 were sequenced according to standard methods (see, e.g., Current Protocols in Molecular Biology, eds. Ausubel et al., John Wiley & Sons, Inc. 2000). Sequencing revealed that 4 unique fre gene fragments were isolated from the strains: three from (fre3,fre12,fre14) and one from I-1529 (fre16). The fre3,fre12,fre14, and fre16 gene fragments were used as probes to identify full-length ferredoxin reductases from genomic clone banks of Streptomyces strains R922 and I-1529. By this approach, the complete coding sequence of each of the 4 different fre genes was cloned and sequenced. The nucleic acid and amino acid sequences are provided as follows:fre3 (SEQ ID NOs: 98 and 99);fre12 (SEQ ID NOs: 100 and 101);fre14 (SEQ ID NOs: 102 and 103); and fre16 (SEQ ID NOs: 104 and 105).
[0222] In order to assess the biological activity of each fre gene in relation to the activity of Ema1, each gene was inserted into the ema1/fd233 operon described above, 3′ to the fd233 gene. This resulted in the formation of artificial operons consisting of the ema1,fd233, and individual fre genes that were expressed from the same promoter. The ema1/fd233/fre operons were cloned into the Pseudomonas plasmid pRK290 and introduced into 3 different P. putida strains. These strains were then analysed for Ema1 biocatalysis activity using the whole cell assay and one of the genes, the fre gene fre16 from strain I-1529, was found to increase the activity of P450 Ema1 monooxygenase by approximately 2-fold. This effect was strain specific, as it was seen only in one of the P. putida strains, ATCC Desposit No. 17453, and not in the other two. In P. putida strain ATCC 17453, the presence of fre gene fre16 resulted in 44% conversion of avermectin to 4″-keto-avermectin, as compared to 23% without this gene. The other fre genes had no impact on the biocatalysis activity in any of the P. putida strains tested.
[0223] In a similar approach, each of the ema1/fd233/fre operons were cloned into the Streptomyces plasmids pTUA, pTBBKA, and pEAA, and introduced into S. lividans strain ZX7. In each case there was no impact in S. lividans by any of the fre genes on biocatalysis activity.
EQUIVALENTS
[0224] Those skilled in the art will recognize, or be able to ascertain, using no more than routine experimentation, numerous equivalents to the specific substances and procedures described herein. Such equivalents are considered to be within the scope of this invention.
1
105
1
1293
DNA
Streptomyces tubercidicus
1
atgtcggaat taatgaactc tccgttcgcc gcgcacgtcg ggaaacaccc gggcgagccg 60
aatgtgatgg accccgccct gatcaccgac ccgttcaccg gctacggcgc gctgcgtgag 120
cagggcccgg tcgtacgggg ccggttcatg gacgactcgc ccgtctggct ggtgacgcgg 180
ttcgaggagg tccgccaggt cctgcgcgac cagcggttcg tgaacaatcc ggcctcgccg 240
tccctgaact acgcgcccga ggacaacccg ctgacccggc tgatggagat gctgggcctc 300
cccgagcacc tccgcgtcta cctgctcgga tcgatcctca actacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgggcg ttcacggccc gcaagatcac cgacctgcgg 420
ccccgggtcg agcagatcgc cgacgcgctg ctggcccggc tgcccgagca cgccgaggac 480
ggcgtcgtcg acctcatcca gcacttcgcc taccccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgaa cgtggggcgc cgacctcatc 600
tcgatggatc cggaccggct cggcgcctcg ttcccggcga tgatcgagca catccatcag 660
atggtccggg aacggcgcga ggcgctcacc gacgacctgc tcagcgaact gatccgcacc 720
catgacgacg acggcgggcg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780
gtcctcgccg gccacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg tctggtcaag gacgatccgg ccctcctccc ccgtgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cacatgaccc agctgcgcta cgccaccgcc 960
gacgtcgacc tcgccggcac accgatccgc cagggcgatg ccgttcaact catcctggta 1020
tcggccaact tcgacccccg tcactacacc gaccccgacc gcctcgatct cacccggcac 1080
cccgcgggcc acgccgagaa ccatgtgggt ttcggccatg gagcgcacta ctgcctgggc 1140
gccacactcg ccaaacagga aggtgaagtc gccttcggca aactgctcac gcactacccg 1200
gacatatcgc tgggcatcgc cccggaacac ctggagcgga caccgctgcc gggcaactgg 1260
cggctgaact cgctgccggt gcggttgggg tga 1293
2
430
PRT
Streptomyces tubercidicus
2
Met Ser Glu Leu Met Asn Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Thr Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ser Pro
65 70 75 80
Ser Leu Asn Tyr Ala Pro Glu Asp Asn Pro Leu Thr Arg Leu Met Glu
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly
195 200 205
Ala Ser Phe Pro Ala Met Ile Glu His Ile His Gln Met Val Arg Glu
210 215 220
Arg Arg Glu Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Val Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Thr Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Pro Ile Arg Gln Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Thr His Tyr Pro
385 390 395 400
Asp Ile Ser Leu Gly Ile Ala Pro Glu His Leu Glu Arg Thr Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ser Leu Pro Val Arg Leu Gly
420 425 430
3
1293
DNA
Streptomyces tubercidicus
3
atgtcggcat tatccagctc tccgttcgct gcgcatgtcg ggaaacaccc gggtgagccg 60
aatgtgatgg agccggcgct gctcaccgac ccgttcgcgg gctacggcgc gctgcgtgag 120
caggccccgg tcgtacgggg ccggttcgtg gacgactcac cggtctggtt cgtgacgcgc 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggccgcgccg 240
cccctggccc catcggccga ggagaacccg ctgaccaggc tgatggacat gctgggcctc 300
cccgagcacc tccgcgtcta catgctcggg tcgattctca actacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgcgcg ttcacggcgc ggaagatcac cgatctgcga 420
ccgcgtgtcg agcagatcgc cgacgagctg ctggcccgcc tccccgagta cgccgaggac 480
ggcgtcgtcg acctcatcca gcatttcgcc tacccgctgc cgatcaccgt catctgcgag 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgga agtggggcgc cgacctcatc 600
tcgatggacc cggaccggct cggcgcaacg ttcccggcga tgatcgagca catccatgag 660
atggtccggg agcggcgcgc ggcgctcacc gatgatctgc tcagcgagct gatccgtacc 720
catgacgacg atggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780
gtcctcgccg gtcacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc ccgggccgtc 900
catgaactga tgcgctggtg cgggccggtg cagatgacgc agctgcgcta cgcggccgcc 960
gacgtcgacc tcgccggtac gcggatccac aagggcgacg ccgtacaact cctcctggtt 1020
gcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgatct gacgcgtcac 1080
cccgccggcc acgccgagaa ccatgtgggt ttcggccacg gtgcgcatta ctgcctgggt 1140
gccaccctcg ccaagcagga gggcgaagtc gcgttcggca agctgctcgc gcactacccg 1200
gagatgtccc tgggcatcga accggaacgt ctggagcgat tgccgctgcc tggcaactgg 1260
cggctgaatt ccctgccgtt gcggctgggg tga 1293
4
430
PRT
Streptomyces tubercidicus
4
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Leu Thr Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Ala Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Pro Leu Ala Pro Ser Ala Glu Glu Asn Pro Leu Thr Arg Leu Met Asp
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Val Tyr Met Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu Tyr Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly
195 200 205
Ala Thr Phe Pro Ala Met Ile Glu His Ile His Glu Met Val Arg Glu
210 215 220
Arg Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ala Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Arg Ile His Lys Gly Asp Ala Val Gln
325 330 335
Leu Leu Leu Val Ala Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Glu Met Ser Leu Gly Ile Glu Pro Glu Arg Leu Glu Arg Leu Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ser Leu Pro Leu Arg Leu Gly
420 425 430
5
1413
DNA
Streptomyces rimosus
5
atgaccacat cgcccaccga gtcccgggcg gccaccccgc ccgactccac cgcctccccc 60
tcgaccgctt ccgccccggc caccacccct tcggccgccg cctctccgga caccaccgac 120
cgcaccacgc tcccctccta cgtcggcctc cacccgggcg agccgaacct gatggaaccg 180
gagctgctgg agaacccgta caccggctac ggcacgctgc gcgagcaggc cccgctcgtc 240
cgcgcccggt tcatcgacga ctcgcccatc tggctggtga cccgcttcga cgtggtgcgc 300
gaggtgatgc gtgaccagcg gttcgtcaac aacccgaccc tggtgcccgg catcggcgcg 360
gacaaggacc cgcgtgcccg gctgatcgag ctgttcggca tccccgagga cctggccccg 420
tacctcaccg acaacatcct caccagcgac ccgccggacc acacccggct gcgccgcctg 480
gtctcccgcg ccttcaccgc acgccgtatc caggacctgc ggccgcgcgt cgagcggatc 540
accgacgagc tgctggaacg gctgccggac catgccgagg acggcgtcgt cgacctcgtc 600
gagcacttcg cctacccgct gcccatcacg gtcatctgcg agctggtcgg catcgacgag 660
gaggatcggg cgctgtggcg gcggttcggc gccgacctcg cctcgctgaa ccccaagcgc 720
atcggcgcca ccatgccgga gatgatctcg cacatccacg agctgatcga cgaacggcgc 780
gcggccctgc gggacgacct gctcagcggg ctcatccggg cgcaggacga cgacggcggc 840
cggctgagcg acgtcgagat ggtcaccctg gtcctgaccc tggtactggc cggtcacgag 900
accaccgccc acctcatcag caacggcacc ctcgccctgc tcacccaccc cgaccagcgg 960
cggctgatcg acgaggaccc ggcgctgctg ccgcgcgcgg tccacgagct gatgcgctgg 1020
tgcgggccga tccaggccac ccagcttcgg tacgccctgg aggacaccga ggtggccgga 1080
gtccaggtcc gccagggcga ggccctgatg ttcagcctcg tcgcggccaa ccacgacccg 1140
cgccactaca ccgggccgga gcggctcgac ctgacgcggc agccggccgg ccgcgccgag 1200
gaccacgtcg gcttcggcca cggcatgcac tactgcctgg gtgcctcact cgcccggcag 1260
gaggccgagg tggcctacgg gaagctgctc acccgctacc cggacctggc gctcgccctc 1320
accccggaac agttggagga ccaggaacgc ctgcggcagc ccggcacctg gcgcctgcga 1380
cggctgccgc tgaggctgca cgcgcagagc tga 1413
6
470
PRT
Streptomyces rimosus
6
Met Thr Thr Ser Pro Thr Glu Ser Arg Ala Ala Thr Pro Pro Asp Ser
1 5 10 15
Thr Ala Ser Pro Ser Thr Ala Ser Ala Pro Ala Thr Thr Pro Ser Ala
20 25 30
Ala Ala Ser Pro Asp Thr Thr Asp Arg Thr Thr Leu Pro Ser Tyr Val
35 40 45
Gly Leu His Pro Gly Glu Pro Asn Leu Met Glu Pro Glu Leu Leu Glu
50 55 60
Asn Pro Tyr Thr Gly Tyr Gly Thr Leu Arg Glu Gln Ala Pro Leu Val
65 70 75 80
Arg Ala Arg Phe Ile Asp Asp Ser Pro Ile Trp Leu Val Thr Arg Phe
85 90 95
Asp Val Val Arg Glu Val Met Arg Asp Gln Arg Phe Val Asn Asn Pro
100 105 110
Thr Leu Val Pro Gly Ile Gly Ala Asp Lys Asp Pro Arg Ala Arg Leu
115 120 125
Ile Glu Leu Phe Gly Ile Pro Glu Asp Leu Ala Pro Tyr Leu Thr Asp
130 135 140
Asn Ile Leu Thr Ser Asp Pro Pro Asp His Thr Arg Leu Arg Arg Leu
145 150 155 160
Val Ser Arg Ala Phe Thr Ala Arg Arg Ile Gln Asp Leu Arg Pro Arg
165 170 175
Val Glu Arg Ile Thr Asp Glu Leu Leu Glu Arg Leu Pro Asp His Ala
180 185 190
Glu Asp Gly Val Val Asp Leu Val Glu His Phe Ala Tyr Pro Leu Pro
195 200 205
Ile Thr Val Ile Cys Glu Leu Val Gly Ile Asp Glu Glu Asp Arg Ala
210 215 220
Leu Trp Arg Arg Phe Gly Ala Asp Leu Ala Ser Leu Asn Pro Lys Arg
225 230 235 240
Ile Gly Ala Thr Met Pro Glu Met Ile Ser His Ile His Glu Leu Ile
245 250 255
Asp Glu Arg Arg Ala Ala Leu Arg Asp Asp Leu Leu Ser Gly Leu Ile
260 265 270
Arg Ala Gln Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val
275 280 285
Thr Leu Val Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His
290 295 300
Leu Ile Ser Asn Gly Thr Leu Ala Leu Leu Thr His Pro Asp Gln Arg
305 310 315 320
Arg Leu Ile Asp Glu Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu
325 330 335
Leu Met Arg Trp Cys Gly Pro Ile Gln Ala Thr Gln Leu Arg Tyr Ala
340 345 350
Leu Glu Asp Thr Glu Val Ala Gly Val Gln Val Arg Gln Gly Glu Ala
355 360 365
Leu Met Phe Ser Leu Val Ala Ala Asn His Asp Pro Arg His Tyr Thr
370 375 380
Gly Pro Glu Arg Leu Asp Leu Thr Arg Gln Pro Ala Gly Arg Ala Glu
385 390 395 400
Asp His Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Ser
405 410 415
Leu Ala Arg Gln Glu Ala Glu Val Ala Tyr Gly Lys Leu Leu Thr Arg
420 425 430
Tyr Pro Asp Leu Ala Leu Ala Leu Thr Pro Glu Gln Leu Glu Asp Gln
435 440 445
Glu Arg Leu Arg Gln Pro Gly Thr Trp Arg Leu Arg Arg Leu Pro Leu
450 455 460
Arg Leu His Ala Gln Ser
465 470
7
1293
DNA
Streptomyces lydicus
7
atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60
aacgtgatgg atccggcgct gatcggggat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240
gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgatctgc tcagcgagct gatccggacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgacccgcg ccactacacc gaccccgacc ggctggacct gacccggcac 1080
cctgccggcc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggaggcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaacg cgctgccgct gcgtctgcgc tga 1293
8
430
PRT
Streptomyces lydicus
8
Met Ser Ala Ser Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Pro Arg Phe Arg Asn Asn Pro Val Ser Ala
65 70 75 80
Ala Pro Gly Ala Ala Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Thr
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Thr Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Glu Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Arg
420 425 430
9
1299
DNA
Streptomyces
9
atgtcagcct tatccagctc tccgttcgcc gagcacatag ggaaacaccc gggcgagccg 60
aacgtgatgg aaccggctct gatcaacgat ccgttcggcg gctacggcgc gctgcgcgag 120
caggggccgg ttgtgcgtgg ccggttcatg gacgactcgc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggcgtcgccg 240
ctcctgggca gtcaggtcga ggagatgccg atggtcaagc tgctggagca gatgggcctc 300
cccgagcacc ttcgggtcta tctgctcgga tcgatcctca acagtgacgc ccccgatcac 360
acccggcttc gccgcctcgt ctcgcgggcc ttcaccgcac gtaagatcac cggtctgcgg 420
ccgcgcgtcg agcagatcgc cgacgagctg ctggcccggc tccccgagca cgccgaggac 480
ggcgtcgtcg acctcatcca gcacttcgcc tacccgctgc cgatcacggt catctgcgaa 540
ctggtcggca tacccgaagc cgatcgcccg caatggcgcg catggggcgc cgacctcgtg 600
tcactggagc cggacaagct cagcacgtcg ttcccggcga tgatcgacca cacccatgaa 660
ctgatccgcc aacggcgcgg cgcgctcacc gacgatctgc tcagcgagct gatccgtgcc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt gttcgctctc 780
gtcttcgccg gtcacgagac caccgcccac ctcataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc gcgtgccgtc 900
catgagctga tgcgctggtg cgggccggtg cacatgaccc agttgcgtta cgcctccgag 960
gacatcgacc tcgccggtac gccgatccgg aagggcgacg ccgtccaact catcctggta 1020
tcggcgaact tcgacccccg ccactacagc gaccccgatc gcctcgacct gacccgtcac 1080
cccgcaggcc acgccgagaa ccacgtgggc ttcggccacg ggatgcacta ctgcttgggc 1140
gccgcgctcg ccaggcagga aggcgaagtg gcgttcggca aactgctcgc gcactacccg 1200
gacgtagcgc tgggcgtcga accggaagcc ctggagcggg tgccgatgcc cggcagttgg 1260
cggctgaatt ccttgccgct gcggttggcg aagcgctaa 1299
10
432
PRT
Streptomyces
10
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Glu His Ile Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Ile Asn Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ser Pro
65 70 75 80
Leu Leu Gly Ser Gln Val Glu Glu Met Pro Met Val Lys Leu Leu Glu
85 90 95
Gln Met Gly Leu Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Ser Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Gly Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Ala Trp Gly Ala Asp Leu Val Ser Leu Glu Pro Asp Lys Leu Ser
195 200 205
Thr Ser Phe Pro Ala Met Ile Asp His Thr His Glu Leu Ile Arg Gln
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Ala
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Phe Ala Leu Val Phe Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ser Glu
305 310 315 320
Asp Ile Asp Leu Ala Gly Thr Pro Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Ser Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Ala Leu Ala
370 375 380
Arg Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Asp Val Ala Leu Gly Val Glu Pro Glu Ala Leu Glu Arg Val Pro Met
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ser Leu Pro Leu Arg Leu Ala Lys Arg
420 425 430
11
1293
DNA
Streptomyces chattanoogenesis
11
atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60
aacgtgatgg atccggcgct gatcggtgat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240
gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgacctgc tcagcgagct gatccggacc 720
catgacgacg acggcagcag gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cactgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgacccgcg ccactacacc gaccccgacc gtctggacct gacccggcac 1080
cccgccggtc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggacgcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaacg cgctgccgct gcgtctgggc tga 1293
12
430
PRT
Streptomyces chattanoogenesis
12
Met Ser Ala Ser Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Pro Arg Phe Arg Asn Asn Pro Val Ser Ala
65 70 75 80
Ala Pro Gly Ala Ala Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Thr
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Thr Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Asp Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Gly
420 425 430
13
1290
DNA
Streptomyces
13
atgaccgaat tagcggactc ccccttcagc gagcacgtcg gcaaacaccc cggcgagccg 60
aacgtgatgg aaccggccct gctcaccgat ccgttcaccg gctacggcga actgcgcgaa 120
cagggcccgg tggtccgcgg ccggttcgcg gacgacaccc ccgtgtggtt catcacccgc 180
ttcgaggagg cccgcgaggt gctgcgcgac caccggttcg ccaatgcccc cgccttcgcg 240
gcgggaggtg gaagcggtga cacaccctcc aaccggctga tggaaatcat gggcctgccc 300
gagcactacc gggtgtacct cgccaacacc atcctcacca tggacgcccc cgaccacacc 360
cggatccggc gattggtctc ccgggcattc accgcccgta agatcaccga tctgcgaccc 420
cgggtggagg acatcgcgga cgatctgctg aggcggctgc ccgagcacgc cgaggacggc 480
gtcgtcgacc tcatcaagca ctacgcctat ccgctgccca taacggtcat ctgcgaactg 540
gtgggaattc cggaggaaga ccgactgcag tggcgggatt gggggtccgc gttcgtctcc 600
ctgcaaccgg atcggctcag caaagcgttc ccggcgatga tcgaacacat tcacgcgctg 660
atccgcgaac ggcgcgcggc gctcaccgac gatctgctca gcgaactgat ccgggtccat 720
gacgacgacg gcggccgact cagcgacgtc gaaatggtca cgatggtcct gaccctcgtt 780
ctcgccggtc atgagaccac cgcccatctc atcggcaacg gcactgccgc gcttctcacc 840
caccccgacc agctgcacct gctgaaatcc gatccggagc tgctcccacg cgccgtgcac 900
gagctgatgc gctggtgcgg accggtgcag atgacgcagt tgcggtacgc caccgaggac 960
gtcgaggtgg ccggggtgca ggtcaagcag ggcgaagcgg tgctggccat gctggtcgcg 1020
gcgaaccacg acccccgcca cttcgccgac cccgcccggc tcgacctcac ccgccagccg 1080
gcgggccggg ccgagaacca cgtcggtttc ggccacggca tgcactactg cctgggcgcc 1140
agcctggccc gccaggaggg cgaggtcgcc ttcgggaacc tgctcgcgca ctacccggac 1200
gtgtcgctgg cggtggaacc ggacgccctc cagcgggtcc cgctgccggg caactggcgg 1260
ctggccgcac tgccggtccg gctgcgctga 1290
14
429
PRT
Streptomyces
14
Met Thr Glu Leu Ala Asp Ser Pro Phe Ser Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Leu Thr Asp Pro Phe
20 25 30
Thr Gly Tyr Gly Glu Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Ala Asp Asp Thr Pro Val Trp Phe Ile Thr Arg Phe Glu Glu Ala
50 55 60
Arg Glu Val Leu Arg Asp His Arg Phe Ala Asn Ala Pro Ala Phe Ala
65 70 75 80
Ala Gly Gly Gly Ser Gly Asp Thr Pro Ser Asn Arg Leu Met Glu Ile
85 90 95
Met Gly Leu Pro Glu His Tyr Arg Val Tyr Leu Ala Asn Thr Ile Leu
100 105 110
Thr Met Asp Ala Pro Asp His Thr Arg Ile Arg Arg Leu Val Ser Arg
115 120 125
Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu Asp
130 135 140
Ile Ala Asp Asp Leu Leu Arg Arg Leu Pro Glu His Ala Glu Asp Gly
145 150 155 160
Val Val Asp Leu Ile Lys His Tyr Ala Tyr Pro Leu Pro Ile Thr Val
165 170 175
Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Leu Gln Trp Arg
180 185 190
Asp Trp Gly Ser Ala Phe Val Ser Leu Gln Pro Asp Arg Leu Ser Lys
195 200 205
Ala Phe Pro Ala Met Ile Glu His Ile His Ala Leu Ile Arg Glu Arg
210 215 220
Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Val His
225 230 235 240
Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met Val
245 250 255
Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile Gly
260 265 270
Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu His Leu Leu
275 280 285
Lys Ser Asp Pro Glu Leu Leu Pro Arg Ala Val His Glu Leu Met Arg
290 295 300
Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Thr Glu Asp
305 310 315 320
Val Glu Val Ala Gly Val Gln Val Lys Gln Gly Glu Ala Val Leu Ala
325 330 335
Met Leu Val Ala Ala Asn His Asp Pro Arg His Phe Ala Asp Pro Ala
340 345 350
Arg Leu Asp Leu Thr Arg Gln Pro Ala Gly Arg Ala Glu Asn His Val
355 360 365
Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Ser Leu Ala Arg
370 375 380
Gln Glu Gly Glu Val Ala Phe Gly Asn Leu Leu Ala His Tyr Pro Asp
385 390 395 400
Val Ser Leu Ala Val Glu Pro Asp Ala Leu Gln Arg Val Pro Leu Pro
405 410 415
Gly Asn Trp Arg Leu Ala Ala Leu Pro Val Arg Leu Arg
420 425
15
1428
DNA
Streptomyces albofaciens
15
atgaccacat cgcccaccga gtcccgggcg gccaccccgc ccgactccac cgcctccccc 60
tcgaccgctg ccgccccggc caccacccct tcggccgccg cctctccgga caccacctct 120
cccgccacca ccgaccgcac cacgctcccc tcctacgtcg gcctccaccc gggcgagccg 180
aacctgatgg aaccggagct gctggacaac ccgtacaccg gctacggcac gctgcgcgag 240
caggcgccgc tcgtccgcgc ccggttcatc gacgactcgc ccatctggct ggtgacccgc 300
ttcgacgtgg tgcgcgaggt gatgcgcgac cagcggttcg tcaacaaccc gaccctggtg 360
cccggcatcg gtgcggacca ggacccgcgc gcccggctga tcgagctgtt cggcatcccc 420
gaggacctgg ccccgtacct caccgacacc atcctcacca gcgacccgcc ggaccacacc 480
cggctgcgcc gcctggtctc ccgtgccttc accgcacgcc gtatccagga cctgcggccg 540
cgcgtcgagc ggatcaccga cgagctgctg gcgcggctgc cggaccatgc cgaggacggc 600
gtcgtcgacc tcgtcgagca cttcgcctac ccgctgccca tcacggtcat ctgcgaactg 660
gtcggcatcg acgaggagga ccgggcgctg tggcggcggt tcggcgccga cctcgcctcg 720
ctgaacccca agcgcatcgg cgccaccatg ccggagatga tcgcgcacat ccacgaggtg 780
atcgacgagc ggcgtgcgga cctgcgggac gacctgctca gcgggctcat ccgggcgcag 840
gacgacgacg gcggccggct gagcgacgtc gagatggtca cgctggtgct gaccctggtg 900
ctggccggtc acgagaccac cgcccacctc atcagcaacg gcaccctcgc cctgctcacc 960
caccccgacc agcggcggct gatcgacgag gacccggcgc tgctgccgcg cgcggtccac 1020
gagctgatgc gctggtgcgg gccgatccag gccacccagc tgcggtacgc catggaggac 1080
accgaggtgg ccggtgtcca ggtccgccag ggcgaggccc tgatgttcag cctcgtcgcg 1140
gccaaccacg acccgcgcca ctacaccggc ccggagcggc tcgacctgac gcggcagccg 1200
gccggccgcg ccgaggacca cgtcggcttc gggcacggga tgcactactg cctgggtgcc 1260
tcactggccc ggcaggaggc cgaggtggcg tacggcaagc tgctcacccg ctacccggac 1320
ctggcgctcg cgctcacccc ggaacagctg gaggaccagg aacgcctgcg gcagcccggc 1380
acctggcgcc tgcgacggct gccgctgagg ttgcacgcgg agagctga 1428
16
475
PRT
Streptomyces albofaciens
16
Met Thr Thr Ser Pro Thr Glu Ser Arg Ala Ala Thr Pro Pro Asp Ser
1 5 10 15
Thr Ala Ser Pro Ser Thr Ala Ala Ala Pro Ala Thr Thr Pro Ser Ala
20 25 30
Ala Ala Ser Pro Asp Thr Thr Ser Pro Ala Thr Thr Asp Arg Thr Thr
35 40 45
Leu Pro Ser Tyr Val Gly Leu His Pro Gly Glu Pro Asn Leu Met Glu
50 55 60
Pro Glu Leu Leu Asp Asn Pro Tyr Thr Gly Tyr Gly Thr Leu Arg Glu
65 70 75 80
Gln Ala Pro Leu Val Arg Ala Arg Phe Ile Asp Asp Ser Pro Ile Trp
85 90 95
Leu Val Thr Arg Phe Asp Val Val Arg Glu Val Met Arg Asp Gln Arg
100 105 110
Phe Val Asn Asn Pro Thr Leu Val Pro Gly Ile Gly Ala Asp Gln Asp
115 120 125
Pro Arg Ala Arg Leu Ile Glu Leu Phe Gly Ile Pro Glu Asp Leu Ala
130 135 140
Pro Tyr Leu Thr Asp Thr Ile Leu Thr Ser Asp Pro Pro Asp His Thr
145 150 155 160
Arg Leu Arg Arg Leu Val Ser Arg Ala Phe Thr Ala Arg Arg Ile Gln
165 170 175
Asp Leu Arg Pro Arg Val Glu Arg Ile Thr Asp Glu Leu Leu Ala Arg
180 185 190
Leu Pro Asp His Ala Glu Asp Gly Val Val Asp Leu Val Glu His Phe
195 200 205
Ala Tyr Pro Leu Pro Ile Thr Val Ile Cys Glu Leu Val Gly Ile Asp
210 215 220
Glu Glu Asp Arg Ala Leu Trp Arg Arg Phe Gly Ala Asp Leu Ala Ser
225 230 235 240
Leu Asn Pro Lys Arg Ile Gly Ala Thr Met Pro Glu Met Ile Ala His
245 250 255
Ile His Glu Val Ile Asp Glu Arg Arg Ala Asp Leu Arg Asp Asp Leu
260 265 270
Leu Ser Gly Leu Ile Arg Ala Gln Asp Asp Asp Gly Gly Arg Leu Ser
275 280 285
Asp Val Glu Met Val Thr Leu Val Leu Thr Leu Val Leu Ala Gly His
290 295 300
Glu Thr Thr Ala His Leu Ile Ser Asn Gly Thr Leu Ala Leu Leu Thr
305 310 315 320
His Pro Asp Gln Arg Arg Leu Ile Asp Glu Asp Pro Ala Leu Leu Pro
325 330 335
Arg Ala Val His Glu Leu Met Arg Trp Cys Gly Pro Ile Gln Ala Thr
340 345 350
Gln Leu Arg Tyr Ala Met Glu Asp Thr Glu Val Ala Gly Val Gln Val
355 360 365
Arg Gln Gly Glu Ala Leu Met Phe Ser Leu Val Ala Ala Asn His Asp
370 375 380
Pro Arg His Tyr Thr Gly Pro Glu Arg Leu Asp Leu Thr Arg Gln Pro
385 390 395 400
Ala Gly Arg Ala Glu Asp His Val Gly Phe Gly His Gly Met His Tyr
405 410 415
Cys Leu Gly Ala Ser Leu Ala Arg Gln Glu Ala Glu Val Ala Tyr Gly
420 425 430
Lys Leu Leu Thr Arg Tyr Pro Asp Leu Ala Leu Ala Leu Thr Pro Glu
435 440 445
Gln Leu Glu Asp Gln Glu Arg Leu Arg Gln Pro Gly Thr Trp Arg Leu
450 455 460
Arg Arg Leu Pro Leu Arg Leu His Ala Glu Ser
465 470 475
17
1293
DNA
Streptomyces
17
atgtcggcat tacccacctc accgttcgct gcacacgtcg ggaaacaccc gggcgagccg 60
aatgtgatgg acccggcact gatcaccgac ccgttcaccg gctacggcgc gctgcgcgag 120
cagggcccgg tcgtccgcgg ccgcttcgtg gacgactcac ccgtctggct ggtgacgcga 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaaccc ggcggcgccc 240
tccctgggcc acgcggccga ggacaacccg ctcaccaggc tgatggacat gctgggcctc 300
cccgagcacc tccgccccta cctcctcgga tcgattctca attacgacgc ccccgaccac 360
acccggctgc gccgcctggt gtcgcgggcc ttcaccgccc gcaagatcac cgacctgcgg 420
ccgcgggtcg agcagatcgc cgacgccctg ctggcccggc tgcccgagca cgccgaggac 480
ggcgtcgtcg atctcatccg gcacttcgcc tacccgctgc cgatcaccgt catctgcgaa 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgga cgtggggcgc cgacctcgtc 600
tcgatggagc cggaccggct caccgcctcg ttcccgccga tgatcgagca catccaccgg 660
atggtccggg agcggcgcgg cgcgctcacc ggcgatctgc tcagcgagct gatccgtgcc 720
catgacgacg acggcggccg gctcagcgac gtcgagatgg tcaccttgat cctcacgctc 780
gtcctcgccg gtcacgagac caccgctcac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accaactgcg cctgctccag gacgacccgg ccctgctccc ccgtgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cagatgaccc agctgcgtta cgccgccgcc 960
gacgtcgacc tggccggcac cacgatccac cggggcgacg ccgtccaact catcctggtg 1020
tcggcgaact tcgacccccg ccactacacc gaccccgacc gcctcgatct gacccgccac 1080
cccgcgggac atgcggagaa ccatgtgggt ttcggccatg gggcgcacta ctgcctgggc 1140
gccacactcg ccaagcagga gggcgaagtc gccttcggca aactgctcgc gcactacccg 1200
gagatggcgt tgggcgtcgc accggagcgc ctggagcgga cgcccctgcc gggcaactgg 1260
cggctgaacg cgctgccggt gcggttgggg tga 1293
18
430
PRT
Streptomyces
18
Met Ser Ala Leu Pro Thr Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Thr Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Ser Leu Gly His Ala Ala Glu Asp Asn Pro Leu Thr Arg Leu Met Asp
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Pro Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Arg His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Met Glu Pro Asp Arg Leu Thr
195 200 205
Ala Ser Phe Pro Pro Met Ile Glu His Ile His Arg Met Val Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Gly Asp Leu Leu Ser Glu Leu Ile Arg Ala
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Leu
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Gln Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ala Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Thr Ile His Arg Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Glu Met Ala Leu Gly Val Ala Pro Glu Arg Leu Glu Arg Thr Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ala Leu Pro Val Arg Leu Gly
420 425 430
19
1293
DNA
Streptomyces kasugaensis
19
atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60
aacgtgatgg atccggcgct gatcggggat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240
gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgatctgc tcagcgagct gatccggacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgacccgcg ccactacacc gaccccgacc ggctggacct gacccggcac 1080
cctgccggcc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggaggcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaacg cgctgccgct gcgtctgcgc tga 1293
20
430
PRT
Streptomyces kasugaensis
20
Met Ser Ala Ser Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Pro Arg Phe Arg Asn Asn Pro Val Ser Ala
65 70 75 80
Ala Pro Gly Ala Ala Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Thr
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Thr Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Glu Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Arg
420 425 430
21
1428
DNA
Streptomyces
21
atgaccacat cgcccaccga gtcccgggcg gccaccccga ccggctccac cgcctccccc 60
tcgaccgctt ccgccccggc caccacccct tcggccgcca cctcttcgga caccacctat 120
cccgccacca ccgaccgcac cacgctcccc tcctacgtcg gcctccaccc gggcgagccg 180
aacctgatgg aaccggagct gctggacaac ccgtacaccg gctacggcac gctgcgcgag 240
caggccccgc tcgtccgtgc ccggttcatc gacgactcgc ccatctggct ggtgacccgc 300
ttcgacgtgg tgcgcgaggt gatgcgcgac cagcggttcg tcaacaaccc gaccctggtg 360
cccggcatcg gtgcggacaa ggacccgcgc gcccggctga tcgagctgtt cggcatcccc 420
gaggacctga ccccgtacct cgccgacacc atcctcacca gcgacccgcc ggaccacacc 480
cggctgcgcc gcctggtctc ccgtgccttc accgcgcgcc gcatccagga cctgcggccg 540
cgcgtcgagc agatcaccga cgcgctgctg gagcgactgc cggaccatgc cgaggacggc 600
gtcgtcgacc tcgtcgagca cttcgcctac ccgctgccca tcacggtcat ctgcgagctg 660
gtcggcatcg acgaggagga ccggacgctg tggcggcggt tcggcgccga cctcgcctca 720
ctgaacccca agcgcatcgg cgccaccatg ccggagatga tcgcgcacat ccacgaggtg 780
atcgacgagc ggcgcgcggc cctgcgggac gacctgctca gcgggctcat ccgggcgcag 840
gacgacgacg gcggccggct gagcgacgtc gagatggtca ccctggtcct gaccctggtg 900
ctggccggtc acgagaccac cgcccacctc atcagcaacg gcaccctcgc cctgctcacc 960
caccccgacc agcggcggct gatcgacgag gacccggcac tgctgccgcg cgcggtccac 1020
gagctgatgc gctggtgcgg gccgatccag gccacccagc tgcggtacgc catggaggac 1080
accgaggtcg ccggtgtcca ggtccgccag ggcgaggccc tgatgttcag cctcgtcgcg 1140
gccaaccacg acccgcgcca ctacaccggg ccggagcggc tcgacctgac gcggcagccg 1200
gccggccgcg ccgaggacca cgtcggcttc gggcacggga tgcactactg cctgggtgcc 1260
tcactcgccc ggcaggaggc cgaggtggcc tacgggaagc tgctcacccg ctacccggac 1320
ctggagctcg ctctcacacc ggaacagctg gaggaccagg aacgcctgcg gcagcccggc 1380
acctggcgcc tgcggcggct gccgctgaag ctgcacgcgc ggagctga 1428
22
475
PRT
Streptomyces
22
Met Thr Thr Ser Pro Thr Glu Ser Arg Ala Ala Thr Pro Thr Gly Ser
1 5 10 15
Thr Ala Ser Pro Ser Thr Ala Ser Ala Pro Ala Thr Thr Pro Ser Ala
20 25 30
Ala Thr Ser Ser Asp Thr Thr Tyr Pro Ala Thr Thr Asp Arg Thr Thr
35 40 45
Leu Pro Ser Tyr Val Gly Leu His Pro Gly Glu Pro Asn Leu Met Glu
50 55 60
Pro Glu Leu Leu Asp Asn Pro Tyr Thr Gly Tyr Gly Thr Leu Arg Glu
65 70 75 80
Gln Ala Pro Leu Val Arg Ala Arg Phe Ile Asp Asp Ser Pro Ile Trp
85 90 95
Leu Val Thr Arg Phe Asp Val Val Arg Glu Val Met Arg Asp Gln Arg
100 105 110
Phe Val Asn Asn Pro Thr Leu Val Pro Gly Ile Gly Ala Asp Lys Asp
115 120 125
Pro Arg Ala Arg Leu Ile Glu Leu Phe Gly Ile Pro Glu Asp Leu Thr
130 135 140
Pro Tyr Leu Ala Asp Thr Ile Leu Thr Ser Asp Pro Pro Asp His Thr
145 150 155 160
Arg Leu Arg Arg Leu Val Ser Arg Ala Phe Thr Ala Arg Arg Ile Gln
165 170 175
Asp Leu Arg Pro Arg Val Glu Gln Ile Thr Asp Ala Leu Leu Glu Arg
180 185 190
Leu Pro Asp His Ala Glu Asp Gly Val Val Asp Leu Val Glu His Phe
195 200 205
Ala Tyr Pro Leu Pro Ile Thr Val Ile Cys Glu Leu Val Gly Ile Asp
210 215 220
Glu Glu Asp Arg Thr Leu Trp Arg Arg Phe Gly Ala Asp Leu Ala Ser
225 230 235 240
Leu Asn Pro Lys Arg Ile Gly Ala Thr Met Pro Glu Met Ile Ala His
245 250 255
Ile His Glu Val Ile Asp Glu Arg Arg Ala Ala Leu Arg Asp Asp Leu
260 265 270
Leu Ser Gly Leu Ile Arg Ala Gln Asp Asp Asp Gly Gly Arg Leu Ser
275 280 285
Asp Val Glu Met Val Thr Leu Val Leu Thr Leu Val Leu Ala Gly His
290 295 300
Glu Thr Thr Ala His Leu Ile Ser Asn Gly Thr Leu Ala Leu Leu Thr
305 310 315 320
His Pro Asp Gln Arg Arg Leu Ile Asp Glu Asp Pro Ala Leu Leu Pro
325 330 335
Arg Ala Val His Glu Leu Met Arg Trp Cys Gly Pro Ile Gln Ala Thr
340 345 350
Gln Leu Arg Tyr Ala Met Glu Asp Thr Glu Val Ala Gly Val Gln Val
355 360 365
Arg Gln Gly Glu Ala Leu Met Phe Ser Leu Val Ala Ala Asn His Asp
370 375 380
Pro Arg His Tyr Thr Gly Pro Glu Arg Leu Asp Leu Thr Arg Gln Pro
385 390 395 400
Ala Gly Arg Ala Glu Asp His Val Gly Phe Gly His Gly Met His Tyr
405 410 415
Cys Leu Gly Ala Ser Leu Ala Arg Gln Glu Ala Glu Val Ala Tyr Gly
420 425 430
Lys Leu Leu Thr Arg Tyr Pro Asp Leu Glu Leu Ala Leu Thr Pro Glu
435 440 445
Gln Leu Glu Asp Gln Glu Arg Leu Arg Gln Pro Gly Thr Trp Arg Leu
450 455 460
Arg Arg Leu Pro Leu Lys Leu His Ala Arg Ser
465 470 475
23
1293
DNA
Streptomyces tubercidicus
23
atgtcggcat tatccaactc cccgctcgcc gcacatgtcg ggaaacaccc tggcgagccg 60
aatgtgatgg acccggcgct gatcaccgac ccgttcggcg gctacggcgc actgcgcgag 120
caaggcccgg tcgtacgggg ccggttcatg gacgactcgc ccgtctggct ggtgacgcgc 180
ttcgaagagg tccgccaagt cctgcgcgat cagcggttcg tgaacaaccc ggccgcaccg 240
tccctgggac gctcgatcga cgaaagcccc gcggtcagac ttttggaaat gttggggttg 300
cccgaccatt tccggccgta tctgctcggg tcgatcctca actacgacgc acccgaccac 360
acccggctcc gccgactggt ctcgcgcgcc ttcacggcac gcaagatcac cgacctgcgg 420
ccgcgggtcg agcagatcac cgacgacctg ctgacccggc ttcccgagca cgccgaggac 480
ggtgtggtcg acctcatcca gcacttcgcc taccccctgc cgatcaccgt gatctgcgaa 540
ctggtcggca tcgccgaagc ggaccgcccg caatggcgga agtggggagc cgatctcgtc 600
tcgctggagc cggggcggct gagcaccgcg ttcccggcga tggtcgagca catccatgag 660
ctgatccgcg agcggcgcgg cgcgctcacc gacgatctgc tcagcgagct gatccgcacc 720
catgacgacg acggcggccg gctcagcgac atcgagatgg tcaccatgat cctcacgatc 780
gtcctggccg gccacgagac caccgcccac ctcataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctactcaag gacgatccgg cgctgctgcc gcgcgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cacatgaccc agctgcggtt cgcgtccgag 960
gacgtcgagg tcgccgggac accgatccac aagggcgacg ccgtacaact catcctggta 1020
tcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgacct gacccgccac 1080
cccgccggcc acgccgagaa ccatgtgggc ttcggccacg gaatgcacta ctgcctgggt 1140
gccaccctcg ccaaacagga aggcgaagtc gccttctccc gcctcttcac gcactacccg 1200
gaactgtccc tgggcgtcgc ggcggaccag ctggcgcgga cacaggtacc cggcagctgg 1260
cggctggaca ccctgccgct gcgactgggg tga 1293
24
430
PRT
Streptomyces tubercidicus
24
Met Ser Ala Leu Ser Asn Ser Pro Leu Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Ser Leu Gly Arg Ser Ile Asp Glu Ser Pro Ala Val Arg Leu Leu Glu
85 90 95
Met Leu Gly Leu Pro Asp His Phe Arg Pro Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Thr Asp Asp Leu Leu Thr Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Ala Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Val Ser Leu Glu Pro Gly Arg Leu Ser
195 200 205
Thr Ala Phe Pro Ala Met Val Glu His Ile His Glu Leu Ile Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Ile Glu Met Val Thr Met
245 250 255
Ile Leu Thr Ile Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Phe Ala Ser Glu
305 310 315 320
Asp Val Glu Val Ala Gly Thr Pro Ile His Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Ser Arg Leu Phe Thr His Tyr Pro
385 390 395 400
Glu Leu Ser Leu Gly Val Ala Ala Asp Gln Leu Ala Arg Thr Gln Val
405 410 415
Pro Gly Ser Trp Arg Leu Asp Thr Leu Pro Leu Arg Leu Gly
420 425 430
25
1293
DNA
Streptomyces
25
atgtcggcat tatccagctc accgttcgcc gcgcatgtcg ggaaacaccc gggcgagccg 60
aatgtgatgg acccggcgct gatcgccgat ccgttcggtg gttatggcgc actgcgtgag 120
caagggccgg tcgtacgggg ccggttcatg gacgactcac ccgtctggct cgtgacgcgc 180
ttcgaggaag tccgccaagt cctgcgcgac cagcggttcc tgaacgatcc gacggccccc 240
tccctggggc gctcattcga cgacagcccc acggccaggc tgctggagat gatgggactg 300
cccgagcatt tccggccgta tctgctcggt tcgattctga acaacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgcgcc ttcacggcac gcaagatcac cgacctgcgg 420
ccgcgggtcg agcagatcgc cgacgagctg ctgacccggc ttcccgagta cgccgaggac 480
ggcgtggtcg acctcatcaa gcacttcgcc taccccctgc cgatcgccgt catctgcgaa 540
ctggtcggca tagccgaagc ggatcgtccg cagtggcgga agtggggtgc cgacctcgtc 600
tcgctgcagc cggaccggct cagcacctcg ttcccggcga tgatcgagca catccatgag 660
ctgatccgcg agcggcgcgg ggcgctcacg gacgatctgc tcagcgagct gatccgtgcc 720
catgacgacg acggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacggtg 780
gtgctcgccg gccacgagac caccgcgcac ctcataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg gctgctcagg gacgacccgg ctctgtttcc ccgtgccgtc 900
cacgagctgt tgcgctggtg cgggccggtc cacatgaccc agatgcggtt tgcgtccgag 960
gatgtcgaca tcgccgggac gaagatccgt aagggcgacg ccgtacaact gatcctggta 1020
tcggccaact tcgacccccg ccactacacc gaccccgaac gtctcgacct gacccgtcac 1080
cccgccggcc acgccgagaa ccatgtgggc ttcggccacg ggatgcacta ctgcctgggc 1140
gccaccctcg ccaaacagga gggcgaagtc gcgttcgaga agctcttcgc gcactacccg 1200
gaggtgtcgc tgggcgtcgc accggaacaa ctggaaagga caccactgcc cggcagctgg 1260
cggctcgatt ccctgccgct gcggttgcgg taa 1293
26
430
PRT
Streptomyces
26
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Ala Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Leu Asn Asp Pro Thr Ala Pro
65 70 75 80
Ser Leu Gly Arg Ser Phe Asp Asp Ser Pro Thr Ala Arg Leu Leu Glu
85 90 95
Met Met Gly Leu Pro Glu His Phe Arg Pro Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Thr Arg Leu Pro Glu Tyr Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Lys His Phe Ala Tyr Pro Leu Pro Ile Ala
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Ala Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Leu Ser
195 200 205
Thr Ser Phe Pro Ala Met Ile Glu His Ile His Glu Leu Ile Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Ala
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Arg Asp Asp Pro Ala Leu Phe Pro Arg Ala Val His Glu Leu Leu
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Met Arg Phe Ala Ser Glu
305 310 315 320
Asp Val Asp Ile Ala Gly Thr Lys Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Glu Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Glu Lys Leu Phe Ala His Tyr Pro
385 390 395 400
Glu Val Ser Leu Gly Val Ala Pro Glu Gln Leu Glu Arg Thr Pro Leu
405 410 415
Pro Gly Ser Trp Arg Leu Asp Ser Leu Pro Leu Arg Leu Arg
420 425 430
27
1293
DNA
Streptomyces
27
atgtcggcat tatccagctc tccgttcgct gcgcatgtcg ggaaacaccc gggtgagccg 60
aatgtgatgg agccggcgct gctcaccgac ccgttcgcgg gctacggcgc gctgcgtgag 120
caggccccgg tcgtacgggg ccggttcgtg gacgactcac cggtctggtt cgtgacgcgc 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggccgcgccg 240
cccctggccc catcggccga ggagaacccg ctgaccaggc tgatggacat gctgggcctc 300
cccgagcacc tccgcgtcta catgctcggg tcgattctca actacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgcgcg ttcacggcgc ggaagatcac cgatctgcga 420
ccgcgtgtcg agcagatcgc cgacgagctg ctggcccgcc tccccgagta cgccgaggac 480
ggcgtcgtcg acctcatcca gcatttcgcc tacccgctgc cgatcaccgt catctgcgag 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgga agtggggcgc cgacctcatc 600
tcgatggacc cggaccggct cggcgcaacg ttcccggcga tgatcgagca catccatgag 660
atggtccggg agcggcgcgc ggcgctcacc gatgatctgc tcagcgagct gatccgtacc 720
catgacgacg atggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780
gtcctcgccg gtcacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc ccgggccgtc 900
catgagctga tgcgctggtg cgggccggtg cagatgacgc agctgcgcta cgcggccgcc 960
gacgtcgacc tcgccggtac gcggatccac aagggcgacg ccgtacaact cctcctggtt 1020
gcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgatct gacgcgtcac 1080
cccgccggcc acgccgagaa ccatgtgggt ttcggccacg gtgcgcatta ctgcctgggt 1140
gccaccctcg ccaagcagga gggcgaagtc gcgttcggca agctgctcgc gcactacccg 1200
gagatgtccc tgggcatcga accggaacgt ctggagcgat tgccgctgcc tggcaactgg 1260
cggctgaatt ccctgccgtt gcggctgggg tga 1293
28
430
PRT
Streptomyces
28
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Leu Thr Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Ala Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Pro Leu Ala Pro Ser Ala Glu Glu Asn Pro Leu Thr Arg Leu Met Asp
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Val Tyr Met Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu Tyr Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly
195 200 205
Ala Thr Phe Pro Ala Met Ile Glu His Ile His Glu Met Val Arg Glu
210 215 220
Arg Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ala Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Arg Ile His Lys Gly Asp Ala Val Gln
325 330 335
Leu Leu Leu Val Ala Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Glu Met Ser Leu Gly Ile Glu Pro Glu Arg Leu Glu Arg Leu Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ser Leu Pro Leu Arg Leu Gly
420 425 430
29
1293
DNA
Streptomyces lydicus
29
atgtcggcat tacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgaaccg 60
aacgtgatgg atccggcgct gatcggtgat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcgtg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgggac cagcggttcc ggaacaatcc ggtctcctcg 240
gcgccggacg cggaccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtcg cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgacctgc tcagcgagct gatccgaacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc aagggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgatccgcg ccactacacc gaacccgacc gtctggacct gacccggcac 1080
cccgccggcc acgccgagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggacgcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaatg cgctgccgct gcgtctgcgc tga 1293
30
430
PRT
Streptomyces lydicus
30
Met Ser Ala Leu Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Gln Arg Phe Arg Asn Asn Pro Val Ser Ser
65 70 75 80
Ala Pro Asp Ala Asp Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Ala
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Glu Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Asp Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Arg
420 425 430
31
1293
DNA
Streptomyces lydicus
31
atgtcggcat cgaccagctc tcccctcagc gcccacgtcg gcaagcaccc gggcgaaccc 60
catgtgatgg atccggcgct gatcagcgat ccgttcggcg gctacggtgc cctgcgcgag 120
cagggaccgg tcgtccgcgg acggttcttc gacgactcgc ccttgtggtt agtgacccgc 180
ttcgaggaag tccgccaggt cctgcgcgac cagcggttcg tgaacaaccc cgccgacccg 240
gcgctcggcg tcgcgccgga ggactccccg cagctgcgcg cgctggcgat gctgggcatc 300
cccgagcacc tgcacggcta tctgctcaac tcgatcctca actacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgcgcc ttcaccgccc gcaagatcac cgatcttcgg 420
ccgcgggtgg cgcagataac cgccgagctg ctggaccgac tcccggagca cgccgaggac 480
ggcgtggtcg acctgatcga gcacttcgcc tacccgctgc cgatcacggt gatctgcgaa 540
cttgtcggca tcgccgcgga ggaccggccc cagtggcgtt cctggggcgc cgacctggtc 600
tcggtggacc ccgaccggct cggccggacc ttcccggcga tgatcgacca catccacgcg 660
ctgatcggcc agcggcgggc cgcgctcacc gacgacctgc tcagcgagct gatccggacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccctggt cctcaccctc 780
gtgctggccg gccacgagac caccgcacac ctcatcggca acggcaccgc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cacgtcaccc agctgcggta cgccgccgag 960
gacgtcgacc tcgccggcac ccggatccgc aggggcgacg ccgtgcaggc cgtcctggtc 1020
tcggcgaacc acgacccgcg ccactacacc gaccccgaac gcctggacct gacccggcag 1080
cccgcgggcc gcgccgagaa ccacgtgggc ttcgggcacg gggcgcacta ctgcctgggc 1140
gccagcctcg ccaggcagga gggtgaggtc gccctgggcg ccctgttcga ccgctacccc 1200
gacctggcgc tggcggtggc gcccgaggag ctggagcgca ccccggtgcc cggtacctgg 1260
cggctgacgt cgctgccggt gcgcctgggc tga 1293
32
430
PRT
Streptomyces lydicus
32
Met Ser Ala Ser Thr Ser Ser Pro Leu Ser Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro His Val Met Asp Pro Ala Leu Ile Ser Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Phe Asp Asp Ser Pro Leu Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Asp Pro
65 70 75 80
Ala Leu Gly Val Ala Pro Glu Asp Ser Pro Gln Leu Arg Ala Leu Ala
85 90 95
Met Leu Gly Ile Pro Glu His Leu His Gly Tyr Leu Leu Asn Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Ala
130 135 140
Gln Ile Thr Ala Glu Leu Leu Asp Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Glu His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Ala Ala Glu Asp Arg Pro Gln Trp
180 185 190
Arg Ser Trp Gly Ala Asp Leu Val Ser Val Asp Pro Asp Arg Leu Gly
195 200 205
Arg Thr Phe Pro Ala Met Ile Asp His Ile His Ala Leu Ile Gly Gln
210 215 220
Arg Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Leu
245 250 255
Val Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Val Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Arg Ile Arg Arg Gly Asp Ala Val Gln
325 330 335
Ala Val Leu Val Ser Ala Asn His Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Glu Arg Leu Asp Leu Thr Arg Gln Pro Ala Gly Arg Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Ser Leu Ala
370 375 380
Arg Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Phe Asp Arg Tyr Pro
385 390 395 400
Asp Leu Ala Leu Ala Val Ala Pro Glu Glu Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Thr Trp Arg Leu Thr Ser Leu Pro Val Arg Leu Gly
420 425 430
33
1281
DNA
Streptomyces tubercidicus
33
atgaactctc cgttcgccgc gcacgtcggg aaacacccgg gcgagccgaa tgtgatggac 60
cccgccctga tcaccgaccc gttcaccggc tacggcgcgc tgcgtgagca gggcccggtc 120
gtacggggcc ggttcatgga cgactcgccc gtctggctgg tgacgcggtt cgaggaggtc 180
cgccaggtcc tgcgcgacca gcggttcgtg aacaatccgg cctcgccgtc cctgaactac 240
gcgcccgagg acaacccgct gacccggctg atggagatgc tgggcctccc cgagcacctc 300
cgcgtctacc tgctcggatc gatcctcaac tacgacgccc ccgaccacac ccggctgcgc 360
cgtctggtgt cgcgggcgtt cacggcccgc aagatcaccg acctgcggcc ccgggtcgag 420
cagatcgccg acgcgctgct ggcccggctg cccgagcacg ccgaggacgg cgtcgtcgac 480
ctcatccagc acttcgccta ccccctgccg atcaccgtca tctgcgaact ggtcggcata 540
cccgaagcgg accgcccgca gtggcgaacg tggggcgccg acctcatctc gatggatccg 600
gaccggctcg gcgcctcgtt cccggcgatg atcgagcaca tccatcagat ggtccgggaa 660
cggcgcgagg cgctcaccga cgacctgctc agcgaactga tccgcaccca tgacgacgac 720
ggcgggcggc tcagcgacgt cgagatggtc accatgatcc tcacgctcgt cctcgccggc 780
cacgagacca ccgcccacct catcagcaac ggcacggcgg cgctgctcac ccaccccgac 840
cagctgcgtc tggtcaagga cgatccggcc ctcctccccc gtgccgtcca cgagctgatg 900
cgctggtgcg ggccggtgca catgacccag ctgcgctacg ccaccgccga cgtcgacctc 960
gccggcacac cgatccgcca gggcgatgcc gttcaactca tcctggtatc ggccaacttc 1020
gacccccgtc actacaccga ccccgaccgc ctcgatctca cccggcaccc cgcgggccac 1080
gccgagaacc atgtgggttt cggccatgga gcgcactact gcctgggcgc cacactcgcc 1140
aaacaggaag gtgaagtcgc cttcggcaaa ctgctcacgc actacccgga catatcgctg 1200
ggcatcgccc cggaacacct ggagcggaca ccgctgccgg gcaactggcg gctgaactcg 1260
ctgccggtgc ggttggggtg a 1281
34
426
PRT
Streptomyces tubercidicus
34
Met Asn Ser Pro Phe Ala Ala His Val Gly Lys His Pro Gly Glu Pro
1 5 10 15
Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe Thr Gly Tyr Gly
20 25 30
Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg Phe Met Asp Asp
35 40 45
Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val Arg Gln Val Leu
50 55 60
Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ser Pro Ser Leu Asn Tyr
65 70 75 80
Ala Pro Glu Asp Asn Pro Leu Thr Arg Leu Met Glu Met Leu Gly Leu
85 90 95
Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile Leu Asn Tyr Asp
100 105 110
Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser Arg Ala Phe Thr
115 120 125
Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu Gln Ile Ala Asp
130 135 140
Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp Gly Val Val Asp
145 150 155 160
Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr Val Ile Cys Glu
165 170 175
Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp Arg Thr Trp Gly
180 185 190
Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly Ala Ser Phe Pro
195 200 205
Ala Met Ile Glu His Ile His Gln Met Val Arg Glu Arg Arg Glu Ala
210 215 220
Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr His Asp Asp Asp
225 230 235 240
Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met Ile Leu Thr Leu
245 250 255
Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile Ser Asn Gly Thr
260 265 270
Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu Val Lys Asp Asp
275 280 285
Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met Arg Trp Cys Gly
290 295 300
Pro Val His Met Thr Gln Leu Arg Tyr Ala Thr Ala Asp Val Asp Leu
305 310 315 320
Ala Gly Thr Pro Ile Arg Gln Gly Asp Ala Val Gln Leu Ile Leu Val
325 330 335
Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro Asp Arg Leu Asp
340 345 350
Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His Val Gly Phe Gly
355 360 365
His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala Lys Gln Glu Gly
370 375 380
Glu Val Ala Phe Gly Lys Leu Leu Thr His Tyr Pro Asp Ile Ser Leu
385 390 395 400
Gly Ile Ala Pro Glu His Leu Glu Arg Thr Pro Leu Pro Gly Asn Trp
405 410 415
Arg Leu Asn Ser Leu Pro Val Arg Leu Gly
420 425
35
195
DNA
Streptomyces tubercidicus
35
atgcggatca cgatcgacac cgacatctgt atcggcgccg gccagtgcgc cctgaccgcg 60
ccgggagtgt tcacccagga cgacgacggc ttcagcgccc tgctgcccgg ccgcgaggac 120
ggtgcgggcg acccgctggt gcgggaggcc gcccgcgcct gcccggtgca ggccatcacg 180
gtcacggacg actga 195
36
64
PRT
Streptomyces tubercidicus
36
Met Arg Ile Thr Ile Asp Thr Asp Ile Cys Ile Gly Ala Gly Gln Cys
1 5 10 15
Ala Leu Thr Ala Pro Gly Val Phe Thr Gln Asp Asp Asp Gly Phe Ser
20 25 30
Ala Leu Leu Pro Gly Arg Glu Asp Gly Ala Gly Asp Pro Leu Val Arg
35 40 45
Glu Ala Ala Arg Ala Cys Pro Val Gln Ala Ile Thr Val Thr Asp Asp
50 55 60
37
195
DNA
Streptomyces tubercidicus
37
atgcggatca ccatcgacac cgacatctgc atcggcgccg gccagtgcgc cctgaccgcg 60
ccgggagtct tcacccagga cgacgacggt ttcagcgccc tgctgcccgg ccgcgaggac 120
ggcgcgggcg acccgctggt gcgcgaggcc gcccgcgcct gccccgtgca ggccatttcg 180
gtcacggacg actga 195
38
64
PRT
Streptomyces tubercidicus
38
Met Arg Ile Thr Ile Asp Thr Asp Ile Cys Ile Gly Ala Gly Gln Cys
1 5 10 15
Ala Leu Thr Ala Pro Gly Val Phe Thr Gln Asp Asp Asp Gly Phe Ser
20 25 30
Ala Leu Leu Pro Gly Arg Glu Asp Gly Ala Gly Asp Pro Leu Val Arg
35 40 45
Glu Ala Ala Arg Ala Cys Pro Val Gln Ala Ile Ser Val Thr Asp Asp
50 55 60
39
9
PRT
Artificial Sequence
Synthetic peptide.
39
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1 5
40
10
PRT
Artificial Sequence
Synthetic peptide.
40
Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu
1 5 10
41
11
PRT
Artificial Sequence
Synthetic peptide.
41
Gln Pro Glu Leu Ala Pro Glu Asp Pro Glu Asp
1 5 10
42
11
PRT
Artificial Sequence
Synthetic peptide.
42
Tyr Thr Asp Ile Glu Met Asn Arg Leu Gly Lys
1 5 10
43
7
PRT
Streptomyces tubercidicus
43
Ile Ala Gly His Glu Thr Thr
1 5
44
20
DNA
Streptomyces tubercidicus
misc_feature
(6)..(18)
Nucleotides 6, 9 and 18 are “s” wherein “s” = g
or c.
44
atcgcsggsc acgagacsac 20
45
7
PRT
Streptomyces tubercidicus
45
Val Ala Gly His Glu Thr Thr
1 5
46
20
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 9, and 18 are “s” wherein “s”
= g or c.
46
gtsgcsggsc acgagacsac 20
47
7
PRT
Streptomyces tubercidicus
47
Leu Ala Gly His Glu Thr Thr
1 5
48
20
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 9, and 18 are “s” wherein “s”
= g or c.
48
ctsgcsggsc acgagacsac 20
49
9
PRT
Streptomyces tubercidicus
49
Leu Leu Leu Ile Ala Gly His Glu Thr
1 5
50
25
DNA
Streptomyces tubercidicus
misc_feature
(2)..(17)
Nucleotides 2, 5, 8, 14, and 17 are “s” wherein
“s” = g or c.
50
tsctsctsat cgcsggscac gagac 25
51
9
PRT
Streptomyces tubercidicus
51
His Gln Cys Leu Gly Gln Asn Leu Ala
1 5
52
26
DNA
Streptomyces tubercidicus
misc_feature
(12)..(24)
Nucleotides 12, 15, and 24 are “s” wherein “s”
= g or c.
52
gtggtcacgg asccstgctt ggascg 26
53
8
PRT
Streptomyces tubercidicus
53
Phe Gly His Gly Val His Gln Cys
1 5
54
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(15)
Nucleotides 6, 12, and 15 are “s” wherein “s” =
g or c.
54
aagccsgtgc cscasgtggt cacg 24
55
8
PRT
Streptomyces tubercidicus
55
Phe Gly Phe Gly Val His Gln Cys
1 5
56
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(15)
Nucleotides 6, 12, and 15 are “s” wherein “s” =
g or c.
56
aaggcsaagc cscasgtggt cacg 24
57
8
PRT
Streptomyces tubercidicus
57
Phe Gly His Gly Ile His Gln Cys
1 5
58
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(12)
Nucleotides 6 and 12 are “s” wherein “s” = g or
c.
58
aagccsgtgc cstaggtggt cacg 24
59
8
PRT
Streptomyces tubercidicus
59
Phe Gly His Gly Val His Phe Cys
1 5
60
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(15)
Nucleotides 6, 12, and 15 are “s” wherein “s” =
g or c.
60
aagccsgtgc cscasgtgaa gacg 24
61
24
PRT
Streptomyces tubercidicus
61
His Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro
1 5 10 15
Phe Thr Gly Tyr Gly Ala Leu Arg
20
62
21
PRT
Streptomyces tubercidicus
62
Phe Val Asn Asn Pro Ala Ser Pro Ser Leu Asn Tyr Ala Pro Glu Asp
1 5 10 15
Asn Pro Leu Thr Arg
20
63
19
PRT
Streptomyces tubercidicus
63
Leu Leu Thr His Tyr Pro Asp Ile Ser Leu Gly Ile Ala Pro Glu His
1 5 10 15
Leu Glu Arg
64
17
PRT
Streptomyces tubercidicus
64
Val Tyr Leu Leu Gly Ser Ile Leu Asn Tyr Asp Ala Pro Asp His Thr
1 5 10 15
Arg
65
13
PRT
Streptomyces tubercidicus
65
Thr Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg
1 5 10
66
13
PRT
Streptomyces tubercidicus
66
Glu Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg
1 5 10
67
12
PRT
Streptomyces tubercidicus
67
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg
1 5 10
68
12
PRT
Streptomyces tubercidicus
68
Leu Met Glu Met Leu Gly Leu Pro Glu His Leu Arg
1 5 10
69
11
PRT
Streptomyces tubercidicus
69
Val Glu Gln Ile Ala Asp Ala Leu Leu Ala Arg
1 5 10
70
11
PRT
Streptomyces tubercidicus
70
Leu Val Lys Asp Asp Pro Ala Leu Leu Pro Arg
1 5 10
71
8
PRT
Streptomyces tubercidicus
71
Asp Asp Pro Ala Leu Leu Pro Arg
1 5
72
8
PRT
Streptomyces tubercidicus
72
Thr Pro Leu Pro Gly Asn Trp Arg
1 5
73
7
PRT
Streptomyces tubercidicus
73
Leu Asn Ser Leu Pro Val Arg
1 5
74
7
PRT
Streptomyces tubercidicus
74
Ile Thr Asp Leu Arg Pro Arg
1 5
75
7
PRT
Streptomyces tubercidicus
75
Glu Gln Gly Pro Val Val Arg
1 5
76
7
PRT
Streptomyces tubercidicus
76
Ala Val His Glu Leu Met Arg
1 5
77
5
PRT
Streptomyces tubercidicus
77
Ala Phe Thr Ala Arg
1 5
78
5
PRT
Streptomyces tubercidicus
78
Phe Glu Glu Val Arg
1 5
79
7
PRT
Streptomyces tubercidicus
79
Pro Gly Glu Asp Asn Val Met
1 5
80
21
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 12, and 18 are “s” wherein
“s” = c or g.
80
ccsggsgarc csaaygtsat g 21
81
7
PRT
Streptomyces tubercidicus
81
Ala Leu Ile Thr Asp Pro Phe
1 5
82
21
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 12, and 18 are “s”wherein
“s” = c or g.
82
gcsctsatya csgacccstt c 21
83
8
PRT
Streptomyces tubercidicus
83
Phe Met Asp Asp Ser Pro Val Trp
1 5
84
24
DNA
Streptomyces tubercidicus
misc_feature
(13)..(13)
Nucleotide 13 is “w” wherein “w” = a or t.
84
ttcatggacg acwssccsgt stgg 24
85
8
PRT
Streptomyces tubercidicus
85
Leu Asn Tyr Asp Ala Pro Asp His
1 5
86
24
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 15 and 18 are “s” wherein “s” =
c or g.
86
ctsaaytayg acgcsccsga ccac 24
87
8
PRT
Streptomyces tubercidicus
87
Val Glu Gln Ile Ala Asp Ala Leu
1 5
88
24
DNA
Streptomyces tubercidicus
misc_feature
(3)..(24)
Nucleotides 3, 15, 21, and 24 are “s” wherein
“s” = c or g.
88
gtsgarcaga tygcsgacgc scts 24
89
8
PRT
Streptomyces tubercidicus
89
Asp Leu Ile Ser Met Asp Pro Asp
1 5
90
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(21)
Nucleotides 6, 11, 12, and 21 are “s” wherein
“s” = c or g.
90
ctggastarw sstacctggg sctg 24
91
36
DNA
Streptomyces tubercidicus
91
agattaatta atgtcggaat taatgaactg tccgtt 36
92
32
DNA
Streptomyces tubercidicus
92
aaactcaccc caaccgcacc ggcagcgagt tc 32
93
7
PRT
Streptomyces tubercidicus
93
Met Ser Glu Leu Met Asn Ser
1 5
94
1293
DNA
Streptomyces
94
atgtcggcaa tatccagctc cccgttcgcc gcacacgtcg gaaagcatcc cggcgagccg 60
aatgtgatgg acccggcgct gatcaccgac ccgttcggcg gctacggcgc actgcgtgag 120
caaggccccg tcctaccggg ccggttcatg gacgactcac ccgtctggct cgtgacgcgc 180
ttcgaagagg tccgccaagt cctgcgcgat cagcggttcc tgaacaaccc ggccgcgtcg 240
tcaccggggc attcgatcga cgagagcccc acggccaggc tgctggacat gatggggatg 300
cccgaacatt tccggccgta tctgatgggg tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcacgcgcg ttcacggcac gcaagatcac cgatctgcgg 420
ccgcgggtcg agcagctcgc cgacgagctg ctggcccggc ttcccgagca cgccgaggac 480
ggtgtggtcg acctgatcaa gcacttcgcc tatcccctgc cgatcaccgt gatctgcgaa 540
ctggtcggca tcccggaagc ggaccgcccg caatggcgga agtggggcgc cgacctcgtt 600
tcgctgcagc cggagcggct cagcacctcg ttcccggcga tgatcgagca catccatgaa 660
ctgatccgcg agcggcgcgg cgcgctcacc gacgatctgc tcagcgagct gatccgtacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcccac ctgataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctggtcaag gacgacccgg agctgcttcc gcgtgccgtc 900
cacgagctgc tgcgctggtg cgggccggtc cagatgaccc agctgcggta cgcctccgag 960
gatgtcgaga tcgccgggac gccgatccgt aagggcgacg ccgtacaact catcctggta 1020
tcggcgaact tcgacccccg ccactacacc gcccccgaac gcctcgacct gacccgccac 1080
cccgccggcc acgccgagaa ccatgtgggc ttcggccacg gaatgcacta ctgcctgggc 1140
gccaccctcg ccaaacagga gggcgaagtc gcgttcggca agctcttcac gcactacccg 1200
gagctgtcgc tggccgtcgc accggacgag ttggagcgaa cgccggtgcc cggcagctgg 1260
cggttggatt cgctgccggt gcggttgggg tga 1293
95
430
PRT
Streptomyces
95
Met Ser Ala Ile Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Leu Pro Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Leu Asn Asn Pro Ala Ala Ser
65 70 75 80
Ser Pro Gly His Ser Ile Asp Glu Ser Pro Thr Ala Arg Leu Leu Asp
85 90 95
Met Met Gly Met Pro Glu His Phe Arg Pro Tyr Leu Met Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Leu Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Lys His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Glu Arg Leu Ser
195 200 205
Thr Ser Phe Pro Ala Met Ile Glu His Ile His Glu Leu Ile Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Val Lys Asp Asp Pro Glu Leu Leu Pro Arg Ala Val His Glu Leu Leu
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ser Glu
305 310 315 320
Asp Val Glu Ile Ala Gly Thr Pro Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Ala Pro
340 345 350
Glu Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Phe Thr His Tyr Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Asp Glu Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asp Ser Leu Pro Val Arg Leu Gly
420 425 430
96
18
DNA
Streptomyces
96
cgsccsccsc tswssaas 18
97
21
DNA
Streptomyces
97
sassgcstts bcccartgyt c 21
98
1266
DNA
Streptomyces
98
gtggtcgacg cacaccagac gttcgtcatc gtcgggggtg gcctggccgg cgcaaaggcc 60
gcggagactc tccgcgcgga ggggttcacc ggccgggtga tcctcatctg tgacgagcgc 120
gaccacccgt acgagcgccc cccgctctcc aaggggttcc tgctcggcaa ggaagagcgc 180
gacagcgtgt tcgtccatga gcccgcctgg tacgcccagg cacagatcga actgcacctg 240
ggccagcccg ccgtccgcct cgaccccgag ggcaggaccg tccgcctcgg cgacggcacc 300
ctgatcgcct acgacaagct gctgctggcc accggcgccg aaccgcggcg cctggacatc 360
cccggcaccg gcctggccgg cgtgcaccac ctgcgccgcc tcgcccacgc cgaacggctg 420
cgcggcgtcc tggcctccct cggccgcgac aacggccatc tggtgatcgc cggagccggc 480
tggatcggcc tggaggtcgc cgccgcggcc cgctcctacg gcgccgaggt gaccgtcgtc 540
gaggccgccc cgacgccgct gcacggcatc ctggggcccg aactcggcgg tctgttcacc 600
gatctgcacc gcgagcacgg cgtccgcttc cacttcggcg cccgcttcac cgagatcgtc 660
ggagagggcg gcatggtgct cgccgtgcgc accgacgacg gcgaggaaca ccccgcccac 720
gatgtgctcg ccgcgatcgg cgccgccccg cgcaccgcgc tcgccgaaca ggccgggctg 780
gatctcgccg acccggagac cggcggcggg gtggccgtcg acgcggcgct gcgcacctcc 840
gacccgtaca tctacgccgc cggtgacgtc gccgccgccg accacccgct gctggacacc 900
cggctgcggg tcgaacactg ggccaacgcc ctcaacggcg gcccggccgc cgcccgcgcc 960
atgctcggcc aggacatcag ctacgaccgc atcccgtact tcttctccga ccagtacgac 1020
gtcggcatgg agtactccgg ctacgccccg cccggctcgt acgcccaggt cgtctgccgc 1080
ggcgacgtcg ccaagcggga gttcatcgcc ttctggctgg cggcggacgg ccggctgctc 1140
gcgggcatga acgtcaacgt ctgggacgtc gccgagtcca tccagcaact catccgctcc 1200
ggggcgccgt tggagcccgg cgcactggcc gatccgcagg ttccgctggc ggcactgctc 1260
ccgtag 1266
99
421
PRT
Streptomyces
99
Val Val Asp Ala His Gln Thr Phe Val Ile Val Gly Gly Gly Leu Ala
1 5 10 15
Gly Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Thr Gly Arg
20 25 30
Val Ile Leu Ile Cys Asp Glu Arg Asp His Pro Tyr Glu Arg Pro Pro
35 40 45
Leu Ser Lys Gly Phe Leu Leu Gly Lys Glu Glu Arg Asp Ser Val Phe
50 55 60
Val His Glu Pro Ala Trp Tyr Ala Gln Ala Gln Ile Glu Leu His Leu
65 70 75 80
Gly Gln Pro Ala Val Arg Leu Asp Pro Glu Gly Arg Thr Val Arg Leu
85 90 95
Gly Asp Gly Thr Leu Ile Ala Tyr Asp Lys Leu Leu Leu Ala Thr Gly
100 105 110
Ala Glu Pro Arg Arg Leu Asp Ile Pro Gly Thr Gly Leu Ala Gly Val
115 120 125
His His Leu Arg Arg Leu Ala His Ala Glu Arg Leu Arg Gly Val Leu
130 135 140
Ala Ser Leu Gly Arg Asp Asn Gly His Leu Val Ile Ala Gly Ala Gly
145 150 155 160
Trp Ile Gly Leu Glu Val Ala Ala Ala Ala Arg Ser Tyr Gly Ala Glu
165 170 175
Val Thr Val Val Glu Ala Ala Pro Thr Pro Leu His Gly Ile Leu Gly
180 185 190
Pro Glu Leu Gly Gly Leu Phe Thr Asp Leu His Arg Glu His Gly Val
195 200 205
Arg Phe His Phe Gly Ala Arg Phe Thr Glu Ile Val Gly Glu Gly Gly
210 215 220
Met Val Leu Ala Val Arg Thr Asp Asp Gly Glu Glu His Pro Ala His
225 230 235 240
Asp Val Leu Ala Ala Ile Gly Ala Ala Pro Arg Thr Ala Leu Ala Glu
245 250 255
Gln Ala Gly Leu Asp Leu Ala Asp Pro Glu Thr Gly Gly Gly Val Ala
260 265 270
Val Asp Ala Ala Leu Arg Thr Ser Asp Pro Tyr Ile Tyr Ala Ala Gly
275 280 285
Asp Val Ala Ala Ala Asp His Pro Leu Leu Asp Thr Arg Leu Arg Val
290 295 300
Glu His Trp Ala Asn Ala Leu Asn Gly Gly Pro Ala Ala Ala Arg Ala
305 310 315 320
Met Leu Gly Gln Asp Ile Ser Tyr Asp Arg Ile Pro Tyr Phe Phe Ser
325 330 335
Asp Gln Tyr Asp Val Gly Met Glu Tyr Ser Gly Tyr Ala Pro Pro Gly
340 345 350
Ser Tyr Ala Gln Val Val Cys Arg Gly Asp Val Ala Lys Arg Glu Phe
355 360 365
Ile Ala Phe Trp Leu Ala Ala Asp Gly Arg Leu Leu Ala Gly Met Asn
370 375 380
Val Asn Val Trp Asp Val Ala Glu Ser Ile Gln Gln Leu Ile Arg Ser
385 390 395 400
Gly Ala Pro Leu Glu Pro Gly Ala Leu Ala Asp Pro Gln Val Pro Leu
405 410 415
Ala Ala Leu Leu Pro
420
100
1314
DNA
Streptomyces
100
atgcccgctg cacgccgccg ccttcgacct ccgcaccgga gcggcgacct gcctgcccgc 60
ccgccgggcc gtgcgcaccc accccgtgac cgtccaggac ggcatgatct acgtccatca 120
cgccgcggag gagggcaccg ccgcatgaag tcggtcgctg tcatcggggc ctcgctggcg 180
ggcctgtacg ccgcgcggtc cctgcgttcc caggggttcg acggccgcct ggtgatcgtc 240
ggggacgagt gccacggccc ctacgaccgg cccccgctgt ccaaggactt cctcaccggc 300
gccaccgacc cgggccgact cgccctggcc gacgccgagg agatcgccga actcgacgcc 360
gaatggctgc tgggcacccg ggccaccggg ctcgacaccg gcggacgcac ggtgctgctc 420
gatggcggcc ggtccctgac caccgacggc gtggtcctcg ccaccggcgc cgccccgcgc 480
ctgctccccg gaccggtgcc cgccggggtc cacaccctgc gcaccctcga cgacgcccag 540
gcgctccgtg cggatctggc gccggcgccg gtccgggtcg tggtgatcgg cggcggcttc 600
atcggcgccg aggtcgcctc gtcctgcgcc gccctaggcc atgacgtcac cgtggtcgag 660
gccgcgccgc tccccctcgt cccccaactc ggccacgcca tggccgagat ctgcgccgcc 720
ctgcatgcgg accacggcgt cacgctgctc accggaaccg gtgtcgcccg gctgcgcagc 780
gagggcgacg gccggcgcgt caccggcgtc gagctgaccg acggccgcct gctccccgcc 840
gacgtggtcg tcgtcggcat cggggtacgc ccccgcaccg cctggctcac ggactccgga 900
ctgccgctcg acgacggtgt gctctgcgac gcgggctgtg tcaccccgct gcccgccgtc 960
gtggccgtcg gcgacgtcgc cagggtggac ggcgcccgtg ccgagcactg gaccagcgcc 1020
accgaacagg ccgccgtggc ggcgcggaac ctgctggccg gcagcaccgt cgcgacccac 1080
cggagcctgc cgtacttctg gtccgaccag tacggcgtcc gcatccagtt cgcgggccac 1140
cggctgccca ccgacacacc gcgcgtcctc gaaggctccc ccgacgaccg cagcttcctc 1200
gcctgttacg aacgggacgg acgcaccacc gcggtgctcg ccctcaaccg gccccgcccc 1260
ttcatgcggc tccgccgcga actcgcccgc accgccctgt cggccaccac ctga 1314
101
437
PRT
Streptomyces
101
Met Pro Ala Ala Arg Arg Arg Leu Arg Pro Pro His Arg Ser Gly Asp
1 5 10 15
Leu Pro Ala Arg Pro Pro Gly Arg Ala His Pro Pro Arg Asp Arg Pro
20 25 30
Gly Arg His Asp Leu Arg Pro Ser Arg Arg Gly Gly Gly His Arg Arg
35 40 45
Met Lys Ser Val Ala Val Ile Gly Ala Ser Leu Ala Gly Leu Tyr Ala
50 55 60
Ala Arg Ser Leu Arg Ser Gln Gly Phe Asp Gly Arg Leu Val Ile Val
65 70 75 80
Gly Asp Glu Cys His Gly Pro Tyr Asp Arg Pro Pro Leu Ser Lys Asp
85 90 95
Phe Leu Thr Gly Ala Thr Asp Pro Gly Arg Leu Ala Leu Ala Asp Ala
100 105 110
Glu Glu Ile Ala Glu Leu Asp Ala Glu Trp Leu Leu Gly Thr Arg Ala
115 120 125
Thr Gly Leu Asp Thr Gly Gly Arg Thr Val Leu Leu Asp Gly Gly Arg
130 135 140
Ser Leu Thr Thr Asp Gly Val Val Leu Ala Thr Gly Ala Ala Pro Arg
145 150 155 160
Leu Leu Pro Gly Pro Val Pro Ala Gly Val His Thr Leu Arg Thr Leu
165 170 175
Asp Asp Ala Gln Ala Leu Arg Ala Asp Leu Ala Pro Ala Pro Val Arg
180 185 190
Val Val Val Ile Gly Gly Gly Phe Ile Gly Ala Glu Val Ala Ser Ser
195 200 205
Cys Ala Ala Leu Gly His Asp Val Thr Val Val Glu Ala Ala Pro Leu
210 215 220
Pro Leu Val Pro Gln Leu Gly His Ala Met Ala Glu Ile Cys Ala Ala
225 230 235 240
Leu His Ala Asp His Gly Val Thr Leu Leu Thr Gly Thr Gly Val Ala
245 250 255
Arg Leu Arg Ser Glu Gly Asp Gly Arg Arg Val Thr Gly Val Glu Leu
260 265 270
Thr Asp Gly Arg Leu Leu Pro Ala Asp Val Val Val Val Gly Ile Gly
275 280 285
Val Arg Pro Arg Thr Ala Trp Leu Thr Asp Ser Gly Leu Pro Leu Asp
290 295 300
Asp Gly Val Leu Cys Asp Ala Gly Cys Val Thr Pro Leu Pro Ala Val
305 310 315 320
Val Ala Val Gly Asp Val Ala Arg Val Asp Gly Ala Arg Ala Glu His
325 330 335
Trp Thr Ser Ala Thr Glu Gln Ala Ala Val Ala Ala Arg Asn Leu Leu
340 345 350
Ala Gly Ser Thr Val Ala Thr His Arg Ser Leu Pro Tyr Phe Trp Ser
355 360 365
Asp Gln Tyr Gly Val Arg Ile Gln Phe Ala Gly His Arg Leu Pro Thr
370 375 380
Asp Thr Pro Arg Val Leu Glu Gly Ser Pro Asp Asp Arg Ser Phe Leu
385 390 395 400
Ala Cys Tyr Glu Arg Asp Gly Arg Thr Thr Ala Val Leu Ala Leu Asn
405 410 415
Arg Pro Arg Pro Phe Met Arg Leu Arg Arg Glu Leu Ala Arg Thr Ala
420 425 430
Leu Ser Ala Thr Thr
435
102
1233
DNA
Streptomyces
102
atggcccaga acacggcatt catcatcgcg ggagcggggc tggccggggc gaaggccgcg 60
gagacactgc gcgcggaggg cttcggcggc cccgtcctgc tgctgggcga cgagcgcgag 120
cgtccctacg agcggccgcc gctgtccaag ggctacctct tgggcacctc cgagcgggag 180
aaggcgtacg tccatccgcc ccagtggtac gccgagcacg acgtcgatct gcggctgggc 240
aacgccgtca ccgccctcga cccggccggc cacgaggtga ccctcgccga cggcagccgg 300
ctgggctacg ccaagctgct gctggccacc ggctccactc cgcgccggct gccggtgccc 360
ggcgccgacc tcgacggggt ccacacgctg cggtacctgg cggacagcga ccgcctcaag 420
gacctcttcc ggtccgcgtc ccggatcgtg gtgatcggcg gcggctggat cggcctggag 480
accacggccg ccgcgcgtgc ggcgggggtc gaggtgaccg tgctggagtc ggcgccgctg 540
cccctgctgg gggtgctggg ccgcgaggtc gcccaggtct tcgccgatct gcacaccgag 600
cacggtgtcg cgctgcgctg cgacacccag gtcacggaga tcaccggcac gaacggcgcg 660
gtcgacgggg tacggctggc cgacggcacc cggatcgcgg ccgacgcggt gatcgtcggc 720
gtcgggatca cccccaactc cgagacggcc gccgcggccg ggctcaaggt cgacaacggc 780
gtcgtcgtgg acgagcggct gtgctcctcc cacccggaca tctacgccgc cggcgacgtc 840
gccaacgcct accaccccct cctgggcaag cacctccgcg tcgagcactg ggccaacgcc 900
ctccaccagc cgaagaccgc ggcccgggcc atgctgggcg gggaggccgg ctacgaccgg 960
ctgccgtact tcttcaccga ccagtacgac ctgggcatgg agtacacggg gcatgtggag 1020
ccgggcgggt acgaccgcgt ggtgttccgc ggcgacaccg gtgcccgcga gttcatcgcc 1080
ttctggctct ccggcggccg ggtgctggcc gggatgaatg tgaacgtatg ggacgtcacc 1140
gacccgatcc gggccctggt ggcgagcggg cgggccgtgg accccgagcg gctcgccgac 1200
gcggacgtac cgctggcgga tctggtcccc tga 1233
103
410
PRT
Streptomyces
103
Met Ala Gln Asn Thr Ala Phe Ile Ile Ala Gly Ala Gly Leu Ala Gly
1 5 10 15
Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Gly Gly Pro Val
20 25 30
Leu Leu Leu Gly Asp Glu Arg Glu Arg Pro Tyr Glu Arg Pro Pro Leu
35 40 45
Ser Lys Gly Tyr Leu Leu Gly Thr Ser Glu Arg Glu Lys Ala Tyr Val
50 55 60
His Pro Pro Gln Trp Tyr Ala Glu His Asp Val Asp Leu Arg Leu Gly
65 70 75 80
Asn Ala Val Thr Ala Leu Asp Pro Ala Gly His Glu Val Thr Leu Ala
85 90 95
Asp Gly Ser Arg Leu Gly Tyr Ala Lys Leu Leu Leu Ala Thr Gly Ser
100 105 110
Thr Pro Arg Arg Leu Pro Val Pro Gly Ala Asp Leu Asp Gly Val His
115 120 125
Thr Leu Arg Tyr Leu Ala Asp Ser Asp Arg Leu Lys Asp Leu Phe Arg
130 135 140
Ser Ala Ser Arg Ile Val Val Ile Gly Gly Gly Trp Ile Gly Leu Glu
145 150 155 160
Thr Thr Ala Ala Ala Arg Ala Ala Gly Val Glu Val Thr Val Leu Glu
165 170 175
Ser Ala Pro Leu Pro Leu Leu Gly Val Leu Gly Arg Glu Val Ala Gln
180 185 190
Val Phe Ala Asp Leu His Thr Glu His Gly Val Ala Leu Arg Cys Asp
195 200 205
Thr Gln Val Thr Glu Ile Thr Gly Thr Asn Gly Ala Val Asp Gly Val
210 215 220
Arg Leu Ala Asp Gly Thr Arg Ile Ala Ala Asp Ala Val Ile Val Gly
225 230 235 240
Val Gly Ile Thr Pro Asn Ser Glu Thr Ala Ala Ala Ala Gly Leu Lys
245 250 255
Val Asp Asn Gly Val Val Val Asp Glu Arg Leu Cys Ser Ser His Pro
260 265 270
Asp Ile Tyr Ala Ala Gly Asp Val Ala Asn Ala Tyr His Pro Leu Leu
275 280 285
Gly Lys His Leu Arg Val Glu His Trp Ala Asn Ala Leu His Gln Pro
290 295 300
Lys Thr Ala Ala Arg Ala Met Leu Gly Gly Glu Ala Gly Tyr Asp Arg
305 310 315 320
Leu Pro Tyr Phe Phe Thr Asp Gln Tyr Asp Leu Gly Met Glu Tyr Thr
325 330 335
Gly His Val Glu Pro Gly Gly Tyr Asp Arg Val Val Phe Arg Gly Asp
340 345 350
Thr Gly Ala Arg Glu Phe Ile Ala Phe Trp Leu Ser Gly Gly Arg Val
355 360 365
Leu Ala Gly Met Asn Val Asn Val Trp Asp Val Thr Asp Pro Ile Arg
370 375 380
Ala Leu Val Ala Ser Gly Arg Ala Val Asp Pro Glu Arg Leu Ala Asp
385 390 395 400
Ala Asp Val Pro Leu Ala Asp Leu Val Pro
405 410
104
1266
DNA
Streptomyces
104
gtggtcgacg cacaccagac gttcgtcatc gtcgggggtg gcctggccgg cgcaaaggcc 60
gcggagactc tccgcgcgga agggttcacc ggccgggtga tcctcatctg tgacgagcgc 120
gaccacccgt acgagcgccc cccgctctcc aaggggttcc tgctcggcaa ggaagagcgc 180
gacagcgttt tcgtccacga acccgcctgg tacgcccagg cacagatcga actgcacctg 240
ggccagcccg ccgtccgcct cgaccccgag gcgaagaccg tccgcctcgg cgacggcacc 300
ctgatcgcct acgacaagct gctgctggcc accggcgccg agccgcgccg cctggacatc 360
cccggcaccg gcctggccgg cgtgcaccac ctgcgccgcc tcgcccacgc cgaacggctg 420
cgcggcgtcc tggcctccct cgggcgggac aacgggcatc tggtgatcgc cggcgccggc 480
tggatcggcc tggaggtcgc cgccgcggcc cgctcctacg gcgccgaggt caccgtcgtc 540
gaggccgccc cgacaccgct gcacggcatc ctggggcccg aactcggcgg cctgttcacc 600
gaactgcacc gcgcacacgg cgtgcgcttc cacttcggcg cccgtttcac cgagatcgtc 660
ggacaggacg gcatggtgct cgccgtgcgc accgacgacg gcgaggagca ccccgcccac 720
gacgtgctcg ccgcgatcgg cgccgccccg cgcaccgcac tcgccgaaca ggccggactc 780
gacctcgccg acccggaggc cggcggcggc gtggccgtcg acgcgacgct gcgcacctcc 840
gacccgtaca tctacgccgc cggcgacgtg gccgccgccg accaccccct cctggacacc 900
cggctgcgcg tcgaacactg ggccaacgcc ctcaacggcg gcccggccgc cgcgcgcgcc 960
atgctcggcc aggacatcag ctacgaccgc gtcccgtact tcttctccga ccagtacgac 1020
gtcggcatgg agtactccgg ctacgccccg cccggctcct acgcacaggt cgtctgccgc 1080
ggcgacgtcg ccaaacggga gttcatcgcg ttctggctcg gcgaggacgg acggctgctc 1140
gcggggatga acgtcaacgt ctgggacgtc gccgaaacca tccagcaact catccgcggc 1200
ggggtgcggt tggagcccgg cgagctggct gatccggagg ttccgctgac ctcactgctc 1260
ccgtag 1266
105
421
PRT
Streptomyces
105
Val Val Asp Ala His Gln Thr Phe Val Ile Val Gly Gly Gly Leu Ala
1 5 10 15
Gly Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Thr Gly Arg
20 25 30
Val Ile Leu Ile Cys Asp Glu Arg Asp His Pro Tyr Glu Arg Pro Pro
35 40 45
Leu Ser Lys Gly Phe Leu Leu Gly Lys Glu Glu Arg Asp Ser Val Phe
50 55 60
Val His Glu Pro Ala Trp Tyr Ala Gln Ala Gln Ile Glu Leu His Leu
65 70 75 80
Gly Gln Pro Ala Val Arg Leu Asp Pro Glu Ala Lys Thr Val Arg Leu
85 90 95
Gly Asp Gly Thr Leu Ile Ala Tyr Asp Lys Leu Leu Leu Ala Thr Gly
100 105 110
Ala Glu Pro Arg Arg Leu Asp Ile Pro Gly Thr Gly Leu Ala Gly Val
115 120 125
His His Leu Arg Arg Leu Ala His Ala Glu Arg Leu Arg Gly Val Leu
130 135 140
Ala Ser Leu Gly Arg Asp Asn Gly His Leu Val Ile Ala Gly Ala Gly
145 150 155 160
Trp Ile Gly Leu Glu Val Ala Ala Ala Ala Arg Ser Tyr Gly Ala Glu
165 170 175
Val Thr Val Val Glu Ala Ala Pro Thr Pro Leu His Gly Ile Leu Gly
180 185 190
Pro Glu Leu Gly Gly Leu Phe Thr Glu Leu His Arg Ala His Gly Val
195 200 205
Arg Phe His Phe Gly Ala Arg Phe Thr Glu Ile Val Gly Gln Asp Gly
210 215 220
Met Val Leu Ala Val Arg Thr Asp Asp Gly Glu Glu His Pro Ala His
225 230 235 240
Asp Val Leu Ala Ala Ile Gly Ala Ala Pro Arg Thr Ala Leu Ala Glu
245 250 255
Gln Ala Gly Leu Asp Leu Ala Asp Pro Glu Ala Gly Gly Gly Val Ala
260 265 270
Val Asp Ala Thr Leu Arg Thr Ser Asp Pro Tyr Ile Tyr Ala Ala Gly
275 280 285
Asp Val Ala Ala Ala Asp His Pro Leu Leu Asp Thr Arg Leu Arg Val
290 295 300
Glu His Trp Ala Asn Ala Leu Asn Gly Gly Pro Ala Ala Ala Arg Ala
305 310 315 320
Met Leu Gly Gln Asp Ile Ser Tyr Asp Arg Val Pro Tyr Phe Phe Ser
325 330 335
Asp Gln Tyr Asp Val Gly Met Glu Tyr Ser Gly Tyr Ala Pro Pro Gly
340 345 350
Ser Tyr Ala Gln Val Val Cys Arg Gly Asp Val Ala Lys Arg Glu Phe
355 360 365
Ile Ala Phe Trp Leu Gly Glu Asp Gly Arg Leu Leu Ala Gly Met Asn
370 375 380
Val Asn Val Trp Asp Val Ala Glu Thr Ile Gln Gln Leu Ile Arg Gly
385 390 395 400
Gly Val Arg Leu Glu Pro Gly Glu Leu Ala Asp Pro Glu Val Pro Leu
405 410 415
Thr Ser Leu Leu Pro
420
1.PublishNumber: US-2003068788-A1
2.Date Publish: 20030410
3.Inventor: BUCKEL THOMAS GUNTHER
HAMMER PHILIP EUGENE
HILL DWIGHT STEVEN
LIGON JAMES MADISON
DURHAM ISTVAN MOLNAR
PACHLATKO JOHANNES PAUL
ZIRKLE ROSS ERIC
4.Inventor Harmonized: BUCKEL THOMAS GUNTHER(DE)
HAMMER PHILIP EUGENE(US)
HILL DWIGHT STEVEN(US)
LIGON JAMES MADISON(US)
DURHAM ISTVAN MOLNAR(US)
PACHLATKO JOHANNES PAUL(CH)
ZIRKLE ROSS ERIC(US)
5.Country: US
6.Claims:
(en)Disclosed is a family of P450 monooxygenases, each member of which regioselectively oxidizes avermectin to 4″-keto-avermectin. The P450 monooxgenases find use in methods and formulations for making emamectin from avermectin. Also disclosed are methods for purifying the P450 monooxygenases of the invention, binding agents that specifically bind to the P450 monooxygenases of the invention, and genetically engineered cells that express the P450 monooxygenases of the invention. Also disclosed are ferredoxins and ferredoxin reductases that are active with the P450 monooxygenases of the invention.
7.Description:
(en)CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority to U.S. Provisional Patent Application Serial No. 60/291,149 filed May 16, 2001, the entire contents of which are hereby incorporated by reference.
BACKGROUND OF THE INVENTION
[0002] 1. Field of the Invention
[0003] The invention relates to the field of agrochemicals, and in particular, to insecticides. More specifically, this invention relates to the derivatization of avermectin, particularly for the synthesis of emamectin.
[0004] 1. Summary of the Related Art
[0005] Emamectin is a potent insecticide and controls many pests such as thrips, leafminers, and worm pests including alfalfa caterpillar, beet armyworm, cabbage looper, corn earworm, cutworms, diamondback moth, tobacco budworm, tomato fruitworm, and tomato pinworm. Emamectin (4″-deoxy-4″-epi-N-methylamino avermectin B1a/B1b) is described in U.S. Pat. No. 4,874,749 and in Cvetovich, R. J. et al, J. Organic Chem. 59:7704-7708, 1994 (as MK-244).
[0006] U.S. Pat. No. 5,288,710 describes salts of emamectin that are especially valuable agrochemically. These salts of emamectin are valuable pesticides, especially for combating insects and representatives of the order Acarina. Some pests for which emamectin is useful in combating are listed in European Patent Application EP-A 736,252.
[0007] One drawback to the use of emamectin is the difficulty of its synthesis from avermectin. This is due to the first step of the process, which is the most costly and time-consuming step of producing emamectin, in which the 4″-carbinol group of avermectin must be oxidized to a ketone. The oxidation of the 4″-carbinol group is problematic due to the presence of two other hydroxyl groups on the molecule that must be chemically protected before oxidation and deprotected after oxidation. Thus, this first step, significantly increases the overall cost and time of producing emamectin from avermectin.
[0008] Because of the efficacy and potency of emamectin as an insecticide, there is a need to develop a cost and time effective method and/or reagent for regioselectively oxidizing the 4″-carbinol group of avermectin to produce 4″-keto-avermectin, which is a necessary intermediate for producing emamectin from avermectin.
BRIEF SUMMARY OF THE INVENTION
[0009] The invention provides a novel family of P450 monooxygenases, each member of which is able to regioselectively oxidize the 4″-carbinol group of unprotected avermectin, thereby resulting in a cheap, effective method to produce 4″-keto-avermectin, a necessary intermediate in the production of emamectin. The invention allows elimination of the costly, time-consuming steps of (1) chemically protecting the two other hydroxyl groups on the avermectin molecule prior to oxidation of the 4″-carbinol group that must be chemically protected before oxidation; and (2) chemically deprotecting these two other hydroxyl groups after oxidation. The invention thus provides reagents and methods for significantly reducing the overall cost of producing emamectin from avermectin.
[0010] Accordingly, in one aspect, the invention provides a purified nucleic acid molecule encoding a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol at position 4″ of a compound of formula (II), in free form or in salt form
[0011] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0012] or a single bond and a methylene bridge of the formula
[0013] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, in order to produce a compound of the formula (III), in free form or in salt form
[0014] in which R1, R2, R3, R4, R5, R6, R7, m, n, A, B, C, D, E and F have the meanings given for formula (II).
[0015] In another aspect, the invention provides a purified nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 66% identical to SEQ ID NO: 1. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 70% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31,SEQ ID NO: 33, or SEQ ID NO: 94. In some embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 80% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO:23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 90% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. In certain embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 95% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0016] In some embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0017] In particular embodiments, the nucleic acid molecule is isolated from a Streptomyces strain. In certain embodiments, the Streptomyces strain is selected from the group consisting of Streptomyces tubercidicus, Streptomyces lydicus, Streptomyces platensis, Streptomyces chattanoogensis, Streptomyces kasugaensis, Streptomyces rimosus, and Streptomyces albofaciens.
[0018] In some embodiments of this aspect, the nucleic acid molecule further comprises a nucleic acid sequence encoding a tag which is linked to the P450 monooxygenase via a covalent bond. In certain embodiments, the tag is selected from the group consisting of a His tag, a GST tag, an HA tag, a HSV tag, a Myc-tag, and VSV-G-Tag.
[0019] In another aspect, the invention provides a purified polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol at position 4″ of a compound of formula (II), in free form or in salt form
[0020] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0021] or a single bond and a methylene bridge of the formula
[0022] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, in order to produce a compound of the formula (III), in free form or in salt form
[0023] in which R1, R2, R3, R4, R5, R6, R7, m, n, A, B, C, D, E and F have the meanings given for formula (II).
[0024] In another aspect, the invention provides a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 50% identical to SEQ ID NO: 2. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 60% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 70% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 90% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, the P450 monooxygenase comprises or consists essentially of an amino acid sequence that is at least 95% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95.
[0025] In some embodiments of this aspect of the invention, the P450 monooxygenase comprises or consists essentially of an amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, and SEQ ID NO: 95.
[0026] In certain embodiments, the P450 monooxygenase further comprises a tag. In some embodiments, the tag is selected from the group consisting of a His tag, a GST tag, an HA tag, a HSV tag, a Myc-tag, and VSV-G-Tag.
[0027] In another aspect, the invention provides a binding agent that specifically binds to a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the binding agent is an antibody. In certain embodiments, the antibody is a polyclonal antibody or a monoclonal antibody.
[0028] In yet another aspect, the invention provides a family of P450 monooxygenase polypeptides, wherein each member of the family regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 50% identical to SEQ ID NO: 2. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 60% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 70% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 90% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, each member of the family comprises or consists essentially of an amino acid sequence that is at least 95% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In some embodiments of this aspect of the invention, each member of the family comprises or consists essentially of an amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, and SEQ ID NO: 95.
[0029] In still another aspect, the invention provides a cell genetically engineered to comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the nucleic acid molecule is positioned for expression in the cell. In certain embodiments, the cell further comprises a nucleic acid molecule encoding a ferredoxin protein. In some embodiments, the cell further comprises a nucleic acid molecule encoding a ferredoxin reductase protein.
[0030] In certain embodiments, the cell is a genetically engineered Streptomyces strain. In some embodiments, the cell is a genetically engineered Streptomyces lividans strain. In particular embodiments, the genetically engineered Streptomyces lividans strain has NRRL Designation No. B-30478. In particular embodiments, the cell is a genetically engineered Pseudomonas strain. In some embodiments, the cell is a genetically engineered Pseudomonas putida strain. In certain embodiments, the genetically engineered Pseudomonas putida strain has NRRL Designation No. B-30479. In some embodiments, the cell is a genetically engineered Escherichia coli strain.
[0031] In another aspect, the invention provides a purified nucleic acid molecule encoding a ferredoxin, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the nucleic acid molecule encoding a ferredoxin of the invention comprises or consists essentially of a nucleic acid sequence that is at least 81% identical to SEQ ID NO: 35 or SEQ ID NO: 37. In some embodiments, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 35 or SEQ ID NO: 37. In certain embodiments, the nucleic acid molecule encoding a ferredoxin of the invention comprises or consists essentially of the nucleic acid sequence of SEQ ID NO: 35 or SEQ ID NO: 37.
[0032] In yet another aspect, the invention provides a purified ferredoxin protein, wherein the ferredoxin protein is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the ferredoxin of the invention comprises or consists essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 36 or SEQ ID NO: 38. In some embodiments, the nucleic acid molecule comprises or consists essentially of an amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 36 or SEQ ID NO: 38. In particular embodiments, the ferredoxin of the invention comprises or consists essentially of the amino acid sequence of SEQ ID NO: 36 or SEQ ID NO: 38.
[0033] In another aspect, the invention provides a purified nucleic acid molecule encoding a ferredoxin reductase, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the nucleic acid molecule encoding a ferredoxin reductase of the invention comprises or consists essentially of the nucleic acid sequence of SEQ ID NO: 98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104.
[0034] In yet another aspect, the invention provides a purified ferredoxin reductase protein, wherein the ferredoxin reductase protein is isolated from a Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In certain embodiments, the ferredoxin reductase of the invention comprises or consists essentially of the amino acid sequence of SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, or SEQ ID NO: 105.
[0035] In another aspect, the invention provides a process for the preparation a compound of the formula (I) in free form or in salt form
[0036] in which R 1 -R 9 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2, or 3; and the bonds marked with A, B, C, D, E, and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0037] or a single bond and a methylene bridge of the formula
[0038] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, which process comprises
[0039] 1) bringing a compound of the formula (II), in free form or in salt form
[0040] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, into contact with a polypeptide according to the invention that regioselectively oxidizes the alcohol at position 4″in order to form a compound of the formula (III), in free form or in salt form
[0041] in which R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the meanings given for formula (I); and
[0042] 2) reacting the compound of the formula (III) with an amine of the formula HN(R 8 )R 9 , wherein R 8 and R 9 have the same meanings as given for formula (I), and which is known, in the presence of a reducing agent; and, in each case, if desired, converting a compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, into a different compound of formula (I) or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, separating a mixture of E/Z isomers obtainable in accordance with the process and isolating the desired isomer, and/or converting a free compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, into a salt or converting a salt, obtainable in accordance with the process or by another method, of a compound of formula (I) or of an E/Z isomer or tautomer thereof into the free compound of formula (I) or an E/Z isomer or tautomer thereof or into a different salt.
[0043] In some embodiments, the compound of formula (II) is further brought into contact with a polypeptide according to the invention exhibiting an enzymatic activity of a ferredoxin. In certain embodiments, the compound of formula (II) is further brought into contact with a polypeptide according to the invention exhibiting an enzymatic activity of a ferredoxin reductase. In some embodiments, the compound of formula (II) is further brought into contact with a reducing agent (e.g., NADH or NADPH).
[0044] In still a further embodiment, the invention provides a process for the preparation of a compound of the formula (III), in free form or in salt form
[0045] in which R 1 -R 7 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0046] or a single bond and a methylene bridge of the formula
[0047] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, which process comprises bringing a compound of the formula (II), in free form or in salt form
[0048] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the same meanings as given for formula (III) above, into contact with a polypeptide according to the invention that regioselectively oxidizes the alcohol at position 4″, and maintaining said contact for a time sufficient for the oxidation reaction to occur and isolating and purifying the compound of formula (III).
[0049] In yet another embodiment, the invention provides a process according to the invention for the preparation of a compound of the formula (I), in which n is 1; m is 1; A is a double bond; B is single bond or a double bond; C is a double bond; D is a single bond; E is a double bond; F is a double bond, or a single bond and a epoxy bridge, or a single bond and a methylene bridge; R 1 , R 2 and R 3 are H; R 4 is methyl; R 5 is C 1 -C 10 alkyl, C 3 -C 8 -cycloalkyl or C 2 -C 10 -alkenyl; R 6 is H; R 7 is OH; R 8 and R 9 are independently of each other H; C 1 -C 10 -alkyl or C 1 -C 10 -acyl, or together form —(CH 2 ) q —, where q is 4, 5 or 6.
[0050] In still another embodiment, the invention provides a process according to the invention for the preparation of a compound of the formula (I), in which n is 1; m is 1; A, B, C, E and F are double bonds; D is a single bond; R 1 , R 2 , and R 3 are H; R 4 is methyl; R 5 is s-butyl or isopropyl; R 6 is H; R 7 is OH; R 8 is methyl; and R 9 is H.
[0051] In still another embodiment, the invention provides a process according to the invention for the preparation of 4″-deoxy-4″-N-methylamino avermectin B 1a /B 1b benzoate salt.
[0052] In another aspect, the invention provides a method for making emamectin. The method comprises adding a P450 monooxygenase, that regioselectively oxidizes avermectin to 4″-keto-avermectin, to a reaction mixture comprising avermectin and incubating the reaction mixture under conditions that allow the P450 monooxygenase to regioselectively oxidize avermectin to 4″-keto-avermectin. In some embodiments, the reaction mixture further comprises a ferredoxin. In certain embodiments, the reaction mixture further comprises a ferredoxin reductase. In some embodiments, the reaction mixture further comprises a reducing agent (e.g., NADH or NADPH).
[0053] In still another aspect, the invention provides a formulation for making a compound of formula (I) comprising a polypeptide according to the invention exhibiting a P450 monooxygenase activity that is capable of regioselectively oxidising the alcohol at position 4″ in order to form a compound of formula (II). In some embodiments, the formulation further comprises a polypeptide according to the invention exhibiting an enzymatic activity of a ferredoxin (e.g., a ferredoxin from cell or strain from which the P450 monooxygenase was isolated or derived).
[0054] In still another aspect, the invention provides a formulation for making emamectin comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the formulation further comprises a ferredoxin (e.g., a ferredoxin from cell or strain from which the P450 monooxygenase was isolated or derived). In certain embodiments, the formulation further comprises a ferredoxin reductase (e.g., a ferredoxin reductase from cell or strain from which the P450 monooxygenase was isolated or derived). In some embodiments, the formulation further comprises a reducing agent (e.g., NADH or NADPH). dr
BRIEF DESCRIPTION OF THE DRAWINGS
[0055]FIGS. 1A and 1B are schematic representations of an HPLC chromatogram (FIG. 1A) and data (FIG. 1B) showing the conversion of avermectin B1a to 4″-hydroxy-avermectin B1a and 4″-keto-avermectin B1a (also called 4″-oxo-avermectin B1a) and a side product from the biocatalysis reaction by a non-limiting P450 monooxygenase of the invention, P450 Ema1 . The HPLC chromatogram using HPLC protocol I to resolve the products is shown in FIG. 1A, and the peaks are identified in FIG. 1B by their retention times. The Y-axis of FIG. 1A shows the milli-absorbance units (mAU) at 243 nm.
[0056]FIG. 2 is a representation of an HPLC chromatogram showing the increased biocatalysis activity (ie., the ability to regioselectively oxidize avermectin to 4″-keto-avermectin) by Streptomyces tubercidicus R-922 UV Mutant as compared to wild-type Streptomyces tubercidicus R-922. The Y-axis shows the milli-absorbance units (mAU) at 243 nm.
[0057]FIG. 3 is a schematic representation of the alignment of the deduced amino acid sequences of P450 gene-specific PCR fragments derived from Streptomyces tubercidicus strain R-922 and the P450 monooxygenase from S. thermotolerans that is involved in the synthesis of carbomycin (Stol-ORFA) (GenBank Accession No. D30759) by the program Pretty from the University of Wisconsin Package version 10.1 (Altschul et al., Nucl. Acids Res. 25:3389-3402). Only the portion of Stol-ORFA that is homologous to the PCR fragments is shown. The O 2 and heme binding sites at each end are indicated. The consensus sequence is shown on the bottom line and includes only those residues that are completely conserved in all sequences compared.
[0058]FIG. 4 is a schematic representation of the alignment of the deduced amino acid sequences of P450 gene-specific PCR fragments derived from Streptomyces strain I-1529 and the P450 monooxygenase from S. thermotolerans that is involved in the synthesis of carbomycin (Stol-ORFA) (GenBank Accession No. D30759) by Pretty. Only the portion of Stol-ORFA that is homologous to the PCR fragments is shown. The O 2 and heme binding sites at each end are indicated. The consensus sequence is shown on the bottom line and includes only those residues that are completely conserved in all sequences compared.
[0059]FIG. 5 is a schematic representation of the alignment of the deduced amino acid sequence of the 600 bp P450 gene fragment from Example VIII with the amino acid sequences of peptide fragments derived from purified P450 Ema1 enzyme from Example VII.
[0060]FIG. 6 is a schematic representation of the alignment of the deduced amino acid sequence of two non-limiting P450 monooxygenases of the invention, namely from Streptomyces strains R-922 and I-1529, that are involved in emamectin biosynthesis. These are compared to the amino acid sequence of a P450 monooxygenase from S. thermotolerans that is involved in the synthesis of carbomycin (Carb-P450) (GenBank Accession No. D30759). Conserved residues in all three P450's are shown on the bottom line of the figure as the “consensus” sequence.
[0061]FIG. 7 is a diagrammatic representation showing a map of plasmid pTBBKA. Recognition sites by the indicated restriction endonucleases are shown, along with the location of the site in the nucleotide sequence of the plasmid. Also shown are genes (e.g., kanamycin resistance “KanR”), and other functional aspects (e.g., Tip promoter) contained in the plasmid.
[0062]FIG. 8 is a diagrammatic representation showing a map of plasmid pTUA1A. Recognition sites by the indicated restriction endonucleases are shown, along with the location of the site in the nucleotide sequence of the plasmid. Also shown are genes (e.g., ampicillin resistance “AmpR”) and other functional aspects (e.g., Tip promoter) contained in the plasmid.
[0063]FIG. 9 is a representation of an HPLC chromatogram showing the oxidation of avermectin to 4″-keto-avermectin by S. lividans transformed with the pTBBKA-ema1, following induction of ema1 expression with 0, 0,5, or 5.0 μg/ml thiostrepton. The Y-axis shows the milli-absorbance units (mAU) at 243 nm.
[0064]FIG. 10 is a diagrammatic representation of a phylogenetic tree showing the relationships between the seventeen ema genes described herein based on the deduced amino acid sequences of their protein products.
[0065]FIG. 11 is a diagrammatic representation showing a map of plasmid pRK-ema1/fd233. This plasmid was derived by ligating a Bg1II fragment containing the ema1 and fd233 genes organized on a single transcriptional unit into the Bg1II site of the broad host-range plasmid pRK290. The ema1/fd233 genes are expressed by the tac promoter (Ptac), and they are followed by the tac terminator (Ttac). Restriction endonuclease recognition sites shown are Bg1II “B”; EcoRI “E”; PacI “Pc”; PmeI “Pm”; and Sa1I “S.”
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0066] The patent and scientific literature referred to herein establishes knowledge that is available to those with skill in the art. The issued patents, applications, and references, including GenBank database sequences, that are cited herein are hereby incorporated by reference to the same extent as if each was specifically and individually indicated to be incorporated by reference. Any conflict between any reference cited herein and the specific teachings of this specification shall be resolved in favor of the latter. Likewise, any conflict between an art-understood definition of a word or phrase and a definition of the word or phrase as specifically taught in this specification shall be resolved in favor of the latter.
[0067] More particularly, the family of polypeptides according to the invention may be used in a process for the preparation a compound of the formula (I), in free form or in salt form
[0068] in which R 1 -R 9 represent, independently of each other hydrogen or a substituent; m is 0, 1 or 2; n is 0, 1, 2 or 3; and the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a epoxide bridge of the formula
[0069] or a single bond and a methylene bridge of the formula
[0070] including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer thereof, which process comprises
[0071] 1) bringing a compound of the formula (II), in free form or in salt form
[0072] wherein R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, into contact with a polypeptide according to the invention which exhibits an enzymatic activity of a P450 monooxygenases and is capable of regioselectively oxidizing the alcohol at position 4″ of formula (II) in order to produce a compound of the formula (III), in free form or in salt form
[0073] in which R 1 -R 7 represent, independently of each other hydrogen or a substituent; and m, n, A, B, C, D, E and F have the meanings given for formula (I); and
[0074] 2) reacting the compound of the formula (III) with an amine of the formula HN(R 8 )R 9 , wherein R 8 and R 9 have the same meanings as given for formula (I), and which is known, in the presence of a reducing agent; and, in each case, if desired, converting a compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, into a different compound of formula (I) or an E/Z isomer or tautomer thereof, in each case in free form or in salt form, separating a mixture of E/Z isomers obtainable in accordance with the process and isolating the desired isomer, and/or converting a free compound of formula (I) obtainable in accordance with the process or by another method, or an E/Z isomer or tautomer thereof, into a salt or converting a salt, obtainable in accordance with the process or by another method, of a compound of formula (I) or of an E/Z isomer or tautomer thereof into the free compound of formula (I) or an E/Z isomer or tautomer thereof or into a different salt.
[0075] Methods of synthesis for the compounds of formula (I) are described in the literature. It has been found, however, that the processes known in the literature cause considerable problems during production basically on account of the low yields and the tedious procedures which have to be used. Accordingly, the known processes are not satisfactory in that respect, giving rise to the need to make available improved preparation processes for those compounds.
[0076] The compounds (I), (II) and (III) may be in the form of tautomers. Accordingly, hereinbefore and hereinafter, where appropriate the compounds (I), (II) and (III) are to be understood to include corresponding tautomers, even if the latter are not specifically mentioned in each case.
[0077] The compounds (I), (II), and (III) are capable of forming acid addition salts. Those salts are formed, for example, with strong inorganic acids, such as mineral acids, for example perchloric acid, sulfuric acid, nitric acid, nitrous acid, a phosphoric acid or a hydrohalic acid, with strong organic carboxylic acids, such as unsubstituted or substituted, for example halo-substituted, C 1 -C 4 alkanecarboxylic acids, for example acetic acid, saturated or unsaturated dicarboxylic acids, for example oxalic, malonic, succinic, maleic, fumaric or phthalic acid, hydroxycarboxylic acids, for example ascorbic, lactic, malic, tartaric or citric acid, or benzoic acid, or with organic sulfonic acids, such as unsubstituted or substituted, for example halo-substituted, C 1 -C 4 alkane- or aryl-sulfonic acids, for example methane- or p-toluene-sulfonic acid. Furthermore, compounds of formula (I), (II), and (III) having at least one acidic group are capable of forming salts with bases. Suitable salts with bases are, for example, metal salts, such as alkali metal or alkaline earth metal salts, for example sodium, potassium or magnesium salts, or salts with ammonia or an organic amine, such as morpholine, piperidine, pyrrolidine, a mono-, di- or tri-lower alkylamine, for example ethyl-, diethyl-, triethyl- or dimethyl-propyl-amine, or a mono-, di- or tri-hydroxy-lower alkylamine, for example mono-, di- or tri-ethanolamine. In addition, corresponding internal salts may also be formed. Particularly useful, within the scope of the invention, are agrochemically advantageous salts. In view of the close relationship between the compounds of formula (I), (II) and (III) in free form and in the form of their salts, any reference hereinbefore or hereinafter to the free compounds of formula (I), (II) and (III) or to their respective salts is to be understood as including also the corresponding salts or the free compounds of formula (I), (II) and (III), where appropriate and expedient. The same applies in the case of tautomers of compounds of formula (I), (II) and (III) and the salts thereof. The free form is generally useful in each case.
[0078] Useful, within the scope of this invention, is a process for the preparation of compounds of the formula (I), in which n is 1; m is 1; A is a double bond; B is single bond or a double bond; C is a double bond; D is a single bond; E is a double bond; F is a double bond, or a single bond and a epoxy bridge, or a single bond and a methylene bridge; R 1 , R 2 and R 3 are H; R 4 is methyl; R 5 is C 1 -C 10 -alkyl, C 3 -C 8 -cycloalkyl or C 2 -C 10 -alkenyl; R 6 is H; R 7 is OH; R 8 and R 9 are independently of each other H; C 1 -C 10 -alkyl or C 1 -C 10 -acyl, or together form —(CH 2 ) q —; and q is 4, 5 or 6.
[0079] Especially useful within the scope of this invention is a process for the preparation of a compound of the formula (I) in which n is 1; m is 1; A, B, C, E and F are double bonds; D is a single bond; R 1 , R 2 , and R 3 are H; R 4 is methyl; R 5 is s-butyl or isopropyl; R 6 is H; R 7 is OH; R8 is methyl; and R 9 is H.
[0080] Very especially useful is a process for the preparation of emamectin, more particularly the benzoate salt of emamectin. Emamectin is a mixture of 4″-deoxy-4″-N-methylamino avermectin B 1a /B 1b and is described in U.S. Pat. No. 4,4874,749 and as MK-244 in J. Organic Chem. 59:7704-7708, 1994. Salts of emamectin that are especially valuable agrochemically are described in U.S. Pat. No. 5,288,710. Each member of this family of peptides exhibiting an enzymatic activity of a P450 monooxygenases as described hereinbefore is able to oxidize unprotected avermectin regioselectively at position 4″, thus opening a new and more economical route for the production of emamectin.
[0081] The family members each catalyze the following reaction:
[0082] Accordingly, the invention provides a purified nucleic acid molecule encoding a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol at position 4″ of a compound of formula (II) such as avermectin in order to produce a compound of formula (III), but especially 4″-keto-avermectin.
[0083] In particular, the invention provides a purified nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. A “nucleic acid molecule” refers to single-stranded or double-stranded polynucleotides, such as deoxyribonucleic acid (DNA), ribonucleic acid (RNA), or analogs of either DNA or RNA.
[0084] The invention also provides a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. As used herein, by “purified” is meant a nucleic acid molecule or polypeptide (e.g., an enzyme or antibody) that has been separated from components which naturally accompany it. For example, in the case of a nucleic acid molecule, the purified nucleic acid molecule is separated from nucleotide sequences, such as promoter or enhancer sequences, that flank the nucleic acid molecule as it naturally occurs in the chromosome. In the case of a protein, the purified protein is separated from components, such as other proteins or fragments of cell membrane, that accompany it in the cell. Of course, those of ordinary skill in molecular biology will understand that water, buffers, and other small molecules may additionally be present in a purified nucleic acid molecule or purified protein preparation. A purified nucleic acid molecule or purified polypeptide (e.g., enzyme) of the invention that is at least 95% by weight, or at least 98% by weight, or at least 99% by weight, or 100% by weight free of components which naturally accompany the nucleic acid molecule or polypeptide.
[0085] According to the invention, a purified nucleic acid molecule may be generated, for example, by excising the nucleic acid molecule from the chromosome. It may then be ligated into an expression plasmid. Other methods for generating a purified nucleic acid molecule encoding a P450 monooxygenase of the invention are available and include, without limitation, artificial synthesis of the nucleic acid molecule on a nucleic acid synthesizer.
[0086] Similarly, a purified P450 monooxygenase of the invention may be generated, for example, by recombinant expression of a nucleic acid molecule encoding the P450 monooxygenase in a cell in which the P450 monooxygenase does not naturally occur. Of course, other methods for obtaining a purified P450 monooxygenase of the invention include, without limitation, artificial synthesis of the P450 monooxygenase on a peptide synthesizer and isolation of the P450 monooxygenase from a cell in which it naturally occurs using, e.g., an antibody that specifically binds the P450 monooxygenase.
[0087] Biotransformations of secondary alcohols to ketones by Streptomyces bacteria are known to be catalyzed by dehydrogenases or oxidases. However, prior to the present discovery of the cytochrome P450 monooxygenase from Streptomyces tubercidicus strain R-922 responsible for the regioselective oxidation of avermectin to 4″-keto-avermectin, no experimental data of another cytochrome P450 monooxygenase from Streptomyces to oxidize a secondary alcohol to a ketone had been reported.
[0088] According to some embodiments of the invention, the nucleic acid molecule and/or the polypeptide encoded by the nucleic acid molecule are isolated from a Streptomyces strain. Thus, the nucleic acid molecule (or polypeptide encoded thereby) may be isolated from, without limitation, Streptomyces tubercidicus, Streptomyces lydicus, Streptomyces platensis, Streptomyces chattanoogensis, Streptomyces kasugaensis, Streptomyces rimosus, or Streptomyces albofaciens.
[0089] As described below, an entire family of P450 monooxygenases capable of regioselectively oxidizing avermectin to 4″-keto-avermectin has been discovered. All of these family members are related by at least 60% identity at the amino acid level. A useful nucleic acid molecule encoding a P450 monooxygenase of the invention comprises or consists essentially of a nucleic acid sequence that is at least 70% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. In certain embodiments, the nucleic acid molecule encoding a P450 monooxygenase of the invention comprises or consists essentially of a nucleic acid sequence that is at least 80% identical; or at least 85% identical; or at least 90% identical; or at least 95% identical; or at least 98% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO:I 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0090] Similarly, the invention provides a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin which, in some embodiments, comprises or consists essentially of an amino acid sequence that is at least 60% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In certain embodiments, the purified P450 monooxygenase of the invention comprises or consists essentially of an amino acid sequence that is at least 70% identical; or at least 80% identical; or at least 90% identical; or at least 95% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95.
[0091] In some embodiments, the nucleic acid molecule encoding a P450 monooxygenase of the invention comprises or consists essentially of the nucleic acid sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94. Similarly, the P450 monooxygenase of the invention, in some embodiments, comprises or consists essentially of the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95.
[0092] One non-limiting source of a purified P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin is the cell-free extract described in the examples below. Another method for purifying a P450 monooxygenase in accordance with the invention is to attach a tag to the protein, thereby facilitating its purification. Accordingly, the invention provides a P450 monooxygenase which regioselectively oxidizes avermectin to 4″-keto-avermectin, wherein the P450 monooxygenase is covalently bound to a tag. The invention further provides a nucleic acid molecule encoding such a tagged P450 monooxygenase.
[0093] As used herein, a “tag” is meant a peptide or other molecule covalently bound to a polypeptide of the invention, whereby a binding agent (e.g., a polypeptide or molecule) specifically binds the tag. In accordance with the invention, by “specifically binds” is meant that the binding agent (e.g., an antibody or Ni 2+ resin) recognizes and binds to a particular polypeptide or chemical but does not substantially recognize or bind to other molecules in the sample. In some embodiments, a binding agent that specifically binds a ligand forms an association with that ligand with an affinity of at least 10 6 M −1 , or at least 10 7 M −1 , or at least 10 8 M −1 , or at least 10 9 M −1 either in water, under physiological conditions, or under conditions which approximate physiological conditions with respect to ionic strength, e.g., 140 mM NaCl, 5 mM MgCl 2 . For example, a His tag is specifically bound by nickel (e.g., the Ni 2+ -charged column commercially available as His•Bind® Resin from Novagen Inc, Madison, Wis.). Likewise, a Myc tag is specifically bound by an antibody that specifically binds Myc.
[0094] As described below, a his tag is attached to the P450 monooxygenases of the invention by generating a nucleic acid molecule encoding the His-tagged polypeptide, and expressing the polypeptide in E. coli. These polypeptides, once expressed by E. coli, are readily purified by standard techniques (e.g., using one of the His•Bind® Kits commercially available from Novagen or using the TALON™ Resin (and manufacturer's instructions) commercially available from Clontech Laboratories, Inc., Palo Alto, Calif.).
[0095] Additional tags may be attached to any or all of the P450 monooxygenases of the invention to facilitate purification. These tags include, without limitation, the HA-Tag (amino acid sequence: YPYDVPDYA (SEQ ID NO: 39)), the Myc-tag (amino acid sequence: EQKLISEEDL (SEQ ID NO: 40)), the HSV tag (amino acid sequence: QPELAPEDPED (SEQ ID NO: 41)), and the VSV-G-Tag (amino acid sequence: YTDIEMNRLGK (SEQ ID NO: 42)). Covalent attachment (e.g., via a peptide bond) of these tags to a polypeptide of the invention allows purification of the tagged polypeptide using, respectively, an anti-HA antibody, an anti-Myc antibody, an anti-HSV antibody, or an anti-VSV-G antibody, all of which are commercially available (for example, from MBL International Corp., Watertown, Mass.; Novagen Inc.; Research Diagnostics Inc., Flanders, N.J.).
[0096] The tagged P450 monooxygenases of the invention may also be tagged by a covalent bond to a chemical, such as biotin, which is specifically bound by streptavidin, and thus may be purified on a streptavidin column. Similarly, the tagged P450 monooxygenases of the invention may be covalently bound (e.g., via a peptide bond) to the constant region of an antibody. Such a tagged P450 monooxygenase may be purified, for example, on protein A sepharose.
[0097] The tagged P450 monooxygenases of the invention may also be tagged to a GST (glutathione-S-transferase) or the constant region of an immunoglobulin. For example, a nucleic acid molecule of the invention (e.g., comprising SEQ ID NO: 1) can be cloned into one of the pGEX plasmids commercially available from Amersham Pharmacia Biotech, Inc. (Piscataway N.J.), and the plasmid expressed in E. coli. The resulting P450 monooxygenase encoded by the nucleic acid molecule is covalently bound to a GST (glutathione-S-transferase). These GST fusion proteins can be purified on a glutathione agarose column (commercially available from, e.g., Amersham Pharmacia Biotech), and thus purified. Many of the pGEX plasmids enable easy removal of the GST portion from the fusion protein. For example, the pGEX-2T plasmid contains a thrombin recognition site between the inserted nucleic acid molecule of interest and the GST-encoding nucleic acid sequence. Similarly, the pGES-3T plasmid contains a factor Xa site. By treating the fusion protein with the appropriate enzyme, and then separating the GST portion from the P450 monooxygenase of the invention using glutathione agarose (to which the GST specifically binds), the P450 monooxygenase of the invention can be purified.
[0098] Yet another method to obtain a purified P450 monooxygenase of the invention is to use a binding agent that specifically binds to the P450 monooxygenase. Accordingly, the invention provides a binding agent that specifically binds to a P450 monooxygenase of the invention. This binding agent of the invention may be a chemical compound (e.g., a protein), a metal ion, or a small molecule.
[0099] In particular embodiments, the binding agent is an antibody. The term “antibody” encompasses, without limitation, polyclonal antibody, monoclonal antibody, antibody fragments (e.g., Fab, Fv, or Fab′ fragments), single chain antibody, chimeric antibody, bi-specific antibody, antibody of any isotype (e.g., IgG, IgA, and IgE), and antibody from any specifies (e.g., rabbit, mouse, and human).
[0100] In one non-limiting example, the binding agent of the invention is a polyclonal antibody. In another non-limiting example, the binding agent of the invention is a monoclonal antibody. Methods for making both monoclonal and polyclonal antibodies are well known (see, e.g., Current Protocols in Immunology, ed. John E. Coligan, John Wiley & Sons, Inc. 1993; Current Protocols in Molecular Biology, eds. Ausubel et a., John Wiley & Sons, Inc. 2000).
[0101] The P450 monooxygenases described herein that regioselectively oxidize avermectin to 4″-keto-avermectin belong to a family of novel P450 monooxygenases. Accordingly, the invention also provides a family of P450 monooxygenase polypeptides, wherein each member of the family regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, each member of the family comprises or consists of an amino acid sequence that is at least 50% identical to SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, or SEQ ID NO: 95. In particular embodiments, each member of the family is encoded by a nucleic acid molecule comprising or consisting of a nucleic acid sequence that is at least 66% identical to SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, or SEQ ID NO: 94.
[0102] The present invention, which provides an entire family of P450 monooxygenases, each member of which is able to regioselectively oxidize avermectin to 4″-keto-avermectin, allowed for the generation of an improved P450 monooxygenase, which may not be naturally occurring, but which regioselectively oxidizes avermectin to 4″-keto-avermectin with efficiency and with reduced undesirable side product. For instance, one of the members of the P450 monooxygenase family of the invention, P450 Ema1 enzyme catalyzes a further oxidation that is not desirable, since the formation of 3″-O-demethyl-4″-keto-avermectin has been detected in the reaction by Streptomyces tubercidicus strain R-922 and by Streptomyces lividans containing the ema1 gene. The formation of 3″-O-demethyl-4″-keto-avermectin is brought about by the oxidation of the 3″-O-methyl group, whereby the hydrolytically labile 3″-O-hydroxymethyl group is formed which hydrolyzes to form formaldehyde and the 3″-hydroxyl group.
[0103] An HPLC chromatogram showing product and side product from the reaction is shown in FIGS. 1A and 1B.
[0104] By providing a family of P450 monooxygenases that regioselectively oxidize avermectin to 4″-keto-avermectin (see, e.g., Table 3 below), individual members of the family can be subjected to family gene shuffling efforts in order to produce new hybrid genes encoding optimized P450 monooxygenases of the invention. In one non-limiting example, a portion of the ema1 gene encoding the O 2 binding site of the P450 Ema1 protein can be swapped with the portion of the ema2 gene encoding the O 2 binding site of the P450 Ema2 protein. Such a chimeric ema1/2 protein is within definition of a P450 monooxygenase of the invention.
[0105] Site-directed mutagenesis or directed evolution technologies may also be employed to generate derivatives of the ema1 gene that encode enzymes with improved properties, including higher overall activity and/or reduced side product formation. One method for deriving such a mutant is to mutate the Streptomyces strain itself, in a manner similar to the UV mutation of Streptomyces tubercidicus strain R-922 described below.
[0106] Additional derivatives may be made by making conservative or non-conservative changes to the amino acid sequence of a P450 monooxygenase. Conservative and non-conservative amino acid substitutions are well known (see, e.g., Stryer, Biochemistry, 3 rd Ed., W. H. Freeman and Co., NY 1988). Similarly, truncations of a P450 monooxygenase of the invention may be generated by truncating the protein at its N-terminus (e.g., see the ema1A gene described below), at its C-terminus, or truncating (i.e., removing amino acid residues) from the middle of the protein.
[0107] Such a mutant, derivative, or truncated P450 monooxygenase is a P450 monooxygenase of the invention as long as the mutant, derivative, or truncated P450 monooxygenase is able to regioselectively oxidize avermectin to 4″-keto-avermectin.
[0108] In another aspect, the invention provides a cell genetically engineered to comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. By “genetically engineered” is meant that the nucleic acid molecule is heterologous to the cell into which it is introduced. Introduction of the heterologous nucleic acid molecule into the genetically engineered cell may be accomplished by any means, including, without limitation, transfection, transduction, and transformation.
[0109] In certain embodiments, the nucleic acid molecule is positioned for expression in the genetically engineered cell. By “positioned for expression” is meant that the heterologous nucleic acid molecule encoding the polypeptide is linked to a regulatory sequence in such a way as to permit expression of the nucleic acid molecule when introduced into a cell. By “regulatory sequence” is meant nucleic acid sequences, such as initiation signals, polyadenylation (polyA) signals, promoters, and enhancers, which control expression of protein coding sequences with which they are operably linked. By “expression” of a nucleic acid molecule encoding a protein or polypeptide fragment is meant expression of that nucleic acid molecule as protein and/or mRNA.
[0110] A genetically engineered cell of the invention may be a prokaryotic cell (e.g., E. coli ) or a eukaryotic cell (e.g., Saccharomyces cerevisiae or mammalian cell (e.g., HeLa)). According to some embodiments of the invention, the genetically engineered cell is a cell wherein the wild-type (i.e., not genetically engineered) cell does not naturally contain the inserted nucleic acid molecule and does not naturally express the protein encoded by the inserted nucleic acid molecule. Accordingly, the cell may be a genetically engineered Streptomyces strain, such as a Streptomyces lividans or a Streptomyces avermitilis strain. Alternatively, the cell may be a genetically engineered Pseudomonas strain, such as a Pseudomonas putida strain or a Pseudomonas fluorescens strain. In another alternative, the cell may be a genetically engineered Escherichia coli strain.
[0111] Note that in some types of cells genetically engineered to comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin, the actual genetically engineered cell, itself, may not be able to convert avermectin into 4″-keto-avermectin. Rather, the P450 monooxygenase heterologously expressed by such a genetically engineered cell may be purified from that cell, where the purified P450 monooxygenase of the invention is able to regioselectively oxidize avermectin to 4″-keto-avermectin. Thus, the genetically engineered cell of the invention need not, itself, be able to regioselectively convert avermectin to 4″-keto-avermection; rather, the genetically engineered cell of the invention need only comprise a nucleic acid molecule encoding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin, regardless of whether the P450 monooxygenase is active inside that cell.
[0112] In addition, a cell (e.g., E. coli ) geneticially engineered to comprise a nucleic acid molecule encoding P450 monooxygenase of the invention may not be able to regioselectively oxidize avermectin to 4″-keto-avermection, although the P450 monooxygenase purified from the genetically engineered cell is able to regioselectively oxidize avermectin to 4″-keto-avermectin. However, if the same cell were genetically engineered to comprise a P450 monooxygenase of the invention, a ferredoxin of the invention, and/or a ferredoxin reductase of the invention, then the P450 monooxygenase together with the ferredoxin and the ferredoxin reductase, all purified from that cell, and in the presence of a reducing agent (e.g., NADH or NADPH), would be able to regioselectively oxidize avermectin to 4″-keto-avermectin. Furthermore,-the genetically engineered cell comprising a P450 monooxygenase of the invention, a ferredoxin of the invention, and/or a ferredoxin reductase of the invention, itself would be able to carry out this oxidation.
[0113] Moreover, in a non-limiting example where a cell (e.g., E. coli ) is genetically engineered to express P450 monooxygenase, a ferredoxin, and a ferredoxin reductase proteins of the invention, all three of these proteins, when purified from the genetically engineered E. coli, are active and together are able to regioselectively oxidize avermectin to 4″-keto-avermectin (e.g., in the presence of a reducing agent, such as NADH or NADPH), and so are useful in a method for making emamectin.
[0114] In accordance with the present invention, the following material has been deposited with the Agricultural Research Service, Patent Culture Collection (NRRL), 1815 North University Street, Peoria, Ill. 61604, under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure: (1) Streptomyces lividans ZX7 (ema1/fd233-TUA1A) NRRL Designation No. B-30478; and (2) Pseudomonas putida NRRL B-4067 containing plasmid pRK290-ema1/fd233, NRRL Designation No.B-30479.
[0115] In identifying the novel family of P450 monooxygenases that regioselectively oxidize avermectin to 4″-keto-avermectin, novel ferredoxins and novel ferredoxin reductases were also discovered in the same strains of bacteria in which the P450 monooxygenases were found. Accordingly, in a further aspect, the invention provides purified nucleic acid molecule encoding a ferredoxin, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a polypeptide that regioselectively oxidizes avermectin to 4″-keto-avermectin. Similarly, the invention provides a purified nucleic acid molecule encoding a ferredoxin reductase, wherein the nucleic acid molecule is isolated from a Streptomyces strain comprising a polypeptide that regioselectively oxidizes avermectin to 4″-keto-avermectin. The invention also provides a purified ferredoxin protein, as well as a purified ferredoxin reductase protein, wherein the ferredoxin protein and the ferredoxin reductase protein are isolated from a Streptomyces strain comprising a polypeptide that regioselectively oxidizes avermectin to 4″-keto-avermectin.
[0116] A useful nucleic acid molecule encoding a ferredoxin of the invention comprises or consists essentially of a nucleic acid sequence that is at least 81% identical to SEQ ID NO: 35 or SEQ ID NO: 37. Alternatively, the nucleic acid molecule comprises or consists essentially of a nucleic acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 35 or SEQ ID NO: 37. The nucleic acid molecule encoding a ferredoxin of the invention may comprise or consist essentially of the nucleic acid sequence of SEQ ID NO: 35 or SEQ ID NO: 37.
[0117] The ferredoxin of the invention may comprise or consist essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO: 36 or SEQ ID NO: 38. In some embodiments, the nucleic acid molecule comprises or consists essentially an amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 36 or SEQ ID NO: 38. The ferredoxin of the invention may comprise or consist essentially of the amino acid sequence of SEQ ID NO: 36 or SEQ ID NO: 38.
[0118] A useful nucleic acid molecule encoding a ferredoxin reductase of the invention comprises or consists essentially of a nucleic acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104. The nucleic acid molecule encoding a ferredoxin reductase of the invention may comprise or consist essentially of the nucleic acid sequence of SEQ ID NO: 98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104. The ferredoxin reductase of the invention may comprise or consist essentially of an amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103,or SEQ ID NO: 105. The ferredoxin reductase of the invention may comprise or consist essentially of the amino acid sequence of SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, or SEQ ID NO: 105.
[0119] Methods for purifying ferredoxin and ferredoxin reductase proteins and nucleic acid molecules encoding such ferredoxin and ferredoxin reductase proteins are known in the art and are the same as those described above for purifying P450 monooxygenases of the invention and nucleic acid molecules encoding P450 monooxygenases of the invention.
[0120] In one non-limiting example to obtain a purified P450 monooxygenase of the invention with a purified ferredoxin, a S. lividans strain (or P. putida strain, or any other cell in which the P450 monooxygenase of the invention does not naturally occur) may be genetically engineered to contain a first nucleic acid molecule encoding a P450 monooxygenase of the invention and a second nucleic acid molecule encoding a ferredoxin protein, where both the first and second nucleic acid molecules are positioned for expression in the genetically engineered cell. The first and the second nucleic acid molecules can be on separate plasmids, or can be on the same plasmid. Thus, the same engineered cell or strain will produce both the P450 monooxygenase of the invention and the ferredoxin protein of the invention.
[0121] In a further non-limiting example to obtain a purified P450 monooxygenase of the invention with a purified ferredoxin and with a purified ferredoxin reductase of the invention, a S. lividans strain (or P. putida strain, or any other cell in which the P450 monooxygenase of the invention does not naturally occur) may be genetically engineered to contain a first nucleic acid molecule encoding a P450 monooxygenase of the invention and a second nucleic acid molecule encoding a ferredoxin protein of the invention and a third nucleic acid molecule encoding a ferredoxin reductase protein of the invention, where all the first and second and third nucleic acid molecules are positioned for expression in the genetically engineered cell. The first and the second and the third nucleic acid molecules can be on separate plasmids, or can be on the same plasmid. Thus, the same engineered cell or strain will produce all the P450 monooxygenase of the invention and the ferredoxin protein and the ferredoxin reductase proteins of the invention.
[0122] As described above for the P450 monooxygenases of the invention, the ferredoxin protein and/or the ferredoxin reductase protein may further comprise a tag. Moreover, the invention contemplates binding agents (e.g., antibodies) that specifically bind to the ferredoxin protein, and binding agents that specifically bind to the ferredoxin reductase proteins of the invention. Methods for generating tagged ferredoxin protein, tagged ferredoxin reductase protein, and binding agents (e.g., antibodies) that specifically bind to ferredoxin or ferredoxin reductase are the same as those as described above for generating tagged P450 monooxygenases of the invention and generating binding agents that specifically bind P450 monooxygenases of the invention.
[0123] The invention also provides a method for making emamectin. In this method, a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin is added to a reaction mixture containing avermectin. The reaction mixture is then incubated under conditions that allow the P450 monooxygenase to regioselectively oxidize avermectin to 4″-keto-avermectin. The reaction mixture may further comprise a ferredoxin, such as a ferredoxin of the present invention. In particular embodiments, the reaction mixture further comprises a ferredoxin reductase, such as a ferredoxin reductase of the present invention. The reaction mixture may further comprise a reducing agent, such as NADH or NADPH.
[0124] Additionally, the invention provides a method for making 4″-keto-avermectin. The method comprises adding a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin to a reaction mixture comprising avermectin and incubating the reaction mixture under conditions that allow the P450 monooxygenase to regioselectively oxidize avermectin to 4″-keto-avermectin. In some embodiments, the reaction mixture further comprises a ferredoxin, such as a ferredoxin of the present invention. The reaction mixture may also further comprise a ferredoxin reductase, such as a ferredoxin reductase of the present invention. In particular embodiments, the reaction mixture further comprises a reducing agent, such as NADH or NADPH.
[0125] The invention also provides a formulation for making emamectin comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the formulation further comprises a ferredoxin, such as a ferredoxin of the present invention. In particular embodiments, the ferredoxin is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. The formulation may further comprise a ferredoxin reductase, such as a ferredoxin reductase of the present invention. In particular embodiments, the ferredoxin reductase is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. In some embodiments, the formulation further comprises a reducing agent, such as NADH or NADPH.
[0126] In addition, the invention provides a formulation for making 4″-keto-avermectin comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4″-keto-avermectin. In some embodiments, the formulation further comprises a ferredoxin, such as a ferredoxin of the present invention. In particular embodiments, the ferredoxin is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. In some embodiments, the formulation further comprises a ferredoxin reductase, such as a ferredoxin reductase of the present invention. In particular embodiments, the ferredoxin reductase is isolated from the same species of cell or strain from which the P450 monooxygenase was isolated or derived. The formulation may further comprise a reducing agent, such as NADH or NADPH.
[0127] The following examples are intended to further illustrate certain preferred embodiments of the invention and are not limiting in nature.
EXAMPLE I
Optimized Growth Conditions for Streptomyces Tubercidicus Strain R-922
[0128] In one non-limiting example, the fermentation conditions needed to provide a steady supply of cells of Streptomyces tubercidicus strain R- 922 highly capable of regioselectively oxidizing avermectin to 4″-keto-avermectin were optimized.
[0129] First, the following solutions were made. For ISP-2 agar, 4 g of yeast extract (commercially available from Oxoid Ltd, Basingstoke, UK), 4 g of D(+)-glucose, 10 g of bacto malt extract (Difco No. 0186-17-7 (Difco products commercially available from, e.g., Voigt Global Distribution, Kansas City, Mo.)), and 20 g of agar (Difco No. 0140-01) were dissolved in one liter of demineralized water, and the pH is adjusted to 7.0. The solution was sterilized at 121° C. for 20 min., cooled down, and kept at 55° C. for the time needed for the immediate preparation of the agar plates.
[0130] For PHG medium, 10 g of peptone (Sigma 0521; commercially available from Sigma Chemical Co., St. Louis, Mo.), 10 g of yeast extract (commercially available from Difco), 10 g of D-(+)-glucose, 2 g of NaCl, 0.15 g of MgSO 4 ×7 H 2 O, 1.3 g of NaH 2 PO 4 ×H 2 O, and 4.4 g of K 2 HPO 4 were dissolved in 1 liter of demineralized water, and the pH was adjusted to 7.0.
[0131] Streptomyces tubercidicus strain R-922 was grown in a Petri dish on ISP-2 agar at 28° C. This culture was used to inoculate four 500 ml shaker flasks with baffle, each containing 100 ml PHG medium. These pre-cultures were grown on an orbital shaker with 120 rpm at 28° C. for 72 hours and then used to inoculate a 10-liter fermenter equipped with a mechanical stirrer and containing 8 liters PHG medium. This main culture was grown at 28° C. with stirring at 500 rpm and with aeration of 1.75 vvm (14 l/min.) and a pressure of 0.7 bar. At the end of the exponential growth, after about 20 hours, the cells were harvested by centrifugation. The yield of wet cells was 70-80 g/l culture.
EXAMPLE II
Whole Cell Biocatalysis Assay
[0132] As determined in accordance with the present invention, the following whole cell biocatalysis assay was employed to determine that the activity from Streptomyces cells capable of regioselectively oxidizing avermectin to 4″-keto-avermectin is catalyzed by a P450 monooxygenase.
[0133] Streptomyces tubercidicus strain R-922 was grown in PHG medium, and Streptomyces tubercidicus strain I-1529 was grown in M-17 or PHG medium. PHG medium contains 10 g/l Peptone (Sigma, 0.521), 10 g/l Yeast Extract (Difco, 0127-17-9), 10 g/l D-Glucose, 2 g/l NaCl, 0.15 g/l MgSO 4 ×7 H 2 O, 1.3 g/l NaH 2 PO 4 ×1 H 2 O, and 4.4 g/l K 2 HPO 4 at pH 7.0. M-17 medium contains 10 g/l glycerol, 20 g/l Dextrin white, 10 g/l Soytone (Difco 0437-17), 3 g/l Yeast Extract (Difco 0127-17-9), 2 g/l (NH 4 ) 2 SO 4 , and 2 g/l CaCO 3 at pH 7.0
[0134] To grow the cells, an ISP2 agar plate (not older than 1-2 weeks) was inoculated and incubated for 3-7 days until good growth was achieved. Next, an overgrown agar piece was transferred (with an inoculation loop) to a 250 ml Erlenmeyer flask with 1 baffle containing 50 ml PHG medium. This pre-culture is incubated at 28° C. and 120 rpm for 2-3 days. Next, 5 ml of the pre-culture were transferred to a 500 ml Erlenmeyer flask with 1 baffle containing 100 ml PHG medium. The main culture was incubated at 28° C. and 120 rpm for 2 days. Next, the culture was centrifuged for 10 min. at 8000 rpm in a Beckman Rotor JA-14. The cells were next washed once with 50 mM potassium phosphate buffer, pH 7.0.
[0135] To perform the whole cell biocatalysis assay, 500 mg wet cells were placed into a 25 ml Erlenmeyer flask, to which were added 10 ml of 50 mM potassium phosphate buffer, pH 7.0. The cells were stirred with a magnetic stir bar to distribute the cells. Next, 15 μl of a solution of avermectin B1a in isopropanol (30 mg/ml) were added, and the mixture shaken on an orbital shaker at 160 rpm and 28° C. Strain R-922 was reacted for 2 hours, and strain I-1529 was reacted for 30 hours.
[0136] To work up the cultures in the whole cell biocatalysis assay, 10 ml methyl-t-butyl-ether was added to an Erlenmeyer flask containing the resting cells and the entire cell mixture was transferred to a 30 ml-centrifuge tube, shaken vigorously, and then centrifuged at 16000 rpm for 10 min. The ether phase was pipetted into a 50 ml pear flask, and evaporated in vacuo by means of a rotary evaporator (≦0.1 mbar). The residue was re-dissolved in 1.2 ml acetonitrile and transferred to an HPLC-sample vial. Formation of 4″-keto-avermectin B1a could be observed by HPLC analysis using HPLC protocol I.
[0137] For HPLC protocol I, the following parameters were used:
Hardware Pump: L-6250 Merck-Hitachi Autosampler: AS-2000A Merck-Hitachi Interface D-6000 Merck-Hitachi Module: Channel 1- L-7450A UV-Diode Array Detector: Merck-Hitachi Colunm Oven: none Column: 70 mm × 4 mm Adsorbent: Kromasil 100 Å-3.5 μ-C18 Gradient Mode: Low Pressure Limit: 5-300 bar Column ambient (≈20° C.) Temperature Solvent A: acetonitrile Solvent B: water Flow: 1.5 ml/min Detection: 243 nm Pump Table: 0.0 min 75% A 25% B linear gradient 7.0 min 100% A 0% B 9.0 min 100% A 0% B jump 9.1 min 75% A 25% B 12.0 min 75% A 25% B Stop time: 12 min Sampling every 200 msec Period: Retention time time References table: 2.12 min 4″-hydroxy- avermectin B1a 3.27 min avermectin B1a 3.77 min 3″-O-demethyl- 4″-keto- avermectin B1a 4.83 min 4″-keto- avermectin B1a
EXAMPLE III
Biotransformation With Cell-Free Extract From Streptomyces Strain R-922
[0138] To prepare an active cell-free extract from Streptomyces tubercidicus strain R-922 capable of regioselective oxidation of avermectin to 4″-keto-avermectin, the following solutions were made, stored at 4° C., and kept on ice when used.
Solution Formula PP-buffer 50 mM K 2 HPO 4 /KH 2 PO 4 (pH 7.0) Disruption buffer 50 mM K 2 HPO 4 /KH 2 PO 4 (pH 7.0), 5 mM benzamidine, 2 mM dithiothreitol, and 0.5 mM Pefabloc (from Roche Diagnostics) Substrate 10 mg avermectin were dissolved in 1 ml isopropanol
[0139] Six grams of wet cells from Streptomyces strain R-922 were washed in PP-buffer and then resuspended in 35 ml disruption buffer and disrupted in a French press at 4° C. The resulting suspension was centrifuged for 1 hour at 35000×g. The supernatant of the cell free extract was collected. One μl substrate was added to 499 μl of cleared cell free extract and incubated at 30° C. for 1 hour. Then, 1 ml methyl-t-butyl ether was added to the reaction mixture and thoroughly mixed. The mixture was next centrifuged for 2 min. at 14000 rpm, and the methyl-t-butyl ether phase was transferred into a 10 ml flask and evaporated in vacuo by means of a rotary evaporator. The residue was dissolved in 200 μl acetonitrile and transferred into an HPLC-sample vial.
[0140] For HPLC, the HPLC protocol I was used.
[0141] When 1 μt substrate was added to 499 μl of cleared cell free extract and incubated at 30° C., no conversion of avermectin to 4″-keto-avermectin was observed by HPLC analysis using HPLC protocol I.
[0142] However, the possibility of addition of spinach ferredoxin and spinach ferredoxin 5 reductase and NADPH to the cell free extract to restore the biocatalytic activity was explored (see, generally, D. E. Cane and E. I. Graziani, J. Amer. Chem. Soc. 120:2682, 1998).
[0143] Accordingly, the following solutions were made:
Solution Formula Substrate 10 mg avermectin were dissolved in 1 ml isopropanol Ferredoxin 5 mg ferredoxin (from spinach), solution 1-3 mg/ml in Tris/ HCl-buffer (from Fluka) or 5 mg ferredoxin (from Clostridium pasteurianum ), solution of 1-3 mg/ml in Tris/HCl-buffer (from Fluka) or 5 mg ferredoxin (from Porphyra umbilicalis ), solution of 1-3 mg/ml in Tris/HCl-buffer (from Fluka) Ferredoxin 1 mg freeze-dried ferredoxin reductase (from spinach), Reductase solution of 3.9 U/mg in 1 ml H 2 O (from Sigma) NADPH 100 mM NADPH in H 2 O (from Roche Diagnostics)
[0144] Thus, to 475 μl of cleared cell free extract the following solutions were added: 10 μl ferredoxin, 10 μl ferredoxin reductase and 1 μl substrate. After the addition of substrate to the cells, the mixture was immediately and thoroughly mixed and aerated. Then, 5 μl of NADPH were added and the mixture incubated at 30° C. for 30 min. Then, 1 ml methyl-t-butyl ether was added to the reaction mixture and thoroughly mixed. The mixture was next centrifuged for 2 min. at 14000 rpm, and the methyl-t-butyl ether phase was transferred into a 10 ml flask and evaporated in vacuo by means of a rotary evaporator. The residue was dissolved in 200 μl acetonitrile and transferred into an HPLC-sample vial, and HPLC analysis performed using HPLC protocol I.
[0145] Formation of 4″-keto-avermectin was observable by HPLC analysis. Thus, addition of spinach ferredoxin and spinach ferredoxin reductase and NADPH to the cell free extract restored the biocatalytic activity.
[0146] Upon injection of a 30 μl sample, a peak appeared at 4.83 min., indicating the presence of 4″-keto-avermectin B la. A mass of 870 D was assigned to this peak by HPLC-mass spectrometry which corresponds to the molecular weight of 4″-keto-avermectin B1a.
[0147] Note that when analyzing product formation by HPLC and HPLC-mass spectrometry, in addition to the 4″-keto-avermectin, the corresponding ketohydrate 4″-hydroxy-avermectin was also found giving a peak at 2.12 min. This finding indicated that the P450 monooxygenase converts avermectin by hydroxylation to 4″-hydroxy-avermectin, from which 4″-keto-avermectin is formed by dehydration. Interestingly, when the spinach ferredoxin was replaced by ferredoxin from the bacterium Clostridium pasteurianum or from the red alga Porphyra umbilicalis, the biocatalytic conversion of avermectin to 4″-keto-avermectin still took place, indicating that the enzyme does not depend on a specific ferredoxin for receiving reduction equivalents.
EXAMPLE IV
Isolation of a Mutant Streptomyces Strain R-922 With Enhanced Activity
[0148] To obtain strains of Streptomyces strain R-922 that have an enhanced ability to regioselectively oxidize avermectin to 4″-keto-avermectin, UV mutants were generated. To do this, spores of Streptomyces strain R-922 were collected and stored in 15% glycerol at −20° C. This stock solution contained 2×10 9 spores.
[0149] The spore stock solution was next diluted and transferred to petri plates containing 10 ml of sterile water, and the suspension was exposed to UV light in a Stratalinker UV crosslinker 2400 (commercially available from Stratagene, La Jolla, Calif.). The Stratalinker UV crosslinker uses a 254-nm light source and the amount of energy used to irradiate a sample can be set in the “energy mode.”
[0150] Through experimentation, it was determined that an exposure of 8000 microjoules of UV irradiation (254 nm) was required to kill 99.9% of the spores. This level of UV exposure was used in the mutagenesis.
[0151] Surviving UV-mutagenized spores were plated, cultured, and transferred to minimal media. Approximately 0.3-0.4% of the viable spores were determined to be auxotrophic, indicating a good level of mutagenesis in the population.
[0152] The mutagenized clones were screened for activity in the whole cell biocatalysis assay described in Example II. As shown in FIG. 2, one mutant (“R-922 UV mutant”) showed a two to three fold increase in an ability to regioselectively oxidize avermectin to 4″-keto-avermectin as compared to wild-type strain R-922. Although the gene encoding the P450 monooxygenase responsible for the regioselectively oxidation activity, ema1, is not mutated in the R-922 UV mutant, this mutant nonetheless provides an excellent source for a cell-free extract containing ema1 protein.
EXAMPLE V
Isolation of the P450 Monooxygenase from Streptomyces Strain R-922
[0153] To enrich the P450 enzyme, 35 ml of active cell free extract were filtered through a 45 μm filter and fractionated by anion exchange chromatography. Anion exchange chromatography conditions were as follows:
FPLC instrument: Ä kta prime (from Pharmacia Biotech) FPLC-column: HiTrap ™Q (5 ml) stacked onto Resource ®Q (6 ml) (from Pharmacia Biotech) eluents buffer A: 25 mM Tris/HCl (pH 7.5) buffer B: 25 mM Tris/HCl (pH 7.5) containing 1 M KCl temperature eluent bottles and fractions in ice bath, flow 3 ml/min detection UV 280 nm Pump table: 0.0 min 100% A 0% B linear gradient to 2.0 min 90% A 10% B 5.0 min 90% A 10% B linear gradient to 30.0 min 50% A 50% B linear gradient to 40.0 min 0% A 100% B 50.0 min 0% A 100% B
[0154] Enzyme activity eluted with 35%-40% buffer B. The active fractions were pooled and concentrated by centrifugal filtration through Biomax™ filters with an exclusion limit of 5 kD (commercially available from Millipore Corp., Bedford, Mass.) at 5000 rpm and then rediluted in disruption buffer containing 20% glycerol to a volume of 5 ml containing 3-10 mg/ml protein. This enriched enzyme solution contained at least 25% of the original enzyme activity.
[0155] The enzyme was further purified by size exclusion chromatography. Size exclusion chromatography conditions were as follows:
FPLC instrument: Ä kta prime (from Pharmacia Biotech) FPLC-column: HiLoad 26/60 Superdex ® 200 prep grade (from Pharmacia Biotech) sample: 3-5 ml enriched enzyme solution from the anion chromatography step sample preparation: filtered through 45 μm filter eluent buffer: PP-buffer (pH 7.0) + 0.1 M KCl temperature: 4° C. flow: 2 ml/min detection: UV 280 nm
[0156] Enzyme activity eluted between 205-235 ml eluent buffer. The active fractions were pooled, concentrated by centrifugal filtration through Biomax™ filters with an exclusion limit of 5 kD (from Millipore) at 5000 rpm, and rediluted in disruption buffer containing 20% glycerol to form a solution of 0.5-1 ml containing 2-5 mg/ml protein. This enriched enzyme solution contained 10% of the original enzyme activity. This enzyme preparation, when checked for purity by SDS page, (see, generally, Laemmli, U. K., Nature 227:680-685, 1970 and Current Protocols in Molecular Biology, supra) and stained with Coomassie blue, showed one dominant protein band with a molecular weight of 45-50 kD, according to reference proteins of known molecular weight.
EXAMPLE VI
Attempted Isolation of P450 Monooxygenase Genes From Streptomyces Strains R-922 and I-1529
[0157] Based on results described above that suggested the enzyme from strain R-922 that is responsible for the regiospecific oxidation of avermectin to 4″-keto-avermectin is a P450 monooxygenase, a direct PCR-based approach to clone P450 monooxygenase genes from this strain was initiated (see, generally, Hyun et al., J. Microbiol. Biotechnol. 8(3):295-299, 1998). This approach is based on the fact that all P450 monooxygenase enzymes contain highly conserved oxygen-binding and heme-binding domains that are also conserved at the vii, nucleotide level. PCR primers were designed to prime to these conserved domains and to amplify the DNA fragment from P450 genes using R-922 or I-1529 genomic DNA as a template. The PCR primers used are shown in Table 1.
TABLE 1 SEQ Degen- ID eracy NOs +TL,1 O 2 -Binding Domain Primers (5′ to 3′)* I A G H E T T 43 ATC GCS GGS CAC GAG ACS AC 8 44 V A G H E T T 45 GTS GCS GGS CAC GAG ACS AC 16 46 L A G H E T T 47 CTS GCS GGS CAC GAG ACS AC 16 48 L L L I A G H E T 49 TS CTS CTS ATC GCS GGS CAC GAG AC & 32 50 Heme-Binding Domain Primers (3′ to 5′)* H Q C L G Q N L A 51 GTG GTC ACG GAS CCS TGC TTG GAS CG & 8 52 F G H G V H Q C 53 AAG CCS GTG CCS CAS GTG GTC ACG 8 54 F G F G V H Q C 55 AAG GCS AAG CCS CAS GTG GTC ACG 8 56 F G H G I H Q C 57 AAG CCS GTG CCS TAG GTG GTC ACG 4 58 F G H G V H F C 59 AAG CCS GTG CCS CAS GTG AAG ACG 8 60
[0158] PCR amplification using any of the primers specific to nucleotide sequences encoding the O 2 -binding domain with any of the primers specific to nucleotide sequences encoding the heme-binding domain and genomic DNA from Streptomyces strains R-922 or I-1529 resulted in the amplification of an approximately 350 bp DNA fragment. This is exactly the size that would be expected from this PCR amplification due to the approximately 350 bp separation in P450 genes of the gene segments encoding the O 2 -binding and heme-binding sites.
[0159] The 350 bp PCR fragments were cloned into the pCR2.1-TOPO TA cloning plasmid (commercially available Invitrogen, Carlsbad, Calif.) and transformed into E. coli strain TOP10 (Invitrogen, Carlsbad, Calif.). Approximately 150 individual clones from strains R-922 and I-1529 were sequenced to determine how many unique P450 gene fragments were represented. Analysis of the sequences revealed that they included 8 unique P450 gene fragments from strain R-922 and 7 unique fragments from 1-1529.
[0160] Blast analysis (Altschul et al., J. Mol. Biol. 215:403-410, 1990) demonstrated that all of the unique P450 gene fragments from both the R-922 and I-1529 strains were derived from P450 genes and encoded the region between the O 2 -binding and heme-binding domains (see FIG. 3 for strain R-922 and FIG. 4 for strain I-1529).
[0161] Next, in order to clone the full-length genes from which the PCR fragments were derived, the DNA fragments cloned by PCR were used as hybridization probes to gene libraries containing genomic DNA from strains R-922 and I-1529. To do this, genomic DNA from the R-922 and I-1529 strains was partially digested with Sau3A I, dephosphorylated with calf intestinal alkaline phosphatase (CIP) and ligated into the cosmid plasmid pPEH215, a modified version of SuperCos 1 (commercially available from Stratagene, La Jolla, Calif.). Ligation products were packaged using the Gigapack III XL packaging extract and transfected into E. coli XL1 Blue MR host cells. Twelve cosmids that strongly hybridized to the PCR-generated P450 gene fragments were identified from the R-922 library, from which three unique P-450 genes were subcloned and sequenced. The hybridizations were performed at high stringency conditions according to the protocol of Church and Gilbert (Church and Gilbert, Proc. Natl. Acad. Sci. USA 81:1991-1995, 1984). In brief, these high stringency conditions include Hybrid Buffer containing 500 mM Na-phosphate, 1 mM EDTA, 7% SDS, 1% BSA; Wash Buffer 1 containing 40 mM Na-phosphate, 1 mM EDTA, 5% SDS, 0.5% BSA; and Wash Buffer 2 containing 40 mM Na-phosphate, 1 mM EDTA, 1% SDS (Note that other high stringency hybridizations conditions are described, for example, in Current Protocols in Molecular Biology, supra.) Nineteen strongly hybridizing cosmids were identified from the I-1529 library, and from these, four unique P-450 genes were subcloned and sequenced.
[0162] In yet a further approach to isolate diverse P450 monooxygenase genes from strains R-922 and I-1529, a known P450 gene from another bacterium was used as a hybridization probe to identify cosmid clones containing homologous P450 genes from strains R-922 and I-1529. The epoF P450 gene from Sorangium cellulosum strain So ce90 that is involved in the synthesis of epothilones (Molnar et al., Chem Biol. 7(2):97-109, 2000) was used as a probe in this effort. Using the epoF P450 gene probe, one cosmid was identified from strain R-922 (clone LC), and three were identified from strain I-1529 (clones LA, LB, and EA). In each case, the homologous gene fragment was subcloned and sequenced, and found to code for P450 monooxygenase enzymes.
[0163] However, a comparison of the 17 peptide sequences identified in Example VII (below) failed to match any of these cloned genes. Two of the peptide sequences (namely, LVKDDPALLPR (SEQ ID NO: 70) and AVHELMR (SEQ ID NO: 76)) mapped to the region between the O 2 and heme binding domains, and so these should have identified any of the partial gene fragments derived by the PCR approach. Thus, the standard approaches based on the known PCR technique of Hyun et al., supra, and using known P450 genes as hybridization probes failed to identify the gene that encodes the specific P450 monooxygenase responsible for the regioselective oxidation of avermectin. Accordingly, it was determined that additional experimentation was required to isolate the gene encoding the P450 monooxygenase of the invention.
EXAMPLE VII
Partial Sequencing of the P450 Monooxygenase from Streptomyces Strain R-922
[0164] Partial amino acid sequencing of the P450 monooxygenase from Streptomyces strain R-922 was carried out by the Friedrich Miescher Institute, Basel Switzerland. The protein of the dominant band on the SDS page was tryptically digested and the formed peptides separated and sequenced by mass spectrometry and Edman degradation (see, generally, Zerbe-Burkhardt et al., J. Biol. Chem. 273:6508, 1998). The sequence of the following 17 peptides were found:
Sequence Sequence I.D. No. HPGEPNVMDPALITDPFTGYGALR (SEQ ID NO:61) FVNNPASPSLNYAPEDNPLTR (SEQ ID NO:62) LLTHYPDISLGIAPEHLER (SEQ ID NO:63) VYLLGSILNYDAPDHTR (SEQ ID NO:64) TWGADLISMDPDR (SEQ ID NO:65) EALTDDLLSELIR (SEQ ID NO:66) FMDDSPVWLVTR (SEQ ID NO:67) LMEMLGLPEHLR (SEQ ID NO:68) VEQIADALLAR (SEQ ID NO:69) LVKDDPALLPR (SEQ ID NO:70) DDPALLPR (SEQ ID NO:71) TPLPGNWR (SEQ ID NO:72) LNSLPVR (SEQ ID NO:73) ITDLRPR (SEQ ID NO:74) EQGPVVR (SEQ ID NO:75) AVHELMR (SEQ ID NO:76) AFTAR (SEQ ID NO:77) FEEVR (SEQ ID NO:78)
[0165] Alignment of these peptides to a selection of actinomycete P450 monooxygenase sequences indicated that all the peptides were fragments of a single P450 mono-oxygenase.
EXAMPLE VIII
Cloning the P450 Monooxygenase Gene from Strain R-922 that Encodes the Enzyme Responsible for the Oxidation of Avermectin to 4″-Keto-Avermectin
[0166] PCR primers were designed by reverse translation from the amino acid sequences of several of the peptides derived from the P450 enzyme of strain R-922 (see Example VII and Table 2 below). Each of five forward primers (2aF, 2bF, 3F, 1F, and 7F) was paired with one reverse primer (5R) in PCR reactions with R-922 genomic DNA as a template. In each reaction, a DNA fragment of the expected size was produced.
TABLE 2 Expected Primer sequence and the amino acid size Primer sequence to which they were designed* Degeneracy (bp)** SEQ ID NO: 2aF P G E D N V M 64 600 79 5′-CCS GGS GAR CCS AAY GTS ATG-3′ 80 2bF A L I T D P F 32 580 81 5′-GCS CTS ATY ACS GAC CCS TTC-3′ 82 3F F M D D S P V W 32 549 83 5′-TTC ATG GAC GAC WSS CCS GTS TGG-3′ 84 1F L N Y D A P D H 32 350 85 5′-CTS AAY TAY GAC GCS CCS GAC CAC-3′ 86 7F V E Q I A D A L 32 300 87 5′-GTS GAR CAG ATY GCS GAC GCS CTS-3′ 88 5R D L I S M D P D 64 — 89 3′-CTG GAS TAR WSS TAC CTG GGS CTG-5′ 90
[0167] The 580 and 600 bp PCR fragments generated by using primers (2bF and 5R) and (2aF and 5R), respectively, were cloned into the pCR-Blunt II -TOPO cloning plasmid (commercially available from Invitrogen, Carlsbad, Calif.) and transformed into E. coli strain TOP10 (Invitrogen, Carlsbad, Calif.). The inserted DNA fragments were then sequenced. Examination of the sequences revealed that the 600 and 580 bp fragments were identical in the 580 bp of sequence that they have in common. Also, there was a perfect match between the deduced amino acid sequence derived from the nucleotide sequence of the 600 bp and 580 bp fragments and the amino acid sequences of peptides isolated from the purified P450 Ema1 enzyme that aligned in this region of the isolated gene (see FIG. 5). This result strongly suggested that the gene fragments isolated in these clones are derived from the gene that encodes the P450 Ema1 enzyme that is responsible for the oxidation of avermectin to 4″-keto-avermectin.
[0168] The 600 bp PCR fragment produced using primers 2aF (SEQ ID NO: 80) and 5R(SEQ ID No: 90) was used as a hybridization probe to a cosmid library of genomic DNA isolated from strain R-922 (cosmid library described in Example VI). Two cosmids named pPEH249 and pPEH250 were identified that hybridized strongly with the probe. The portion of each cosmid encoding the P450 enzyme was sequenced and the sequences were found to be identical between the two cosmids. The complete coding sequence of the ema1 gene was identified (SEQ ID NO: 1). The amino acid sequence of all peptide fragments from P450 Ema1 matched perfectly with the deduced amino acid sequence from the ema1 gene (see FIG. 5). Comparison of the deduced amino acid sequence of the protein encoded by the ema1 gene using BLASTP (Altschul et al., supra) determined that the closest match in the databases is to a P450 monooxygenase from S. thermotolerans that has a role in the biosynthesis of carbomycin (Arisawa et al., Biosci. Biotech. Biochem. 59(4):582-588, 1995) and whose identity with ema1 is only 49% (Identities=202/409 (49%), Positives=271/409 (65%), Gaps=2/409 (0%)). In the Blast analysis, the following settings were employed:
BLASTP 2.0.10 Lambda K H 0.322 0.140 0.428 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 375001765 Number of Sequences: 1271323 Number of extensions: 16451653 Number of successful extensions: 46738 Number of sequences better than 10.0: 2211 Number of HSP's better than 10.0 without gapping: 628 Number of HSP's successfully gapped in prelim test: 1583 Number of HSP's that attempted gapping in prelim test: 43251 Number of HSP's gapped (non-prelim): 2577 length of query: 430 length of database: 409,691,007 effective HSP length: 55 effective length of query: 375 effective length of database: 339,768,242 effective search space: 127413090750 effective search space used: 127413090750
[0169] A similar comparison of the nucleotide sequences of these two genes demonstrated that they are 65% identical at the nucleotide level. These results demonstrate that P450 Ema1 is a new enzyme.
EXAMPLE IX
Heterologous Expression of the ema1 Gene in Strentomyces lividans Strain ZX7
[0170] The coding sequence of the ema1 gene was fused to the thiostrepton-inducible promoter (tipA) (Murakami et al., J. Bacteriol. 171:1459-1466, 1989). The tipA promoter was derived from plasmid pSIT151 (Herron and Evans, FEMS Microbiology Letters 171:215-221, 1999).
[0171] The fusion of the tipA promoter and the ema1 coding sequence was achieved by first amplifying the ema1 coding sequence with the following primers to introduce a PacI cloning site at the 5′ end and a PmeI compatible end on the 3′ end.
[0172] Forward Primer: The underlined sequence is a PacI recognition sequence; the sequence in bold-face type is the start of the coding sequence of ema1.
[0173] Reverse Primer: The underlined sequence is half of a PmeI recognition sequence; the bold-face type sequence is the reverse complement of the ema1 translation stop codon followed by the 3′ end of the ema1 coding sequence.
(SEQ ID NO:92) 5′-AAACTCACCCCAACCGCACCGGCAGCGAGTTC-3″
[0174] The PacI-digested PCR fragment containing the ema1 coding sequence was cloned into plasmid pTBBKA (see FIG. 7) that was restricted (i.e., digested) with PacI and PmeI, and the ligated plasmid transformed into E. coli. Four clones were sequenced. Three of the four contained the complete and correct ema1 coding sequence. The fourth ema1 gene clone contained a truncated version of the ema1 gene. The full-length ema1 gene encodes a protein that begins with the amino acid sequence MSELMNS (SEQ ID NO: 93). The truncated gene encodes a protein that lacks the first 4 amino acids and begins with the second methionine residue. This gene has been named ema1A. The nucleotide and amino acid sequence of ema1A are provided as SEQ ID NO: 33 and SEQ ID NO: 34, respectively. The ema1 and ema1A genes in these plasmids, pTBBKA-ema1 and pTBBKA-ema1A, are in the correct juxtaposition with the tipA promoter to cause expression of the genes from this promoter.
[0175] Plasmid pTBBKA contains a gene from the Streptomyces insertion element IS117 that encodes an integrase that catalyzes site-specific integration of the plasmid into the chromosome of Streptomyces species (Henderson et al., Mol. Microbiol. 3:1307-1318, 1989 and Lydiate et al., Mol. Gen. Genet. 203:79-88, 1986). Since plasmid pTBBKA has only an E. coli replication origin and contains a mobilization site, it can be transferred from E. coli to Streptomyces strains by conjugation where it will not replicate. However, it is able to integrate into the chromosome due to the IS 117 integrase and Streptomyces clones containing chromosomal integrations can be selected by resistance to kanamycin due to the plasmid-borne kanamycin resistance gene.
[0176] The ema1 coding sequence was also cloned into other plasmids that are either replicative in Streptomyces or, like pTBBKA, integrate into the chromosome upon introduction into a Streptomyces host. For example, ema1 was cloned into plasmid pEAA, which is similar to plasmid pTBBKA but the KpnI/PacI fragment containing the tipA promoter was replaced with the ermE gene promoter (Schmitt-John and Engels, Appl Microbiol Biotechnol. 36(4):493-498, 1992). In addition, pEAA does not contain the kanamycin resistance gene. The ema1 gene was cloned into pEAA as a PacI/PmeI fragment to create plasmid pEAA-ema1 in which the ema1 gene is expressed from the constitutive ermE promoter.
[0177] Plasmid pTUA1A is a Streptomyces- E.coli shuttle plasmid (see FIG. 8) that contains the tipA promoter. The ema1 gene was also cloned into the PacI/PmeI sites in plasmid pTUA1A to create plasmid pTUA-ema1.
[0178] The ema1 A gene fragment was also ligated as a PacI/PmeI fragment into plasmids pTUA1A, and pEAA in the same way as the ema1 gene fragment to create plasmids pTUA-ema1A, and pEAA-ema1 A, respectively.
[0179] The pTBBKA, pTUA1A, and pEAA-based plasmids containing the ema1 or ema1A genes were introduced into S. lividans ZX7 and in each case transformants were obtained and verified ( S. lividans l strains ZX 7::pTBBKA-ema1 or ema1A, ZX7 (pTUA-ema1 or -ema1 A), and ZX7::pEAA-ema1 or -ema1A, respectively).
[0180] Wild-type Streptomyces lividans strain ZX7 was tested and found to be incapable of the oxidation of avermectin to 4″-keto-avermectin. Transformed S. lividans strains ZX7::pTBBKA-ema1, ZX7::pTBBKA-ema1A, ZX7 (pTUA-ema1), ZX7 (pTUA-ema1A), ZX7::pEAA-ema1, and ZX7::pEAA-ema1A were each tested for the ability to oxidize avermectin to 4″-keto-avermectin using resting cells. To do this, the whole cell biocatalysis assay described above (including analysis method) was performed. Note that for the whole cell biocatalysis assay, transformed Streptomyces lividans, like strain R-922, was grown in PHG medium and, again like strain R-922, had a reaction time of 16 hours (i.e., during which time the 500 mg transformed Streptomyces lividans wet cells in 10 ml of 50 mM potassium phosphate buffer, pH 7.0, were shaken at 160 rpm at 28° C. in the presence of 15 μl of a solution of avermectin in isopropanol (30 mg/ml)).
[0181] In the presence of the inducer, thiostrepton (5 ug/ml), the ema1- or ema1A-containing strains ZX7::pTBBKA-ema1, ZX7::pTBBKA-ema1A, ZX7 (pTUA-ema1), ZX7 (pTUA-ema1A) were found to oxidize avermectin to 4″-keto-avermectin as evidenced by the appearance of the oxidized 4″-keto-avermectin compound (see Table 3).
TABLE 3 % Conversion of Avermectin Strain 2 hour 16 hour Streptomyces lividans ZX7 + Plasmid 1 None 0 0 pTBBKA-ema1A 0.5 ± 0.059 1.17 ± 0.112 pTBBKA-ema1 0.21 ± 0.0.356 0.65 ± 0.079 pTUA-ema1 20.96 ± 1.044 42.0 ± 2.5 pEAA-ema1 3.0 ± 0.232 24.1 ± 0.358 pTBBKA-ema2 4.79 ± 0.096 9.57 ± 0.423 pTUA-ema2 0.77 ± 0.138 2.05 ± 0.537 pEAA-ema2 0.0 1.73 ± 3.00 pTBBKA-ema1/fd233 8.89 ± 0.720 30.99 ± 0.880 pTUA-ema1/fd233 23.29 ± 0.854 61.2 ± 3.548 pEAA-ema1/fd233 8.26 ± 0.845 10.66 ± 0.858 pTUA-ema2/fd233 1.85 ± 0.861 6.40 ± 1.918 Pseudomonas putida S12 + Plasmid None 0 pRK-ema1 ND 2 18 pRK-ema1/fd233 ND 32
[0182] These results conclusively demonstrate that the P450 Ema1 enzyme encoded by the ema1 gene is responsible for the oxidation of avermectin to 4″-keto-avermectin in S. tubercidicus strain R-922. Furthermore, the data demonstrates that the ema1A gene that is 4 amino acids shorter on the N-terminus than the native ema1 gene also encodes an active P450 Ema1 enzyme.
[0183] As can be seen in FIG. 9, oxidation of avermectin to 4″-keto-avermectin by S. lividans strain ZX7::pTBBKA-ema1, as detected by HPLC analysis, is variable depending upon the amount of thiostrepton used to induce expression of ema1. Note that S. lividans strains ZX7::pEAA-ema1 and ZX7::pEAA-ema1A (see Table 3) demonstrated this oxidation activity in the absence of thiostrepton since in these strains the ema1 or ema1A genes are expressed from the ermE promoter that does not require induction.
EXAMPLE X
Isolation of an ema1-Homolosous Gene From Streptomyces tubercidicus Strain I-1529
[0184] Streptomyces tubercidicus strain I-1529 was also found to be active in biocatalysis of avermectin to form the 4″-keto-avermectin derivative. The cosmid library from strain I-1529, described in Example VI, was probed at the high stringency conditions of Church and Gilbert (Church and Gilbert, Proc. Natl. Acad. Sci. USA 81:1991-1995, 1984) with the 600 bp ema1 PCR fragment produced using primers 2aF (SEQ ID NO: 80) and 5R (SEQ ID NO: 90) described previously to identify clones containing the ema1 homolog from strain I-1529. Three strongly hybridizing cosmids were identified. The P450 gene regions in two of the cosmids, pPEH252 and pPEH253, were sequenced and found to be identical. Analysis of the DNA sequence revealed the presence of a gene with high homology to the ema1 gene of strain R-922. FIG. 6 shows a comparison of the deduced amino acid sequence of Ema2 (i.e., P450 Ema2 ), Ema1 (i.e., P450 Ema1 ), and a P450 monooxygenase from Streptomyces thermotolerans that is involved in the biosynthesis of carbomycin (Carb-450) (GenBank Accession No. D30759).
[0185] The gene from Streptomyces tubercidicus strain I-1529, named ema2, encodes an enzyme with 90% identity at the amino acid level and 90.6% identity at the nucleotide level to the P450 Ema1 enzyme. The nucleotide sequence of the ema2 gene and the deduced amino acid sequence of P450 Ema2 are provided in SEQ ID NO: 3 and SEQ ID NO: 4, respectively.
[0186] The ema2 coding sequence was cloned in the same manner as the ema1 and ema1A genes into plasmids pTBBKA, pTUA1A, and pEAA such that the coding sequence was functionally fused to the tipA or ermE promoter in these plasmids. The resulting plasmids, pTBBKA-ema2, pTUA-ema2, and pEAA-ema2 were transferred from E. coli to S. lividans ZX7 by conjugation to create strains ZX7::TBBKA-ema2 and ZX7 (pTUA-ema2), and ZX7::pEAA-ema2 containing the ema2 gene integrated into the chromosome or maintained on a plasmid.
[0187] Strains ZX7::TBBKA-ema2, ZX7 (pTUA-ema2), and ZX7::pEAA-ema2 were next tested for the ability to oxidize avermectin to 4″-keto-avermectin. The ema2 gene was also shown to provide biocatalysis activity, although at a lower level compared to the ema1 gene (see Table 3).
[0188] These results demonstrate that the ema2 gene from S. tubercidicus strain I-1529 also encodes a P450 enzyme (P450 Ema2 ) capable of oxidizing avermectin to 4″-keto-avermectin.
EXAMPLE XI
Characterization of ema1 Homologs From Other Biocatalysis Strains
[0189] Seventeen Streptomyces sp. strains, including strains R-922 and I-1529, were identified that are capable of catalyzing the regiospecific oxidation of the 4″-carbinol of avermectin to a ketone. Next, the isolation and characterization of the genes encoding the biocatalysis enzyme from all of these strains was accomplished.
[0190] To do this, genomic DNA was isolated from the strains and was evaluated by restriction with several restriction endonucleases and Southern hybridization with the ema1 gene. A specific restriction endonuclease was identified for each DNA that would generate a single DNA fragment of a defined size to which the ema1 gene hybridizes. For each strain, there was only one strongly hybridizing DNA fragment, thus suggesting that other P450 genes were not detected under the high stringency hybridization conditions used in these experiments. Each DNA was digested with the appropriate restriction endonuclease, and the DNA was subjected to agarose gel electrophoresis. DNA in a narrow size range that included the size of the ema1-hybridizing fragment was excised from the gel. The size-selected DNA was ligated into an appropriate cloning plasmid and this ligated plasmid was used to transform E. coli. The E. coli clones from each experiment were screened by colony hybridization with the ema1 gene fragment to identify clones containing the ema1-homologous DNA fragment.
[0191] The nucleotide sequence of the cloned DNA in each ema1-homologous clone was determined and examined for the presence of a gene encoding a P450 enzyme with homology to ema1. In this way, ema1-homologous genes were isolated from 14 of the 15 other active strains. The nucleotide and deduced amino acid sequences of these are referenced in Table 4 as SEQ ID NOS: 5-32 and 94-95. A diagram of the relationship of these enzymes in the form of a phylogenetic tree is shown in FIG. 10. This phylogenetic tree was generated using the commercially available GCG Wisconsin software program version 1.0 (Madison, Wis.).
TABLE 4 SEQ ID NO (nucleotide and amino acid, Strain Number Gene Classification respectively) R-0922 ema1 Strept. tubercidicus 1 and 2 I-1529 ema2 Strept. tubercidicus 3 and 4 1053 ema3 Streptomyces rimosus 5 and 6 R-0401 ema4 Streptomyces lydicus 7 and 8 I-1525 ema5 Streptomyces sp. 9 and 10 DSM-40241 ema6 Strept. chattanoogensis * 11 and 12 IHS-0435 ema7 Streptomyces sp. 13 and 14 C-00083 ema8 Streptomyces albofaciens 15 and 16 MAAG-7479 ema9 Streptomyces platensis 17 and 18 A/96-1208710 ema10 Strept. kasugaensis 19 and 20 R-2374 ema11 Streptomyces rimosus 21 and 22 MAAG-7027 ema12 Strept. tubercidicus 23 and 24 Tue-3077 ema13 Streptomyces platensis 25 and 26 I-1548 ema14 Streptomyces platensis 27 and 28 NRRL-2433 ema15 Strept. lydicus 29 and 30 MAAG-0114 ema16 Streptomyces lydicus 31 and 32 DSM-40261 ema17 Streptomyces tubercidicus 94 and 95
EXAMPLE XII
Construction of His-tagged ema1 and ema1 Homologs to Facilitate Enzyme Purification
[0192] In order to purify the P450 Ema1 enzyme and the P450 enzymes encoded by the ema1 homologs from other biocatalysis strains, each of the P450 genes was cloned into the E. coli expression plasmid pET-28b(+) (commercially available from Novagen, Madison, Wis.). The pET-28 plasmids are designed to facilitate His-tag fusions at either the N-, or C-terminus and to provide strong expression of the genes in E. coli from the T7 phage promoter. In many cases, the coding sequence of the ema genes begins with the sequence ATGT. These genes were amplified by PCR such that the primers on the 5′ end incorporated a PciI recognition site (5′ ATATGT 3′) at the 5′ terminus. The last four bases of the PciI site correspond to the ATGT at the beginning of the ema gene coding sequence.
[0193] PCR primers at the 3′ end of the genes were designed to remove the translation stop codon at the end of the ema gene coding sequence and to add an XhoI recognition site to the 3′ terminus. The resulting PCR fragments were restricted with PciI and XhoI to generate PciI ends at the 5′ termini and XhoI ends at the 3′ termini, thereby facilitating cloning of the fragments into pET-28b(+) previously restricted with NcoI and XhoI. Since PciI and NcoI ends are compatible, the fragments were cloned into pET-28b(+) in the proper orientation to the T7 promoter and ribosome binding site in the plasmid to provide expression of the genes.
[0194] At the 3′ end of each ema gene, the coding sequence was fused in frame at the XhoI site to the His-tag sequence followed by a translation stop codon. This results in the production of an Ema enzyme with six histidine residues added to the C-terminus to facilitate purification on nickel columns.
[0195] In the case of ema genes in which the ATG translation initiation codon is not followed by a T nucleotide, the ema genes were amplified by PCR using a different strategy for the 5′ end. The primers at the 5′ end were designed to incorporate a C immediately preceding the ATG translation initiation codon and the primers at the 3′ end were the same as described above. The PCR fragments that were amplified were restricted with XhoI to create an XhoI end at the 3′ -terminus and the 5′ end was left as a blunt end. These fragments were cloned into pET-28b(+) that had been restricted with NcoI, but the NcoI ends were made blunt-ended by treatment with mung bean exonuclease, and restricted with XhoI.
[0196] In this manner, the ema genes were cloned into pET-28b(+) to create a functional fusion with the T7 promoter and the His-tag at the C-terminus as described previously. All His-tagged ema genes were sequenced to ensure that no errors were introduced by PCR.
[0197] Large amounts of the His-tagged P450 Ema1 and P450 Ema2 enzymes were isolated and purified by standard protocols. E. coli strain BL21 DE3 (commercially available from Invitrogen; Carlsbad, Calif.) containing the T7 RNA polymerase gene under the control of the inducible tac promoter and the appropriate pET-28/ema plasmid was cultured and the cells were harvested and lysed. The lysates were applied to Ni-NTA columns (commercially available from Qiagen Inc., Valencia, Calif.) and the protein was purified according to the procedure recommended by the manufacturer.
[0198] Purified His-tagged P450 Ema1 and P450 Ema2 were highly active in in vitro activity assays as evidenced by a high rate of conversion of avermectin to 4″-keto-avermectin.
EXAMPLE XIII
Expression of ema1 in Pseudomonas
[0199] The ema1 gene constructs were next introduced into P. putida (wildtype P. putida commercially available from the American Type Culture Collection, Manassas, Va.; ATCC Nos. 700801 and 17453). The ema1 and ema1/fd233 gene fragments were cloned as PacI/PmeI fragments into the plasmid pUK21 (Viera and Messing, Gene 100:189-194, 1991). The fragments were cloned into a position located between the tac promoter (P tac ) and terminator (T tac ) on pUK21 in the proper orientation for expression from the tac promoter. The P tac-ema 1-T tac and P tac -ema1/fd233-T tac gene fragments were removed from pUK21 as Bg1II fragments and these were cloned into the broad host-range, transmissible plasmid, pRK290 (Ditta et al., Proc. Natl. Acad. Sci. USA 77:7347-7351, 1980) to create plasmids pRK-ema1 and pRK-ema1/fd233 (FIG. 11). These plasmids were introduced into P. putida strains ATCC 700801 and ATCC 17453 by conjugal transfer from E. coli hosts by standard methodology (see, e.g., Ditta et al., Proc. Natl. Acad. Sci. USA 77:7347-7351, 1980).
[0200] P. putida ATCC 700801 and ATCC 17453 containing plasmids pRK-ema1 or pRK-ema1/fd233 were tested for the ability to catalyze the oxidation of avermectin. The results shown in Table 3 demonstrate that these strains are able to catalyze this reaction.
EXAMPLE XIV
Identification of Genes Encoding Ferredoxins That Are Active With the P450 Ema1 Monooxygenase
[0201] P450 monooxygenases require two electrons for each hydroxylation reaction catalyzed (Mueller et al., “Twenty-five years of P450 cam research: Mechanistic Insights into Oxygenase Catalysis.” Cytochrome P 450, 2 nd Edition, P. R. Ortiz de Montellano (ed.), pp. 83-124; Plenum Press, NY 1995). These electrons are transferred to the P450 monooxygenase one at a time by a ferredoxin. The electrons are ultimately derived from NAD(P)H and are passed to the ferredoxin by a ferredoxin reductase. Specific P-450 monooxygenase enzymes have a higher activity when they interact with a specific ferredoxin. In many cases, the gene encoding a ferredoxin that interacts specifically with a given P450 monooxygenase is located adjacent to the gene encoding the P450 enzyme.
[0202] As described above, in addition to the ema1 gene, four P450 genes from strain R-922 and seven P450 genes from strain I-1529 (see Example VI) were isolated and sequenced. In some of these, there was sufficient sequence information about the DNA flanking the P-450 genes to look for the presence of associated ferredoxin genes. By this approach, two unique ferredoxin genes were identified from each of the two strains. Ferredoxin genes fd229 and fd230 were identified from strain R-922, and fd233 and fdEA were identified from strain I-1529. In addition, a ferredoxin reductase gene was found to reside adjacent to the fdEA gene from strain I-1529. In order to test the biological activity of each of these ferredoxins in combination with P450 Ema1 , each individual ferredoxin gene was amplified by PCR to produce a gene fragment that included a blunt 5′-end, the native ribosome-binding site and ferredoxin gene coding sequence, and a PmeI restriction site on the 3′-end. Each such ferredoxin gene fragment was cloned into the PmeI site located 3′ to the ema1 gene in plasmid pTUA-ema1. In this way, artificial operons consisting of the ema1 gene and one of the ferredoxin genes operably linked to a functional promoter were created.
[0203] In the case of the fdEA ferredoxin gene in which a ferredoxin reductase gene, freEA, was found to be located adjacent to the fdEA gene, a DNA fragment containing both the fdEA and freEA genes was generated by a similar PCR strategy. This gene fragment was also cloned in the PmeI site of plasmid pTUA-ema1 as described for the other ferredoxin genes.
[0204] Each ema1-ferredoxin gene combination was tested for biological activity by introduction of the individual ema1-ferredoxin gene plasmids into S. lividans strain ZX7. The biocatalysis activity derived from each plasmid in S. lividans was determined. Of the four different constructs, only the ferredoxin gene fd233 derived from strain I-1529 provided increased activity when compared to the expression of ema1 alone in the same plasmid and host background (see Table 3). The pTUA-ema1/fd233 plasmid in S. lividans gave approximately 1.5 to 3 fold higher activity compared to the pTUA-ema1 plasmid. The other three plasmids containing the other ferredoxin genes provided results essentially the same as the plasmid with only the ema1 gene. Likewise, the pTUA-ema1/fdEA/freEA plasmid did not yield results different from those of pTUA-ema1. The nucleotide and deduced amino acid sequences of the fd233 gene are shown in SEQ ID NOs: 35 and 36, respectively.
[0205] A BLAST analysis of the nucelotide and amino acid sequences of fd233 revealed that the closest matches were to ferredoxins from S. coelicolor (GenBank Accession AL445945) and S. lividans (GenBank Accession AF072709). At the nucleotide level, fd233 shares 80 and 79.8% identity with the ferredoxin genes from S. coelicolor and S. lividans, respectively. At the peptide level, fd233 shares 79.4 and 77.8% identity with the ferredoxins from S. coelicolor and S. lividans, respectively.
[0206] Since fd233 is derived from strain I-1529 and ema1 is from strain R-922, the proteins encoded by the two genes cannot interact with each other in nature. In an approach designed to identify a ferredoxin gene from strain R-922 that is homologous to the fd233 gene and that might encode a ferredoxin that interacts optimally with the P450 Ema1 , the fd233 gene was used as a hybridization probe to a gene library of DNA from strain R-922. A strongly hybridizing cosmid, pPEH232, was identified and the hybridizing DNA was cloned and sequenced. Comparison of the deduced amino acid sequences from fd233 and the ferredoxin gene on cosmid pPEH232,fd232, revealed that they differed in only a single amino acid.
[0207] In a similar manner, plasmid pTUA-ema1-fd232 was constructed and tested in S. lividans ZX7. This plasmid gave similar results as those obtained with plasmid pTUA-ema1-fd233 (see Table 3). The nucleotide and deduced amino acid sequences of fd232 are shown in SEQ ID NOs: 37 and 38, respectively.
[0208] The ema1-fd233 operon was also subcloned, as a PacI-PmeI fragment, into pTBBKA and pEAA that had been digested with the same restriction enzymes. S. lividans ZX7::pTBBKA-ema1-fd233, and S. lividans ZX7::pEAA-ema1-fd233 were tested in the avermectin conversion assay and found to have higher activities than the strains harboring the ema1 gene alone in the comparable plasmids (see Table 3).
EXAMPLE XV
Heterologous Expression of P450 Ema1 and P450 Ema2 in Other Cells
[0209] The expression constructs pRK-ema1 (Example XIII) and pRK-ema2 (created in a way analogous to that described in Example XIII for pRK-ema1) were mobilized by conjugation into three fluorescent soil Pseudomonas strains. Conjugation was performed according to standard methods (see, e.g., Ditta et al., Proc. Natl. Acad. Sci. USA 77:7347-7351, 1980). The strains were: P. fluorescens MOCG134, P. fluorescens Pf-5, and P. fluorescens CHAO. Standard resting cell assays for the conversion of avermectin to 4″-ketoavermectin were conducted for each of the transconjugants. For strains Pf-5 and CHAO, the levels of conversion were below the detection limit. Strain MOCG134 yielded 3% conversion for ema1 and 5% for ema2.
[0210] In addition, the constructs listed in the Table 5 were introduced into Streptomyces avermitilis MOS-0001 by protoplast-mediated transformation (see, e.g., Kieser, T.; Bibb, M. J.; Buttner, M. J.; Chater, K. F.; Hopwood, D. A. (eds.): Practical Streptomyces Genetics. The John Innes Foundation, Norwich (England), 2000); Stutzman-Engwall, K. et al. (1999) Streptomyces avermitilis gene directing the ratio of B 2 :B 1 avermectins, WO 99/41389).
TABLE 5 Construct % Conversion of avermectin, 16 hrs None 0 pTBBKA-ema1 10.90 +/− 3.48 pTUA-ema1 5.326 +/− 2.19 pEAA-ema1 6.74 +/− 0.08 pTBBKA-ema1A/fd233 28.50 +/− 0.20 pTUA-ema1A/fd233 23.97 +/− 5.95
[0211] Wild-type Str. avermitilis MOS-0001 was tested and found to be incapable of the oxidation of avermectin to 4″-ketoavermectin.
[0212] Transformed S. avermitilis strains MOS-0001::pTBBKA-ema1, MOS-0001 (pTUA-ema1), MOS-0001::pEAA-ema1, MOS-0001::pTBBKA-ema1A-fd233, and MOS-0001 (pTUA-ema1A-fd233) were each tested for their ability to oxidize avermectin to 4″-keto-avermectin using resting cells. To do this, the whole cell biocatalysis assay described above (including analysis method) was performed. Note that for the whole cell biocatalysis assay, transformed Streptomyces avermitilis, like strain R-922, was grown in PHG medium and, again like strain R-922, had a reaction time of 16 hours (i.e., during which time the 500 mg transformed Streptomyces avermitilis wet cells in 10 ml of 50 mM potassium phosphate buffer, pH 7.0, were shaken at 160 rpm at 28° C. in the presence of 15 μl of a solution of avermectin in isopropanol (30 mg/ml)).
[0213] As shown in Table 5, in the presence of the inducer, thiostrepton (5 μg/ml), the ema1- or ema1A-fd233-containing strains MOS-0001::pTBBKA-ema1, MOS-0001::pTBBKA-ema1A-fd233, MOS-0001 (pTUA-ema1), MOS-0001 (pTUA-ema1A-fd233) were found to oxidize avermectin to 4″-keto-avermectin as evidenced by the appearance of the oxidized 4″-keto-avermectin compound. Note that the S. avermitilis strain MOS-0001::pEAA-ema1 demonstrated this oxidation activity in the absence of thiostrepton since in this strain the ema1 gene is expressed from the ermE promoter that does not require induction.
[0214] Thus, expression of the ema1 P450 monooxygenase gene in various Streptomyces and Pseudomonas strains provided recombinant cells that were able to convert avermectin to 4″-ketoavermectin in resting cell assays.
[0215] Next, expression and activity of P450 Ema1 monooxygenase was tested in E. coli. To do this, the ema1 gene was cloned into the E. coli expression plasmid pET-28b(+) (commercially available from Novagen, Madison, Wis.) as described previously. E. coli strain BL21 DE3 (commercially available from Invitrogen; Carlsbad, Calif.) that contains the T7 RNA polymerase gene under control of the inducible tac promoter and the pET-28/ema1 plasmid was cultured in 50 ml LB medium containing 5 mg/l kanamycin in a 250-ml flask with one baffle, for 16 hours at 37° C., with shaking at 130 rpm. 0.5 ml of this culture was used to inoculate 500 ml LB medium with 5 mg/l kanamycin in a 2-liter flask with one baffle, and the culture was incubated for 4 hours at 37° C. followed by 4 hours and 30° C., with shaking at 130 rpm throughout. The cells were harvested by centrifugation, washed in 50 mM potassium phosphate buffer, and centrifuged again.
[0216] For the resting cell assays, 90 mg wet cells were weighed into deep-well plates in triplicate and resuspended in 0.5 ml 50 mM potassium phosphate buffer. For cell-free extracts, 4 grams wet cells in 8 ml disruption buffer were disrupted in French press.
[0217] For the resting cell assays, 5 μl of substrate (2.5 mg/ml in 2-propanol) was added to the cell suspension. The plate was sealed with air permeable foil, and the reaction was incubated on an orbital shaker at 1000 rpm at 28° C. for 22 hours. No conversion of avermectin to 4″-ketoavermectin was detected.
[0218] For the cell-free assays, 100 μl cell free extract, 1 μl substrate solution (20 mg/ml) in 2-propanol, 5 μl 100 mM NADPH, 10 μl ferredoxin, 10 μl ferredoxin reductase, and 374 μl potassium phosphate buffer pH 7.0 were added as described in Example III, and the assay was incubated at 30° C. with shaking at 600 rpm for 20 hours. 9.2% +/−0.3% of avermectin was converted to 4″-ketoavermectin.
[0219] Thus, expression of the ema1 gene in E. coli resulted in the production of the active Ema1 P450 monooxygenase enzyme which, when purified from the cells, was able to convert avermectin to 4″-ketoavermectin.
EXAMPLE XVI
Identification and Cloning of Genes Encoding Ferredoxin Reductases that Support Increased Activity of the P450 Ema1 Monooxygenase
[0220] The electron transport pathway that supports the activity of P450 monooxygenases also includes ferredoxin reductases. These proteins donate electrons to the ferredoxin and, as is the case with ferredoxins and P450 monooxygenases, specific ferredoxin reductases are known to be better electron donors for certain ferredoxins than others.
[0221] Accordingly, a number of ferredoxin reductase genes from Streptomyces strains were cloned and were evaluated for their impacts on the biocatalysis reaction. To do this, numerous bacterial ferredoxin reductase (Fre) protein sequences were retrieved from NCBI and aligned with the program Pretty from the GCG package. Two conserved regions, approximately 266 amino acid residues apart, were used to make degenerate oligonucleotides for PCR. The forward primer (CGSCCSCCSCTSWSSAAS (SEQ ID NO: 96; where “S” is C or G; and “W” is A or G)) and the reverse primer (SASSGCSTTSBCCCARTGYTC (SEQ ID NO: 97; where “S” is C or G; “B” is C, G, or T; “R” is A or G; and “Y” is C or T)) were used to amplify 800 bp products from the biocatalytically active Streptomyces strains R-922 and I-1529. These pools of products were cloned into TOPO TA cloning vectors (commercially available from Invitrogen Inc., Carlsbad, Calif.), and 20 clones each from R922 and I-1529 were sequenced according to standard methods (see, e.g., Current Protocols in Molecular Biology, eds. Ausubel et al., John Wiley & Sons, Inc. 2000). Sequencing revealed that 4 unique fre gene fragments were isolated from the strains: three from (fre3,fre12,fre14) and one from I-1529 (fre16). The fre3,fre12,fre14, and fre16 gene fragments were used as probes to identify full-length ferredoxin reductases from genomic clone banks of Streptomyces strains R922 and I-1529. By this approach, the complete coding sequence of each of the 4 different fre genes was cloned and sequenced. The nucleic acid and amino acid sequences are provided as follows:fre3 (SEQ ID NOs: 98 and 99);fre12 (SEQ ID NOs: 100 and 101);fre14 (SEQ ID NOs: 102 and 103); and fre16 (SEQ ID NOs: 104 and 105).
[0222] In order to assess the biological activity of each fre gene in relation to the activity of Ema1, each gene was inserted into the ema1/fd233 operon described above, 3′ to the fd233 gene. This resulted in the formation of artificial operons consisting of the ema1,fd233, and individual fre genes that were expressed from the same promoter. The ema1/fd233/fre operons were cloned into the Pseudomonas plasmid pRK290 and introduced into 3 different P. putida strains. These strains were then analysed for Ema1 biocatalysis activity using the whole cell assay and one of the genes, the fre gene fre16 from strain I-1529, was found to increase the activity of P450 Ema1 monooxygenase by approximately 2-fold. This effect was strain specific, as it was seen only in one of the P. putida strains, ATCC Desposit No. 17453, and not in the other two. In P. putida strain ATCC 17453, the presence of fre gene fre16 resulted in 44% conversion of avermectin to 4″-keto-avermectin, as compared to 23% without this gene. The other fre genes had no impact on the biocatalysis activity in any of the P. putida strains tested.
[0223] In a similar approach, each of the ema1/fd233/fre operons were cloned into the Streptomyces plasmids pTUA, pTBBKA, and pEAA, and introduced into S. lividans strain ZX7. In each case there was no impact in S. lividans by any of the fre genes on biocatalysis activity.
EQUIVALENTS
[0224] Those skilled in the art will recognize, or be able to ascertain, using no more than routine experimentation, numerous equivalents to the specific substances and procedures described herein. Such equivalents are considered to be within the scope of this invention.
1
105
1
1293
DNA
Streptomyces tubercidicus
1
atgtcggaat taatgaactc tccgttcgcc gcgcacgtcg ggaaacaccc gggcgagccg 60
aatgtgatgg accccgccct gatcaccgac ccgttcaccg gctacggcgc gctgcgtgag 120
cagggcccgg tcgtacgggg ccggttcatg gacgactcgc ccgtctggct ggtgacgcgg 180
ttcgaggagg tccgccaggt cctgcgcgac cagcggttcg tgaacaatcc ggcctcgccg 240
tccctgaact acgcgcccga ggacaacccg ctgacccggc tgatggagat gctgggcctc 300
cccgagcacc tccgcgtcta cctgctcgga tcgatcctca actacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgggcg ttcacggccc gcaagatcac cgacctgcgg 420
ccccgggtcg agcagatcgc cgacgcgctg ctggcccggc tgcccgagca cgccgaggac 480
ggcgtcgtcg acctcatcca gcacttcgcc taccccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgaa cgtggggcgc cgacctcatc 600
tcgatggatc cggaccggct cggcgcctcg ttcccggcga tgatcgagca catccatcag 660
atggtccggg aacggcgcga ggcgctcacc gacgacctgc tcagcgaact gatccgcacc 720
catgacgacg acggcgggcg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780
gtcctcgccg gccacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg tctggtcaag gacgatccgg ccctcctccc ccgtgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cacatgaccc agctgcgcta cgccaccgcc 960
gacgtcgacc tcgccggcac accgatccgc cagggcgatg ccgttcaact catcctggta 1020
tcggccaact tcgacccccg tcactacacc gaccccgacc gcctcgatct cacccggcac 1080
cccgcgggcc acgccgagaa ccatgtgggt ttcggccatg gagcgcacta ctgcctgggc 1140
gccacactcg ccaaacagga aggtgaagtc gccttcggca aactgctcac gcactacccg 1200
gacatatcgc tgggcatcgc cccggaacac ctggagcgga caccgctgcc gggcaactgg 1260
cggctgaact cgctgccggt gcggttgggg tga 1293
2
430
PRT
Streptomyces tubercidicus
2
Met Ser Glu Leu Met Asn Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Thr Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ser Pro
65 70 75 80
Ser Leu Asn Tyr Ala Pro Glu Asp Asn Pro Leu Thr Arg Leu Met Glu
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly
195 200 205
Ala Ser Phe Pro Ala Met Ile Glu His Ile His Gln Met Val Arg Glu
210 215 220
Arg Arg Glu Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Val Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Thr Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Pro Ile Arg Gln Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Thr His Tyr Pro
385 390 395 400
Asp Ile Ser Leu Gly Ile Ala Pro Glu His Leu Glu Arg Thr Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ser Leu Pro Val Arg Leu Gly
420 425 430
3
1293
DNA
Streptomyces tubercidicus
3
atgtcggcat tatccagctc tccgttcgct gcgcatgtcg ggaaacaccc gggtgagccg 60
aatgtgatgg agccggcgct gctcaccgac ccgttcgcgg gctacggcgc gctgcgtgag 120
caggccccgg tcgtacgggg ccggttcgtg gacgactcac cggtctggtt cgtgacgcgc 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggccgcgccg 240
cccctggccc catcggccga ggagaacccg ctgaccaggc tgatggacat gctgggcctc 300
cccgagcacc tccgcgtcta catgctcggg tcgattctca actacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgcgcg ttcacggcgc ggaagatcac cgatctgcga 420
ccgcgtgtcg agcagatcgc cgacgagctg ctggcccgcc tccccgagta cgccgaggac 480
ggcgtcgtcg acctcatcca gcatttcgcc tacccgctgc cgatcaccgt catctgcgag 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgga agtggggcgc cgacctcatc 600
tcgatggacc cggaccggct cggcgcaacg ttcccggcga tgatcgagca catccatgag 660
atggtccggg agcggcgcgc ggcgctcacc gatgatctgc tcagcgagct gatccgtacc 720
catgacgacg atggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780
gtcctcgccg gtcacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc ccgggccgtc 900
catgaactga tgcgctggtg cgggccggtg cagatgacgc agctgcgcta cgcggccgcc 960
gacgtcgacc tcgccggtac gcggatccac aagggcgacg ccgtacaact cctcctggtt 1020
gcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgatct gacgcgtcac 1080
cccgccggcc acgccgagaa ccatgtgggt ttcggccacg gtgcgcatta ctgcctgggt 1140
gccaccctcg ccaagcagga gggcgaagtc gcgttcggca agctgctcgc gcactacccg 1200
gagatgtccc tgggcatcga accggaacgt ctggagcgat tgccgctgcc tggcaactgg 1260
cggctgaatt ccctgccgtt gcggctgggg tga 1293
4
430
PRT
Streptomyces tubercidicus
4
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Leu Thr Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Ala Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Pro Leu Ala Pro Ser Ala Glu Glu Asn Pro Leu Thr Arg Leu Met Asp
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Val Tyr Met Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu Tyr Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly
195 200 205
Ala Thr Phe Pro Ala Met Ile Glu His Ile His Glu Met Val Arg Glu
210 215 220
Arg Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ala Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Arg Ile His Lys Gly Asp Ala Val Gln
325 330 335
Leu Leu Leu Val Ala Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Glu Met Ser Leu Gly Ile Glu Pro Glu Arg Leu Glu Arg Leu Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ser Leu Pro Leu Arg Leu Gly
420 425 430
5
1413
DNA
Streptomyces rimosus
5
atgaccacat cgcccaccga gtcccgggcg gccaccccgc ccgactccac cgcctccccc 60
tcgaccgctt ccgccccggc caccacccct tcggccgccg cctctccgga caccaccgac 120
cgcaccacgc tcccctccta cgtcggcctc cacccgggcg agccgaacct gatggaaccg 180
gagctgctgg agaacccgta caccggctac ggcacgctgc gcgagcaggc cccgctcgtc 240
cgcgcccggt tcatcgacga ctcgcccatc tggctggtga cccgcttcga cgtggtgcgc 300
gaggtgatgc gtgaccagcg gttcgtcaac aacccgaccc tggtgcccgg catcggcgcg 360
gacaaggacc cgcgtgcccg gctgatcgag ctgttcggca tccccgagga cctggccccg 420
tacctcaccg acaacatcct caccagcgac ccgccggacc acacccggct gcgccgcctg 480
gtctcccgcg ccttcaccgc acgccgtatc caggacctgc ggccgcgcgt cgagcggatc 540
accgacgagc tgctggaacg gctgccggac catgccgagg acggcgtcgt cgacctcgtc 600
gagcacttcg cctacccgct gcccatcacg gtcatctgcg agctggtcgg catcgacgag 660
gaggatcggg cgctgtggcg gcggttcggc gccgacctcg cctcgctgaa ccccaagcgc 720
atcggcgcca ccatgccgga gatgatctcg cacatccacg agctgatcga cgaacggcgc 780
gcggccctgc gggacgacct gctcagcggg ctcatccggg cgcaggacga cgacggcggc 840
cggctgagcg acgtcgagat ggtcaccctg gtcctgaccc tggtactggc cggtcacgag 900
accaccgccc acctcatcag caacggcacc ctcgccctgc tcacccaccc cgaccagcgg 960
cggctgatcg acgaggaccc ggcgctgctg ccgcgcgcgg tccacgagct gatgcgctgg 1020
tgcgggccga tccaggccac ccagcttcgg tacgccctgg aggacaccga ggtggccgga 1080
gtccaggtcc gccagggcga ggccctgatg ttcagcctcg tcgcggccaa ccacgacccg 1140
cgccactaca ccgggccgga gcggctcgac ctgacgcggc agccggccgg ccgcgccgag 1200
gaccacgtcg gcttcggcca cggcatgcac tactgcctgg gtgcctcact cgcccggcag 1260
gaggccgagg tggcctacgg gaagctgctc acccgctacc cggacctggc gctcgccctc 1320
accccggaac agttggagga ccaggaacgc ctgcggcagc ccggcacctg gcgcctgcga 1380
cggctgccgc tgaggctgca cgcgcagagc tga 1413
6
470
PRT
Streptomyces rimosus
6
Met Thr Thr Ser Pro Thr Glu Ser Arg Ala Ala Thr Pro Pro Asp Ser
1 5 10 15
Thr Ala Ser Pro Ser Thr Ala Ser Ala Pro Ala Thr Thr Pro Ser Ala
20 25 30
Ala Ala Ser Pro Asp Thr Thr Asp Arg Thr Thr Leu Pro Ser Tyr Val
35 40 45
Gly Leu His Pro Gly Glu Pro Asn Leu Met Glu Pro Glu Leu Leu Glu
50 55 60
Asn Pro Tyr Thr Gly Tyr Gly Thr Leu Arg Glu Gln Ala Pro Leu Val
65 70 75 80
Arg Ala Arg Phe Ile Asp Asp Ser Pro Ile Trp Leu Val Thr Arg Phe
85 90 95
Asp Val Val Arg Glu Val Met Arg Asp Gln Arg Phe Val Asn Asn Pro
100 105 110
Thr Leu Val Pro Gly Ile Gly Ala Asp Lys Asp Pro Arg Ala Arg Leu
115 120 125
Ile Glu Leu Phe Gly Ile Pro Glu Asp Leu Ala Pro Tyr Leu Thr Asp
130 135 140
Asn Ile Leu Thr Ser Asp Pro Pro Asp His Thr Arg Leu Arg Arg Leu
145 150 155 160
Val Ser Arg Ala Phe Thr Ala Arg Arg Ile Gln Asp Leu Arg Pro Arg
165 170 175
Val Glu Arg Ile Thr Asp Glu Leu Leu Glu Arg Leu Pro Asp His Ala
180 185 190
Glu Asp Gly Val Val Asp Leu Val Glu His Phe Ala Tyr Pro Leu Pro
195 200 205
Ile Thr Val Ile Cys Glu Leu Val Gly Ile Asp Glu Glu Asp Arg Ala
210 215 220
Leu Trp Arg Arg Phe Gly Ala Asp Leu Ala Ser Leu Asn Pro Lys Arg
225 230 235 240
Ile Gly Ala Thr Met Pro Glu Met Ile Ser His Ile His Glu Leu Ile
245 250 255
Asp Glu Arg Arg Ala Ala Leu Arg Asp Asp Leu Leu Ser Gly Leu Ile
260 265 270
Arg Ala Gln Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val
275 280 285
Thr Leu Val Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His
290 295 300
Leu Ile Ser Asn Gly Thr Leu Ala Leu Leu Thr His Pro Asp Gln Arg
305 310 315 320
Arg Leu Ile Asp Glu Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu
325 330 335
Leu Met Arg Trp Cys Gly Pro Ile Gln Ala Thr Gln Leu Arg Tyr Ala
340 345 350
Leu Glu Asp Thr Glu Val Ala Gly Val Gln Val Arg Gln Gly Glu Ala
355 360 365
Leu Met Phe Ser Leu Val Ala Ala Asn His Asp Pro Arg His Tyr Thr
370 375 380
Gly Pro Glu Arg Leu Asp Leu Thr Arg Gln Pro Ala Gly Arg Ala Glu
385 390 395 400
Asp His Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Ser
405 410 415
Leu Ala Arg Gln Glu Ala Glu Val Ala Tyr Gly Lys Leu Leu Thr Arg
420 425 430
Tyr Pro Asp Leu Ala Leu Ala Leu Thr Pro Glu Gln Leu Glu Asp Gln
435 440 445
Glu Arg Leu Arg Gln Pro Gly Thr Trp Arg Leu Arg Arg Leu Pro Leu
450 455 460
Arg Leu His Ala Gln Ser
465 470
7
1293
DNA
Streptomyces lydicus
7
atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60
aacgtgatgg atccggcgct gatcggggat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240
gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgatctgc tcagcgagct gatccggacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgacccgcg ccactacacc gaccccgacc ggctggacct gacccggcac 1080
cctgccggcc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggaggcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaacg cgctgccgct gcgtctgcgc tga 1293
8
430
PRT
Streptomyces lydicus
8
Met Ser Ala Ser Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Pro Arg Phe Arg Asn Asn Pro Val Ser Ala
65 70 75 80
Ala Pro Gly Ala Ala Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Thr
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Thr Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Glu Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Arg
420 425 430
9
1299
DNA
Streptomyces
9
atgtcagcct tatccagctc tccgttcgcc gagcacatag ggaaacaccc gggcgagccg 60
aacgtgatgg aaccggctct gatcaacgat ccgttcggcg gctacggcgc gctgcgcgag 120
caggggccgg ttgtgcgtgg ccggttcatg gacgactcgc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggcgtcgccg 240
ctcctgggca gtcaggtcga ggagatgccg atggtcaagc tgctggagca gatgggcctc 300
cccgagcacc ttcgggtcta tctgctcgga tcgatcctca acagtgacgc ccccgatcac 360
acccggcttc gccgcctcgt ctcgcgggcc ttcaccgcac gtaagatcac cggtctgcgg 420
ccgcgcgtcg agcagatcgc cgacgagctg ctggcccggc tccccgagca cgccgaggac 480
ggcgtcgtcg acctcatcca gcacttcgcc tacccgctgc cgatcacggt catctgcgaa 540
ctggtcggca tacccgaagc cgatcgcccg caatggcgcg catggggcgc cgacctcgtg 600
tcactggagc cggacaagct cagcacgtcg ttcccggcga tgatcgacca cacccatgaa 660
ctgatccgcc aacggcgcgg cgcgctcacc gacgatctgc tcagcgagct gatccgtgcc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt gttcgctctc 780
gtcttcgccg gtcacgagac caccgcccac ctcataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc gcgtgccgtc 900
catgagctga tgcgctggtg cgggccggtg cacatgaccc agttgcgtta cgcctccgag 960
gacatcgacc tcgccggtac gccgatccgg aagggcgacg ccgtccaact catcctggta 1020
tcggcgaact tcgacccccg ccactacagc gaccccgatc gcctcgacct gacccgtcac 1080
cccgcaggcc acgccgagaa ccacgtgggc ttcggccacg ggatgcacta ctgcttgggc 1140
gccgcgctcg ccaggcagga aggcgaagtg gcgttcggca aactgctcgc gcactacccg 1200
gacgtagcgc tgggcgtcga accggaagcc ctggagcggg tgccgatgcc cggcagttgg 1260
cggctgaatt ccttgccgct gcggttggcg aagcgctaa 1299
10
432
PRT
Streptomyces
10
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Glu His Ile Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Ile Asn Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ser Pro
65 70 75 80
Leu Leu Gly Ser Gln Val Glu Glu Met Pro Met Val Lys Leu Leu Glu
85 90 95
Gln Met Gly Leu Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Ser Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Gly Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Ala Trp Gly Ala Asp Leu Val Ser Leu Glu Pro Asp Lys Leu Ser
195 200 205
Thr Ser Phe Pro Ala Met Ile Asp His Thr His Glu Leu Ile Arg Gln
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Ala
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Phe Ala Leu Val Phe Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ser Glu
305 310 315 320
Asp Ile Asp Leu Ala Gly Thr Pro Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Ser Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Ala Leu Ala
370 375 380
Arg Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Asp Val Ala Leu Gly Val Glu Pro Glu Ala Leu Glu Arg Val Pro Met
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ser Leu Pro Leu Arg Leu Ala Lys Arg
420 425 430
11
1293
DNA
Streptomyces chattanoogenesis
11
atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60
aacgtgatgg atccggcgct gatcggtgat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240
gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgacctgc tcagcgagct gatccggacc 720
catgacgacg acggcagcag gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cactgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgacccgcg ccactacacc gaccccgacc gtctggacct gacccggcac 1080
cccgccggtc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggacgcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaacg cgctgccgct gcgtctgggc tga 1293
12
430
PRT
Streptomyces chattanoogenesis
12
Met Ser Ala Ser Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Pro Arg Phe Arg Asn Asn Pro Val Ser Ala
65 70 75 80
Ala Pro Gly Ala Ala Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Thr
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Thr Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Asp Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Gly
420 425 430
13
1290
DNA
Streptomyces
13
atgaccgaat tagcggactc ccccttcagc gagcacgtcg gcaaacaccc cggcgagccg 60
aacgtgatgg aaccggccct gctcaccgat ccgttcaccg gctacggcga actgcgcgaa 120
cagggcccgg tggtccgcgg ccggttcgcg gacgacaccc ccgtgtggtt catcacccgc 180
ttcgaggagg cccgcgaggt gctgcgcgac caccggttcg ccaatgcccc cgccttcgcg 240
gcgggaggtg gaagcggtga cacaccctcc aaccggctga tggaaatcat gggcctgccc 300
gagcactacc gggtgtacct cgccaacacc atcctcacca tggacgcccc cgaccacacc 360
cggatccggc gattggtctc ccgggcattc accgcccgta agatcaccga tctgcgaccc 420
cgggtggagg acatcgcgga cgatctgctg aggcggctgc ccgagcacgc cgaggacggc 480
gtcgtcgacc tcatcaagca ctacgcctat ccgctgccca taacggtcat ctgcgaactg 540
gtgggaattc cggaggaaga ccgactgcag tggcgggatt gggggtccgc gttcgtctcc 600
ctgcaaccgg atcggctcag caaagcgttc ccggcgatga tcgaacacat tcacgcgctg 660
atccgcgaac ggcgcgcggc gctcaccgac gatctgctca gcgaactgat ccgggtccat 720
gacgacgacg gcggccgact cagcgacgtc gaaatggtca cgatggtcct gaccctcgtt 780
ctcgccggtc atgagaccac cgcccatctc atcggcaacg gcactgccgc gcttctcacc 840
caccccgacc agctgcacct gctgaaatcc gatccggagc tgctcccacg cgccgtgcac 900
gagctgatgc gctggtgcgg accggtgcag atgacgcagt tgcggtacgc caccgaggac 960
gtcgaggtgg ccggggtgca ggtcaagcag ggcgaagcgg tgctggccat gctggtcgcg 1020
gcgaaccacg acccccgcca cttcgccgac cccgcccggc tcgacctcac ccgccagccg 1080
gcgggccggg ccgagaacca cgtcggtttc ggccacggca tgcactactg cctgggcgcc 1140
agcctggccc gccaggaggg cgaggtcgcc ttcgggaacc tgctcgcgca ctacccggac 1200
gtgtcgctgg cggtggaacc ggacgccctc cagcgggtcc cgctgccggg caactggcgg 1260
ctggccgcac tgccggtccg gctgcgctga 1290
14
429
PRT
Streptomyces
14
Met Thr Glu Leu Ala Asp Ser Pro Phe Ser Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Leu Thr Asp Pro Phe
20 25 30
Thr Gly Tyr Gly Glu Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Ala Asp Asp Thr Pro Val Trp Phe Ile Thr Arg Phe Glu Glu Ala
50 55 60
Arg Glu Val Leu Arg Asp His Arg Phe Ala Asn Ala Pro Ala Phe Ala
65 70 75 80
Ala Gly Gly Gly Ser Gly Asp Thr Pro Ser Asn Arg Leu Met Glu Ile
85 90 95
Met Gly Leu Pro Glu His Tyr Arg Val Tyr Leu Ala Asn Thr Ile Leu
100 105 110
Thr Met Asp Ala Pro Asp His Thr Arg Ile Arg Arg Leu Val Ser Arg
115 120 125
Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu Asp
130 135 140
Ile Ala Asp Asp Leu Leu Arg Arg Leu Pro Glu His Ala Glu Asp Gly
145 150 155 160
Val Val Asp Leu Ile Lys His Tyr Ala Tyr Pro Leu Pro Ile Thr Val
165 170 175
Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Leu Gln Trp Arg
180 185 190
Asp Trp Gly Ser Ala Phe Val Ser Leu Gln Pro Asp Arg Leu Ser Lys
195 200 205
Ala Phe Pro Ala Met Ile Glu His Ile His Ala Leu Ile Arg Glu Arg
210 215 220
Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Val His
225 230 235 240
Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met Val
245 250 255
Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile Gly
260 265 270
Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu His Leu Leu
275 280 285
Lys Ser Asp Pro Glu Leu Leu Pro Arg Ala Val His Glu Leu Met Arg
290 295 300
Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Thr Glu Asp
305 310 315 320
Val Glu Val Ala Gly Val Gln Val Lys Gln Gly Glu Ala Val Leu Ala
325 330 335
Met Leu Val Ala Ala Asn His Asp Pro Arg His Phe Ala Asp Pro Ala
340 345 350
Arg Leu Asp Leu Thr Arg Gln Pro Ala Gly Arg Ala Glu Asn His Val
355 360 365
Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Ser Leu Ala Arg
370 375 380
Gln Glu Gly Glu Val Ala Phe Gly Asn Leu Leu Ala His Tyr Pro Asp
385 390 395 400
Val Ser Leu Ala Val Glu Pro Asp Ala Leu Gln Arg Val Pro Leu Pro
405 410 415
Gly Asn Trp Arg Leu Ala Ala Leu Pro Val Arg Leu Arg
420 425
15
1428
DNA
Streptomyces albofaciens
15
atgaccacat cgcccaccga gtcccgggcg gccaccccgc ccgactccac cgcctccccc 60
tcgaccgctg ccgccccggc caccacccct tcggccgccg cctctccgga caccacctct 120
cccgccacca ccgaccgcac cacgctcccc tcctacgtcg gcctccaccc gggcgagccg 180
aacctgatgg aaccggagct gctggacaac ccgtacaccg gctacggcac gctgcgcgag 240
caggcgccgc tcgtccgcgc ccggttcatc gacgactcgc ccatctggct ggtgacccgc 300
ttcgacgtgg tgcgcgaggt gatgcgcgac cagcggttcg tcaacaaccc gaccctggtg 360
cccggcatcg gtgcggacca ggacccgcgc gcccggctga tcgagctgtt cggcatcccc 420
gaggacctgg ccccgtacct caccgacacc atcctcacca gcgacccgcc ggaccacacc 480
cggctgcgcc gcctggtctc ccgtgccttc accgcacgcc gtatccagga cctgcggccg 540
cgcgtcgagc ggatcaccga cgagctgctg gcgcggctgc cggaccatgc cgaggacggc 600
gtcgtcgacc tcgtcgagca cttcgcctac ccgctgccca tcacggtcat ctgcgaactg 660
gtcggcatcg acgaggagga ccgggcgctg tggcggcggt tcggcgccga cctcgcctcg 720
ctgaacccca agcgcatcgg cgccaccatg ccggagatga tcgcgcacat ccacgaggtg 780
atcgacgagc ggcgtgcgga cctgcgggac gacctgctca gcgggctcat ccgggcgcag 840
gacgacgacg gcggccggct gagcgacgtc gagatggtca cgctggtgct gaccctggtg 900
ctggccggtc acgagaccac cgcccacctc atcagcaacg gcaccctcgc cctgctcacc 960
caccccgacc agcggcggct gatcgacgag gacccggcgc tgctgccgcg cgcggtccac 1020
gagctgatgc gctggtgcgg gccgatccag gccacccagc tgcggtacgc catggaggac 1080
accgaggtgg ccggtgtcca ggtccgccag ggcgaggccc tgatgttcag cctcgtcgcg 1140
gccaaccacg acccgcgcca ctacaccggc ccggagcggc tcgacctgac gcggcagccg 1200
gccggccgcg ccgaggacca cgtcggcttc gggcacggga tgcactactg cctgggtgcc 1260
tcactggccc ggcaggaggc cgaggtggcg tacggcaagc tgctcacccg ctacccggac 1320
ctggcgctcg cgctcacccc ggaacagctg gaggaccagg aacgcctgcg gcagcccggc 1380
acctggcgcc tgcgacggct gccgctgagg ttgcacgcgg agagctga 1428
16
475
PRT
Streptomyces albofaciens
16
Met Thr Thr Ser Pro Thr Glu Ser Arg Ala Ala Thr Pro Pro Asp Ser
1 5 10 15
Thr Ala Ser Pro Ser Thr Ala Ala Ala Pro Ala Thr Thr Pro Ser Ala
20 25 30
Ala Ala Ser Pro Asp Thr Thr Ser Pro Ala Thr Thr Asp Arg Thr Thr
35 40 45
Leu Pro Ser Tyr Val Gly Leu His Pro Gly Glu Pro Asn Leu Met Glu
50 55 60
Pro Glu Leu Leu Asp Asn Pro Tyr Thr Gly Tyr Gly Thr Leu Arg Glu
65 70 75 80
Gln Ala Pro Leu Val Arg Ala Arg Phe Ile Asp Asp Ser Pro Ile Trp
85 90 95
Leu Val Thr Arg Phe Asp Val Val Arg Glu Val Met Arg Asp Gln Arg
100 105 110
Phe Val Asn Asn Pro Thr Leu Val Pro Gly Ile Gly Ala Asp Gln Asp
115 120 125
Pro Arg Ala Arg Leu Ile Glu Leu Phe Gly Ile Pro Glu Asp Leu Ala
130 135 140
Pro Tyr Leu Thr Asp Thr Ile Leu Thr Ser Asp Pro Pro Asp His Thr
145 150 155 160
Arg Leu Arg Arg Leu Val Ser Arg Ala Phe Thr Ala Arg Arg Ile Gln
165 170 175
Asp Leu Arg Pro Arg Val Glu Arg Ile Thr Asp Glu Leu Leu Ala Arg
180 185 190
Leu Pro Asp His Ala Glu Asp Gly Val Val Asp Leu Val Glu His Phe
195 200 205
Ala Tyr Pro Leu Pro Ile Thr Val Ile Cys Glu Leu Val Gly Ile Asp
210 215 220
Glu Glu Asp Arg Ala Leu Trp Arg Arg Phe Gly Ala Asp Leu Ala Ser
225 230 235 240
Leu Asn Pro Lys Arg Ile Gly Ala Thr Met Pro Glu Met Ile Ala His
245 250 255
Ile His Glu Val Ile Asp Glu Arg Arg Ala Asp Leu Arg Asp Asp Leu
260 265 270
Leu Ser Gly Leu Ile Arg Ala Gln Asp Asp Asp Gly Gly Arg Leu Ser
275 280 285
Asp Val Glu Met Val Thr Leu Val Leu Thr Leu Val Leu Ala Gly His
290 295 300
Glu Thr Thr Ala His Leu Ile Ser Asn Gly Thr Leu Ala Leu Leu Thr
305 310 315 320
His Pro Asp Gln Arg Arg Leu Ile Asp Glu Asp Pro Ala Leu Leu Pro
325 330 335
Arg Ala Val His Glu Leu Met Arg Trp Cys Gly Pro Ile Gln Ala Thr
340 345 350
Gln Leu Arg Tyr Ala Met Glu Asp Thr Glu Val Ala Gly Val Gln Val
355 360 365
Arg Gln Gly Glu Ala Leu Met Phe Ser Leu Val Ala Ala Asn His Asp
370 375 380
Pro Arg His Tyr Thr Gly Pro Glu Arg Leu Asp Leu Thr Arg Gln Pro
385 390 395 400
Ala Gly Arg Ala Glu Asp His Val Gly Phe Gly His Gly Met His Tyr
405 410 415
Cys Leu Gly Ala Ser Leu Ala Arg Gln Glu Ala Glu Val Ala Tyr Gly
420 425 430
Lys Leu Leu Thr Arg Tyr Pro Asp Leu Ala Leu Ala Leu Thr Pro Glu
435 440 445
Gln Leu Glu Asp Gln Glu Arg Leu Arg Gln Pro Gly Thr Trp Arg Leu
450 455 460
Arg Arg Leu Pro Leu Arg Leu His Ala Glu Ser
465 470 475
17
1293
DNA
Streptomyces
17
atgtcggcat tacccacctc accgttcgct gcacacgtcg ggaaacaccc gggcgagccg 60
aatgtgatgg acccggcact gatcaccgac ccgttcaccg gctacggcgc gctgcgcgag 120
cagggcccgg tcgtccgcgg ccgcttcgtg gacgactcac ccgtctggct ggtgacgcga 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaaccc ggcggcgccc 240
tccctgggcc acgcggccga ggacaacccg ctcaccaggc tgatggacat gctgggcctc 300
cccgagcacc tccgccccta cctcctcgga tcgattctca attacgacgc ccccgaccac 360
acccggctgc gccgcctggt gtcgcgggcc ttcaccgccc gcaagatcac cgacctgcgg 420
ccgcgggtcg agcagatcgc cgacgccctg ctggcccggc tgcccgagca cgccgaggac 480
ggcgtcgtcg atctcatccg gcacttcgcc tacccgctgc cgatcaccgt catctgcgaa 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgga cgtggggcgc cgacctcgtc 600
tcgatggagc cggaccggct caccgcctcg ttcccgccga tgatcgagca catccaccgg 660
atggtccggg agcggcgcgg cgcgctcacc ggcgatctgc tcagcgagct gatccgtgcc 720
catgacgacg acggcggccg gctcagcgac gtcgagatgg tcaccttgat cctcacgctc 780
gtcctcgccg gtcacgagac caccgctcac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accaactgcg cctgctccag gacgacccgg ccctgctccc ccgtgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cagatgaccc agctgcgtta cgccgccgcc 960
gacgtcgacc tggccggcac cacgatccac cggggcgacg ccgtccaact catcctggtg 1020
tcggcgaact tcgacccccg ccactacacc gaccccgacc gcctcgatct gacccgccac 1080
cccgcgggac atgcggagaa ccatgtgggt ttcggccatg gggcgcacta ctgcctgggc 1140
gccacactcg ccaagcagga gggcgaagtc gccttcggca aactgctcgc gcactacccg 1200
gagatggcgt tgggcgtcgc accggagcgc ctggagcgga cgcccctgcc gggcaactgg 1260
cggctgaacg cgctgccggt gcggttgggg tga 1293
18
430
PRT
Streptomyces
18
Met Ser Ala Leu Pro Thr Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Thr Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Ser Leu Gly His Ala Ala Glu Asp Asn Pro Leu Thr Arg Leu Met Asp
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Pro Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Arg His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Met Glu Pro Asp Arg Leu Thr
195 200 205
Ala Ser Phe Pro Pro Met Ile Glu His Ile His Arg Met Val Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Gly Asp Leu Leu Ser Glu Leu Ile Arg Ala
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Leu
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Gln Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ala Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Thr Ile His Arg Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Glu Met Ala Leu Gly Val Ala Pro Glu Arg Leu Glu Arg Thr Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ala Leu Pro Val Arg Leu Gly
420 425 430
19
1293
DNA
Streptomyces kasugaensis
19
atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60
aacgtgatgg atccggcgct gatcggggat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240
gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgatctgc tcagcgagct gatccggacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgacccgcg ccactacacc gaccccgacc ggctggacct gacccggcac 1080
cctgccggcc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggaggcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaacg cgctgccgct gcgtctgcgc tga 1293
20
430
PRT
Streptomyces kasugaensis
20
Met Ser Ala Ser Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Pro Arg Phe Arg Asn Asn Pro Val Ser Ala
65 70 75 80
Ala Pro Gly Ala Ala Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Thr
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Thr Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Glu Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Arg
420 425 430
21
1428
DNA
Streptomyces
21
atgaccacat cgcccaccga gtcccgggcg gccaccccga ccggctccac cgcctccccc 60
tcgaccgctt ccgccccggc caccacccct tcggccgcca cctcttcgga caccacctat 120
cccgccacca ccgaccgcac cacgctcccc tcctacgtcg gcctccaccc gggcgagccg 180
aacctgatgg aaccggagct gctggacaac ccgtacaccg gctacggcac gctgcgcgag 240
caggccccgc tcgtccgtgc ccggttcatc gacgactcgc ccatctggct ggtgacccgc 300
ttcgacgtgg tgcgcgaggt gatgcgcgac cagcggttcg tcaacaaccc gaccctggtg 360
cccggcatcg gtgcggacaa ggacccgcgc gcccggctga tcgagctgtt cggcatcccc 420
gaggacctga ccccgtacct cgccgacacc atcctcacca gcgacccgcc ggaccacacc 480
cggctgcgcc gcctggtctc ccgtgccttc accgcgcgcc gcatccagga cctgcggccg 540
cgcgtcgagc agatcaccga cgcgctgctg gagcgactgc cggaccatgc cgaggacggc 600
gtcgtcgacc tcgtcgagca cttcgcctac ccgctgccca tcacggtcat ctgcgagctg 660
gtcggcatcg acgaggagga ccggacgctg tggcggcggt tcggcgccga cctcgcctca 720
ctgaacccca agcgcatcgg cgccaccatg ccggagatga tcgcgcacat ccacgaggtg 780
atcgacgagc ggcgcgcggc cctgcgggac gacctgctca gcgggctcat ccgggcgcag 840
gacgacgacg gcggccggct gagcgacgtc gagatggtca ccctggtcct gaccctggtg 900
ctggccggtc acgagaccac cgcccacctc atcagcaacg gcaccctcgc cctgctcacc 960
caccccgacc agcggcggct gatcgacgag gacccggcac tgctgccgcg cgcggtccac 1020
gagctgatgc gctggtgcgg gccgatccag gccacccagc tgcggtacgc catggaggac 1080
accgaggtcg ccggtgtcca ggtccgccag ggcgaggccc tgatgttcag cctcgtcgcg 1140
gccaaccacg acccgcgcca ctacaccggg ccggagcggc tcgacctgac gcggcagccg 1200
gccggccgcg ccgaggacca cgtcggcttc gggcacggga tgcactactg cctgggtgcc 1260
tcactcgccc ggcaggaggc cgaggtggcc tacgggaagc tgctcacccg ctacccggac 1320
ctggagctcg ctctcacacc ggaacagctg gaggaccagg aacgcctgcg gcagcccggc 1380
acctggcgcc tgcggcggct gccgctgaag ctgcacgcgc ggagctga 1428
22
475
PRT
Streptomyces
22
Met Thr Thr Ser Pro Thr Glu Ser Arg Ala Ala Thr Pro Thr Gly Ser
1 5 10 15
Thr Ala Ser Pro Ser Thr Ala Ser Ala Pro Ala Thr Thr Pro Ser Ala
20 25 30
Ala Thr Ser Ser Asp Thr Thr Tyr Pro Ala Thr Thr Asp Arg Thr Thr
35 40 45
Leu Pro Ser Tyr Val Gly Leu His Pro Gly Glu Pro Asn Leu Met Glu
50 55 60
Pro Glu Leu Leu Asp Asn Pro Tyr Thr Gly Tyr Gly Thr Leu Arg Glu
65 70 75 80
Gln Ala Pro Leu Val Arg Ala Arg Phe Ile Asp Asp Ser Pro Ile Trp
85 90 95
Leu Val Thr Arg Phe Asp Val Val Arg Glu Val Met Arg Asp Gln Arg
100 105 110
Phe Val Asn Asn Pro Thr Leu Val Pro Gly Ile Gly Ala Asp Lys Asp
115 120 125
Pro Arg Ala Arg Leu Ile Glu Leu Phe Gly Ile Pro Glu Asp Leu Thr
130 135 140
Pro Tyr Leu Ala Asp Thr Ile Leu Thr Ser Asp Pro Pro Asp His Thr
145 150 155 160
Arg Leu Arg Arg Leu Val Ser Arg Ala Phe Thr Ala Arg Arg Ile Gln
165 170 175
Asp Leu Arg Pro Arg Val Glu Gln Ile Thr Asp Ala Leu Leu Glu Arg
180 185 190
Leu Pro Asp His Ala Glu Asp Gly Val Val Asp Leu Val Glu His Phe
195 200 205
Ala Tyr Pro Leu Pro Ile Thr Val Ile Cys Glu Leu Val Gly Ile Asp
210 215 220
Glu Glu Asp Arg Thr Leu Trp Arg Arg Phe Gly Ala Asp Leu Ala Ser
225 230 235 240
Leu Asn Pro Lys Arg Ile Gly Ala Thr Met Pro Glu Met Ile Ala His
245 250 255
Ile His Glu Val Ile Asp Glu Arg Arg Ala Ala Leu Arg Asp Asp Leu
260 265 270
Leu Ser Gly Leu Ile Arg Ala Gln Asp Asp Asp Gly Gly Arg Leu Ser
275 280 285
Asp Val Glu Met Val Thr Leu Val Leu Thr Leu Val Leu Ala Gly His
290 295 300
Glu Thr Thr Ala His Leu Ile Ser Asn Gly Thr Leu Ala Leu Leu Thr
305 310 315 320
His Pro Asp Gln Arg Arg Leu Ile Asp Glu Asp Pro Ala Leu Leu Pro
325 330 335
Arg Ala Val His Glu Leu Met Arg Trp Cys Gly Pro Ile Gln Ala Thr
340 345 350
Gln Leu Arg Tyr Ala Met Glu Asp Thr Glu Val Ala Gly Val Gln Val
355 360 365
Arg Gln Gly Glu Ala Leu Met Phe Ser Leu Val Ala Ala Asn His Asp
370 375 380
Pro Arg His Tyr Thr Gly Pro Glu Arg Leu Asp Leu Thr Arg Gln Pro
385 390 395 400
Ala Gly Arg Ala Glu Asp His Val Gly Phe Gly His Gly Met His Tyr
405 410 415
Cys Leu Gly Ala Ser Leu Ala Arg Gln Glu Ala Glu Val Ala Tyr Gly
420 425 430
Lys Leu Leu Thr Arg Tyr Pro Asp Leu Glu Leu Ala Leu Thr Pro Glu
435 440 445
Gln Leu Glu Asp Gln Glu Arg Leu Arg Gln Pro Gly Thr Trp Arg Leu
450 455 460
Arg Arg Leu Pro Leu Lys Leu His Ala Arg Ser
465 470 475
23
1293
DNA
Streptomyces tubercidicus
23
atgtcggcat tatccaactc cccgctcgcc gcacatgtcg ggaaacaccc tggcgagccg 60
aatgtgatgg acccggcgct gatcaccgac ccgttcggcg gctacggcgc actgcgcgag 120
caaggcccgg tcgtacgggg ccggttcatg gacgactcgc ccgtctggct ggtgacgcgc 180
ttcgaagagg tccgccaagt cctgcgcgat cagcggttcg tgaacaaccc ggccgcaccg 240
tccctgggac gctcgatcga cgaaagcccc gcggtcagac ttttggaaat gttggggttg 300
cccgaccatt tccggccgta tctgctcggg tcgatcctca actacgacgc acccgaccac 360
acccggctcc gccgactggt ctcgcgcgcc ttcacggcac gcaagatcac cgacctgcgg 420
ccgcgggtcg agcagatcac cgacgacctg ctgacccggc ttcccgagca cgccgaggac 480
ggtgtggtcg acctcatcca gcacttcgcc taccccctgc cgatcaccgt gatctgcgaa 540
ctggtcggca tcgccgaagc ggaccgcccg caatggcgga agtggggagc cgatctcgtc 600
tcgctggagc cggggcggct gagcaccgcg ttcccggcga tggtcgagca catccatgag 660
ctgatccgcg agcggcgcgg cgcgctcacc gacgatctgc tcagcgagct gatccgcacc 720
catgacgacg acggcggccg gctcagcgac atcgagatgg tcaccatgat cctcacgatc 780
gtcctggccg gccacgagac caccgcccac ctcataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctactcaag gacgatccgg cgctgctgcc gcgcgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cacatgaccc agctgcggtt cgcgtccgag 960
gacgtcgagg tcgccgggac accgatccac aagggcgacg ccgtacaact catcctggta 1020
tcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgacct gacccgccac 1080
cccgccggcc acgccgagaa ccatgtgggc ttcggccacg gaatgcacta ctgcctgggt 1140
gccaccctcg ccaaacagga aggcgaagtc gccttctccc gcctcttcac gcactacccg 1200
gaactgtccc tgggcgtcgc ggcggaccag ctggcgcgga cacaggtacc cggcagctgg 1260
cggctggaca ccctgccgct gcgactgggg tga 1293
24
430
PRT
Streptomyces tubercidicus
24
Met Ser Ala Leu Ser Asn Ser Pro Leu Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Ser Leu Gly Arg Ser Ile Asp Glu Ser Pro Ala Val Arg Leu Leu Glu
85 90 95
Met Leu Gly Leu Pro Asp His Phe Arg Pro Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Thr Asp Asp Leu Leu Thr Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Ala Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Val Ser Leu Glu Pro Gly Arg Leu Ser
195 200 205
Thr Ala Phe Pro Ala Met Val Glu His Ile His Glu Leu Ile Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Ile Glu Met Val Thr Met
245 250 255
Ile Leu Thr Ile Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Phe Ala Ser Glu
305 310 315 320
Asp Val Glu Val Ala Gly Thr Pro Ile His Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Ser Arg Leu Phe Thr His Tyr Pro
385 390 395 400
Glu Leu Ser Leu Gly Val Ala Ala Asp Gln Leu Ala Arg Thr Gln Val
405 410 415
Pro Gly Ser Trp Arg Leu Asp Thr Leu Pro Leu Arg Leu Gly
420 425 430
25
1293
DNA
Streptomyces
25
atgtcggcat tatccagctc accgttcgcc gcgcatgtcg ggaaacaccc gggcgagccg 60
aatgtgatgg acccggcgct gatcgccgat ccgttcggtg gttatggcgc actgcgtgag 120
caagggccgg tcgtacgggg ccggttcatg gacgactcac ccgtctggct cgtgacgcgc 180
ttcgaggaag tccgccaagt cctgcgcgac cagcggttcc tgaacgatcc gacggccccc 240
tccctggggc gctcattcga cgacagcccc acggccaggc tgctggagat gatgggactg 300
cccgagcatt tccggccgta tctgctcggt tcgattctga acaacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgcgcc ttcacggcac gcaagatcac cgacctgcgg 420
ccgcgggtcg agcagatcgc cgacgagctg ctgacccggc ttcccgagta cgccgaggac 480
ggcgtggtcg acctcatcaa gcacttcgcc taccccctgc cgatcgccgt catctgcgaa 540
ctggtcggca tagccgaagc ggatcgtccg cagtggcgga agtggggtgc cgacctcgtc 600
tcgctgcagc cggaccggct cagcacctcg ttcccggcga tgatcgagca catccatgag 660
ctgatccgcg agcggcgcgg ggcgctcacg gacgatctgc tcagcgagct gatccgtgcc 720
catgacgacg acggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacggtg 780
gtgctcgccg gccacgagac caccgcgcac ctcataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg gctgctcagg gacgacccgg ctctgtttcc ccgtgccgtc 900
cacgagctgt tgcgctggtg cgggccggtc cacatgaccc agatgcggtt tgcgtccgag 960
gatgtcgaca tcgccgggac gaagatccgt aagggcgacg ccgtacaact gatcctggta 1020
tcggccaact tcgacccccg ccactacacc gaccccgaac gtctcgacct gacccgtcac 1080
cccgccggcc acgccgagaa ccatgtgggc ttcggccacg ggatgcacta ctgcctgggc 1140
gccaccctcg ccaaacagga gggcgaagtc gcgttcgaga agctcttcgc gcactacccg 1200
gaggtgtcgc tgggcgtcgc accggaacaa ctggaaagga caccactgcc cggcagctgg 1260
cggctcgatt ccctgccgct gcggttgcgg taa 1293
26
430
PRT
Streptomyces
26
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Ala Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Leu Asn Asp Pro Thr Ala Pro
65 70 75 80
Ser Leu Gly Arg Ser Phe Asp Asp Ser Pro Thr Ala Arg Leu Leu Glu
85 90 95
Met Met Gly Leu Pro Glu His Phe Arg Pro Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Thr Arg Leu Pro Glu Tyr Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Lys His Phe Ala Tyr Pro Leu Pro Ile Ala
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Ala Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Leu Ser
195 200 205
Thr Ser Phe Pro Ala Met Ile Glu His Ile His Glu Leu Ile Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Ala
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Arg Asp Asp Pro Ala Leu Phe Pro Arg Ala Val His Glu Leu Leu
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Met Arg Phe Ala Ser Glu
305 310 315 320
Asp Val Asp Ile Ala Gly Thr Lys Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Glu Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Glu Lys Leu Phe Ala His Tyr Pro
385 390 395 400
Glu Val Ser Leu Gly Val Ala Pro Glu Gln Leu Glu Arg Thr Pro Leu
405 410 415
Pro Gly Ser Trp Arg Leu Asp Ser Leu Pro Leu Arg Leu Arg
420 425 430
27
1293
DNA
Streptomyces
27
atgtcggcat tatccagctc tccgttcgct gcgcatgtcg ggaaacaccc gggtgagccg 60
aatgtgatgg agccggcgct gctcaccgac ccgttcgcgg gctacggcgc gctgcgtgag 120
caggccccgg tcgtacgggg ccggttcgtg gacgactcac cggtctggtt cgtgacgcgc 180
ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggccgcgccg 240
cccctggccc catcggccga ggagaacccg ctgaccaggc tgatggacat gctgggcctc 300
cccgagcacc tccgcgtcta catgctcggg tcgattctca actacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcgcgcgcg ttcacggcgc ggaagatcac cgatctgcga 420
ccgcgtgtcg agcagatcgc cgacgagctg ctggcccgcc tccccgagta cgccgaggac 480
ggcgtcgtcg acctcatcca gcatttcgcc tacccgctgc cgatcaccgt catctgcgag 540
ctggtcggca tacccgaagc ggaccgcccg cagtggcgga agtggggcgc cgacctcatc 600
tcgatggacc cggaccggct cggcgcaacg ttcccggcga tgatcgagca catccatgag 660
atggtccggg agcggcgcgc ggcgctcacc gatgatctgc tcagcgagct gatccgtacc 720
catgacgacg atggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780
gtcctcgccg gtcacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc ccgggccgtc 900
catgagctga tgcgctggtg cgggccggtg cagatgacgc agctgcgcta cgcggccgcc 960
gacgtcgacc tcgccggtac gcggatccac aagggcgacg ccgtacaact cctcctggtt 1020
gcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgatct gacgcgtcac 1080
cccgccggcc acgccgagaa ccatgtgggt ttcggccacg gtgcgcatta ctgcctgggt 1140
gccaccctcg ccaagcagga gggcgaagtc gcgttcggca agctgctcgc gcactacccg 1200
gagatgtccc tgggcatcga accggaacgt ctggagcgat tgccgctgcc tggcaactgg 1260
cggctgaatt ccctgccgtt gcggctgggg tga 1293
28
430
PRT
Streptomyces
28
Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Glu Pro Ala Leu Leu Thr Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Ala Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ala Pro
65 70 75 80
Pro Leu Ala Pro Ser Ala Glu Glu Asn Pro Leu Thr Arg Leu Met Asp
85 90 95
Met Leu Gly Leu Pro Glu His Leu Arg Val Tyr Met Leu Gly Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu Tyr Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly
195 200 205
Ala Thr Phe Pro Ala Met Ile Glu His Ile His Glu Met Val Arg Glu
210 215 220
Arg Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Ile Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ala Ala
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Arg Ile His Lys Gly Asp Ala Val Gln
325 330 335
Leu Leu Leu Val Ala Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro
385 390 395 400
Glu Met Ser Leu Gly Ile Glu Pro Glu Arg Leu Glu Arg Leu Pro Leu
405 410 415
Pro Gly Asn Trp Arg Leu Asn Ser Leu Pro Leu Arg Leu Gly
420 425 430
29
1293
DNA
Streptomyces lydicus
29
atgtcggcat tacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgaaccg 60
aacgtgatgg atccggcgct gatcggtgat ccgttcgccg gttacggcgc gctgcgcgag 120
cagggcccgg tcgtgcgggg gcggttcgtg gacgactccc ccgtgtggtt cgtgacccgc 180
ttcgaggagg tccgcgaggt cctgcgggac cagcggttcc ggaacaatcc ggtctcctcg 240
gcgccggacg cggaccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300
cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420
ccgcgcgtcg cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480
ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540
ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600
tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660
ctgatcgcgg cgcggcgccg ggcgctcacc gacgacctgc tcagcgagct gatccgaacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900
cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960
gacgtcgagc tggcgggcgt ccggatccgc aagggggacg ccgtccagct catcctggtg 1020
tcggcgaacc gcgatccgcg ccactacacc gaacccgacc gtctggacct gacccggcac 1080
cccgccggcc acgccgagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140
gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200
gagctgtcgc tggccgtcgc gccggacgcc ctggagcgca caccggtacc gggcagctgg 1260
cggctgaatg cgctgccgct gcgtctgcgc tga 1293
30
430
PRT
Streptomyces lydicus
30
Met Ser Ala Leu Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Gly Asp Pro Phe
20 25 30
Ala Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Val Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Glu Val Leu Arg Asp Gln Arg Phe Arg Asn Asn Pro Val Ser Ser
65 70 75 80
Ala Pro Asp Ala Asp Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp
85 90 95
Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Ala
130 135 140
Gln Ile Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Glu Asp Arg Pro Gln Trp
180 185 190
Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Asp Arg Met Ser
195 200 205
Arg Ser Phe Pro Ala Met Ile Asp His Ile His Glu Leu Ile Ala Ala
210 215 220
Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Met Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Glu Leu Ala Gly Val Arg Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Arg Asp Pro Arg His Tyr Thr Glu Pro
340 345 350
Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Leu Arg His Phe Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Asp Ala Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asn Ala Leu Pro Leu Arg Leu Arg
420 425 430
31
1293
DNA
Streptomyces lydicus
31
atgtcggcat cgaccagctc tcccctcagc gcccacgtcg gcaagcaccc gggcgaaccc 60
catgtgatgg atccggcgct gatcagcgat ccgttcggcg gctacggtgc cctgcgcgag 120
cagggaccgg tcgtccgcgg acggttcttc gacgactcgc ccttgtggtt agtgacccgc 180
ttcgaggaag tccgccaggt cctgcgcgac cagcggttcg tgaacaaccc cgccgacccg 240
gcgctcggcg tcgcgccgga ggactccccg cagctgcgcg cgctggcgat gctgggcatc 300
cccgagcacc tgcacggcta tctgctcaac tcgatcctca actacgacgc ccccgaccac 360
acccggctgc gccgcctggt ctcccgcgcc ttcaccgccc gcaagatcac cgatcttcgg 420
ccgcgggtgg cgcagataac cgccgagctg ctggaccgac tcccggagca cgccgaggac 480
ggcgtggtcg acctgatcga gcacttcgcc tacccgctgc cgatcacggt gatctgcgaa 540
cttgtcggca tcgccgcgga ggaccggccc cagtggcgtt cctggggcgc cgacctggtc 600
tcggtggacc ccgaccggct cggccggacc ttcccggcga tgatcgacca catccacgcg 660
ctgatcggcc agcggcgggc cgcgctcacc gacgacctgc tcagcgagct gatccggacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccctggt cctcaccctc 780
gtgctggccg gccacgagac caccgcacac ctcatcggca acggcaccgc ggccctgctc 840
acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgccgtc 900
cacgagctga tgcgctggtg cgggccggtg cacgtcaccc agctgcggta cgccgccgag 960
gacgtcgacc tcgccggcac ccggatccgc aggggcgacg ccgtgcaggc cgtcctggtc 1020
tcggcgaacc acgacccgcg ccactacacc gaccccgaac gcctggacct gacccggcag 1080
cccgcgggcc gcgccgagaa ccacgtgggc ttcgggcacg gggcgcacta ctgcctgggc 1140
gccagcctcg ccaggcagga gggtgaggtc gccctgggcg ccctgttcga ccgctacccc 1200
gacctggcgc tggcggtggc gcccgaggag ctggagcgca ccccggtgcc cggtacctgg 1260
cggctgacgt cgctgccggt gcgcctgggc tga 1293
32
430
PRT
Streptomyces lydicus
32
Met Ser Ala Ser Thr Ser Ser Pro Leu Ser Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro His Val Met Asp Pro Ala Leu Ile Ser Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg
35 40 45
Phe Phe Asp Asp Ser Pro Leu Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Asp Pro
65 70 75 80
Ala Leu Gly Val Ala Pro Glu Asp Ser Pro Gln Leu Arg Ala Leu Ala
85 90 95
Met Leu Gly Ile Pro Glu His Leu His Gly Tyr Leu Leu Asn Ser Ile
100 105 110
Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Ala
130 135 140
Gln Ile Thr Ala Glu Leu Leu Asp Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Glu His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Ala Ala Glu Asp Arg Pro Gln Trp
180 185 190
Arg Ser Trp Gly Ala Asp Leu Val Ser Val Asp Pro Asp Arg Leu Gly
195 200 205
Arg Thr Phe Pro Ala Met Ile Asp His Ile His Ala Leu Ile Gly Gln
210 215 220
Arg Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Leu
245 250 255
Val Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met
290 295 300
Arg Trp Cys Gly Pro Val His Val Thr Gln Leu Arg Tyr Ala Ala Glu
305 310 315 320
Asp Val Asp Leu Ala Gly Thr Arg Ile Arg Arg Gly Asp Ala Val Gln
325 330 335
Ala Val Leu Val Ser Ala Asn His Asp Pro Arg His Tyr Thr Asp Pro
340 345 350
Glu Arg Leu Asp Leu Thr Arg Gln Pro Ala Gly Arg Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Ser Leu Ala
370 375 380
Arg Gln Glu Gly Glu Val Ala Leu Gly Ala Leu Phe Asp Arg Tyr Pro
385 390 395 400
Asp Leu Ala Leu Ala Val Ala Pro Glu Glu Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Thr Trp Arg Leu Thr Ser Leu Pro Val Arg Leu Gly
420 425 430
33
1281
DNA
Streptomyces tubercidicus
33
atgaactctc cgttcgccgc gcacgtcggg aaacacccgg gcgagccgaa tgtgatggac 60
cccgccctga tcaccgaccc gttcaccggc tacggcgcgc tgcgtgagca gggcccggtc 120
gtacggggcc ggttcatgga cgactcgccc gtctggctgg tgacgcggtt cgaggaggtc 180
cgccaggtcc tgcgcgacca gcggttcgtg aacaatccgg cctcgccgtc cctgaactac 240
gcgcccgagg acaacccgct gacccggctg atggagatgc tgggcctccc cgagcacctc 300
cgcgtctacc tgctcggatc gatcctcaac tacgacgccc ccgaccacac ccggctgcgc 360
cgtctggtgt cgcgggcgtt cacggcccgc aagatcaccg acctgcggcc ccgggtcgag 420
cagatcgccg acgcgctgct ggcccggctg cccgagcacg ccgaggacgg cgtcgtcgac 480
ctcatccagc acttcgccta ccccctgccg atcaccgtca tctgcgaact ggtcggcata 540
cccgaagcgg accgcccgca gtggcgaacg tggggcgccg acctcatctc gatggatccg 600
gaccggctcg gcgcctcgtt cccggcgatg atcgagcaca tccatcagat ggtccgggaa 660
cggcgcgagg cgctcaccga cgacctgctc agcgaactga tccgcaccca tgacgacgac 720
ggcgggcggc tcagcgacgt cgagatggtc accatgatcc tcacgctcgt cctcgccggc 780
cacgagacca ccgcccacct catcagcaac ggcacggcgg cgctgctcac ccaccccgac 840
cagctgcgtc tggtcaagga cgatccggcc ctcctccccc gtgccgtcca cgagctgatg 900
cgctggtgcg ggccggtgca catgacccag ctgcgctacg ccaccgccga cgtcgacctc 960
gccggcacac cgatccgcca gggcgatgcc gttcaactca tcctggtatc ggccaacttc 1020
gacccccgtc actacaccga ccccgaccgc ctcgatctca cccggcaccc cgcgggccac 1080
gccgagaacc atgtgggttt cggccatgga gcgcactact gcctgggcgc cacactcgcc 1140
aaacaggaag gtgaagtcgc cttcggcaaa ctgctcacgc actacccgga catatcgctg 1200
ggcatcgccc cggaacacct ggagcggaca ccgctgccgg gcaactggcg gctgaactcg 1260
ctgccggtgc ggttggggtg a 1281
34
426
PRT
Streptomyces tubercidicus
34
Met Asn Ser Pro Phe Ala Ala His Val Gly Lys His Pro Gly Glu Pro
1 5 10 15
Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe Thr Gly Tyr Gly
20 25 30
Ala Leu Arg Glu Gln Gly Pro Val Val Arg Gly Arg Phe Met Asp Asp
35 40 45
Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val Arg Gln Val Leu
50 55 60
Arg Asp Gln Arg Phe Val Asn Asn Pro Ala Ser Pro Ser Leu Asn Tyr
65 70 75 80
Ala Pro Glu Asp Asn Pro Leu Thr Arg Leu Met Glu Met Leu Gly Leu
85 90 95
Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser Ile Leu Asn Tyr Asp
100 105 110
Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser Arg Ala Phe Thr
115 120 125
Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu Gln Ile Ala Asp
130 135 140
Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp Gly Val Val Asp
145 150 155 160
Leu Ile Gln His Phe Ala Tyr Pro Leu Pro Ile Thr Val Ile Cys Glu
165 170 175
Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp Arg Thr Trp Gly
180 185 190
Ala Asp Leu Ile Ser Met Asp Pro Asp Arg Leu Gly Ala Ser Phe Pro
195 200 205
Ala Met Ile Glu His Ile His Gln Met Val Arg Glu Arg Arg Glu Ala
210 215 220
Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr His Asp Asp Asp
225 230 235 240
Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met Ile Leu Thr Leu
245 250 255
Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile Ser Asn Gly Thr
260 265 270
Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu Val Lys Asp Asp
275 280 285
Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met Arg Trp Cys Gly
290 295 300
Pro Val His Met Thr Gln Leu Arg Tyr Ala Thr Ala Asp Val Asp Leu
305 310 315 320
Ala Gly Thr Pro Ile Arg Gln Gly Asp Ala Val Gln Leu Ile Leu Val
325 330 335
Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro Asp Arg Leu Asp
340 345 350
Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His Val Gly Phe Gly
355 360 365
His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala Lys Gln Glu Gly
370 375 380
Glu Val Ala Phe Gly Lys Leu Leu Thr His Tyr Pro Asp Ile Ser Leu
385 390 395 400
Gly Ile Ala Pro Glu His Leu Glu Arg Thr Pro Leu Pro Gly Asn Trp
405 410 415
Arg Leu Asn Ser Leu Pro Val Arg Leu Gly
420 425
35
195
DNA
Streptomyces tubercidicus
35
atgcggatca cgatcgacac cgacatctgt atcggcgccg gccagtgcgc cctgaccgcg 60
ccgggagtgt tcacccagga cgacgacggc ttcagcgccc tgctgcccgg ccgcgaggac 120
ggtgcgggcg acccgctggt gcgggaggcc gcccgcgcct gcccggtgca ggccatcacg 180
gtcacggacg actga 195
36
64
PRT
Streptomyces tubercidicus
36
Met Arg Ile Thr Ile Asp Thr Asp Ile Cys Ile Gly Ala Gly Gln Cys
1 5 10 15
Ala Leu Thr Ala Pro Gly Val Phe Thr Gln Asp Asp Asp Gly Phe Ser
20 25 30
Ala Leu Leu Pro Gly Arg Glu Asp Gly Ala Gly Asp Pro Leu Val Arg
35 40 45
Glu Ala Ala Arg Ala Cys Pro Val Gln Ala Ile Thr Val Thr Asp Asp
50 55 60
37
195
DNA
Streptomyces tubercidicus
37
atgcggatca ccatcgacac cgacatctgc atcggcgccg gccagtgcgc cctgaccgcg 60
ccgggagtct tcacccagga cgacgacggt ttcagcgccc tgctgcccgg ccgcgaggac 120
ggcgcgggcg acccgctggt gcgcgaggcc gcccgcgcct gccccgtgca ggccatttcg 180
gtcacggacg actga 195
38
64
PRT
Streptomyces tubercidicus
38
Met Arg Ile Thr Ile Asp Thr Asp Ile Cys Ile Gly Ala Gly Gln Cys
1 5 10 15
Ala Leu Thr Ala Pro Gly Val Phe Thr Gln Asp Asp Asp Gly Phe Ser
20 25 30
Ala Leu Leu Pro Gly Arg Glu Asp Gly Ala Gly Asp Pro Leu Val Arg
35 40 45
Glu Ala Ala Arg Ala Cys Pro Val Gln Ala Ile Ser Val Thr Asp Asp
50 55 60
39
9
PRT
Artificial Sequence
Synthetic peptide.
39
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1 5
40
10
PRT
Artificial Sequence
Synthetic peptide.
40
Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu
1 5 10
41
11
PRT
Artificial Sequence
Synthetic peptide.
41
Gln Pro Glu Leu Ala Pro Glu Asp Pro Glu Asp
1 5 10
42
11
PRT
Artificial Sequence
Synthetic peptide.
42
Tyr Thr Asp Ile Glu Met Asn Arg Leu Gly Lys
1 5 10
43
7
PRT
Streptomyces tubercidicus
43
Ile Ala Gly His Glu Thr Thr
1 5
44
20
DNA
Streptomyces tubercidicus
misc_feature
(6)..(18)
Nucleotides 6, 9 and 18 are “s” wherein “s” = g
or c.
44
atcgcsggsc acgagacsac 20
45
7
PRT
Streptomyces tubercidicus
45
Val Ala Gly His Glu Thr Thr
1 5
46
20
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 9, and 18 are “s” wherein “s”
= g or c.
46
gtsgcsggsc acgagacsac 20
47
7
PRT
Streptomyces tubercidicus
47
Leu Ala Gly His Glu Thr Thr
1 5
48
20
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 9, and 18 are “s” wherein “s”
= g or c.
48
ctsgcsggsc acgagacsac 20
49
9
PRT
Streptomyces tubercidicus
49
Leu Leu Leu Ile Ala Gly His Glu Thr
1 5
50
25
DNA
Streptomyces tubercidicus
misc_feature
(2)..(17)
Nucleotides 2, 5, 8, 14, and 17 are “s” wherein
“s” = g or c.
50
tsctsctsat cgcsggscac gagac 25
51
9
PRT
Streptomyces tubercidicus
51
His Gln Cys Leu Gly Gln Asn Leu Ala
1 5
52
26
DNA
Streptomyces tubercidicus
misc_feature
(12)..(24)
Nucleotides 12, 15, and 24 are “s” wherein “s”
= g or c.
52
gtggtcacgg asccstgctt ggascg 26
53
8
PRT
Streptomyces tubercidicus
53
Phe Gly His Gly Val His Gln Cys
1 5
54
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(15)
Nucleotides 6, 12, and 15 are “s” wherein “s” =
g or c.
54
aagccsgtgc cscasgtggt cacg 24
55
8
PRT
Streptomyces tubercidicus
55
Phe Gly Phe Gly Val His Gln Cys
1 5
56
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(15)
Nucleotides 6, 12, and 15 are “s” wherein “s” =
g or c.
56
aaggcsaagc cscasgtggt cacg 24
57
8
PRT
Streptomyces tubercidicus
57
Phe Gly His Gly Ile His Gln Cys
1 5
58
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(12)
Nucleotides 6 and 12 are “s” wherein “s” = g or
c.
58
aagccsgtgc cstaggtggt cacg 24
59
8
PRT
Streptomyces tubercidicus
59
Phe Gly His Gly Val His Phe Cys
1 5
60
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(15)
Nucleotides 6, 12, and 15 are “s” wherein “s” =
g or c.
60
aagccsgtgc cscasgtgaa gacg 24
61
24
PRT
Streptomyces tubercidicus
61
His Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro
1 5 10 15
Phe Thr Gly Tyr Gly Ala Leu Arg
20
62
21
PRT
Streptomyces tubercidicus
62
Phe Val Asn Asn Pro Ala Ser Pro Ser Leu Asn Tyr Ala Pro Glu Asp
1 5 10 15
Asn Pro Leu Thr Arg
20
63
19
PRT
Streptomyces tubercidicus
63
Leu Leu Thr His Tyr Pro Asp Ile Ser Leu Gly Ile Ala Pro Glu His
1 5 10 15
Leu Glu Arg
64
17
PRT
Streptomyces tubercidicus
64
Val Tyr Leu Leu Gly Ser Ile Leu Asn Tyr Asp Ala Pro Asp His Thr
1 5 10 15
Arg
65
13
PRT
Streptomyces tubercidicus
65
Thr Trp Gly Ala Asp Leu Ile Ser Met Asp Pro Asp Arg
1 5 10
66
13
PRT
Streptomyces tubercidicus
66
Glu Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg
1 5 10
67
12
PRT
Streptomyces tubercidicus
67
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg
1 5 10
68
12
PRT
Streptomyces tubercidicus
68
Leu Met Glu Met Leu Gly Leu Pro Glu His Leu Arg
1 5 10
69
11
PRT
Streptomyces tubercidicus
69
Val Glu Gln Ile Ala Asp Ala Leu Leu Ala Arg
1 5 10
70
11
PRT
Streptomyces tubercidicus
70
Leu Val Lys Asp Asp Pro Ala Leu Leu Pro Arg
1 5 10
71
8
PRT
Streptomyces tubercidicus
71
Asp Asp Pro Ala Leu Leu Pro Arg
1 5
72
8
PRT
Streptomyces tubercidicus
72
Thr Pro Leu Pro Gly Asn Trp Arg
1 5
73
7
PRT
Streptomyces tubercidicus
73
Leu Asn Ser Leu Pro Val Arg
1 5
74
7
PRT
Streptomyces tubercidicus
74
Ile Thr Asp Leu Arg Pro Arg
1 5
75
7
PRT
Streptomyces tubercidicus
75
Glu Gln Gly Pro Val Val Arg
1 5
76
7
PRT
Streptomyces tubercidicus
76
Ala Val His Glu Leu Met Arg
1 5
77
5
PRT
Streptomyces tubercidicus
77
Ala Phe Thr Ala Arg
1 5
78
5
PRT
Streptomyces tubercidicus
78
Phe Glu Glu Val Arg
1 5
79
7
PRT
Streptomyces tubercidicus
79
Pro Gly Glu Asp Asn Val Met
1 5
80
21
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 12, and 18 are “s” wherein
“s” = c or g.
80
ccsggsgarc csaaygtsat g 21
81
7
PRT
Streptomyces tubercidicus
81
Ala Leu Ile Thr Asp Pro Phe
1 5
82
21
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 6, 12, and 18 are “s”wherein
“s” = c or g.
82
gcsctsatya csgacccstt c 21
83
8
PRT
Streptomyces tubercidicus
83
Phe Met Asp Asp Ser Pro Val Trp
1 5
84
24
DNA
Streptomyces tubercidicus
misc_feature
(13)..(13)
Nucleotide 13 is “w” wherein “w” = a or t.
84
ttcatggacg acwssccsgt stgg 24
85
8
PRT
Streptomyces tubercidicus
85
Leu Asn Tyr Asp Ala Pro Asp His
1 5
86
24
DNA
Streptomyces tubercidicus
misc_feature
(3)..(18)
Nucleotides 3, 15 and 18 are “s” wherein “s” =
c or g.
86
ctsaaytayg acgcsccsga ccac 24
87
8
PRT
Streptomyces tubercidicus
87
Val Glu Gln Ile Ala Asp Ala Leu
1 5
88
24
DNA
Streptomyces tubercidicus
misc_feature
(3)..(24)
Nucleotides 3, 15, 21, and 24 are “s” wherein
“s” = c or g.
88
gtsgarcaga tygcsgacgc scts 24
89
8
PRT
Streptomyces tubercidicus
89
Asp Leu Ile Ser Met Asp Pro Asp
1 5
90
24
DNA
Streptomyces tubercidicus
misc_feature
(6)..(21)
Nucleotides 6, 11, 12, and 21 are “s” wherein
“s” = c or g.
90
ctggastarw sstacctggg sctg 24
91
36
DNA
Streptomyces tubercidicus
91
agattaatta atgtcggaat taatgaactg tccgtt 36
92
32
DNA
Streptomyces tubercidicus
92
aaactcaccc caaccgcacc ggcagcgagt tc 32
93
7
PRT
Streptomyces tubercidicus
93
Met Ser Glu Leu Met Asn Ser
1 5
94
1293
DNA
Streptomyces
94
atgtcggcaa tatccagctc cccgttcgcc gcacacgtcg gaaagcatcc cggcgagccg 60
aatgtgatgg acccggcgct gatcaccgac ccgttcggcg gctacggcgc actgcgtgag 120
caaggccccg tcctaccggg ccggttcatg gacgactcac ccgtctggct cgtgacgcgc 180
ttcgaagagg tccgccaagt cctgcgcgat cagcggttcc tgaacaaccc ggccgcgtcg 240
tcaccggggc attcgatcga cgagagcccc acggccaggc tgctggacat gatggggatg 300
cccgaacatt tccggccgta tctgatgggg tcgatcctca acaacgacgc ccccgaccac 360
acccggctgc gccgtctggt gtcacgcgcg ttcacggcac gcaagatcac cgatctgcgg 420
ccgcgggtcg agcagctcgc cgacgagctg ctggcccggc ttcccgagca cgccgaggac 480
ggtgtggtcg acctgatcaa gcacttcgcc tatcccctgc cgatcaccgt gatctgcgaa 540
ctggtcggca tcccggaagc ggaccgcccg caatggcgga agtggggcgc cgacctcgtt 600
tcgctgcagc cggagcggct cagcacctcg ttcccggcga tgatcgagca catccatgaa 660
ctgatccgcg agcggcgcgg cgcgctcacc gacgatctgc tcagcgagct gatccgtacc 720
catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780
gtcctggccg gccacgagac caccgcccac ctgataggca acggcacggc ggcgctgctc 840
acccaccccg accagctgcg cctggtcaag gacgacccgg agctgcttcc gcgtgccgtc 900
cacgagctgc tgcgctggtg cgggccggtc cagatgaccc agctgcggta cgcctccgag 960
gatgtcgaga tcgccgggac gccgatccgt aagggcgacg ccgtacaact catcctggta 1020
tcggcgaact tcgacccccg ccactacacc gcccccgaac gcctcgacct gacccgccac 1080
cccgccggcc acgccgagaa ccatgtgggc ttcggccacg gaatgcacta ctgcctgggc 1140
gccaccctcg ccaaacagga gggcgaagtc gcgttcggca agctcttcac gcactacccg 1200
gagctgtcgc tggccgtcgc accggacgag ttggagcgaa cgccggtgcc cggcagctgg 1260
cggttggatt cgctgccggt gcggttgggg tga 1293
95
430
PRT
Streptomyces
95
Met Ser Ala Ile Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His
1 5 10 15
Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu Ile Thr Asp Pro Phe
20 25 30
Gly Gly Tyr Gly Ala Leu Arg Glu Gln Gly Pro Val Leu Pro Gly Arg
35 40 45
Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val
50 55 60
Arg Gln Val Leu Arg Asp Gln Arg Phe Leu Asn Asn Pro Ala Ala Ser
65 70 75 80
Ser Pro Gly His Ser Ile Asp Glu Ser Pro Thr Ala Arg Leu Leu Asp
85 90 95
Met Met Gly Met Pro Glu His Phe Arg Pro Tyr Leu Met Gly Ser Ile
100 105 110
Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser
115 120 125
Arg Ala Phe Thr Ala Arg Lys Ile Thr Asp Leu Arg Pro Arg Val Glu
130 135 140
Gln Leu Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp
145 150 155 160
Gly Val Val Asp Leu Ile Lys His Phe Ala Tyr Pro Leu Pro Ile Thr
165 170 175
Val Ile Cys Glu Leu Val Gly Ile Pro Glu Ala Asp Arg Pro Gln Trp
180 185 190
Arg Lys Trp Gly Ala Asp Leu Val Ser Leu Gln Pro Glu Arg Leu Ser
195 200 205
Thr Ser Phe Pro Ala Met Ile Glu His Ile His Glu Leu Ile Arg Glu
210 215 220
Arg Arg Gly Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu Ile Arg Thr
225 230 235 240
His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met
245 250 255
Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu Ile
260 265 270
Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gln Leu Arg Leu
275 280 285
Val Lys Asp Asp Pro Glu Leu Leu Pro Arg Ala Val His Glu Leu Leu
290 295 300
Arg Trp Cys Gly Pro Val Gln Met Thr Gln Leu Arg Tyr Ala Ser Glu
305 310 315 320
Asp Val Glu Ile Ala Gly Thr Pro Ile Arg Lys Gly Asp Ala Val Gln
325 330 335
Leu Ile Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Ala Pro
340 345 350
Glu Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His
355 360 365
Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Thr Leu Ala
370 375 380
Lys Gln Glu Gly Glu Val Ala Phe Gly Lys Leu Phe Thr His Tyr Pro
385 390 395 400
Glu Leu Ser Leu Ala Val Ala Pro Asp Glu Leu Glu Arg Thr Pro Val
405 410 415
Pro Gly Ser Trp Arg Leu Asp Ser Leu Pro Val Arg Leu Gly
420 425 430
96
18
DNA
Streptomyces
96
cgsccsccsc tswssaas 18
97
21
DNA
Streptomyces
97
sassgcstts bcccartgyt c 21
98
1266
DNA
Streptomyces
98
gtggtcgacg cacaccagac gttcgtcatc gtcgggggtg gcctggccgg cgcaaaggcc 60
gcggagactc tccgcgcgga ggggttcacc ggccgggtga tcctcatctg tgacgagcgc 120
gaccacccgt acgagcgccc cccgctctcc aaggggttcc tgctcggcaa ggaagagcgc 180
gacagcgtgt tcgtccatga gcccgcctgg tacgcccagg cacagatcga actgcacctg 240
ggccagcccg ccgtccgcct cgaccccgag ggcaggaccg tccgcctcgg cgacggcacc 300
ctgatcgcct acgacaagct gctgctggcc accggcgccg aaccgcggcg cctggacatc 360
cccggcaccg gcctggccgg cgtgcaccac ctgcgccgcc tcgcccacgc cgaacggctg 420
cgcggcgtcc tggcctccct cggccgcgac aacggccatc tggtgatcgc cggagccggc 480
tggatcggcc tggaggtcgc cgccgcggcc cgctcctacg gcgccgaggt gaccgtcgtc 540
gaggccgccc cgacgccgct gcacggcatc ctggggcccg aactcggcgg tctgttcacc 600
gatctgcacc gcgagcacgg cgtccgcttc cacttcggcg cccgcttcac cgagatcgtc 660
ggagagggcg gcatggtgct cgccgtgcgc accgacgacg gcgaggaaca ccccgcccac 720
gatgtgctcg ccgcgatcgg cgccgccccg cgcaccgcgc tcgccgaaca ggccgggctg 780
gatctcgccg acccggagac cggcggcggg gtggccgtcg acgcggcgct gcgcacctcc 840
gacccgtaca tctacgccgc cggtgacgtc gccgccgccg accacccgct gctggacacc 900
cggctgcggg tcgaacactg ggccaacgcc ctcaacggcg gcccggccgc cgcccgcgcc 960
atgctcggcc aggacatcag ctacgaccgc atcccgtact tcttctccga ccagtacgac 1020
gtcggcatgg agtactccgg ctacgccccg cccggctcgt acgcccaggt cgtctgccgc 1080
ggcgacgtcg ccaagcggga gttcatcgcc ttctggctgg cggcggacgg ccggctgctc 1140
gcgggcatga acgtcaacgt ctgggacgtc gccgagtcca tccagcaact catccgctcc 1200
ggggcgccgt tggagcccgg cgcactggcc gatccgcagg ttccgctggc ggcactgctc 1260
ccgtag 1266
99
421
PRT
Streptomyces
99
Val Val Asp Ala His Gln Thr Phe Val Ile Val Gly Gly Gly Leu Ala
1 5 10 15
Gly Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Thr Gly Arg
20 25 30
Val Ile Leu Ile Cys Asp Glu Arg Asp His Pro Tyr Glu Arg Pro Pro
35 40 45
Leu Ser Lys Gly Phe Leu Leu Gly Lys Glu Glu Arg Asp Ser Val Phe
50 55 60
Val His Glu Pro Ala Trp Tyr Ala Gln Ala Gln Ile Glu Leu His Leu
65 70 75 80
Gly Gln Pro Ala Val Arg Leu Asp Pro Glu Gly Arg Thr Val Arg Leu
85 90 95
Gly Asp Gly Thr Leu Ile Ala Tyr Asp Lys Leu Leu Leu Ala Thr Gly
100 105 110
Ala Glu Pro Arg Arg Leu Asp Ile Pro Gly Thr Gly Leu Ala Gly Val
115 120 125
His His Leu Arg Arg Leu Ala His Ala Glu Arg Leu Arg Gly Val Leu
130 135 140
Ala Ser Leu Gly Arg Asp Asn Gly His Leu Val Ile Ala Gly Ala Gly
145 150 155 160
Trp Ile Gly Leu Glu Val Ala Ala Ala Ala Arg Ser Tyr Gly Ala Glu
165 170 175
Val Thr Val Val Glu Ala Ala Pro Thr Pro Leu His Gly Ile Leu Gly
180 185 190
Pro Glu Leu Gly Gly Leu Phe Thr Asp Leu His Arg Glu His Gly Val
195 200 205
Arg Phe His Phe Gly Ala Arg Phe Thr Glu Ile Val Gly Glu Gly Gly
210 215 220
Met Val Leu Ala Val Arg Thr Asp Asp Gly Glu Glu His Pro Ala His
225 230 235 240
Asp Val Leu Ala Ala Ile Gly Ala Ala Pro Arg Thr Ala Leu Ala Glu
245 250 255
Gln Ala Gly Leu Asp Leu Ala Asp Pro Glu Thr Gly Gly Gly Val Ala
260 265 270
Val Asp Ala Ala Leu Arg Thr Ser Asp Pro Tyr Ile Tyr Ala Ala Gly
275 280 285
Asp Val Ala Ala Ala Asp His Pro Leu Leu Asp Thr Arg Leu Arg Val
290 295 300
Glu His Trp Ala Asn Ala Leu Asn Gly Gly Pro Ala Ala Ala Arg Ala
305 310 315 320
Met Leu Gly Gln Asp Ile Ser Tyr Asp Arg Ile Pro Tyr Phe Phe Ser
325 330 335
Asp Gln Tyr Asp Val Gly Met Glu Tyr Ser Gly Tyr Ala Pro Pro Gly
340 345 350
Ser Tyr Ala Gln Val Val Cys Arg Gly Asp Val Ala Lys Arg Glu Phe
355 360 365
Ile Ala Phe Trp Leu Ala Ala Asp Gly Arg Leu Leu Ala Gly Met Asn
370 375 380
Val Asn Val Trp Asp Val Ala Glu Ser Ile Gln Gln Leu Ile Arg Ser
385 390 395 400
Gly Ala Pro Leu Glu Pro Gly Ala Leu Ala Asp Pro Gln Val Pro Leu
405 410 415
Ala Ala Leu Leu Pro
420
100
1314
DNA
Streptomyces
100
atgcccgctg cacgccgccg ccttcgacct ccgcaccgga gcggcgacct gcctgcccgc 60
ccgccgggcc gtgcgcaccc accccgtgac cgtccaggac ggcatgatct acgtccatca 120
cgccgcggag gagggcaccg ccgcatgaag tcggtcgctg tcatcggggc ctcgctggcg 180
ggcctgtacg ccgcgcggtc cctgcgttcc caggggttcg acggccgcct ggtgatcgtc 240
ggggacgagt gccacggccc ctacgaccgg cccccgctgt ccaaggactt cctcaccggc 300
gccaccgacc cgggccgact cgccctggcc gacgccgagg agatcgccga actcgacgcc 360
gaatggctgc tgggcacccg ggccaccggg ctcgacaccg gcggacgcac ggtgctgctc 420
gatggcggcc ggtccctgac caccgacggc gtggtcctcg ccaccggcgc cgccccgcgc 480
ctgctccccg gaccggtgcc cgccggggtc cacaccctgc gcaccctcga cgacgcccag 540
gcgctccgtg cggatctggc gccggcgccg gtccgggtcg tggtgatcgg cggcggcttc 600
atcggcgccg aggtcgcctc gtcctgcgcc gccctaggcc atgacgtcac cgtggtcgag 660
gccgcgccgc tccccctcgt cccccaactc ggccacgcca tggccgagat ctgcgccgcc 720
ctgcatgcgg accacggcgt cacgctgctc accggaaccg gtgtcgcccg gctgcgcagc 780
gagggcgacg gccggcgcgt caccggcgtc gagctgaccg acggccgcct gctccccgcc 840
gacgtggtcg tcgtcggcat cggggtacgc ccccgcaccg cctggctcac ggactccgga 900
ctgccgctcg acgacggtgt gctctgcgac gcgggctgtg tcaccccgct gcccgccgtc 960
gtggccgtcg gcgacgtcgc cagggtggac ggcgcccgtg ccgagcactg gaccagcgcc 1020
accgaacagg ccgccgtggc ggcgcggaac ctgctggccg gcagcaccgt cgcgacccac 1080
cggagcctgc cgtacttctg gtccgaccag tacggcgtcc gcatccagtt cgcgggccac 1140
cggctgccca ccgacacacc gcgcgtcctc gaaggctccc ccgacgaccg cagcttcctc 1200
gcctgttacg aacgggacgg acgcaccacc gcggtgctcg ccctcaaccg gccccgcccc 1260
ttcatgcggc tccgccgcga actcgcccgc accgccctgt cggccaccac ctga 1314
101
437
PRT
Streptomyces
101
Met Pro Ala Ala Arg Arg Arg Leu Arg Pro Pro His Arg Ser Gly Asp
1 5 10 15
Leu Pro Ala Arg Pro Pro Gly Arg Ala His Pro Pro Arg Asp Arg Pro
20 25 30
Gly Arg His Asp Leu Arg Pro Ser Arg Arg Gly Gly Gly His Arg Arg
35 40 45
Met Lys Ser Val Ala Val Ile Gly Ala Ser Leu Ala Gly Leu Tyr Ala
50 55 60
Ala Arg Ser Leu Arg Ser Gln Gly Phe Asp Gly Arg Leu Val Ile Val
65 70 75 80
Gly Asp Glu Cys His Gly Pro Tyr Asp Arg Pro Pro Leu Ser Lys Asp
85 90 95
Phe Leu Thr Gly Ala Thr Asp Pro Gly Arg Leu Ala Leu Ala Asp Ala
100 105 110
Glu Glu Ile Ala Glu Leu Asp Ala Glu Trp Leu Leu Gly Thr Arg Ala
115 120 125
Thr Gly Leu Asp Thr Gly Gly Arg Thr Val Leu Leu Asp Gly Gly Arg
130 135 140
Ser Leu Thr Thr Asp Gly Val Val Leu Ala Thr Gly Ala Ala Pro Arg
145 150 155 160
Leu Leu Pro Gly Pro Val Pro Ala Gly Val His Thr Leu Arg Thr Leu
165 170 175
Asp Asp Ala Gln Ala Leu Arg Ala Asp Leu Ala Pro Ala Pro Val Arg
180 185 190
Val Val Val Ile Gly Gly Gly Phe Ile Gly Ala Glu Val Ala Ser Ser
195 200 205
Cys Ala Ala Leu Gly His Asp Val Thr Val Val Glu Ala Ala Pro Leu
210 215 220
Pro Leu Val Pro Gln Leu Gly His Ala Met Ala Glu Ile Cys Ala Ala
225 230 235 240
Leu His Ala Asp His Gly Val Thr Leu Leu Thr Gly Thr Gly Val Ala
245 250 255
Arg Leu Arg Ser Glu Gly Asp Gly Arg Arg Val Thr Gly Val Glu Leu
260 265 270
Thr Asp Gly Arg Leu Leu Pro Ala Asp Val Val Val Val Gly Ile Gly
275 280 285
Val Arg Pro Arg Thr Ala Trp Leu Thr Asp Ser Gly Leu Pro Leu Asp
290 295 300
Asp Gly Val Leu Cys Asp Ala Gly Cys Val Thr Pro Leu Pro Ala Val
305 310 315 320
Val Ala Val Gly Asp Val Ala Arg Val Asp Gly Ala Arg Ala Glu His
325 330 335
Trp Thr Ser Ala Thr Glu Gln Ala Ala Val Ala Ala Arg Asn Leu Leu
340 345 350
Ala Gly Ser Thr Val Ala Thr His Arg Ser Leu Pro Tyr Phe Trp Ser
355 360 365
Asp Gln Tyr Gly Val Arg Ile Gln Phe Ala Gly His Arg Leu Pro Thr
370 375 380
Asp Thr Pro Arg Val Leu Glu Gly Ser Pro Asp Asp Arg Ser Phe Leu
385 390 395 400
Ala Cys Tyr Glu Arg Asp Gly Arg Thr Thr Ala Val Leu Ala Leu Asn
405 410 415
Arg Pro Arg Pro Phe Met Arg Leu Arg Arg Glu Leu Ala Arg Thr Ala
420 425 430
Leu Ser Ala Thr Thr
435
102
1233
DNA
Streptomyces
102
atggcccaga acacggcatt catcatcgcg ggagcggggc tggccggggc gaaggccgcg 60
gagacactgc gcgcggaggg cttcggcggc cccgtcctgc tgctgggcga cgagcgcgag 120
cgtccctacg agcggccgcc gctgtccaag ggctacctct tgggcacctc cgagcgggag 180
aaggcgtacg tccatccgcc ccagtggtac gccgagcacg acgtcgatct gcggctgggc 240
aacgccgtca ccgccctcga cccggccggc cacgaggtga ccctcgccga cggcagccgg 300
ctgggctacg ccaagctgct gctggccacc ggctccactc cgcgccggct gccggtgccc 360
ggcgccgacc tcgacggggt ccacacgctg cggtacctgg cggacagcga ccgcctcaag 420
gacctcttcc ggtccgcgtc ccggatcgtg gtgatcggcg gcggctggat cggcctggag 480
accacggccg ccgcgcgtgc ggcgggggtc gaggtgaccg tgctggagtc ggcgccgctg 540
cccctgctgg gggtgctggg ccgcgaggtc gcccaggtct tcgccgatct gcacaccgag 600
cacggtgtcg cgctgcgctg cgacacccag gtcacggaga tcaccggcac gaacggcgcg 660
gtcgacgggg tacggctggc cgacggcacc cggatcgcgg ccgacgcggt gatcgtcggc 720
gtcgggatca cccccaactc cgagacggcc gccgcggccg ggctcaaggt cgacaacggc 780
gtcgtcgtgg acgagcggct gtgctcctcc cacccggaca tctacgccgc cggcgacgtc 840
gccaacgcct accaccccct cctgggcaag cacctccgcg tcgagcactg ggccaacgcc 900
ctccaccagc cgaagaccgc ggcccgggcc atgctgggcg gggaggccgg ctacgaccgg 960
ctgccgtact tcttcaccga ccagtacgac ctgggcatgg agtacacggg gcatgtggag 1020
ccgggcgggt acgaccgcgt ggtgttccgc ggcgacaccg gtgcccgcga gttcatcgcc 1080
ttctggctct ccggcggccg ggtgctggcc gggatgaatg tgaacgtatg ggacgtcacc 1140
gacccgatcc gggccctggt ggcgagcggg cgggccgtgg accccgagcg gctcgccgac 1200
gcggacgtac cgctggcgga tctggtcccc tga 1233
103
410
PRT
Streptomyces
103
Met Ala Gln Asn Thr Ala Phe Ile Ile Ala Gly Ala Gly Leu Ala Gly
1 5 10 15
Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Gly Gly Pro Val
20 25 30
Leu Leu Leu Gly Asp Glu Arg Glu Arg Pro Tyr Glu Arg Pro Pro Leu
35 40 45
Ser Lys Gly Tyr Leu Leu Gly Thr Ser Glu Arg Glu Lys Ala Tyr Val
50 55 60
His Pro Pro Gln Trp Tyr Ala Glu His Asp Val Asp Leu Arg Leu Gly
65 70 75 80
Asn Ala Val Thr Ala Leu Asp Pro Ala Gly His Glu Val Thr Leu Ala
85 90 95
Asp Gly Ser Arg Leu Gly Tyr Ala Lys Leu Leu Leu Ala Thr Gly Ser
100 105 110
Thr Pro Arg Arg Leu Pro Val Pro Gly Ala Asp Leu Asp Gly Val His
115 120 125
Thr Leu Arg Tyr Leu Ala Asp Ser Asp Arg Leu Lys Asp Leu Phe Arg
130 135 140
Ser Ala Ser Arg Ile Val Val Ile Gly Gly Gly Trp Ile Gly Leu Glu
145 150 155 160
Thr Thr Ala Ala Ala Arg Ala Ala Gly Val Glu Val Thr Val Leu Glu
165 170 175
Ser Ala Pro Leu Pro Leu Leu Gly Val Leu Gly Arg Glu Val Ala Gln
180 185 190
Val Phe Ala Asp Leu His Thr Glu His Gly Val Ala Leu Arg Cys Asp
195 200 205
Thr Gln Val Thr Glu Ile Thr Gly Thr Asn Gly Ala Val Asp Gly Val
210 215 220
Arg Leu Ala Asp Gly Thr Arg Ile Ala Ala Asp Ala Val Ile Val Gly
225 230 235 240
Val Gly Ile Thr Pro Asn Ser Glu Thr Ala Ala Ala Ala Gly Leu Lys
245 250 255
Val Asp Asn Gly Val Val Val Asp Glu Arg Leu Cys Ser Ser His Pro
260 265 270
Asp Ile Tyr Ala Ala Gly Asp Val Ala Asn Ala Tyr His Pro Leu Leu
275 280 285
Gly Lys His Leu Arg Val Glu His Trp Ala Asn Ala Leu His Gln Pro
290 295 300
Lys Thr Ala Ala Arg Ala Met Leu Gly Gly Glu Ala Gly Tyr Asp Arg
305 310 315 320
Leu Pro Tyr Phe Phe Thr Asp Gln Tyr Asp Leu Gly Met Glu Tyr Thr
325 330 335
Gly His Val Glu Pro Gly Gly Tyr Asp Arg Val Val Phe Arg Gly Asp
340 345 350
Thr Gly Ala Arg Glu Phe Ile Ala Phe Trp Leu Ser Gly Gly Arg Val
355 360 365
Leu Ala Gly Met Asn Val Asn Val Trp Asp Val Thr Asp Pro Ile Arg
370 375 380
Ala Leu Val Ala Ser Gly Arg Ala Val Asp Pro Glu Arg Leu Ala Asp
385 390 395 400
Ala Asp Val Pro Leu Ala Asp Leu Val Pro
405 410
104
1266
DNA
Streptomyces
104
gtggtcgacg cacaccagac gttcgtcatc gtcgggggtg gcctggccgg cgcaaaggcc 60
gcggagactc tccgcgcgga agggttcacc ggccgggtga tcctcatctg tgacgagcgc 120
gaccacccgt acgagcgccc cccgctctcc aaggggttcc tgctcggcaa ggaagagcgc 180
gacagcgttt tcgtccacga acccgcctgg tacgcccagg cacagatcga actgcacctg 240
ggccagcccg ccgtccgcct cgaccccgag gcgaagaccg tccgcctcgg cgacggcacc 300
ctgatcgcct acgacaagct gctgctggcc accggcgccg agccgcgccg cctggacatc 360
cccggcaccg gcctggccgg cgtgcaccac ctgcgccgcc tcgcccacgc cgaacggctg 420
cgcggcgtcc tggcctccct cgggcgggac aacgggcatc tggtgatcgc cggcgccggc 480
tggatcggcc tggaggtcgc cgccgcggcc cgctcctacg gcgccgaggt caccgtcgtc 540
gaggccgccc cgacaccgct gcacggcatc ctggggcccg aactcggcgg cctgttcacc 600
gaactgcacc gcgcacacgg cgtgcgcttc cacttcggcg cccgtttcac cgagatcgtc 660
ggacaggacg gcatggtgct cgccgtgcgc accgacgacg gcgaggagca ccccgcccac 720
gacgtgctcg ccgcgatcgg cgccgccccg cgcaccgcac tcgccgaaca ggccggactc 780
gacctcgccg acccggaggc cggcggcggc gtggccgtcg acgcgacgct gcgcacctcc 840
gacccgtaca tctacgccgc cggcgacgtg gccgccgccg accaccccct cctggacacc 900
cggctgcgcg tcgaacactg ggccaacgcc ctcaacggcg gcccggccgc cgcgcgcgcc 960
atgctcggcc aggacatcag ctacgaccgc gtcccgtact tcttctccga ccagtacgac 1020
gtcggcatgg agtactccgg ctacgccccg cccggctcct acgcacaggt cgtctgccgc 1080
ggcgacgtcg ccaaacggga gttcatcgcg ttctggctcg gcgaggacgg acggctgctc 1140
gcggggatga acgtcaacgt ctgggacgtc gccgaaacca tccagcaact catccgcggc 1200
ggggtgcggt tggagcccgg cgagctggct gatccggagg ttccgctgac ctcactgctc 1260
ccgtag 1266
105
421
PRT
Streptomyces
105
Val Val Asp Ala His Gln Thr Phe Val Ile Val Gly Gly Gly Leu Ala
1 5 10 15
Gly Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Thr Gly Arg
20 25 30
Val Ile Leu Ile Cys Asp Glu Arg Asp His Pro Tyr Glu Arg Pro Pro
35 40 45
Leu Ser Lys Gly Phe Leu Leu Gly Lys Glu Glu Arg Asp Ser Val Phe
50 55 60
Val His Glu Pro Ala Trp Tyr Ala Gln Ala Gln Ile Glu Leu His Leu
65 70 75 80
Gly Gln Pro Ala Val Arg Leu Asp Pro Glu Ala Lys Thr Val Arg Leu
85 90 95
Gly Asp Gly Thr Leu Ile Ala Tyr Asp Lys Leu Leu Leu Ala Thr Gly
100 105 110
Ala Glu Pro Arg Arg Leu Asp Ile Pro Gly Thr Gly Leu Ala Gly Val
115 120 125
His His Leu Arg Arg Leu Ala His Ala Glu Arg Leu Arg Gly Val Leu
130 135 140
Ala Ser Leu Gly Arg Asp Asn Gly His Leu Val Ile Ala Gly Ala Gly
145 150 155 160
Trp Ile Gly Leu Glu Val Ala Ala Ala Ala Arg Ser Tyr Gly Ala Glu
165 170 175
Val Thr Val Val Glu Ala Ala Pro Thr Pro Leu His Gly Ile Leu Gly
180 185 190
Pro Glu Leu Gly Gly Leu Phe Thr Glu Leu His Arg Ala His Gly Val
195 200 205
Arg Phe His Phe Gly Ala Arg Phe Thr Glu Ile Val Gly Gln Asp Gly
210 215 220
Met Val Leu Ala Val Arg Thr Asp Asp Gly Glu Glu His Pro Ala His
225 230 235 240
Asp Val Leu Ala Ala Ile Gly Ala Ala Pro Arg Thr Ala Leu Ala Glu
245 250 255
Gln Ala Gly Leu Asp Leu Ala Asp Pro Glu Ala Gly Gly Gly Val Ala
260 265 270
Val Asp Ala Thr Leu Arg Thr Ser Asp Pro Tyr Ile Tyr Ala Ala Gly
275 280 285
Asp Val Ala Ala Ala Asp His Pro Leu Leu Asp Thr Arg Leu Arg Val
290 295 300
Glu His Trp Ala Asn Ala Leu Asn Gly Gly Pro Ala Ala Ala Arg Ala
305 310 315 320
Met Leu Gly Gln Asp Ile Ser Tyr Asp Arg Val Pro Tyr Phe Phe Ser
325 330 335
Asp Gln Tyr Asp Val Gly Met Glu Tyr Ser Gly Tyr Ala Pro Pro Gly
340 345 350
Ser Tyr Ala Gln Val Val Cys Arg Gly Asp Val Ala Lys Arg Glu Phe
355 360 365
Ile Ala Phe Trp Leu Gly Glu Asp Gly Arg Leu Leu Ala Gly Met Asn
370 375 380
Val Asn Val Trp Asp Val Ala Glu Thr Ile Gln Gln Leu Ile Arg Gly
385 390 395 400
Gly Val Arg Leu Glu Pro Gly Glu Leu Ala Asp Pro Glu Val Pro Leu
405 410 415
Thr Ser Leu Leu Pro
420
You are contracting for Methods and compositions for making emamectin
Expert Methods and compositions for making emamectin
You are commenting for Methods and compositions for making emamectin