(en)The present invention relates to a new essential downstream component of the wingless signaling pathway. In particular, the invention relates to nucleotide sequences of the Drosophila melanogaster daughter of legless (doll) gene, of its encoded proteins, as well as derivatives, fragments and analogues thereof. The invention includes vertebrate and invertebrate homologues of the Doll protein, comprising proteins that contain a stretch of amino acids with similarity to the Drosophila Doll gene. Methods for producing the Doll protein, derivatives and analogs, e.g. by recombinant means, and antibodies to Doll are provided by the present invention as well. The invention also relates to methods for performing high throughput screening assays for compounds modulating Doll function in the Wnt pathway.
1.ApplicationNumber: US-47238504-A
1.PublishNumber: US-2004209264-A1
2.Date Publish: 20041021
3.Inventor: KRAMPS THOMAS
BASLER KONRAD
4.Inventor Harmonized: KRAMPS THOMAS(CH)
BASLER KONRAD(CH)
5.Country: US
6.Claims:
(en)The present invention relates to a new essential downstream component of the wingless signaling pathway. In particular, the invention relates to nucleotide sequences of the Drosophila melanogaster daughter of legless (doll) gene, of its encoded proteins, as well as derivatives, fragments and analogues thereof. The invention includes vertebrate and invertebrate homologues of the Doll protein, comprising proteins that contain a stretch of amino acids with similarity to the Drosophila Doll gene. Methods for producing the Doll protein, derivatives and analogs, e.g. by recombinant means, and antibodies to Doll are provided by the present invention as well. The invention also relates to methods for performing high throughput screening assays for compounds modulating Doll function in the Wnt pathway.
7.Description:
(en)[0001] The present invention relates to a new essential downstream component of the wingless signaling pathway. In particular, the invention relates to nucleotide sequences of the Drosophila melanogaster daughter of legless (doll) gene, of its encoded proteins, as well as derivatives, fragments and analogues thereof. The invention includes vertebrate and invertebrate homologues of the Doll protein, comprising proteins that contain a stretch of amino acids with similarity to the Drosophila Doll gene. Methods for producing the Doll protein, derivatives and analogs, e.g. by recombinant means, and antibodies to Doll are provided by the present invention as well. The invention also relates to methods for performing high throughput screening assays for compounds modulating Doll function in the Wnt pathway.
BACKGROUND OF THE INVENTION
[0002] Wnt genes encode a large family of secreted, cystein rich proteins that play key roles as intercellular signaling molecules in a wide variety of biological processes (for an extensive review see (Wodarz and Nusse 1998). The first Wnt gene, mouse wnt-1, was discovered as a proto-oncogene activated by integration of mouse mammary tumor virus in mammary tumors (Nusse and Varmus 1982). Consequently, the involvement of the Wnt pathway in cancer has been largely studied. With the identification of the Drosophila polarity gene wingless (wg) as a wnt-1 homologue (Cabrera, Alonso et al. 1987; Perrimon and Mahowald 1987; Rijsewijk, Schuermann et al. 1987), it became clear that wnt genes are important developmental regulators. Thus, although at first glance dissimilar, biological processes like embryogenesis and carcinogenesis both rely on cell communication via identical signaling pathways. In a current model of the pathway, the secreted Wnt protein binds to Frizzle cell surface receptors and activates the cytoplasmic protein Dishevelled (Dsh). Dsh then transmits the signal to a complex of several proteins, including the protein kinase Shaggy/GSK3, the scaffold protein Axin and β-Catenin, the vertebrate homologue of Armadillo. In this complex β-Catenin is targeted for degradation after being phosphorylated by Sgg. After Wnt signaling and the resulting down-regulation of Sgg activity, β-Catenin (or its Drosophila homologue Armadillo) escape from degradation and accumulate into the cytoplasm. Free cytoplasmic β-Catenin translocates to the nucleus by a still obscure mechanism, and modulates gene transcription through binding the Tcf/Lef family of transcription factors (Grosschedl R 1999).
[0003] This set up, in which the key transducer is continuously held in check, is highly susceptible to mutations in its inhibitory components. The loss of any of the three elements of the β-Catenin destruction complex leads to an increase in β-Catenin levels, and hence to the constitutive activation of the pathway. While this may reduce cellular viability, as upon loss of GSK-3 function, it can also lead to cell fate changes, uncontrolled proliferation and tumorigenic behavior as in the cases of APC and Axin (Barker N 1999; Morin 1999; Potter 1999; Roose and Clevers 1999; Waltzer and Bienz 1999). Attempts to counter these harmful situations must aim at curbing the nuclear activities of β-Catenin, either by preventing the formation of the β-Catenin-TCF complex or by interfering with its transcriptional activator function.
[0004] Currently, there are no known therapeutic agents effectively inhibiting β-Catenin transcriptional activation. This is partly due to the fact that many of the essential components required for its full activation and nuclear translocation are still unknown. consequently, there is an urge to understand more about this pathway in order to be able to develop effective drugs against these highly malignant diseases.
[0005] In order to identify new components required for Wingless activation the inventors used a Drosophila genetic approach to screen for dominant suppressors of the rough eye phenotype caused by ectopic expression of Wingless, the Drosophila homologue of Wnt, during eye development. Three genes were identified: the β-catenin homologue armadillo (arm), the tcf/lef-1 homologue pangolin (pan) and legless (lgs), a completely new gene (U.S. Ser. No. 09/915,543). The lgs gene was subsequently cloned and its in vivo requirement for Wingless signal transduction in embryo and in developing tissues was confirmed. The presence of Lgs is required for a transcriptional active Arm/Pangolin complex and over-expression of Lgs strongly stimulates the transcriptional output of this bipartite transcription factor. The human genome contains at least two human Lgs homologues. One of them, Bcl9, has been previously implicated in B cell malignancies (Willis, Zalcberg et al. 1998). It was also genetically and biochemically demonstrated that digs and hLgs bind to Armadillo and β-Catenin and are functionally required for Wnt signal propagation in human cells. However, genetic experiments strongly suggested the presence of a second protein which binds to Lgs and is essential for the function of the active β-Catenin-Pangolin-Lgs complex.
[0006] The present invention describes the cloning and functional characterization of a novel Drosophila protein, named Daughter of Legless (Doll), which binds to Lgs and is required for Wnt signaling. In addition, the invention provides the sequences of the functional and structural human and mouse homologues as well as methods to screen for compounds inhibiting Doll function in the Wnt pathway.
DEFINITIONS
[0007] The term “Doll polypeptide”, “Doll protein” when used herein encompasses native invertebrate and vertebrate Doll and Doll variant sequences (which are further defined herein).
[0008] A “wild type sequence Doll” comprises a polypeptide having the same amino acid sequence as a Doll protein derived from nature. Such wild type sequence of Doll can be isolated from nature or produced by recombinant and/or synthetic means. The term “wild type sequence Doll” specifically encompasses naturally occurring truncated forms, naturally occurring variant forms (e.g., alternatively spliced forms) and naturally occurring allelic variants of Doll. In one embodiment of the invention, the wild type Doll sequence is a mature or full-length Doll sequence comprising amino acids 1 to 815 of dDoll (FIG. 1), or 1 to 419 of hDoll-1, or 1 to 406 of hDoll-2 (FIG. 2), or 1 to 417 of mDoll-1, or 1 to 407 of mDoll-2 (FIGS. 3). “Doll variant” means an active Doll, having at least about 50% amino acid sequence identity with the amino acid sequence of a wild type Doll protein of FIG. 1, 2 and 3 . The term “Doll variant” however, does also include functional homologues of Doll in the Wnt pathway.
[0009] “Percent (%) amino acid sequence identity” with respect to the Doll sequences identified herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the Doll sequence, after aligning the sequence and introducing gaps, if necessary, to achieve the maximum percentage sequence identity, and not considering any conservative amino acid substitution as part of the sequence identity. The % identity values used herein can be generated by WU-BLAST-2, which was obtained from (Tatusova TA 1999). WU-BLAST-2 uses several search parameters, most of which are set to the default values.
[0010] The term “positive”, in the context of sequence comparison performed as described above, includes residues in the sequence compared that are not identical but have similar properties (e.g. as a result of a conservative substitution). The % value of positive is determined by the fraction of residues scoring a positive value in the BLOSUM 62 matrix divided by the total number of residues in the longer sequence as defined above.
[0011] In a similar manner, “percent (%) nucleic acid sequence identity” with respect to the coding sequence of the Doll polypeptides identified herein is defined as the percentage of nucleotide residues in a candidate sequence that are identical with the nucleotide residues in any of the Doll coding sequences of this invention. The identity values used herein can be generated using BLAST module of WU-BLAST-2 set to the default parameters.
[0012] The term “epitope tag” refers to a chimeric polypeptide comprising a Doll polypeptide fused to a “tag polypeptide”. The tag polypeptide has enough residues to provide an epitope against which an antibody can be made, yet is short enough that it does not interfere with the activity of the Doll polypeptide to which it is fused.
[0013] Nucleic acids are “operably linked” when are placed in a functional relationship with another nucleic acid sequence.
[0014] The term “epistasis” means hierarchy in gene action. Epistasis experiments are performed to place components of a signaling pathway in the right order.
[0015] The term “rescue experiments” are designed to determine which gene is responsible for a specific mutant phenotype. Specifically, mutant embryos are injected with coding or genomic DNA, and the effect of the introduced DNA is determined on the basis of the capacity to revert the mutant phenotype.
[0016] “Active” or “activity” refers to forms of Doll polypeptides that retain the biological and/or immunological activity. A preferred activity includes for instance the ability to modulate the Wnt signaling pathway.
[0017] The term “antagonist” is used in a broad sense, and includes any molecule that partially or fully inhibits, blocks or neutralizes a biological activity of Doll polypeptides described herein. In a similar way, the term “agonist” is used in the broadest sense and includes any molecule that mimics or support a biological activity of an active Doll polypeptide.
[0018] “Treatment” refers to both therapeutic treatments and prophylactic or preventive measures, wherein the objective is to prevent or slow down the targeted pathologic condition or disorder. Those in need of treatment include those already with the disorder as well as those prone to have the disorder or those in whom the disorder is to be prevented.
SUMMARY OF THE INVENTION
[0019] The present invention relates to a novel family of proteins present in insects and vertebrate organisms, referred to hereinafter as “Daughter of Legless (Doll)” proteins. These proteins play an essential role in the Wnt signaling pathway, and thus in the formation and maintenance of spatial arrangements and proliferation of tissues during development, and in the formation and growth of many human tumors.
[0020] In particular, the invention relates to nucleotide sequences of the Drosophila melanogaster doll gene, of proteins encoded by said nucleotide sequences, as well as fragments, derivatives and structural and functional analogs thereof.
[0021] In a preferred embodiment the invention relates to the nucleotide and protein sequences of the human and mouse doll homologues, hdoll-1, hdoll-2 and mdoll-1 and mdoll-2, respectively.
[0022] In one embodiment, the isolated nucleic acid comprises a sequence encoding a polypeptide having at least 50% amino acid sequence identity, preferably at least about 70% sequence identity, more preferably at least 90% sequence identity, even more preferably at least about 95% sequence identity, yet even more preferably at least about 98% sequence identity, and most preferably 100% identity to (a) a fragment or the entire protein sequence of the Doll polypeptide shown in FIG. 1, or (b) the complement of the nucleic acid molecule coding for (a).
[0023] In another preferred embodiment, the isolated nucleic acid encodes a polypeptide having at least 50% amino acid sequence identity, preferably about 70% sequence identity, more preferably at least 90% sequence identity, even more preferably about 95% sequence identity, yet even more preferably about 98% sequence identity, and most preferably 100% identity to (a) a polypeptide which is part or the entire human Doll polypeptides of FIG. 2 a/b or (b) the complement of the nucleic acid molecule coding for (a).
[0024] In another embodiment, the isolated nucleic acid encodes a polypeptide sequence having at least 50% amino acid sequence identity, preferably about 70% sequence identity, more preferably at least 90% sequence identity, even more preferably about 95% sequence identity, yet even more preferably about 98% sequence identity, and most preferably 100% identity to (a) a polypeptide encoding part of the entire mouse Doll protein of FIG. 3 a/b or (b) the complement of the nucleic acid molecule coding for (a).
[0025] In a further embodiment, the isolated nucleic acid comprises a sequence encoding a polypeptide with a low overall amino acid sequence identity but shows a sequence identity of at least 60%, preferably at least 70%, more preferably at least 80%, even more preferably at least 90% and most preferably 100% in the conserved domains DHD and PHD (FIG. 4).
[0026] In yet another embodiment of the present invention isolated nucleic acids encode polypeptides having a function resembling that of the doll genes.
[0027] In another embodiment, the invention relates to a fragment of the Drosophila or vertebrate doll nucleic acid sequences that is applied as hybridization probe. Such nucleic acid fragments are about 20 to about 100 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, most preferably from 20 to 50 nucleotides in length and are derived from the nucleotides sequences shown in FIG. 1, 2 and 3 .
[0028] The invention further provides eucaryotic and procaryotic expression vectors comprising a nucleic acid molecule encoding Drosophila or vertebrate doll or a fragment thereof as shown in FIGS. 1, 2 and 3 . The vector can comprise any of the molecules or fragments thereof described above.
[0029] The invention also includes host cells comprising such a vector. By way of example, the host cells can be mammalian cells, yeast cells, insect cells, plant cells or bacteria cells.
[0030] Methods of production, isolation and purification of the Doll proteins, derivatives and analogs, e.g. by recombinant means, are also provided (see Example VI, below). In a specific aspect, the invention concerns an isolated Doll peptide, comprising an amino acid sequence of at least 80%, preferably at least about 85% sequence identity, more preferably at least 90% sequence identity, even more preferably at least 95% sequence identity, yet most preferably 100% identity with the amino acid sequences of FIGS. 1, 2 and 3 .
[0031] In yet another embodiment the invention relates to chimeric proteins comprising a Doll polypeptide fused to a heterologous polypeptide or amino acid sequence. An example of such chimeric molecule comprises a Doll polypeptide fused to an epitope tagged sequence, glutathione-S-transferase protein or to a protein with an enzymatic activity, such as beta-galactosidase or alkaline phosphatase as described in Example VI below.
[0032] In a further aspect the invention concerns an isolated full length Doll polypeptide (prepared as described in Example VI), comprising the amino acid sequences of FIG. 1, 2 and 3 , or any Doll polypeptide or a fragment thereof described in this invention sufficient to provide a binding site for an anti-Doll anti-body.
[0033] In another embodiment the invention provides antibodies that specifically recognize Doll polypeptides. The antibodies can be a polyclonal or a monoclonal preparation or fragments thereof. Polyclonal antibodies are prepared by immunization of rabbits with purified Doll polypeptides prepared as described in Example VI.
[0034] The invention also relates to transgenic animals, e.g. Drosophila, mice, rats, chicken, frogs, pigs or sheep, having a transgene, e.g., animals that include and preferably express a heterologous form of the Doll genes described herein, or that misexpress an endogenous or transgenic doll gene. Such a transgenic animal can serve as a model for studying diseases with disrupted Wnt signaling pathway, for the production of Doll proteins, or for drug screening.
[0035] In yet another embodiment, the invention also features animals, e.g. Drosophila, mice, rats, chicken, frogs, pigs or sheep, having a mutation in the doll gene, e.g. deletions, point mutations, foreign DNA insertions or inversions. Such animals can serve to study diseases characterized by disrupted Wnt function or in drug screening.
[0036] In addition, the invention relates to the use of Doll proteins, homologues, derivatives and fragments thereof as well as nucleic acids, derivatives and fragments thereof in therapeutic and diagnostic methods and compounds. In particular, the invention provides methods and compounds for treatment of disorders of cell fate, differentiation or proliferation by administration of a therapeutic compound of the invention. Such therapeutic compounds include: Drosophila and vertebrate Doll protein homologues or fragments thereof, antibodies or antibody fragments thereto, doll antisense DNA or RNA, doll double stranded RNA, and any chemical or natural occurring compound interfering with Doll function, synthesis or degradation. In particular, the invention provides methods to screen for chemical compounds, organic products or peptides interfering with Doll function in the Wnt pathway. In a preferred embodiment the screening method will be a cellular reporter gene assay or a protein-protein interaction assay.
[0037] In another embodiment, a screening assay based on protein-protein interaction is used to screen for compounds specifically inhibiting Doll-Lgs or Doll-interaction partner X.
[0038] The invention also provides methods to screen for chemical compounds, organic products or peptides interfering with Doll function in the Wnt pathway.
[0039] Furthermore, the invention comprises the use of the DHD domain in screening assays such as an in vitro protein-protein interaction assay or a protein-protein interaction in a host cell. Said assays are applied for the identification of chemical compounds, organic products, polypeptides or peptides interfering with Doll function in the Wnt pathway.
[0040] In a preferred embodiment, a therapeutic product according to the invention is administered to treat a cancerous condition or to prevent progression from a pre-neoplastic or non-malignant condition to a neoplastic or malignant state.
[0041] In other specific embodiments, a therapeutic product of the invention is administered to treat a blood disease or to promote tissue regeneration and repair. Finally disorders of cell fate, especially hyperproliferative or hypoproliferative disorders, involving aberrant or undesirable expression, or localization, or activity of the Doll protein can be diagnosed by detecting such levels.
BRIEF DESCRIPTION OF THE DRAWINGS
[0042]FIG. 1 The Drosophila doll CDNA and protein sequence.
[0043]FIG. 2 The human doll-1 and doll-2 cDNA and protein sequence.
[0044]FIG. 3 The mouse doll-1 and doll-2 CDNA and protein sequences.
[0045]FIG. 4. Drosophila and human Doll proteins contain a PED finger motif with which they bind to the ED1 of Lgs/BCL9
[0046] (A) Top: Schematic representation of Doll, human DOLL-1 and human DOLL-2. The two domains that show high sequence similarities are highlighted in dark gray (DHD: Doll homology domain) and red (PHD: plant homology domain). The DHD appears to be unique, as the inventors failed to find a similar sequence in other Drosophila or human proteins. GenBank accession numbers for Doll, hDoll-1, hDoll-2 are AF457206, AF457207, AF457208, respectively. Bottom: Multiple alignment of Drosophila, human and mouse Doll protein sequences.
[0047] (B) Alignment of amino acid sequences of DHD and PHD in Doll and its human homologues.
[0048] Similarities are boxed, identities shaded in gray. The numbers to the left indicate the positions of DHD and PHD within their respective protein sequences. For the DHD alignment a gap of 22 aa has been introduced in the Drosophila DHD (represented as (X)5)
[0049] (C) Mapping of the Lgs/BCL9 interaction site in Doll. Schematic representation of the proteins tested in the yeast-two-hybrid assay for their interactions with Lgs and BCL9. Results are indicated to the right (“bdg”).
[0050] (D) Mapping of the Doll interaction site in digs and hLgs/BCL9. Schematic representation of the proteins tested in the yeast-two-hybrid assay for their interactions with Lgs. The two proteins shown at the bottom were tested by a pull-down assay for both dlgs (numbers without brackets) and hLgs/BCL9 (numbers in parenthesis) with the same result (“bdg”). The deletion removing HD1 comprises aa 318-345 for Lgs and aa 177-204 for hLgs/BCL9. Fusion proteins used were S-Tag-dDoll (aa 542-815) and GST-hDOLL-2 (aa 301-406).
[0051]FIG. 5. doll is a segment polarity gene required for Wg signalling
[0052] (A-C) Cuticle preparations of larvae derived from wild-type (A), wg mutant (B), and doll mutant embryos (C). The doll 130 /doll 130 embryo in (C) is derived from a homozygous doll 130 mutant germ line clone (see Experimental Procedures) and displays a wg-like phenotype.
[0053] (D,E) doll functions downstream of dAPC2. Two cuticle preparations are shown from larvae that developed in the absence of maternal and zygotic wild-type dAPC2 function (McCartney et al., 1999). The embryo in (E) additionally lacks the maternal and zygotic function of doll (see Examples). In contrast to dAPC single mutant animals, which have strongly reduced denticle belts, double mutants display a doll-like phenotype.
[0054] (F-I) Confocal images of third instar wing imaginal disc preparations stained with antibodies against Doll (F,G) and Ptc (E,I). Wild-type animals show normal expression of these genes (F,H). Discs derived from doll 130 mutant larvae are small, yet express Ptc (I), but fail to express Dll (H). Lack of Dll expression may be an indirect consequence of the earlier wing-tonotum transformation in doll 130 larvae. However, we also see a strong reduction of Dll expression in doll 130 mutant cells from mosaic animals (not shown).
[0055]FIG. 6. Lgs and the PHD finger of Doll serve to assemble Doll and Arm
[0056] Schematic representation of Lgs (yellow) and Doll (light green) constructs that were used in transgene assays to assess their ability to rescue lgs or doll mutant animals. 1: Full-length Lgs (pOP216, aa 1-1464). 2: C-terminally truncated Lgs (pTK131, aa 1-583). 3: HD1-Gal11-HD2 (pTK153, aa 268-395 (HD1), aa 369-500 (Gal11), aa 465-596 (HD2)). 4: HD1-(HA)3-HD2(pTK143, aa 268-395 (HD1), aa 465-596 (HD2)). 5: Full-length Doll (pTK56, aa 1-815). 6: Doll[DPHD]-HD2 (pTK135, aa 1-740 (Doll[DPHD]), aa 483-561 (HD2)). Transgenes 1 to 4 are able to rescue lgs 20F homozygotes. An example for an adult animal rescued by transgene 3 is shown on the right. Transgene 5 can rescue doll 130 homozygotes. Transgene 6 can rescue doll 130 as well as lgs 20F homozygotes (photographs on the right).
[0057]FIG. 7: Rescue of ddoll-/-flies by expression of a human doll transgene. The lethality caused by the doll 130 /EP(3)1076 genotype can be fully rescued by a tubulin 1 promoter-driven transgene that contains either the coding region of the Drosophila doll gene (not shown) or that of one of its two human homologues hdoll-1 and hdoll-2.
[0058]FIG. 8: Effects of human Doll 1 and 2 on Tcf transcription. 293 cells were transiently transfected with the pTOPFLASH or pFOPFLASH luciferase reporters and different effector plasmids as indicated. A constitutively active form of β-Catenin (ΔN-β-Catenin, 50 ng) or human Doll-1 or hDoll-2 (350 ng) activate the pTOPFLASH reporter. Cotransfection of human Doll with ΔN-β-catenin strongly enhance the response.
DETAILED DESCRIPTION OF THE INVENTION
[0059] The Wnt signaling cascade is essential for the development of both invertebrates and vertebrates, and has been implicated in tumorigenesis. The Drosophila wg genes are one of the best characterized within the Wnt-protein family, which includes more than hundred genes. In the Drosophila embryo, wg is required for formation of parasegment boundaries and for maintenance of engrailed (en) expression in adjacent cells. The epidermis of embryo defective in wg function shows only a rudimentary segmentation, which is reflected in an abnormal cuticle pattern. While the ventral cuticle of wild type larvae displays denticle belts alternating with naked regions, the cuticle of wg mutant larvae is completely covered with denticles. During imaginal disc development, wg controls dorso-ventral positional information. In the leg disc, wg patters the future leg by the induction of ventral fate (Struhl and Basler 1993). In animals with reduced wg activity, the ventral half of the leg develops into a mirror image of the dorsal side (Baker 1988). Accordingly, reduced wg activity leads to the transformation of wing to notal tissue, hence the name of the gene (Sharma and Chopra 1976). In the eye disc, wg suppresses ommatidial differentiation in favor of head cuticle development, and is involved in establishing the dorso-ventral axis across the eye field (Heberlein, Borod et al. 1998).
[0060] Additional genes have been implicated in the secretion, reception or interpretation of the Wg signaling. For instance, genetic studies in Drosophila revealed the involvement of frizzled (Dfz), Dishevelled (dsh), shaggy/zeste-white-3 (sgg/zw-3), armadillo (arm), adenomatous polyposis coli (E-apc), axin, and pangolin (pan) in Wg signaling. The genetic order of these transducers has been established in which Wg acts through Dsh to inhibit Sgg, thus relieving the repression of Arm by Sgg, resulting in the cytoplasmic accumulation of Arm and its translocation to the nucleus. In the nucleus Arm interacts with Pan to activate transcription of target genes. Vertebrate homologues have been identified for all these components (for an updated review see (Peifer and Polakis 2000), suggesting that novel identified members of the Drosophila signaling pathway may likely have vertebrate counterparts.
[0061] Mutations leading to nuclear accumulation of the mammalian homologue of Arm, β-Catenin, and consequently to constitutive activation of the Wnt pathway have been observed in many types of cancer, including colon cancer, breast cancer, melanoma, hepatocellular carcinoma, ovarian cancer, endometrial cancer, medulloblastoma pilomatricomas, and prostate cancer (Morin 1999; Polakis, Hart et al. 1999). It is now apparent that deregulation of β-Catenin signaling is an important event in the genesis of these malignancies. However, there are still no known therapeutic agents effectively inhibiting β-Catenin transcriptional activation. This is partly due to the fact that many of the essential components required for its full activation and nuclear translocation are still unknown.
[0062] In order to identify new components required for Wingless activation the inventors used a Drosophila genetic approach to screen for dominant suppressors of the rough eye phenotype caused by ectopic expression of Wingless (Wg), the Drosophila homologue of Wnt, during eye development. A new gene, legless (lgs, U.S. Ser. No. 09/915,543) was identified as a strong dominant suppressor of the rough eye phenotype. The gene was subsequently cloned and its in vivo requirement for Wg signal transduction in embryo and in developing tissues was confirmed. The human genome contains at least two human Lgs homologues, hLgs/Bcl9 and hLgs-1. dLgs and hLgs bind to Armadillo and β-Catenin and are functionally required for Wnt signal propagation in invertebrate and vertebrate cells (U.S. Ser. No. 09/915,543). In particular, the presence of Lgs is required for a transcriptional active Arm/Pangolin complex and over-expression of Lgs strongly stimulates the transcriptional output.
[0063] The inventors later made the interesting observation that a mutant form of Lgs protein from which the β-Catenin interacting domain was deleted exhibited a strong dominant-negative effect on Wg-dependent patterning processes when expressed from a transgene in wild-type larvae (data not shown). This strongly suggested that Lgs normally interacts not only with Arm, but also with at least one additional component. In an effort to identify such components yeast-two-hybrid screens for interacting proteins were carried out. In two independent screens in which either the entire protein or the N-terminal half of Lgs was used as a bait, a novel PHD finger protein, referred to as Daughter-of-Legless (Doll), was identified as a Lgs binding protein (FIG. 4). The 815 amino acid residue Doll protein carries a C-terminal domain of 60 amino acids (FIG. 4 a ), which shows extensive homologies to the PHD (plant homology domain) finger, also known as LAP (leukemia associated protein) domain (Aasland, Gibson et al. 1995). This domain comprises a cysteine rich Zn-binding motif, that has been associated with proteins involved in chromatin-mediated regulation of transcription. The PHD finger of Doll is necessary and sufficient to mediate the interaction to Lgs (FIG. 4 c - d ). The inventors also demonstrate herein that this interaction is essential for Doll function.
[0064] The region of Lgs responsible for Doll-binding was mapped to the HD1 sequence (FIG. 4 d ). Moreover, two human homologues of the Drosophila doll gene were identified and isolated (FIG. 4 a ). The protein products of both human genes, hDOLL-1 and hDOLL-2, as well as their mouse homologues possess a highly conserved PHD finger which interacts with the HD1 of hLgs/BCL9 (FIG. 4 d ). The only other domain in Drosophila Doll, hDoll-1/hDoll-2 and mDoll-1/mDoll-2 that shows significant sequence homology is a 50 amino acid stretch in the N-terminal region, which is referred herein to as ‘Doll homology domain’ (DHD, FIG. 4 a,b ).
[0065] The interaction with Doll appears to be relevant for the in vivo function of Lgs, since a mutant form of Lgs with a deletion of HD1 was unable to rescue lgs mutant animals. The physical association of Doll and Lgs suggested that Doll, like Lgs, may be required for Wg signaling in vivo. To explore this hypothesis a proprietary collection of suppressors of the sev-wg phenotype was searched for mutations that map to the tip of the right arm of chromosome 3, the position of the doll gene. One such suppressor, Sup 130 , mapped to this position, and intriguingly, it showed dominant lethality in combination with the lgs allele lgs 17E (U.S. Ser. No. 09/915,543) (Sup 130 /+lgs 17E /+transheterozygous animals do not survive). The doll coding region was sequenced using genomic DNA from homozygous Sup 130 mutant larvae and a 14 bp deletion starting at amino acid position 751 was identified. Hence this allele is referred to as doll 130 and encodes a truncated Doll protein lacking the C-terminal PHD finger.
[0066] The lethality caused by the homozygous dol 130 genotype can be fully rescued by a tubulin l promoter-driven transgene that contains either the coding region of the Drosophila doll gene or that of one of its two human homologues hDoll-1 and hDoll-2 (FIG. 7). Thus, the vertebrate homologues of doll were confirmed genetically to be true functional homologues of Doll, and hence the vertebrate homoiogues are part of this invention. To assay the possible role of Doll in Wg signal transduction during development, embryos homozygous for the doll 130 mutation that derived from female germ cells equally mutant for doll were generated. Doll mRNA is maternally contributed and strongly and ubiquitously expressed during all the developmental stages. Consequently, only embryos lacking both embryonal and maternal doll are characterized by a severe segment polarity phenotype (FIG. 5A-C), while weaker loss of function doll mutants display pupal lethality with a partial or complete loss of the antennae and the legs. Mutant individuals lacking only zygotic function survive until early pupal stages and exhibit imaginal discs that are abnormally small. The Hh target gene ptc was expressed at wild-type levels in these discs, however, no expression of the Wg target Dll could be detected (FIG. 5F-I). These discs appear to lack the presumptive wing blade field and possess two primordia for the notum (FIGS. 5G and I). The fact that similar phenotypes are caused by loss of function of wg, dsh, arm, or lgs confirms the essential role of doll in the Wg signaling pathway.
[0067] To address the role of Doll in β-Catenin-mediated transcription a TCF reporter gene (TOPFLASH, (Morin, Sparks et al. 1997)) was used in immortalized human embryo kidney cells (HEK 293 cells). Low levels of a stable mutant form of β-Catenin (ΔN-β-catenin; (van de Wetering, Cavallo et al. 1997)) were introduced into these cells to partially stimulate the pathway. The additional expression of hDoll-1 (FIG. 8) or hDoll-2 (not shown) lead to a large increase in luciferase activity (30-fold). These levels are significantly higher than the sum of those produced by either treatment alone (FIG. 8). This potentiation of β-Catenin activity by hDoll-1 and 2 appears to be mediated by the interaction of endogenous TCF protein with its DNA target sites, as it is only observed with TOPFLASH, which contains five optimal TCF binding sites, but not with the control reporter FOPFLASH, which contains five mutated sites (Morin, Sparks et al. 1997). Thus this experiment adds supportive evidence to the notion that Doll proteins transduce Wnt signals by activating TCF target genes in a β-Catenin-dependent manner.
[0068] In summary, the protein-protein interactions demonstrated between Drosophila Doll and Lgs and those between their human homologues human Doll and hLgs/Bcl9, respectively, in conjunction with the genetic and cell biological data show that Doll proteins are positive regulators of the Wg and Wnt signaling pathways, respectively.
EXAMPLES
Example I
Isolation of Doll cDNA
[0069] The cDNA for Daughter of Legless (Doll) was isolated in two independent yeast genetic screens of a Drosophila cDNA-library for proteins directly binding to Lgs. Other DNA libraries can be used as well, such a genomic and CDNA libraries from vertebrate and invertebrate organisms. Other methods than a yeast-two hybrid screen can be used as well. Such methods include, but are not limited to, direct amplification using gene specific primers and standard methods known by people skill in the art. To perform the yeast two-hybrid screening for protein binding to Lgs CDNA sequences encoding the first 732 amino acids (“LgsN”) and the full-length protein of 1464 amino acids (“LgsFL”) were subcloned into a yeast expression vector (pLexA, Clontech), fusing them to the LexA DNA-binding domain. Subsequently these constructs were transformed into the LEU2-reporter yeast strain EGY48 together with the lacZ-reporter plasmid pSH18-34 and an embryonic Drosohila melanogaster cDNA-library fused to an acidic transcriptional activation domain (“RFLY-1” library, PNAS 93, 3011-3015). In a first step triple-transformant colonies containing the LgsN- or LgsFL-LexA-fusion constructs, respectively, the pSH18-34 reporter and a RFLY-1 library plasmid were grown on minimal selective medium plates for two days, harvested, thoroughly mixed, and stored as uniform aliquots. Then cells from one of these aliquots were transferred into permissive Galactose/Raffinose minimal selective liquid medium, and incubated with shaking at 30° C. for a few hours, thereby inducing expression of the library cDNA-activation domain fusion from the GAL1-inducible promotor. Finally these “induced” cells were plated on Galactose/Raffinose minimal selective medium plates lacking the amino acid 1-leucine. On these plates cell growth was sustained only upon activation of a LEU2-selector gene through molecular interaction of the respective LexA-fusion and activation domain-fusion proteins. The LEU2-gene codes for an essential metabolic enzyme needed for the biosynthesis of leucine from other amino acid precursors. All clones growing under these restrictive conditions were isolated and analyzed for the activity of the lacZ-reporter gene, encoding the metabolic enzyme β-Galactosidase from the enterobacterium E.coli , by a standard X-Gal assay (e.g. Bartel and Fields (eds.), Oxford University Press 1997). From all candidate clones that passed these two selection steps, the cDNA-library plasmids were isolated again by standard techniques (e.g. Methods in Yeast Genetics, Cold Spring Harbour Laboratory Press, 1997) and retested for specific interaction with Lgs in the X-Gal assay, using an unrelated LexA-fusion protein as a negative control. By this procedure three independent cDNA-clones were identified, that strongly and specifically interacted only with Lgs and contained partially overlapping sequences: BK12b, BK14b and TK5.35h. By searching the Drosophila genome database using the blastn algorithm (http://www.ncbi.nlm.nih.gov:80/BLAST/) we mapped the three isolated cDNAs to the CG11518 locus, coding for a protein product of 815 amino acids in length. The cDNA-clones coded different parts of the Doll protein, with BK12b containing nucleotides (nt) 2223-2448, BK14b nt 2191-2448 and TK4.35h nt 749-2448 of the computationally predicted open reading frame (ORF). Further bioinformatical analysis (http://www.ebi.ac.uk/interpro/) revealed that the very C-terminal part of the protein sequence (ca. aa 745-805), present in all three of the Lgs-binding clones, was predicted to adapt a PHD-finger fold, which has been identified in other proteins involved in transcriptional regulation at different levels.
Example II
Identification of Human and Mouse Homologues of Drosophila Doll
[0070] After the identification of the Drosophila Doll amino acid sequence, publicly available databases were searched for similar protein sequences in other species, using the tblastn algorithm (http://www.ncbi.nlm.nih.gov:80/BLAST/). Two candidate sequences were found each in ESTs from Mus musculus and Homo sapiens, respectively, the putative protein products of which display high similarity to Doll in their C-terminal domains. These stretches of high similarity are predicted to adapt a PHD-finger fold as well (FIG. 4). Doll proteins do not display other known structural motifs in their N-terminal sequences but they display a second high homology domain, which was accordingly named DOLL Homology Domain (bHD) (FIG. 4). Doll proteins of both invertebrate and vertebrate origin have so far not been further described or experimentally studied, and have thus not previously been implicated in any specific biological process.
Example III
Isolation and Mapping of Drosophila Doll Alleles
[0071] EMS-treated males were crossed to females carrying a wg transgene (sev-wg) driven by two copies of the sevenless enhancer (Basler, Christen et al. 1991). 2×105 progeny were screened for suppressors of the rough eye phenotype. Third chromosomal suppressors were coarsely mapped by meiotic recombination using a panel of P[y+] insertions. One such suppressor, Sup 130 , showed intriguingly dominant lethality in combination with the lgs allele lgs 17E (U.S. Ser. No. 09/915,543) (Sup 130 /+lgs 17E /+transheterozygous animals do not survive), strongly suggesting a close genetic interaction. Fine mapping of the mutation using denaturing HPLC (WAWE system, Transgenomic Inc.) demonstrated that it localizes within the doll gene. The doll coding region was therefore sequenced using PCR fragments covering the doll coding region derived from genomic DNA from homozygous Sup 130 mutant larvae. The defect in Sup 130 was found to be a 14 bp deletion (nucleotides 2253 to 2266: 5′ CATGTGCCACAAGG 3′) within the doll open reading frame that induced a frame-shift subsequent to amino acid 751 and resulted in the formation of a premature stop codon. Hence this allele is referred to as doll 130 and encodes a truncated Doll protein lacking the C-terminal PHD finger.
[0072] Pole cell transplantation, chromosome squashes, and chromosome in situ hybridization experiments were carried out according to standard protocols (Ashburner 1989).
Example IV
Use of Doll as a Hybridization Probe
[0073] The following method describes the use of a non-repetitive nucleotide sequence of doll as a hybridization probe. The method can be applied to screen for doll homologues in other organisms as well. DNA comprising the sequence of doll (as shown in FIGS. 1 , 2 , 3 ) is employed as probe to screen for homologue DNAs (such as those included in CDNA or genomic libraries). Hybridization and washing of the filters containing either library DNAs is performed under standard high stringency conditions (Sambrook, Fritsch et al. 1989). Positive clones can be used to further screen larger cDNA library platings. Representative cDNA-clones are subsequently cloned into pBluescript (Stratagene) or similar cloning vectors and sequenced.
Example V
Use of Doll as a Hybridization Probe for in situ Hybridization
[0074] In situ hybridization of Drosophila doll mRNA can be performed in embryo as described in (Tautz and Pfeifle 1989). However, with small modifications it can also be used to detect any mRNA transcript in Drosophila larval imaginal discs or vertebrate tissue sections. Labeled RNA probes can be prepared from linearized doll CDNA (as showed in FIGS. 1 , 2 , 3 ), or a fragment thereof, using the DIG RNA labeling Kit (SP6/T7) (Boehringer Mannheim) following the manufacturer's recommendations.
Example VI
Expression of Doll in Drosophila melanogaster
[0075] Doll can be expressed in Drosophila in the whole organism, in a specific organ or in a specific cell type, during the whole life or only at a specific developmental stage, and at different levels. An overview of the standard methods used in Drosophila genetics can be found in (Brand and Perrimon 1993; Perrimon 1998; Perrimon 1998).
[0076] Generation of Doll Mutant Embryos
[0077] Mosaic germlines are generated with the help of site-specific recombination through the FLP recombinase (Xu and Rubin 1993).
[0078] Females of the genotype hsp70:flp, FRT82 doll 130 /FRT82 ubi-GFP are heat-shocked at 37° C. for 1 hr during the third instar larval stage to induce FLβ-directed recombination and later mated to doll 130 /TM6b[y+] males. Germline mosaics are induced. The source of recombinase is a first chromosome insertion of a fusion of the hsp 70 promoter (denoted by “hsp70”) to the FLP coding sequence. Somatic recombination at the FRT82 sites gives rise to adult female germ line that produces oocytes that upon fertilization lead to embryos which do not contain neither zygotic nor maternally contributed information for the production of functional dDoll protein. Those embryos can be identified by the absence of the yellow+phenotype provided by the TM6b[y+] paternal balancer chromosome. For analysis, cuticles are prepared by standard techniques from mutant embryos, and examined by dark field microscopy.
[0079] dApC2 doll doublemutant germ line clones were generated with an FRT82 dAPC2 DS doll 130 chromosome. The FRT82 ovo D1 chromosome (Chou and Perrimon 1996) was used to select for mutant germ cells. The FRT82 doll 133 chromosome was also used to create doll mutant clones in discs, in conjunction with an FRT82 arm-lacz chromosome.
[0080] Generation of Doll Mutant Embryos Expressing Constitutively Active Arm
[0081] In order to express constitutively active Arm (“ΔArm”),h females of the genotype described above are heat shocked at 37° C. for 1 hr during late pupal stages and mated to males of the genotype UAS:ΔArm hsp70-Gal4/UAS:ΔArm hsp70-Gal4; doll/TM6b[y+]. Due to the presence of the additional transgenes in these males off-spring that had arisen from a doll mutant oocytes and doll mutant sperm express upon heat treatment the constitutively active Arm protein, that transiently induced Wingless target genes.
Example VII
Rescue of Ddoll-/-flies with Hdoll-1 and Hdoll-2 cDNA Expression
[0082] In order to confirm the functional homology between Drosophila and human Doll-1 and human Doll-2, the human genes were introduced into Drosophila flies carrying two mutant doll alleles (ddoll-/-flies) Specifically, flies carrying e.g. a tub:hdoll transgene, and two mutant doll alleles, e.g. doll 130 and EP(3)1076 (publicly available) were generated. ddoll-/-mutant flies display larval or pupal lethality. In contrast, ddoll-/-mutant flies carrying at least one copy of the tub:hdoll-1 or tub:hdoll-2 transgenes survive to adulthood. This demonstrates that both, hDoll-1 and hDoll-2, can replace endogenous ddoll function in flies and thus validates functional homology between Drosophila and human Doll (FIG. 7).
Example VIII
Protein Production and Purification of Doll in E. coli
[0083] The following method describes recombinant expression of Doll proteins in bacterial cells. DNA encoding full-length or a truncated Doll form is fused e.g. downstream of an epitope tag or glutathione-S-transferase (GST) cDNA and a thrombin or enterokinase cleavage site contained within an inducible bacterial expression vector. Such epitope tags include poly-his, S-protein, thioredoxin and immunoglobin tags. A variety of plasmids can be employed, including commercially available plasmid such as pGEX-4T (Pharmacia) or pET-32a (Novagen).
[0084] Briefly, a bacterial expression plasmid containing the doll sequence, for instance fused to a GST-sequence, is transformed by conventional methods into protease deficient E.coli such as BL21 (Stratagene). A bacterial colony containing the plasmid is then expanded overnight in selection medium to reach saturation. The next morning, this culture is diluted 1:100 and bacterial are allowed to growth to an optical density (OD 600 ) of 0.6. Protein production is initiated by addition of an inducer of the promoter under which GST-Doll fusion protein is expressed. A variety of inducers can be employed depending on the expression vector used, including IPTG.
[0085] Expressed GST tagged Doll can then be purified, for instance, using affinity beads or affinity chromatography, such as glutathione beads (commercially available from Pharmacia). Extracts are prepared by lysing the Doll-expressing bacteria in sonication buffer (10 mM Tris HCl pH 8.0, 150 mM NaCl, 1 mM EDTA, 1.5% sarkosyl, 2% Triton-X-100, 1 mM DTT and protease inhibitors), followed by short sonication on ice (e.g. 3 times 20 seconds at middle power) and centrifugation. Cleared supernatants are then incubated under gentle rotation for example with glutathione beads for 1 hrs at 4° C. Next beads are washed several time in washing buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl 2 , 0.5% NP40), and finally stored in storage buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl2, 10% glycerol, 0.5% NP40, and proteinase inhibitors).
[0086] Alternatively, a His-tagged, S-protein, thioredoxin or IgG tagged Doll can be purified using affinity chromatography.
[0087] The quality of the preparations can be checked e.g. by SDS-gel electrophoresis and silver staining or Western blot.
[0088] In case the epitope-tag has to be cleaved, several methods are available depending on the presence of a cleavage site between the epitope-tag and the Doll protein. For example, it is possible to produce a GST-Doll fusion protein containing a thrombin cleavage site right before the first Doll amino acid. Briefly, a GST-Doll preparation on glutathione-affinity beads is washed several times in cleavage buffer (50 mM Tris HCl pH 7.0, 150 mM NaCl, 1 mM EDTA, 1 mM DTT). Thrombin is then added and the samples are incubated for over 16 h at room temperature. Supernatants are then collected and analyzed for successful cleavage of Doll from the beads by polyacrylamide gel electrophoresis and silver staining or Western blot.
Example IX
Protein-protein Interactions Involving Doll
[0089] A GST-fusion protein in vitro binding assay can be performed to map binding domains and find additional interaction partners. For this purpose, proteins are in vitro translated using reticulocyte lysates (e.g. TNT-lysates, Promega Corporation) containing [3 5 S] methionine following the instructions provided by the manufacturer. Alternatively, cellular proteins can be labeled by incubation of culture cells with [ 35 S]methionine. Glutathione Stransferase (GST) fusion proteins, produced as illustrated in the Example VIII, are immobilized on glutathione-Sepharose and blocked in binding buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl 2 , 10% glycerol, 0.5% NP40, 0.05% BSA, and proteinase inhibitors) for 45 min. Two μg of immobilized GST proteins are then incubated for 1.5 hrs with 0.5-6 μl of in vitro translated proteins in binding buffer or with [ 35 S]methionine labeled cell extract. The beads are washed four times in washing buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl 2 , 0.5% NP40) and boiled in Laemmli SDS sample buffer. Proteins binding to Doll are detected by autoradiography. In case that a cell lysate were used to identify novel Doll binding partner, the protein bands on the gel can be isolated by methods known in the art, and the protein sequence can be determined e.g. by mass spectrophotometrical analysis.
[0090] A yeast two hybrid assay can additionally be performed to confirm the results of the in vitro binding assays described above or to screen cDNA library for new interaction partners (Fields and Sternglanz 1994). In this context, the desired cDNAs are subcloned into appropriate yeast expression vectors that link them either to a Lex DNA binding domain (e.g. pLexA, Clontech) or an acidic activation domain (e.g. pGJ4-5, Clontech). The appropriate pair of plasmids is then transformed together with a reporter plasmid (e.g. pSH18-34, Clontech) into an appropriate yeast strain (e.g. EGY48) by the lithium acetate-polyethylene glycol method and grown on selective media (Sambrook, Fritsch et al. 1989). Transformants are analyzed for reporter gene activity as described by the manufacturer of the vector-reporter plasmid used. To establish reproducibility the interactions is tested in both directions. Alternatively, this method is used to screen for novel Doll interaction partners. In this context, e.g. pLexA-Doll is transfected into yeast together with a cDNA library cloned into e.g. pGJ4-5 as described above. Positive clones can be isolated and the cDNA they contain can be sequenced by methods known by people skilled in the art.
Example X
Immunohistochemistry
[0091] Localization of the Doll proteins is performed on Drosophila embryo, imaginal discs, invertebrate and vertebrate adult tissue sections or tumor cell lines using the anti-Doll antibodies provided by this invention. For instance, if a tumor cell line is used, cells can be seeded into polylysine-coated 8 well chambers (Nalge-Nunc Internat.) and grown overnight at 37° C. As a positive control, 293 MEK cells (ATCC) cells might be transfected e.g. by a lipofection method (e.g. Lipofectamine, Gibco technologies) with a Doll expression plasmid, such as pcDNA3.1 (Invitrogen). Two days after transfection, cells are washed and fixed with 3.7% formaldehyde in PBS for 10 min, permeabilized in 0.5% Triton-X-100 for another 10 min, and blocked with a 1:1000 dilution of pre-immunoserum in 2% BSA-PBS for 1 h at RT. Cells are then incubated with a 1:1000 dilution of anti-Doll immunoserum for 2 hrs at RT, followed by washing in PBS and staining with anti-rabbit secondary-antibody. The washing step is repeated and preparations are blocked in a solution of 3% BSA in PBS/0.1% TritonX-100 for 1 hr. The slides are then washed three times for 5 min in PBS and incubated with a 1:200 dilution (v/v) of TRITC-conjugated swine anti-rabbit immunoglobulin (Dako, Inc.). The washing step is repeated before applying coverslips using Vectashield® mounting medium (Vector Laboratories, Inc.). Detection of other proteins such as β-Catenin, hLgs or Tcf can be performed in the same way using anti-β-Catenin (commercially available), anti-hLgs (U.S. Ser. No. 09/915,543) or anti-Tcf (commercially available) specific antibodies, respectively.
Example XI
Luciferase Reporter Gene Assays
[0092] The effect of Doll on Tcf transactivation activity can be performed in a cell culture system using a Tcf responsive luciferase reporter gene. Depending on the expression vector used, this protocol can be applied for mammalian as well as for Drosophila cell lines. For instance, HEK293 cells (ATCC) are a well suitable system. Hereby, Doll full length cDNA is cloned into a mammalian expression vector, such as pcDNA3 (Invitrogen), and transfected together with the TOPFLASH luciferase reporter plasmid (Upstate biotechnology, New York, USA) into 293 cells. A lipofection agent like the Lipofectamine transfection reagent (Life Technologies, Inc.) can be used for this purpose. A renilla luciferase reporter plasmid, e.g. pRL-SV40, (Promega Corporation, Madison USA), is co-transfected to normalize the transfection efficiency. Cell extracts are prepared 48 h after transfection and assayed for firefly and renilla luciferase activity as described by the manufacturer (Dual luciferase reporter assay system, Promega Corporation). All the luciferase values are normalized for renilla luciferase activity (see FIG. 8).
Example XII
Screening of Chemical Compounds, Organic Products or Peptides Interfering with Doll Function
[0093] A reporter gene assay is performed with a similar protocol as described in example XI, but scaled down to be performed as a high throughput screening. For this purpose colon cancer cell lines with mutated and/or constitutively active β-Catenin are stably transfected with the Topflash vector described in Example XI and Doll cDNA. The established monoclonal population, which gives the most reliable and constant reporter gene activity is selected for later assays. One day after plating, cells are treated with single compounds derived from a chemical or peptide library. One to 24 hours later reporter gene activity is measured. Compounds found to inhibit reporter gene activity are then further characterized for specific activity on the Doll-containing transcriptional complex. Alternatively, Wnt pathway activity can be measured by detecting mRNA or protein levels of a target gene, e.g. myc (He, Sparks et al. 1998).
Example XIII
Screening Assay Based on Protein-protein Interaction for Compounds Inhibiting Doll-Lgs or Doll Interaction Partner X
[0094] Doll and its interaction partner or fragments thereof are produced and purified e.g. from E.coli cultures (e.g. as described in example VIII). Proteins are tagged e.g. with 6 histidines, Sprotein, GST or thioredoxin. Small aliquots of the purified proteins are incubated in an appropriate binding buffer. At this point chemical compounds are added to the mixture and their capacity to disrupt the protein-protein interaction is monitored e.g. by any of the methods described below. Compounds inhibiting this interaction are subsequently tested for their specificity and in vivo toxicity. Well established methods to monitor protein-protein interactions are e.g.:
[0095] Time resolved fluorometry with lanthanide chelate labels (Hemmilä I. And Webb S. DDT 2: 373-381 (1997))
[0096] Scintillation proximity assay (SPA) (Amersham life Science)
[0097] Fluorescence polarisation
References
[0098] Aasland, R., T. J. Gibson, et al. (1995). “The PHD finger: implications for chromatin-mediated transcriptional regulation.” Trends Biochem Sci 20(2): 56-9.
[0099] Ashburner, M. (1989). Drosophila A laboratory handbook . Cold Sring Harbor, N.Y., Cold Spring Harbor Laboratory Press.
[0100] Baker, N. E. (1988). “Transcription of the segment-polarity gene wingless in the imaginal discs of Drosophila, and the phenotype of a pupal-lethal wg mutation.” Development 102(3): 489-97.
[0101] Barker N, H. G., Korinek V, Clevers H (1999). “Restricted high level expression of Tcf-4 protein in intestinal and mammary gland epithelium.” Am J Pathol 154: 29-35.
[0102] Basler, K., B. Christen, et al. (1991). “Ligand-independent activation of the sevenless receptor tyrosine kinase changes the fate of cells in the developing Drosophila eye.” Cell 64(6): 1069-81.
[0103] Brand, A. H. and N. Perrimon (1993). “Targeted gene expression as a means of altering cell fates and generating dominant phenotypes.” Development 118(2): 401-15.
[0104] Cabrera, C. V., M. C. Alonso, et al. (1987). “Phenocopies induced with antisense RNA identify the wingless gene.” Cell 50(4): 659-63.
[0105] Chou, T. B. and N. Perrimon (1996). “The autosomal FLP-DFS technique for generating germline mosaics in Drosophila melanogaster.” Genetics 144(4): 1673-9.
[0106] Fields, S. and R. Sternglanz (1994). “The two-hybrid system: an assay for protein-protein interactions.” Trends Genet 10(8): 286-92.
[0107] Grosschedl R, E. Q. (1999). “Regulation of LEF-1TCF transcription factors by Wnt and other signals.” Current Opinion in Cell Biology 11: 233-240.
[0108] He, T. C., A. B. Sparks, et al. (1998). “Identification of c-MYC as a target of the APC pathway [see comments].” Science 281(5382): 1509-12.
[0109] Heberlein, U., E. R. Borod, et al. (1998). “Dorsoventral patterning in the Drosophila retina by wingless.” Development 125(4): 567-77.
[0110] Morin, P. J. (1999). “beta-catenin signaling and cancer.” Bioessays 21(12): 1021-30.
[0111] Morin, P. J., A. B. Sparks, et al. (1997). “Activation of betacatenin-Tcf signaling in colon cancer by mutations in betacatenin or APC [see comments].” Science 275(5307): 1787-90.
[0112] Nusse, R. and H. E. Varmus (1982). “Many tumors induced by the mouse mammary tumor virus contain a provirus integrated in the same region of the host genome.” Cell 31(1): 99-109.
[0113] Peifer, M. and P. Polakis (2000). “Wnt signaling in oncogenesis and embryogenesis—a look outside the nucleus.” Science 287(5458): 1606-9.
[0114] Perrimon, N. (1998). “Creating mosaics in Drosophila.” Int j Dev Biol 42(3): 243-7.
[0115] Perrimon, N. (1998). “New advances in Drosophila provide opportunities to study gene functions.” Proc Natl Acad Sci USA 95(17): 9716-7.
[0116] Perrimon, N. and A. P. Mahowald (1987). “Multiple functions of segment polarity genes in Drosophila.” Dev Biol 119(2): 587-600.
[0117] Polakis, P., M. Hart, et al. (1999). “Defects in the regulation of beta-catenin in colorectal cancer.” Adv Exp Med Biol 470: 23-32.
[0118] Potter, J. D. (1999). “Colorectal cancer: molecules and populations.” Journal of the National Cancer Institute 91(11): 916-32.
[0119] Rijsewijk, F., M. Schuermann, et al. (1987). “The Drosophila homolog of the mouse mammary oncogene int-1 is identical to the segment polarity gene wingless.” Cell 50(4): 649-57.
[0120] Roose, J. and H. Clevers (1999). “TCF transcription factors: molecular switches in carcinogenesis.” Biochimica et Biophysica Acta 1424 (2-3): M23-37.
[0121] Sambrook, J., E. F. Fritsch, et al. (1989). Molecular cloning: a laboratory manual , Cold Spring Harbor Laboratory Press.
[0122] Sharma, R. P. and V. L. Chopra (1976). “Effect of the Wingless (wg1) mutation on wing and haltere development in Drosophila melanogaster.” Dev Biol 48(2): 461-5.
[0123] Struhl, G. and K. Basler (1993). “Organizing activity of wingless protein in Drosophila.” Cell 72(4): 527-40.
[0124] Tatusova TA, M. T. (1999). “Blast 2 sequences—a new tool for comparing protein and nucleotide sequences.” FEMS Microbiol Lett . 174: 247-250.
[0125] Tautz, D. and C. Pfeifle (1989). “A non-radioactive in situ hybridization method for the localization of specific RNAs in Drosophila embryos reveals translational control of the segmentation gene hunchback.” Chromosoma 98(2): 81-5.
[0126] van de Wetering, M., R. Cavallo, et al. (1997). “Armadillo coactivates transcription driven by the product of the Drosophila segment polarity gene dTCF.” Cell 88(6): 789-99.
[0127] Waltzer, L. and M. Bienz (1999). “The control of beta-catenin and TCF during embryonic development and cancer.” Cancer & Metastasis Reviews 18(2): 231-46.
[0128] Willis, T. G., I. R. Zalcberg, et al. (1998). “Molecular cloning of translocation t(1;14) (q21;q32) defines a novel gene (BCL9) at chromosome lq21 .” Blood 91(6): 1873-81.
[0129] Wodarz, A. and R. Nusse (1998). “Mechanisms of Wnt signaling in development.” Annual Review of Cell & Developmental Biology 14: 59-88.
[0130] Xu, T. and G. M. Rubin (1993). “Analysis of genetic mosaics in developing and adult Drosophila tissues.” Development 117(4): 1223-37.
1
10
1
2448
DNA
Drosophila melanogaster
1
atgacccaca atcttggtat ggcgccatat cgattgccgg gtccagcggg cggactctgt 60
ccgcccgatt ttaagccgcc gcctcccacg gacatcatct cggcgccgag caatccgaag 120
aagcggcgaa aaacctcaag tgccgccaac tccgctgcag cggtggctgc ggcggcggct 180
gcagcagctg ctgcgaattc catgcagcag cagcaggcgc cacccacacc gcaggatttg 240
ctgccccctc cgccaatggg aggcttcgga gacaccatta ttgcctcgaa tccattcgac 300
gacagtcccc aggtgtcggc gatgtccagc tcagcggccg cggcgatggc ggccatgaat 360
cagatgggcg gcggaccagg aggtggtcac tttggcggcg gtggaccggg tgggcacccg 420
cactgggaag accgcatggg catgggcggt ggacctcctc ccccgcctca catgcatccc 480
catatgcacc cgcatcatcc aggcggacct atgggtcacc cacatggccc acatccgcac 540
atgggtggtc cacctccaat gcgaggaatg agccccatgc acccccatca aatgggaccg 600
ggaccaggcg tcggactacc gccgcatatg aatcacggaa ggccaggggg acctggtggt 660
cctggaggac ccgtcccaat gggtagtccc atgggtggaa tagctggcat gggcggcatg 720
agcccaatgg gcggaatggg aggccccagc atatcacccc atcacatggg catgggtggt 780
ctgtcgccca tgggaggcgg tcccaacgga cccaatccgc gagccatgca gggttcaccg 840
atgggcggtc cggggcagaa ctcgccaatg aactcactgc ctatgggttc gccaatgggc 900
aatccaattg gcagcccgtt gggccctccc tcgggaccgg gccctgggaa tcccggcaat 960
accggcggac cacagcagca acaacaacaa cctccgcagc caccgatgaa caacgggcag 1020
atgggtcctc ctcctctgca cagtccgctc ggaaacggac caacgggtca tggcagtcac 1080
atgcctggag gaccaatccc aggaccaggt cctgggcctg gcggcctagt aggtcccggt 1140
ggcatctccc ccgcgcacgg caataacccg ggtggttctg ggaacaacat gctcggcggg 1200
aatcccggcg gcggcaacag caacaacaac ggaagcaata caagtaacgc cagcaacaac 1260
aatcaaaatc ctcacctctc gccagcagcc ggacgcctgg gagtgccgac gtcgatgcag 1320
tcgaatggac cttcggtatc atcggtagcc tcctcatcgg ttccctcgcc cgccacgccc 1380
acgctcacgc ccacatcgac ggccacgtcc atgtccacgt cagtgcctac atcctcgcca 1440
gcgccgcccg ccatgtcacc gcatcactcg ctaaacagcg ccggcccgag tccgggcatg 1500
cccaactcgg gacccagccc gctgcagtca ccagccggac ccaatggccc caataacaac 1560
aacagcaata acaacaacgg acccatgatg ggccagatga tcccgaacgc agttcctatg 1620
cagcaccagc agcacatggg cggcggccca cctggccacg ggcccggacc aatgcccgga 1680
atgggcatga accaaatgct gccaccgcag caaccctccc atcttggtcc cccgcatccg 1740
aatatgatga accacccgca tcatccgcac catcatcctg gcggaccacc gccgcacatg 1800
atgggtggac ccggaatgca cggcggtcct gctggaatgc ctcctcatat gggcggagga 1860
cctaatccgc acatgatggg cggtccgcac gggaacgcgg gtccgcacat gggccacggc 1920
cacatgggtg gagtaccagg tccaggaccc ggacccggcg gcatgaacgg acccccgcat 1980
ccgcacatgt ccccgcacca cggacatccg catcaccacc acaatccgat gggcggccca 2040
ggtccaaata tgttcggcgg tggtggagga ggtcccatgg gtcccggtgg accgatgggc 2100
aacatggggc ccatgggagg tggcccgatg ggcggcccta tgggcgtagg tcccaagccg 2160
atgacaatgg gcggcgggaa gatgtacccg ccgggacagc caatggtctt taatccgcag 2220
aacccgaatg cgccgcccat atatccttgt ggcatgtgcc acaaggaggt gaacgacaac 2280
gacgaagccg tgttctgtga atccggttgt aactttttct ttcacagaac ctgtgttggc 2340
ctgacagagg cggccttcca aatgctcaac aaggaggtgt ttgccgagtg gtgctgcgac 2400
aagtgcgtgt cttccaagca tattcccatg gtcaagttca agtgttga 2448
2
1366
DNA
Homo sapiens
2
ggatccccac atgcccgccg agaactctcc agctcccgct tacaaagttt cctcgcatgg 60
tggtgatagt ggactggatg ggttaggagg accaggtgta caactaggaa gcccagataa 120
gaaaaagcgc aaggcaaata cacagggacc ttctttccct ccattgtctg agtatgctcc 180
accaccgaat ccaaactctg accatctagt ggctgctaat ccatttgatg acaactataa 240
tactatttcc tataaaccac taccttcgtc aaatccatat cttggccctg gttatcctgg 300
ctttggaggc tatagtacat tcagaatgcc acctcacgtt cccccaagaa tgtcttcccc 360
atactgtggt ccttactcac tcaggaacca gccacaccca tttcctcaga atcctctggg 420
catgggtttt aatcgacctc atgcttttaa ctttgggcca catgataatt caagtttcgg 480
taatccatct tataataatg cactaagtca gaatgtcaac atgcctaatc aacattttag 540
acaaaatcct gctgaaaatt tcagtcaaat tcctccacag aatgctagcc aagtttctaa 600
ccccgatttg gcatctaatt ttgttcctgg aaataattca aattttactt ctccgttaga 660
atctaatcat tcttttattc ctcccccaaa cacttttggt caagcaaaag caccaccccc 720
aaaacaagac tttactcaag gagcaaccaa aaacactaat caaaattcct ctgctcatcc 780
acctcacttg aatatggatg acacagtgaa tcagagtaat attgaattaa aaaatgttaa 840
tcgaaacaat gcagtaaatc aggagaacag ccgttcaagt agcactgaag ccacaaacaa 900
taaccctgca aatgggacgc agaataagcc acgacaacca agaggtgcag cagatgcctg 960
caccacagaa aaaagcaata aatcctctct tcacccaaac cgtcatggcc attcgtcttc 1020
tgacccagtg tatccttgtg gaatttgtac aaacgaggtg aacgatgatc aggatgccat 1080
cttatgtgag gcctcttgtc agaaatggtt tcatcggatc tgtactggaa tgactgaaac 1140
agcttatggc ctcttaactg cagaagcatc tgcagtatgg ggctgtgata cctgtatggc 1200
tgacaaagat gtccagttaa tgcgtactag agaaactttt ggtccatctg cagtgggcag 1260
tgatgcttaa tcaaaggcat taactaaagt gggtttattt tcctgtgcat tgcagaagtt 1320
cattgacaca ggattttaat gttttacatt atttttttaa atgcat 1366
3
1449
DNA
Homo sapiens
3
cccgggtccc ccactccatg gccgcctcgg cgccgccccc accggacaag ctggagggag 60
gtggcggccc cgcaccgccc cctgcgccgc ccagcaccgg gaggaagcag ggcaaggccg 120
gtctgcaaat gaagagtcca gaaaagaagc gaaggaagtc aaatactcag ggccctgcat 180
actcacatct gacggagttt gcaccacccc caactcccat ggtggatcac ctggttgcat 240
ccaacccttt tgaagatgac ttcggagccc ccaaagtggg ggttgcagcc cctccattcc 300
ttggcagtcc tgtgcccttc ggaggcttcc gtgtgcaggg gggcatggcg ggccaggtac 360
ccccaggcta cagcactgga ggtggagggg gcccccagcc actccgtcga cagccacccc 420
ccttccctcc caatcctatg ggccctgctt tcaacatgcc cccccagggt cctggctacc 480
cacccccagg caacatgaac tttcccagcc aacccttcaa ccagcctctg ggtcaaaact 540
ttagtcctcc cagtgggcag atgatgccgg gcccagtggg gggatttggt cccatgatct 600
cacccaccat gggacagcct cccagagcag agctgggccc accttctctg tcccaacgat 660
ttgctcagcc aggggctcct tttggccctt ctcctctcca gagacctggt caggggctcc 720
ccagcctgcc gcctaacaca agtccctttc ctggtccgga ccctggcttt cctggccctg 780
gtggtgagga tggggggaag cccttgaatc cacctgcttc tactgctttt ccccaggagc 840
cccactcagg ctccccggct gctgctgtta atgggaacca gcccagtttc cccccgaaca 900
gcagtgggcg gggtgggggc actccagatg ccaacagctt ggcaccccct ggcaaggcag 960
gtgggggctc cgggccccag cctcccccag gcttggtgta cccatgtggt gcctgtcgga 1020
gtgaggtgaa cgatgaccag gatgccattc tgtgtgaggc ctcctgccag aaatggttcc 1080
accgtgagtg cacaggcatg actgagagcg cctatgggct gctgaccact gaagcttctg 1140
ccgtctgggc ctgcgatctc tgcctcaaga ccaaggagat ccagtctgtc tacatccgtg 1200
agggcatggg gcagctggtg gctgctaacg atgggtgacg ctggtgaagt ggcccaggga 1260
agtgcacatg tctctccctg ctcttccagg gtgatttttt tgatgtttgg ctcttggtcc 1320
ttgtttccac tggctttcca tccccatggg gcagaaacag tggctcctgg gagcagaaaa 1380
ggaattgagg tgggcaggca gaagagcctg gattgctcac tgttttggga aacttacatg 1440
ttgagatct 1449
4
1254
DNA
Mus musclus
4
atgtcagcgg aacaggacaa ggagcccatc gcgctgaaga gagttagagg tggtgacagt 60
ggactggatg ggttaggagg gcccaatata caactcggaa gcccagataa gaaaaaacgc 120
aaggccaaca cacagggatc ttcctttcct ccgctgtcag agtacgcgcc accaccgaat 180
ccaaactccg accatctagt ggctgccaat ccgttcgatg acagctacaa caccatttcc 240
tataaaccac tgccttcatc taatccatat cttggccctg ggtatcctgg ctttggaggc 300
tacagcacat tcagaatgcc accccacgtc cctccaagaa tgtcttctcc ctactgtggt 360
ccttactcac tcaggaatca gccccaccca tttcctcaga atccgttggg catgggcttt 420
aaccggcctc atgcttttaa ctttgggccg catgataatt cgaattttgg aaacccacct 480
tataataatg tactgactca ggacattaac atgcccggtc agcattttag acaaggctct 540
gctgaaaact tcagtcagat tcccccgcag aatgttggcc aagtgtctaa ccctgacctc 600
gcatctaatt ttgcccctgg gaataattca aattttacct ctccgttaga aacgaatcat 660
tcgtttattc cacccccaaa cgcgtttggc caagcaaaag ctccacttcc caaacaagac 720
ttcactcaag gggcaaccaa aaccccgaat cagaattcgt ccactcaccc acctcaccta 780
aatatggagg atccagtcaa tcagagtaac gtcgagttaa aaaatgtcaa cagaaacaac 840
gttgtccaag agaacagccg ttcgggcagc gcagaggcca ccaacaacca tgcgaatggg 900
acccagaaca agccccggca gcccaggggc gcagctgacc tgtgcacccc cgacaaaagc 960
cgcaagttct ccctgctccc cagccggcat ggccattcct cctctgaccc tgtgtacccg 1020
tgcgggattt gtacaaatga agtgaatgac gatcaggacg ccattctgtg tgaagcctct 1080
tgtcagaagt ggtttcatcg catctgcact ggaatgaccg aaacagccta cgggctcctg 1140
acagcggaag catccgcagt gtggggctgt gacacgtgca tggctgacaa ggatgtccag 1200
ctcatgcgca ctagagaggc ctttggtcca cctgccgtgg gcggcgatgc ctaa 1254
5
1221
DNA
Mus musclus
5
atggccgcct cggcgccgcc cccaccggac aagctggagg gaggcagcgg ccccgcaccg 60
ccccccgcgc cgcccagcac cgggaggaag cagggcaagg ccggtctgca aatgaagagc 120
cctgaaaaga agcgaagaaa gtccaatact cagggtcctg catattcaca tctgacggag 180
ttcgccccac ccccgacccc catggtggat catctggtcg cttctaaccc ttttgaggat 240
gatttcggag cccctaaagt ggggggcgca ggccctccgt tcctcggcag tccggtgccc 300
tttggaggct ttcgtgtaca ggggggcatg gcaggccagg tacccccaag ctacggcact 360
ggaggaggag ggggtcccca gccacttcgt cggcagcccc ctccttttcc ccccagccct 420
atgggtccag cttttaatat gccccctcag ggtccctggg gtaccccgcc ccctggcaac 480
atgaactttc ccagtcaacc cttcaaccag tctctgggcc aaaactttag cccacctggt 540
gggcaggtga tgccaggccc agtaggcgga tttggtccca tgatctcacc gaccatggga 600
cagcctccta gaggggagct gggtcctcct cctctccccc aacgctttac ccaaccagga 660
gcaccttatg gtccttctct tcagagacct ggtcagggac tcacccagct gccctccaac 720
acaagtccct tccctggtcc agaccctggt tttcctggac ctggcggtga ggatggtggg 780
aagcccttga acccaccggc tcccaccgcc tttccccagg aagcaccatt cgggctcccc 840
gctgctgctg tcaatgggaa tcagcccagt ttccccccta gcagcagtgg tcgaggtggg 900
ggcactccag atgccaacag tctggcaccc cccggcaagg cagggggagg ctcagggccc 960
cagcctcccc caggcctggt gtacccctgc ggtgcctgcc gtagtgaggt aaatgatgac 1020
caggatgcca ttctgtgtga ggcctcctgc cagaagtggt ttcaccgcga gtgcaccggc 1080
atgaccgaga gtgcctacgg cctgctgacc accgaggcct ctgccgtctg ggcctgtgat 1140
ctttgcctca agaccaagga gatccagtct gtctacatcc gagagggcat gggccagttg 1200
gtggctgcta acgatgggtg a 1221
6
815
PRT
Drosophila melanogaster
6
Met Thr His Asn Leu Gly Met Ala Pro Tyr Arg Leu Pro Gly Pro Ala
1 5 10 15
Gly Gly Leu Cys Pro Pro Asp Phe Lys Pro Pro Pro Pro Thr Asp Ile
20 25 30
Ile Ser Ala Pro Ser Asn Pro Lys Lys Arg Arg Lys Thr Ser Ser Ala
35 40 45
Ala Asn Ser Ala Ala Ala Val Ala Ala Ala Ala Ala Ala Ala Ala Ala
50 55 60
Ala Asn Ser Met Gln Gln Gln Gln Ala Pro Pro Thr Pro Gln Asp Leu
65 70 75 80
Leu Pro Pro Pro Pro Met Gly Gly Phe Gly Asp Thr Ile Ile Ala Ser
85 90 95
Asn Pro Phe Asp Asp Ser Pro Gln Val Ser Ala Met Ser Ser Ser Ala
100 105 110
Ala Ala Ala Met Ala Ala Met Asn Gln Met Gly Gly Gly Pro Gly Gly
115 120 125
Gly His Phe Gly Gly Gly Gly Pro Gly Gly His Pro His Trp Glu Asp
130 135 140
Arg Met Gly Met Gly Gly Gly Pro Pro Pro Pro Pro His Met His Pro
145 150 155 160
His Met His Pro His His Pro Gly Gly Pro Met Gly His Pro His Gly
165 170 175
Pro His Pro His Met Gly Gly Pro Pro Pro Met Arg Gly Met Ser Pro
180 185 190
Met His Pro His Gln Met Gly Pro Gly Pro Gly Val Gly Leu Pro Pro
195 200 205
His Met Asn His Gly Arg Pro Gly Gly Pro Gly Gly Pro Gly Gly Pro
210 215 220
Val Pro Met Gly Ser Pro Met Gly Gly Ile Ala Gly Met Gly Gly Met
225 230 235 240
Ser Pro Met Gly Gly Met Gly Gly Pro Ser Ile Ser Pro His His Met
245 250 255
Gly Met Gly Gly Leu Ser Pro Met Gly Gly Gly Pro Asn Gly Pro Asn
260 265 270
Pro Arg Ala Met Gln Gly Ser Pro Met Gly Gly Pro Gly Gln Asn Ser
275 280 285
Pro Met Asn Ser Leu Pro Met Gly Ser Pro Met Gly Asn Pro Ile Gly
290 295 300
Ser Pro Leu Gly Pro Pro Ser Gly Pro Gly Pro Gly Asn Pro Gly Asn
305 310 315 320
Thr Gly Gly Pro Gln Gln Gln Gln Gln Gln Pro Pro Gln Pro Pro Met
325 330 335
Asn Asn Gly Gln Met Gly Pro Pro Pro Leu His Ser Pro Leu Gly Asn
340 345 350
Gly Pro Thr Gly His Gly Ser His Met Pro Gly Gly Pro Ile Pro Gly
355 360 365
Pro Gly Pro Gly Pro Gly Gly Leu Val Gly Pro Gly Gly Ile Ser Pro
370 375 380
Ala His Gly Asn Asn Pro Gly Gly Ser Gly Asn Asn Met Leu Gly Gly
385 390 395 400
Asn Pro Gly Gly Gly Asn Ser Asn Asn Asn Gly Ser Asn Thr Ser Asn
405 410 415
Ala Ser Asn Asn Asn Gln Asn Pro His Leu Ser Pro Ala Ala Gly Arg
420 425 430
Leu Gly Val Pro Thr Ser Met Gln Ser Asn Gly Pro Ser Val Ser Ser
435 440 445
Val Ala Ser Ser Ser Val Pro Ser Pro Ala Thr Pro Thr Leu Thr Pro
450 455 460
Thr Ser Thr Ala Thr Ser Met Ser Thr Ser Val Pro Thr Ser Ser Pro
465 470 475 480
Ala Pro Pro Ala Met Ser Pro His His Ser Leu Asn Ser Ala Gly Pro
485 490 495
Ser Pro Gly Met Pro Asn Ser Gly Pro Ser Pro Leu Gln Ser Pro Ala
500 505 510
Gly Pro Asn Gly Pro Asn Asn Asn Asn Ser Asn Asn Asn Asn Gly Pro
515 520 525
Met Met Gly Gln Met Ile Pro Asn Ala Val Pro Met Gln His Gln Gln
530 535 540
His Met Gly Gly Gly Pro Pro Gly His Gly Pro Gly Pro Met Pro Gly
545 550 555 560
Met Gly Met Asn Gln Met Leu Pro Pro Gln Gln Pro Ser His Leu Gly
565 570 575
Pro Pro His Pro Asn Met Met Asn His Pro His His Pro His His His
580 585 590
Pro Gly Gly Pro Pro Pro His Met Met Gly Gly Pro Gly Met His Gly
595 600 605
Gly Pro Ala Gly Met Pro Pro His Met Gly Gly Gly Pro Asn Pro His
610 615 620
Met Met Gly Gly Pro His Gly Asn Ala Gly Pro His Met Gly His Gly
625 630 635 640
His Met Gly Gly Val Pro Gly Pro Gly Pro Gly Pro Gly Gly Met Asn
645 650 655
Gly Pro Pro His Pro His Met Ser Pro His His Gly His Pro His His
660 665 670
His His Asn Pro Met Gly Gly Pro Gly Pro Asn Met Phe Gly Gly Gly
675 680 685
Gly Gly Gly Pro Met Gly Pro Gly Gly Pro Met Gly Asn Met Gly Pro
690 695 700
Met Gly Gly Gly Pro Met Gly Gly Pro Met Gly Val Gly Pro Lys Pro
705 710 715 720
Met Thr Met Gly Gly Gly Lys Met Tyr Pro Pro Gly Gln Pro Met Val
725 730 735
Phe Asn Pro Gln Asn Pro Asn Ala Pro Pro Ile Tyr Pro Cys Gly Met
740 745 750
Cys His Lys Glu Val Asn Asp Asn Asp Glu Ala Val Phe Cys Glu Ser
755 760 765
Gly Cys Asn Phe Phe Phe His Arg Thr Cys Val Gly Leu Thr Glu Ala
770 775 780
Ala Phe Gln Met Leu Asn Lys Glu Val Phe Ala Glu Trp Cys Cys Asp
785 790 795 800
Lys Cys Val Ser Ser Lys His Ile Pro Met Val Lys Phe Lys Cys
805 810 815
7
419
PRT
Homo sapiens
7
Met Pro Ala Glu Asn Ser Pro Ala Pro Ala Tyr Lys Val Ser Ser His
1 5 10 15
Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Gly Val Gln Leu
20 25 30
Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gln Gly Pro Ser
35 40 45
Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp
50 55 60
His Leu Val Ala Ala Asn Pro Phe Asp Asp Asn Tyr Asn Thr Ile Ser
65 70 75 80
Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro
85 90 95
Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro His Val Pro Pro
100 105 110
Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gln Pro
115 120 125
His Pro Phe Pro Gln Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His
130 135 140
Ala Phe Asn Phe Gly Pro His Asp Asn Ser Ser Phe Gly Asn Pro Ser
145 150 155 160
Tyr Asn Asn Ala Leu Ser Gln Asn Val Asn Met Pro Asn Gln His Phe
165 170 175
Arg Gln Asn Pro Ala Glu Asn Phe Ser Gln Ile Pro Pro Gln Asn Ala
180 185 190
Ser Gln Val Ser Asn Pro Asp Leu Ala Ser Asn Phe Val Pro Gly Asn
195 200 205
Asn Ser Asn Phe Thr Ser Pro Leu Glu Ser Asn His Ser Phe Ile Pro
210 215 220
Pro Pro Asn Thr Phe Gly Gln Ala Lys Ala Pro Pro Pro Lys Gln Asp
225 230 235 240
Phe Thr Gln Gly Ala Thr Lys Asn Thr Asn Gln Asn Ser Ser Ala His
245 250 255
Pro Pro His Leu Asn Met Asp Asp Thr Val Asn Gln Ser Asn Ile Glu
260 265 270
Leu Lys Asn Val Asn Arg Asn Asn Ala Val Asn Gln Glu Asn Ser Arg
275 280 285
Ser Ser Ser Thr Glu Ala Thr Asn Asn Asn Pro Ala Asn Gly Thr Gln
290 295 300
Asn Lys Pro Arg Gln Pro Arg Gly Ala Ala Asp Ala Cys Thr Thr Glu
305 310 315 320
Lys Ser Asn Lys Ser Ser Leu His Pro Asn Arg His Gly His Ser Ser
325 330 335
Ser Asp Pro Val Tyr Pro Cys Gly Ile Cys Thr Asn Glu Val Asn Asp
340 345 350
Asp Gln Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys Trp Phe His
355 360 365
Arg Ile Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala
370 375 380
Glu Ala Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp
385 390 395 400
Val Gln Leu Met Arg Thr Arg Glu Thr Phe Gly Pro Ser Ala Val Gly
405 410 415
Ser Asp Ala
8
406
PRT
Homo sapiens
8
Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Gly
1 5 10 15
Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gln Gly
20 25 30
Lys Ala Gly Leu Gln Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser
35 40 45
Asn Thr Gln Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro
50 55 60
Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp
65 70 75 80
Asp Phe Gly Ala Pro Lys Val Gly Val Ala Ala Pro Pro Phe Leu Gly
85 90 95
Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gln Gly Gly Met Ala Gly
100 105 110
Gln Val Pro Pro Gly Tyr Ser Thr Gly Gly Gly Gly Gly Pro Gln Pro
115 120 125
Leu Arg Arg Gln Pro Pro Pro Phe Pro Pro Asn Pro Met Gly Pro Ala
130 135 140
Phe Asn Met Pro Pro Gln Gly Pro Gly Tyr Pro Pro Pro Gly Asn Met
145 150 155 160
Asn Phe Pro Ser Gln Pro Phe Asn Gln Pro Leu Gly Gln Asn Phe Ser
165 170 175
Pro Pro Ser Gly Gln Met Met Pro Gly Pro Val Gly Gly Phe Gly Pro
180 185 190
Met Ile Ser Pro Thr Met Gly Gln Pro Pro Arg Ala Glu Leu Gly Pro
195 200 205
Pro Ser Leu Ser Gln Arg Phe Ala Gln Pro Gly Ala Pro Phe Gly Pro
210 215 220
Ser Pro Leu Gln Arg Pro Gly Gln Gly Leu Pro Ser Leu Pro Pro Asn
225 230 235 240
Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly
245 250 255
Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Ser Thr Ala Phe Pro
260 265 270
Gln Glu Pro His Ser Gly Ser Pro Ala Ala Ala Val Asn Gly Asn Gln
275 280 285
Pro Ser Phe Pro Pro Asn Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp
290 295 300
Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro
305 310 315 320
Gln Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu
325 330 335
Val Asn Asp Asp Gln Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys
340 345 350
Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu
355 360 365
Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys
370 375 380
Thr Lys Glu Ile Gln Ser Val Tyr Ile Arg Glu Gly Met Gly Gln Leu
385 390 395 400
Val Ala Ala Asn Asp Gly
405
9
417
PRT
Mus musclus
9
Met Ser Ala Glu Gln Asp Lys Glu Pro Ile Ala Leu Lys Arg Val Arg
1 5 10 15
Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Asn Ile Gln Leu
20 25 30
Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gln Gly Ser Ser
35 40 45
Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp
50 55 60
His Leu Val Ala Ala Asn Pro Phe Asp Asp Ser Tyr Asn Thr Ile Ser
65 70 75 80
Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro
85 90 95
Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro His Val Pro Pro
100 105 110
Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gln Pro
115 120 125
His Pro Phe Pro Gln Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His
130 135 140
Ala Phe Asn Phe Gly Pro His Asp Asn Ser Asn Phe Gly Asn Pro Pro
145 150 155 160
Tyr Asn Asn Val Leu Thr Gln Asp Ile Asn Met Pro Gly Gln His Phe
165 170 175
Arg Gln Gly Ser Ala Glu Asn Phe Ser Gln Ile Pro Pro Gln Asn Val
180 185 190
Gly Gln Val Ser Asn Pro Asp Leu Ala Ser Asn Phe Ala Pro Gly Asn
195 200 205
Asn Ser Asn Phe Thr Ser Pro Leu Glu Thr Asn His Ser Phe Ile Pro
210 215 220
Pro Pro Asn Ala Phe Gly Gln Ala Lys Ala Pro Leu Pro Lys Gln Asp
225 230 235 240
Phe Thr Gln Gly Ala Thr Lys Thr Pro Asn Gln Asn Ser Ser Thr His
245 250 255
Pro Pro His Leu Asn Met Glu Asp Pro Val Asn Gln Ser Asn Val Glu
260 265 270
Leu Lys Asn Val Asn Arg Asn Asn Val Val Gln Glu Asn Ser Arg Ser
275 280 285
Gly Ser Ala Glu Ala Thr Asn Asn His Ala Asn Gly Thr Gln Asn Lys
290 295 300
Pro Arg Gln Pro Arg Gly Ala Ala Asp Leu Cys Thr Pro Asp Lys Ser
305 310 315 320
Arg Lys Phe Ser Leu Leu Pro Ser Arg His Gly His Ser Ser Ser Asp
325 330 335
Pro Val Tyr Pro Cys Gly Ile Cys Thr Asn Glu Val Asn Asp Asp Gln
340 345 350
Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys Trp Phe His Arg Ile
355 360 365
Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala Glu Ala
370 375 380
Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp Val Gln
385 390 395 400
Leu Met Arg Thr Arg Glu Ala Phe Gly Pro Pro Ala Val Gly Gly Asp
405 410 415
Ala
10
406
PRT
Mus musclus
10
Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Ser
1 5 10 15
Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gln Gly
20 25 30
Lys Ala Gly Leu Gln Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser
35 40 45
Asn Thr Gln Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro
50 55 60
Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp
65 70 75 80
Asp Phe Gly Ala Pro Lys Val Gly Gly Ala Gly Pro Pro Phe Leu Gly
85 90 95
Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gln Gly Gly Met Ala Gly
100 105 110
Gln Val Pro Pro Ser Tyr Gly Thr Gly Gly Gly Gly Gly Pro Gln Pro
115 120 125
Leu Arg Arg Gln Pro Pro Pro Phe Pro Pro Ser Pro Met Gly Pro Ala
130 135 140
Phe Asn Met Pro Pro Gln Gly Pro Trp Gly Thr Pro Pro Pro Gly Asn
145 150 155 160
Met Asn Phe Pro Ser Gln Pro Phe Asn Gln Ser Leu Gly Gln Asn Phe
165 170 175
Ser Pro Pro Gly Gly Gln Val Met Pro Gly Pro Val Gly Gly Phe Gly
180 185 190
Pro Met Ile Ser Pro Thr Met Gly Gln Pro Pro Arg Gly Glu Leu Gly
195 200 205
Pro Pro Pro Leu Pro Gln Arg Phe Thr Gln Pro Gly Ala Pro Tyr Gly
210 215 220
Pro Ser Leu Gln Arg Pro Gly Gln Gly Leu Thr Gln Leu Pro Ser Asn
225 230 235 240
Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly
245 250 255
Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Pro Thr Ala Phe Pro
260 265 270
Gln Glu Ala Pro Phe Gly Leu Pro Ala Ala Ala Val Asn Gly Asn Gln
275 280 285
Pro Ser Phe Pro Pro Ser Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp
290 295 300
Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro
305 310 315 320
Gln Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu
325 330 335
Val Asn Asp Asp Gln Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys
340 345 350
Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu
355 360 365
Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys
370 375 380
Thr Lys Glu Ile Gln Ser Val Tyr Ile Arg Glu Gly Met Gly Gln Leu
385 390 395 400
Val Ala Ala Asn Asp Gly
405
1.PublishNumber: US-2004209264-A1
2.Date Publish: 20041021
3.Inventor: KRAMPS THOMAS
BASLER KONRAD
4.Inventor Harmonized: KRAMPS THOMAS(CH)
BASLER KONRAD(CH)
5.Country: US
6.Claims:
(en)The present invention relates to a new essential downstream component of the wingless signaling pathway. In particular, the invention relates to nucleotide sequences of the Drosophila melanogaster daughter of legless (doll) gene, of its encoded proteins, as well as derivatives, fragments and analogues thereof. The invention includes vertebrate and invertebrate homologues of the Doll protein, comprising proteins that contain a stretch of amino acids with similarity to the Drosophila Doll gene. Methods for producing the Doll protein, derivatives and analogs, e.g. by recombinant means, and antibodies to Doll are provided by the present invention as well. The invention also relates to methods for performing high throughput screening assays for compounds modulating Doll function in the Wnt pathway.
7.Description:
(en)[0001] The present invention relates to a new essential downstream component of the wingless signaling pathway. In particular, the invention relates to nucleotide sequences of the Drosophila melanogaster daughter of legless (doll) gene, of its encoded proteins, as well as derivatives, fragments and analogues thereof. The invention includes vertebrate and invertebrate homologues of the Doll protein, comprising proteins that contain a stretch of amino acids with similarity to the Drosophila Doll gene. Methods for producing the Doll protein, derivatives and analogs, e.g. by recombinant means, and antibodies to Doll are provided by the present invention as well. The invention also relates to methods for performing high throughput screening assays for compounds modulating Doll function in the Wnt pathway.
BACKGROUND OF THE INVENTION
[0002] Wnt genes encode a large family of secreted, cystein rich proteins that play key roles as intercellular signaling molecules in a wide variety of biological processes (for an extensive review see (Wodarz and Nusse 1998). The first Wnt gene, mouse wnt-1, was discovered as a proto-oncogene activated by integration of mouse mammary tumor virus in mammary tumors (Nusse and Varmus 1982). Consequently, the involvement of the Wnt pathway in cancer has been largely studied. With the identification of the Drosophila polarity gene wingless (wg) as a wnt-1 homologue (Cabrera, Alonso et al. 1987; Perrimon and Mahowald 1987; Rijsewijk, Schuermann et al. 1987), it became clear that wnt genes are important developmental regulators. Thus, although at first glance dissimilar, biological processes like embryogenesis and carcinogenesis both rely on cell communication via identical signaling pathways. In a current model of the pathway, the secreted Wnt protein binds to Frizzle cell surface receptors and activates the cytoplasmic protein Dishevelled (Dsh). Dsh then transmits the signal to a complex of several proteins, including the protein kinase Shaggy/GSK3, the scaffold protein Axin and β-Catenin, the vertebrate homologue of Armadillo. In this complex β-Catenin is targeted for degradation after being phosphorylated by Sgg. After Wnt signaling and the resulting down-regulation of Sgg activity, β-Catenin (or its Drosophila homologue Armadillo) escape from degradation and accumulate into the cytoplasm. Free cytoplasmic β-Catenin translocates to the nucleus by a still obscure mechanism, and modulates gene transcription through binding the Tcf/Lef family of transcription factors (Grosschedl R 1999).
[0003] This set up, in which the key transducer is continuously held in check, is highly susceptible to mutations in its inhibitory components. The loss of any of the three elements of the β-Catenin destruction complex leads to an increase in β-Catenin levels, and hence to the constitutive activation of the pathway. While this may reduce cellular viability, as upon loss of GSK-3 function, it can also lead to cell fate changes, uncontrolled proliferation and tumorigenic behavior as in the cases of APC and Axin (Barker N 1999; Morin 1999; Potter 1999; Roose and Clevers 1999; Waltzer and Bienz 1999). Attempts to counter these harmful situations must aim at curbing the nuclear activities of β-Catenin, either by preventing the formation of the β-Catenin-TCF complex or by interfering with its transcriptional activator function.
[0004] Currently, there are no known therapeutic agents effectively inhibiting β-Catenin transcriptional activation. This is partly due to the fact that many of the essential components required for its full activation and nuclear translocation are still unknown. consequently, there is an urge to understand more about this pathway in order to be able to develop effective drugs against these highly malignant diseases.
[0005] In order to identify new components required for Wingless activation the inventors used a Drosophila genetic approach to screen for dominant suppressors of the rough eye phenotype caused by ectopic expression of Wingless, the Drosophila homologue of Wnt, during eye development. Three genes were identified: the β-catenin homologue armadillo (arm), the tcf/lef-1 homologue pangolin (pan) and legless (lgs), a completely new gene (U.S. Ser. No. 09/915,543). The lgs gene was subsequently cloned and its in vivo requirement for Wingless signal transduction in embryo and in developing tissues was confirmed. The presence of Lgs is required for a transcriptional active Arm/Pangolin complex and over-expression of Lgs strongly stimulates the transcriptional output of this bipartite transcription factor. The human genome contains at least two human Lgs homologues. One of them, Bcl9, has been previously implicated in B cell malignancies (Willis, Zalcberg et al. 1998). It was also genetically and biochemically demonstrated that digs and hLgs bind to Armadillo and β-Catenin and are functionally required for Wnt signal propagation in human cells. However, genetic experiments strongly suggested the presence of a second protein which binds to Lgs and is essential for the function of the active β-Catenin-Pangolin-Lgs complex.
[0006] The present invention describes the cloning and functional characterization of a novel Drosophila protein, named Daughter of Legless (Doll), which binds to Lgs and is required for Wnt signaling. In addition, the invention provides the sequences of the functional and structural human and mouse homologues as well as methods to screen for compounds inhibiting Doll function in the Wnt pathway.
DEFINITIONS
[0007] The term “Doll polypeptide”, “Doll protein” when used herein encompasses native invertebrate and vertebrate Doll and Doll variant sequences (which are further defined herein).
[0008] A “wild type sequence Doll” comprises a polypeptide having the same amino acid sequence as a Doll protein derived from nature. Such wild type sequence of Doll can be isolated from nature or produced by recombinant and/or synthetic means. The term “wild type sequence Doll” specifically encompasses naturally occurring truncated forms, naturally occurring variant forms (e.g., alternatively spliced forms) and naturally occurring allelic variants of Doll. In one embodiment of the invention, the wild type Doll sequence is a mature or full-length Doll sequence comprising amino acids 1 to 815 of dDoll (FIG. 1), or 1 to 419 of hDoll-1, or 1 to 406 of hDoll-2 (FIG. 2), or 1 to 417 of mDoll-1, or 1 to 407 of mDoll-2 (FIGS. 3). “Doll variant” means an active Doll, having at least about 50% amino acid sequence identity with the amino acid sequence of a wild type Doll protein of FIG. 1, 2 and 3 . The term “Doll variant” however, does also include functional homologues of Doll in the Wnt pathway.
[0009] “Percent (%) amino acid sequence identity” with respect to the Doll sequences identified herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the Doll sequence, after aligning the sequence and introducing gaps, if necessary, to achieve the maximum percentage sequence identity, and not considering any conservative amino acid substitution as part of the sequence identity. The % identity values used herein can be generated by WU-BLAST-2, which was obtained from (Tatusova TA 1999). WU-BLAST-2 uses several search parameters, most of which are set to the default values.
[0010] The term “positive”, in the context of sequence comparison performed as described above, includes residues in the sequence compared that are not identical but have similar properties (e.g. as a result of a conservative substitution). The % value of positive is determined by the fraction of residues scoring a positive value in the BLOSUM 62 matrix divided by the total number of residues in the longer sequence as defined above.
[0011] In a similar manner, “percent (%) nucleic acid sequence identity” with respect to the coding sequence of the Doll polypeptides identified herein is defined as the percentage of nucleotide residues in a candidate sequence that are identical with the nucleotide residues in any of the Doll coding sequences of this invention. The identity values used herein can be generated using BLAST module of WU-BLAST-2 set to the default parameters.
[0012] The term “epitope tag” refers to a chimeric polypeptide comprising a Doll polypeptide fused to a “tag polypeptide”. The tag polypeptide has enough residues to provide an epitope against which an antibody can be made, yet is short enough that it does not interfere with the activity of the Doll polypeptide to which it is fused.
[0013] Nucleic acids are “operably linked” when are placed in a functional relationship with another nucleic acid sequence.
[0014] The term “epistasis” means hierarchy in gene action. Epistasis experiments are performed to place components of a signaling pathway in the right order.
[0015] The term “rescue experiments” are designed to determine which gene is responsible for a specific mutant phenotype. Specifically, mutant embryos are injected with coding or genomic DNA, and the effect of the introduced DNA is determined on the basis of the capacity to revert the mutant phenotype.
[0016] “Active” or “activity” refers to forms of Doll polypeptides that retain the biological and/or immunological activity. A preferred activity includes for instance the ability to modulate the Wnt signaling pathway.
[0017] The term “antagonist” is used in a broad sense, and includes any molecule that partially or fully inhibits, blocks or neutralizes a biological activity of Doll polypeptides described herein. In a similar way, the term “agonist” is used in the broadest sense and includes any molecule that mimics or support a biological activity of an active Doll polypeptide.
[0018] “Treatment” refers to both therapeutic treatments and prophylactic or preventive measures, wherein the objective is to prevent or slow down the targeted pathologic condition or disorder. Those in need of treatment include those already with the disorder as well as those prone to have the disorder or those in whom the disorder is to be prevented.
SUMMARY OF THE INVENTION
[0019] The present invention relates to a novel family of proteins present in insects and vertebrate organisms, referred to hereinafter as “Daughter of Legless (Doll)” proteins. These proteins play an essential role in the Wnt signaling pathway, and thus in the formation and maintenance of spatial arrangements and proliferation of tissues during development, and in the formation and growth of many human tumors.
[0020] In particular, the invention relates to nucleotide sequences of the Drosophila melanogaster doll gene, of proteins encoded by said nucleotide sequences, as well as fragments, derivatives and structural and functional analogs thereof.
[0021] In a preferred embodiment the invention relates to the nucleotide and protein sequences of the human and mouse doll homologues, hdoll-1, hdoll-2 and mdoll-1 and mdoll-2, respectively.
[0022] In one embodiment, the isolated nucleic acid comprises a sequence encoding a polypeptide having at least 50% amino acid sequence identity, preferably at least about 70% sequence identity, more preferably at least 90% sequence identity, even more preferably at least about 95% sequence identity, yet even more preferably at least about 98% sequence identity, and most preferably 100% identity to (a) a fragment or the entire protein sequence of the Doll polypeptide shown in FIG. 1, or (b) the complement of the nucleic acid molecule coding for (a).
[0023] In another preferred embodiment, the isolated nucleic acid encodes a polypeptide having at least 50% amino acid sequence identity, preferably about 70% sequence identity, more preferably at least 90% sequence identity, even more preferably about 95% sequence identity, yet even more preferably about 98% sequence identity, and most preferably 100% identity to (a) a polypeptide which is part or the entire human Doll polypeptides of FIG. 2 a/b or (b) the complement of the nucleic acid molecule coding for (a).
[0024] In another embodiment, the isolated nucleic acid encodes a polypeptide sequence having at least 50% amino acid sequence identity, preferably about 70% sequence identity, more preferably at least 90% sequence identity, even more preferably about 95% sequence identity, yet even more preferably about 98% sequence identity, and most preferably 100% identity to (a) a polypeptide encoding part of the entire mouse Doll protein of FIG. 3 a/b or (b) the complement of the nucleic acid molecule coding for (a).
[0025] In a further embodiment, the isolated nucleic acid comprises a sequence encoding a polypeptide with a low overall amino acid sequence identity but shows a sequence identity of at least 60%, preferably at least 70%, more preferably at least 80%, even more preferably at least 90% and most preferably 100% in the conserved domains DHD and PHD (FIG. 4).
[0026] In yet another embodiment of the present invention isolated nucleic acids encode polypeptides having a function resembling that of the doll genes.
[0027] In another embodiment, the invention relates to a fragment of the Drosophila or vertebrate doll nucleic acid sequences that is applied as hybridization probe. Such nucleic acid fragments are about 20 to about 100 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, most preferably from 20 to 50 nucleotides in length and are derived from the nucleotides sequences shown in FIG. 1, 2 and 3 .
[0028] The invention further provides eucaryotic and procaryotic expression vectors comprising a nucleic acid molecule encoding Drosophila or vertebrate doll or a fragment thereof as shown in FIGS. 1, 2 and 3 . The vector can comprise any of the molecules or fragments thereof described above.
[0029] The invention also includes host cells comprising such a vector. By way of example, the host cells can be mammalian cells, yeast cells, insect cells, plant cells or bacteria cells.
[0030] Methods of production, isolation and purification of the Doll proteins, derivatives and analogs, e.g. by recombinant means, are also provided (see Example VI, below). In a specific aspect, the invention concerns an isolated Doll peptide, comprising an amino acid sequence of at least 80%, preferably at least about 85% sequence identity, more preferably at least 90% sequence identity, even more preferably at least 95% sequence identity, yet most preferably 100% identity with the amino acid sequences of FIGS. 1, 2 and 3 .
[0031] In yet another embodiment the invention relates to chimeric proteins comprising a Doll polypeptide fused to a heterologous polypeptide or amino acid sequence. An example of such chimeric molecule comprises a Doll polypeptide fused to an epitope tagged sequence, glutathione-S-transferase protein or to a protein with an enzymatic activity, such as beta-galactosidase or alkaline phosphatase as described in Example VI below.
[0032] In a further aspect the invention concerns an isolated full length Doll polypeptide (prepared as described in Example VI), comprising the amino acid sequences of FIG. 1, 2 and 3 , or any Doll polypeptide or a fragment thereof described in this invention sufficient to provide a binding site for an anti-Doll anti-body.
[0033] In another embodiment the invention provides antibodies that specifically recognize Doll polypeptides. The antibodies can be a polyclonal or a monoclonal preparation or fragments thereof. Polyclonal antibodies are prepared by immunization of rabbits with purified Doll polypeptides prepared as described in Example VI.
[0034] The invention also relates to transgenic animals, e.g. Drosophila, mice, rats, chicken, frogs, pigs or sheep, having a transgene, e.g., animals that include and preferably express a heterologous form of the Doll genes described herein, or that misexpress an endogenous or transgenic doll gene. Such a transgenic animal can serve as a model for studying diseases with disrupted Wnt signaling pathway, for the production of Doll proteins, or for drug screening.
[0035] In yet another embodiment, the invention also features animals, e.g. Drosophila, mice, rats, chicken, frogs, pigs or sheep, having a mutation in the doll gene, e.g. deletions, point mutations, foreign DNA insertions or inversions. Such animals can serve to study diseases characterized by disrupted Wnt function or in drug screening.
[0036] In addition, the invention relates to the use of Doll proteins, homologues, derivatives and fragments thereof as well as nucleic acids, derivatives and fragments thereof in therapeutic and diagnostic methods and compounds. In particular, the invention provides methods and compounds for treatment of disorders of cell fate, differentiation or proliferation by administration of a therapeutic compound of the invention. Such therapeutic compounds include: Drosophila and vertebrate Doll protein homologues or fragments thereof, antibodies or antibody fragments thereto, doll antisense DNA or RNA, doll double stranded RNA, and any chemical or natural occurring compound interfering with Doll function, synthesis or degradation. In particular, the invention provides methods to screen for chemical compounds, organic products or peptides interfering with Doll function in the Wnt pathway. In a preferred embodiment the screening method will be a cellular reporter gene assay or a protein-protein interaction assay.
[0037] In another embodiment, a screening assay based on protein-protein interaction is used to screen for compounds specifically inhibiting Doll-Lgs or Doll-interaction partner X.
[0038] The invention also provides methods to screen for chemical compounds, organic products or peptides interfering with Doll function in the Wnt pathway.
[0039] Furthermore, the invention comprises the use of the DHD domain in screening assays such as an in vitro protein-protein interaction assay or a protein-protein interaction in a host cell. Said assays are applied for the identification of chemical compounds, organic products, polypeptides or peptides interfering with Doll function in the Wnt pathway.
[0040] In a preferred embodiment, a therapeutic product according to the invention is administered to treat a cancerous condition or to prevent progression from a pre-neoplastic or non-malignant condition to a neoplastic or malignant state.
[0041] In other specific embodiments, a therapeutic product of the invention is administered to treat a blood disease or to promote tissue regeneration and repair. Finally disorders of cell fate, especially hyperproliferative or hypoproliferative disorders, involving aberrant or undesirable expression, or localization, or activity of the Doll protein can be diagnosed by detecting such levels.
BRIEF DESCRIPTION OF THE DRAWINGS
[0042]FIG. 1 The Drosophila doll CDNA and protein sequence.
[0043]FIG. 2 The human doll-1 and doll-2 cDNA and protein sequence.
[0044]FIG. 3 The mouse doll-1 and doll-2 CDNA and protein sequences.
[0045]FIG. 4. Drosophila and human Doll proteins contain a PED finger motif with which they bind to the ED1 of Lgs/BCL9
[0046] (A) Top: Schematic representation of Doll, human DOLL-1 and human DOLL-2. The two domains that show high sequence similarities are highlighted in dark gray (DHD: Doll homology domain) and red (PHD: plant homology domain). The DHD appears to be unique, as the inventors failed to find a similar sequence in other Drosophila or human proteins. GenBank accession numbers for Doll, hDoll-1, hDoll-2 are AF457206, AF457207, AF457208, respectively. Bottom: Multiple alignment of Drosophila, human and mouse Doll protein sequences.
[0047] (B) Alignment of amino acid sequences of DHD and PHD in Doll and its human homologues.
[0048] Similarities are boxed, identities shaded in gray. The numbers to the left indicate the positions of DHD and PHD within their respective protein sequences. For the DHD alignment a gap of 22 aa has been introduced in the Drosophila DHD (represented as (X)5)
[0049] (C) Mapping of the Lgs/BCL9 interaction site in Doll. Schematic representation of the proteins tested in the yeast-two-hybrid assay for their interactions with Lgs and BCL9. Results are indicated to the right (“bdg”).
[0050] (D) Mapping of the Doll interaction site in digs and hLgs/BCL9. Schematic representation of the proteins tested in the yeast-two-hybrid assay for their interactions with Lgs. The two proteins shown at the bottom were tested by a pull-down assay for both dlgs (numbers without brackets) and hLgs/BCL9 (numbers in parenthesis) with the same result (“bdg”). The deletion removing HD1 comprises aa 318-345 for Lgs and aa 177-204 for hLgs/BCL9. Fusion proteins used were S-Tag-dDoll (aa 542-815) and GST-hDOLL-2 (aa 301-406).
[0051]FIG. 5. doll is a segment polarity gene required for Wg signalling
[0052] (A-C) Cuticle preparations of larvae derived from wild-type (A), wg mutant (B), and doll mutant embryos (C). The doll 130 /doll 130 embryo in (C) is derived from a homozygous doll 130 mutant germ line clone (see Experimental Procedures) and displays a wg-like phenotype.
[0053] (D,E) doll functions downstream of dAPC2. Two cuticle preparations are shown from larvae that developed in the absence of maternal and zygotic wild-type dAPC2 function (McCartney et al., 1999). The embryo in (E) additionally lacks the maternal and zygotic function of doll (see Examples). In contrast to dAPC single mutant animals, which have strongly reduced denticle belts, double mutants display a doll-like phenotype.
[0054] (F-I) Confocal images of third instar wing imaginal disc preparations stained with antibodies against Doll (F,G) and Ptc (E,I). Wild-type animals show normal expression of these genes (F,H). Discs derived from doll 130 mutant larvae are small, yet express Ptc (I), but fail to express Dll (H). Lack of Dll expression may be an indirect consequence of the earlier wing-tonotum transformation in doll 130 larvae. However, we also see a strong reduction of Dll expression in doll 130 mutant cells from mosaic animals (not shown).
[0055]FIG. 6. Lgs and the PHD finger of Doll serve to assemble Doll and Arm
[0056] Schematic representation of Lgs (yellow) and Doll (light green) constructs that were used in transgene assays to assess their ability to rescue lgs or doll mutant animals. 1: Full-length Lgs (pOP216, aa 1-1464). 2: C-terminally truncated Lgs (pTK131, aa 1-583). 3: HD1-Gal11-HD2 (pTK153, aa 268-395 (HD1), aa 369-500 (Gal11), aa 465-596 (HD2)). 4: HD1-(HA)3-HD2(pTK143, aa 268-395 (HD1), aa 465-596 (HD2)). 5: Full-length Doll (pTK56, aa 1-815). 6: Doll[DPHD]-HD2 (pTK135, aa 1-740 (Doll[DPHD]), aa 483-561 (HD2)). Transgenes 1 to 4 are able to rescue lgs 20F homozygotes. An example for an adult animal rescued by transgene 3 is shown on the right. Transgene 5 can rescue doll 130 homozygotes. Transgene 6 can rescue doll 130 as well as lgs 20F homozygotes (photographs on the right).
[0057]FIG. 7: Rescue of ddoll-/-flies by expression of a human doll transgene. The lethality caused by the doll 130 /EP(3)1076 genotype can be fully rescued by a tubulin 1 promoter-driven transgene that contains either the coding region of the Drosophila doll gene (not shown) or that of one of its two human homologues hdoll-1 and hdoll-2.
[0058]FIG. 8: Effects of human Doll 1 and 2 on Tcf transcription. 293 cells were transiently transfected with the pTOPFLASH or pFOPFLASH luciferase reporters and different effector plasmids as indicated. A constitutively active form of β-Catenin (ΔN-β-Catenin, 50 ng) or human Doll-1 or hDoll-2 (350 ng) activate the pTOPFLASH reporter. Cotransfection of human Doll with ΔN-β-catenin strongly enhance the response.
DETAILED DESCRIPTION OF THE INVENTION
[0059] The Wnt signaling cascade is essential for the development of both invertebrates and vertebrates, and has been implicated in tumorigenesis. The Drosophila wg genes are one of the best characterized within the Wnt-protein family, which includes more than hundred genes. In the Drosophila embryo, wg is required for formation of parasegment boundaries and for maintenance of engrailed (en) expression in adjacent cells. The epidermis of embryo defective in wg function shows only a rudimentary segmentation, which is reflected in an abnormal cuticle pattern. While the ventral cuticle of wild type larvae displays denticle belts alternating with naked regions, the cuticle of wg mutant larvae is completely covered with denticles. During imaginal disc development, wg controls dorso-ventral positional information. In the leg disc, wg patters the future leg by the induction of ventral fate (Struhl and Basler 1993). In animals with reduced wg activity, the ventral half of the leg develops into a mirror image of the dorsal side (Baker 1988). Accordingly, reduced wg activity leads to the transformation of wing to notal tissue, hence the name of the gene (Sharma and Chopra 1976). In the eye disc, wg suppresses ommatidial differentiation in favor of head cuticle development, and is involved in establishing the dorso-ventral axis across the eye field (Heberlein, Borod et al. 1998).
[0060] Additional genes have been implicated in the secretion, reception or interpretation of the Wg signaling. For instance, genetic studies in Drosophila revealed the involvement of frizzled (Dfz), Dishevelled (dsh), shaggy/zeste-white-3 (sgg/zw-3), armadillo (arm), adenomatous polyposis coli (E-apc), axin, and pangolin (pan) in Wg signaling. The genetic order of these transducers has been established in which Wg acts through Dsh to inhibit Sgg, thus relieving the repression of Arm by Sgg, resulting in the cytoplasmic accumulation of Arm and its translocation to the nucleus. In the nucleus Arm interacts with Pan to activate transcription of target genes. Vertebrate homologues have been identified for all these components (for an updated review see (Peifer and Polakis 2000), suggesting that novel identified members of the Drosophila signaling pathway may likely have vertebrate counterparts.
[0061] Mutations leading to nuclear accumulation of the mammalian homologue of Arm, β-Catenin, and consequently to constitutive activation of the Wnt pathway have been observed in many types of cancer, including colon cancer, breast cancer, melanoma, hepatocellular carcinoma, ovarian cancer, endometrial cancer, medulloblastoma pilomatricomas, and prostate cancer (Morin 1999; Polakis, Hart et al. 1999). It is now apparent that deregulation of β-Catenin signaling is an important event in the genesis of these malignancies. However, there are still no known therapeutic agents effectively inhibiting β-Catenin transcriptional activation. This is partly due to the fact that many of the essential components required for its full activation and nuclear translocation are still unknown.
[0062] In order to identify new components required for Wingless activation the inventors used a Drosophila genetic approach to screen for dominant suppressors of the rough eye phenotype caused by ectopic expression of Wingless (Wg), the Drosophila homologue of Wnt, during eye development. A new gene, legless (lgs, U.S. Ser. No. 09/915,543) was identified as a strong dominant suppressor of the rough eye phenotype. The gene was subsequently cloned and its in vivo requirement for Wg signal transduction in embryo and in developing tissues was confirmed. The human genome contains at least two human Lgs homologues, hLgs/Bcl9 and hLgs-1. dLgs and hLgs bind to Armadillo and β-Catenin and are functionally required for Wnt signal propagation in invertebrate and vertebrate cells (U.S. Ser. No. 09/915,543). In particular, the presence of Lgs is required for a transcriptional active Arm/Pangolin complex and over-expression of Lgs strongly stimulates the transcriptional output.
[0063] The inventors later made the interesting observation that a mutant form of Lgs protein from which the β-Catenin interacting domain was deleted exhibited a strong dominant-negative effect on Wg-dependent patterning processes when expressed from a transgene in wild-type larvae (data not shown). This strongly suggested that Lgs normally interacts not only with Arm, but also with at least one additional component. In an effort to identify such components yeast-two-hybrid screens for interacting proteins were carried out. In two independent screens in which either the entire protein or the N-terminal half of Lgs was used as a bait, a novel PHD finger protein, referred to as Daughter-of-Legless (Doll), was identified as a Lgs binding protein (FIG. 4). The 815 amino acid residue Doll protein carries a C-terminal domain of 60 amino acids (FIG. 4 a ), which shows extensive homologies to the PHD (plant homology domain) finger, also known as LAP (leukemia associated protein) domain (Aasland, Gibson et al. 1995). This domain comprises a cysteine rich Zn-binding motif, that has been associated with proteins involved in chromatin-mediated regulation of transcription. The PHD finger of Doll is necessary and sufficient to mediate the interaction to Lgs (FIG. 4 c - d ). The inventors also demonstrate herein that this interaction is essential for Doll function.
[0064] The region of Lgs responsible for Doll-binding was mapped to the HD1 sequence (FIG. 4 d ). Moreover, two human homologues of the Drosophila doll gene were identified and isolated (FIG. 4 a ). The protein products of both human genes, hDOLL-1 and hDOLL-2, as well as their mouse homologues possess a highly conserved PHD finger which interacts with the HD1 of hLgs/BCL9 (FIG. 4 d ). The only other domain in Drosophila Doll, hDoll-1/hDoll-2 and mDoll-1/mDoll-2 that shows significant sequence homology is a 50 amino acid stretch in the N-terminal region, which is referred herein to as ‘Doll homology domain’ (DHD, FIG. 4 a,b ).
[0065] The interaction with Doll appears to be relevant for the in vivo function of Lgs, since a mutant form of Lgs with a deletion of HD1 was unable to rescue lgs mutant animals. The physical association of Doll and Lgs suggested that Doll, like Lgs, may be required for Wg signaling in vivo. To explore this hypothesis a proprietary collection of suppressors of the sev-wg phenotype was searched for mutations that map to the tip of the right arm of chromosome 3, the position of the doll gene. One such suppressor, Sup 130 , mapped to this position, and intriguingly, it showed dominant lethality in combination with the lgs allele lgs 17E (U.S. Ser. No. 09/915,543) (Sup 130 /+lgs 17E /+transheterozygous animals do not survive). The doll coding region was sequenced using genomic DNA from homozygous Sup 130 mutant larvae and a 14 bp deletion starting at amino acid position 751 was identified. Hence this allele is referred to as doll 130 and encodes a truncated Doll protein lacking the C-terminal PHD finger.
[0066] The lethality caused by the homozygous dol 130 genotype can be fully rescued by a tubulin l promoter-driven transgene that contains either the coding region of the Drosophila doll gene or that of one of its two human homologues hDoll-1 and hDoll-2 (FIG. 7). Thus, the vertebrate homologues of doll were confirmed genetically to be true functional homologues of Doll, and hence the vertebrate homoiogues are part of this invention. To assay the possible role of Doll in Wg signal transduction during development, embryos homozygous for the doll 130 mutation that derived from female germ cells equally mutant for doll were generated. Doll mRNA is maternally contributed and strongly and ubiquitously expressed during all the developmental stages. Consequently, only embryos lacking both embryonal and maternal doll are characterized by a severe segment polarity phenotype (FIG. 5A-C), while weaker loss of function doll mutants display pupal lethality with a partial or complete loss of the antennae and the legs. Mutant individuals lacking only zygotic function survive until early pupal stages and exhibit imaginal discs that are abnormally small. The Hh target gene ptc was expressed at wild-type levels in these discs, however, no expression of the Wg target Dll could be detected (FIG. 5F-I). These discs appear to lack the presumptive wing blade field and possess two primordia for the notum (FIGS. 5G and I). The fact that similar phenotypes are caused by loss of function of wg, dsh, arm, or lgs confirms the essential role of doll in the Wg signaling pathway.
[0067] To address the role of Doll in β-Catenin-mediated transcription a TCF reporter gene (TOPFLASH, (Morin, Sparks et al. 1997)) was used in immortalized human embryo kidney cells (HEK 293 cells). Low levels of a stable mutant form of β-Catenin (ΔN-β-catenin; (van de Wetering, Cavallo et al. 1997)) were introduced into these cells to partially stimulate the pathway. The additional expression of hDoll-1 (FIG. 8) or hDoll-2 (not shown) lead to a large increase in luciferase activity (30-fold). These levels are significantly higher than the sum of those produced by either treatment alone (FIG. 8). This potentiation of β-Catenin activity by hDoll-1 and 2 appears to be mediated by the interaction of endogenous TCF protein with its DNA target sites, as it is only observed with TOPFLASH, which contains five optimal TCF binding sites, but not with the control reporter FOPFLASH, which contains five mutated sites (Morin, Sparks et al. 1997). Thus this experiment adds supportive evidence to the notion that Doll proteins transduce Wnt signals by activating TCF target genes in a β-Catenin-dependent manner.
[0068] In summary, the protein-protein interactions demonstrated between Drosophila Doll and Lgs and those between their human homologues human Doll and hLgs/Bcl9, respectively, in conjunction with the genetic and cell biological data show that Doll proteins are positive regulators of the Wg and Wnt signaling pathways, respectively.
EXAMPLES
Example I
Isolation of Doll cDNA
[0069] The cDNA for Daughter of Legless (Doll) was isolated in two independent yeast genetic screens of a Drosophila cDNA-library for proteins directly binding to Lgs. Other DNA libraries can be used as well, such a genomic and CDNA libraries from vertebrate and invertebrate organisms. Other methods than a yeast-two hybrid screen can be used as well. Such methods include, but are not limited to, direct amplification using gene specific primers and standard methods known by people skill in the art. To perform the yeast two-hybrid screening for protein binding to Lgs CDNA sequences encoding the first 732 amino acids (“LgsN”) and the full-length protein of 1464 amino acids (“LgsFL”) were subcloned into a yeast expression vector (pLexA, Clontech), fusing them to the LexA DNA-binding domain. Subsequently these constructs were transformed into the LEU2-reporter yeast strain EGY48 together with the lacZ-reporter plasmid pSH18-34 and an embryonic Drosohila melanogaster cDNA-library fused to an acidic transcriptional activation domain (“RFLY-1” library, PNAS 93, 3011-3015). In a first step triple-transformant colonies containing the LgsN- or LgsFL-LexA-fusion constructs, respectively, the pSH18-34 reporter and a RFLY-1 library plasmid were grown on minimal selective medium plates for two days, harvested, thoroughly mixed, and stored as uniform aliquots. Then cells from one of these aliquots were transferred into permissive Galactose/Raffinose minimal selective liquid medium, and incubated with shaking at 30° C. for a few hours, thereby inducing expression of the library cDNA-activation domain fusion from the GAL1-inducible promotor. Finally these “induced” cells were plated on Galactose/Raffinose minimal selective medium plates lacking the amino acid 1-leucine. On these plates cell growth was sustained only upon activation of a LEU2-selector gene through molecular interaction of the respective LexA-fusion and activation domain-fusion proteins. The LEU2-gene codes for an essential metabolic enzyme needed for the biosynthesis of leucine from other amino acid precursors. All clones growing under these restrictive conditions were isolated and analyzed for the activity of the lacZ-reporter gene, encoding the metabolic enzyme β-Galactosidase from the enterobacterium E.coli , by a standard X-Gal assay (e.g. Bartel and Fields (eds.), Oxford University Press 1997). From all candidate clones that passed these two selection steps, the cDNA-library plasmids were isolated again by standard techniques (e.g. Methods in Yeast Genetics, Cold Spring Harbour Laboratory Press, 1997) and retested for specific interaction with Lgs in the X-Gal assay, using an unrelated LexA-fusion protein as a negative control. By this procedure three independent cDNA-clones were identified, that strongly and specifically interacted only with Lgs and contained partially overlapping sequences: BK12b, BK14b and TK5.35h. By searching the Drosophila genome database using the blastn algorithm (http://www.ncbi.nlm.nih.gov:80/BLAST/) we mapped the three isolated cDNAs to the CG11518 locus, coding for a protein product of 815 amino acids in length. The cDNA-clones coded different parts of the Doll protein, with BK12b containing nucleotides (nt) 2223-2448, BK14b nt 2191-2448 and TK4.35h nt 749-2448 of the computationally predicted open reading frame (ORF). Further bioinformatical analysis (http://www.ebi.ac.uk/interpro/) revealed that the very C-terminal part of the protein sequence (ca. aa 745-805), present in all three of the Lgs-binding clones, was predicted to adapt a PHD-finger fold, which has been identified in other proteins involved in transcriptional regulation at different levels.
Example II
Identification of Human and Mouse Homologues of Drosophila Doll
[0070] After the identification of the Drosophila Doll amino acid sequence, publicly available databases were searched for similar protein sequences in other species, using the tblastn algorithm (http://www.ncbi.nlm.nih.gov:80/BLAST/). Two candidate sequences were found each in ESTs from Mus musculus and Homo sapiens, respectively, the putative protein products of which display high similarity to Doll in their C-terminal domains. These stretches of high similarity are predicted to adapt a PHD-finger fold as well (FIG. 4). Doll proteins do not display other known structural motifs in their N-terminal sequences but they display a second high homology domain, which was accordingly named DOLL Homology Domain (bHD) (FIG. 4). Doll proteins of both invertebrate and vertebrate origin have so far not been further described or experimentally studied, and have thus not previously been implicated in any specific biological process.
Example III
Isolation and Mapping of Drosophila Doll Alleles
[0071] EMS-treated males were crossed to females carrying a wg transgene (sev-wg) driven by two copies of the sevenless enhancer (Basler, Christen et al. 1991). 2×105 progeny were screened for suppressors of the rough eye phenotype. Third chromosomal suppressors were coarsely mapped by meiotic recombination using a panel of P[y+] insertions. One such suppressor, Sup 130 , showed intriguingly dominant lethality in combination with the lgs allele lgs 17E (U.S. Ser. No. 09/915,543) (Sup 130 /+lgs 17E /+transheterozygous animals do not survive), strongly suggesting a close genetic interaction. Fine mapping of the mutation using denaturing HPLC (WAWE system, Transgenomic Inc.) demonstrated that it localizes within the doll gene. The doll coding region was therefore sequenced using PCR fragments covering the doll coding region derived from genomic DNA from homozygous Sup 130 mutant larvae. The defect in Sup 130 was found to be a 14 bp deletion (nucleotides 2253 to 2266: 5′ CATGTGCCACAAGG 3′) within the doll open reading frame that induced a frame-shift subsequent to amino acid 751 and resulted in the formation of a premature stop codon. Hence this allele is referred to as doll 130 and encodes a truncated Doll protein lacking the C-terminal PHD finger.
[0072] Pole cell transplantation, chromosome squashes, and chromosome in situ hybridization experiments were carried out according to standard protocols (Ashburner 1989).
Example IV
Use of Doll as a Hybridization Probe
[0073] The following method describes the use of a non-repetitive nucleotide sequence of doll as a hybridization probe. The method can be applied to screen for doll homologues in other organisms as well. DNA comprising the sequence of doll (as shown in FIGS. 1 , 2 , 3 ) is employed as probe to screen for homologue DNAs (such as those included in CDNA or genomic libraries). Hybridization and washing of the filters containing either library DNAs is performed under standard high stringency conditions (Sambrook, Fritsch et al. 1989). Positive clones can be used to further screen larger cDNA library platings. Representative cDNA-clones are subsequently cloned into pBluescript (Stratagene) or similar cloning vectors and sequenced.
Example V
Use of Doll as a Hybridization Probe for in situ Hybridization
[0074] In situ hybridization of Drosophila doll mRNA can be performed in embryo as described in (Tautz and Pfeifle 1989). However, with small modifications it can also be used to detect any mRNA transcript in Drosophila larval imaginal discs or vertebrate tissue sections. Labeled RNA probes can be prepared from linearized doll CDNA (as showed in FIGS. 1 , 2 , 3 ), or a fragment thereof, using the DIG RNA labeling Kit (SP6/T7) (Boehringer Mannheim) following the manufacturer's recommendations.
Example VI
Expression of Doll in Drosophila melanogaster
[0075] Doll can be expressed in Drosophila in the whole organism, in a specific organ or in a specific cell type, during the whole life or only at a specific developmental stage, and at different levels. An overview of the standard methods used in Drosophila genetics can be found in (Brand and Perrimon 1993; Perrimon 1998; Perrimon 1998).
[0076] Generation of Doll Mutant Embryos
[0077] Mosaic germlines are generated with the help of site-specific recombination through the FLP recombinase (Xu and Rubin 1993).
[0078] Females of the genotype hsp70:flp, FRT82 doll 130 /FRT82 ubi-GFP are heat-shocked at 37° C. for 1 hr during the third instar larval stage to induce FLβ-directed recombination and later mated to doll 130 /TM6b[y+] males. Germline mosaics are induced. The source of recombinase is a first chromosome insertion of a fusion of the hsp 70 promoter (denoted by “hsp70”) to the FLP coding sequence. Somatic recombination at the FRT82 sites gives rise to adult female germ line that produces oocytes that upon fertilization lead to embryos which do not contain neither zygotic nor maternally contributed information for the production of functional dDoll protein. Those embryos can be identified by the absence of the yellow+phenotype provided by the TM6b[y+] paternal balancer chromosome. For analysis, cuticles are prepared by standard techniques from mutant embryos, and examined by dark field microscopy.
[0079] dApC2 doll doublemutant germ line clones were generated with an FRT82 dAPC2 DS doll 130 chromosome. The FRT82 ovo D1 chromosome (Chou and Perrimon 1996) was used to select for mutant germ cells. The FRT82 doll 133 chromosome was also used to create doll mutant clones in discs, in conjunction with an FRT82 arm-lacz chromosome.
[0080] Generation of Doll Mutant Embryos Expressing Constitutively Active Arm
[0081] In order to express constitutively active Arm (“ΔArm”),h females of the genotype described above are heat shocked at 37° C. for 1 hr during late pupal stages and mated to males of the genotype UAS:ΔArm hsp70-Gal4/UAS:ΔArm hsp70-Gal4; doll/TM6b[y+]. Due to the presence of the additional transgenes in these males off-spring that had arisen from a doll mutant oocytes and doll mutant sperm express upon heat treatment the constitutively active Arm protein, that transiently induced Wingless target genes.
Example VII
Rescue of Ddoll-/-flies with Hdoll-1 and Hdoll-2 cDNA Expression
[0082] In order to confirm the functional homology between Drosophila and human Doll-1 and human Doll-2, the human genes were introduced into Drosophila flies carrying two mutant doll alleles (ddoll-/-flies) Specifically, flies carrying e.g. a tub:hdoll transgene, and two mutant doll alleles, e.g. doll 130 and EP(3)1076 (publicly available) were generated. ddoll-/-mutant flies display larval or pupal lethality. In contrast, ddoll-/-mutant flies carrying at least one copy of the tub:hdoll-1 or tub:hdoll-2 transgenes survive to adulthood. This demonstrates that both, hDoll-1 and hDoll-2, can replace endogenous ddoll function in flies and thus validates functional homology between Drosophila and human Doll (FIG. 7).
Example VIII
Protein Production and Purification of Doll in E. coli
[0083] The following method describes recombinant expression of Doll proteins in bacterial cells. DNA encoding full-length or a truncated Doll form is fused e.g. downstream of an epitope tag or glutathione-S-transferase (GST) cDNA and a thrombin or enterokinase cleavage site contained within an inducible bacterial expression vector. Such epitope tags include poly-his, S-protein, thioredoxin and immunoglobin tags. A variety of plasmids can be employed, including commercially available plasmid such as pGEX-4T (Pharmacia) or pET-32a (Novagen).
[0084] Briefly, a bacterial expression plasmid containing the doll sequence, for instance fused to a GST-sequence, is transformed by conventional methods into protease deficient E.coli such as BL21 (Stratagene). A bacterial colony containing the plasmid is then expanded overnight in selection medium to reach saturation. The next morning, this culture is diluted 1:100 and bacterial are allowed to growth to an optical density (OD 600 ) of 0.6. Protein production is initiated by addition of an inducer of the promoter under which GST-Doll fusion protein is expressed. A variety of inducers can be employed depending on the expression vector used, including IPTG.
[0085] Expressed GST tagged Doll can then be purified, for instance, using affinity beads or affinity chromatography, such as glutathione beads (commercially available from Pharmacia). Extracts are prepared by lysing the Doll-expressing bacteria in sonication buffer (10 mM Tris HCl pH 8.0, 150 mM NaCl, 1 mM EDTA, 1.5% sarkosyl, 2% Triton-X-100, 1 mM DTT and protease inhibitors), followed by short sonication on ice (e.g. 3 times 20 seconds at middle power) and centrifugation. Cleared supernatants are then incubated under gentle rotation for example with glutathione beads for 1 hrs at 4° C. Next beads are washed several time in washing buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl 2 , 0.5% NP40), and finally stored in storage buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl2, 10% glycerol, 0.5% NP40, and proteinase inhibitors).
[0086] Alternatively, a His-tagged, S-protein, thioredoxin or IgG tagged Doll can be purified using affinity chromatography.
[0087] The quality of the preparations can be checked e.g. by SDS-gel electrophoresis and silver staining or Western blot.
[0088] In case the epitope-tag has to be cleaved, several methods are available depending on the presence of a cleavage site between the epitope-tag and the Doll protein. For example, it is possible to produce a GST-Doll fusion protein containing a thrombin cleavage site right before the first Doll amino acid. Briefly, a GST-Doll preparation on glutathione-affinity beads is washed several times in cleavage buffer (50 mM Tris HCl pH 7.0, 150 mM NaCl, 1 mM EDTA, 1 mM DTT). Thrombin is then added and the samples are incubated for over 16 h at room temperature. Supernatants are then collected and analyzed for successful cleavage of Doll from the beads by polyacrylamide gel electrophoresis and silver staining or Western blot.
Example IX
Protein-protein Interactions Involving Doll
[0089] A GST-fusion protein in vitro binding assay can be performed to map binding domains and find additional interaction partners. For this purpose, proteins are in vitro translated using reticulocyte lysates (e.g. TNT-lysates, Promega Corporation) containing [3 5 S] methionine following the instructions provided by the manufacturer. Alternatively, cellular proteins can be labeled by incubation of culture cells with [ 35 S]methionine. Glutathione Stransferase (GST) fusion proteins, produced as illustrated in the Example VIII, are immobilized on glutathione-Sepharose and blocked in binding buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl 2 , 10% glycerol, 0.5% NP40, 0.05% BSA, and proteinase inhibitors) for 45 min. Two μg of immobilized GST proteins are then incubated for 1.5 hrs with 0.5-6 μl of in vitro translated proteins in binding buffer or with [ 35 S]methionine labeled cell extract. The beads are washed four times in washing buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 mM MgCl 2 , 0.5% NP40) and boiled in Laemmli SDS sample buffer. Proteins binding to Doll are detected by autoradiography. In case that a cell lysate were used to identify novel Doll binding partner, the protein bands on the gel can be isolated by methods known in the art, and the protein sequence can be determined e.g. by mass spectrophotometrical analysis.
[0090] A yeast two hybrid assay can additionally be performed to confirm the results of the in vitro binding assays described above or to screen cDNA library for new interaction partners (Fields and Sternglanz 1994). In this context, the desired cDNAs are subcloned into appropriate yeast expression vectors that link them either to a Lex DNA binding domain (e.g. pLexA, Clontech) or an acidic activation domain (e.g. pGJ4-5, Clontech). The appropriate pair of plasmids is then transformed together with a reporter plasmid (e.g. pSH18-34, Clontech) into an appropriate yeast strain (e.g. EGY48) by the lithium acetate-polyethylene glycol method and grown on selective media (Sambrook, Fritsch et al. 1989). Transformants are analyzed for reporter gene activity as described by the manufacturer of the vector-reporter plasmid used. To establish reproducibility the interactions is tested in both directions. Alternatively, this method is used to screen for novel Doll interaction partners. In this context, e.g. pLexA-Doll is transfected into yeast together with a cDNA library cloned into e.g. pGJ4-5 as described above. Positive clones can be isolated and the cDNA they contain can be sequenced by methods known by people skilled in the art.
Example X
Immunohistochemistry
[0091] Localization of the Doll proteins is performed on Drosophila embryo, imaginal discs, invertebrate and vertebrate adult tissue sections or tumor cell lines using the anti-Doll antibodies provided by this invention. For instance, if a tumor cell line is used, cells can be seeded into polylysine-coated 8 well chambers (Nalge-Nunc Internat.) and grown overnight at 37° C. As a positive control, 293 MEK cells (ATCC) cells might be transfected e.g. by a lipofection method (e.g. Lipofectamine, Gibco technologies) with a Doll expression plasmid, such as pcDNA3.1 (Invitrogen). Two days after transfection, cells are washed and fixed with 3.7% formaldehyde in PBS for 10 min, permeabilized in 0.5% Triton-X-100 for another 10 min, and blocked with a 1:1000 dilution of pre-immunoserum in 2% BSA-PBS for 1 h at RT. Cells are then incubated with a 1:1000 dilution of anti-Doll immunoserum for 2 hrs at RT, followed by washing in PBS and staining with anti-rabbit secondary-antibody. The washing step is repeated and preparations are blocked in a solution of 3% BSA in PBS/0.1% TritonX-100 for 1 hr. The slides are then washed three times for 5 min in PBS and incubated with a 1:200 dilution (v/v) of TRITC-conjugated swine anti-rabbit immunoglobulin (Dako, Inc.). The washing step is repeated before applying coverslips using Vectashield® mounting medium (Vector Laboratories, Inc.). Detection of other proteins such as β-Catenin, hLgs or Tcf can be performed in the same way using anti-β-Catenin (commercially available), anti-hLgs (U.S. Ser. No. 09/915,543) or anti-Tcf (commercially available) specific antibodies, respectively.
Example XI
Luciferase Reporter Gene Assays
[0092] The effect of Doll on Tcf transactivation activity can be performed in a cell culture system using a Tcf responsive luciferase reporter gene. Depending on the expression vector used, this protocol can be applied for mammalian as well as for Drosophila cell lines. For instance, HEK293 cells (ATCC) are a well suitable system. Hereby, Doll full length cDNA is cloned into a mammalian expression vector, such as pcDNA3 (Invitrogen), and transfected together with the TOPFLASH luciferase reporter plasmid (Upstate biotechnology, New York, USA) into 293 cells. A lipofection agent like the Lipofectamine transfection reagent (Life Technologies, Inc.) can be used for this purpose. A renilla luciferase reporter plasmid, e.g. pRL-SV40, (Promega Corporation, Madison USA), is co-transfected to normalize the transfection efficiency. Cell extracts are prepared 48 h after transfection and assayed for firefly and renilla luciferase activity as described by the manufacturer (Dual luciferase reporter assay system, Promega Corporation). All the luciferase values are normalized for renilla luciferase activity (see FIG. 8).
Example XII
Screening of Chemical Compounds, Organic Products or Peptides Interfering with Doll Function
[0093] A reporter gene assay is performed with a similar protocol as described in example XI, but scaled down to be performed as a high throughput screening. For this purpose colon cancer cell lines with mutated and/or constitutively active β-Catenin are stably transfected with the Topflash vector described in Example XI and Doll cDNA. The established monoclonal population, which gives the most reliable and constant reporter gene activity is selected for later assays. One day after plating, cells are treated with single compounds derived from a chemical or peptide library. One to 24 hours later reporter gene activity is measured. Compounds found to inhibit reporter gene activity are then further characterized for specific activity on the Doll-containing transcriptional complex. Alternatively, Wnt pathway activity can be measured by detecting mRNA or protein levels of a target gene, e.g. myc (He, Sparks et al. 1998).
Example XIII
Screening Assay Based on Protein-protein Interaction for Compounds Inhibiting Doll-Lgs or Doll Interaction Partner X
[0094] Doll and its interaction partner or fragments thereof are produced and purified e.g. from E.coli cultures (e.g. as described in example VIII). Proteins are tagged e.g. with 6 histidines, Sprotein, GST or thioredoxin. Small aliquots of the purified proteins are incubated in an appropriate binding buffer. At this point chemical compounds are added to the mixture and their capacity to disrupt the protein-protein interaction is monitored e.g. by any of the methods described below. Compounds inhibiting this interaction are subsequently tested for their specificity and in vivo toxicity. Well established methods to monitor protein-protein interactions are e.g.:
[0095] Time resolved fluorometry with lanthanide chelate labels (Hemmilä I. And Webb S. DDT 2: 373-381 (1997))
[0096] Scintillation proximity assay (SPA) (Amersham life Science)
[0097] Fluorescence polarisation
References
[0098] Aasland, R., T. J. Gibson, et al. (1995). “The PHD finger: implications for chromatin-mediated transcriptional regulation.” Trends Biochem Sci 20(2): 56-9.
[0099] Ashburner, M. (1989). Drosophila A laboratory handbook . Cold Sring Harbor, N.Y., Cold Spring Harbor Laboratory Press.
[0100] Baker, N. E. (1988). “Transcription of the segment-polarity gene wingless in the imaginal discs of Drosophila, and the phenotype of a pupal-lethal wg mutation.” Development 102(3): 489-97.
[0101] Barker N, H. G., Korinek V, Clevers H (1999). “Restricted high level expression of Tcf-4 protein in intestinal and mammary gland epithelium.” Am J Pathol 154: 29-35.
[0102] Basler, K., B. Christen, et al. (1991). “Ligand-independent activation of the sevenless receptor tyrosine kinase changes the fate of cells in the developing Drosophila eye.” Cell 64(6): 1069-81.
[0103] Brand, A. H. and N. Perrimon (1993). “Targeted gene expression as a means of altering cell fates and generating dominant phenotypes.” Development 118(2): 401-15.
[0104] Cabrera, C. V., M. C. Alonso, et al. (1987). “Phenocopies induced with antisense RNA identify the wingless gene.” Cell 50(4): 659-63.
[0105] Chou, T. B. and N. Perrimon (1996). “The autosomal FLP-DFS technique for generating germline mosaics in Drosophila melanogaster.” Genetics 144(4): 1673-9.
[0106] Fields, S. and R. Sternglanz (1994). “The two-hybrid system: an assay for protein-protein interactions.” Trends Genet 10(8): 286-92.
[0107] Grosschedl R, E. Q. (1999). “Regulation of LEF-1TCF transcription factors by Wnt and other signals.” Current Opinion in Cell Biology 11: 233-240.
[0108] He, T. C., A. B. Sparks, et al. (1998). “Identification of c-MYC as a target of the APC pathway [see comments].” Science 281(5382): 1509-12.
[0109] Heberlein, U., E. R. Borod, et al. (1998). “Dorsoventral patterning in the Drosophila retina by wingless.” Development 125(4): 567-77.
[0110] Morin, P. J. (1999). “beta-catenin signaling and cancer.” Bioessays 21(12): 1021-30.
[0111] Morin, P. J., A. B. Sparks, et al. (1997). “Activation of betacatenin-Tcf signaling in colon cancer by mutations in betacatenin or APC [see comments].” Science 275(5307): 1787-90.
[0112] Nusse, R. and H. E. Varmus (1982). “Many tumors induced by the mouse mammary tumor virus contain a provirus integrated in the same region of the host genome.” Cell 31(1): 99-109.
[0113] Peifer, M. and P. Polakis (2000). “Wnt signaling in oncogenesis and embryogenesis—a look outside the nucleus.” Science 287(5458): 1606-9.
[0114] Perrimon, N. (1998). “Creating mosaics in Drosophila.” Int j Dev Biol 42(3): 243-7.
[0115] Perrimon, N. (1998). “New advances in Drosophila provide opportunities to study gene functions.” Proc Natl Acad Sci USA 95(17): 9716-7.
[0116] Perrimon, N. and A. P. Mahowald (1987). “Multiple functions of segment polarity genes in Drosophila.” Dev Biol 119(2): 587-600.
[0117] Polakis, P., M. Hart, et al. (1999). “Defects in the regulation of beta-catenin in colorectal cancer.” Adv Exp Med Biol 470: 23-32.
[0118] Potter, J. D. (1999). “Colorectal cancer: molecules and populations.” Journal of the National Cancer Institute 91(11): 916-32.
[0119] Rijsewijk, F., M. Schuermann, et al. (1987). “The Drosophila homolog of the mouse mammary oncogene int-1 is identical to the segment polarity gene wingless.” Cell 50(4): 649-57.
[0120] Roose, J. and H. Clevers (1999). “TCF transcription factors: molecular switches in carcinogenesis.” Biochimica et Biophysica Acta 1424 (2-3): M23-37.
[0121] Sambrook, J., E. F. Fritsch, et al. (1989). Molecular cloning: a laboratory manual , Cold Spring Harbor Laboratory Press.
[0122] Sharma, R. P. and V. L. Chopra (1976). “Effect of the Wingless (wg1) mutation on wing and haltere development in Drosophila melanogaster.” Dev Biol 48(2): 461-5.
[0123] Struhl, G. and K. Basler (1993). “Organizing activity of wingless protein in Drosophila.” Cell 72(4): 527-40.
[0124] Tatusova TA, M. T. (1999). “Blast 2 sequences—a new tool for comparing protein and nucleotide sequences.” FEMS Microbiol Lett . 174: 247-250.
[0125] Tautz, D. and C. Pfeifle (1989). “A non-radioactive in situ hybridization method for the localization of specific RNAs in Drosophila embryos reveals translational control of the segmentation gene hunchback.” Chromosoma 98(2): 81-5.
[0126] van de Wetering, M., R. Cavallo, et al. (1997). “Armadillo coactivates transcription driven by the product of the Drosophila segment polarity gene dTCF.” Cell 88(6): 789-99.
[0127] Waltzer, L. and M. Bienz (1999). “The control of beta-catenin and TCF during embryonic development and cancer.” Cancer & Metastasis Reviews 18(2): 231-46.
[0128] Willis, T. G., I. R. Zalcberg, et al. (1998). “Molecular cloning of translocation t(1;14) (q21;q32) defines a novel gene (BCL9) at chromosome lq21 .” Blood 91(6): 1873-81.
[0129] Wodarz, A. and R. Nusse (1998). “Mechanisms of Wnt signaling in development.” Annual Review of Cell & Developmental Biology 14: 59-88.
[0130] Xu, T. and G. M. Rubin (1993). “Analysis of genetic mosaics in developing and adult Drosophila tissues.” Development 117(4): 1223-37.
1
10
1
2448
DNA
Drosophila melanogaster
1
atgacccaca atcttggtat ggcgccatat cgattgccgg gtccagcggg cggactctgt 60
ccgcccgatt ttaagccgcc gcctcccacg gacatcatct cggcgccgag caatccgaag 120
aagcggcgaa aaacctcaag tgccgccaac tccgctgcag cggtggctgc ggcggcggct 180
gcagcagctg ctgcgaattc catgcagcag cagcaggcgc cacccacacc gcaggatttg 240
ctgccccctc cgccaatggg aggcttcgga gacaccatta ttgcctcgaa tccattcgac 300
gacagtcccc aggtgtcggc gatgtccagc tcagcggccg cggcgatggc ggccatgaat 360
cagatgggcg gcggaccagg aggtggtcac tttggcggcg gtggaccggg tgggcacccg 420
cactgggaag accgcatggg catgggcggt ggacctcctc ccccgcctca catgcatccc 480
catatgcacc cgcatcatcc aggcggacct atgggtcacc cacatggccc acatccgcac 540
atgggtggtc cacctccaat gcgaggaatg agccccatgc acccccatca aatgggaccg 600
ggaccaggcg tcggactacc gccgcatatg aatcacggaa ggccaggggg acctggtggt 660
cctggaggac ccgtcccaat gggtagtccc atgggtggaa tagctggcat gggcggcatg 720
agcccaatgg gcggaatggg aggccccagc atatcacccc atcacatggg catgggtggt 780
ctgtcgccca tgggaggcgg tcccaacgga cccaatccgc gagccatgca gggttcaccg 840
atgggcggtc cggggcagaa ctcgccaatg aactcactgc ctatgggttc gccaatgggc 900
aatccaattg gcagcccgtt gggccctccc tcgggaccgg gccctgggaa tcccggcaat 960
accggcggac cacagcagca acaacaacaa cctccgcagc caccgatgaa caacgggcag 1020
atgggtcctc ctcctctgca cagtccgctc ggaaacggac caacgggtca tggcagtcac 1080
atgcctggag gaccaatccc aggaccaggt cctgggcctg gcggcctagt aggtcccggt 1140
ggcatctccc ccgcgcacgg caataacccg ggtggttctg ggaacaacat gctcggcggg 1200
aatcccggcg gcggcaacag caacaacaac ggaagcaata caagtaacgc cagcaacaac 1260
aatcaaaatc ctcacctctc gccagcagcc ggacgcctgg gagtgccgac gtcgatgcag 1320
tcgaatggac cttcggtatc atcggtagcc tcctcatcgg ttccctcgcc cgccacgccc 1380
acgctcacgc ccacatcgac ggccacgtcc atgtccacgt cagtgcctac atcctcgcca 1440
gcgccgcccg ccatgtcacc gcatcactcg ctaaacagcg ccggcccgag tccgggcatg 1500
cccaactcgg gacccagccc gctgcagtca ccagccggac ccaatggccc caataacaac 1560
aacagcaata acaacaacgg acccatgatg ggccagatga tcccgaacgc agttcctatg 1620
cagcaccagc agcacatggg cggcggccca cctggccacg ggcccggacc aatgcccgga 1680
atgggcatga accaaatgct gccaccgcag caaccctccc atcttggtcc cccgcatccg 1740
aatatgatga accacccgca tcatccgcac catcatcctg gcggaccacc gccgcacatg 1800
atgggtggac ccggaatgca cggcggtcct gctggaatgc ctcctcatat gggcggagga 1860
cctaatccgc acatgatggg cggtccgcac gggaacgcgg gtccgcacat gggccacggc 1920
cacatgggtg gagtaccagg tccaggaccc ggacccggcg gcatgaacgg acccccgcat 1980
ccgcacatgt ccccgcacca cggacatccg catcaccacc acaatccgat gggcggccca 2040
ggtccaaata tgttcggcgg tggtggagga ggtcccatgg gtcccggtgg accgatgggc 2100
aacatggggc ccatgggagg tggcccgatg ggcggcccta tgggcgtagg tcccaagccg 2160
atgacaatgg gcggcgggaa gatgtacccg ccgggacagc caatggtctt taatccgcag 2220
aacccgaatg cgccgcccat atatccttgt ggcatgtgcc acaaggaggt gaacgacaac 2280
gacgaagccg tgttctgtga atccggttgt aactttttct ttcacagaac ctgtgttggc 2340
ctgacagagg cggccttcca aatgctcaac aaggaggtgt ttgccgagtg gtgctgcgac 2400
aagtgcgtgt cttccaagca tattcccatg gtcaagttca agtgttga 2448
2
1366
DNA
Homo sapiens
2
ggatccccac atgcccgccg agaactctcc agctcccgct tacaaagttt cctcgcatgg 60
tggtgatagt ggactggatg ggttaggagg accaggtgta caactaggaa gcccagataa 120
gaaaaagcgc aaggcaaata cacagggacc ttctttccct ccattgtctg agtatgctcc 180
accaccgaat ccaaactctg accatctagt ggctgctaat ccatttgatg acaactataa 240
tactatttcc tataaaccac taccttcgtc aaatccatat cttggccctg gttatcctgg 300
ctttggaggc tatagtacat tcagaatgcc acctcacgtt cccccaagaa tgtcttcccc 360
atactgtggt ccttactcac tcaggaacca gccacaccca tttcctcaga atcctctggg 420
catgggtttt aatcgacctc atgcttttaa ctttgggcca catgataatt caagtttcgg 480
taatccatct tataataatg cactaagtca gaatgtcaac atgcctaatc aacattttag 540
acaaaatcct gctgaaaatt tcagtcaaat tcctccacag aatgctagcc aagtttctaa 600
ccccgatttg gcatctaatt ttgttcctgg aaataattca aattttactt ctccgttaga 660
atctaatcat tcttttattc ctcccccaaa cacttttggt caagcaaaag caccaccccc 720
aaaacaagac tttactcaag gagcaaccaa aaacactaat caaaattcct ctgctcatcc 780
acctcacttg aatatggatg acacagtgaa tcagagtaat attgaattaa aaaatgttaa 840
tcgaaacaat gcagtaaatc aggagaacag ccgttcaagt agcactgaag ccacaaacaa 900
taaccctgca aatgggacgc agaataagcc acgacaacca agaggtgcag cagatgcctg 960
caccacagaa aaaagcaata aatcctctct tcacccaaac cgtcatggcc attcgtcttc 1020
tgacccagtg tatccttgtg gaatttgtac aaacgaggtg aacgatgatc aggatgccat 1080
cttatgtgag gcctcttgtc agaaatggtt tcatcggatc tgtactggaa tgactgaaac 1140
agcttatggc ctcttaactg cagaagcatc tgcagtatgg ggctgtgata cctgtatggc 1200
tgacaaagat gtccagttaa tgcgtactag agaaactttt ggtccatctg cagtgggcag 1260
tgatgcttaa tcaaaggcat taactaaagt gggtttattt tcctgtgcat tgcagaagtt 1320
cattgacaca ggattttaat gttttacatt atttttttaa atgcat 1366
3
1449
DNA
Homo sapiens
3
cccgggtccc ccactccatg gccgcctcgg cgccgccccc accggacaag ctggagggag 60
gtggcggccc cgcaccgccc cctgcgccgc ccagcaccgg gaggaagcag ggcaaggccg 120
gtctgcaaat gaagagtcca gaaaagaagc gaaggaagtc aaatactcag ggccctgcat 180
actcacatct gacggagttt gcaccacccc caactcccat ggtggatcac ctggttgcat 240
ccaacccttt tgaagatgac ttcggagccc ccaaagtggg ggttgcagcc cctccattcc 300
ttggcagtcc tgtgcccttc ggaggcttcc gtgtgcaggg gggcatggcg ggccaggtac 360
ccccaggcta cagcactgga ggtggagggg gcccccagcc actccgtcga cagccacccc 420
ccttccctcc caatcctatg ggccctgctt tcaacatgcc cccccagggt cctggctacc 480
cacccccagg caacatgaac tttcccagcc aacccttcaa ccagcctctg ggtcaaaact 540
ttagtcctcc cagtgggcag atgatgccgg gcccagtggg gggatttggt cccatgatct 600
cacccaccat gggacagcct cccagagcag agctgggccc accttctctg tcccaacgat 660
ttgctcagcc aggggctcct tttggccctt ctcctctcca gagacctggt caggggctcc 720
ccagcctgcc gcctaacaca agtccctttc ctggtccgga ccctggcttt cctggccctg 780
gtggtgagga tggggggaag cccttgaatc cacctgcttc tactgctttt ccccaggagc 840
cccactcagg ctccccggct gctgctgtta atgggaacca gcccagtttc cccccgaaca 900
gcagtgggcg gggtgggggc actccagatg ccaacagctt ggcaccccct ggcaaggcag 960
gtgggggctc cgggccccag cctcccccag gcttggtgta cccatgtggt gcctgtcgga 1020
gtgaggtgaa cgatgaccag gatgccattc tgtgtgaggc ctcctgccag aaatggttcc 1080
accgtgagtg cacaggcatg actgagagcg cctatgggct gctgaccact gaagcttctg 1140
ccgtctgggc ctgcgatctc tgcctcaaga ccaaggagat ccagtctgtc tacatccgtg 1200
agggcatggg gcagctggtg gctgctaacg atgggtgacg ctggtgaagt ggcccaggga 1260
agtgcacatg tctctccctg ctcttccagg gtgatttttt tgatgtttgg ctcttggtcc 1320
ttgtttccac tggctttcca tccccatggg gcagaaacag tggctcctgg gagcagaaaa 1380
ggaattgagg tgggcaggca gaagagcctg gattgctcac tgttttggga aacttacatg 1440
ttgagatct 1449
4
1254
DNA
Mus musclus
4
atgtcagcgg aacaggacaa ggagcccatc gcgctgaaga gagttagagg tggtgacagt 60
ggactggatg ggttaggagg gcccaatata caactcggaa gcccagataa gaaaaaacgc 120
aaggccaaca cacagggatc ttcctttcct ccgctgtcag agtacgcgcc accaccgaat 180
ccaaactccg accatctagt ggctgccaat ccgttcgatg acagctacaa caccatttcc 240
tataaaccac tgccttcatc taatccatat cttggccctg ggtatcctgg ctttggaggc 300
tacagcacat tcagaatgcc accccacgtc cctccaagaa tgtcttctcc ctactgtggt 360
ccttactcac tcaggaatca gccccaccca tttcctcaga atccgttggg catgggcttt 420
aaccggcctc atgcttttaa ctttgggccg catgataatt cgaattttgg aaacccacct 480
tataataatg tactgactca ggacattaac atgcccggtc agcattttag acaaggctct 540
gctgaaaact tcagtcagat tcccccgcag aatgttggcc aagtgtctaa ccctgacctc 600
gcatctaatt ttgcccctgg gaataattca aattttacct ctccgttaga aacgaatcat 660
tcgtttattc cacccccaaa cgcgtttggc caagcaaaag ctccacttcc caaacaagac 720
ttcactcaag gggcaaccaa aaccccgaat cagaattcgt ccactcaccc acctcaccta 780
aatatggagg atccagtcaa tcagagtaac gtcgagttaa aaaatgtcaa cagaaacaac 840
gttgtccaag agaacagccg ttcgggcagc gcagaggcca ccaacaacca tgcgaatggg 900
acccagaaca agccccggca gcccaggggc gcagctgacc tgtgcacccc cgacaaaagc 960
cgcaagttct ccctgctccc cagccggcat ggccattcct cctctgaccc tgtgtacccg 1020
tgcgggattt gtacaaatga agtgaatgac gatcaggacg ccattctgtg tgaagcctct 1080
tgtcagaagt ggtttcatcg catctgcact ggaatgaccg aaacagccta cgggctcctg 1140
acagcggaag catccgcagt gtggggctgt gacacgtgca tggctgacaa ggatgtccag 1200
ctcatgcgca ctagagaggc ctttggtcca cctgccgtgg gcggcgatgc ctaa 1254
5
1221
DNA
Mus musclus
5
atggccgcct cggcgccgcc cccaccggac aagctggagg gaggcagcgg ccccgcaccg 60
ccccccgcgc cgcccagcac cgggaggaag cagggcaagg ccggtctgca aatgaagagc 120
cctgaaaaga agcgaagaaa gtccaatact cagggtcctg catattcaca tctgacggag 180
ttcgccccac ccccgacccc catggtggat catctggtcg cttctaaccc ttttgaggat 240
gatttcggag cccctaaagt ggggggcgca ggccctccgt tcctcggcag tccggtgccc 300
tttggaggct ttcgtgtaca ggggggcatg gcaggccagg tacccccaag ctacggcact 360
ggaggaggag ggggtcccca gccacttcgt cggcagcccc ctccttttcc ccccagccct 420
atgggtccag cttttaatat gccccctcag ggtccctggg gtaccccgcc ccctggcaac 480
atgaactttc ccagtcaacc cttcaaccag tctctgggcc aaaactttag cccacctggt 540
gggcaggtga tgccaggccc agtaggcgga tttggtccca tgatctcacc gaccatggga 600
cagcctccta gaggggagct gggtcctcct cctctccccc aacgctttac ccaaccagga 660
gcaccttatg gtccttctct tcagagacct ggtcagggac tcacccagct gccctccaac 720
acaagtccct tccctggtcc agaccctggt tttcctggac ctggcggtga ggatggtggg 780
aagcccttga acccaccggc tcccaccgcc tttccccagg aagcaccatt cgggctcccc 840
gctgctgctg tcaatgggaa tcagcccagt ttccccccta gcagcagtgg tcgaggtggg 900
ggcactccag atgccaacag tctggcaccc cccggcaagg cagggggagg ctcagggccc 960
cagcctcccc caggcctggt gtacccctgc ggtgcctgcc gtagtgaggt aaatgatgac 1020
caggatgcca ttctgtgtga ggcctcctgc cagaagtggt ttcaccgcga gtgcaccggc 1080
atgaccgaga gtgcctacgg cctgctgacc accgaggcct ctgccgtctg ggcctgtgat 1140
ctttgcctca agaccaagga gatccagtct gtctacatcc gagagggcat gggccagttg 1200
gtggctgcta acgatgggtg a 1221
6
815
PRT
Drosophila melanogaster
6
Met Thr His Asn Leu Gly Met Ala Pro Tyr Arg Leu Pro Gly Pro Ala
1 5 10 15
Gly Gly Leu Cys Pro Pro Asp Phe Lys Pro Pro Pro Pro Thr Asp Ile
20 25 30
Ile Ser Ala Pro Ser Asn Pro Lys Lys Arg Arg Lys Thr Ser Ser Ala
35 40 45
Ala Asn Ser Ala Ala Ala Val Ala Ala Ala Ala Ala Ala Ala Ala Ala
50 55 60
Ala Asn Ser Met Gln Gln Gln Gln Ala Pro Pro Thr Pro Gln Asp Leu
65 70 75 80
Leu Pro Pro Pro Pro Met Gly Gly Phe Gly Asp Thr Ile Ile Ala Ser
85 90 95
Asn Pro Phe Asp Asp Ser Pro Gln Val Ser Ala Met Ser Ser Ser Ala
100 105 110
Ala Ala Ala Met Ala Ala Met Asn Gln Met Gly Gly Gly Pro Gly Gly
115 120 125
Gly His Phe Gly Gly Gly Gly Pro Gly Gly His Pro His Trp Glu Asp
130 135 140
Arg Met Gly Met Gly Gly Gly Pro Pro Pro Pro Pro His Met His Pro
145 150 155 160
His Met His Pro His His Pro Gly Gly Pro Met Gly His Pro His Gly
165 170 175
Pro His Pro His Met Gly Gly Pro Pro Pro Met Arg Gly Met Ser Pro
180 185 190
Met His Pro His Gln Met Gly Pro Gly Pro Gly Val Gly Leu Pro Pro
195 200 205
His Met Asn His Gly Arg Pro Gly Gly Pro Gly Gly Pro Gly Gly Pro
210 215 220
Val Pro Met Gly Ser Pro Met Gly Gly Ile Ala Gly Met Gly Gly Met
225 230 235 240
Ser Pro Met Gly Gly Met Gly Gly Pro Ser Ile Ser Pro His His Met
245 250 255
Gly Met Gly Gly Leu Ser Pro Met Gly Gly Gly Pro Asn Gly Pro Asn
260 265 270
Pro Arg Ala Met Gln Gly Ser Pro Met Gly Gly Pro Gly Gln Asn Ser
275 280 285
Pro Met Asn Ser Leu Pro Met Gly Ser Pro Met Gly Asn Pro Ile Gly
290 295 300
Ser Pro Leu Gly Pro Pro Ser Gly Pro Gly Pro Gly Asn Pro Gly Asn
305 310 315 320
Thr Gly Gly Pro Gln Gln Gln Gln Gln Gln Pro Pro Gln Pro Pro Met
325 330 335
Asn Asn Gly Gln Met Gly Pro Pro Pro Leu His Ser Pro Leu Gly Asn
340 345 350
Gly Pro Thr Gly His Gly Ser His Met Pro Gly Gly Pro Ile Pro Gly
355 360 365
Pro Gly Pro Gly Pro Gly Gly Leu Val Gly Pro Gly Gly Ile Ser Pro
370 375 380
Ala His Gly Asn Asn Pro Gly Gly Ser Gly Asn Asn Met Leu Gly Gly
385 390 395 400
Asn Pro Gly Gly Gly Asn Ser Asn Asn Asn Gly Ser Asn Thr Ser Asn
405 410 415
Ala Ser Asn Asn Asn Gln Asn Pro His Leu Ser Pro Ala Ala Gly Arg
420 425 430
Leu Gly Val Pro Thr Ser Met Gln Ser Asn Gly Pro Ser Val Ser Ser
435 440 445
Val Ala Ser Ser Ser Val Pro Ser Pro Ala Thr Pro Thr Leu Thr Pro
450 455 460
Thr Ser Thr Ala Thr Ser Met Ser Thr Ser Val Pro Thr Ser Ser Pro
465 470 475 480
Ala Pro Pro Ala Met Ser Pro His His Ser Leu Asn Ser Ala Gly Pro
485 490 495
Ser Pro Gly Met Pro Asn Ser Gly Pro Ser Pro Leu Gln Ser Pro Ala
500 505 510
Gly Pro Asn Gly Pro Asn Asn Asn Asn Ser Asn Asn Asn Asn Gly Pro
515 520 525
Met Met Gly Gln Met Ile Pro Asn Ala Val Pro Met Gln His Gln Gln
530 535 540
His Met Gly Gly Gly Pro Pro Gly His Gly Pro Gly Pro Met Pro Gly
545 550 555 560
Met Gly Met Asn Gln Met Leu Pro Pro Gln Gln Pro Ser His Leu Gly
565 570 575
Pro Pro His Pro Asn Met Met Asn His Pro His His Pro His His His
580 585 590
Pro Gly Gly Pro Pro Pro His Met Met Gly Gly Pro Gly Met His Gly
595 600 605
Gly Pro Ala Gly Met Pro Pro His Met Gly Gly Gly Pro Asn Pro His
610 615 620
Met Met Gly Gly Pro His Gly Asn Ala Gly Pro His Met Gly His Gly
625 630 635 640
His Met Gly Gly Val Pro Gly Pro Gly Pro Gly Pro Gly Gly Met Asn
645 650 655
Gly Pro Pro His Pro His Met Ser Pro His His Gly His Pro His His
660 665 670
His His Asn Pro Met Gly Gly Pro Gly Pro Asn Met Phe Gly Gly Gly
675 680 685
Gly Gly Gly Pro Met Gly Pro Gly Gly Pro Met Gly Asn Met Gly Pro
690 695 700
Met Gly Gly Gly Pro Met Gly Gly Pro Met Gly Val Gly Pro Lys Pro
705 710 715 720
Met Thr Met Gly Gly Gly Lys Met Tyr Pro Pro Gly Gln Pro Met Val
725 730 735
Phe Asn Pro Gln Asn Pro Asn Ala Pro Pro Ile Tyr Pro Cys Gly Met
740 745 750
Cys His Lys Glu Val Asn Asp Asn Asp Glu Ala Val Phe Cys Glu Ser
755 760 765
Gly Cys Asn Phe Phe Phe His Arg Thr Cys Val Gly Leu Thr Glu Ala
770 775 780
Ala Phe Gln Met Leu Asn Lys Glu Val Phe Ala Glu Trp Cys Cys Asp
785 790 795 800
Lys Cys Val Ser Ser Lys His Ile Pro Met Val Lys Phe Lys Cys
805 810 815
7
419
PRT
Homo sapiens
7
Met Pro Ala Glu Asn Ser Pro Ala Pro Ala Tyr Lys Val Ser Ser His
1 5 10 15
Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Gly Val Gln Leu
20 25 30
Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gln Gly Pro Ser
35 40 45
Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp
50 55 60
His Leu Val Ala Ala Asn Pro Phe Asp Asp Asn Tyr Asn Thr Ile Ser
65 70 75 80
Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro
85 90 95
Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro His Val Pro Pro
100 105 110
Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gln Pro
115 120 125
His Pro Phe Pro Gln Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His
130 135 140
Ala Phe Asn Phe Gly Pro His Asp Asn Ser Ser Phe Gly Asn Pro Ser
145 150 155 160
Tyr Asn Asn Ala Leu Ser Gln Asn Val Asn Met Pro Asn Gln His Phe
165 170 175
Arg Gln Asn Pro Ala Glu Asn Phe Ser Gln Ile Pro Pro Gln Asn Ala
180 185 190
Ser Gln Val Ser Asn Pro Asp Leu Ala Ser Asn Phe Val Pro Gly Asn
195 200 205
Asn Ser Asn Phe Thr Ser Pro Leu Glu Ser Asn His Ser Phe Ile Pro
210 215 220
Pro Pro Asn Thr Phe Gly Gln Ala Lys Ala Pro Pro Pro Lys Gln Asp
225 230 235 240
Phe Thr Gln Gly Ala Thr Lys Asn Thr Asn Gln Asn Ser Ser Ala His
245 250 255
Pro Pro His Leu Asn Met Asp Asp Thr Val Asn Gln Ser Asn Ile Glu
260 265 270
Leu Lys Asn Val Asn Arg Asn Asn Ala Val Asn Gln Glu Asn Ser Arg
275 280 285
Ser Ser Ser Thr Glu Ala Thr Asn Asn Asn Pro Ala Asn Gly Thr Gln
290 295 300
Asn Lys Pro Arg Gln Pro Arg Gly Ala Ala Asp Ala Cys Thr Thr Glu
305 310 315 320
Lys Ser Asn Lys Ser Ser Leu His Pro Asn Arg His Gly His Ser Ser
325 330 335
Ser Asp Pro Val Tyr Pro Cys Gly Ile Cys Thr Asn Glu Val Asn Asp
340 345 350
Asp Gln Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys Trp Phe His
355 360 365
Arg Ile Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala
370 375 380
Glu Ala Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp
385 390 395 400
Val Gln Leu Met Arg Thr Arg Glu Thr Phe Gly Pro Ser Ala Val Gly
405 410 415
Ser Asp Ala
8
406
PRT
Homo sapiens
8
Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Gly
1 5 10 15
Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gln Gly
20 25 30
Lys Ala Gly Leu Gln Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser
35 40 45
Asn Thr Gln Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro
50 55 60
Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp
65 70 75 80
Asp Phe Gly Ala Pro Lys Val Gly Val Ala Ala Pro Pro Phe Leu Gly
85 90 95
Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gln Gly Gly Met Ala Gly
100 105 110
Gln Val Pro Pro Gly Tyr Ser Thr Gly Gly Gly Gly Gly Pro Gln Pro
115 120 125
Leu Arg Arg Gln Pro Pro Pro Phe Pro Pro Asn Pro Met Gly Pro Ala
130 135 140
Phe Asn Met Pro Pro Gln Gly Pro Gly Tyr Pro Pro Pro Gly Asn Met
145 150 155 160
Asn Phe Pro Ser Gln Pro Phe Asn Gln Pro Leu Gly Gln Asn Phe Ser
165 170 175
Pro Pro Ser Gly Gln Met Met Pro Gly Pro Val Gly Gly Phe Gly Pro
180 185 190
Met Ile Ser Pro Thr Met Gly Gln Pro Pro Arg Ala Glu Leu Gly Pro
195 200 205
Pro Ser Leu Ser Gln Arg Phe Ala Gln Pro Gly Ala Pro Phe Gly Pro
210 215 220
Ser Pro Leu Gln Arg Pro Gly Gln Gly Leu Pro Ser Leu Pro Pro Asn
225 230 235 240
Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly
245 250 255
Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Ser Thr Ala Phe Pro
260 265 270
Gln Glu Pro His Ser Gly Ser Pro Ala Ala Ala Val Asn Gly Asn Gln
275 280 285
Pro Ser Phe Pro Pro Asn Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp
290 295 300
Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro
305 310 315 320
Gln Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu
325 330 335
Val Asn Asp Asp Gln Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys
340 345 350
Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu
355 360 365
Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys
370 375 380
Thr Lys Glu Ile Gln Ser Val Tyr Ile Arg Glu Gly Met Gly Gln Leu
385 390 395 400
Val Ala Ala Asn Asp Gly
405
9
417
PRT
Mus musclus
9
Met Ser Ala Glu Gln Asp Lys Glu Pro Ile Ala Leu Lys Arg Val Arg
1 5 10 15
Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Asn Ile Gln Leu
20 25 30
Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gln Gly Ser Ser
35 40 45
Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp
50 55 60
His Leu Val Ala Ala Asn Pro Phe Asp Asp Ser Tyr Asn Thr Ile Ser
65 70 75 80
Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro
85 90 95
Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro His Val Pro Pro
100 105 110
Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gln Pro
115 120 125
His Pro Phe Pro Gln Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His
130 135 140
Ala Phe Asn Phe Gly Pro His Asp Asn Ser Asn Phe Gly Asn Pro Pro
145 150 155 160
Tyr Asn Asn Val Leu Thr Gln Asp Ile Asn Met Pro Gly Gln His Phe
165 170 175
Arg Gln Gly Ser Ala Glu Asn Phe Ser Gln Ile Pro Pro Gln Asn Val
180 185 190
Gly Gln Val Ser Asn Pro Asp Leu Ala Ser Asn Phe Ala Pro Gly Asn
195 200 205
Asn Ser Asn Phe Thr Ser Pro Leu Glu Thr Asn His Ser Phe Ile Pro
210 215 220
Pro Pro Asn Ala Phe Gly Gln Ala Lys Ala Pro Leu Pro Lys Gln Asp
225 230 235 240
Phe Thr Gln Gly Ala Thr Lys Thr Pro Asn Gln Asn Ser Ser Thr His
245 250 255
Pro Pro His Leu Asn Met Glu Asp Pro Val Asn Gln Ser Asn Val Glu
260 265 270
Leu Lys Asn Val Asn Arg Asn Asn Val Val Gln Glu Asn Ser Arg Ser
275 280 285
Gly Ser Ala Glu Ala Thr Asn Asn His Ala Asn Gly Thr Gln Asn Lys
290 295 300
Pro Arg Gln Pro Arg Gly Ala Ala Asp Leu Cys Thr Pro Asp Lys Ser
305 310 315 320
Arg Lys Phe Ser Leu Leu Pro Ser Arg His Gly His Ser Ser Ser Asp
325 330 335
Pro Val Tyr Pro Cys Gly Ile Cys Thr Asn Glu Val Asn Asp Asp Gln
340 345 350
Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys Trp Phe His Arg Ile
355 360 365
Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala Glu Ala
370 375 380
Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp Val Gln
385 390 395 400
Leu Met Arg Thr Arg Glu Ala Phe Gly Pro Pro Ala Val Gly Gly Asp
405 410 415
Ala
10
406
PRT
Mus musclus
10
Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Ser
1 5 10 15
Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gln Gly
20 25 30
Lys Ala Gly Leu Gln Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser
35 40 45
Asn Thr Gln Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro
50 55 60
Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp
65 70 75 80
Asp Phe Gly Ala Pro Lys Val Gly Gly Ala Gly Pro Pro Phe Leu Gly
85 90 95
Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gln Gly Gly Met Ala Gly
100 105 110
Gln Val Pro Pro Ser Tyr Gly Thr Gly Gly Gly Gly Gly Pro Gln Pro
115 120 125
Leu Arg Arg Gln Pro Pro Pro Phe Pro Pro Ser Pro Met Gly Pro Ala
130 135 140
Phe Asn Met Pro Pro Gln Gly Pro Trp Gly Thr Pro Pro Pro Gly Asn
145 150 155 160
Met Asn Phe Pro Ser Gln Pro Phe Asn Gln Ser Leu Gly Gln Asn Phe
165 170 175
Ser Pro Pro Gly Gly Gln Val Met Pro Gly Pro Val Gly Gly Phe Gly
180 185 190
Pro Met Ile Ser Pro Thr Met Gly Gln Pro Pro Arg Gly Glu Leu Gly
195 200 205
Pro Pro Pro Leu Pro Gln Arg Phe Thr Gln Pro Gly Ala Pro Tyr Gly
210 215 220
Pro Ser Leu Gln Arg Pro Gly Gln Gly Leu Thr Gln Leu Pro Ser Asn
225 230 235 240
Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly
245 250 255
Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Pro Thr Ala Phe Pro
260 265 270
Gln Glu Ala Pro Phe Gly Leu Pro Ala Ala Ala Val Asn Gly Asn Gln
275 280 285
Pro Ser Phe Pro Pro Ser Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp
290 295 300
Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro
305 310 315 320
Gln Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu
325 330 335
Val Asn Asp Asp Gln Asp Ala Ile Leu Cys Glu Ala Ser Cys Gln Lys
340 345 350
Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu
355 360 365
Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys
370 375 380
Thr Lys Glu Ile Gln Ser Val Tyr Ile Arg Glu Gly Met Gly Gln Leu
385 390 395 400
Val Ala Ala Asn Asp Gly
405
You are contracting for New essential downstream component of the wingless signalling pathway
Expert New essential downstream component of the wingless signalling pathway
You are commenting for New essential downstream component of the wingless signalling pathway