Abstract
Abstract
We have conducted a screen to identify developmentally regulated enhancers that drive tissue-specific Gal4 expression in zebrafish. We obtained 63 stable transgenic lines with expression patterns in embryonic or adult zebrafish. The use of a newly identified minimal promoter from the medaka edar locus resulted in a relatively unbiased set of expression patterns representing many tissue types derived from all germ layers. Subsequent detailed characterization of selected lines showed strong and reproducible Gal4-driven GFP expression in diverse tissues, including neurons from the central and peripheral nervous systems, pigment cells, erythrocytes, and peridermal cells. By screening adults for GFP expression, we also isolated lines expressed in tissues of the adult zebrafish, including scales, fin rays, and joints. The new and efficient minimal promoter and large number of transactivating driver-lines we identified will provide the zebrafish community with a useful resource for further enhancer trap screening, as well as precise investigation of tissue-specific processes in vivo.
Introduction
In exchange for a reporter protein, the addition of the Gal4 transcriptional activator from yeast strengthened the use of such enhancer screens by further allowing tissue-specific manipulation of embryos in vivo. 8 When Gal4 is expressed in the tissue of interest, it facilitates the transcription of genes directly downstream of the Upstream Activating Sequence (UAS). In this way, it is possible to achieve spatial and temporal control over the expression of effector constructs fused to a UAS cassette. By simply crossing a stable transgenic line expressing the Gal4 driver to a line expressing the UAS:effector transgene, the effector construct will be activated only in the cell type and at the time when the Gal4 protein is present. Not only do these lines allow for more detailed observation of developmental processes in vivo, but the driver lines in particular are a powerful tool for real-time manipulation of development through the targeted expression of effector constructs such as endogenous genes, 9 subcellular or physiological markers, 10 ablation or synaptic antagonist toxins,11,12 photoswitchable fluorophores, 13 or optico-genetic molecules that can be used for controlling neuronal activity. 14
Although broad-based enhancer screens have been widely applied to many nonvertebrate model systems,15–19 they have only been moderately used in mouse due to the difficulties of large-scale transgenesis and embryonic expression screening in mammals.20–22 However, the use of enhancer detection in zebrafish has expanded dramatically in recent years, due to the fish's transparency, external development, and amenability to high-throughput screening.23–25 These advantages have long been employed for systematic genetic screens,26–28 but advances in retroviral29,30 and transposon 31 based transgenesis techniques have enabled the efficient and large-scale detection of enhancer regions in the zebrafish genome,23,25,32 and the identification of transactivating Gal4-driver lines for targeted transgene expression in a tissue-specific manner.11,33–36
The initial enhancer detection screens in fish were performed to optimize transposon-based transgenesis techniques with simple reporter transgenes in medaka 37 and zebrafish.25,38 In spite of the successful and continuing use of transposon-based methodologies, the first large-scale enhancer screen using a fluorescent reporter was carried out with a murine leukemia retrovirus (MLV) that was engineered for zebrafish transgenesis. 23 With this method, about 95 distinct transgenic YFP-expressing lines were obtained with a diverse set of tissue-specific and ubiquitous expression patterns. 23 However, because of the specialized training, equipment, and permitting that is required to work with MLV, it is not used as commonly as transposon-based methods.
Transposons have been an essential tool of Drosophila genetics for decades, but the use of transposons in zebrafish has been limited, until recently, by the lack of transposable elements in the zebrafish genome. 39 The distinct advantages of transposon-based systems for enhancer detection and routine transgenesis such as their simplicity and efficiency, resulted in the development of diverse transposon vectors (e.g., Sleeping Beauty,40,41 Tc3, 42 AC-DS, 43 and Tol2 44 ) that allowed for high frequency chromosomal integration in germ cells. The Tol2 system in particular, which is derived from an endogenous medaka transposable element, has dramatically increased the efficiency and ease of both making transgenic lines and mapping their integration sites.
Here we present the results of our Tol2 transposon-based Gal4-driver line screen with a minimal promoter derived from the medaka edar locus, which demonstrates tissue-specific expression in lines encompassing a diverse set of cell types in embryonic, larval, and adult zebrafish. Our lines will serve as an important supplement to the growing driver line collection in the zebrafish community and will strengthen the position of the zebrafish as an excellent model for studying fundamental developmental processes through in vivo imaging and tissue-specific genetic manipulations. Additionally, the use of this promoter with refined Gal4 UAS constructs will aid future enhancer screens in the zebrafish.
Materials and Methods
Cloning of the enhancer detection construct
The promoter used to generate the TDL driver lines was derived from the immediate upstream region of the first exon of the medaka edar gene. An analysis of conserved elements in the edar locus of the zebrafish compared with medaka showed three elements sharing long stretches of identity. Whereas the medaka elements were all clustered in the 5′ upstream region, in zebrafish they were found duplicated and scattered within the first intron as well. To test the function of these elements, we isolated them from genomic DNA in three parts—one containing all elements (1140 bp), the two proximal elements (787 bp), and the most proximal element (627 bp) to the edar transcript. We cloned these putative promoters in front of Gal4 cassette in cis to 14xUAS driving GFP. 54 These constructs were injected with Tol2 mRNA into single-celled embryos to facilitate transgenesis. Analysis of F1 transgenic progeny showed specific expression patterns in a broad range of tissue types. This suggests that the promoter constructs contained little specific regulation, however had sufficient information to promote expression with the specificity gained from the particular insertion site. Of the three different promoter constructs, we found no obvious difference in the types of expression or frequency of insertion/transgenesis in any of the transgenic progeny screened. The smallest proximal element showed the most robust expression and all further screening of transgenic lines used this minimal promoter element. The sequence of this proximal element is as follows:
5′TGACCTGAACCGGAACATCAGTGGGGGGACCCTTCGGTGTGGAGTTTGCATGTTCTCCCTGTGCAGGTATGGGTTCTCTCTGGGAACTCTGGCTTCCTCCCACCGTCCAAAAACATGCTTCATAGGTCAATTGGCAACTCTAAATTGTCCATAGATGTGGGTGTGAGAGTGAGTGGATGTGTGATATTTACATACACACATACATATTTGTTTTTGCAAATATTTTATTTAAAAAACATACATAGAACTCCACTAACCATTTCCAGCAGTTTGCATGCTTTGGCTCCTCCACTACATGTTATTGCATGCCTGCAAACACTAGATGGCGTGGTCGAGTCAGGCTCTGCAAACTTTTTTTTTTTTTTTTGACAGAGTTTACCTGGGACGCTTTAATCAGTTTGTCTTCCAGGGCGGAGAAAACTTCAAGGAGGAGCCTGCAGACACACCGACATGCGTCCAAACGGAGGCTGAGACACAACTTAGATTGTGTGAAGTTGAACCTTTGAGACTTGCCGGTCCGTTTGGGGATGCTACAGGAACGCCGGGTCAGAGACTGAGCGGTGACCATA3′.
Screening of F1 progeny
Mating crosses of albino mutant zebrafish were set up according to standard procedures 55 and single-cell embryos were injected with 30 ng/μL of Tol2 RNA and 30 ng/μL of plasmid DNA from the enhancer trap construct in volumes of 1 nL per embryo. Capped Tol2 RNA was generated with a mMessage mMachine RNA kit (Applied Biosystems, Darmstadt Germany) according to manufacturer protocols. All injected larvae that survived up to 7 days post fertilization (dpf) were raised, and F1 progeny were screened for GFP expression every day from 24 hpf up until 7 dpf, and then once again at 30 dpf. All larvae showing GFP expression were raised and then re-identified in the F2 generation.
Mapping of insertion sites
The insertion sites were determined by thermal asymmetric interlaced (TAIL) PCR as described in Parinov et al. (2004) 25 with the following modifications: 100 ng genomic DNA extracted from 20, 5 dpf larvae or caudal fin clips of adults showing GFP expression were used as template for primary PCR reactions; PCR products of secondary or tertiary PCR reactions were gel-purified (Wizard SV Gel and PCR Clean-Up System, Promega), subcloned into pGEMT-easy vector (Promega), and sequenced using the standard primers M13uni and M13rev. All sequences were analyzed for the presence of either Tol2-5′ or -3′ flanking sequences, and the cloned zebrafish genomic sequences were BLASTed to the zebrafish genome to identify the insertion site as seen in Table 2 (Ensemble release Zv9 http://www.ensembl.org/Danio_rerio/). For some lines we found two genomic positions, however it is not clear which is responsible for the GFP expression pattern.
Cloning of the mitochondrial-localized construct, MLS-DsRed
The MLS-DsRed construct, 55 which contains a mitochondrial targeting sequence (MLS) to target DsRed to the mitochondrial matrix, was digested with EcoRV/XhoI and inserted into vector pT2K, which contains Tol2 transposase recognition elements. 32 This construct was designated pT2K-MLSDsRed. For targeted gene expression using the GAL4-UAS system, a fragment containing nine consecutive Gal4 upstream activator sequences was amplified by PCR and cloned upstream of the MLS-DsRed at SacII and SpeI sites. In cells with GAL4 expression, the mitochondrial network is marked by DsRed and visualized with the Texas Red filter set.
Imaging
Confocal imaging of live larvae was done after mounting them in 1% low-melt agarose in glass-bottom dishes on a Zeiss LSM-510 confocal microscope. Larvae from fixed fish (i.e., TDL244, 3, 22, and 74) were stained with the zn-12 (HNK1) antibody 57 according to standard techniques, 56 and imaged on glass slides with a LSM-510 confocal.
Results
Identification of an efficient minimal promoter and the generation of 63 Gal4 driver lines
In order to identify a new minimal promoter for enhancer detection, we cloned out two viral promoters (i.e., pTAL 45 and Sv4046,47) that are commonly fused to cis-regulatory regions to drive reporter proteins in cell culture 48 and in vivo. 49 Given the low recovery rate of transgenic lines from these constructs (data not shown), we decided to focus our attention on endogenous promoters from teleost fish. Our previous work with the ectodysplasin receptor (edar) gene in medaka and zebrafish prompted us to evaluate the regulation of its expression. 50 We observed that a 627 bp minimal promoter from this locus was not capable of independently driving expression of a reporter transgene without additional enhancer elements (data not shown, Materials and Methods). However, random integration of this construct into the genome resulted in a large number of strong and distinct expression patterns with very little nonspecific background. Thus, this minimal promoter provided an enhancer detection construct that could be used efficiently to generate tissue-specific Gal4-driver lines. We named all transgenic lines coming from this screen TDL, for Tuebingen Driver Lines.
We injected this construct along with Tol2 RNA into single-cell stage albino embryos and raised a total of 380 adult zebrafish for GFP screening of F1 progeny at 1–7 days post fertilization (dpf) and again at 30 dpf. We screened in the albino background in order to reduce the obstructing effect of melanin in larval and adult stages. We obtained a total of 63 driver lines representing all germ layers and a diverse set of tissue types (Table 1). The success rate of about 17% enhancer detection from stable integrated lines corresponds with previous Tol2 transposon-based enhancer trap screens that range from 12% to 36% transmission.25,33,36 Many founders had multiple insertions, as demonstrated by the presence of embryos with different expression patterns in the F1 clutch and subsequent segregation of the lines upon outcrossing. There was also considerable variability in the efficiency of germ line transmission in individual F0 fish, as could be seen in the number of individual F1 progeny within a clutch exhibiting a specific expression pattern. In the case of TDL296, only a single individual F1 fish was observed with the given expression pattern, but upon raising and outcrossing that individual, it was possible to maintain three stable transgenic lines from independent insertions (Table 2). In other cases, the majority of the larvae in a particular clutch exhibited a specific expression pattern, and several individuals were raised to maintain the line. However, in most cases, only between 5 and 10 F1 fish per clutch expressed the transgene in a similar way and the other larvae either showed no expression, or in rare cases the segregation of another expression pattern (e.g., TDL302 and TDL45). Besides notochord expression, which we observed in eight driver lines, the hatching gland represented the most frequent expression found in larvae of our screen.
GFP expression was screened from 1–7 days post fertilization (dpf) and in 30 dpf fish of progeny from founder fish injected with the TDL plasmid. The tissues in which GFP expression was observed, as well as the number of lines (and their identification number) with that expression pattern are shown.
Selected lines with specific expression and minimally variegating patterns were chosen for genomic mapping of insertion sites using TAIL PCR. Some mappings resulted in a lack of clear TAIL PCR product (*). The stage in hpf (hours post fertilization) indicates the time point when the GFP expression pattern is fully developed, although GFP expression can start earlier.
Although we were able to identify a large number of driver lines with consistent and specific expression pattern, variegation and generational silencing were detected in many instances. Previous studies have noted general background expression with other minimal promoters such as the keratin 8 promoter, 25 or the heat shock-70kD promoter, 35 which could cause problems for downstream applications such as the ectopic expression of a second transgene in a tissue-specific manner. However, we have observed very little to no basal expression using the edar minimal promoter construct. In contrast, the variegation that we did observe was more often spurious GFP expression in a few individual muscle fibers or notochord cells, and the absence of GFP in a number of cells in which the transgene was normally expressed. Because of these two types of variegation (i.e., spurious expression, and patchy expression), only a subset of the most robust and specific lines were kept for subsequent detailed analysis (Table 1).
Tissue-specific expression patterns of selected driver lines
In order to more accurately describe the expression patterns of some of the transgenic lines identified in our enhancer detection screen, we performed confocal microscopy on selected lines at early larval stages and more detailed fluorescence microscopy of adult tissues (Fig. 1). Lower resolution images of a subset of the lines that were identified can be seen in Supplementary Figure S1 (Supplementary Data are available online at www.liebertpub.com/zeb). The TDL driver lines shown in Figure 1 demonstrate the typical lack of low level background expression present in most of the lines generated from the driver construct used in this study. In addition, the specificity of some lines for particular cell types can be readily observed, for example, in the case of TDL6, which labels primary motor neurons (PMN) from 24 hpf onwards (Fig. 1A). Although GFP fluorescence appears to be both strong and specific in TDL6 in all three PMN at 24 hpf (Fig. 1A′, 1A″), variegation occurs in the form of absent expression in some PMN throughout the embryo. In these cases, only two of the three primary motor neurons are labeled (Fig. 1A′), where the middle primary can be seen extending dorsally and the caudal primary migrating ventrally through the choice point past the horizontal myoseptum. In another example, neither the Middle Primary nor the Caudal Primary is labeled, but the Rostral Primary, which is the third PMN, specifically expresses GFP (Fig. 1A″). In spite of this early variegation, by 5 dpf most of the larval PMNs express GFP (Fig. 1A).

GFP expression of selected Tuebingen driver lines (TDL).
Confocal maximum intensity projections from TDL42 show GFP in multiple cell types of the myogenic lineage, from myoblasts to differentiated muscle fibers (Fig. 1B). Line TDL358 was identified as being expressed in Schwann cells of the posterior lateral line system and in iridophores in 5 dpf larvae (Fig. 1C). This line shows similar expression in both larval (Fig. 1C) and 30 dpf fish (Fig. 1C′, 1C″). Iridescent iridophores, which are derivatives of the neural crest, 51 are reflective pigment cells present on the eye and, in line TDL358, they express GFP around the eye and operculum at 30 dpf (Fig. 1C, 1C′). Iridophores are also present in the zebrafish integumentary stripes (Fig. 1C″), which strongly express GFP in this line (Fig. 1C″). Although this is the only line marking larval pigment cells, two other driver lines (i.e., TDL 290, and TDL318) also labeled Schwann cells of the lateral line system, as well as other tissues of the central nervous system (Table 1).
Expression in CNS tissues was observed in the largest class of lines we identified (Table 1), and some examples of these lines may be seen in Figure 1D–1G (i.e., TDL354, TDL, TDL235, TDL234, and TDL318, respectively). By immunohistochemistry and confocal microscopy, we further characterized two of the driver lines that showed specific expression in cells of the CNS (Fig. 1D, 1E). At 28 hpf, line TDL354 shows expression in primary neurons of the forebrain belonging to the dorso-rostral cluster (DRC) and in a subset of neurons in the nucleus of the medial longitudinal fasciculus (Fig. 1D). A lateral view of the ventral midbrain of a 28 hpf TDL235 embryo co-stained with the neuronal marker zn-12 shows that neurons belonging to the medial longitudinal fasciculus are GFP positive (Fig. 1E). At 3 dpf, TDL234 shows strong midbrain expression (Fig. 1F), and 28 hpf embryos from line TDL318 express GFP in the dorsal midbrain in the region of the forming tectum (Fig. 1G). This line additionally shows expression in dorso-rostral cluster cells, Schwann cells of the lateral line, and a population of cranial neural crest cells migrating anteriorly before 24 hpf (data not shown).
A novel aspect of our screen was that it was not limited to larval expression patterns, but because it was done in the albino background, we were able to identify a GFP signal at much later time points (Table 1). All F1 progeny were screened at 30 dpf to find driver lines with continued expression beyond larval stages or displayed new tissue-specific expression in adults. Four additional lines were identified with specific, albeit sometimes patchy expression (Table 1; Figure 1H–1J). One line (i.e., TDL244-3) had expression in the veination of the fin (Fig. 1H). This line had shown expression in the vasculature during larval stages, and this persisted in the fins of the adult. Line TDL3 showed expression in the forming joint/intraray space of the lepidotrichia of the fins (data not shown). However, there was apparent silencing of this line over several generations and so it was not kept. Line TDL22 showed expression in mosaic patches around adult scales (Fig. 1I). Finally, line TDL74, in addition to showing some early neural crest expression, labeled scleroblasts of the fins at 30 dpf (Fig. 1J).
Mapping of insertion sites identifies 23 enhancer loci
To identify the genomic integration site of selected driver lines, we re-confirmed in F2 fish the expression pattern that had been originally identified in F1 larvae. We outcrossed F1 carriers of the driver lines to albino fish and collected at least 5 individuals from each driver line for mapping by Thermal Asymmetric Interlaced PCR (TAIL-PCR) 52 (see Methods). We were not able to obtain a PCR product for 12 of the lines, but we could successfully map the genomic insertion site in 17 (Table 2). Interestingly, eight of the lines mapped to intronic sites, five to intergenic sites, and three to transcribed regions (i.e., two exonic and one in the 5′UTR) (Table 2). In one case (i.e., TDL318), two insertions appeared to segregate with the transgene expression, and it was not clear if the intronic or intergenic insertion were responsible for the expression pattern observed (Table 2). In one other case (i.e., TDL283), two insertions were identified but both were determined to be intronic (Table 2). Only in two lines (i.e., TDL201 and TDL13) were mutant phenotypes observed and this was from an apparent intronic insertion in the gnb1l locus in the case of TDL201, and an intergenic insertion in the promoter region of kans12 for TDL13 (Table 2).
TDL275 can transactivate effector constructs
In order to test the ability of the TDL lines to transactivate genes under control of UAS sequences in different genomic loci, we crossed the transgenic line TDL275, which expresses GFP in progenitors and differentiated cells of the skeletal muscle lineage (Table 1), to a UAS line driving mitochondrial-localized DsRed in response to Gal4. In larvae containing both insertions, GFP and DsRed colocalize in the same cells (Fig. 2). This demonstrates that the lines we generated are able to drive UAS effector constructs in trans and could be used to activate other reporter or effector genes under control of UAS sequences.

Tuebingen driver line TDL275 transactivates the effector line UAS::MLS:DsRed.
Discussion
Enhancer detection through large scale transgenesis screening provides a means to address two fundamental objectives of developmental biology: understanding the cis-regulatory control of metazoan development, and the generation of tools for the tissue-specific manipulation of ontogeny in vivo. By directly demonstrating which genomic regions are competent to drive transgene expression, random genomic integration of a reporter construct complements other primarily computational methods of predicting cis-regulatory modules. It may be difficult if not impossible in some cases to identify the actual cis-regulatory elements responsible for a particular expression pattern, since cis-regulation can occur at very large distances. However, the knowledge that a particular genomic region is able to drive specific expression is achieved without any a priori assumptions of sequence content or conservation.
Taking advantage of the efficiency of these transgenesis methods and the power of the Gal4 transactivation system, several labs have recently conducted screens to identify tissue specific driver lines in zebrafish.11,33–36,53 But because accurate transactivation requires tight temporal-spatial control, it has been important to optimize the minimal promoter that is used in these transgenic constructs. The choice of the minimal promoter may dramatically affect the amount of nonspecific background that is observed, the degree of variegation of the lines, and the tissue bias of the entire screen. It has been observed in one study, for instance, that a c-fos minimal promoter showed the lowest amount of background expression (as compared to two different length heat shock promoters), whereas the E1b promoter had a strong bias towards cranial ganglia expression. 33 Likewise, in another study, greater specificity was achieved with a thymidine kinase and GATA2 promoter, than with a minimal heat shock promoter. 36
While the objective of identifying cis-regulatory regions of the genome may be accomplished with a simple reporter construct, our intention here was to generate transactivating lines that may be used to drive transgene expression in larval or adult zebrafish tissues. By designing an efficient minimal promoter from the edar locus of medaka and screening the F1 progeny of 380 TDL-injected zebrafish, we identified 63 new transgenic lines displaying GFP expression in multiple tissue types (Table 1). The over-representation of some tissue types (e.g., hatching gland and notochord) in our screen did not preclude the identification of a diverse set of driver lines (Table 1). In fact, the distribution of expression patterns across various organ systems and cell types would suggest that targeted screens using this minimal promoter might be effective for many different tissue of interest to the zebrafish community.
It is worth noting that, in spite of not having screened for recessive phenotypes, we identified two apparent insertional mutants in our screen (Table 2). The apparent inefficiency of recovering mutant phenotypes (i.e., 2 out of 63 confirmed insertions) is likely the result of the screening strategy we employed. That is, having screened for consistent GFP expression in the F1 generation, we only raised and in-crossed F1 fish that had reproducible expression patterns. In all likelihood, most genomic integrations of our construct did not result in observable transgene expression and so would have been discarded. Presumably, some of those insertions would have occurred in open reading frames. However, because of our screen design, it is not possible to evaluate the efficiency of insertional mutagenesis from our construct at this time. Likewise, it will be interesting to further investigate the mechanism whereby this particular promoter minimizes background noise and allows for the identification of a broad range of tissue-specific expression patterns in the zebrafish embryo and adult.
Variegation of transgenic lines may occur in many forms, including generational silencing, spurious ectopic expression, and cell-type specific mosaicism. To varying degrees, these phenomena were all observed in some of our lines (Table 2). However, these are commonly observed properties of transgenic lines using the Gal4 system 53 and thus may be general features of the Gal4 and/or Tol2 systems. Thus, the variegation we noted is not likely to be an inherent property of the minimal promoter we used in this study. The fact that variegation can cause problems for some downstream applications has encouraged several groups to improve both the transgenesis constructs used in zebrafish and the Gal4 system itself.34,36,53 As transgenesis and enhancer detection systems are continually improved, it might be advantageous for future screens to choose the minimal promoter, transposon, and Gal4-variant combinations that are best suited for the intended application.
We demonstrate here that the edar promoter from medaka can supplement the growing set of commonly used minimal promoters in the zebrafish community, due to its efficiency, low bias, and lack of background expression. This promoter construct and the new Gal4-driver lines that we describe will be a useful addition to the tool-box of other groups either wishing to conduct their own enhancer detection screens, or requiring driver lines in some of the tissues where we have found expression. All lines that were generated here and are included in Table 2 have been stored permanently as frozen sperm and are available upon request.
Footnotes
Acknowledgments
We would like to thank Katrin Henke, Christian Soellner, Alessandro Mongera, and Vanya Krneta-Stankic for some of the images of the driver lines presented here. This work would also not have been possible without the technical assistance of Christian Weiler and Horst Geiger, as well as the support of Christiane Nuesslein-Volhard.
Disclosure Statement
No competing financial interests exist.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
