Culture purification and DNA extraction procedures suitable for next-generation sequencing of euglenids
In the present study, five different DNA extraction procedures were examined to determine their effectiveness for extracting DNA suitable for NGS applications. This included two silica-membrane spin column kits, phenol:chloroform, and two CTAB-based methods. Spectrophotometric and fluorimetric measurements as well as standard gel electrophoresis were used as criteria for evaluating the quantity and quality of the isolated DNA prior to the sequencing. Herein, the method of establishing and maintaining axenic Euglena cultures is also presented. The modified CTAB-based method proved to be highly efficient. In terms of DNA quantity and purity (according to the absorbance ratios), the chosen method resulted in DNA of high molecular weight and quality, which fulfills the library construction requirements. Genomic DNA of Euglena hiemalis (CCAP 1224/35) and E. longa (CCAP 1204-17a) isolated using the suggested protocol had been successfully sequenced on the Illumina HiSeq platform. A modified, rapid CTAB-based method of total DNA isolation from Euglena has been described. In terms of the DNA quantity and quality, the protocol devised involving the washing step with DMSO:acetonitrile proved superior to the commonly used, commercially manufactured kits and isolation with phenol:chloroform. The method is also less labor-intensive and time-consuming than the traditional CTAB-based protocol.
KeywordsCTAB Culture purification DNA isolation Euglena High throughput sequencing NGS library
Euglenids (Euglenida) are unicellular, free-living phytoflagellates, widespread in various aquatic environments (Zakryś et al. 2017). Their plastids, enclosed by three membranes, are a secondary acquisition from the Pyramimonas-related green alga. Thus, the euglenids are an interesting case for studying the evolution of organelles (Turmel et al. 2009; Hrdá et al. 2012; Zakryś et al. 2017). Despite these organisms (especially Euglena gracilis) being utilized as an object of intensive laboratory studies on photosynthetic capacity, phototaxis, and metabolic and gene expression pathways and having even been proposed as an attractive feedstock for biodiesel and biomass production, knowledge about the organization of their genetic material remains very limited (Milanowski et al. 2014; Yoshida et al. 2016; Ebenezer et al. 2017; Li et al. 2017; Tomiyama et al. 2017). With the advent of the next-generation sequencing (NGS) platforms, investigation of the genomes of many algae species has become more affordable than ever before (Saint-Marcoux et al. 2015; Tan et al. 2015; Gawryluk et al. 2016; Yurchenko et al. 2016). However, the efforts aimed at sequencing Euglena nuclear genomes have remained challenging and, to date, an incomplete task (Milanowski et al. 2016; Ebenezer et al. 2017). It is known that the nuclear genomes of euglenids are large, highly repetitive and complex, therefore difficult to study. Only the draft genome assembly of E. gracilis has been published so far (Ebenezer et al. 2017). The preliminary data indicate that the genome of this organism ranges around from 1.4 up to 2 Gbp (giga base pairs) (Ebenezer et al. 2017). Therefore, obtaining sufficient data to provide adequate coverage enabling genome assembly requires extraction of particularly high-quality DNA sample. The DNA material suitable for NGS analysis should be characterized by high molecular weight with an A260/280 ratio between 1.8 and 2.0 (Healey et al. 2014; Lucena-Aguilar et al. 2016). The sample should be free from co-precipitating contaminant substances, such as proteins, polysaccharides, waxes, and photosynthetic pigments, which tether or obstruct NGS DNA library preparation (Healey et al. 2014).
The difficulty of DNA isolation from eukaryotic microalgae is a well-known and often reported issue (Eland et al. 2012; Tear et al. 2013; Maneeruttanarungroj and Incharoensakdi 2016). Many lineages have developed, apart from the cell wall, unique surface structures, such as frustules, loricae, or mucilage sheaths, frequently enriched by some highly resistant compounds (Barsanti et al. 2001; Popper et al. 2014). The cells of euglenids are covered with a complex structure called the pellicle. It is organized in a series of overlapping, proteinaceous strips enveloped by the plasma membrane and supported on the microtubule corset (Leander et al. 2007; Zakryś et al. 2017). The pellicle strips run along the entire length of the cell in an imbricated manner, which allows them to slide against each other. This arrangement enables dynamic changes in cell shape, called metaboly or euglenoid movement (Zakryś et al. 2017). Furthermore, euglenids may also secrete protective, mucilaginous material. It accumulates under culture conditions as clumps of mucus composed of mostly glycosylated polypeptides and insoluble gelatinous polysaccharides (Cogburn and Schiff 1984). The abovementioned features render euglenids recalcitrant to cell disruption.
Most of the DNA extraction methods currently applied for DNA isolation from euglenids were developed and optimized for different organisms—i.e. plants, yeast, and mammals. These protocols tend to obtain relatively small amounts of DNA, which is often highly diluted, contaminated, and prone to tearing. The use of physical cell disruption methods, including liquid homogenization, sonication, or grinding in liquid nitrogen, may result in obtaining highly fragmented DNA. Although such DNA samples may still be used as a template for PCR, followed by standard sequencing, the method is inapplicable for whole genome study purposes. Furthermore, cell strains acquired from the collections of cultures or isolated from environmental samples are usually contaminated with bacteria and fungi. Mechanical separation of the cells, such as micromanipulation and/or equilibrium centrifugation, followed by subsequent passages in liquid media is often not enough to establish axenic Euglena cultures (Gilbert 1970; Jones et al. 1973). Moreover, supplementing the liquid growth media with antibiotics (especially those newly developed) is usually harmful for euglenids. These agents have a stronger effect on Euglena cells at lower concentrations than on bacteria or fungi. Many antibiotics, routinely used in purification of algal and protozoan cultures, such as streptomycin, kanamycin, and neomycin, permanently bleach photosynthetic euglenids (Droop 1967; Jones et al. 1973; Tucci et al. 2010). Their chloroplasts become aberrant and are gradually diluted out of dividing cells, which eventually leads to a hereditary loss of plastids. Such changed cells have a limited life span and exhibit other atypical traits, i.e., improper divisions (Ebringer 1964). Thus, they cannot be regarded as a reflection of the natural state and used for de novo next-generation sequencing. It is particularly important to strive for elimination of contaminants prior to the stage of sample preparation. Foreign DNA admixtures can lead to wrong or confusing results in the assembly of the desired genome, particularly when reference data is absent (Langdon 2014; Merchant et al. 2014; Strong et al. 2014; Gruber 2015).
Herein, we present the comparison of five commonly used DNA isolation protocols and culture purification method developed while working on the de novo genome sequencing of the two Euglena species: photoautotrophic Euglena hiemalis (CCAP 1224/35) and secondarily heterotrophic E. longa (CCAP 1204-17a). The lack of a rapid and efficient method for pure, high-quality genomic DNA extraction from Euglena species has led us towards attempting to optimize a method for the isolation of highly concentrated DNA, suitable for next-generation sequencing purposes.
Cell strains and growth conditions
Cultures of Euglena gracilis Z (SAG 1224-5/25), Euglena hiemalis (CCAP 1224/35), and Euglena longa (CCAP 1204-17a) were cultivated statically in the Cramer-Myers medium (Cramer and Myers 1952), supplemented with ethanol (0.8% v/v) and aqueous soil extract (1% v/v). Cells were grown at 18 °C under white light exposure (16:8-h light/dark cycle, ca. 27 μmol photons m−2 s−1).
Disc diffusion antibiotic sensitivity testing
In order to determine the antibiotic susceptibility of contaminant organisms, the agar diffusion test was performed (Bauer et al. 1966). The initial, non-axenic cultures of euglenids were plated on Tryptone Soya Yeast Extract Agar (TSYEA; BTL) supplemented with amphotericin B (1% v/w; Sigma). Then, the antibiotic-impregnated paper discs (Oxoid) were placed on the plates and left to incubate. Our previous experience has shown that antibiotics affecting DNA or protein synthesis, particularly those recently developed, are more lethal for euglenids at lower concentrations than for their bacterial and/or fungal contaminants. Therefore, they were not taken into account in this study. Various agents inhibiting bacterial cell wall synthesis, such as ampicillin (25 μg), cefotaxime (30 μg), fosfomycin (50 μg), gentamicin (30 μg), penicillin (25 μg), rifampicin (30 μg), trimethoprim (2.5 μg), and vancomycin (30 μg), were used in the antibiotic screening, as the least harmful for the cells of euglenids. Cefotaxime and vancomycin—the compounds generating the largest zones of inhibition—were chosen for the final purification procedure.
The initial cultures of euglenids were mechanically pre-purified by centrifugation (2500×g, 30 s, RT) and washing with distilled water (each time the supernatant was discarded). The procedure was repeated as long as the amount of bacteria observed under the microscope was visibly decreasing. Such prepared cultures were diluted and streaked on solid Cramer-Myers medium supplemented with mineral medium (5% v/v) (Starr 1964), aqueous soil extract (5% v/v), and amphotericin B (1% v/w; Sigma), and agarised with TSYEA (1% v/w; BTL). In order to obtain zones with decreasing concentrations of antibiotics, only two discs, one for each of the selected compounds (cefotaxime and vancomycin, respectively), were placed on the opposite sides of the plate (supplementary Figure S1, supplementary material online). Grown Euglena colonies (visible under the microscope as bacteria-free and alive) were subsequently restreaked on the same medium for further purification. To increase survivability of the Euglena cells, the antibiotic discs were placed on the plates every second passage. The procedure was repeated until the axenic algal cultures were obtained. Afterwards, they were transferred to the liquid medium and constantly monitored for bacterial and fungal presence.
Genomic DNA extraction protocols
Five DNA extraction methods were evaluated in this study. The DNA was isolated from all three species (E. gracilis, E. longa, E. hiemalis) in pentaplicates with each extraction protocol. The initial experimental steps remained the same in all cases. A total volume of 10 mL of liquid cultures in the logarithmic growth phase was centrifuged (5000×g, 5 min, RT) and rinsed with nuclease-free water three times to completely remove the residues of the growth medium. Washed Euglena cells were then resuspended, aliquoted (1 mL), and centrifuged. Then, each of the cell pellets (± 50 mg) was processed in accordance with the chosen method’s requirements. Finally, the DNA was eluted or resuspended in 100 μL of nuclease-free water (GE Healthcare).
Extraction with commercial silica-membrane column kits
Two commercially manufactured kits, designed for quick purification of genomic DNA—DNeasy Blood & Tissue (Qiagen) and DNeasy Plant (Qiagen)—were tested. In the case of the DNeasy Blood & Tissue kit, the spin column protocol designed for purification of total DNA from animal blood/cultured cells was applied, whereas in the case of DNeasy Plant kit, the TissueRuptor protocol with liquid nitrogen was used. All steps were performed strictly as described in the instructions provided by the manufacturer. In both cases, on-column RNAse A (100 mg mL−1, 4 μL, 2 min, RT; Qiagen) digestion was carried out.
Extraction with phenol:chloroform
Genomic DNA was isolated with standard ethanol precipitation following phenol:chloroform:isoamyl alcohol (24:25:1 v/v; AppliChem) treatment, according to the previously described, albeit slightly modified protocol (Psifidi et al. 2010; Green and Sambrook 2017). Specifically, 1 mL of lysis buffer containing 10 mM Tris HCl (pH = 7.5), 1 mM EDTA, 50 mM NaCl, 0.2% SDS (v/w), and 1 mg of proteinase K (Qiagen) was used to digest Euglena cells for 1.5 h at 56 °C. The lysate extraction was performed twice with 1 mL of phenol:chloroform:isoamyl alcohol (24:25:1 v/v; AppliChem). Following the first extraction step, the aqueous phase was treated with RNAse A (100 mg mL−1, 4 μL; Qiagen) and incubated at 37 °C for 30 min with periodic, gentle mixing. Afterwards, the extraction was repeated. Then, 2.5 volume of absolute ethanol (AppliChem) and 0.1 volume of 3 M sodium acetate (pH = 5.2) were added and the DNA was precipitated at − 20 °C for 1.5 h. The sample was spun at 12,000×g, 10 min, RT, and the DNA pellet was washed twice with 70% ethanol. Finally, the pellet was air-dried and resuspended in nuclease-free water.
Extraction with traditional CTAB method
The cetyltrimethylammonium bromide (CTAB; AppliChem) DNA isolation was performed strictly as described elsewhere (Allen et al. 2006). The volumes of utilized reagents were downscaled according to the amount of the initial Euglena biomass.
Extraction with modified CTAB method
The CTAB-based, rapid DNA extraction protocol (Healey et al. 2014) was slightly modified. Prior to the proper isolation, the cell pellets were washed with ice-cold DMSO:acetonitrile (1:1 v/v; AppliChem). This step was introduced to perforate the pellicle (and to facilitate the penetration of the lysis buffer into the cells), as well as to remove the photosynthetic pigments, polysaccharides, and wax esters which hinder nucleic acids isolation and purification (Rosenberg 1967; Mederic et al. 1987; Barsanti et al. 2001; del Campo et al. 2010; Healey et al. 2014). Also, the RNAse A (100 mg mL−1, 4 μL; Qiagen) treatment was extended to 30 min in comparison to the guidelines provided by Healey et al. (2014).
DNA integrity assessment
The integrity of the DNA samples obtained using each tested extraction method was examined through standard gel electrophoresis (Psifidi et al. 2015). In detail, 5 μL of each DNA extract was analyzed in a 1.5% agarose gel stained with 0.5% Midori Green (Nippon), run in 1× TAE buffer. DNA bands were visualized using the ChemiDoc UV transilluminator (Bio-Rad).
DNA purity and yield
For each of the applied extraction procedures, the concentration and purity of the recovered DNA were assessed spectrophotometrically with NanoPhotometer NP80 (Implen). An Abs260/280 ratio was used to evaluate protein contamination while an Abs260/230 ratio was used to determine organic solvents contamination. Afterwards, for each species/isolation method, one sample (with the best parameters) was selected based on the absorbance values and subjected to fluorimetric measurements. Concentration of the DNA in those samples was further examined using the High Sensitivity DNA Assay implemented by Qubit 3.0 fluorometer (Thermo Scientific). Each time, 1.5 μL (absorbance) or 1 μL (fluorescence) of DNA sample (or nuclease-free water as a blank solution) was used during the sample assessment. Each measurement was performed twice, and obtained values were averaged. Total DNA yield was calculated based on DNA concentration derived from the NanoPhotometer and Qubit results calculated together with the total volume of the DNA extract.
Application of the isolated DNA in high throughput sequencing
Based on the average values of the above parameters, out of five extraction methods tested, the most effective and robust one was selected. In order to evaluate its application in NGS library construction and sequencing, single isolates of E. hiemalis and E. longa, which exhibited optimal parameters of concentration and quality, were chosen for further manipulations. The DNA prepared for sequencing was stored in − 20 °C no longer than a few days, avoiding its exposure to temperature amplitudes.
Preparation of a pair-end reads (PE150) library was carried out externally using NEBNext DNA Library Prep Master Mix Set for Illumina (NEB) and sequenced commercially on an HiSeq4000 instrument (Genomed, Warsaw, Poland). The quality of the DNA library was assessed using a 2100 Bioanalyzer (Agilent). Additionally, the quality of raw sequencing reads after trimming (removal of library adaptors) was analyzed using the FastQC software (Andrews 2010).
Results and discussion
DNA concentration and quality
In terms of the DNA concentration, as evidenced by spectrophotometry (Fig. 1b; supplementary Table S2, supplementary material online), the isolation with DNeasy Plant kit turned out to be the least efficient (with the mean DNA concentration of 15.78 ± 0.89 ng μL−1 in all samples), while isolation with CTAB-based methods proved to be the most efficient. Traditional protocol yielded the mean DNA concentration of 301.56 ± 156.27 ng μL−1 in all samples and the modified rapid protocol resulted in a mean DNA concentration of 498.61 ± 184.63 ng μL−1 in all samples, respectively.
The spectrophotometric method does not allow to determine whether the material is degraded or not and whether the sample is contaminated (Psifidi et al. 2015). Meanwhile, the fluorimeter measures the concentration of an intact (undenatured and not fragmented), double-stranded DNA. Discordance between these two methods (Fig. 2) was to be expected, since similar findings have been reported (O’Neill et al. 2011; Nakayama et al. 2016; Hussing et al. 2018). Results obtained herein further support discrepancy between spectrophotometry and fluorimetry. However, contradictory evidence has been provided as well (Haque et al. 2003; Foley et al. 2011). Notwithstanding this, it is assumed that the optimal workflow for quality assessment is to control the presence of potential contaminants with a spectrophotometer (NanoPhotometer) and subsequently to quantify double-stranded DNA with a fluorometer (Qubit) (Simbolo et al. 2013). Applying both simultaneously certainly provides more accurate information about the examined sample than any method separately. Therefore, we recommend this approach, especially in the case of de novo genome sequencing.
According to the above results, two samples obtained with the modified CTAB-based protocol, one of E. hiemalis and one of E. longa, respectively, were chosen and further subjected for library preparation followed by high-throughput sequencing. The reason behind the decision to exclusively use the DNA isolated with the modified CTAB-based method for high-throughput sequencing, without including isolates deriving from other methods, was mainly based on the satisfactory parameters of the DNA integrity and purity, but also the substantially higher yield, compared to, e.g., DNeasy Blood & Tissue method. As mentioned previously, the nuclear genomes of euglenids are unexpectedly large and complex, for single-celled organisms. Therefore, it was necessary to obtain a highly concentrated DNA template to ensure sufficient coverage of reads for assembled genome sequences.
High-throughput sequencing quality
The genomic DNA samples were transferred to the external company (Genomed, Warsaw, Poland) for library preparation and sequencing using the Illumina HiSeq 4000 platform. Samples successfully passed standard quality control measures, which require high molecular weight of genomic DNA, purity of polysaccharide as well as RNA and protein contamination, and an A260/280 ratio ranging between 1.8 and 2.2.
For enzymatic reactions, such as PCR, the demands regarding DNA material condition are less strict those for NGS standard libraries preparations. High-throughput sequencing requires DNA not only in large amounts but also in very good quality, meaning lack of organic/inorganic contamination. Column methods can be ineffective in such applications, since usually each step involves washing or filtering. This supplementary caution is certainly welcome; however, it may decrease the final DNA extraction yield (Healey et al. 2014). In the case of E. hiemalis and E. longa, it proves to be a serious problem, and without a more effective protocol of DNA isolation, it would eventually be impossible to obtain a fine sequencing library.
It has been previously discussed elsewhere that the DNA of both high concentration and quality is more stable and degrades more slowly, even when stored for a longer period of time (Psifidi et al. 2015). Eventually, this supports extra future NGS runs (when more data is needed), since it is better to carry out sequencing using the isolates from the same extraction course to eliminate potential bias and save time for additional laborious laboratory exercise.
The method of isolation is crucial for particular experimental design and the modified CTAB-based protocol presented proved to be efficient for whole genome NGS library construction and then sequencing. Isolation has been effective in both phototrophic and non-phototrophic species; therefore, we believe it can be universally applied. This is particularly important due to certain differences in the cell wall composition across the euglenids—especially the spatial structure of the pellicle and the amount or distribution of mucus, as well as the presence or absence of photosynthetic dyes and secondary metabolites.
In this study, a comparison between five various procedures of DNA extraction for whole genome sequencing of euglenids was conducted. So far, no such analyses for Euglena species have been carried out. In the era of growing interest in the genomics of algae, the results presented herein are of high practical significance. An efficient, fast, and reliable procedure grounded on a modified CTAB-based isolation protocol was proposed as the most suitable method of extracting euglenids NGS-suitable genomic DNA.
Furthermore, an efficient Euglena culture purification method was proposed herein. The use of discs impregnated with cefotaxime and vancomycin, respectively, together with amphotericin B, allowed the axenic strains to be established. The described experimental approach was aimed at maintaining the euglenids cells in the best possible condition with simultaneous elimination of bacterial and fungal contaminants. The culture purification step performed prior to the sequencing was crucial for the de novo genome sequencing and assembly. Subsequent passages did not show the presence of any other organisms in the cultures purified with the developed method. Therefore, the methodology described is effective and can be successfully applied for other euglenids.
We would like to thank Dr. Anna Karnkowska for supporting NGS data analysis. The study was carried out at the Biological and Chemical Research Centre, University of Warsaw, established within the project co-financed by European Union from the European Regional Development Fund under the Operational Programme Innovative Economy, 2007–2013.
NG was responsible for the concept and design of experiments. The experiments were performed by NG, HW, PH (DNA isolation), and BZ (establishing of axenic cultures). NG and MP analyzed the data and wrote the paper. RM supervised all experiments, analyses, and writing of the manuscript. All authors have read and approved the final manuscript.
This work was funded by grant 2015/19/B/NZ8/00166 from the National Science Centre, Poland.
- Andrews S (2010) FastQC A Quality Control tool for High Throughput Sequence Data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/. Accessed 26 January 2018
- Cramer M, Myers J (1952) Growth and photosynthetic characteristics of Euglena gracilis. Arch Microbiol 17:384–402Google Scholar
- Ebenezer TE, Carrington M, Lebert M, Kelly S, Field MC (2017) Euglena gracilis genome and transcriptome: organelles, nuclear genome assembly strategies and initial features. In: Schwartzbach S, Shigeoka S (eds) Euglena: biochemistry, cell and molecular biology. Springer, Cham, pp 125–140CrossRefGoogle Scholar
- del Campo EM, del Hoyo A, Casano LM, Martínez-Alberola F, Barreno E (2010) A rapid and cost-efficient DMSO-based method for isolating DNA from cultured lichen photobionts. Taxon 59:588–591Google Scholar
- O’Neill M, McPartlin J, Arthure K, Riedel S, McMillan ND (2011) Comparison of the TLDA with the Nanodrop and the reference Qubit System. J Phys 307:1–6Google Scholar
- Turmel M, Gagnon MC, O’Kelly CJ, Otis C, Lemieux C (2009) The chloroplast genomes of the green algae Pyramimonas, Monomastix, and Pycnococcus shed new light on the evolutionary history of prasinophytes and the origin of the secondary chloroplasts of euglenids. Mol Biol Evol 26:631–648CrossRefPubMedGoogle Scholar
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.