Abstract
In recent years, evolutionary biologists have developed an increasing interest in the use of barcoding strategies to study eco-evolutionary dynamics of lineages within evolving populations and communities. Although barcoded populations can deliver unprecedented insight into evolutionary change, barcoding microbes presents specific technical challenges. Here, strategies are described for barcoding populations of the model bacterium Pseudomonas fluorescens SBW25, including the design and cloning of barcoded regions, preparation of libraries for amplicon sequencing, and quantification of resulting barcoded lineages. In so doing, we hope to aid the design and implementation of barcoding methodologies in a broad range of model and non-model organisms.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Barcoding individual microbial cells with chromosomally integrated short, random sequences is a powerful approach for obtaining a high-resolution understanding of the eco-evolutionary dynamics that shape microbial populations (Blundell and Levy 2014; Levy et al. 2015; Cvijović et al. 2018; Ba et al. 2019; Kinsler et al. 2022; Johnson et al. 2023). In brief, the technique involves introducing a large number of random sequences into a single chromosomal region of a population of cells such that all lineages are uniquely identifiable. A single PCR reaction allows amplification of the barcoded region, while DNA sequencing of the product reveals the frequency of each unique code. Changes in barcode frequencies allow inference of mutational events and fitness effects. For example, a cell that acquires a beneficial mutation will cause the frequency of the linked barcode to increase within the population. The rate of increase can be used to quantitate contributions to fitness.
When changes are measured over the course of hundreds of generations, detailed understanding of eco-evolutionary dynamics can emerge (Levy et al. 2015; Jasinska et al. 2020; Acar et al. 2020; Boyer et al. 2021; Aggeli et al. 2021), including insight into the connection between genotype and phenotype of even complex traits (Venkataram et al. 2016; Kinsler et al. 2020; Jagdish and Nguyen Ba 2022; Nguyen Ba et al. 2022; Aggeli et al. 2022; Ascensao et al. 2022). Recent developments allowing the regeneration of barcode diversity extend the opportunity to observe evolutionary change over the course of thousands of generations (Ba et al. 2019).
While application of barcoding initially focussed on adaptive evolution in single populations of model microbes, particularly yeast (Saccharomyces cerevisiae) (Blundell and Levy 2014; Cvijović et al. 2018) and Escherichia coli (Jasinska et al. 2020), strategies have since been devised for studying the evolution of cancer using barcoded lung cancer cells in mice (Rogers et al. 2018) and quantification of eco-evolutionary dynamics in microbial communities (Venkataram et al. 2022). The potential for extending strategies and applications is considerable, ranging from tracing object provenance (Nguyen Ba et al. 2022), enhancing experimental power through mutant-multiplexing (Jackson et al. 2020), extending understanding of microbiome transmission (Vasquez et al. 2021), to the study of demographic changes during colonisation of new environments (Brettner et al. 2022). However, one impediment to progress is the effort required to develop the initial barcoded library, especially in non-model systems.
Here, we outline a step-by-step guide for barcoding microbial cells, with a particular focus on the bacterium Pseudomonas fluorescens SBW25 (hereafter SBW25). In addition to detailed protocols covering library preparation, amplicon-sequencing methods and strategies for assessment of data quality, critical steps are emphasised and guidance of a general nature given in order to facilitate the development of barcoded libraries in non-model microbes.
Library Creation and Validation
Design and Construction of Barcoded Integrative Plasmids
The primary aim is to introduce barcoded sequences into the chromosome of an otherwise isogenic population in order to enable lineage tracking in later experiments. Note that barcoding for lineage tracking is distinct from ‘Tn-seq’ (van Opijnen et al. 2009), which introduces transposon insertion mutations along with a barcode sequence. The standard strategy behind barcoding for lineage tracking involves the creation of a vector with a region of extreme sequence variation, which is eventually introduced into the target bacterium. The backbone vector should have minimal fitness consequences and should integrate just once at a single genomic site. In addition, the ‘barcode’ region should be short enough to be sequenced with short-read next-generation sequencing platforms. The downstream library of barcoded mutants should also be highly diverse (i.e. there should be many barcodes, each at a similar frequency).
For our libraries, derivatives of mini-Tn7 vectors (Choi and Schweizer 2006) were employed. This transposon-based system ensures chromosomal integration of random (barcode) sequences in a single, known chromosomal location (Waddell and Craig 1989). The strategy is similar to that used to barcode E. coli (Jasinska et al. 2020). The advantage of the Tn7 system is the highly conserved position of the attTn7 site (downstream of the glmS gene) into which Tn7 elements integrate (without causing the inactivation of an open-reading frame). In addition, the attTn7 region is conserved across many bacterial phyla, including Proteobacteria, Fermicutes, and Bacteroidetes (Parks and Peters 2007; Mitra et al. 2010; Wiles et al. 2018), and integration of Tn7-vector systems has been demonstrated across many model organisms (Schlechter et al. 2018; Wiles et al. 2018), including SBW25 (Liu et al. 2014; Lind et al. 2015; Farr et al. 2017). In bacteria lacking an attTn7 site, such a site can be engineered into the genome, provided that the bacterium of interest is capable of transformation and undergoing allelic replacement (Figueroa-Cuilan et al. 2016). If introduction of attTn7 is not an option, other methods are available for introducing barcodes, but they are less ideal for final applications. These options include (1) the transformation of cells with barcoded plasmids (Vasquez et al. 2021) and (2) the use of Tn5-based systems. The former requires selection for maintenance of the plasmid during subsequent evolution experiments, and this can be problematic from the perspective of cost, and variation in copy number complicates estimates of barcode lineage frequency. The Tn5-based option involves use of transposons that integrate effectively at random with each insertion standing to disrupt targeted genes. This is clearly undesirable.
In developing an integration vector for SBW25, all efforts were made to keep the vector as small as possible and the metabolic cost of the vector as minimal as possible to prevent selection for lineages with spontaneous deletions of the barcoded region in subsequent experiments. The final vector is ~ 3000 bp and contains a minimal set of expressed genes (Fig. 1, Panel A). Limiting the size of the vector has the additional benefit of increasing the rate of transformation (although it should be noted that a co-transformation with a larger helper vector is required for the integration of mini-Tn7-vectors). Our vector was constructed using Gibson assembly, although any assembly method can be used, including standard restriction enzyme digestion and ligation strategies.
Panel B summarises the molecular process required to barcode SBW25. This process was repeated 25 times until a mutant library of sufficient diversity was reached. Following the construction of the backbone vector, plasmids with 25N random sequences were used to PCR amplify the vector, the product was ligated into a circular vector and amplified by transformation into E. coli Top10 cells (transformants were selected on agar plates supplemented with tetracycline), and a pool of plasmids were extracted from the resultant colonies and transformed into SBW25 with a pUX-BF13 helper vector (which expresses transposition machinery but does not replicate in SBW25). Transformants were grown on selective agar plates, and the resulting colonies were harvested and pooled into a single library.
Panel C illustrates the 25 bp of a random barcode and an example of 5 bp of the conserved areas left and right of the barcode coming from the TN7 plasmid. In addition, panel C shows the position downstream of glmS where the barcoded vector integrates into the SBW25 genome and final orientation of the vector.
The vector has three key elements: (1) an origin of replication (pBR322 origin) that allows propagation in an E. coli host (the pBR322 origin does not allow replication in non-E. coli hosts); (2) a selectable marker (in our case, a gene-encoding tetracycline resistance: tetA) (3) paired Tn7 integrative elements (L and R elements) between which the selectable marker and later the barcodes are situated. Elements 1 and 2 may require exchange with suitable alternative elements to enable the use of this vector in other model species. Transcriptional terminators were included on either side of the selectable marker to limit excessive expression of the antibiotic resistance gene, the downstream barcode (unintended transcription of which may cause deleterious peptides), or the neighbour genes at the integration site.
Integration of random sequences (the barcodes) requires several rounds of cloning before the final barcoded library is delivered to the desired target species. Transformation also needs to be performed multiple times to achieve high diversity of the barcoded library. During this process—detailed in supplementary files, visually summarised in (Fig. 1, Panel B), and recounted in the following paragraphs—care should be taken to ensure the frequency of each barcoded genotype is as even as possible. Unevenness implies one or several barcodes are at a high frequency, which increases the likelihood that spontaneous mutations will occur in the lineage of those genotypes, reducing the resolution at which beneficial mutants can be quantified. To this end, repeated restriction digestion was used to limit ancestral vector sequences and the number of transformants was carefully monitored to prevent an overrepresentation of barcodes from any single transformation.
To integrate random sequences into the previously described vector, the backbone was amplified with primers (see Table S3) containing randomly generated 25 bp sequences (the “barcodes”) using a fast-cloning approach (Li et al. 2011). The PCR product was digested with DpnI, which specifically targets multiple methylated restriction sites found in the non-amplified backbone plasmid. The resulting products were transformed into electrocompetent TOP10 cells to ligate the linear PCR product into a circular plasmid and amplify ligated plasmids. Transformed colonies from one plate were counted to estimate the total number of transformants (with an average of ~ 23,000 CFU per transformation). The remaining colonies were harvested, resuspended in LB and plasmids extracted over several miniprep columns. The protocol was repeated 25 times to improve barcode diversity. An estimated 580,000 transformant colonies were harvested for plasmid extraction from all 25 transformations. The plasmids underwent an enzymatic digestion step to limit the occurrence of the ancestral non-barcoded plasmid to < 0.001% in the final barcoded population (note that without the final digestion, the ancestral barcode-free sequence was measured at about 1% of the final population of transformed SBW25).
Clonal populations of SBW25 were prepared for transformation with the barcoded vector using pre-established methods (Choi and Schweizer 2006). The barcoded plasmid and the pUX-BF13 helper plasmid (Bao et al. 1991) were transformed into SBW25 by electroporation. Cells from each transformation were plated on selective plates with a dilution series to estimate the total number of transformants. Importantly, the total number of harvested SBW25 transformants was always less (~ 25%) than the number of E coli transformants from which plasmids were extracted. Limiting the number of SBW25 transformants reduces the potential for transformation with the same barcoded vector, thus, increasing diversity of the final library. The estimated number of unique barcodes was ~ 100,000. To further increase library diversity, the library-making process was repeated 25 times drawing upon plasmids from independent E. coli transformations. Because plates with more colonies have smaller colonies, measures of OD were made of resuspended cells with adjustments to dilutions to ensure equal contributions of each transformation to the final library of cells. Each of the 25 transformations was mixed in a final volume of ~ 50 mL, which was then vortexed thoroughly and frozen for later experiments.
Following library construction, simple quality control checks are necessary. The most important check involves the determination of the fraction of clones that do not contain barcodes. Such unwanted cells—those containing no Tn7 and those with Tn7 but no barcode—can appear in the final library and arise as, for example, a consequence of imperfect antibiotic counter-selection or incomplete restriction digestion of the original plasmid. This fraction, if large, would mean that numerous mutational events are unassociated with a unique sequence and, thus, go undetected. This check was made by sequencing across the Tn7 insertion site in 96 independent clones. The resultant data showed that all 96 contained unique barcodes [see (Theodosiou et al. 2023)].
Library Preparation
Identification and quantification of barcoded lineages in populations require amplicon sequencing. Our approach involved a two-step strategy for amplification: (1) amplification via PCR of the barcoded region (using primers specific to the barcoded region) and (2) amplification with Illumina-specific primers (this product was then sequenced on Illumina platforms, see Supplementary material: Amplicon-library preparation protocol). Table S4 and S5 list primers used for the rounds of PCR. The first pair of primers features sequencing primer-binding sites, 8N randomers to detect amplification biases (these are optional), a smaller heterospacer (see below), and the barcode vector-specific sequence for annealing of the primers. The heterospacer consists of either none, one, or two extra nucleotides, with all three primers mixed to reduce the saturation of the flow cell with signals from the same sequenced nucleotide. The second primer pair consists of the P5 or P7 flow cell binding regions, eight nucleotide indices to allow identification of the reads, and finally, a region complementary to the first set of primers to allow amplification of the initial PCR product. Together, this results in a ~ 310 bp product, with the barcoded region able to be sequenced from either end using 150 bp paired-end reactions.
At the point of PCR, it is necessary to be aware of problems arising from the amplification of random barcode sequences—a problem that becomes more acute with each amplification cycle. It was observed during the design of the library-generation protocol that more than 18 total PCR cycles of the barcoded region resulted in multiple large aberrant amplicons (see Figure S1). This led to the design of a limited cycle approach—with the added benefit of reducing potential PCR jackpots—which involved extracting large quantities of DNA from the experimental samples (~ 1 μg). This means that many cells in the population are sampled and ultimately provide greater resolution on the frequency of each barcode. After testing several commercial kits, a chloroform-based DNA extraction method was employed because this maximised extraction of DNA from cell pellets (see Supplementary material: Salt DNA extraction protocol).
The DNA extracted from colonies was used as a template for the first round of PCR of 13 cycles using a high-fidelity DNA polymerase. This first reaction was performed in quadruplicate per biological sample. The four PCR products were then purified into a single concentrated sample, which was used as a template for the second round of PCR (of only 5 cycles). The final amplicon was then purified of remaining primers and primer dimers with a magnet-based fragment size selection kit. The purified product was then quality controlled with electrophoresis (see Figure S2) and amplicon sequenced using an Illumina MiSeq DNA Flex Library Prep Kit.
Read Processing and Library Assessment
Following construction of a barcode library, the size and diversity of the library need assessment. This section describes a pipeline for extracting barcodes from sequencing data and then suggests an experimental protocol for testing the reproducibility and quality of the barcoded library.
When acquiring sequence data, it is advisable to check the quality of the amplicon sequences by filtering low-quality reads to avoid erroneous results in downstream analysis. If paired-end data are acquired, it is suggested to merge the reads, retaining the highest quality reads. The user should check that barcode sequences have similar length (± 2 bp) to the anticipated length (following the barcode design see Library Creation section), and only barcodes close to the expected values should be used. Moreover, it is best practice to map amplicon sequences to a reference sequence, ensuring that the amplicon sequences are in the same direction (sequences might be reversed or be in the reverse complement state after merging). Finally, barcode sequences should be retrieved by extracting sequence data that map within the coordinates of the reference barcode sequence. Alternatively, barcodes can be extracted based on the conserved areas left and right of the barcode (Fig. 1, Panel C) as in the bartender pipeline (Zhao et al. 2018). Using this approach, all sequences must have the same direction as the reference.
To assess the quality and reproducibility of the barcoded SBW25 library, three independent overnight cultures were initiated from the frozen stock (inoculated with ~ 1% V/V). Genomic DNA was extracted from all three populations, resulting in three amplicon libraries for sequencing.
After paired-end sequencing with MiSeq, we obtained two fastq files for each sample, each containing approximately 1.5 million sequences. The length of each sequence was around 150 bp, which included the barcoded area (~ 25 bp) and a conserved area left and right of the barcode (~ 120 bp). To ensure read quality, we used fastQC version 0.11.8. We then merged the paired-end reads using PEAR version 0.9.8 (default options), which keeps the read with the highest overall PHRED score (Zhang et al. 2014). To ensure that all sequences were in the same direction and orientation, we mapped all sequences to a reference barcode sequence using Minimap2 version 2.24 (Li 2018) with default options, requiring a minimum mapping quality of 30. Finally, we extracted the sequence reads that mapped to the reference barcode using a custom pipeline (see Data availability section).
Sequence mapping revealed 179.374 unique barcoded sequences from replicate 1178.993 from replicate 2 and 159.870 from replicate 3 (see Table S6). More than 98% of all barcodes in all replicates had the expected length of 25 ± 2 bp (Fig. 2) and could be used for downstream analysis. A pairwise comparison for all replicates was made to find common barcodes (Fig. 3), and frequencies were compared using Pearson correlations (Fig. 4). Barcode frequencies between replicates were highly similar, leading to an R = 1 in all cases, indicating high reproducibility. Across replicates, no specific barcodes were overrepresented, and barcodes were present at low frequencies, making our barcoded library suitable for experimental evolution (Figure S3). Around 56% of the unique barcodes were found in all three replicates, although this can increase significantly if the barcodes are clustered based on their sequence similarity (the clustering of barcodes is not considered further in this article). Differences in the number of unique barcodes in each replicate can be attributed to two possible factors: (1) library preparation and the sequencing process are prone to error, leading to changes in the nucleotide sequence and, thus, to different unique barcode sequences; (2) Sampling effects: at each stage of culturing, sampling and amplicon sequencing, multiple bottlenecks exist that may alter final barcode diversity.
Next Stages
Having prepared and evaluated libraries, attention turns to experiments where barcoding can be applied. A standard experiment is to set up replicates of genetically homogeneous barcoded populations in different ecological conditions (e.g. with and without predators, antibiotics, salinity, or fluctuating environments), sample populations daily, amplify the barcoded region from the population, and sequence the amplicons. The resulting sequences allow changes in the frequencies of lineages to be tracked. Aided by mathematical models, it is possible to infer the number of beneficial mutations, their fitness effects, and how these affect the population dynamics of lineages. It is of interest to quantify competition among lineages and the contribution of ecology and evolution to lineage population dynamics.
Beyond the analysis of adaptive evolution in single populations, barcoded organisms can be used to explore a range of questions in ecology and evolution. For example, introducing a barcoded population to a community of microbial species allows analysis of the eco-evolutionary dynamics of the focal type embedded in a community [referred to as an ESENCE approach (Lenski 2017)]. To do so, a population of the model or nonmodel organism can be barcoded and then introduced into experimental cultures composed of complex microbial communities. The dynamics of that focal strain can then be observed over time. Using this procedure, the effects of manipulation of the community (such as specific composition, diversity, resource use, or adaptation to resources etc.) can be assessed.
A second line of inquiry concerns ecological factors shaping diversification in the face of dispersal and recolonisation events. Populations of barcoded cells can be subjected to different regimes of dispersal or disturbance over short time scales.
Once experimental data have been acquired, further challenges remain, such as clustering of sequencing data and inference of eco-evolutionary dynamics. The current methodology (Levy et al. 2015; Ba et al. 2019; Kinsler et al. 2020; Vasquez et al. 2021; Avecilla et al. 2022; Johnson et al. 2023; Li et al. 2023) for analysing barcoding data has been tailored to specific types of experiments that assume absence of ecological processes, such as frequency-dependent selection. There is a considerable need to build pipelines for analysis that encompasses more complex and realistic eco-evolutionary scenarios. The challenges presented require extensive development and description and will be the subject of a forthcoming paper.
Conclusions
In this article, we have provided a step-by-step guide for barcoding microbial cells: from library preparation to amplicon sequencing, including methods to assess sequence data quality. As an example, a protocol to barcode Pseudomonas fluorescens SBW25 has been provided, and procedures have been emphasised, which enable maximum diversity of the resulting library. Barcoding SBW25, or any other strain, constitutes a powerful and versatile system for tracking the eco-evolutionary dynamics of populations—in a multitude of contexts—through time. We hope that the protocols described, along with the accompanying discussion, serve to promote the construction of barcoded libraries in a diverse array of organisms and the development of new experimental possibilities.
Data Availability
All data is deposited in the following zenodo repository https://doi.org/10.5281/zenodo.7703993. Code and a tutorial for analysing the data are deposited in the following publicly available GitLab repository https://gitlab.gwdg.de/loukas.theodosiou/sbw25-barcoding.
References
Acar A, Nichol D, Fernandez-Mateos J et al (2020) Exploiting evolutionary steering to induce collateral drug sensitivity in cancer. Nat Commun 11:1923. https://doi.org/10.1038/s41467-020-15596-z
Aggeli D, Li Y, Sherlock G (2021) Changes in the distribution of fitness effects and adaptive mutational spectra following a single first step towards adaptation. Nat Commun 12:5193. https://doi.org/10.1038/s41467-021-25440-7
Aggeli D, Marad DA, Liu X et al (2022) Overdominant and partially dominant mutations drive clonal adaptation in diploid Saccharomyces cerevisiae. Genetics. https://doi.org/10.1093/genetics/iyac061
Ascensao JA, Wetmore KM, Good BH et al (2022) Quantifying the local adaptive landscape of a nascent bacterial community. Nat Commun. https://doi.org/10.1101/2022.02.03.475969
Avecilla G, Chuong JN, Li F et al (2022) Neural networks enable efficient and accurate simulation-based inference of evolutionary parameters from adaptation dynamics. PLoS Biol 20:e3001633. https://doi.org/10.1371/journal.pbio.3001633
Ba ANN, Cvijović I, Echenique JIR et al (2019) High-resolution lineage tracking reveals traveling wave of adaptation in laboratory yeast. Nature 575:494–499. https://doi.org/10.1038/s41586-019-1749-3
Bao Y, Lies DP, Fu H, Roberts GP (1991) An improved Tn7-based system for the single-copy insertion of cloned genes into chromosomes of gram-negative bacteria. Gene 109:167–168. https://doi.org/10.1016/0378-1119(91)90604-A
Blundell JR, Levy SF (2014) Beyond genome sequencing: lineage tracking with barcodes to study the dynamics of evolution, infection, and cancer. Genomics 104:417–430. https://doi.org/10.1016/j.ygeno.2014.09.005
Boyer S, Hérissant L, Sherlock G (2021) Adaptation is influenced by the complexity of environmental change during evolution in a dynamic environment. PLoS Genet 17:e1009314. https://doi.org/10.1371/journal.pgen.1009314
Brettner L, Ho WC, Schmidlin K et al (2022) Challenges and potential solutions for studying the genetic and phenotypic architecture of adaptation in microbes. Curr Opin Genet Dev 75:101951. https://doi.org/10.1016/j.gde.2022.101951
Choi K-H, Schweizer HP (2006) mini-Tn7 insertion in bacteria with single attTn7 sites: example Pseudomonas aeruginosa. Nat Protoc 1:153–161. https://doi.org/10.1038/nprot.2006.24
Cvijović I, Nguyen Ba AN, Desai MM (2018) Experimental studies of evolutionary dynamics in microbes. Trends Genet 34:693–703. https://doi.org/10.1016/j.tig.2018.06.004
Farr AD, Remigi P, Rainey PB (2017) Adaptive evolution by spontaneous domain fusion and protein relocalization. Nat Ecol Evol 1:1562–1568. https://doi.org/10.1038/s41559-017-0283-7
Figueroa-Cuilan W, Daniel JJ, Howell M et al (2016) Mini-Tn7 insertion in an artificial attTn7 site enables depletion of the essential master regulator CtrA in the phytopathogen Agrobacterium tumefaciens. Appl Environ Microbiol 82:5015–5025. https://doi.org/10.1128/AEM.01392-16
Jackson CA, Castro DM, Saldi G-A et al (2020) Gene regulatory network reconstruction using single-cell RNA sequencing of barcoded genotypes in diverse environments. Elife 9:e51254. https://doi.org/10.7554/eLife.51254
Jagdish T, Nguyen Ba AN (2022) Microbial experimental evolution in a massively multiplexed and high-throughput era. Curr Opin Genet Dev. https://doi.org/10.1016/j.gde.2022.101943
Jasinska W, Manhart M, Lerner J et al (2020) Chromosomal barcoding of E. coli populations reveals lineage diversity dynamics at high resolution. Nat Ecol Evol 4:437–452. https://doi.org/10.1038/s41559-020-1103-z
Johnson MS, Venkataram S, Kryazhimskiy S (2023) Best practices in designing, sequencing, and identifying random DNA barcodes. J Mol Evol. https://doi.org/10.1007/s00239-022-10083-z
Kinsler G, Geiler-Samerotte K, Petrov DA (2020) Fitness variation across subtle environmental perturbations reveals local modularity and global pleiotropy of adaptation. Elife 9:e61271. https://doi.org/10.7554/eLife.61271
Kinsler G, Schmidlin K, Newell D et al (2022) Extreme sensitivity of fitness to environmental conditions; lessons from #1BigBatch. bioRxiv. https://doi.org/10.1101/2022.08.25.505320
Lenski RE (2017) Experimental evolution and the dynamics of adaptation and genome evolution in microbial populations. ISME J 11:2181–2194. https://doi.org/10.1038/ismej.2017.69
Levy SF, Blundell JR, Venkataram S et al (2015) Quantitative evolutionary dynamics using high-resolution lineage tracking. Nature 519:181–186. https://doi.org/10.1038/nature14279
Li F, Salit ML, Levy SF (2018) Unbiased fitness estimation of pooled barcode or amplicon sequencing studies. Cell Syst 7(5):521–525.e4. https://doi.org/10.1016/j.cels.2018.09.004
Li C, Wen A, Shen B et al (2011) FastCloning: a highly simplified, purification-free, sequence- and ligation-independent PCR cloning method. BMC Biotechnol 11:92. https://doi.org/10.1186/1472-6750-11-92
Li F, Tarkington J, Sherlock G (2023) Fit-Seq2.0: an improved software for high-throughput fitness measurements using pooled competition assays. J Mol Evol. https://doi.org/10.1007/s00239-023-10098-0
Lind PA, Farr AD, Rainey PB (2015) Experimental evolution reveals hidden diversity in evolutionary pathways. Elife 4:e07074. https://doi.org/10.7554/eLife.07074
Liu Y, Rainey PB, Zhang XX (2014) Mini-Tn7 vectors for studying post-transcriptional gene expression in Pseudomonas. J Microbiol Methods 107:182–185. https://doi.org/10.1016/j.mimet.2014.10.015
Mitra R, McKenzie GJ, Yi L et al (2010) Characterization of the TnsD-attTn7 complex that promotes site-specific insertion of Tn7. Mob DNA 1:18. https://doi.org/10.1186/1759-8753-1-18
Nguyen Ba AN, Lawrence KR, Rego-Costa A et al (2022) Barcoded bulk QTL mapping reveals highly polygenic and epistatic architecture of complex traits in yeast. Elife 11:e73983. https://doi.org/10.7554/eLife.73983
Parks AR, Peters JE (2007) Transposon Tn7 is widespread in diverse bacteria and forms genomic islands. J Bacteriol 189:2170–2173. https://doi.org/10.1128/JB.01536-06
Rogers ZN, McFarland CD, Winters IP et al (2018) Mapping the in vivo fitness landscape of lung adenocarcinoma tumor suppression in mice. Nat Genet 50:483–486. https://doi.org/10.1038/s41588-018-0083-2
Schlechter RO, Jun H, Bernach M et al (2018) Chromatic bacteria–A broad host-range plasmid and chromosomal insertion toolbox for fluorescent protein expression in bacteria. Front Microbiol. https://doi.org/10.3389/fmicb.2018.03052
Theodosiou L, Farr AD, Rainey PB (2023) Barcoding populations of Pseudomonas fluorescens SBW25. Data repository, zenodo. https://doi.org/10.5281/zenodo.7703993
van Opijnen T, Bodi KL, Camilli A (2009) Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat Methods 6:767–772. https://doi.org/10.1038/nmeth.1377
Vasquez KS, Willis L, Cira NJ et al (2021) Quantifying rapid bacterial evolution and transmission within the mouse intestine. Cell Host Microbe 29:1454-1468.e4. https://doi.org/10.1016/j.chom.2021.08.003
Venkataram S, Dunn B, Li Y et al (2016) Development of a comprehensive genotype-to-fitness map of adaptation-driving mutations in yeast. Cell 166:1585-1596.e22. https://doi.org/10.1016/j.cell.2016.08.002
Venkataram S, Kuo HY, Hom EFY, Kryazhimskiy S (2022) Mutualism-enhancing mutations dominate early adaptation in a microbial community. Biorxiv. https://doi.org/10.1101/2021.07.07.451547
Waddell CS, Craig NL (1989) Tn7 transposition: recognition of the attTn7 target sequence. Proc Natl Acad Sci 86:3958–3962. https://doi.org/10.1073/pnas.86.11.3958
Wiles TJ, Wall ES, Schlomann BH et al (2018) Modernized tools for streamlined genetic manipulation and comparative study of wild and diverse proteobacterial lineages. Mbio 9:e01877-e1918. https://doi.org/10.1128/mBio.01877-18
Zhang J, Kobert K, Flouri T, Stamatakis A (2014) PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics 30(5):614–620. https://doi.org/10.1093/bioinformatics/btt593
Zhao L, Liu Z, Levy SF, Wu S (2018) Bartender: a fast and accurate clustering algorithm to count barcode reads. Bioinformatics 34:739–747. https://doi.org/10.1093/bioinformatics/btx655
Acknowledgements
We thank Withe Derner for help constructing the barcoded cellular library, David Rogers for advice on the vector construction and amplification of the barcoded region, Ellen McConnell for her advice in improving the Salt DNA extraction protocol, Sven Kuenzel for his help sequencing the samples, and Kaumudi Prabhakara for reading the manuscript and providing valuable insights. We acknowledge the Max Planck Society for generous core support and funding from the Deutsche Forschungsgemeinschaft (DFG) Collaborative Research Center 1182 ‘Origin and Function of Metaorganisms’ (grant no. SFB1182).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Handling editor: Kerry Geiler-Samerotte.
Supplementary Information
Below is the link to the electronic supplementary material, which includes supplementary figures and tables.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Theodosiou, L., Farr, A.D. & Rainey, P.B. Barcoding Populations of Pseudomonas fluorescens SBW25. J Mol Evol 91, 254–262 (2023). https://doi.org/10.1007/s00239-023-10103-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-023-10103-6