Abstract
Cis-regulatory sequences play a crucial role in regulating gene expression and are evolutionary hot spots that drive phenotypic divergence among organisms. Sequencing some cis-regulatory regions of interest in many different species is common in comparative genetic studies. For nonmodel organisms lacking genomic data, genome walking is often the preferred method for this type of application. However, applying genome walking will be laborious and time-consuming when the number of cis-regulatory regions and species to be analyzed is large. In this study, we propose a novel method called amplicon genome fishing (AGF), which can isolate and sequence cis-regulatory regions of interest for any organism. The main idea of the AGF method is to use fragments amplified from the target cis-regulatory regions as enrichment baits to capture and sequence the whole target cis-regulatory regions from genomic library pools. Unlike genome walking, the AGF method is based on hybridization capture and high-throughput sequencing, which makes this method rapid and efficient for projects where some cis-regulatory regions have to be sequenced for many species. We used human amplicons as capture baits and successfully sequenced five target enhancer regions of Homo sapiens, Mus musculus, Gallus gallus, and Xenopus tropicalis, proving the feasibility and repeatability of AGF. To show the utility of the AGF method in real studies, we used it to sequence the ZRS enhancer, a cis-regulatory region associated with the limb loss of snakes, for twenty-three vertebrate species (includes many limbless species never sequenced before). The newly obtained ZRS sequences provide new perspectives into the relationship between the ZRS enhancer's evolution and limb loss in major tetrapod lineages.
Similar content being viewed by others
Data accessibility
The Illumina read data generated in this study were deposited in NCBI SRA (accession PRJNA683667 and PRJNA683686). The newly obtained enhancer sequences were deposited in FigShare https://figshare.com/s/fe8c63036f371dc5a0f7.
References
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS et al (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19(5):455–477. https://doi.org/10.1089/cmb.2012.0021
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30(15):2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Brandley MC, Huelsenbeck JP, Wiens JJ (2008) Rates and patterns in the evolution of snake-like body form in squamate reptiles: evidence for repeated re-evolution of lost digits and long-term persistence of intermediate body forms. Evolution 62(8):2042–2064. https://doi.org/10.1111/j.1558-5646.2008.00430.x
Cretekos CJ, Wang Y, Green ED, Martin JF, Rasweiler JJ IV, Behringer RR (2008) Regulatory divergence modifies limb length between mammals. Genes Dev 22(2):141–151. https://doi.org/10.1101/gad.1620408
Dai SM, Chen HH, Chang C, Riggs AD, Flanagan SD (2000) Ligation-mediated PCR for quantitative in vivo footprinting. Nat Biotechnol 18(10 SUPPL.):1108–1111. https://doi.org/10.1038/80323
Glenn TC, Faircloth BC (2016) Capturing Darwin’s dream. Mol Ecol Resour 16(5):1051–1058. https://doi.org/10.1111/1755-0998.12574
Hardison RC (2000) Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet 16(9):369–372. https://doi.org/10.1016/S0168-9525(00)02081-3
Hardison RC, Taylor J (2012) Genomic approaches towards finding cis-regulatory modules in animals. Nat Rev Genet 13(7):469–483. https://doi.org/10.1038/nrg3242
Kalendar R, Shustov AV, Seppänen MM, Schulman AH, Stoddard FL (2019) Palindromic sequence-targeted (PST) PCR: a rapid and efficient method for high-throughput gene characterization and genome walking. Sci Rep 9(1):1–11. https://doi.org/10.1038/s41598-019-54168-0
Kvon EZ, Kamneva OK, Melo US, Barozzi I, Osterwalder M, Mannion BJ et al (2016) Progressive loss of function in a limb enhancer during snake evolution. Cell 167(3):633-642.e11. https://doi.org/10.1016/j.cell.2016.09.028
Leal F, Cohn MJ (2016) Loss and re-emergence of legs in snakes by modular evolution of Sonic hedgehog and HOXD enhancers. Curr Biol 26(21):2966–2973. https://doi.org/10.1016/j.cub.2016.09.020
Lemmon AR, Emme SA, Lemmon EM (2012) Anchored hybrid enrichment for massively high-throughput phylogenomics. Syst Biol 61(5):727–744. https://doi.org/10.1093/sysbio/sys049
Lettice LA, Heaney SJH, Purdie LA, Li L, de Beer P, Oostra BA et al (2003) A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. Hum Mol Genet 12(14):1725–1735. https://doi.org/10.1093/hmg/ddg180
Lettice LA, Hill AE, Devenney PS, Hill RE (2008) Point mutations in a distant sonic hedgehog cis-regulator generate a variable regulatory output responsible for preaxial polydactyly. Hum Mol Genet 17(7):978–985. https://doi.org/10.1093/hmg/ddm370
Lettice LA, Williamson I, Wiltshire JH, Peluso S, Devenney PS, Hill AE et al (2012) Opposing functions of the ETS factor family define Shh spatial expression in limb buds and underlie polydactyly. Dev Cell 22(2):459–467. https://doi.org/10.1016/j.devcel.2011.12.010
Lettice LA, Devenney P, De Angelis C, Hill RE (2017) The conserved Sonic Hedgehog Limb Enhancer consists of discrete functional elements that regulate precise spatial expression. Cell Rep 20(6):1396–1408. https://doi.org/10.1016/j.celrep.2017.07.037
Liu YG, Whittier RF (1995) Thermal asymmetric interlaced PCR: automatable amplification and sequencing of insert end fragments from P1 and YAC clones for chromosome walking. Genomics 25(3):674–681. https://doi.org/10.1016/0888-7543(95)80010-J
Liu K, Warnow TJ, Holder MT, Nelesen SM, Yu J, Stamatakis AP, Linder CR (2012) SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees. Syst Biol 61(1):90–106. https://doi.org/10.1093/sysbio/syr095
Loh EY, Elliott JF, Cwirla S, Lanier LL, Davis MM (1989) Polymerase chain reaction with single-sided specificity: analysis of T cell receptor δ chain. Science 243(4888):217–220. https://doi.org/10.1126/science.2463672
Mamanova L, Coffey AJ, Scott CE, Kozarewa I, Turner EH, Kumar A et al (2010) Target-enrichment strategies for next-generation sequencing. Nat Methods 7(2):111–118. https://doi.org/10.1038/nmeth.1419
Ochman H, Gerber AS, Hartl DL (1988) Genetic applications of an inverse polymerase chain reaction. Genetics 120(3):621–623
Parker JD, Rabinovitch PS, Burmer GC (1991) Targeted gene walking polymerase chain reaction. Nucleic Acids Res 19(11):3055–3060. https://doi.org/10.1093/nar/19.11.3055
Peñalba JV, Smith LL, Tonione MA, Sass C, Hykin SM, Skipwith PL et al (2014) Sequence capture using PCR-generated probes: a cost-effective method of targeted high-throughput sequencing for nonmodel organisms. Mol Ecol Resour 14(5):1000–1010. https://doi.org/10.1111/1755-0998.12249
Prescott SL, Srinivasan R, Marchetto MC, Grishina I, Narvaiza I, Selleri L et al (2015) Enhancer divergence and cis-regulatory evolution in the human and chimp neural crest. Cell 163(1):68–83. https://doi.org/10.1016/j.cell.2015.08.036
Reddy PS, Mahanty S, Kaul T, Nair S, Sopory SK, Reddy MK (2008) A high-throughput genome-walking method and its use for cloning unknown flanking sequences. Anal Biochem 381(2):248–253. https://doi.org/10.1016/j.ab.2008.07.012
Roscito JG, Sameith K, Parra G, Langer BE, Petzold A, Moebius C et al (2018) Phenotype loss is associated with widespread divergence of the gene regulatory landscape in evolution. Nat Commun. https://doi.org/10.1038/s41467-018-07122-z
Sagai T, Hosoya M, Mizushina Y, Tamura M, Shiroishi T (2005) Elimination of a long-range cis-regulatory module causes complete loss of limb-specific Shh expression and truncation of the mouse limb. Development 132(4):797–803. https://doi.org/10.1242/dev.01613
Schoenfelder S, Fraser P (2019) Long-range enhancer–promoter contacts in gene expression control. Nat Rev Genet. https://doi.org/10.1038/s41576-019-0128-0
Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K et al (2005) Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res 15(8):1034–1050. https://doi.org/10.1101/gr.3715005
Tan G, Gao Y, Shi M, Zhang X, He S, Chen Z, An C (2005) SiteFinding-PCR: a simple and efficient PCR method for chromosome walking. Nucleic Acids Res 33(13):1–7. https://doi.org/10.1093/nar/gni124
Thirulogachandar V, Pandey P, Vaishnavi CS, Reddy MK (2011) An affinity-based genome walking method to find transgene integration loci in transgenic genome. Anal Biochem 416(2):196–201. https://doi.org/10.1016/j.ab.2011.05.021
Visel A, Minovitsky S, Dubchak I, Pennacchio LA (2007) VISTA enhancer browser—a database of tissue-specific human enhancers. Nucleic Acids Res 35(SUPPL. 1):88–92. https://doi.org/10.1093/nar/gkl822
Wittkopp PJ, Kalay G (2012) Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergence. Nat Rev Genet 13(1):59–69. https://doi.org/10.1038/nrg3095
Woltering JM, Irisarri I, Ericsson R, Joss JMP, Sordino P, Meyer A (2020) Sarcopterygian fin ontogeny elucidates the origin of hands with digits. Sci Adv 6(34):eabc3510. https://doi.org/10.1126/sciadv.abc3510
Wray GA (2007) The evolutionary significance of cis-regulatory mutations. Nat Rev Genet 8(3):206–216. https://doi.org/10.1038/nrg2063
Yuanxin Y, Chengcai A, Li L, Jiayu G, Guihong T, Zhangliang C (2003) T-linker-specific ligation PCR (T-linker PCR): an advanced PCR technique for chromosome walking or for isolation of tagged DNA ends. Nucleic Acids Res 31(12):e68–e68. https://doi.org/10.1093/nar/gng068
Zhang Y, Deng S, Liang D, Zhang P (2019) Sequence capture across large phylogenetic scales by using pooled PCR-generated baits: a case study of Lepidoptera. Mol Ecol Res. https://doi.org/10.1111/1755-0998.13026
Acknowledgements
This work was supported by Guangdong Basic and Applied Basic Research Foundation, China (No. 2019A1515010729) to D. Liang, and the National Natural Science Foundation of China (Grants No. 32071611) to P. Zhang. Some tissue samples used in this study are from the Museum of Vertebrate Zoology, Berkeley. We want to thank those researchers who collected and kindly shared these invaluable specimens.
Author information
Authors and Affiliations
Contributions
DL designed and supervised the whole study. HG performed the experiments. HG and MX did data analysis. HG, PZ, and DL wrote the paper.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Communicated by Stefan Hohmann.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Gu, H., Zhang, P., Xu, M. et al. Amplicon genome fishing (AGF): a rapid and efficient method for sequencing target cis-regulatory regions in nonmodel organisms. Mol Genet Genomics 296, 527–539 (2021). https://doi.org/10.1007/s00438-021-01775-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-021-01775-0