Abstract
Purpose of Review
Here we highlight the significant contribution that genomics-based approaches have had on the field of Cryptosporidium research and the insights these approaches have generated into Cryptosporidium biology and transmission.
Recent Findings
There are advances in genomics, genetic manipulation, gene expression, and single-cell technologies. New and better genome sequences have revealed variable sub-telomeric gene families and genes under selection. RNA expression data now include single-cell and post-infection time points. These data have provided insights into the Cryptosporidium life cycle and host–pathogen interactions. Antisense and ncRNA transcripts are abundant. The critical role of the dsRNA virus is becoming apparent.
Summary
The community’s ability to identify genomic targets in the abundant, yet still lacking, collection of genomic data, combined with their increased ability to assess function via gene knock-out, is revolutionizing the field. Advances in the detection of virulence genes, surveillance, population genomics, recombination studies, and epigenetics are upon us.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Cryptosporidiosis is a neglected disease, caused by apicomplexan parasites in the genus Cryptosporidium. It has devastating impacts on the most vulnerable, especially infants and the immunosuppressed [1]. As the scope and significance of this infectious disease has become apparent [2, 3••], the research community has responded. Over the last decade, there have been significant advances in genetics, genomics, the ability to culture the parasite and perform high-throughput screening, host–pathogen interactions, surveillance, and therapeutics. The advances in these areas have yielded large quantities of associated genomic data (Table 1) that are fueling advances in our understanding of Cryptosporidium, its life cycle [4, 5], its evolution [6], transmission [7••], and host–pathogen interactions [8, 9, 10••]. These data will also facilitate the design of better surveillance tools for local, regional, and hopefully, global use.
Genomic data, however, are not without their challenges. Historically, Cryptosporidium genome sequences have been very hard to generate due to a lack of pure parasite material. This challenge has recently been overcome for genome sequence generation, but not for transcriptomics of post-infection Cryptosporidium life cycle stages. Another challenge arises from the fact that most available data were created with short-read sequencing approaches. While Illumina sequencing is highly accurate, it cannot yield complete genome assemblies and poses significant challenges for the analysis of gene families and repetitive sequences. Currently, cloning is also impossible for Cryptosporidium. Thus, nearly all genomic and transcriptomic data have been generated using populations of parasites rather than purified isogenic clones. This fact creates considerable challenges for genome assembly, data analysis, and interpretation. Here we highlight key advances, remaining challenges and future prospects.
Technological Advances Have Facilitated Cryptosporidium Genomics
Whole Genome Sequencing
With the advent of high accuracy second-generation sequencing (Illumina short reads) and the large fragment sequencing capabilities of third-generation sequencing (Pacific Biosciences and Oxford Nanopore Technologies, ONT), Cryptosporidium genome sequences are being generated at an increased rate [11]. Currently, there are 74 Cryptosporidium genome sequence assemblies located in the NCBI GenBank and more than half have been submitted since 2018. Whole-genome sequences are needed to facilitate the research community’s ability to design and interpret their experiments. As additional genome sequences become available for new species and strains, a framework for a more holistic genomic comparative analysis is being constructed. The power of comparative insights is significant [12, 13••, 14••, 15]. For example, the addition of a small number of genome sequences shed considerable insight into the diversity and evolution of a species, C. parvum, which revealed the existence of an anthroponotic subclade that was likely shaped via introgression of DNA from other Cryptosporidium species and subtypes [6].
Single-Oocyst Sequencing
Genome sequence generation for Cryptosporidium has historically been quite difficult due to the large number of oocysts required for DNA preparation. Oocysts, which contain four haploid sporozoites, have ~ 40 fg of genomic DNA. Most clinical samples do not contain a sufficient number of oocysts to reach the minimum DNA requirements for sequencing library preparation. Thus, important isolates have historically been propagated in immunosuppressed mice or gnotobiotic pigs. This process is difficult, expensive, and time-consuming. However, with the advent of single-oocyst sequencing, the possibility of generating genomic sequences from a single oocyst is a reality [16]. The protocol involves oocyst sorting, lysis, genome amplification with multiple displacement amplification (MDA), and sequencing with short-read Illumina [16]. This technique has recently been modified to utilize long-read ONT sequencing [17••, 18]. Given the obligately sexual nature of Cryptosporidium and the existence of four related haploid sporozoites within an oocyst, single-oocyst sequencing is also a promising technique for studies of diversity within a single infection [16]. Single-oocyst sequencing also creates an avenue for studying diversity and recombination events within a single oocyst [17••].
Hybrid Capture from Fecal DNA Samples
Hybrid capture, i.e., the selective enrichment of particular DNA sequences via hybridization to long, single-stranded RNA probes representing the target genome sequence of interest [19], is an ideal approach for isolating Cryptosporidium genomic DNA from fecal DNA samples. Most clinical Cryptosporidium fecal DNA samples contain abundant microbial, food, and human DNA content. Recently, hybrid capture has proven tractable for fecal DNA samples with a Cryptosporidium qPCR Ct score of < 20 and for much higher values if a double enrichment is performed (Bayona et al., in prep). The use of hybridization capture has made fecal DNA samples accessible for genome sequence generation. This development will permit samples from numerous studies, sitting in freezers to be analyzed. Importantly, the hybrid capture baits can be customized to have a wider sequence divergence range to facilitate detection of less common human-infecting species. Smaller subsets of probes can be tailored for specific regions of the genome to provide a multi-locus approach to quickly screen large numbers of samples and facilitate outbreak investigations.
Genomic and Transcriptomic Data Are Abundant yet Incomplete
As we can see in Table 1, there are over 1000 genomic data sets for C. parvum and C. hominis; yet, there are only nine assembled and annotated genome sequences. Outside of these two prominent human-infecting species, the situation is bleak. A few dozen genome sequence data sets exist for all other species, and more than half of all named species have no genomic sequence data and are missing from Table 1.
Most existing Cryptosporidium genome sequences also present several challenges for the community. Most were generated using only short-read technologies that produce assemblies that contain gaps and compressed sequence regions (Fig. 1). A complete telomere-to-telomere, T2T, chromosomal assembly would contain eight chromosomes. Although the karyotype is unknown for most species, current assemblies contain dozens to hundreds of contigs, few telomere sequences, and many unassembled reads. Genome sequence assembly gaps most often arise in genome regions that contain repetitive sequences making short reads difficult to place. Long stretches of repetitive sequence also generate gaps, as does the merger of recent gene duplications that reside in multiple locations in genome (Fig. 1). Diversity within the population of parasites being sequenced can also create gaps because some parasites may possess structural variants like indels and inversions or differences in gene family or repeat copy numbers. When genome sequences contain gaps, it is difficult to know if genes are actually missing, thus posing significant challenges for comparative genomics.
The inclusion of long-read sequencing approaches and hybrid genome assemblies utilizing both long- and short-read approaches is the answer. Long reads, which can reach 100 + kb in length, can cover large genomic regions permitting an exact determination of repeat or gene copy numbers (Fig. 1) and provide proof of genome rearrangements. Depending on population numbers, contigs for differing genotypes within a population can be obtained, i.e., evidence for parasites that have two vs three copies of a particular gene in the same isolate.
When looking at RNA sequence data, the landscape is barren (Table 1), and most existing data sets are a combination of host and parasite transcripts since purification of post-infection parasites remains tenuous at best [20]. There are 145 RNA data sets for C. parvum, four for C. hominis, and one for C. baileyi. This paucity of data has significant consequences for the community. It means that the few genome sequences that have annotation have had to rely on orthology and de novo gene prediction alone. This means that species-specific genes are very difficult to discover, ncRNA genes will be missed, and untranslated regions (UTRs) will be unannotated making it difficult to know where promoters are located since transcription initiation sites are unknown and studies of post-transcriptional regulation, which often involve sequences in the 5′ and 3′ UTRs, are impossible.
Comparative genomics of Cryptosporidium species yields informative insights
Many Human-Infecting Species Are Closely Related
Genome sequences for the species most often observed in humans revealed that the genome sequences are highly similar and highly syntenic [6, 21, 22]. Several recent papers have also demonstrated the complex population genomic structure of C. parvum and C. hominis in natural infections [6, 7••, 15, 23] and highlighted the role that recombination and introgression have played during evolution [6, 14••, 15]. These and other works also show the impact that recombination can have with respect to the generation of novelty with proven impacts on transmission [7••]. Comparative genomics has also revealed the strikingly close relationship between the genome sequences of C. cuniculus, which infects rabbits and humans, and C. hominis [6]. Interestingly, an even closer genomic relationship was observed between the genome sequences of C. parvum and C. tyzzeri, yet C. tyzzeri only infects mice [24].
Recently, a genetic cross was reported not only within C. parvum but also between C. parvum and C. tyzzeri [17••]. In addition to being a major genetics breakthrough for Cryptosporidium research, this finding, combined with the recombination and introgression observations above, raises questions regarding the definition of what constitutes a species in Cryptosporidium. For many reasons, we are not advocating changes, only the recognition of just how similar some subclades and species are to each other at the genomic sequence level and how little we know about their host range [25]. C. parvum shares 96.8% identity with C. hominis, 97.2% identity with C. tyzzeri, 97% identity with C. cuniculus, and 91.3% identity with C. meleagridis (% identity is average nucleotide identity). These species also share almost complete synteny (gene order and orientation) and appear to differ by only a few sub-telomeric genes, if any [24, 26••] with the exception of C. meleagridis that appears, on the basis of long reads, to have a few dozen small intra- and inter-chromosomal rearrangements relative to the other species [27]. Thus, the genetic basis of host preference and pathogenicity may extend from gene content differences to also include single nucleotide variants, small indels, and possible differences in gene regulation. It is worth noting that significant differences in gene content between these species are found with short read analyses, including one in which some of us have participated [22, 28], highlighting the impact of technology and assembly quality on downstream analyses. Notably, synteny with species outside of this group, for which we have genome sequences, no longer extends for the full length of the chromosome and instead is broken down into smaller units of recognizable synteny [11, 29].
Subtelomeric Chromosomal Regions Contain Gene Families and Appear to be Highly Dynamic
In general, the subtelomeric regions of eukaryotic chromosomes are more dynamic in terms of gene copy numbers and levels of observed variation than the rest of the chromosome, and this is particularly the case in pathogenic organisms [30, 31]. Genes that encode proteins involved in host–pathogen interactions and environmental responses are often, but not uniquely, located in sub-telomeric locations [30, 31]. As a result, these regions of the genome are notoriously difficult to assemble. They also represent some of the fastest evolving regions of the genome and, thus, are interesting from the perspectives of host–pathogen biology, evolution, and diagnostics/surveillance.
The first T2T genome sequences for C. parvum revealed surprises regarding higher than expected (based on previous short-read assemblies) gene copy number for a number of genes located in subtelomeric regions, e.g., MEDLE genes, tryptophan synthase beta, and rRNA genes among others [24, 26••]. They also revealed that three different chromosomes shared a total of four highly similar subtelomeric chromosome ends, indicating that replication had occurred between chromosomes [24, 26••]. Better assembly and identification of genes in subtelomeric regions are likely to be crucial for our understanding of important aspects of Cryptosporidium biology. For example, MEDLE proteins, most of which are encoded in subtelomeric regions, are important secreted pathogenesis determinants [9, 32, 33] that appear to be differentially present across a number of species [29].
Population Genomic Studies Provide Insights into Variation, Evolution, and Transmission
Short-read sequencing technology permitted the generation of nearly 700 genomic data sets for C. parvum and nearly 400 for C. hominis. These highly accurate reads have been used to detect variants that exist among and between the different populations of Cryptosporidium parasites that have been sequenced [6, 7••, 13••, 14••, 15, 34, 35•]. The results have been illuminating. They have revealed a discordance in some cases between gp60 single locus typing and genome ancestry, mixed infections with the same or different species, recombination events within species and hybridization between species, discovery of novel subclades, and in general demonstrated the role that admixture has had on shaping population structure [13••, 15].
These studies have also revealed how little we know about the global population structure of Cryptosporidium species and the forces driving their evolution in differing environments and outbreak scenarios [6, 7••, 13••, 14••, 15, 34, 35•]. These studies also reveal the critical role that the reference genome has in the determination of differences in gene content and polymorphisms. Figure 1 highlights the theoretical outcome of determining single-nucleotide variants (SNVs) in two different scenarios, uncompressed gap free and compressed gapped genome sequences. Thus, a degree of caution is warranted for the interpretation of variant calling until the community has more complete reference genome sequences. The community would greatly benefit from a more diverse set of reference genome sequences and methods for capturing novel genomic content that may not be present in any given reference genome sequence.
Transcriptomics in Cryptosporidium
Annotation, Antisense, and ncRNA Transcripts
Utilization of small RNA-seq and PacBio long-read Iso-seq and ONT Direct RNAseq has significantly advanced our understanding of the Cryptosporidium transcriptome. These technologies have enabled the identification of untranslated regions (UTRs), as well as a variety of long and short non-coding RNAs (ncRNAs), including anti-sense transcripts of unknown function [26••, 36, 37]. Furthermore, single-molecule long-read RNAseq has been instrumental in demonstrating that approximately 10% of C. parvum genes have polycistronic transcripts, offering new insights into gene expression biology and regulation in this important pathogen (Xiao et al., in prep).
Differential Gene Expression
There is a significant amount of RNA-seq data for C. parvum (Table 1). Studies involving oxidative and heat stress on C. parvum oocysts have identified genes responsive to environmental cues [38]. Transcriptomic analyses of C. parvum-infected HCT-8 cells have revealed gene regulation patterns during early stages of infection [39•]. Comparative RNA-seq studies of both C. parvum and C. hominis have provided novel data sets on host-parasite RNA interactions during infection [40]. Research on AP-2 transcription factor deletion and its impact on sex differentiation and oocyst shedding, RNA m6A-immunoprecipitation in infected host cells, and differential gene expression in distinct Cryptosporidium species using enteroids have all contributed significantly to our understanding of life cycle progression and interactions with the host [41••, 42, 43••]. Additionally, RNA-seq in drug screening assays has suggested potential inhibition of translation during parasite sexual differentiation [44].
RNA expression data sets are desperately needed for additional species, especially C. hominis (Table 1). These data will permit comparative transcriptomic analyses and provide additional insight into gene regulatory differences that may exist between species. Additionally, RNA data can also be used to greatly improve the annotation of reference genome sequences for the community by providing evidence for UTRs and alternative isoforms, if present.
Epigenetics
Recent studies have shown that C. parvum has enzymatically functional histone methyltransferases, indicating developmentally dependent histone modifications. These findings also suggest that C. parvum infection can alter the epigenetic landscapes of host cells [45••]. Additionally, ATAC-seq in C. parvum sporozoites has provided the first glimpse into the parasite’s accessible chromatin landscape, prior to invasion, enabling new research into the regulation of parasite gene expression (Xiao et al., in prep). The community will benefit greatly from additional research into this exciting layer of gene regulation in Cryptosporidium especially during it developmental life cycle.
Major Advances That Are Generating Abundant, Informative Genomic and Transcriptomic Data
Transgenics
CRISPR/Cas9 mediated genetic modifications work in Cryptosporidium [46, 47]. Genetically modified C. tyzerri and C. parvum are being used to study host protective immunity and Cryptosporidium biology [47,48,49]. Dihydrofolate reductase-thymidylate synthase (DHFR-TS) and inosine monophosphate dehydrogenase (IMPDH) have been knocked out to study nucleotide synthesis [50]. C. parvum parasites were still viable after gene knockouts, suggesting alternative pathways for sequestering nucleotides from the host. This knowledge will help in drug development as some drugs are designed to target nucleotide synthesis [50], and these pathways have been successfully targeted in other apicomplexans.
Several selectable genetic markers are now available [17••, 46, 51••], and this development has permitted genetic crosses [17••, 51••] in immunocompromised mice. This advance is of considerable significance. The model, combined with genomic sequences, opens up reverse genetic research into Cryptosporidium biology, development, and the important question of host specificity.
As cloning of individual parasites is not yet possible, many phenotypic effects like changes in gene regulation or recombinant progeny are assessed via analyses of genome and transcriptome sequencing [17••], thus generating many new, important, data sets.
RNA Host–Pathogen Interactions
Recent advances looking at host-Cryptosporidium interactions have revealed that a wide variety of novel RNA forms are involved. For example, host circular RNA ciRS-7 is upregulated during C. parvum infection in HCT-8 cells. This upregulation influences the NF-κB signaling pathway by sponging miR-1270, which in turn significantly impacts the propagation of C. parvum [52]. A recent scRNA study of C. parvum-infected intestinal epithelial cells has led to development of a model to explain the role of IFN-gamma in the control of C. parvum infection in intestinal epithelial cells [53].
On the parasite side, recent research has shown that some C. parvum long noncoding RNAs can localize to the host cell [10••] and manipulate host cell gene expression by suppressing expression of CDH3 and LOXL4 [8]. One group has also explored RNA-based therapy, in which a single-stranded antisense RNA designed to parasite protein coding genes can silence parasite genes when loaded with argonaute protein [54].
The Cryptosporidium RNA Virus and Host–Pathogen Interactions
Previous studies have discovered two unique extrachromosomal linear double-stranded RNAs in C. parvum, encoding an RNA-dependent RNA polymerase and a protein kinase [55] The Cryptosporidium dsRNA virus has been detected in isolates of C. parvum, C. hominis, C. meleagridis, and C. felis [56]. Recent research has shown that the dsRNA virus can hijack a host long noncoding RNA, U90926 [10••]. Additional research has also shown that the Cryptosporidium dsRNA virus can trigger the host type I IFN antiviral pathway to dampen the hosts antiparasitic response, thus facilitating parasite success [57••]. All of these studies have generated numerous, exciting, host–pathogen gene expression data sets.
Applications to Control and Prevention
The Influence of Genomics on Cryptosporidium Surveillance
The natural environments that serve as reservoirs for Cryptosporidium are waterbodies and hosts. Several detection techniques have been developed and improved significantly, from conventional microscopy to immunological assays, flow cytometry, and nucleic acid–based methods [58]. Nucleic acid–based detection methods are highly efficient in detecting mixed infections and mixed populations with low abundance subpopulations present [59]. With nucleic acid detection–based approaches such as PCR, qPCR, and DNA sequencing, tiny amounts of Cryptosporidium DNA can be detected, and new bioinformatics tools make it easier to type Sanger sequences of Cryptosporidium species based on gp60 and SSU rRNA [60]. Though DNA sequencing as a detection method is relatively expensive, the ability to multiplex samples makes it more affordable, and its high accuracy makes it a better choice than microscopy [61, 62]. Accurately detecting and identifying Cryptosporidium sp. in an outbreak, based on genomic data, facilitate inquiries into the source of the outbreak and, on a broader scale, the epidemiology of the disease [63].
Clinical samples are essential in the study of cryptosporidiosis. Unfortunately, clinical isolates do not contain enough oocysts for traditional sequencing as discussed above. However, multiple displacement amplification (MDA), a type of whole genome amplification (WGA) technique, can be used to amplify the amount of DNA. MDA introduces very few errors, amplification is random, and the entire genome can be amplified for sequencing [64, 65]. WGA provides a means to obtain sufficient DNA from samples for use in analyses as well as long-read sequencing [66]. Studies of parasite diversity within and between samples can inform upon the source of the infection and the diversity of Cryptosporidium circulating within the population.
Genetic Markers, Diagnostics, and Surveillance
Phenotypic and antibody diagnostic identification are unable to distinguish Cryptosporidium species and subtypes; therefore, the genetic markers used such as 18SrRNA and gp60 were developed as useful single genetic markers. Due to the sexual nature of Cryptosporidium, and the fact that it has eight independently segregating chromosomes, multi-locus typing is a more ideal approach to typing [67]. Although there is no consensus on the specific markers to use, there is agreement that a multi-locus approach is needed [68] to better inform on Cryptosporidium epidemiology. Also, multi-locus genotyping is better because more data are available for identification and it adds the possibility of species subtype identification [69].
The increasing availability of full genome sequence data from increasing numbers of isolates should make the determination of appropriate loci for typing easier, but challenges remain. First, the community still lacks genomic sequence data from isolates circulating in many important regions of the world with a high incidence of Cryptosporidium infection. Just compare the burden reported in Gilbert et al. [3••] with the source of available genomes sequences in Fan et al. [11]. This situation is beginning to change, and sequences from isolates in other countries are emerging [15]. Hopefully, the genomic advances described above will facilitate this process and unlock the potential of existing samples and lead to the strategic collection of others. Additional genome sequences from new geographic locations and environments will allow the community to survey the extent of the genomic diversity that exists globally and design markers to account for it. One can also imagine the need for specialized markers to very quickly evolving regions of the genome that can be utilized in outbreak scenarios to detect variants as they arise.
Second, in order to appropriately assess genomic variation and rapidly evolving genomic regions, complete, T2T reference genomes for the species most commonly infecting humans should be established and adopted.
Finally, markers for routine surveillance are also needed. RT-PCR tests for the Cryptosporidium dsRNA virus are very sensitive due to viral abundance [70]. The method has been used to successfully identify Cryptosporidium infection in calves, lambs, goats, and environmental water samples across the world [71,72,73]. However, we do not yet know the full extent to which the dsRNA virus is present in different species [56].
Conclusions
Advances in genomics and transcriptomics are impacting all arenas of Cryptosporidium research [74••] from evolution to the life cycle, to host–pathogen interactions and surveillance. The Cryptosporidium research community has come far, very quickly with many new technologies, approaches, and data sets. Much of this new data is available for use and mining in the NCBI GenBank [75] and CryptoDB.org [76•].
The Cryptosporidium community is also struggling a bit with the difficult challenges posed by this important pathogen and the state of genomics technology. The lack of transcriptomic data for species other than C. parvum and the lack of genome sequences for more than half of the named species are real challenges. Complete genome sequences are still too hard to generate and even harder to consistently annotate, especially in the absence of RNA data. This reality impacts their utility and application to important needs like global surveillance and determination of complete gene repertoires. Complete, annotated reference genome sequences greatly facilitate experimental design, e.g., gene knock-outs, and pathway analyses. Likewise, analyses of data that require a reference genome sequence for interpretation, e.g., transcriptome and proteome data analyses and comparative genomics and evolution studies, will also benefit.
The community is also struggling with appropriate geographic representation of genomic data sets from many of the countries most affected by this pathogen. This lack of representation impacts the development of more representative, multi-locus diagnostics and impacts our knowledge base for epidemiological studies and outbreak investigations. Given how difficult Cryptosporidium is to work with, genomics advances have come far, but more is needed.
Data Availability
No datasets were generated or analysed during the current study.
References
Papers of particular interest, published recently, have been highlighted as: • Of importance •• Of major importance
Checkley W, White AC Jr, Jaganath D, Arrowood MJ, Chalmers RM, Chen XM, et al. A review of the global burden, novel diagnostics, therapeutics, and vaccine targets for Cryptosporidium. Lancet Infect Dis. 2015;15(1):85–94. https://doi.org/10.1016/S1473-3099(14)70772-8.
Kotloff KL, Nataro JP, Blackwelder WC, Nasrin D, Farag TH, Panchalingam S, et al. Burden and aetiology of diarrhoeal disease in infants and young children in developing countries (the Global Enteric Multicenter Study, GEMS): a prospective, case-control study. Lancet. 2013;382(9888):209–22. https://doi.org/10.1016/S0140-6736(13)60844-2.
Gilbert IH, Vinayak S, Striepen B, Manjunatha UH, Khalil IA, Van Voorhis WC, et al. Safe and effective treatments are needed for cryptosporidiosis, a truly neglected tropical disease. BMJ Glob Health. 2023;8(8). https://doi.org/10.1136/bmjgh-2023-012540. This work is the most recent collection of statistics, deaths, and DALYs on the global burden of Cryptosporidium and places it in the context of other neglected tropical tropical diseases. It also addresses the need for safe and effective therapeutic treatments.
English ED, Guerin A, Tandel J, Striepen B. Live imaging of the Cryptosporidium parvum life cycle reveals direct development of male and female gametes from type I meronts. PLoS Biol. 2022;20(4):e3001604. https://doi.org/10.1371/journal.pbio.3001604.
Tandel J, English ED, Sateriale A, Gullicksrud JA, Beiting DP, Sullivan MC, et al. Life cycle progression and sexual development of the apicomplexan parasite Cryptosporidium parvum. Nat Microbiol. 2019;4(12):2226–36. https://doi.org/10.1038/s41564-019-0539-x.
Nader JL, Mathers TC, Ward BJ, Pachebat JA, Swain MT, Robinson G, et al. Evolutionary genomics of anthroponosis in Cryptosporidium. Nat Microbiol. 2019;4(5):826–36. https://doi.org/10.1038/s41564-019-0377-x.
Huang W, Guo Y, Lysen C, Wang Y, Tang K, Seabolt MH, et al. Multiple introductions and recombination events underlie the emergence of a hyper-transmissible Cryptosporidium hominis subtype in the USA. Cell Host Microbe. 2023;31(1):112-23 e4. https://doi.org/10.1016/j.chom.2022.11.013. This work demonstrates how multiple different recobmination events involving C. hominis isolates from within the USA and other countries generated the hyper-transmissiable C. hominis IfA12G1R5 subtype that is increasing in incidence in the USA.
Ming Z, Gong AY, Wang Y, Zhang XT, Li M, Li Y, et al. Trans-suppression of host CDH3 and LOXL4 genes during Cryptosporidium parvum infection involves nuclear delivery of parasite Cdg7_FLc_1000 RNA. Int J Parasitol. 2018;48(6):423–31. https://doi.org/10.1016/j.ijpara.2017.10.008.
Dumaine JE, Sateriale A, Gibson AR, Reddy AG, Gullicksrud JA, Hunter EN, et al. The enteric pathogen Cryptosporidium parvum exports proteins into the cytosol of the infected host cell. Elife. 2021;10. https://doi.org/10.7554/eLife.70451.
Graham ML, Li M, Gong AY, Deng S, Jin K, Wang S, et al. Cryptosporidium parvum hijacks a host’s long noncoding RNA U90926 to evade intestinal epithelial cell-autonomous antiparasitic defense. Front Immunol. 2023;14:1205468. https://doi.org/10.3389/fimmu.2023.1205468. This work demonstrates the role that the Cryptosporidium virus plays in affecting regulation of a host lncRNA to epigenetically alter the host cell anti-parastitic response.
Fan Y, Feng Y, Xiao L. Comparative genomics: how has it advanced our knowledge of cryptosporidiosis epidemiology? Parasitol Res. 2019;118(12):3195–204. https://doi.org/10.1007/s00436-019-06537-x.
Abrahamsen MS, Templeton TJ, Enomoto S, Abrahante JE, Zhu G, Lancto CA, et al. Complete genome sequence of the apicomplexan Cryptosporidium parvum. Science. 2004;304(5669):441–5. https://doi.org/10.1126/science.1094786.
Corsi GI, Tichkule S, Sannella AR, Vatta P, Asnicar F, Segata N, et al. Recent genetic exchanges and admixture shape the genome and population structure of the zoonotic pathogen Cryptosporidium parvum. Mol Ecol. 2023;32(10):2633–45. https://doi.org/10.1111/mec.16556. This work identifies and traces the gene flow that resulted from several recombination events in C. parvum between ruminant and human isolates. They also age the recomination events and show that 50% have occurred in the last ~200 years.
Tichkule S, Caccio SM, Robinson G, Chalmers RM, Mueller I, Emery-Corbin SJ, et al. Global population genomics of two subspecies of Cryptosporidium hominis during 500 years of evolution. Mol Biol Evol. 2022;39(4). https://doi.org/10.1093/molbev/msac056. This work demonstrates the evolution of two proposed subspecies of C. hominis that differ by environment and transmission, i.e., between low-income and high-income countries. These subtypes differ in putative resistance genes, effective population sizes, and there is a bias in the direciton of gene flow between them.
Tichkule S, Jex AR, van Oosterhout C, Sannella AR, Krumkamp R, Aldrich C, et al. Comparative genomics revealed adaptive admixture in Cryptosporidium hominis in Africa. Microb Genom. 2021;7(1). https://doi.org/10.1099/mgen.0.000493.
Troell K, Hallstrom B, Divne AM, Alsmark C, Arrighi R, Huss M, et al. Cryptosporidium as a testbed for single cell genome characterization of unicellular eukaryotes. BMC Genomics. 2016;17:471. https://doi.org/10.1186/s12864-016-2815-y.
Shaw S, Cohn IS, Baptista RP, Xia G, Melillo B, Agyabeng-Dadzie F, et al. Genetic crosses within and between species of Cryptosporidium. Proc Natl Acad Sci U S A. 2024;121(1):e2313210120. https://doi.org/10.1073/pnas.2313210120. This work highlights the development of a new selectable marker for genetic studies in Cryptosporidium. They use this and an existing marker to perfrom genetic crosses between C. parvum parasites and between C. parvum and C. tyzzeri. This work also reports on the successful use of genome apmplification and long-read sequencing of single oocysts.
Agyabeng-Dadzie F, Beaudry M, Deyanov A, Slanis H, Duong MQ, Turner R, et al. Evaluating the benefits and limits of multiple displacement amplification with whole-genome Oxford Nanopore Sequencing. bioRxiv. 2024. https://doi.org/10.1101/2024.02.09.579537.
Gaudin M, Desnues C. Hybrid capture-based next generation sequencing and its application to human infectious diseases. Front Microbiol. 2018;9:2924. https://doi.org/10.3389/fmicb.2018.02924.
Kissinger JC, Hermetz KE, Woods KM, Upton SJ. Enrichment of Cryptosporidium parvum from in vitro culture as measured by total RNA and subsequent sequence analysis. Mol Biochem Parasitol. 2018;220:5–9. https://doi.org/10.1016/j.molbiopara.2017.12.004.
Xu P, Widmer G, Wang Y, Ozaki LS, Alves JM, Serrano MG, et al. The genome of Cryptosporidium hominis. Nature. 2004;431(7012):1107–12. https://doi.org/10.1038/nature02977.
Ifeonu OO, Chibucos MC, Orvis J, Su Q, Elwin K, Guo F, et al. Annotated draft genome sequences of three species of Cryptosporidium: Cryptosporidium meleagridis isolate UKMEL1, C. baileyi isolate TAMU-09Q1 and C. hominis isolates TU502_2012 and UKH1. Pathog Dis. 2016;74(7). https://doi.org/10.1093/femspd/ftw080.
Gilchrist CA, Cotton JA, Burkey C, Arju T, Gilmartin A, Lin Y, et al. Genetic diversity of Cryptosporidium hominis in a Bangladeshi community as revealed by whole-genome sequencing. J Infect Dis. 2018;218(2):259–64. https://doi.org/10.1093/infdis/jiy121.
Baptista RP, Li Y, Sateriale A, Sanders MJ, Brooks KL, Tracey A, et al. Long-read assembly and comparative evidence-based reanalysis of Cryptosporidium genome sequences reveal expanded transporter repertoire and duplication of entire chromosome ends including subtelomeric regions. Genome Res. 2022;32(1):203–13. https://doi.org/10.1101/gr.275325.121.
Widmer G, Koster PC, Carmena D. Cryptosporidium hominis infections in non-human animal species: revisiting the concept of host specificity. Int J Parasitol. 2020;50(4):253–62. https://doi.org/10.1016/j.ijpara.2020.01.005.
Baptista RP, Xiao R, Li Y, Glenn TC, Kissinger JC. New T2T assembly of Cryptosporidium parvum IOWA annotated with reference genome gene identifiers. bioRxiv. 2023. https://doi.org/10.1101/2023.06.13.544219. This pre-print reports on the first telomer to telomere genomic assembly for C. parvum that contains all 16 telomeres. This genome sequence is annotated using gene IDs from the current reference IOWA genome assembly when possible and extensive annotation of non-coding RNAs.
Penumarthi LR, Baptista RP, Beaudry MS, Glenn TC, Kissinger JC. A new chromosome-level genome assembly and annotation of Cryptosporidium meleagridis bioRxiv. 2024. https://doi.org/10.1101/2024.02.16.580748
Arias-Agudelo LM, Garcia-Montoya G, Cabarcas F, Galvan-Diaz AL, Alzate JF. Comparative genomic analysis of the principal Cryptosporidium species that infect humans. PeerJ. 2020;8:e10478. https://doi.org/10.7717/peerj.10478.
Xu Z, Li N, Guo Y, Feng Y, Xiao L. Comparative genomic analysis of three intestinal species reveals reductions in secreted pathogenesis determinants in bovine-specific and non-pathogenic Cryptosporidium species. Microb Genom. 2020;6(6). https://doi.org/10.1099/mgen.0.000379.
Otto TD, Bohme U, Sanders M, Reid A, Bruske EI, Duffy CW, et al. Long read assemblies of geographically dispersed Plasmodium falciparum isolates reveal highly structured subtelomeres. Wellcome Open Res. 2018;3:52. https://doi.org/10.12688/wellcomeopenres.14571.1.
Dunn MJ, Shazib SUA, Simonton E, Slot JC, Anderson MZ. Architectural groups of a subtelomeric gene family evolve along distinct paths in Candida albicans. G3 (Bethesda). 2022;12(12). https://doi.org/10.1093/g3journal/jkac283.
Fei J, Wu H, Su J, Jin C, Li N, Guo Y, et al. Characterization of MEDLE-1, a protein in early development of Cryptosporidium parvum. Parasit Vectors. 2018;11(1):312. https://doi.org/10.1186/s13071-018-2889-2.
Xu Z, Guo Y, Roellig DM, Feng Y, Xiao L. Comparative analysis reveals conservation in genome organization among intestinal Cryptosporidium species and sequence divergence in potential secreted pathogenesis determinants among major human-infecting species. BMC Genomics. 2019;20(1):406. https://doi.org/10.1186/s12864-019-5788-9.
Baptista RP, Cooper GW, Kissinger JC. Challenges for Cryptosporidium population studies. Genes (Basel). 2021;12(6). https://doi.org/10.3390/genes12060894.
Wang T, Guo Y, Roellig DM, Li N, Santin M, Lombard J, et al. Sympatric recombination in zoonotic Cryptosporidium leads to emergence of populations with modified host preference. Mol Biol Evol. 2022;39(7). https://doi.org/10.1093/molbev/msac150. This comparative genomic analysis reveals how past and ongoing recombination events, especially between parasite strains isolated from humans and animals, have affected the population structure of C. parvum and affected host preference.
Li Y, Baptista RP, Mei X, Kissinger JC. Small and intermediate size structural RNAs in the unicellular parasite Cryptosporidium parvum as revealed by sRNA-seq and comparative genomics. Microb Genom. 2022;8(5). https://doi.org/10.1099/mgen.0.000821.
Li Y, Baptista RP, Sateriale A, Striepen B, Kissinger JC. Analysis of long non-coding RNA in Cryptosporidium parvum reveals significant stage-specific antisense transcription. Front Cell Infect Microbiol. 2020;10:608298. https://doi.org/10.3389/fcimb.2020.608298.
Temesgen TT, Tysnes KR, Robertson LJ. Use of oxidative stress responses to determine the efficacy of inactivation treatments on Cryptosporidium oocysts. Microorganisms. 2021;9(7). https://doi.org/10.3390/microorganisms9071463.
Sun L, Li J, Xie F, Wu S, Shao T, Li X, et al. Whole transcriptome analysis of HCT-8 cells infected by Cryptosporidium parvum. Parasit Vectors. 2022;15(1):441. https://doi.org/10.1186/s13071-022-05565-4. This work examines host gene expression 3 and 12 h post-infection and highlights a number of coding and non-coding gene expression differences.
Greigert V, Saraav I, Son J, Zhu Y, Dayao D, Antia A, et al. Cryptosporidium infection of human small intestinal epithelial cells induces type III interferon and impairs infectivity of Rotavirus. Gut Microbes. 2024;16(1):2297897. https://doi.org/10.1080/19490976.2023.2297897.
Tandel J, Walzer KA, Byerly JH, Pinkston B, Beiting DP, Striepen B. Genetic ablation of a female-specific apetala 2 transcription factor blocks oocyst shedding in Cryptosporidium parvum. mBio. 2023;14(2):e0326122. https://doi.org/10.1128/mbio.03261-22. This work identifies male- and female-specific AP2 transcription factors and develops conditional gene knock-out strategies to study the female AP2. Transcriptomics reavealed a role for the female AP2 in crystalloid protein expression and transmission.
Xia Z, Xu J, Lu E, He W, Deng S, Gong AY, et al. m6A mRNA methylation regulates epithelial innate antimicrobial defense against cryptosporidial infection. Front Immunol. 2021;12:705232. https://doi.org/10.3389/fimmu.2021.705232.
Deng M, Hou T, Mao X, Zhang J, Yang F, Wei Y, et al. Cultivation of host-adapted Cryptosporidium parvum and Cryptosporidium hominis using enteroids for cryopreservation of isolates and transcriptomic studies of infection. bioRxiv: Cold Spring Harbor Laboratory; 2023. https://doi.org/10.1101/2023.12.06.570384. This preprint reports on a new murine enteriod system that works for particular isolates of both of the major human-infecting Cryptosporidium species in which the life cycle can be completed and parasites can be cryopreserved. The system has been assessed with transciptomic analyses.
Hasan M, Mattice E, Teixeira JE, Jumani RS, Stebbins EE, Klopfer C, et al. Cryptosporidium life cycle small molecule probing implicates translational repression and an apetala 2 transcription factor in sexual differentiation. bioRxiv. Cold Spring Harbor Laboratory; 2023.
Sawant M, Benamrouz-Vanneste S, Meloni D, Gantois N, Even G, Guyot K, et al. Putative SET-domain methyltransferases in Cryptosporidium parvum and histone methylation during infection. Virulence. 2022;13(1):1632–50. https://doi.org/10.1080/21505594.2022.2123363.
Vinayak S, Pawlowic MC, Sateriale A, Brooks CF, Studstill CJ, Bar-Peled Y, et al. Genetic modification of the diarrhoeal pathogen Cryptosporidium parvum. Nature. 2015;523(7561):477–80. https://doi.org/10.1038/nature14651.
Sateriale A, Pawlowic M, Vinayak S, Brooks C, Striepen B. Genetic manipulation of Cryptosporidium parvum with CRISPR/Cas9. Methods Mol Biol. 2020;2052:219–28. https://doi.org/10.1007/978-1-4939-9748-0_13.
Sateriale A, Slapeta J, Baptista R, Engiles JB, Gullicksrud JA, Herbert GT, et al. A genetically tractable, natural mouse model of cryptosporidiosis offers insights into host protective immunity. Cell Host Microbe. 2019;26(1):135-46 e5. https://doi.org/10.1016/j.chom.2019.05.006.
Vinayak S. Recent advances in genetic manipulation of Cryptosporidium. Curr Opin Microbiol. 2020;58:146–52. https://doi.org/10.1016/j.mib.2020.09.010.
Pawlowic MC, Somepalli M, Sateriale A, Herbert GT, Gibson AR, Cuny GD, et al. Genetic ablation of purine salvage in Cryptosporidium parvum reveals nucleotide uptake from the host cell. Proc Natl Acad Sci U S A. 2019;116(42):21160–5. https://doi.org/10.1073/pnas.1908239116.
Hanna JC, Corpas-Lopez V, Seizova S, Colon BL, Bacchetti R, Hall GMJ, et al. Mode of action studies confirm on-target engagement of lysyl-tRNA synthetase inhibitor and lead to new selection marker for Cryptosporidium. Front Cell Infect Microbiol. 2023;13:1236814. https://doi.org/10.3389/fcimb.2023.1236814. This work described the development of a new selectable genetic marker for Cryptosporidium genetics studies and reports on its use in a genetic cross in C. parvum.
Yin YL, Liu TL, Yao Q, Wang YX, Wu XM, Wang XT, et al. Circular RNA ciRS-7 affects the propagation of Cryptosporidium parvum in HCT-8 cells by sponging miR-1270 to activate the NF-kappaB signaling pathway. Parasit Vectors. 2021;14(1):238. https://doi.org/10.1186/s13071-021-04739-w.
Pardy RD, Walzer KA, Wallbank BA, Byerly JH, O’Dea KM, Cohn IS, et al. Analysis of intestinal epithelial cell responses to Cryptosporidium highlights the temporal effects of IFN-gamma on parasite restriction. bioRxiv. 2023. https://doi.org/10.1101/2023.11.14.567008.
Castellanos-Gonzalez A, Sadiqova A, Ortega-Mendez J, White AC Jr. RNA-based therapy for Cryptosporidium parvum infection: proof-of-concept studies. Infect Immun. 2022;90(7):e0019622. https://doi.org/10.1128/iai.00196-22.
Khramtsov NV, Woods KM, Nesterenko MV, Dykstra CC, Upton SJ. Virus-like, double-stranded RNAs in the parasitic protozoan Cryptosporidium parvum. Mol Microbiol. 1997;26(2):289–300. https://doi.org/10.1046/j.1365-2958.1997.5721933.x.
Leoni F, Gallimore CI, Green J, McLauchlin J. Characterisation of small double stranded RNA molecule in Cryptosporidium hominis, Cryptosporidium felis and Cryptosporidium meleagridis. Parasitol Int. 2006;55(4):299–306. https://doi.org/10.1016/j.parint.2006.06.006.
Deng S, He W, Gong AY, Li M, Wang Y, Xia Z, et al. Cryptosporidium uses CSpV1 to activate host type I interferon and attenuate antiparasitic defenses. Nat Commun. 2023;14(1):1456. https://doi.org/10.1038/s41467-023-37129-0. This work highlights the role of the type1 IFN receptor in C. parvum Tinfection and demonstrates a role for the Cryptosporidium virus in altering host cell gene expression and triggering a type 1 IFN response in infected cells. This response allows the parasite to evade the epithelial antiparasitic response.
Jex AR, Smith HV, Monis PT, Campbell BE, Gasser RB. Cryptosporidium–biotechnological advances in the detection, diagnosis and analysis of genetic variation. Biotechnol Adv. 2008;26(4):304–17. https://doi.org/10.1016/j.biotechadv.2008.02.003.
Luka G, Samiei E, Tasnim N, Dalili A, Najjaran H, Hoorfar M. Comprehensive review of conventional and state-of-the-art detection methods of Cryptosporidium. J Hazard Mater. 2022;421:126714. https://doi.org/10.1016/j.jhazmat.2021.126714.
Yanta CA, Bessonov K, Robinson G, Troell K, Guy RA. CryptoGenotyper: a new bioinformatics tool for rapid Cryptosporidium identification. Food Waterborne Parasitol. 2021;23:e00115. https://doi.org/10.1016/j.fawpar.2021.e00115.
Ryan U, Paparini A, Oskam C. New technologies for detection of enteric parasites. Trends Parasitol. 2017;33(7):532–46. https://doi.org/10.1016/j.pt.2017.03.005.
Xiao L, Feng Y. Molecular epidemiologic tools for waterborne pathogens Cryptosporidium spp. and Giardia duodenalis. Food Waterborne Parasitol. 2017;8–9:14–32. https://doi.org/10.1016/j.fawpar.2017.09.002.
Alderisio KA, Mergen K, Moessner H, Madison-Antenucci S. Identification and evaluation of Cryptosporidium species from New York City cases of cryptosporidiosis (2015 to 2018): a watershed perspective. Microbiol Spectr. 2023;11(1):e0392122. https://doi.org/10.1128/spectrum.03921-22.
Dean FB, Hosono S, Fang L, Wu X, Faruqi AF, Bray-Ward P, et al. Comprehensive human genome amplification using multiple displacement amplification. Proc Natl Acad Sci U S A. 2002;99(8):5261–6. https://doi.org/10.1073/pnas.082089499.
Hou Y, Wu K, Shi X, Li F, Song L, Wu H, et al. Comparison of variations detection between whole-genome amplification methods used in single-cell resequencing. Gigascience. 2015;4:37. https://doi.org/10.1186/s13742-015-0068-3.
Guo Y, Li N, Lysen C, Frace M, Tang K, Sammons S, et al. Isolation and enrichment of Cryptosporidium DNA and verification of DNA purity for whole-genome sequencing. J Clin Microbiol. 2015;53(2):641–7. https://doi.org/10.1128/JCM.02962-14.
Widmer G, Lee Y. Comparison of single- and multilocus genetic diversity in the protozoan parasites Cryptosporidium parvum and C. hominis. Appl Environ Microbiol. 2010;76(19):6639–44. https://doi.org/10.1128/AEM.01268-10.
Chalmers RM, Perez-Cordon G, Caccio SM, Klotz C, Robertson LJ. participants of the Cryptosporidium genotyping w. Cryptosporidium genotyping in Europe: the current status and processes for a harmonised multi-locus genotyping scheme. Exp Parasitol. 2018;191:25–30. https://doi.org/10.1016/j.exppara.2018.06.004.
Uran-Velasquez J, Alzate JF, Farfan-Garcia AE, Gomez-Duarte OG, Martinez-Rosado LL, Dominguez-Hernandez DD, et al. Multilocus sequence typing helps understand the genetic diversity of Cryptosporidium hominis and Cryptosporidium parvum isolated from Colombian patients. PLoS ONE. 2022;17(7):e0270995. https://doi.org/10.1371/journal.pone.0270995.
de Souza MS, O’Brien C, Santin M, Jenkins M. A highly sensitive method for detecting Cryptosporidium parvum oocysts recovered from source and finished water using RT-PCR directed to Cryspovirus RNA. J Microbiol Methods. 2019;156:77–80. https://doi.org/10.1016/j.mimet.2018.11.022.
Adjou KT, Chevillot A, Lucas P, Blanchard Y, Louifi H, Arab R, et al. First identification of Cryptosporidium parvum virus 1 (CSpV1) in various subtypes of Cryptosporidium parvum from diarrheic calves, lambs and goat kids from France. Vet Res. 2023;54(1):66. https://doi.org/10.1186/s13567-023-01196-4.
Berber E, Simsek E, Canakoglu N, Sursal N, Gencay GA. Newly identified Cryptosporidium parvum virus-1 from newborn calf diarrhoea in Turkey. Transbound Emerg Dis. 2021;68(4):2571–80. https://doi.org/10.1111/tbed.13929.
Chae JB, Shin SU, Kim S, Jo YM, Roh H, Chae H, et al. The first identification of Cryptosporidium parvum virus-1 (CSpV1) in Hanwoo (Bos taurus coreanae) calves in Korea. Vet Sci. 2023;10(11). https://doi.org/10.3390/vetsci10110633.
Dabrowska J, Sroka J, Cencek T. Investigating Cryptosporidium spp. using genomic, proteomic and transcriptomic techniques: current progress and future directions. Int J Mol Sci. 2023;24(16). https://doi.org/10.3390/ijms241612867. This review is a detailed history with timelines of omics research in Cryptosporidium.
Sayers EW, Cavanaugh M, Clark K, Pruitt KD, Sherry ST, Yankie L, et al. GenBank 2024 update. Nucleic Acids Res. 2024;52(D1):D134–7. https://doi.org/10.1093/nar/gkad903.
Warrenfeltz S, Kissinger JC, EuPath DBT. Accessing Cryptosporidium omic and isolate data via CryptoDB.org. Methods Mol Biol. 2020;2052:139–92. https://doi.org/10.1007/978-1-4939-9748-0_10. This work describes a database where existing Cryptosporidium omic data can be searched, visualized, and downloaded.
Acknowledgements
The authors would like to thank R.P. Baptista for useful discussions and L.R. Penumarthi assistance with Table 1.
Funding
This work was funded in part by National Institute for Allergy and Infectious Diseases, NIAID, R01 AI148667 to Travis Glenn and J.C.K, and an NIH T32GM142623 to F.A-D.
Ethics declarations
Human and Animal Rights and Informed Consent
This article does not contain any studies with human or animal subjects performed by any of the authors.
Competing interests
JCK owns PacBio stock.
Disclaimer
The funders played no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Topical Collection on Giardia/Crypto
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Agyabeng-Dadzie, F., Xiao, R. & Kissinger, J.C. Cryptosporidium Genomics — Current Understanding, Advances, and Applications. Curr Trop Med Rep 11, 92–103 (2024). https://doi.org/10.1007/s40475-024-00318-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40475-024-00318-y