The updated genome of the Hungarian population of Aedes koreicus

Nagy, Nikoletta Andrea; Tóth, Gábor Endre; Kurucz, Kornélia; Kemenesi, Gábor; Laczkó, Levente

doi:10.1038/s41598-024-58096-6

The updated genome of the Hungarian population of Aedes koreicus

Article
Open access
Published: 30 March 2024

Volume 14, article number 7545, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

The updated genome of the Hungarian population of Aedes koreicus

Download PDF

Nikoletta Andrea Nagy^1,2,3,
Gábor Endre Tóth^4,5,
Kornélia Kurucz^4,6,
Gábor Kemenesi^4,6 &
…
Levente Laczkó^7,8

779 Accesses
7 Altmetric
Explore all metrics

Abstract

Vector-borne diseases pose a potential risk to human and animal welfare, and understanding their spread requires genomic resources. The mosquito Aedes koreicus is an emerging vector that has been introduced into Europe more than 15 years ago but only a low quality, fragmented genome was available. In this study, we carried out additional sequencing and assembled and characterized the genome of the species to provide a background for understanding its evolution and biology. The updated genome was 1.1 Gbp long and consisted of 6099 contigs with an N50 value of 329,610 bp and a BUSCO score of 84%. We identified 22,580 genes that could be functionally annotated and paid particular attention to the identification of potential insecticide resistance genes. The assessment of the orthology of the genes indicates a high turnover at the terminal branches of the species tree of mosquitoes with complete genomes, which could contribute to the adaptation and evolutionary success of the species. These results could form the basis for numerous downstream analyzes to develop targets for the control of mosquito populations.

Improved reference genome of Aedes aegypti informs arbovirus vector control

Article Open access 14 November 2018

Mapping the virome in wild-caught Aedes aegypti from Cairns and Bangkok

Article Open access 16 March 2018

De novo genome assembly of the invasive mosquito species Aedes japonicus and Aedes koreicus

Article Open access 20 November 2023

Introduction

Every year, vector-borne diseases are responsible for more than 700,000 deaths and account for more than 17% of all infectious diseases¹. Vector-borne diseases (VBDs) pose a significant threat to global health, putting more than 80% of the world’s population at potential risk. Among these diseases, mosquito-borne diseases (MBDs) are the most important and significant contributor to this burden². For many mosquito-borne diseases, an increase in incidence and geographical spread can be observed. A prime example is dengue fever, whose global incidence has increased 30-fold in the last five decades and which occurs in previously unaffected countries^3,4,5. This phenomenon leads to the emergence of diseases in previously unaffected regions and their re-emergence in areas where they were previously eradicated. This process is largely driven by anthropogenic effects (globalization, deforestation, overpopulation, land use, etc.) and has a strong impact on MBDs^6,7,8.

The introduction of mosquitoes from Asia has attracted attention in Europe⁹. In recent decades, numerous invasive mosquito species of the genus Aedes (albopictus, japonicus) have been introduced and have become successfully established. This has led to considerable distress to the population since they can act as potential vectors for exotic and native pathogens¹⁰. The first detection of Aedes koreicus, a potential vector of arboviruses¹¹, outside its native range was in 2008 in an industrial area in Maasmechelen, Belgium, where this mosquito species is now firmly established and can overwinter¹². Despite its continuous presence, Ae. koreicus has managed to colonize the surrounding areas to a limited extent¹³. In 2011, this species was found in north-eastern Italy, more precisely in the province of Belluno in the Veneto region, and rapidly expanded its range over the following ten years. It infested neighboring provinces and spread to more distant regions in northern Italy^14,15,16,17. A revision of Aedes japonicus specimens collected in Slovenia confirmed the introduction of Ae. koreicus in 2013¹⁸. In 2015, the species appeared in southern Germany¹⁹. At the same time, Ae. koreicus was found in southwestern Hungary, where it established an overwintering but localized population^20,21. More recently, Ae. koreicus has been detected in western Austria²², on the southern coast of the Crimean peninsula²³ and in the Republic of Kazakhstan²⁴. The literature indicates that, Ae. koreicus is a novel vector on the European continent²⁵. In the field of mosquito invasion genomics, a fundamental and widely pursued goal is to elucidate the origins of invasive populations. Given their high propensity for invasion and associated disease risks, Aedes mosquitoes in particular have received much attention^26,27.

The application of whole genome sequencing in mosquitoes has provided invaluable insights into fundamental biological processes at the molecular level and improved our understanding of their intricate mechanisms. Furthermore, this approach holds great promise for use in mosquito control strategies and the prevention of mosquito-borne disease transmission^28,29. In the context of mosquito control, insecticide resistance is a major global challenge and an example of an extreme manifestation of adaptive evolution driven by human activities. Resistance in mosquitoes is often detected by bioassays or targeted sequencing methods³⁰. An important application of population genomics throughout the development of the field has been understanding the evolution of insecticide resistance.

Invasive Aedes species can impact local ecosystems³¹. Genomic resources also help assess the impact of these invasive species. The increasing introduction of invasive species can pose a major challenge to local ecosystem functions³². Using genomic approaches, we can provide markers to monitor population genomes and invasion processes to assess whether invasiveness can be predicted from genome sequences^32,33 to mitigate the impact of invasive species on ecosystems while reducing the likelihood of the spread of vector-borne diseases.

Given the spread of Ae. koreicus described above, accurate genomic characterization of the species could be important for numerous applications. In this study, we describe the improved genome sequence of Aedes koreicus (NCBI Assembly: GCA_024533555.1), focusing on the Hungarian population, that we assembled using the publicly available data (Sequence Read Archive: SRR14975285, SRR14975286) of Kurucz et al. (2022)²⁵ supplemented with newly generated third-generation sequencing data. In addition to improving the assembly, we annotated the genome with a particular focus on genes that may be involved in insecticide resistance.

Results and discussion

In this study, we used a hybrid genome assembly approach and combined Illumina short-read with Oxford Nanopore long-read sequencing data to reconstruct the high-quality draft genome of Aedes koreicus. Of the 87.31Gbp raw short-read sequencing data, 59.71 Gbp (489,687,726 reads) passed quality filtering with an average read length of 132 bp. A total of 8,196,976 reads with a base count of 37.90 Gbp were retained in the quality filtering of the long-read sequencing libraries. The read N50 values of the individual libraries ranged from 5923 to 7017 (mean = 6415 bp), with all libraries having an average read quality of 14.4. The three long-read libraries had a total throughput of 7.15 to 18.5 Gbp (mean = 12.6 Gbp) and yielded a total of 37.9 Gbp of sequencing data.

Based on the 21-mer frequency of the short reads, GenomeScope2 estimated the genome size to be 884.23 Mbp with a unique k-mer frequency of 54.8% and a relatively high heteroziygosity rate (1.86%), regardless of the k-mer coverage threshold (Fig. 1). In contrast, CovEst estimated the genome size to be 1.49 Gbp. This 1.6-fold difference is most likely due to the different approaches of the two tools. GenomeScope2 accounts for variation in coverage by fitting negative binomials to the k-mer coverage histogram, but does not fit binomials for regions that occur more than twice. CovEst assumes uniform coverage and estimates genome size by dividing the total amount of sequencing data by the observed coverage. The contrast in estimated genome size is most likely due to pooling of individuals to obtain enough DNA for genome sequencing and/or sequencing of a highly repetitive genome (as confirmed by masking of repeats in the assembly). Given the assembly size (see below), which was filtered multiple times to remove duplicated contigs, the true genome size of the species should be between the two estimates and given the BUSCO score, closer to the result of CovEst.

The mitochondrial genome appeared to be circular and 15,851 bps long, with a structure characteristic of the Culicidae family (Fig. 2A). The order and orientation of the genes of Ae. koreicus, Aedes japonicus, Aedes aegypti and Aedes albopictus matched perfectly. The result of skmer clustered Ae. koreicus together with Ae. japonicus as the sister clade of Ae. aegypti and Ae. albopictus (Fig. 2B). Before nuclear genome assembly, we excluded 0.26% of the short reads and 0.18% of the long reads for being mitochondrial; thus, 488,392,324 short reads and 8,182,409 long reads were used for nuclear genome assembly.

The final assembly of the nuclear genome consisted of 6099 contigs with a total length of 1.10 Gbp, excluding all contigs flagged as contaminants (0.25% of the assembly). Compared to the previous version of the species’ genome (GCA_024533555.1), the number of contigs was reduced by one tenth, the N50 value increased from 18,623 bp to 329,610 bp and the L50 value decreased from 12,967 to 896 (Table 1 and Supplementary Table 1). The GC content of the new assembly (39.67%) remained unchanged compared to the previous version of the genome (39.7%). The ratio of complete BUSCOs increased by 10.4% and the ratio of missing BUSCOs decreased by 4.5% (Table 1). At the same time, we were able to increase the proportion of single-copy BUSCOs by 0.2%, suggesting several newly identified BUSCOs are duplicated in the genome (Fig. 3). Usually, such a high proportion of duplicates indicates duplicated contigs in the assembly, but given the multiple approaches we used to exclude false duplicates (i.e., pseudohaploid and redundans with multiple thresholds), these duplicates might actually exist in the species’ genome. However, the high proportion of duplicated BUSCO genes is not exceptional within the genus, e.g. 39.6% in the representative genome assembly of Ae. albopictus (Fig. 3), and gene duplications can be frequently observed in Aedes (e.g. Waterhouse et al. 2008³⁴). The other recently published analysis of the complete genome³⁵ reported an N50 value of 190,716 bp with an assembly size of 1.24 Gbp, consisting of 21,315 scaffolds and a BUSCO score of 91% (Duplicate = 8%, Missing = 2%). The assembly presented here shows better contiguity, but a somewhat lower BUSCO score. The relatively high duplication ratio again indicates that numerous valid gene duplicates can be found in the genome of the species. The differences between the two assemblies could be due to the different heterogeneity of the pooled DNA isolates, the different assembly approach including the filtering of duplicate contigs and the not yet well characterized variability of the species.

Table 1 Comparison of the contiguity and completeness of the publicly available and the newly assembled genome of Aedes koreicus.

Full size table

We soft-masked 60.62% of the genome as repetitive prior to gene prediction (1,256,253 identified repetitive regions with an average length of 530.85 bp), which is close to our initial estimate of repeat ratio based on k-mer frequencies (Fig. 1) and approximately 10% lower than reported for the species genome³⁵. The ab initio gene prediction identified 28,154 potential protein-coding genes, whereas the homology-based method identified 43,226 genes. Merging these two sets of putative CDSes resulted in 47,796 unique amino acid sequences, of which 22,580 could be functionally annotated (47%). In addition, we identified 86 rRNA and 791 tRNA sequences in the final assembly. This gene count is comparable to the number of protein-coding genes in the publicly available genomes of Culicidae (mean number of genes in the proteomes used for phylogenomic reconstruction: 23,183). 42.3% of the functionally annotated genes were responsible for biological processes (BP), 26.8% contributed to cellular components (CC) and 30.8% were assigned to the GO term molecular function (MF) (Fig. 4A). In the BP category, DNA biosynthetic process, proteolysis, DNA metabolic process, phosphorylation and transmembrane transport were the most frequently occurring functions. Most genes with the GO term CC received the free text annotation membrane, nucleus, cytoplasm, plasma membrane and extracellular regions. The most frequently occurring molecular functions were nucleic acid binding, metal ion binding, ATP binding, zinc ion binding and RNA binding. We did not detect any strikingly overrepresented features in the frequency of the 50 most abundant gene functions (Fig. 4B). At the same time, we identified 218 genes involved in odorant binding, which not only highlights the importance of odorants in the feeding of Ae. koreicus, but also provides potential targets for the control of this vector species. These targets should be investigated with a larger sample size to validate their structure and function and potentially suggest specific repellent molecules (see: Yan et al. (2022)³⁶ and Tiwari and Sowdhamini (2023)³⁷). Out of 27 potential insecticide resistance genes^35,38, we were able to identify the homolog of aael012918, ace1, cyp6bb2, ABCA3, cuticle protein, cuticle protein CP14.6, cyp9j26, cyp9j28, cyp9j32, ketohexokinase, modifier of mdg4, muscle calcium channel subunit alpha-1, nav, potassium voltage-gated channel protein Shaker, rdl, sodium leak channel non-selective protein and uncharacterized LOC5575776 (Supplementary Table 2). The cuticular protein was found in three copies in the genome of Ae. aegypti and in two copies in the genome of Ae. koreicus with free-text annotation “Cuticular protein 73” and “Pupal cuticle protein 78e”. Ketohexokinase could be annotated after ab initio gene prediction as “Phospholipid/glycerol acyltransferase domain-containing protein”. LOC5575776 appeared to be chitin synthase that was also described by Catapano et al. (2023)³⁵ as a potential resistance gene. The cytochrome cyp9j26 was identified as a potential coding sequence by structural annotation, but no function could be identified using PANNZER (Supplementary Table 2). The remaining putative resistance genes, including gstd4, gstd6, multidrug resistance-associated protein 1 and multiple cytochromes, could not be identified, suggesting either low sequence similarity of the target sequences and/or that other mechanisms are involved in insecticide resistance in Ae. koreicus. In addition, in the functional annotation returned by PANNZER, we identified eight genes with the free text annotation “Deltamethrin resistance-associated NYD-OP7” and one with “Deltamethrin resistance protein prag01 domain-containing protein”, already described in Culex pipiens³⁹. The genes labelled glutathione S-transferase (number of copies = 11) involved in insecticide resistance in several mosquito species⁴⁰ all appeared to be putative, with the exception of delta-glutathione S-transferase (GST). Of the six copies of genes coding voltage-gated sodium channels, four appeared to be fragmented, and of the 11 copies of acetylcholinesterase, five were fragmented as indicated ‘(Fragment)’ in the gene annotation. Multidrug resistance-associated protein 1, which could not be identified by annotation transfer, could be found in three copies. We also discovered two copies of multidrug resistance-associated protein 7 and three copies of multidrug resistance-associated protein 9. Similarly to the odorant binding system, these potential target sequences should be investigated with a larger sample size and under experimental conditions to identify other genes involved in the insecticide resistance mechanisms of the species, e.g. with RNASeq.

OrthoFinder clustered all genes into 20,021 orthogroups, of which 4725 were contained in all species and 61 were single-copy orthologs. Of the 17,848 species-specific orthogroups, 403 (1134 genes) were specific to Ae. koreicus. The rooted species tree (Fig. 5) identified D. melanogaster as an outgroup and separated all Anopheles species from the genera Aedes, Wyeomyia, Sabethes, Toxorhynchites, Culex and Uranotaenia. Although the structure of the species tree was consistent with the phylogenetic reconstruction of Zadra et al. (2021)⁴¹ and Catapano et al. (2023)³⁵, multiple branches received low support values, in particular the separation of Culex sp. and the separation of Aedes and its sister group consisting of Wyeomyia, Sabethes and Toxorhynchites. The structure within Aedes resembled the neighbor-joining phylogram reconstructed using pairwise mitochondrial (with Ae. japonicus missing from the phylogenetic reconstruction of nuclear genes due to the lack of genomic resources) distances, grouping Ae. albopictus and Ae. aegypti together and placing Ae. koreicus as sister to these species. The low support values indicate gene tree incongruence, which may result from ancient hybridization events that retain the correct topology of the phylogenetic tree but decrease phylogenetic support values⁴². As multiple phylogenetic reconstructions^35,41 support a very similar phylogenetic hypothesis, a possible explanation for the low support could be that ancient hybridization played an important role in the speciation of the species group. At the same time, gene duplications were much more frequent in terminal branches (453–22,920, mean = 8852.64) than in internal branches (110–4281, mean = 1149) (Fig. 5), suggesting high gene turnover⁴³, which can play a role in the adaptive strategy and evolutionary success of mosquito species. Genome size appeared to be much less variable in Anopheles than in the rest of the samples, and the genome size of Ae. albopictus was by far the largest, followed by Ae. aegypti and Ae. koreicus. The larger (Aedes) genomes contained a higher number of genes, resulting in a similar CDS density within the accessions of the genus (Fig. 5).

A single genome can hardly represent the entire variability of a species^44,45. Multiple genome assemblies of the same species can be important to understand the unique aspects of the species’ biology⁴⁶, including genome plasticity, identification of marker genes, and application of comparative genomic methods. In this particular case, the development of control strategies against this invasive species could benefit from multiple genomic resources that provide the opportunity to account for the variability of multiple genomes. Our results are consistent and comparable with another study conducted in parallel by Catapano et al. (2023)³⁵. High-quality assemblies from different populations improve future genomic work on the global invasive populations of a species and facilitate the study of structural variation that may exist between different populations⁴⁷. Furthermore, the question of whether invasiveness can be predicted by knowledge of genome variability³² can only be answered if we have multiple genomic resources at our disposal. Aedes koreicus is considered a vector on the rise²⁵ with a continuous spread across Europe¹³. Therefore, future studies on this species should be conducted with international collaboration and shared resources to investigate the biology of this species and provide greater benefit to the scientific community.

In our study, we updated the first version of the Aedes koreicus genome assembly, aiming to create a well-characterized genome of the Hungarian populations that can be used as a resource for future studies on the diversity of genome structure and content of the species. Such genomic resources help to assess the impact and potential threat of invasive mosquito species and help designing specific control strategies³⁵. The functional annotation presented here corroborates the presence of the majority of potential resistance genes in the genome reported previously³⁵ and presents potential targets for the control of Ae. koreicus that could be evaluated experimentally.

Methods

Sample collection and genome sequencing

We collected mosquito larvae from stagnant waters, in the framework of a regular monitoring program run in urban and suburban areas of the city of Pécs (Hungary). Larvae were reared to adult stage in the laboratory and adult specimens were identified at the species level under a Nikon SMZ800N stereomicroscope (Minato, Japan) using morphological identification keys^48,49,50. Four adult male Aedes koreicus specimens were captured on 23.06.2022 and pooled together. Nucleic acid was extracted from this pool using the DNeasy Blood & Tissue Kit (Qiagen, Germany) by following the manufacturer’s recommendations. We prepared two sequencing libraries using the Oxford Nanopore Sequencing Kit SQK-LSK110 (Oxford Nanopore, UK) and NEBNext FFPE Repair and Ultra II End Prep (New England Biolabs, USA) according to the protocol provided by the Nanopore Community. We used AMPure XP (Beckman Coulter, USA) magnetic beads for all purification steps and quantified the libraries using a Qubit Fluorometer v4 (Invitrogen, USA). Either the Small Fragment Buffer (SFB) or the Large Fragment Buffer (LFB) was used for size selection (the only difference between the libraries). Sequencing was performed with an Oxford Nanopore MinION MK1C (Oxford Nanopore, UK) sequencer using Flow Cells R9.4.1 (Oxford Nanopore, UK).

In addition to the sequencing libraries described above, we used the raw short (SRR14975286) and long (SRR14975285) sequencing datasets of the published genome²⁵ to reconstruct the mitochondrial and nuclear genome of Ae. koreicus. The quality of Illumina short reads was checked using FastQC 0.11.9, and then adapters and low-quality bases were trimmed using fastp 0.20.1⁵¹. The parameters of fastp were set to trim sequences at both the 3’ and 5’ ends with a mean quality score of less than 15 using the default sliding window size and discard reads shorter than 90 bp (–cut_front 15 –cut_tail 15 –length_required 90). In addition, we turned on adapter detection for paired-end reads and enabled the polyX trimming at the 3’ ends (–detect_adapter_for_pe –trim_poly_x). To decrease the error rate, we corrected the sequencing errors using the k-mer frequency spectrum with Bloocoo 1.0.6⁵².

To achieve the highest possible read accuracy, we re-basecalled the raw reads used in Kurucz et al. (2022)²⁵ with the same version of Guppy 6.5.7 (Oxford Nanopore Technologies, Oxford, UK) as for the newly generated data using the super-high accuracy basecall model. The quality of the long reads was checked using the R script MinIONQC 1.4.2⁵³. We filtered and evaluated the quality of the long read sequences with NanoFilt 2.8.0⁵⁴ and NanoPlot 1.40.0⁵⁴.

We analyzed the 21-mer frequency spectrum in the filtered short-read dataset using KMC 3.1.1⁵⁵ with the following parameters: minimum occurrence 1 (-ci1) and maximum frequency 10,000 (-cs10000). We used Genomescope 2.0⁵⁶ to analyze the resulting k-mer histogram and estimate the genome size, k-mer coverage, heterozygosity and error rate of the sequencing data with different upper bounds of k-mer coverage (1000, 10,000, 100,000). In addition, we used CovEst 0.5.6⁵⁷ assuming a repeat-rich genome (-m repeat) with the same k-mer histogram as input to confirm the estimated genome size.

Mitochondrial genome assembly

Mitochondrial sequences are usually overrepresented in sequencing experiments^58,59 and nuclear mitochondrial DNA segments (NUMT) are potentially present in the nuclear genome. Therefore, we first assembled the mitochondrial genome and used this assembly to exclude mitochondrial reads from the dataset to reduce the number of misassemblies in the nuclear genome and increase its contiguity⁶⁰. We used the publicly available mitochondrial genome (GenBank accession number: NC_046946.1) as a reference for mapping short and long reads with BWA 0.7.17-r1188⁶¹ and Minimap2 2.17-r941⁶², respectively. In the case of short reads, reads with both ends were extracted with samtools 1.15.1⁶³.

We performed mitochondrial de novo assembly using two software: GetOrganelle 1.7.6.1⁶⁴ for short reads and Flye 2.9-b1768⁶⁵ for long reads. The maximum number of extension rounds of GetOrganelle was set to 30 and the organelle type was set to animal mitochondrion (-R 30 -F animal_mt). In the case of Flye, we used an estimated genome size of 16,000 (assessed by the length of publicly available mitochondria of the genus Aedes) and set the coverage to 300 (-g 16 k –asm-coverage 300) to randomly resample the dataset and decrease the computational time required for the analysis. The two mitochondrial sequences were merged using quickmerge 0.3⁶⁶. Sequence polishing consisted of three steps: we ran Racon 1.4.10⁶⁷ and then medaka 1.7.2⁶⁸ with the r941_min_sup_g507 model to create a more accurate consensus sequence of long reads; then we ran Pilon 1.23⁶⁹ to correct SNPs and short indel variations using the alignment of the short read sequences.

We aligned both sequencing datasets to the polished mitochondrial genome and visualized the alignments using Integrative Genomics Viewer 2.16.0⁷⁰ to ensure that there were no spurious segmental duplications in the assembly. Corrections to the consensus sequence were made manually in AliView 1.28⁷¹. We performed the functional annotation of the mitochondrion on the MITOS2 web server (http://mitos2.bioinf.uni-leipzig.de/index.py last accessed: June 9, 2023;⁷²) and then visualized the genome with Proksee⁷³. We used Clinker 0.0.27⁷⁴ to assess whether there are structural variations in the mitochondrion using Aedes japonicus (OR668894.1), Aedes albopictus (NC_006817.1) and Aedes aegypti (NC_035159.1) as reference taxa. The same four assemblies were used as input for skmer 3.3.0⁷⁵ and the Jukes-Cantor-transformed genetic distances were visualized as a neighbor-joining phylogram using the pegas 1.2⁷⁶ R 4.2.2⁷⁷ package.

Nuclear genome assembly

For the assembly of the nuclear genome, we first excluded reads that could be mapped to the mitochondrial genome. We excluded long reads with an alignment block length of at least 95% of their length and flagged all short reads with both ends mapped to the mitochondrial assembly as mitochondrial. Alignments were created using minimap2 in the same way as described above for the initial identification of mitochondrial reads. For the primary assembly, we used two different approaches: long reads were assembled using nextDenovo 2.5.0⁷⁸, and the hybrid assembly with long and short reads was performed using MaSuRCA 4.0.5⁷⁹. Both primary genome assemblies were polished following the same steps as for the mitochondrial sequence (see above). We checked the contiguity and completeness of the genomes with QUAST 5.0.2⁸⁰ and BUSCO 5.2.2⁸¹ using the BUSCO gene set of the Diptera lineage from the Ortholog Database v10 (https://www.orthodb.org/).

Before merging the assemblies, we polished the assemblies again and then ran quickmerge 0.3 with the MaSuRCA assembly as hybrid and the nextDenovo assembly as self-assembly. Genome assembly of pooled samples may accumulate a high ratio of duplications; therefore, we removed false duplications in the polished sequences using create_pseudohaploid.sh (https://github.com/schatzlab/pseudohaploid/tree/master). Since we still found a relatively high ratio of duplications according to the results of BUSCO, we also ran redundans 0.13c⁸² with the assembly already curated with pseudohaploid as input. We ran redundans with different identity values (–identity 0.6, 0.65, 0.7, 0.75, 0.8, 0.85, 0.9, 0.95, 1.0) and overlap (–overlap 0.8, 0.85, 0.9, 0.95, 1.0) without scaffolding and gapclosing (–noscaffolding –nogapclosing) and chose the best parameters based on BUSCO results. We polished the reduced assembly again using the same approach as above, then identified possible contaminants with Bertax 0.1⁸³ and excluded all sequences that were not classified as Arthropoda. The contiguity and gene completeness of the decontaminated assembly were checked again using QUAST and BUSCO.

Gene prediction, functional annotation and phylogenetic reconstruction

We masked all repeat sequences, including tandem repeats and transposable elements in the genome with Red 2.0⁸⁴ before gene prediction. We predicted transfer RNA (tRNA) and ribosomal RNA (rRNA) sequences using ARAGORN 1.2.38⁸⁵ and Barrnap 0.9 (https://github.com/tseemann/barrnap), respectively. We predicted the sequence, location and structure of protein-coding genes in the soft-masked genome by combining ab initio and homology-based methods as implemented in the BRAKER 3.0.2 pipeline⁸⁶. We carried out ab initio prediction with Augustus 3.5.0⁸⁷. For homology-based gene prediction, we used arthropoda_odb11 to generate hints with ProtHint 2.6.0⁸⁸, which were then used by GeneMark-EP 4.71_lic⁸⁸ to generate the training gene set for Augustus. We performed homology-based prediction in two iterations and clustered coding sequences (CDS) to have the same protein product using CD-HIT 4.7⁸⁹ with the following parameters: -c 1 -G 0 -aL 1.0 -aS 1.0 and then used the PANNZER2 [⁹⁰ web server (http://ekhidna2.biocenter.helsinki.fi/sanspanz/; last accessed June 16, 2023) to functionally annotate the predicted genes, restricting the GO classes to arthropods.

To identify potential resistance genes, we transferred the annotations of the publicly available genome of Aedes albopictus (GCF_006496715.2) and checked if the target gene is present in the genome annotation of the de novo assembled genome of Aedes koreicus. We ran liftoff 1.6.3⁹¹ with the default settings, using the whole genome sequence and genome annotation of Ae. albopictus as the reference and the updated Ae. koreicus assembly as the target. Then, we searched for potential insecticide resistace genes (Supplementary Table 2) of Ae. koreicus in the transferred annotations. We used the targets reported by Djiappi-Tchamen et al. (2023)³⁸, which were found in Ae. albopictus and Ae. aegypti, and the targets reported by Catapano et al. (2023)³⁵, which are specific to Ae. koreicus. To verify the presence of genes, we used bedtools intersect 2.31.0⁹² to check whether the functional annotation of genomic regions with positive hits returned by PANNZER matched the function identified by annotation transfer.

We then searched for the orthologs of the functionally annotated genes of Aedes koreicus in other 23 species of the family Culicidae and used Drosophila melanogaster as an outgroup. We identified orthogroups and performed phylogenomic reconstruction of the species with OrthoFinder 2.5.5⁹³ using the default settings. We used all accessions of Culicidae with available genome annotation in the NCBI genome database as of June 16, 2023 (Table 2).

Table 2 Species used for ortholog finding and phylogenetic analysis.

Full size table

Data availability

We deposited all newly generated data described in this study in the NCBI database under BioProject PRJNA728830. The raw data belonging to BioSample SAMN19104546 can be found in the Sequence Read Archive (SRA) database under accessions SRR27118085 and SRR27118086, whereas the updated genome assembly can be found in the Assembly database under accession GCA_024533555.2. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession JAHHFK000000000. The version described in this paper is version JAHHFK020000000. The analysis schema is available as a supplementary file to this paper (Supplementary File 1). The structural and functional annotations of the assembly as well as the genome version presented in this study are made public in the Zenodo data repository under https://doi.org/10.5281/zenodo.10278300.

References

Vector-borne diseases https://www.who.int/news-room/fact-sheets/detail/vector-borne-diseases
Franklinos, L. H. V., Jones, K. E., Redding, D. W. & Abubakar, I. The effect of global change on mosquito-borne disease. Lancet. Infect. Dis. 19, e302–e312 (2019).
Article PubMed Google Scholar
Bhatt, S. et al. The global distribution and burden of dengue. Nature 496, 504–507 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Gubler, D. J. Dengue, urbanization and globalization: The unholy trinity of the 21st century. Trop. Med. Health 39, S3–S11 (2011).
Article Google Scholar
Stanaway, J. D. et al. The global burden of dengue: An analysis from the Global Burden of Disease Study 2013. Lancet. Infect. Dis. 16, 712–723 (2016).
Article PubMed PubMed Central Google Scholar
Whitmee, S. et al. Safeguarding human health in the Anthropocene epoch: Report of The Rockefeller Foundation on planetary health. Lancet 386, 1973–2028 (2015).
Article PubMed Google Scholar
Gottdenker, N. L., Streicker, D. G., Faust, C. L. & Carroll, C. R. Anthropogenic land use change and infectious diseases: A review of the evidence. EcoHealth 11, 619–632 (2014).
Article PubMed Google Scholar
Franklinos, L. H. V., Jones, K. E., Redding, D. W. & Abubakar, I. The effect of global change on mosquito-borne disease. Lancet Infect. Dis. 19, e302–e312 (2019).
Article PubMed Google Scholar
Schaffner, F. et al. Development of guidelines for the surveillance of invasive mosquitoes in Europe. Parasit Vectors 6, 209 (2013).
Article PubMed PubMed Central Google Scholar
Martinet, J.-P., Ferté, H., Failloux, A.-B., Schaffner, F. & Depaquit, J. Mosquitoes of North-Western Europe as potential vectors of arboviruses: A review. Viruses 11, 1059 (2019).
Article PubMed PubMed Central Google Scholar
Montarsi, F. et al. First report of the blood-feeding pattern in Aedes koreicus, a new invasive species in Europe. Sci. Rep. 12, 15751 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Versteirt, V. et al. Bionomics of the established exotic mosquito species Aedes koreicus in Belgium, Europe. J. Med. Entomol. 49, 1226–1232 (2012).
Article CAS PubMed Google Scholar
Deblauwe, I. et al. From a long-distance threat to the invasion front: A review of the invasive Aedes mosquito species in Belgium between 2007 and 2020. Parasit Vectors 15, 206 (2022).
Article PubMed PubMed Central Google Scholar
Montarsi, F. et al. Current distribution of the invasive mosquito species, Aedes koreicus [Hulecoeteomyia koreica] in northern Italy. Parasit Vectors 8, 614 (2015).
Article PubMed PubMed Central Google Scholar
Ballardini, M. et al. First report of the invasive mosquito Aedes koreicus (Diptera: Culicidae) and of its establishment in Liguria, northwest Italy. Parasit Vectors 12, 334 (2019).
Article PubMed PubMed Central Google Scholar
Negri, A. et al. Evidence for the spread of the alien species Aedes koreicus in the Lombardy region, Italy. Parasit Vectors 14, 534 (2021).
Article PubMed PubMed Central Google Scholar
Gradoni, F. et al. Geographical data on the occurrence and spreading of invasive Aedes mosquito species in Northeast Italy. Data Brief 36, 107047 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kalan, K., Šušnjar, J., Ivović, V. & Buzan, E. First record of Aedes koreicus (Diptera, Culicidae) in Slovenia. Parasitol. Res 116, 2355–2358 (2017).
Article PubMed Google Scholar
Werner, D., Zielke, D. E. & Kampen, H. First record of Aedes koreicus (Diptera: Culicidae) in Germany. Parasitol. Res. 115, 1331–1334 (2016).
Article PubMed Google Scholar
Kurucz, K. et al. Emergence of Aedes koreicus (Diptera: Culicidae) in an urban area, Hungary, 2016. Parasitol. Res. 115, 4687–4689 (2016).
Article PubMed Google Scholar
Kurucz, K., Manica, M., Delucchi, L., Kemenesi, G. & Marini, G. Dynamics and distribution of the invasive mosquito Aedes koreicus in a temperate European City. Int. J. Environ. Res. Public Health 17, 2728 (2020).
Article PubMed PubMed Central Google Scholar
Fuehrer, H.-P. et al. Monitoring of alien mosquitoes in Western Austria (Tyrol, Austria, 2018). PLoS Neglect. Trop. Dis. 14, e0008433 (2020).
Article Google Scholar
Ganushkina, L., Lukashev, A., Patraman, I., Razumeyko, V. & Shaikevich, E. Detection of the invasive mosquito species Aedes (Stegomyia) aegypti and Aedes (Hulecoeteomyia) koreicus on the Southern Coast of the Crimean Peninsula. J. Arthropod-Borne Dis. 14, 270–276 (2020).
PubMed PubMed Central Google Scholar
Andreeva, Y. V. et al. First record of the invasive mosquito species Aedes koreicus (Diptera, Culicidae) in the Republic of Kazakhstan. Parasite 28, 52 (2021).
Article PubMed PubMed Central Google Scholar
Kurucz, K. et al. Aedes koreicus, a vector on the rise: Pan-European genetic patterns, mitochondrial and draft genome sequencing. PLOS ONE 17, e0269880 (2022).
Article CAS PubMed PubMed Central Google Scholar
Sherpa, S. et al. Unravelling the invasion history of the Asian tiger mosquito in Europe. Mol. Ecol. 28, 2360–2377 (2019).
Article PubMed Google Scholar
Kotsakiozi, P. et al. Population genomics of the Asian tiger mosquito, Aedes Albopictus : Insights into the recent worldwide invasion. Ecol. Evol. 7, 10143–10157 (2017).
Article PubMed PubMed Central Google Scholar
Land, K. M. The mosquito genome: Perspectives and possibilities. Trends Parasitol. 19, 103–105 (2003).
Article CAS PubMed Google Scholar
Schmidt, T. L., Endersby-Harshman, N. M. & Hoffmann, A. A. Improving mosquito control strategies with population genomics. Trends Parasitol. 37, 907–921 (2021).
Article CAS PubMed Google Scholar
Richards, S. L., Byrd, B. D., Reiskind, M. H. & White, A. V. Assessing insecticide resistance in adult mosquitoes: Perspectives on current methods. Environ. Health Insights 14, 117863022095279 (2020).
Article Google Scholar
Juliano, S. A. & Philip Lounibos, L. Ecology of invasive mosquitoes: Effects on resident species and on human health: Invasive mosquitoes. Ecol. Lett. 8, 558–574 (2005).
Article PubMed PubMed Central Google Scholar
Blaxter, M. et al. Why sequence all eukaryotes?. Proc. Natl. Acad. Sci. 119, e2115636118 (2022).
Article PubMed PubMed Central Google Scholar
Huang, C. et al. InvasionDB: A genome and gene database of invasive alien species. J. Integrat. Agric. 20, 191–200 (2021).
Article CAS Google Scholar
Waterhouse, R. M., Wyder, S. & Zdobnov, E. M. The Aedes Aegypti genome: A comparative perspective. Insect Mol. Biol. 17, 1–8 (2008).
Article CAS PubMed Google Scholar
Catapano, P. L. et al. De novo genome assembly of the invasive mosquito species Aedes japonicus and Aedes koreicus. Parasit. Vectors 16, 427 (2023).
Article CAS PubMed PubMed Central Google Scholar
Yan, R. et al. Molecular and functional characterization of a conserved odorant receptor from Aedes albopictus. Parasit. Vectors 15, 43 (2022).
Article CAS PubMed PubMed Central Google Scholar
Tiwari, V. & Sowdhamini, R. Structure modelling of odorant receptor from Aedes aegypti and identification of potential repellent molecules. Comput. Struct. Biotechnol. J. 21, 2204–2214 (2023).
Article CAS PubMed PubMed Central Google Scholar
Djiappi-Tchamen, B. et al. Analyses of insecticide resistance Genes in Aedes Aegypti and Aedes Albopictus mosquito populations from cameroon. Genes 12, 828 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. et al. Prag01, a novel deltamethrin-resistance-associated gene from Culex pipiens pallens. Parasitol. Res. 108, 417–423 (2011).
Article PubMed Google Scholar
Ibrahim, S. S. et al. Molecular drivers of insecticide resistance in the Sahelo-Sudanian populations of a major malaria vector Anopheles Coluzzii. BMC Biol. 21, 125 (2023).
Article CAS PubMed PubMed Central Google Scholar
Zadra, N., Rizzoli, A. & Rota-Stabelli, O. Chronological incongruences between mitochondrial and nuclear phylogenies of Aedes mosquitoes. Life 11, 181 (2021).
Article ADS PubMed PubMed Central Google Scholar
Leaché, A. D., Harris, R. B., Rannala, B. & Yang, Z. The influence of gene flow on species tree estimation: A simulation study. Syst. Biol. 63, 17–30 (2014).
Article PubMed Google Scholar
Montañés, J. C., Huertas, M., Messeguer, X. & Albà, M. M. Evolutionary trajectories of new duplicated and putative de novo genes. Mol. Biol. Evol. 40, msad098 (2023).
Article PubMed PubMed Central Google Scholar
Sun, C. et al. RPAN: Rice pan-genome browser for ∼3000 rice genomes. Nucleic Acids Research 45, 597–605 (2017).
Article CAS PubMed Google Scholar
Khan, A. W. et al. Super-pangenome by integrating the wild side of a species for accelerated crop improvement. Trends Plant Sci. 25, 148–158 (2020).
Article CAS PubMed PubMed Central Google Scholar
Matthews, B. J. et al. Improved reference genome of Aedes aegypti informs arbovirus vector control. Nature 563, 501–507 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Stuart, K. C. et al. Transcript- and annotation-guided genome assembly of the European starling. Mol. Ecol. Resourc. 22, 3141–3160 (2022).
Article Google Scholar
Pfitzner, W. P., Lehner, A., Hoffmann, D., Czajka, C. & Becker, N. First record and morphological characterization of an established population of Aedes (Hulecoeteomyia) koreicus (Diptera: Culicidae) in Germany. Parasites & Vectors 11, 662 (2018).
Article CAS Google Scholar
Nielsen, L. T. A revision of the adult and larval mosquitoes of Japan (including the Ryukyu Archipelago and the Ogasawara Islands) and Korea (Diptera: Culicidae). Mosquito News 40, 311 (1980).
Google Scholar
Marrama Rakotoarivony, L. & Schaffner, F. ECDC guidelines for the surveillance of invasive mosquitoes in Europe. Euro Surveill. 17, 20265 (2012).
CAS PubMed Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central Google Scholar
Benoit, G., Lavenier, D., Lemaitre, C. & Rizk, G. Bloocoo, a memory efficient read corrector. In European conference on computational biology (ECCB) (2014).
Lanfear, R., Schalamun, M., Kainer, D., Wang, W. & Schwessinger, B. MinIONQC: Fast and simple quality control for MinION sequencing data. Bioinformatics 35, 523–525 (2019).
Article CAS PubMed Google Scholar
De Coster, W., D’Hert, S., Schultz, D. T., Cruts, M. & Van Broeckhoven, C. NanoPack: Visualizing and processing long-read sequencing data. Bioinformatics 34, 2666–2669 (2018).
Article PubMed PubMed Central Google Scholar
Kokot, M., Długosz, M. & Deorowicz, S. KMC 3: Counting and manipulating k-mer statistics. Bioinformatics 33, 2759–2761 (2017).
Article CAS PubMed Google Scholar
Ranallo-Benavidez, T. R., Jaron, K. S. & Schatz, M. C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun. 11, 1432 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hozza, M., Vinař, T. & Brejová, B. How big is that genome? Estimating genome size and coverage from k-mer abundance spectra. In String Processing and Information Retrieval Vol. 9309 (eds Iliopoulos, C. et al.) 199–209 (Springer, 2015).
Chapter Google Scholar
Bendich, A. J. Why do chloroplasts and mitochondria contain so many copies of their genome?. BioEssays 6, 279–282 (1987).
Article CAS PubMed Google Scholar
Ekblom, R., Smeds, L. & Ellegren, H. Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria. BMC Genomics 15, 467 (2014).
Article PubMed PubMed Central Google Scholar
Ekblom, R. & Wolf, J. B. W. A field guide to whole-genome sequencing, assembly and annotation. Evol. Appl. 7, 1026–1042 (2014).
Article PubMed PubMed Central Google Scholar
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997 [q-bio] (2013)
Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Jin, J.-J. et al. GetOrganelle: A fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21, 241 (2020).
Article PubMed PubMed Central Google Scholar
Kolmogorov, M., Yuan, J., Lin, Y. & Pevzner, P. A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546 (2019).
Article CAS PubMed Google Scholar
Solares, E. A. et al. Rapid low-cost assembly of the Drosophila melanogaster reference genome using low-coverage, long-read sequencing. G3 8, 3143–3154 (2018).
Article CAS PubMed PubMed Central Google Scholar
Vaser, R., Sovic, I., Nagarajan, N. & Sikic, M. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res. https://doi.org/10.1101/gr.214270.116 (2017).
Article PubMed PubMed Central Google Scholar
Medaka. (2023) https://github.com/nanoporetech/medaka
Walker, B. J. et al. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
Article ADS PubMed PubMed Central Google Scholar
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Article CAS PubMed PubMed Central Google Scholar
Larsson, A. AliView: A fast and lightweight alignment viewer and editor for large datasets. Bioinformatics 30, 3276–3278 (2014).
Article CAS PubMed PubMed Central Google Scholar
Donath, A. et al. Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes. Nucleic Acids Res. 47, 10543–10552 (2019).
Article CAS PubMed PubMed Central Google Scholar
Grant, J. R. et al. Proksee: In-depth characterization and visualization of bacterial genomes. Nucleic Acids Res. 51, W484–W492 (2023).
Article CAS PubMed PubMed Central Google Scholar
Gilchrist, C. L. M. & Chooi, Y.-H. Clinker & clustermap.js: Automatic generation of gene cluster comparison figures. Bioinformatics 37, 2473–2475 (2021).
Article CAS PubMed Google Scholar
Sarmashghi, S., Bohmann, K., Gilbert, M. T. P., Bafna, V. & Mirarab, S. Skmer: Assembly-free and alignment-free sample identification using genome skims. Genome Biol. 20, 34 (2019).
Article PubMed PubMed Central Google Scholar
Paradis, E. Pegas: An R package for population genetics with an integrated-modular approach. Bioinformatics 26, 419–420 (2010).
Article CAS PubMed Google Scholar
R Foundation for Statistical Computing. R: A language and environment for statistical computing. Vienna, Austria.
Hu, J. et al. An Efficient Error Correction and Accurate Assembly Tool for Noisy Long Reads. (2023) https://doi.org/10.1101/2023.03.09.531669
Zimin, A. V. et al. The MaSuRCA genome assembler. Bioinformatics 29, 2669–2677 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: Quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Article CAS PubMed PubMed Central Google Scholar
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A. & Zdobnov, E. M. BUSCO update: Novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 38, 4647–4654 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pryszcz, L. P. & Gabaldón, T. Redundans: An assembly pipeline for highly heterozygous genomes. Nucleic Acids Res. 44, e113–e113 (2016).
Article PubMed PubMed Central Google Scholar
Mock, F., Kretschmer, F., Kriese, A., Böcker, S. & Marz, M. Taxonomic classification of DNA sequences beyond sequence similarity using deep neural networks. Proc. Natl. Acad. Sci. 119, e2122636119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Girgis, H. Z. Red: An intelligent, rapid, accurate tool for detecting repeats de-novo on the genomic scale. BMC Bioinform. 16, 227 (2015).
Article Google Scholar
Laslett, D. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32, 11–16 (2004).
Article CAS PubMed PubMed Central Google Scholar
Brůna, T., Hoff, K. J., Lomsadze, A., Stanke, M. & Borodovsky, M. BRAKER2: Automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genomics Bioinform. 3, Iqaa108 (2021).
Article Google Scholar
Stanke, M., Diekhans, M., Baertsch, R. & Haussler, D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 24, 637–644 (2008).
Article CAS PubMed Google Scholar
Brůna, T., Lomsadze, A. & Borodovsky, M. GeneMark-EP+: Eukaryotic gene prediction with self-training in the space of genes and proteins. NAR Genomics Bioinform. 2, Iqaa026 (2020).
Article Google Scholar
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: Accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Article CAS PubMed PubMed Central Google Scholar
Törönen, P. & Holm, L. PANNZER A practical tool for protein function prediction. Protein Sci. 31, 118–128 (2022).
Article PubMed Google Scholar
Shumate, A. & Salzberg, S. L. Liftoff: Accurate mapping of gene annotations. Bioinformatics 37, 1639–1643 (2021).
Article CAS PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

On behalf of the “Harmadik generációs szekvenálási adatok bioinformatikai elemzése” (Bioinformatic analysis of third generation sequencing data) projects’ team we are grateful for the possibility to use ELKH Cloud (see Héder et al. 202290; https://science-cloud.hu/), which helped us achieve the results published in this paper. N.A.N. was supported by the National Research, Development and Innovation Office (OTKA PD142602). The field sampling of mosquitoes and the laboratory work for the initial processing of genome data were supported by the National Research, Development and Innovation Office, Grant Numbers: FK-138563 and RRF-2.3.1-21-2022-00010 “National Laboratory of Virology”.

Funding

Open access funding provided by University of Debrecen.

Author information

Authors and Affiliations

Department of Evolutionary Zoology and Human Biology, University of Debrecen, Debrecen, Hungary
Nikoletta Andrea Nagy
HUN-REN-UD Behavioural Ecology Research Group, University of Debrecen, Debrecen, Hungary
Nikoletta Andrea Nagy
Institute of Metagenomics, University of Debrecen, Debrecen, Hungary
Nikoletta Andrea Nagy
National Laboratory of Virology, Szentágothai Research Centre, University of Pécs, Pecs, Hungary
Gábor Endre Tóth, Kornélia Kurucz & Gábor Kemenesi
Bernhard Nocht Institute for Tropical Medicine, WHO Collaborating Centre for Arbovirus and Hemorrhagic Fever Reference and Research, Hamburg, Germany
Gábor Endre Tóth
Institute of Biology, Faculty of Sciences, University of Pécs, Pecs, Hungary
Kornélia Kurucz & Gábor Kemenesi
HUN-REN-UD Conservation Biology Research Group, University of Debrecen, Debrecen, Hungary
Levente Laczkó
One Health Institute, University of Debrecen, Debrecen, Hungary
Levente Laczkó

Authors

Nikoletta Andrea Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Endre Tóth
View author publications
You can also search for this author in PubMed Google Scholar
Kornélia Kurucz
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Kemenesi
View author publications
You can also search for this author in PubMed Google Scholar
Levente Laczkó
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.E.T., K.K., and G.K. conceived the idea and carried out sampling, G.E.T. carried out laboratory work and initial processing of raw data, L.L. carried out the genome assembly and descriptive statistics, N.A.N. and L.L. performed downstream analyses and wrote the first draft of the manuscript, all authors participated in writing and editing the manuscript.

Corresponding author

Correspondence to Nikoletta Andrea Nagy.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nagy, N.A., Tóth, G.E., Kurucz, K. et al. The updated genome of the Hungarian population of Aedes koreicus. Sci Rep 14, 7545 (2024). https://doi.org/10.1038/s41598-024-58096-6

Download citation

Received: 11 December 2023
Accepted: 25 March 2024
Published: 30 March 2024
DOI: https://doi.org/10.1038/s41598-024-58096-6
Springer Nature Limited

The updated genome of the Hungarian population of Aedes koreicus

Abstract

Similar content being viewed by others

Improved reference genome of Aedes aegypti informs arbovirus vector control

Mapping the virome in wild-caught Aedes aegypti from Cairns and Bangkok

De novo genome assembly of the invasive mosquito species Aedes japonicus and Aedes koreicus

Introduction

Results and discussion

Methods

Sample collection and genome sequencing

Mitochondrial genome assembly

Nuclear genome assembly

Gene prediction, functional annotation and phylogenetic reconstruction

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The updated genome of the Hungarian population of Aedes koreicus

Abstract

Similar content being viewed by others

Improved reference genome of Aedes aegypti informs arbovirus vector control

Mapping the virome in wild-caught Aedes aegypti from Cairns and Bangkok

De novo genome assembly of the invasive mosquito species Aedes japonicus and Aedes koreicus

Introduction

Results and discussion

Methods

Sample collection and genome sequencing

Mitochondrial genome assembly

Nuclear genome assembly

Gene prediction, functional annotation and phylogenetic reconstruction

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation