油茶象甲Curculio chinensis基因组Survey测序分析

张丽, 吴佳茜, 曾萝琦, 刘惠, 汤小宇

中国农学通报. 2024, 40(17): 135-141

PDF(1485 KB)
PDF(1485 KB)
中国农学通报 ›› 2024, Vol. 40 ›› Issue (17) : 135-141. DOI: 10.11924/j.issn.1000-6850.casb2023-0607
植物保护

油茶象甲Curculio chinensis基因组Survey测序分析

作者信息 +

Genome Survey Analysis of Curculio chinensis (Coleoptera: Curculionidae)

Author information +
History +

摘要

油茶象甲Curculio chinensis Chevrolat是危害中国特有木本油料树种——油茶的专性蛀果害虫,也是中国林业危险性有害生物之一,在中国不同油茶产区均有分布,对油茶的为害呈加重趋势。为了深入研究其遗传基础和寄主适应性,确定适合鞘翅目象甲科基因组测序研究策略,首先采取Survey测序,之后做深度基因组测序和大规模种群测序。研究采用二代高通量测序技术(MGISEQ-2000),测定了油茶象甲基因组大小,并估计该物种基因组的杂合度、重复率和GC含量。结果显示:油茶象甲基因组大小约为1356.82 Mb,测序深度达到50×;K-mer分析油茶象甲基因组虽然具有杂合峰,但杂合率较低,为1.20%;基因组重复序列含量约为77%。该研究结果对于揭示油茶象甲适应性进化具有重要意义,可为后续构建完整基因组和多种群基因组测序提供策略选择依据。

Abstract

The camellia weevil, Curculio chinensis Chevrolat, is a host-specific predator of the seeds of oil-tea camellia, endemic woody oil-bearing plant in China. It is one of the dangerous pests in Chinese forestry, and is widely distributed in different oil-tea camellia planting areas of China with increasing damage. In order to further study its genetic basis and host adaptability, and determine sequencing strategies that were suitable for genome sequencing of Curculionidae in Coleoptera, survey sequencing was first adopted to lay the foundation for further deep genome sequencing and large-scale population sequencing. In this study, reads from second-generation high-throughput sequencing technology (MGISEQ-2000 Pair-End sequencing platform) was used to determine the genome size of C. chinensis, and estimate the heterozygosity, repetition rate and GC content of the genome mass of this species. The results showed that the genome size of C. chinensis was about 1356.82 Mb, and the sequencing depth reached 50×. K-mer analysis showed that although there was a heterozygous peak in the C. chinensis genome, the heterozygous rate was as low as 1.20%. And the content of repeated sequences was about 77%. The results of this study are of great significance to reveal the adaptive evolution of Camellia weevil, and can provide a basis for the selection of strategies for the subsequent construction of a complete genome and multi-population genome sequencing.

关键词

油茶象甲 / 鞘翅目 / Survey测序 / 杂合率 / GC含量

Key words

Curculio chinensis / Coleoptera / survey sequencing / heterozygous rate / GC content

引用本文

导出引用
张丽 , 吴佳茜 , 曾萝琦 , 刘惠 , 汤小宇. 油茶象甲Curculio chinensis基因组Survey测序分析. 中国农学通报. 2024, 40(17): 135-141 https://doi.org/10.11924/j.issn.1000-6850.casb2023-0607
ZHANG Li , WU Jiaxi , ZENG Luoqi , LIU Hui , TANG Xiaoyu. Genome Survey Analysis of Curculio chinensis (Coleoptera: Curculionidae). Chinese Agricultural Science Bulletin. 2024, 40(17): 135-141 https://doi.org/10.11924/j.issn.1000-6850.casb2023-0607

参考文献

[1]
蒋三俊. 油茶象鼻虫的防治[J]. 特种经济动植物, 2009, 12(8):54-54.
[2]
蔡守平, 何学友, 李志真, 等. 油茶象危害油茶果实的初步研究[J]. 福建林业科技, 2011, 38(2):14-16.
[3]
李苗苗, 张威, 吕军美, 等. 茶籽象为害对油茶果产量和茶油品质的影响[J]. 植物保护, 2016, 42(5):65-68.
[4]
李苗苗, 舒金平, 张威, 等. 茶籽象危害与不同品种油茶果实物理性状的关系[J]. 林业科学研究, 2017, 30(2):232-237.
[5]
赵丹阳, 秦长生, 徐金柱, 等. 油茶象甲成虫对油茶寄主选择性研究[J]. 中国农学通报, 2015, 31(17):100-104.
为研究油茶象甲对油茶寄主的选择性趋性,为中国油茶抗虫品种选育提供基础数据。应用野外调查和室内测定相结合的方法,室内测定又分为强迫性和选择性取食、产卵以及嗅觉反应测定的方法,测定油茶象甲成虫对不同油茶树种、品系的取食和产卵选择性。结果发现:油茶象甲对油茶2个树种、3个品系均能造成危害,导致采果前落果的主要原因是由于该虫为害所致;6月份,该虫对其他1个树种、3个品系的取食、产卵趋性均大于广宁红花油茶;该虫在油茶叶片上不产卵,当果皮厚度大于其能蛀入最大值时不产卵;雌成虫出土初期对油茶嫩芽的趋性大于果实和成叶,交配产卵期对油茶果实的趋性大于嫩芽和成叶。
[6]
张守科, 方林鑫, 刘亚宁, 等. 茶籽象ATP合成酶基因在不同海拔选择压力下的遗传分化及结构变异[J]. 林业科学, 2019, 55(6):65-73.
[7]
ZHANG S K, SHU J P, XUE H J, et al. Genetic diversity in the camellia weevil, Curculio chinensis Chevrolat (Coleptera: Curculionidae) and inferences for the impact of host plant and human activity[J]. Entomological science, 2018,21:447-460.
[8]
ZHANG S K, SHU J P, XUE H J, et al. The gut microbiota in camellia weevils are influenced by plant secondary metabolites and contribute to saponin degradation[J]. mSystems, 2020, 5(2):e00692-19.
[9]
曾家城, 秦长生, 赵丹阳, 等. 油茶象甲对油茶果实挥发物的触角电生理和行为反应[J]. 林业与环境科学, 2020, 36(4):30-34.
[10]
WANG Q, LIU L, ZHANG S, et al. A chromosome-level genome assembly and intestinal transcriptome of Trypoxylus dichotomus (Coleoptera: Scarabaeidae) to understand its lignocellulose digestion ability[J]. GigaScience, 2022,11:giac059.
[11]
FILIPOVIĆ I, RAŠIĆ G, HEREWARD J, et al. A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)[J]. BMC Genom., 2022,23:426.
[12]
COHEN Z P, PERKIN L C, SIM S B, et al. Insight into weevil biology from a reference quality genome of the boll weevil, Anthonomus grandis grandis Boheman (Coleoptera:Curculionidae)[J]. G3 (Bethesda,Md.), 2023, 13(2):jkac309.
[13]
JEON S A, PARK J L, PARK S J, et al. Comparison between MGI and Illumina sequencing platforms for whole genome sequencing[J]. Genes genom, 2021, 43(7):713-724.
Illumina next generation sequencing (NGS) systems are the major sequencing platform in worldwide next-generation sequencing market. On the other hand, MGI Tech launched a series of new NGS equipment that promises to deliver high-quality sequencing data faster and at lower prices than Illumina's sequencing instruments.In this study, we compared the performance of the two platform's major sequencing instruments-Illumina's NovaSeq 6000 and MGI's MGISEQ-2000 and DNBSEQ-T7-to test whether the MGISEQ-2000 and DNBSEQ-T7 sequencing instruments are also suitable for whole genome sequencing.We sequenced two pairs of normal and tumor tissues from Korean lung cancer patients using the three platforms. Then, we called single nucleotide variants (SNVs) and insertion and deletion (indels) for somatic and germline variants to compare the performance among the three platforms.In quality control analysis, all of the three platforms showed high-quality scores and deep coverages. Comparison among the three platforms revealed that MGISEQ-2000 is most concordant with NovaSeq 6000 for germline SNVs and indels, and DNBSEQ-T7 is most concordant with NovaSeq 6000 for somatic SNVs and indels.These results suggest that the performances of the MGISEQ-2000 and DNBSEQ-T7 platforms are comparable to that of the Illumina NovaSeq 6000 platform and support the potential applicability of the MGISEQ-2000 and DNBSEQ-T7 platforms in actual genome analysis fields.
[14]
KIM H M, JEON S, CHUNG O, et al. Comparative analysis of 7 short-read sequencing platforms using the Korean Reference Genome: MGI and Illumina sequencing benchmark for whole-genome sequencing[J]. GigaScience, 2021, 10(3):giab014.
[15]
DEOROWICZ S, KOKOT M, GRABOWSKI S, et al. KMC 2:fast and resource-frugal k-mer counting[J]. Bioinformatics, 2015, 31(10):1569-1576.
[16]
SUN H, DING J, PIEDNOËL M, et al. FindGSE: Estimating genome size variation within human and Arabidopsis using k-mer frequencies[J]. Bioinformatics, 2018, 34(4):550-557.
Analyzing k-mer frequencies in whole-genome sequencing data is becoming a common method for estimating genome size (GS). However, it remains uninvestigated how accurate the method is, especially if it can capture intra-species GS variation.We present findGSE, which fits skew normal distributions to k-mer frequencies to estimate GS. findGSE outperformed existing tools in an extensive simulation study. Estimating GSs of 89 Arabidopsis thaliana accessions, findGSE showed the highest capability in capturing GS variations. In an application with 71 female and 71 male human individuals, findGSE delivered an average of 3039 Mb as haploid human GS, while female genomes were on average 41 Mb larger than male genomes, in astonishing agreement with size difference of the X and Y chromosomes. Further analysis showed that human GS variations link to geographical patterns and significant differences between populations, which can be explained by variable abundances of LINE-1 retrotransposons.R package of findGSE is freely available at https://github.com/schneebergerlab/findGSE and supported on linux and Mac systems.schneeberger@mpipz.mpg.de.Supplementary data are available at Bioinformatics online.© The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
[17]
VURTURE G W, SEDLAZECK F J, NATTESTAD M, et al. GenomeScope: Fast reference-free genome profiling from short reads[J]. Bioinformatics, 2017, 33(14):2202-2204.
GenomeScope is an open-source web tool to rapidly estimate the overall characteristics of a genome, including genome size, heterozygosity rate and repeat content from unprocessed short reads. These features are essential for studying genome evolution, and help to choose parameters for downstream analysis. We demonstrate its accuracy on 324 simulated and 16 real datasets with a wide range in genome sizes, heterozygosity levels and error rates.http://genomescope.org, https://github.com/schatzlab/genomescope.git.mschatz@jhu.edu.Supplementary data are available at Bioinformatics online.© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
[18]
LUO R B, LIU B H, XIE Y L, et al. SOAPdenovo2: An empirically improved memory-efficient short-read de novo assembler[J]. GigaScience, 2012, 1(1):18.
There is a rapidly increasing amount of de novo genome assembly using next-generation sequencing (NGS) short reads; however, several big challenges remain to be overcome in order for this to be efficient and accurate. SOAPdenovo has been successfully applied to assemble many published genomes, but it still needs improvement in continuity, accuracy and coverage, especially in repeat regions.To overcome these challenges, we have developed its successor, SOAPdenovo2, which has the advantage of a new algorithm design that reduces memory consumption in graph construction, resolves more repeat regions in contig assembly, increases coverage and length in scaffold construction, improves gap closing, and optimizes for large genome.Benchmark using the Assemblathon1 and GAGE datasets showed that SOAPdenovo2 greatly surpasses its predecessor SOAPdenovo and is competitive to other assemblers on both assembly length and accuracy. We also provide an updated assembly version of the 2008 Asian (YH) genome using SOAPdenovo2. Here, the contig and scaffold N50 of the YH genome were ~20.9 kbp and ~22 Mbp, respectively, which is 3-fold and 50-fold longer than the first published version. The genome coverage increased from 81.16% to 93.91%, and memory consumption was ~2/3 lower during the point of largest memory consumption.
[19]
CUNNINGHAM C B, JI L X, WIBERG R A W, et al. The genome and methylome of a beetle with complex social behavior, Nicrophorus vespilloides (Coleoptera:Silphidae)[J]. Genome biology and evolution, 2015, 7(12):3383-3396.
[20]
MEYER J M, MARKOV G V, BASKARAN P, et al. Draft genome of the scarab beetle Oryctes borbonicus on La Réunion Island[J]. Genome biology and evolution, 2016, 8(7):2093-2105.
[21]
EVANS J D, MCKENNA D, SCULLY E, et al. Genome of the small hive beetle (Aethina tumida, Coleoptera:Nitidulidae), a worldwide parasite of social bee colonies, provides insights into detoxification and herbivory[J]. GigaScience, 2018, 7(12):giy138.
[22]
WU Y M, LI J, CHEN X S. Draft genomes of two blister beetles Hycleus cichorii and Hycleus phaleratus[J]. GigaScience, 2018, 7(3):1-7.
[23]
GUAN D L, HAO X Q, MI D, et al. Draft genome of a blister beetle Mylabris aulica[J]. Frontiers in Genetics, 2020,10:1281.
[24]
OPPERT B, MUSZEWSKA A, STECZKIEWICZ K, et al. The genome of Rhyzopertha dominica (Fab.) (Coleoptera: Bostrichidae): Adaptation for success[J]. Genes, 2022, 13(3):446.
[25]
FALLON T R, LOWER S E, CHANG C H, et al. Firefly genomes illuminate parallel origins of bioluminescence in beetles[J]. eLife, 2018,7:e36495.
[26]
FU X H, LI J J, TIAN Y, et al. Long-read sequence assembly of the firefly Pyrocoelia pectoralis genome[J]. GigaScience, 2017, 6(12):1-7.
[27]
ANDO T, MATSUDA T, GOTO K, et al. Repeated inversions within a pannier intron drive diversification of intraspecific colour patterns of ladybird beetles[J]. Nature communications, 2018, 9(1):3843.
How genetic information is modified to generate phenotypic variation within a species is one of the central questions in evolutionary biology. Here we focus on the striking intraspecific diversity of >200 aposematic elytral (forewing) colour patterns of the multicoloured Asian ladybird beetle, Harmonia axyridis, which is regulated by a tightly linked genetic locus h. Our loss-of-function analyses, genetic association studies, de novo genome assemblies, and gene expression data reveal that the GATA transcription factor gene pannier is the major regulatory gene located at the h locus, and suggest that repeated inversions and cis-regulatory modifications at pannier led to the expansion of colour pattern variation in H. axyridis. Moreover, we show that the colour-patterning function of pannier is conserved in the seven-spotted ladybird beetle, Coccinella septempunctata, suggesting that H. axyridis' extraordinary intraspecific variation may have arisen from ancient modifications in conserved elytral colour-patterning mechanisms in ladybird beetles.
[28]
ZHANG L J, LI S, LUO J Y, et al. Chromosome-level genome assembly of the predator Propylea japonica to understand its tolerance to insecticides and high temperatures[J]. Molecular ecology resources, 2020, 20(1):292-307.
[29]
WANG K, LI P P, GAO Y Y, et al. De novo genome assembly of the white-spotted flower chafer (Protaetia brevitarsis)[J]. GigaScience, 2019, 8(4):giz019.
[30]
MCKENNA D D, SCULLY E D, PAUCHET Y, et al. Genome of the Asian longhorned beetle (Anoplophora glabripennis), a globally significant invasive species, reveals key functional and evolutionary innovations at the beetle-plant interface[J]. Genome biology, 2016, 17(1):227.
Relatively little is known about the genomic basis and evolution of wood-feeding in beetles. We undertook genome sequencing and annotation, gene expression assays, studies of plant cell wall degrading enzymes, and other functional and comparative studies of the Asian longhorned beetle, Anoplophora glabripennis, a globally significant invasive species capable of inflicting severe feeding damage on many important tree species. Complementary studies of genes encoding enzymes involved in digestion of woody plant tissues or detoxification of plant allelochemicals were undertaken with the genomes of 14 additional insects, including the newly sequenced emerald ash borer and bull-headed dung beetle.The Asian longhorned beetle genome encodes a uniquely diverse arsenal of enzymes that can degrade the main polysaccharide networks in plant cell walls, detoxify plant allelochemicals, and otherwise facilitate feeding on woody plants. It has the metabolic plasticity needed to feed on diverse plant species, contributing to its highly invasive nature. Large expansions of chemosensory genes involved in the reception of pheromones and plant kairomones are consistent with the complexity of chemical cues it uses to find host plants and mates.Amplification and functional divergence of genes associated with specialized feeding on plants, including genes originally obtained via horizontal gene transfer from fungi and bacteria, contributed to the addition, expansion, and enhancement of the metabolic repertoire of the Asian longhorned beetle, certain other phytophagous beetles, and to a lesser degree, other phytophagous insects. Our results thus begin to establish a genomic basis for the evolutionary success of beetles on plants.
[31]
VAN DAM M H, CABRAS A A, HENDERSON J B, et al. The Easter Egg Weevil (Pachyrhynchus) genome reveals syntenic patterns in Coleoptera across 200 million years of evolution[J]. Plos genetics, 2021, 17(8):e1009745.
[32]
HARROP T W R, LE LEC M F, JAUREGUI R, et al. Genetic diversity in invasive populations of argentine stem weevil associated with adaptation to biocontrol[J]. Insects, 2020, 11(7):441.
[33]
KEELING C I, CAMPBELL E O, BATISTA P D, et al. Chromosome-level genome assembly reveals genomic architecture of northern range expansion in the mountain pine beetle, Dendroctonus ponderosae Hopkins (Coleoptera:Curculionidae)[J]. Molecular ecology resources, 2022, 22(3):1149-1167.
[34]
LIU Z D, XING L S, HUANG W L, et al. Chromosome-level genome assembly and population genomic analyses provide insights into adaptive evolution of the red turpentine beetle, Dendroctonus valens[J]. BMC biology, 2022, 20(1):190.
[35]
VEGA F, BROWN S, CHEN H, et al. Draft genome of the most devastating insect pest of coffee worldwide: The coffee berryborer, Hypothenemus hampei[J]. Scientific reports 2015,5:12525.
[36]
BOUCHEMOUSSE S, FALQUET L, MÜLLER-SCHÄRER H. Genome assembly of the ragweed leaf beetle: A step forward to better predict rapid evolution of a weed biocontrol agent to environmental novelties[J]. Genome biology and evolution, 2020, 12(7):1167-1173.
Rapid evolution of weed biological control agents (BCAs) to new biotic and abiotic conditions is poorly understood and so far only little considered both in pre-release and post-release studies, despite potential major negative or positive implications for risks of nontargeted attacks or for colonizing yet unsuitable habitats, respectively. Provision of genetic resources, such as assembled and annotated genomes, is essential to assess potential adaptive processes by identifying underlying genetic mechanisms. Here, we provide the first sequenced genome of a phytophagous insect used as a BCA, that is, the leaf beetle Ophraella communa, a promising BCA of common ragweed, recently and accidentally introduced into Europe. A total 33.98 Gb of raw DNA sequences, representing ∼43-fold coverage, were obtained using the PacBio SMRT-Cell sequencing approach. Among the five different assemblers tested, the SMARTdenovo assembly displaying the best scores was then corrected with Illumina short reads. A final genome of 774 Mb containing 7,003 scaffolds was obtained. The reliability of the final assembly was then assessed by benchmarking universal single-copy orthologous genes (>96.0% of the 1,658 expected insect genes) and by remapping tests of Illumina short reads (average of 98.6 ± 0.7% without filtering). The number of protein-coding genes of 75,642, representing 82% of the published antennal transcriptome, and the phylogenetic analyses based on 825 orthologous genes placing O. communa in the monophyletic group of Chrysomelidae, confirm the relevance of our genome assembly. Overall, the genome provides a valuable resource for studying potential risks and benefits of this BCA facing environmental novelties.© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
[37]
XUE H J, NIU Y W, SEGRAVES K A, et al. The draft genome of the specialist flea beetle Altica viridicyanea (Coleoptera: Chrysomelidae)[J]. BMC genomics, 2021, 22(1):243.
[38]
姜志磊, 王大铭, 王柏凤, 等. 双斑萤叶甲基因组调研及线粒体基因组分析[J]. 环境昆虫学报, 2019, 41(6):1287-1296.
[39]
YANG X, SLOTTE T, DAINAT J, et al. Genome assemblies of three closely related leaf beetle species (Galerucella spp.)[J]. G3-genes genomes genetics, 2021, 11(8):jkab214.
[40]
SAYADI A, BARRIO A M, IMMONEN E, et al. The genomic footprint of sexual conflict[J]. Nature ecology & evolution, 2019,3:1725-1730.
[41]
XIA J X, GUO Z J, YANG Z Z, et al. Whitefly hijacks a plant detoxification gene that neutralizes plant toxins[J]. Cell, 2021, 184(7):1693-1705.
[42]
XIE W, CHEN CH, YANG ZZ, et al. Genome sequencing of the sweetpotato whitefly Bemisia tabaci MED/Q[J]. Gigascience, 2017, 6(5):1-7.
[43]
KEELING C I, YUEN M M, LIAO N Y, et al. Draft genome of the mountain pine beetle, Dendroctonus ponderosae Hopkins, a major forest pest[J]. Genome biology, 2013, 14(3):R27.

基金

江西省教育厅科技项目“油茶象甲种群遗传结构及局域适应性进化研究”(GJJ201823)
PDF(1485 KB)

Accesses

Citation

Detail

段落导航
相关文章

/