The Gypsy moth, Lymantria dispar (Linnaeus, 1758) (Lepidoptera: Erebidae), is distributed throughout most of the northern hemisphere (Bogdanowicz et al., 1997;2000). This species damages a wide variety of 400 species of plants (Lowe et al., 2000;Pogue and Schaefer, 2007). L. dispar is composed of three subspecies based on morphological characteristics, geographic location, and flight capable females: Lymantria dispar dispar, Lymantria dispar asiatica, Lymantria dispar japonica (Pogue and Schaefer, 2007). Particularly, Lymantria species (L. d. asiatica, L. d. japonica, L. albescens, L. umbrosa, L. postalba), called AGMs, are considered a serious threat to North America because females can fly (USDA, 2016). So far, 11 gypsy moth complete mitochondrial genomes, including 3 subspecies, have been reported, but none have been reported in the Republic of Korea. Here, we analyzed the complete mitochondrial genome of L. dispar in this study.
Materials and Methods
Eggs were collected in Chungju-si, Republic of Korea (36° 58'43.9"N 127°59'43.1"E) and its DNA was extracted from one egg. Leftover DNA and the specimen were deposited at the Gyeongsang National University, Jinju, Republic of Korea. Using the extracted DNA, three long overlapping fragments (LFs: COI-ND5, ND5-lrRNA, and ND1-COI) were amplified using four sets of primers designed using data regarding the previously published species of L. dispar published in earlier studies (Djoumad et al., 2017). Using the LFs as templates, 25 overlapping short fragments (SF) were amplified using the afore mentioned primers.
Phylogenetic analysis was performed using 23 available mitogenomes from the superfamily Noctuoidea, including the one obtained in the current study (Fig. 1). The nucleotide sequence of each PCG was aligned based on the codons using RevTrans ver. 2.0 (Wernersson and Pedersen, 2003). Each of the two rRNA genes was individually aligned using MAFFT ver. 7 (Katoh and Standley, 2013) and well-aligned blocks were selected using GBlocks 0.91b software (Castresana, 2000) with gap positions allowed. Subsequently, 13 PCGs and 2 rRNAs were concatenated in an alignment (13,568 bp including gaps).
An optimal partitioning scheme (five partitions) and substitution model (GTR + Gamma + I) were determined using PartitionFinder 2 with the Greedy algorithm (Lanfear et al., 2014;2016). The five partition schemes for PCG+rRNA data matrix were obtained. These partition schemes and substitution models were applied for each phylogenetic analysis.
Bayesian inference (BI) methods were applied using Mr- Bayes ver. 3.2.7 (Ronquist et al., 2012), which was incorporated into the CIPRES Portal ver. 3.1 (Miller et al., 2010). For the BI analysis, two independent runs of four incrementally heated Markov and Monte Carlo chains (one cold chain and three hot chains) were simultaneously run for 10 million generations, with tree sampling conducted at every 100 generations. The first 25% of the sampled trees were discarded as burn-in. Partitioned analyses were conducted with each partition unlinked in each parameter (statefreq, revmat, shape, pinvar, and tratio). An average split frequency of less than 0.01 was used to represent the convergence of the two simultaneous runs. The confidence values for BI tree were determined with the Bayesian posterior probabilities (BPP). Conogethes pinicolalis (Jeong et al., 2021), Chilo suppressalis (Park et al., 2019), and Ostrinia kasmirica (Luo et al., 2021) that belong to the family Crambidae of the superfamily Pyraloidea were used as outgroups. FigTree ver. 1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/) was used to visualize phylogenetic trees.
Results
The L. dispar mitogenome was found to be 15,548 bp in length, with typical gene sets - 2 rRNAs, 22 tRNAs, and 13 PCGs - and a major non-coding A + T-rich region of 393 bp length (GenBank accession number ON469817). The previous studies showed that the shortest mitogenome was 15,569 bp (FJ617240; Unpublished) and the longest 15,698 bp (Djoumad et al., 2017). The largest size variation was detected in the A + T-rich region (393 bp in this study, the shortest 368 bp in Djoumad et al. (2017), and longest 461 bp in Djoumad et al. (2017)) (Table 1).
A phylogenetic analysis was performed using 23 mitogenome sequences, including the mitogenome of Lymantria dispar analyzed in this study, using concatenated 13 PCGs and 2 rRNAs. These were composed of 5 families (Erebidae, Euteliidae, Noctuidae, Nolidae and Notodontidae) in Noctuoidea (Fig. 1). Monophyly of each family was supported with the highest nodal supports, presenting the familial relationships of ((Erebidae + ((Euteliidae + Noctuidae) + Nolidae)) + Notodontidae) in Noctuoidea (Fig. 1). Furthermore, the monophyly of each family were also supported generally with the highest nodal supports (BPP = 1.0; Fig. 1). The 11 Lymantria dispar mitogenomes formed a strong group with the higher nodal supports from 0.81 to 1 (Fig. 1). L. dispar analyzed in this study was confirmed to L. d. asiatica by supporting the monophyly with the highest node value of L. d. asiatica (KY923067) (Djoumad et al., 2017).