Next Article in Journal
An Assessment of the Genetic Parameters of Boars’ Reproductive Traits
Next Article in Special Issue
Chloroplast Genome Comparison and Phylogenetic Analysis of the Commercial Variety Actinidia chinensis ‘Hongyang’
Previous Article in Journal
Regulation of Epithelial-Mesenchymal Transitions by Alternative Splicing: Potential New Area for Cancer Therapeutics
Previous Article in Special Issue
Comparative Plastid Genome and Phylogenomic Analyses of Potamogeton Species
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complete Chloroplast Genomes of Saussurea katochaete, Saussurea superba, and Saussurea stella: Genome Structures and Comparative and Phylogenetic Analyses

1
Qinghai Academy of Animal and Veterinary Science, Xining 810016, China
2
State Key Laboratory of Plateau Ecology and Agriculture, Qinghai Academy of Animal and Veterinary Sciences, Qinghai University, Xining 810016, China
*
Authors to whom correspondence should be addressed.
Genes 2023, 14(11), 2002; https://doi.org/10.3390/genes14112002
Submission received: 19 September 2023 / Revised: 21 October 2023 / Accepted: 25 October 2023 / Published: 26 October 2023
(This article belongs to the Special Issue Plant Plastid Genome)

Abstract

:
Saussurea plants are widely distributed in Asia and Europe; however, their complex phylogenetic relationships have led to many difficulties in phylogenetic studies and interspecific identification. In this study, we assembled, annotated, and analyzed the chloroplast genomes of three Saussurea plants: Saussurea katochaete, Saussurea superba, and Saussurea stella. The results showed that the full-length sequences of the three Saussurea plants were 152,561 bp, 151,452 bp, and 152,293 bp, respectively, which represent the typical quadripartite structure, and the genomes were relatively conserved. The gene annotation results showed that the chloroplast genomes of S. katochaete, S. superba, and S. stella were annotated with 128, 124, and 127 unique genes, respectively, which included 83, 80, and 83 protein-coding genes (PCGs), respectively, 37, 36, and 36 tRNA genes, respectively, and 8 rRNA genes. Moreover, 46, 45, and 43 SSR loci, respectively, and nine highly variable regions (rpl32-trnL-UAG, rpl32, ndhF-rpl32, ycf1, trnC-GCA-petN, trnC-GCA, rpcL, psbE-petL, and rpl16-trnG-UUG) were identified and could be used as potential molecular markers for population identification and phylogenetic study of Saussurea plants. Phylogenetic analyses strongly support the sisterhood of S. katochaete with S. superba and S. stella, and are all clustered with S. depsagensis, S. inversa, S. medusa, and S. gossipihora, of which S. gossipiphora is most closely related. Additionally, the phylogenetic results indicate a high frequency of differentiation among different species of Saussurea plants, and many different species or genera are morphologically very different from each other, which may be related to certain genetic material in the chloroplasts. This study provides an important reference for the identification of Saussurea plants and studies their evolution and phylogenetics.

1. Introduction

Saussurea originated in the early-middle Miocene within the Hengduan Mountains [1], and is an annual, biennial, or perennial herb of the Trib. Cynarea of the Compositae. There are more than 400 species worldwide, mainly in Asia and Europe, with about 264 species (66%) found in China, mainly in areas at altitudes of 400–5000 m [2]. Due to the wide distribution of the genus, the variety of species, and the unclear morphological differentiation between species, the identification of species is difficult [3,4].
More than 10 species of this genus have been used to treat bruises and sprains, altitude sickness, and food poisoning. In the last two decades, it has been found that the chemical constituents of the Saussurea plant mainly include compounds such as steroids, phenylpropanoids, flavonoids, sesquiterpenoids, and triterpenoids, which are known to possess biological activities such as antitumor, antibacterial, anti-inflammatory, and cardiotonic activities [5,6]. Because of its high medicinal value and because wild resources are not yet available for large-scale application, artificial cultivation is one of the future development trends. However, Saussurea plants are distributed in harsh environments, coupled with their controversial population identification [7] making accurate species identification one of the prerequisites for achieving cultivation.
Previous studies have shown that the plant chloroplast genome, which together with the central nuclear genome and the mitochondrial genome constitutes a complete plant cell, is an effective way to achieve species identification [8,9]. In addition, chloroplasts, like the mitochondria, are double-membrane-bound cytoplasmic organelles whose origin can be traced back at least one billion years to the prokaryotic organisms of a photosynthetic cyanobacterium [9,10,11]. Over a long period of evolution, chloroplasts have become indispensable organelles to allow green plants to carry out life activities such as photosynthesis and carbon fixation [12]. They are involved in the synthesis of fatty acids and amino acids in the plant body [13], and they possess a set of structurally complete genomes [14], which usually does not exceed 160 kb, including a large single-copy region, a small single-copy region, and two inverted repeat regions, [15]. Compared with the nuclear and mitochondrial genomes, they have the advantages of easy discernment of their age characteristics, easy access to sequences, a more conserved gene composition and structure, rich genetic information, and high species discrimination [9,16]. These characteristics are of great significance for the identification of Saussurea plant species, which have complex germplasm relationships and a rich diversity.
S. katochaete and S. stella are both perennial stemless rosette-like herbs. In China, they are mainly distributed in hillside meadows, valley marshes, river meadows, alpine meadows, and hillside scrub meadows in Gansu, Qinghai, Sichuan, Yunnan, and other provinces at 2230–4700 m and 2000–5400 m, respectively. S. superba is a perennial herb that is mainly found in the sandy river valleys of the Gansu and Qinghai Provinces in China. Previous studies on Saussurea plants have focused on resource investigations [17], karyotyping [7,18], and chemical composition analyses [5], but chloroplast genomic studies on S. katochaete, S. superba, and S. stella have not been reported. Therefore, in this study, we assembled, annotated, and analyzed the chloroplast genomes of S. katochaete, S. superba, and S. stella using second- and third-generation high-throughput sequencing technologies and revealed their phylogenetic relationships. The results are expected to provide some references for the identification of populations, the identification of germplasm resources, and phylogenetic studies in the field of Saussurea plants.

2. Results

2.1. Characterization of the Saussurea Chloroplast Genome

The full-length sequences of the chloroplast genomes of the three Saussurea plants were 152,586 bp (S. katochaete), 152,490 bp (S. superba), and 152,442 bp (S. stella), respectively. Additionally, they were double-stranded loops with a typical quadripartite structure, as in most herbaceous plants. Two inverted repeat regions (IRs) separate the large single copy (LSC) and small single copy (SSC) regions (Figure 1, Figure 2 and Figure 3). In the IR regions of S. katochaete, S. superba, and S. stella, the IRa lengths were found to be 25,193 bp, 25,193 bp, and 25,201 bp, and the IRb lengths were 25,193 bp, 25,193 bp, and 25,201 bp, respectively. The lengths of the LSC regions were 83,551 bp, 83,460 bp, and 83,457 bp. The SSC region lengths were 18,649 bp, 18,644 bp, and 18,583 bp, respectively (Table 1). The GC contents of the three Saussurea plants were similar (37.68–37.69%), and the GC contents of the IR region (43.12–44.28%) were higher than those of the LSC region (35.80–35.94%) and the SSC region (31.39–31.40%) (Table 1). This might be related to the higher GC-rich rRNA and tRNA gene contents [14,19]. The AT content (62.31–62.32%) was greater than the GC content (37.68–37.69%), indicating the same characteristics as other plant chloroplast genomes, i.e., the AT bias was obvious [20].

2.2. Gene Annotation and Categorization Analysis

The results of the three Saussurea plants showed that the chloroplast genome of S. katochaete contains 128 genes, of which eighty-three are PCGs, eight are rRNA genes, and thirty-seven are tRNA genes (Table 2). The chloroplast genome of S. superba contains 124 genes, of which eighty are PCGs, eight are rRNA genes, and thirty-six are tRNA genes. The chloroplast genome of S. stella contains 127 genes, of which eighty-three are PCGs, eight are rRNA genes, and thirty-six are tRNA genes. The genes can be classified into four main categories based on their functions. The first category is the class of genes related to photosynthesis, including photosystem I, photosystem II, the cytochrome b/f complex, ATP synthase, and NADH dehydrogenase. The second category is the category of genes associated with one’s own inheritance, including the ribosomal protein class (SSU), ribosomal protein (LSU), RNA polymerase, RubisCO large subunit, transfer RNAs, and ribosomal RNAs. The third category contains genes associated with other syntheses, including protease, maturase, and translational initiation factors. The fourth category contains gene types with unknown functions. In addition, a total of 22 intron-containing genes were identified in the three Saussurea plants (Table 3, Table 4 and Table 5, Figure 4). Among these intron-containing genes, 18 genes (atpF, clpP, ndhA, ndhB, petB, petD, rpl16, rpl2, rpl2, rpoC1, rps16, trnA-UGC, trnA-UGC, trnI-GAU, trnI-GAU, trnK-UUU, trnL-UAA, and trnS-CGA) contain one intron, including thirteen PCGs and five tRNA genes, and two genes (clpP and ycf3) contain two introns. Two rps12 genes are trans-splicing genes.

2.3. Long Repeat Sequences and SSR Analyses

Long repetitive sequences comprise three types: forward (F), palindromic (P), and tandem (T) repeats. They may function to promote chloroplast genome rearrangements and can increase the population’s genetic diversity. A total of 90 unique long repetitive sequences were detected in the chloroplast genomes of three plants of the genus Saussurea (Table 6), of which 25 pairs of long repetitive sequences were found in S. katochaete, including one forward and twenty-four palindromic repetitions. Thirty-six and 29 pairs of long repetitive sequences were found in S. superba, and S. stella had 36 and 29 pairs of long repeat sequences, respectively, all of which were palindromic repeats (Table 6). The types of long repetitive sequences found in the chloroplast genomes of the three plant species of the genus Saussurea are almost all palindromic repeats with no tandem repeats, except for S. katochaete, which contains one forward repeat. Of these repeats, the shortest is only 30 bp in length, while the longest is as large as 4344 bp. The results indicate that the number, distribution, and length of the long repetitive sequences in the chloroplast genome of Saussurea plants are heterogeneous.
SSRs (Simple Sequence Repeats), i.e., Simple Repeat Sequences refer to a segment of DNA in the genome consisting of basic units of one to six nucleotides that are repeated many times. They are widely distributed among different locations of the genome. In addition, they are widely used as molecular markers in species identification and phylogenetic studies [19]. The annotation results showed that a total of 908 SSRs were identified, with the numbers of SSRs for S. katochaete, S. superba, and S. stella being 305, 303, and 300, respectively (Table 7). The average number of SSRs was about 303, with the highest number of SSRs being found in S. katochaete and the lowest number of SSRs being found in S. stella. Among them, 46, 45, and 43 SSR loci were identified in three Saussurea plants (Table 8, Table 9 and Table 10). Most of the SSR loci were located in the LSC region (37, 37, and 34 loci) of the Saussurea chloroplast genome, whereas relatively few SSR loci were located in the SSC region (4, 4, and 4 loci) and the IR region (5, 5, and 5 loci). In addition, the six types of SSRs identified were mononucleotide (36.12%), dinucleotide (50%), trinucleotide (5.07%), tetranucleotide (6.72%), pentanucleotide (1.32%), and hexanucleotide (0.77%). Dinucleotide and mononucleotide were the most predominant types of SSRs. The five major repetitive sequence types were A/T, AA/TT, AAA/TTT, AAAA/TTTT, and AAAAA/TTTTT.

2.4. Codon Usage Bias

The PCGs in the chloroplast genomes of S. katochaete, S. superba, and S. stella contained 21,408, 18,341, and 18,986 codons, respectively (Table 11). Of the amino acids encoded, the most encoded amino acid was leucine (Leu), which accounted for 10.49%, 10.33%, and 10.23% of all encoded amino acids, respectively, in the three plant types. The amino acid that was least encoded was cysteine (Cys), which accounted for only 1.07%, 1.03%, and 1.07% of all encoded amino acids, respectively, in the three plant types. This result is similar to that of a previous study on herbaceous plants [21]. Based on the RSCU values, the number of preferred codons in the chloroplast genomes of all three Saussurea plants was found to be 30 (RSCU > 1), and the number of non-preferred codons was found to be 32 (RSCU < 1). Among these, the 29 preferred codons, except for the UUG, which ended in G/C, ended in A/U. This indicates that the codons ending in the base of A/U are preferred codons, and codons ending in G/C are non-preferred codons [22], a result that may be related to the adaptive evolution of the plant chloroplast genomes [23]. This is in contrast to the non-preferred codons (RSCU < 1), which mostly end in G/C, suggesting that these types of codons occur less frequently in the Saussurea chloroplast genome. In addition, methionine (Met) and tryptophan (Trp) were encoded by only one codon, and both RSCUs were 1. The number of codons encoded by the rest of the codons ranged from two to six, and the results were in agreement with the findings of Chong et al. and Shi et al. [19,24]. The three termination codons detected were UAA, UAG, and UGA, with UAA being the most frequent, i.e., the termination codons were biased towards UAA.

2.5. IR Expansion and Contraction

In this study, we comparatively analyzed the boundaries of the IR region of the chloroplast genomes of S. katochaete, S. superba, and S. stella with those of seven closely related species: Saussurea phaeantha, Saussurea sutchuenensis, Saussurea apus, Saussurea depsangensis, Saussurea gossipiphora, Saussurea bullockii, and Saussurea leucophylla (Figure 5). The results showed that the lengths of the chloroplast genomes of the 10 Saussurea plants range from 152,270 (S. gossipiphora) to 152,586 (S. katochaete) bp. The length of the IR region was found to be 25,185 (S. bullockii) to 25,202 (S. gossipiphora) bp. The length of the LSC region was found to be 83,344 (S. gossipiphora) to 83,551 (S. katochaete) bp, and the length of the SSC region was found to be 18,522 (S. gossipiphora) to 18,690 (S. leucophylla) bp. The chloroplast genes of the 10 species differed in length by 316 bp. The LSC region differed by 207 bp, the SSC region differed by 168 bp, and the IR region differed by 77 bp. The IR showed a high degree of conservatism with the LSC and SSC boundaries, and genes spanning or close to the boundaries of the IR and SC regions mainly included rps19, rpl22, rpl2, ycf1, ndhF, and trnH. Among them, ndhF, which is involved in photosynthesis, is located across the JSB boundary, 2–15 bp away from the JSB boundary. The ycf1 gene crosses the JSA boundary and extends from 5222 (S. depsangensis) to 5300 (S. leucopphylla) bp towards the Ira region and from 18,522 (S. gossipiphora) to 18,690 (S. leucophylla) bp from the IRa region towards the SSC region. The rps19 gene extends 6 bp towards the IRb region. The JLA boundary is located between rpl2 and trnH.

2.6. Comparative Genome Analyses of Saussurea Species

To understand the degree of sequence similarity between the chloroplast genomes of S. katochaete, S. superba, and S. stella and the other seven close relatives to Saussurea, the S. phaeantha chloroplast genome was used as the reference sequence for a full sequence comparison and analysis. The results show that the chloroplast genome sequences of the 10 species exhibit high levels of similarity or highly conserved genes, and no obvious gene rearrangements were found (Supplementary Figure S1, Figure 6). In comparison, the sequences of the LSC and SSC regions were found to be more variable than those of the IR regions (Figure 7), and the sequences of non-coding regions were more variable than those of the coding regions. Among these, several loci, such as TRN-GCA-petN, rpl32-trnL-UAG, and ycf1, showed lower levels of similarity.

2.7. Analysis of the Nucleotide Diversity

A total of 1112 variable (polymorphic) sites were identified in 151,485 nucleotide sites, including 694 single variable sites (SVS) and 418 parsimony information sites (PIS). Two different types were observed under SVS: 690 for two variable sites (SV2V) and four for three variable sites (SV3V). Similarly, there were two types of PIS: four loci with two variants (PIS2V) and seven loci with three variants (PIS3V). In addition, to quantify the level of nucleotide polymorphisms, the chloroplast genomes of 10 Saussurea plant species were compared and analyzed using DNAsp software(Version: v.5.10.01). The results showed that the Pi values of nucleotide polymorphisms in the chloroplast genomes of the 10 Saussurea species varied from 0 to 0.00911, with a mean value of 0.00200. At least nine highly variable regions are included in the Saussurea chloroplast genomes, i.e., rpl32-trnL-UAG, rpl32, ndhF-rpl32, ycf1, trnC-GCA-petN, trnC-GCA, rpcL, psbE-petL, and rpl16-trnG-UUG. These genes or gene spacer regions have high variability (Pi > 0.007). The highest values of nucleotide variation (Pi) found were 0.00911, 0.00828, 0.00806, 0.00786, 0.00753, 0.00719, 0.00719, 0.00714, and 0.00711 (Figure 8). These can be used as a potential DNA barcode for Saussurea species.

2.8. Phylogenetic Analysis

The phylogenetic trees constructed based on both the ML and BI methods are based on agreement (Figure 9). Twenty-nine species of the Trib. Cynareae clustered in the same large branch, except for the exotic taxa. The two Carlininae plants, Atractylodes lancea and Carlina acaulis, located at the base of the evolutionary tree, clustered into one branch. Secondly, Carduus crispus, Cirsium japonicum, and Cynara cardunculus of the Carduinae clustered into one branch. Centaurea cyrdunculus of Centaureinae clustered into one branch, while Carthamus tinctorius of Centaureinae and Arctium lappa of Carduinae, as well as Dolomiaea calophylla of Carduinae, grouped together in a single unit, distinguishing S. albifolia of Carduinae from 19 other plants of the same genus. In the present study, S. stella clustered with S. depsagensis, S. inversa, S. medusa, and S. gossipihora, while S. katochaete and S. superba clustered together in a single unit that was closest in kinship to S. gossipiphora. It can be seen that S. katochaete and S. superba are closely related, and both are relatively distantly related to S. stella.

3. Discussion

3.1. Chloroplast Genomic Characteristics and Sequence Variation in the Three Saussurea Species

In this study, the complete chloroplast genomes of three species of SaussureaS. katochaete, S. superba, and S. stella. They were found to be similar to those of other herbaceous plants with typical double-stranded circular tetramer structures [16,25] and sizes of 152,561 bp, 151,452 bp, and 152,293 bp, respectively. Changes in the size of plant chloroplast genomes are mainly affected by the expansion and contraction of IR regions, variation in spacer regions, and gene loss. At the same time, these expansion and contraction events cause sequence disruptions of genes located at the edges of IR regions, ultimately resulting in the formation of pseudogenes. These pseudogenes do not participate in the protein-coding process, and their products are non-essential in the organism; hence, they are also known as non-functional genes [24,26,27]. However, some other studies have found that the nucleotide sequences of the pseudogenes are well-preserved, and therefore, they may not be non-functional genes, as traditionally perceived [28]. In this study, 124–128 genes were identified in the chloroplast genomes of three Saussurea plants. Their protein-coding genes, tRNA genes, rRNA genes, and intron genes showed extremely similar characteristics in terms of number, which explains, to some extent, why they have similar epimorphologies. Among them, two shared pseudogenes, rps19 and ycf1, were also identified in the gene sequence. These are commonly found in the chloroplast genomes of many angiosperms and herbaceous plants, such as Lycium (Solanaceae) [29], Allium chrysanthum (Amaryllidaceae J.St.-Hil.) [30], and C. cardunculus (Asteraceae) [31], among others. In recent years, it has been found that the ycf1 gene may be involved in the process of photosynthesis and in responses to environmental changes in plants [32]. It is a key DNA barcode in plants [23], but expansion and contraction of the IR region may also lead to its fragmentation [26]. In addition, the contraction and expansion of the IR region result in the loss of the ycf2 gene in S. superba and S. stella, suggesting that genetic differentiation may occur in the region of loss in Saussurea species and may also inhibit chloroplast genome enlargement in the Saussurea species to a certain extent [24,33].
A total of 22 intron-containing genes were identified in three plants of the genus Saussurea. These included 18 genes with one intron, two genes with two introns, and two trans-shear genes. These intron-containing genes can play important roles in regulating gene expression and improving agronomic traits in plants [34].
The determination of long repeated sequences and Simple Sequence Repeats (SSRs) is closely related to the genetic diversity of plants and the identification of molecular markers for germplasm resources [35]. Long repetitive sequences are usually found in the intergenic spacer (IGS) and intronic regions of plant chloroplast genomes [36]. In this study, the repetitive sequences observed in the chloroplast genomes of the three Saussurea plant species showed a very heterogeneous phenomenon in terms of the number, type, and length. With palindromic (P) repeats occurring at a very high frequency relative to forward (F) repeats, tandem (T) repeats were not detected in any of the three Saussurea plants, suggesting that there may be a certain degree of difference in the mutation frequency among the three species [35]. These long repetitive sequences play important roles in gene recombination and sequence structure variation [23] and can serve as potential indicators of differential identification between Saussurea species. SSRs are a class of short repetitive sequences, or Simple Sequence Repeats, found in the chloroplast genome. They are also known as microsatellites. They are prevalent in eukaryotes and prokaryotes and are closely related to gene expression and regulation [37]. These DNA sequences are widely involved in a wide range of life processes in plant cells, and because of their high level of polymorphism, they are often used in the fields of genetic diversity, species identification, and the development of molecular markers [21,23]. The three Saussurea plants included in this study, S. katochaete, S. superba, and S. stella, were identified as having 46, 45, and 43 SSR loci, respectively (Table 9, Table 10 and Table 11). Six types of SSRs were included, i.e., mononucleotide, dinucleotide, trinucleotide, tetranucleotide, pentanucleotide, and hexanucleotide, which are mostly located in the non-coding region. Mononucleotide (A/T) is the predominant type of repeat, a similar result to studies on the predominance of A/T in the base composition of SSRs in other herbaceous plants, such as Thalictrum cirrhosum (Ranunculaceae) [38] and Themeda japonica (Gramineae) [39]. This A/T-dominant repeat type is thought to be a widespread phenomenon in the chloroplast genomes of higher plants, and the fact that preferred codons predominantly end in A/U causes this bias. Previous studies have suggested that this may be related to natural selection and genetic mutation [40]. In addition, we identified nine highly variable genes or intergenic regions (rpl32-trnL-UAG, rpl32, ndhF-rpl32, ycf1, trnC-GCA-petN, trnC-GCA, rpcL, psbE-petL, and rpl16-trnG-UUG), and it is clear that the sequence variability is significantly higher in the LSC region and the SSC region than in the IRs. This may be caused by certain duplicated genes within the IR region preventing mutations from occurring [40]. Some of these rare SSR loci as well as highly variable regions can be used as potential molecular markers in the fields of intraspecific genetic variation, population identification, species evolution, and phylogenetic studies in plants, e.g., rps8, rpl16, PsbE-petL, and ndhF-rpl32 have been demonstrated to be useful in Phoenix dancong (Camellia sinensis) [41], Oryza sativa (Gramineae) [42], and some Lauraceae plants (Machilus yunnanensis, Machilus balansae) [43] in terms of population identification and phylogenetic analyses, demonstrating high levels of resolution.
Codon preferences in plant chloroplast genomes can reveal phylogenetic relationships across species or within the same species, and mutation and natural selection are closely linked to codon preferences in genes [44]. For example, the codon preferences of some Euphorbiaceae [45] plants are mainly influenced by natural selection, and those of Oncidium Gower Ramsey [46], an ornamental flower, are mainly influenced by mutations. It has also been suggested that the factors affecting codon preferences in the chloroplast genome are complex and diverse and are not solely influenced by a single natural selection or mutation [47]. In this study, codons ending in A/U were dominant in the chloroplast genomes of the three Saussurea plants, and the RSCU values were close to each other, with the number of preferred codons being 30 (RSCU > 1) and the number of non-preferred codons being 32 (RSCU < 1). This suggests that the chloroplast genomes of Saussurea plants are relatively conserved, and they have formed a unique codon use system that distinguishes them from other species during the evolutionary and developmental processes. In addition, the three Saussurea species showed a higher preference for the codon UUA, which encodes an amino acid type of Leu, than for other codons with RSCUs as high as 1.95, 2.1, and 2.05, respectively. This suggests that the trnL-UAA gene may have exerted an important influence on the evolutionary development of Saussurea species.

3.2. Phylogenetic Analyses in Saussurea Species

Although the chloroplast genome dataset is still considered to have many shortcomings, the gene sequences are still important for revealing phylogenetic relationships between or within species because they include many important information sites [24,33,48]. The phylogenetic tree of Saussurea plants, like those of other reported species [24], has similar topological structures, suggesting that many closely related species within the genus may have originated from a common ancestor. The flanking genes of the JLB, JSB, JSA, and JLA borders of the three species in this study, S. katochaete, S. superba, and S. stella, were the same, being the rpl22, rps19, rpl2, ndhF, ycf1, and psbA genes. With the exception of minor differences in the amplification lengths of the genes at the JSB boundary, the amplification lengths were consistent across the remaining boundaries, suggesting close phylogenetic relationships among these three Saussurea species. After they diverged from a common ancestor, S. stella, S. depsagensis, S. inversa, S. medusa, and S. gossipihora clustered together in a separate monophyletic group. Additionally, S. katochaete and S. superba evolved into a sisterhood. Obviously, S. katochaete and S. superba are more closely related compared to S. stella.
Saussurea was first established in 1810, and it is distributed in Asia and Europe. In China, it is mainly distributed in the high-altitude areas of southwest and northwest China. Due to the wide distribution of the genus, the great variety of species, and the unclear morphological differentiation among species, coupled with the fact that the classification of the genus is mainly based on phenotypic characteristics, such as the subtending leaf of capitulate, etc., its taxonomic study and interspecific identification have been controversial [4,49]. Since the introduction of Mendel’s laws of inheritance in 1865, researchers have generally believed that the evolutionary process of plants can be fully explained by applying Mendel’s laws of inheritance. However, studies on some higher plants proved in 1909 that certain variable traits in plant evolution may be closely related to chloroplasts rather than being caused by Mendelian modes of inheritance [50]. It was not until the 1960s that Sager and Ishida’s studies further confirmed that there is indeed a unique class of genetic material in the plant chloroplast genome that is capable of influencing certain traits in plants [9]. Phylogenetic analyses of Saussurea plants in the present study revealed a high frequency of Saussurea species differentiation among different species and great morphological differences among many different species, which may also be related to certain genetic materials in the chloroplasts.

4. Materials and Methods

4.1. Plant Samples, DNA Extraction, and Sequencing

Leaves from S. katochaete, S. superba, and S. stella used for this study were collected from Yushu City, Yushu Tibetan Autonomous Prefecture, Qinghai Province, China (N: 96°29′35″, E: 33°18′58″, H: 4210 m), Menyuan County, Haibei Tibetan Autonomous Prefecture, Qinghai Province, China (N: 37°22′35″, E: 101°37′21″, H: 3276 m), and Zaduo County, Yushu Tibetan Autonomous Prefecture, Qinghai Province, China (N: 95°49′23″, E: 32°47′38″, H: 4601 m). DNA samples were extracted by the classical CTAB method, and the DNA quality was detected by 1% agarose gel electrophoresis. DNA was precisely quantified by Qubit. The constructed libraries were sequenced using Illumina NovaSeq, an Illumina high-throughput sequencing platform.

4.2. Genome Assembly and Annotation

Genome splicing was performed using Flye (version: v.2.9; parameters: meta-plasmids) software, and the splicing results were compared with a close reference genome using Blastn (version: 2.12.0+; parameters: evaluate 1 × 10−5) based on a comparison to determine the candidate sequence assembly results. The genomic linkages of chloroplasts were determined based on the depth of sequence sequencing, the read comparison situation, and the comparison situation with closely related species. The connected sequences, if containing gaps (containing N sequences), were further hole-patched using Gapcloser (Version: 1.12) to obtain the final splicing results.
The functional annotation of the chloroplast genome included coding gene predictions and non-coding RNA annotations (rRNA and tRNA annotations). Gene annotations were performed using the chloroplast-specific annotation software CPGAVAS2 (http://47.96.249.172:16019/analyzer/annotate accessed on 2 August 2023).

4.3. Genome Structure Analyses

SSRs (Simple Sequence Repeats) are segments of DNA in the genome consisting of basic units of one to six nucleotides repeated many times that are widely distributed in different locations in the genome, and the flanking sequences are usually single sequences that are strongly conserved. SSRs were performed on chloroplasts using MISA (version: v2.1; default parameters; corresponding to the minimum number of replicates for each replicate unit (unit size): 1-8 2-8 2-8 2-8 2-4 3-4 4-4 4-3 5-3 6-3, respectively), as described in (http://pgrc.ipk-gatersleben.de/misa/misa.html accessed on 16 September 2023).
We used vmatch (http://www.vmatch.de/ accessed on 7 August 2023; parameter: minimal repeat size 30 bp) to find scattered long repeat sequence fragments in the chloroplast genome. Codon usage bias, or relative synonymous codon usage (RSCU), is an assessment of the preference for the use of synonymous codons. The value is equal to the ratio of the actual observed value of synonymous codons to the average expected value of synonymous codon usage. If there is no preference for codon usage, the RSCU value is 1; if the codon is used more frequently than other synonymous codons, its RSCU value is greater than 1; if the opposite is true, the RSCU value is less than 1. The frequency of relative synonymous codon usage was statistically estimated for codons greater than 300 in length and with “ATG”, “TTG”, “CTG”, “ATT”, “ATC”, “GTG”, and “ATA” as start codons. With “ATT”, “ATC”, “GTG”, and “ATA” as start codons, the use of “TGA”, “TAG”, and “TAA” as stop codons was analyzed for codon usage bias using CodonW (Version: 1.4.4). Sequences were analyzed for codon preference using CodonW (Version: 1.4.4).

4.4. Genome Comparison and Nucleotide Variation Analysis

The LSC, SSC, and IR region boundaries of the Saussurea chloroplast gene regions and adjacent genes were visualized and analyzed using the IRscope online analysis tool (Amiryousefi et al., 2018). The whole sequence of the Saussurea chloroplast genome was compared and visualized using the mVISTA online tool (https://genome.lbl.gov/vista/index.shtml accessed on 8 August 2023) in shuffle-LAGAN mode using S. phaeantha as a reference sequence. The corresponding genome sequences were compared using MAFFT software (Version: v7.487). A chloroplast genome polymorphism analysis (nucleotide diversity, Pi) was performed using DNAsp software (Version: v.5.10.01) with a window length of 800 bp and a glide step of 200 bp.

4.5. Phylogenetic Analyses

To clarify the phylogenetic positions of S. katochaete, S. superba, and S. stella in Saussurea, 20 Saussurea plants and nine other Compositae, Trib. Cynareae plants were studied, and the Compositae, Trb. Heliantheae, Helianthus, and Helianthus annuus were used as an outgroup (Table 12). They were analyzed by a multiple sequence comparison using MAFFT software (Version: v7.487), and then IQ-TREE software (Version: v.1.6.8.) was used to construct an ML tree (maximum likelihood method) using the TVM+F+I+G4 model and a BI tree (Bayesian method) using MrBayes (Version: v. 3.2.6.) using the GTR+I+G model with 2,000,000 generations and a sampling frequency of 100. The visualization was performed using ITOL software (http://itol.embl.de/ accessed on 17 August 2023).

5. Conclusions

In this study, we assembled, annotated, and analyzed the whole chloroplast genomes of three species of Saussurea and explored their genomic features and phylogenetic relationships with other closely related species. The results show that the three Saussurea plants, similar to other herbaceous plants, have relatively conserved genome structures, abundant SSR loci, and some highly variable genes or gene intergenic regions. This provides an important basis for the population identification and phylogenetic study of Saussurea plants. The degree of sequence variation in the LSC and SSC regions of the genome was significantly higher than that in the IR region. The trnL-UAA gene may have had an important influence on the evolutionary development of species of Saussurea. In addition, phylogenetic trees constructed based on ML and BI indicated that S. katochaete and S. superba are more closely related compared to S. stella. Additionally, the great variation among different species of Saussurea might be related to certain genetic materials in the chloroplast genome. The results of this study are important in terms of population identification, phylogeny, genetic variation, species evolution, and the identification of germplasm resources in Saussurea plants.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes14112002/s1, Figure S1: Comparison of the genomes of Saussurea species using Geneious Prime. Annotations of the CDs, rRNA, and tRNA genes are shown in yellow, red, and with purple-red arrowheads, respectively.

Author Contributions

Manuscript writing and data analysis, H.H.; revision of manuscripts, T.W., X.P. and C.T.; sample collection, Z.C.; guidance and supervision, X.L. and Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Department of Science and Technology of Qinghai Province (2021-SF-A4) and the Chinese Academy of Sciences (LHZX-2022-01).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The assembled chloroplast genome sequences of S. katochaete, S. superba, and S. stella have been uploaded to and deposited in GenBank under accession numbers OQ596208, OQ596209, and OQ596210, respectively. The first author and corresponding author can provide all relevant raw data if required.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Xu, L.; Herrando-Moraira, S.; Susanna, A.; Galbany-Casals, M.; Chen, Y. Phylogeny, origin and dispersal of Saussurea (Asteraceae) based on chloroplast genome data. Mol. Phylogenetics Evol. 2019, 141, 106613. [Google Scholar] [CrossRef] [PubMed]
  2. Yang, Y. The Study of Variation in Resource Allocation and Reproductive Strategies of Two Species of Saussurea DC. with Elevation in Qinghai-Tibet Plateau. Ph.D. Thesis, Northwest Normal University, Lanzhou, China, 2018. [Google Scholar]
  3. Wang, Y.; Ni, Z.; Dong, M.; Cong, B.; Shi, Q.; Gu, Y.; Kiyota, H. ChemInform Abstract: Secondary Metabolites of Plants from the Genus Saussurea: Chemistry and Biological Activity. ChemInform 2010, 7, 2623–2659. [Google Scholar] [CrossRef]
  4. Qiao, Y.; Song, Y. Clustering Analysis of Karyotype Resemblance-near Coefficient for 15 Species of Saussurea DC. Chin. J. Grassl. 2011, 33, 38–43. [Google Scholar]
  5. Wang, H.; Qin, G. Chemical composition and biological activities of Saussurea plants. Drugs Clin. 2005, 2, 47–52. [Google Scholar]
  6. Zhang, W. The Chemical Constituents of Saussurea Graminea. Ph.D. Thesis, University of Electronic Science and Technology of China, Chengdu, China, 2019. [Google Scholar]
  7. Li, S.; He, T.; Zhang, D.; Ding, W. Karyotypes of Six Species of Genus Saussurea DC in the Hengduan Mountains, Southwestern China. Acta Bot. Boreali-Occident. Sin. 2013, 33, 2194–2202. [Google Scholar]
  8. Wang, X.; Li, H.; Zhao, W.; Li, Y.; Qin, X.; Zhang, F. Characterization and phylogenetic analysis of complete chloroplast genome of Polygala tenuifolia. Chin. Tradit. Herb. Drugs 2023, 54, 3655–3665. [Google Scholar]
  9. Ravi, V.; Khurana, J.; Tyagi, A.; Khurana, P. An update on chloroplast genomes. Plant Syst. Evol. 2008, 271, 101–122. [Google Scholar] [CrossRef]
  10. Kostianovsky, M. Evolutionary origin of eukaryotic cells. Ultrastruct. Pathol. 2000, 24, 59–66. [Google Scholar] [CrossRef]
  11. Zhang, Y.; Tian, L.; Lu, C. Chloroplast Gene Expression: Recent Advances and Perspectives. Plant Commun. 2023, 4, 100611. [Google Scholar] [CrossRef]
  12. Wicke, S.; Schneeweiss, G.; Depamphilis, C.; Müller, K.; Quandt, D. The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Mol. Biol. 2011, 76, 273–297. [Google Scholar] [CrossRef]
  13. Xia, E.; Tong, W.; Wu, Q.; Wei, S.; Zhao, J.; Zhang, Z.; Wei, C.; Wan, X. Tea plant genomics: Achievements, challenges and perspectives. Hortic. Res. 2020, 7, 7–25. [Google Scholar] [CrossRef] [PubMed]
  14. Gao, N.; Ni, L.; Zhao, Z.; Ga, W. SNP double peak methods through chloroplast genome profiling for identifying Gentiana crassicaulis and two closely related species. Acta Pharm. Sin. 2022, 57, 2520–2527. [Google Scholar]
  15. Lu, Z.; Tian, W.; Yang, C. Chloroplast Genome Characteristics and Codon Usage Bias Analysis of Panax Linn. Mol. Plant Breed. 2023, 1–24. Available online: http://kns.cnki.net/kcms/detail/46.1068.S.20220720.1016.002.html (accessed on 16 September 2023).
  16. Li, Z.; Qiu, Q.; Lang, J.; Wu, Y.; Du, H.; Zhou, N. Sequence analysis of complete chloroplast genome of Dendrobium heterocarpum Lindl. and Dendrobium Trigonopus Rchb. F. Chin. Tradit. Herb. Drugs 2022, 53, 5159–5169. [Google Scholar]
  17. Cao, J.; Wang, Y.; Xi, S.; Jin, J.; Qi, R.; Yang, Y. Investigation on resources of medicinal plants Saussurea DC. in Gansu Province. J. Tradit. Chin. Vet. Med. 2016, 35, 73–75. [Google Scholar]
  18. Shang, B.; Meng, K.; Wang, Y. Karyotype Analysis of Five Saussurea Species. Acta Bot. Boreali-Occident. Sin. 2014, 34, 2220–2226. [Google Scholar]
  19. Chong, X.; Li, Y.; Yan, M.; Wang, Y.; Li, M.; Zhou, Y.; Chen, H.; Lu, X.; Zhang, F. Comparative chloroplast genome analysis of 10 Ilex species and the development of species-specific identification markers. Ind. Crops Prod. 2022, 187, 115408. [Google Scholar] [CrossRef]
  20. Fu, G.; Liu, J.; Li, J. Characterization of chloroplast genome structure and phyletic evolution of Elsholtzia densa. Chin. Tradit. Herb. Drugs 2022, 53, 1844–1853. [Google Scholar]
  21. Lu, Z.; Yu, F.; Xiao, Y.; Shi, Y.; Bi, X. Complete Chloroplast Genomes and Characteristics Analysis of Iris typhifolia. Acta Agrestia Sin. 2023, 31, 1656–1664. [Google Scholar]
  22. Deng, G.; Wu, T.; Gao, R.; Wang, M.; Liu, X.; Xiang, L. Characteristics and adaptive evolution analysis of the chloroplast genome of Gentiana rhodantha. Acta Pharm. Sin. 2022, 57, 3240–3253. [Google Scholar]
  23. Ma, D.; Ding, Q.; Zhao, Z.; Han, X.; Zheng, H. Chloroplast genome analysis of three Acanthus species reveal the adaptation of mangrove to intertidal habitats. Gene 2023, 873, 147479. [Google Scholar] [CrossRef] [PubMed]
  24. Shi, W.; Song, W.; Liu, J.; Shi, C.; Wang, S. Comparative chloroplast genome analysis of Citrus (Rutaceae) species: Insights into genomic characterization, phylogenetic relationships, and discrimination of subgenera. Sci. Hortic. 2023, 313, 111909. [Google Scholar] [CrossRef]
  25. Hao, X.; Zhao, S.; Liu, J.; Shi, R.; Wang, X.; Li, Y.; Wang, X.; Mi, F. Characteristics and Phylogenetic Analysis of Chloroplast Genome of Medicago varia. Acta Agrestia Sin. 2023, 31, 1665–1672. [Google Scholar]
  26. Carvalho, L.; Nunes, R.; Sobreiro, M.; Dias, R.; Corvalán, L.; Braga-Ferreira, R.; Targueta, C.; Telles, M. The complete chloroplast genome sequence of Eugenia klotzschiana O. Berg unveils the evolutionary dynamics in plastomes of Myrteae DC. tribe (Myrtaceae). Gene 2023, 876, 147488. [Google Scholar] [CrossRef] [PubMed]
  27. Tangphatsornruang, S.; Sangsrakru, D.; Chanprasert, J.; Uthaipaisanwong, P.; Yoocha, T.; Jomchai, N.; Tragoonrung, S. The chloroplast genome sequence of mungbean (Vigna radiata) determined by high-throughput pyrosequencing: Structural organization and phylogenetic relationships. DNA Res. 2010, 17, 11–22. [Google Scholar] [CrossRef]
  28. Poliseno, L.; Salmena, L.; Zhang, J.; Carver, B.; Haveman, W.; Pandolfi, P. A coding-independent function of gene and pseudogene mRNAs regulates tumour biology. Nature 2010, 465, 1033–1038. [Google Scholar] [CrossRef]
  29. Cui, Y.; Zhou, J.; Chen, X.; Xu, Z.; Wang, Y.; Sun, W.; Song, J.; Yao, H. Complete chloroplast genome and comparative analysis of three Lycium (Solanaceae) species with medicinal and edible properties. Gene Rep. 2019, 17, 100464. [Google Scholar] [CrossRef]
  30. Wang, H.; Zhang, H.; Fan, H.; Wang, X. Analysis of Chloroplast Genome Structure and Characteristics of Allium chrysanthum. Mol. Plant Breed. 2023, 1–17. Available online: http://kns.cnki.net/kcms/detail/46.1068.S.20230606.1334.010.html (accessed on 23 July 2023).
  31. Liang, F.; Wen, X.; Gao, H.; Zhang, Y. Analysis of Chloroplast Genomes Features of Asteraceae Species. Genom. Appl. Biol. 2018, 37, 5437–5447. [Google Scholar]
  32. Zheng, X.; Wang, J.; Feng, L.; Liu, S.; Pang, H.; Qi, L.; Li, J.; Sun, Y.; Qiao, W.; Zhang, L.; et al. Inferring the evolutionary mechanism of the chloroplast genome size by comparing whole-chloroplast genome sequences in seed plants. Sci. Rep. 2017, 7, 1555. [Google Scholar]
  33. Zhang, J.; Liao, M.; Cheng, Y.; Feng, Y.; Ju, W.; Deng, H.; Li, X.; Andelka Plenković-Moraj, A. Comparative Chloroplast Genomics of Seven Endangered Cypripedium Species and Phylogenetic Relationships of Orchidaceae. Front. Plant Sci. 2022, 13, 911702. [Google Scholar] [CrossRef]
  34. Wu, L.; Fan, P.; Zhou, J.; Li, Y.; Xu, Z.; Lin, Y.; Wang, Y.; Song, J.; Yao, H. Gene Losses and Homology of the Chloroplast Genomes of Taxillus and Phacellaria Species. Genes 2023, 14, 943. [Google Scholar] [CrossRef]
  35. Zhang, W.; Liu, H.; Wang, L.; Zhang, S. Chloroplast Genome Comparison and Phylogenetic Analysis of Four Medicinal Plants of Evodia. Guid. J. Tradit. Chin. Med. Pharm. 2023, 29, 44–50+65. [Google Scholar]
  36. Yang, C.; Liu, X.; Cui, Y.; Nie, L.; Lin, Y.; Wei, X.; Wang, Y.; Yao, H. Molecular structure and phylogenetic analyses of the complete chloroplast genomes of three original species of Pyrrosiae Folium. Chin. J. Nat. Med. 2020, 18, 573–581. [Google Scholar] [CrossRef] [PubMed]
  37. Wang, L.; Zhang, S.; Fang, J.; Jin, X.; Mamut, R.; Li, P. The Chloroplast Genome of the Lichen Photobiont Trebouxiophyceae sp. DW1 and Its Phylogenetic Implications. Genes 2022, 13, 1840. [Google Scholar] [CrossRef] [PubMed]
  38. Hou, Z.; Lou, X.; Li, A.; Huang, C. Analyses on characteristics of chloroplast genome and phylogeny of Thalictrum cirrhosum. J. Plant Resour. Environ. 2023, 32, 24–32. [Google Scholar]
  39. Zhao, P.; Zhao, W.; Hou, Z.; Li, X.; Liu, B.; Du, Y.; Xie, R.; Cao, Z.; Wang, K. Comparison of chloroplast genome characteristics and phylogenetic analysis between Themeda japonica and two species of Themeda. Chin. Tradit. Herb. Drugs 2023, 54, 3261–3272. [Google Scholar]
  40. Zhang, H.; Huang, T.; Zhou, Q.; Sheng, Q.; Zhu, Z. Complete Chloroplast Genomes and Phylogenetic Relationships of Bougainvillea spectabilis and Bougainvillea glabra (Nyctaginaceae). Int. J. Mol. Sci. 2023, 24, 13044. [Google Scholar] [CrossRef]
  41. Liu, Y.; Lin, L.; Yang, D.; Zou, X.; Zhang, Z.; Liu, M.; Lin, M.; Zheng, Y. Comparative phylogenetic analysis of oolong tea (Phoenix Dancong tea) using complete chloroplast genome sequences. Heliyon 2022, 8, e12557. [Google Scholar] [CrossRef]
  42. Song, Y.; Chen, Y.; Lv, J.; Xu, J.; Zhu, S.; Li, M.; Chen, N. Development of Chloroplast Genomic Resources for Oryza Species Discrimination. Front. Plant Sci. 2017, 8, 1854. [Google Scholar] [CrossRef]
  43. Song, Y.; Dong, W.; Liu, B.; Xu, C.; Yao, X.; Gao, J.; Corlett, R. Comparative Analysis of Complete Chloroplast Genome Sequences of two tropical trees Machilus yunnanensis and Machilus balansae in the family Lauraceae. Front. Plant Sci. 2015, 6, 662. [Google Scholar] [CrossRef] [PubMed]
  44. Zhao, Z.; Ma, Q.; Lou, Y.; Zhang, J.; Hu, X.; He, J.; Li, X. Comparative analysis of the chloroplast genomes of a novel albino tea “Huabai 1” and the traditional albino tea “Huangjinya”. S. Afr. J. Bot. 2022, 149, 572–581. [Google Scholar] [CrossRef]
  45. Wang, Z.; Xu, B.; Li, B.; Zhou, Q.; Wang, G.; Jiang, X.; Wang, C.; Xu, Z. Comparative analysis of codon usage patterns in chloroplast genomes of six Euphorbiaceae species. PeerJ 2020, 8, e8251. [Google Scholar] [CrossRef] [PubMed]
  46. Xu, C.; Cai, X.; Chen, Q.; Zhou, H.; Cai, Y.; Ben, A. Factors Affecting Synonymous Codon Usage Bias in Chloroplast Genome of Oncidium Gower Ramsey. Evol. Bioinform. 2011, 7, 271–278. [Google Scholar] [CrossRef] [PubMed]
  47. Shi, S.; Liu, Y.; Xia, R.; Qin, L. Comprehensive Analysis of Codon Usage in Quercus Chloroplast Genome and Focus on psbA Gene. Genes 2022, 13, 2156. [Google Scholar] [CrossRef]
  48. Rogalski, M.; Vieira, L.; Fraga, H.; Guerra, M. Plastid genomics in horticultural species: Importance and applications for plant population genetics, evolution, and biotechnology. Front. Plant Sci. 2015, 6, 586. [Google Scholar] [CrossRef]
  49. Wang, Y.; Shi, H.; Gao, H.; Wang, J.; Du, G. Pollen characteristics of twenty-eight species of Saussurea in alpine meadow of the eastern fringe of the Qinghai-tibetan Plateau. Guihaia 2008, 28, 4–14. [Google Scholar]
  50. Sugiura, M. The Chloroplast Genome; Springer: Dordrecht, The Netherlands, 1992. [Google Scholar]
Figure 1. Chloroplast genome map of S. katochaete. The gray arrows indicate the direction of gene transcription. Genes in the inner circle are transcribed in a clockwise direction, while those in the outer circle are transcribed in a counter-clockwise direction.
Figure 1. Chloroplast genome map of S. katochaete. The gray arrows indicate the direction of gene transcription. Genes in the inner circle are transcribed in a clockwise direction, while those in the outer circle are transcribed in a counter-clockwise direction.
Genes 14 02002 g001
Figure 2. Chloroplast genome map of S. superba. The gray arrows indicate the direction of gene transcription. Genes in the inner circle are transcribed in a clockwise direction, while those in the outer circle are transcribed in a counter-clockwise direction.
Figure 2. Chloroplast genome map of S. superba. The gray arrows indicate the direction of gene transcription. Genes in the inner circle are transcribed in a clockwise direction, while those in the outer circle are transcribed in a counter-clockwise direction.
Genes 14 02002 g002
Figure 3. Chloroplast genome map of S. stella. The gray arrows indicate the direction of gene transcription. Genes in the inner circle are transcribed in a clockwise direction, while those in the outer circle are transcribed in a counter-clockwise direction.
Figure 3. Chloroplast genome map of S. stella. The gray arrows indicate the direction of gene transcription. Genes in the inner circle are transcribed in a clockwise direction, while those in the outer circle are transcribed in a counter-clockwise direction.
Genes 14 02002 g003
Figure 4. Schematic representation of cis-splicing and trans-splicing in S. katochaete (A), S. superba (B), and S. stella (C).
Figure 4. Schematic representation of cis-splicing and trans-splicing in S. katochaete (A), S. superba (B), and S. stella (C).
Genes 14 02002 g004
Figure 5. Comparison of the boundaries of the LSC, SSC, and IR regions of the chloroplast genomes of 10 species of Saussurea. Genes around the border are shown above and below the main line. JLB, JSB, JSA, and JLA represent the junctions of LSC/IRb, IRb/SSC, SSC/IRa, and IRa/LSC, respectively.
Figure 5. Comparison of the boundaries of the LSC, SSC, and IR regions of the chloroplast genomes of 10 species of Saussurea. Genes around the border are shown above and below the main line. JLB, JSB, JSA, and JLA represent the junctions of LSC/IRb, IRb/SSC, SSC/IRa, and IRa/LSC, respectively.
Genes 14 02002 g005
Figure 6. Comparative genomic analyses of 10 species of Saussurea. Gray arrows indicate the gene orientation, pink indicates non-coding sequences, purple indicates exons, light blue indicates rRNAs, and white bumps indicate different regions of chloroplast genes. The x-axis delineates the aligned nucleotide positions from the S. phaeantha chloroplast genome (MT554930.1), and the y-axis displays pairwise identity percentages, spanning from 50% to 100%.
Figure 6. Comparative genomic analyses of 10 species of Saussurea. Gray arrows indicate the gene orientation, pink indicates non-coding sequences, purple indicates exons, light blue indicates rRNAs, and white bumps indicate different regions of chloroplast genes. The x-axis delineates the aligned nucleotide positions from the S. phaeantha chloroplast genome (MT554930.1), and the y-axis displays pairwise identity percentages, spanning from 50% to 100%.
Genes 14 02002 g006
Figure 7. The BLAST Atlas results of chloroplast genomes of 10 Saussurea species. The CDS regions in the reference genome were BLASTed against the CDS regions in the query genomes, and the top hits were rendered in a genome map using GView. The purple and black wave charts represent the GC content and skew. The innermost slot on the map (orange) shows the CDS regions on the reference genome. The reference was S. katochaete. From the outside to the inside, they represent S. phaeantha, S. sutchuenensis, S. bullockii, S. leucophylla, S. gossipiphora, S. depsangensis, S. apus, S. stella and S. superba.
Figure 7. The BLAST Atlas results of chloroplast genomes of 10 Saussurea species. The CDS regions in the reference genome were BLASTed against the CDS regions in the query genomes, and the top hits were rendered in a genome map using GView. The purple and black wave charts represent the GC content and skew. The innermost slot on the map (orange) shows the CDS regions on the reference genome. The reference was S. katochaete. From the outside to the inside, they represent S. phaeantha, S. sutchuenensis, S. bullockii, S. leucophylla, S. gossipiphora, S. depsangensis, S. apus, S. stella and S. superba.
Genes 14 02002 g007
Figure 8. Analysis of the nucleotide diversity in the chloroplast genomes of 10 species of Saussurea. The X-axis and Y-axis show the positions of the midpoint of a window and the pi values, respectively.
Figure 8. Analysis of the nucleotide diversity in the chloroplast genomes of 10 species of Saussurea. The X-axis and Y-axis show the positions of the midpoint of a window and the pi values, respectively.
Genes 14 02002 g008
Figure 9. Phylogenetic analysis of the Saussurea. The red pentagrams represent the three species included in this study, and the green pentagrams represent the seven closely related species. The ML tree was reconstructed by IQ-TREE v.1.6.8. The numbers next to the branches represent ML bootstrap support values. The BI tree was reconstructed by MrBayes v. 3.2.6. The numbers next to the branches represent BI probability support values.
Figure 9. Phylogenetic analysis of the Saussurea. The red pentagrams represent the three species included in this study, and the green pentagrams represent the seven closely related species. The ML tree was reconstructed by IQ-TREE v.1.6.8. The numbers next to the branches represent ML bootstrap support values. The BI tree was reconstructed by MrBayes v. 3.2.6. The numbers next to the branches represent BI probability support values.
Genes 14 02002 g009
Table 1. Chloroplast genome base composition of three Saussurea species.
Table 1. Chloroplast genome base composition of three Saussurea species.
SpeciesRegionLengthsA (%)T (%)C (%)G (%)AT (%)GC (%)
S. katochaeteTotal152,56131.0531.2718.5519.1362.3237.68
LSC83,55931.9432.2617.5718.2464.2035.80
IRA25,18128.2628.6220.7122.4156.8743.13
SSC18,64934.0734.5414.8416.5568.6131.39
IRB25,17228.6328.2522.4120.7156.8843.12
S. superbaTotal151,85631.0431.2718.5619.1362.3137.69
LSC87,75531.7932.2717.6718.2764.0635.94
IRA20,29628.0427.6921.0223.2655.7244.28
SSC18,64334.0834.5214.8416.5668.631.40
IRB20,28727.6928.0323.2521.0355.7244.28
S. stellaTotal152,36831.0531.2718.5619.1262.3237.68
LSC83,43031.9432.2617.5718.2264.235.80
IRA25,17428.2528.6220.722.4256.8843.12
SSC18,58134.0934.5214.8616.5368.6131.39
IRB25,18328.6328.2422.4320.7056.8743.13
IRA and IRB represent two inverted repeat regions; LSC represents the large single copy region; and SSC represents the small single copy region.
Table 2. Chloroplast genome annotation and classification analysis of S. katochaete, S. superba, and S. stella.
Table 2. Chloroplast genome annotation and classification analysis of S. katochaete, S. superba, and S. stella.
Gene FunctionGene CategoryGene Name
S. katochaeteS.superbaS. stella
PhotosynthesisPhotosystem IpsaC, psaB, psaI, psaA, psaJpsaA, psaJ, psaC, psaI, psaBpsaC, psaJ, psaI, psaA, psaB
Photosystem IIpsbE, psbD, psbT, psbZ, psbI, psbC, psbH, psbJ, psbA, psbN, psbM, psbB, psbK, psbFpsbM, psbZ, psbH, psbT, psbA, psbN, psbJ, psbK, psbC, psbD, psbI, psbE, psbB, psbFpsbM, psbH, psbN, psbB, psbZ, psbD, psbF, psbK, psbA, psbC, psbE, psbJ, psbI, psbT
Cytochrome b/f complexpetB *, petD *, petL, petN, petA, petGpetD *, petB *, petL, petG, petNpetN, petB *, petD *, petA, petG, petL
ATP synthaseatpA, atpE, atpH, atpF *, atpB, atpIatpF *, atpB, atpE, atpI, atpH, atpAatpH, atpE, atpB, atpI, atpF *, atpA
NADH dehydrogenasendhC, ndhH, ndhB *, ndhK, ndhA *, ndhI, ndhD, ndhG, ndhE, ndhF, ndhJndhA *, ndhJ, ndhG, ndhH, ndhE, ndhF, ndhC, ndhD, ndhI, ndhKndhH, ndhJ, ndhE, ndhG, ndhB *, ndhK, ndhF, ndhD, ndhA *, ndhC, ndhI
Self-inheritanceRibosomal proteins (SSU)rps3, rps11, rps16 *, rps12, rps4, rps14, rps18, rps8, rps7, rps19, rps15, rps2rps18, rps11, rps3, rps2, rps15, rps7, rps8, rps12, rps19, rps4, rps14, rps16 * rps3, rps2, rps19, rps4, rps18, rps14, rps11, rps15, rps12, rps16 *, rps7, rps8
Ribosomal proteins (LSU)rpl14, rpl16 *, rpl2 *, rpl36, rpl22, rpl33, rpl23, rpl20, rpl32rpl36, rpl32, rpl33, rpl20, rpl22, rpl23, rpl2 *, rpl14, rpl16 * rpl2 *, rpl23, rpl14, rpl22, rpl36, rpl16 *, rpl33, rpl20, rpl32
RNA polymeraserpoB, rpoC2, rpoC1 *, rpoArpoB, rpoC1 *, rpoA, rpoC2rpoA, rpoC2, rpoC1 *, rpoB
RubisCO large subunitrbcLrbcLrbcL
Transfer RNAstrnL-UAG, trnQ-UUG, trnS-GCU, trnA-UGC *, trnN-GUU, trnH-GUG, trnS-GGA, trnR-ACG, trnV-UAC, trnV-GAC, trnR-UCU, trnD-GUC, trnM-CAU, trnE-UUC, trnC-GCA, trnI-CAU, trnL-CAA, trnK-UUU *, trnG-GCC, trnG-UCC, trnL-UAA *, trnI-GAU *, trnF-GAA, trnfM-CAU, trnT-GGU, trnP-UGG, trnT-UGU, trnW-CCA, trnY-GUAtrnI-CAU, trnA-UGC *, trnT-GGU, trnG-GCC, trnL-CAA, trnL-UAG, trnI-GAU *, trnQ-UUG, trnV-UAC, trnfM-CAU, trnD-GUC, trnK-UUU *, trnN-GUU, trnP-UGG, trnR-UCU, trnV-GAC, trnM-CAU, trnR-ACG, trnC-GCA, trnY-GUA, trnS-GCU, trnW-CCA, trnL-UAA *, trnE-UUC, trnG-UCC, trnT-UGU, trnF-GAA, trnS-GGAtrnW-CCA, trnR-UCU, trnfM-CAU, trnG-GCC, trnL-CAA, trnK-UUU *, trnE-UUC, trnD-GUC, trnM-CAU, trnP-UGG, trnV-GAC, trnT-GGU, trnN-GUU, trnF-GAA, trnS-GCU, trnY-GUA, trnQ-UUG, trnS-GGA, trnG-UCC, trnR-ACG, trnI-GAU *, trnI-CAU, trnT-UGU, trnC-GCA, trnA-UGC *, trnL-UAG, trnV-UAC, trnL-UAA *
Ribosomal RNAsrrn16S, rrn23S, rrn4.5S, rrn5Srrn4.5S, rrn23S, rrn5S, rrn16Srrn16S, rrn23S, rrn5S, rrn4.5S
Other genesProteaseclpP ** clpP ** clpP **
MaturasematKmatKmatK
Translational initiation factorinfAinfAinfA
Other genesaccD, cemA, ccsAaccD, ccsA, cemAccsA, cemA, accD
Genes of unknown functionHypothetical chloroplast reading frames (ycf)ycf4, ycf3 **, ycf2, ycf15, ycf1ycf3 **, ycf1, ycf4, ycf15ycf4, ycf15, ycf1, ycf3 **
* Gene with one intron; ** gene with two introns.
Table 3. The lengths of the introns and exons for the splitting genes of the chloroplast genome of S. katochaete.
Table 3. The lengths of the introns and exons for the splitting genes of the chloroplast genome of S. katochaete.
GeneStrandStartEndExon IIntron IExon IIIntron IIExon III
trnK-UUU1738427138246036
rps165058617440850227
rpoC1+15,65918,4574327291638
atpF+26,35127,612145707410
trnS-CGA29,67930,4613169260
ycf341,64843,594124698230742153
trnL-UAA+46,47747,0213546050
clpP69,36871,39371808294627226
petB+74,32575,7446772642
petD+75,93377,1148699475
rpl1680,57981,99991013399
rpl283,66685,155391665434
ndhB93,75495,956777670756
trnI-GAU+101,488102,5003394040
trnA-UGC+102,565103,4583782136
ndhA118,605120,7555531059539
trnA-UGC132,680133,5733782136
trnI-GAU133,638134,6503394040
ndhB+140,182142,384777670756
rpl2+150,983152,472391665434
Table 4. The lengths of the introns and exons for the splitting genes of the chloroplast genome of S. superba.
Table 4. The lengths of the introns and exons for the splitting genes of the chloroplast genome of S. superba.
GeneStrandStartEndExon IIntron IExon IIIntron IIExon III
trnK-UUU1738427237246236
rps165059617542862213
rpoC1+15,58718,3854307291640
atpF+26,28127,542145707410
ycf341,56443,511124699230742153
trnL-UAA+46,39346,9373745850
trnV-UAC50,68851,3363857437
clpP69,27671,30271812291627226
petB+74,23475,6526771642
petD+75,84177,0228699475
rpl1680,48881,90891013399
rpl283,57585,064397662431
ndhB93,66395,865777670756
trnI-GAU+101,397102,4094393535
trnA-UGC+102,474103,3673882135
ndhA118,509120,6595531059539
trnA-UGC132,584133,4773882135
trnI-GAU133,542134,5544393535
ndhB+140,086142,288777670756
rpl2+150,887152,376397662431
Table 5. The lengths of the introns and exons for the splitting genes of S. stella.
Table 5. The lengths of the introns and exons for the splitting genes of S. stella.
GeneStrandStartEndExon IIntron IExon IIIntron IIExon III
trnK-UUU1764429438245736
rps165080619640850227
rpoC1+15,61018,4064327271638
atpF+26,31327,574145707410
trnS-CGA29,64030,4223169260
ycf341,62743,573124698230742153
trnL-UAA+46,48747,0133544250
clpP69,26871,29171806294627226
petB+74,22375,6496779642
petD+75,83877,0198699475
rpl1680,48181,90591017399
rpl283,57285,061391665434
ndhB93,66995,871777670756
trnI-GAU+101,403102,4143393940
trnA-UGC+102,479103,3723782136
ndhA118,456120,6055531058539
trnA-UGC132,528133,4213782136
trnI-GAU133,486134,4973393940
ndhB+140,029142,231777670756
rpl2+150,839152,328391665434
Table 6. Statistics for the long repetitive sequences in the chloroplast genomes of S. katochaete, S. superba, and S. stella.
Table 6. Statistics for the long repetitive sequences in the chloroplast genomes of S. katochaete, S. superba, and S. stella.
S. katochaeteS. superbaS. stella
TypeLengthTypeLengthTypeLength
F46P1222P162
P433P1581P702
P2813P425P2379
P4344P656P1590
P811P329P2898
P2161P100P442
P2917P558P658
P527P776P1747
P2340P844P2720
P795P510P90
P266P113P1692
P436P2858P419
P45P1189P1085
P863P691P829
P1001P316P176
P34P527P311
P31P1475P863
P39P612P47
P1982P36P776
P1675P52P1004
P1710P39P323
P42P735P935
P48P1141P1676
P34P358P474
P30P500P1250
P482P42
P3475P48
P39P34
P501P30
P1194
P813
P438
P42
P48
P34
P30
“F” for forward repeat, and “P” for palindromic repeat.
Table 7. SSR analysis of the chloroplast genomes of S. katochaete, S. superba, and S. stella.
Table 7. SSR analysis of the chloroplast genomes of S. katochaete, S. superba, and S. stella.
KindRepetitive SequenceS. katochaeteS. superbaS. stella
QuantityTotalQuantityTotalQuantityTotal
MononucleotideA/T109111106107107110
C/G2 1 3
DinucleotideAA/TT109153106149107152
AC/GT2 2 2
AG/CT19 19 19
AT/AT20 20 20
CC/GG2 1 3
CG/CG1 1 1
TrinucleotideAAA/TTT10151317814
AAG/CTT1 1 1
AAT/ATT3 3 4
CCC/GGG1 / 1
TetranucleotideAAAA/TTTT10211322818
AAAC/GTTT1 1 1
AAAG/CTTT2 2 2
AAAT/ATTT3 2 3
AATC/ATTG2 2 1
ATAT/ATAT2 2 2
CCCC/GGGG1 / 1
PentanucleotideAAAAA/TTTTT344523
ACTAT/AGTAT1 1 1
HexanucleotideAAAAAA/TTTTTT/11323
AATAGG/ATTCCT1 1 1
AGCAGG/CCTGCT/ 1 /
305 303 300
“/” means no such repeat type.
Table 8. SSRs in the chloroplast genome of S. katochaete.
Table 8. SSRs in the chloroplast genome of S. katochaete.
IDSSR nr.SSR TypeSSRSizeStartEnd
OQ596208.1_Saussurea_katochaete1p4(ATAA)31219561967
OQ596208.1_Saussurea_katochaete2p1(A)101043404349
OQ596208.1_Saussurea_katochaete3p1(C)141453125325
OQ596208.1_Saussurea_katochaete4p1(A)101082398248
OQ596208.1_Saussurea_katochaete5p1(A)101094259434
OQ596208.1_Saussurea_katochaete6p1(A)101013,06213,071
OQ596208.1_Saussurea_katochaete7p1(A)101018,10618,115
OQ596208.1_Saussurea_katochaete8p2(TA)51018,32918,338
OQ596208.1_Saussurea_katochaete9p2(AT)51019,32519,334
OQ596208.1_Saussurea_katochaete10p6(TATTCC)31821,28921,306
OQ596208.1_Saussurea_katochaete11p1(T)101022,92322,932
OQ596208.1_Saussurea_katochaete12p1(T)101025,71125,720
OQ596208.1_Saussurea_katochaete13p2(AT)61226,26926,280
OQ596208.1_Saussurea_katochaete14p3(TAA)41230,74330,754
OQ596208.1_Saussurea_katochaete15p5(ACTAT)42030,76630,785
OQ596208.1_Saussurea_katochaete16p3(TTC)41234,22134,232
OQ596208.1_Saussurea_katochaete17p1(A)151534,40834,422
OQ596208.1_Saussurea_katochaete18p1(T)111134,42834,438
OQ596208.1_Saussurea_katochaete19p1(A)141443,70443,717
OQ596208.1_Saussurea_katochaete20p1(T)111146,14746,157
OQ596208.1_Saussurea_katochaete21p1(T)121249,80949,820
OQ596208.1_Saussurea_katochaete22p1(T)131354,16254,174
OQ596208.1_Saussurea_katochaete23p3(AAT)41258,45758,468
OQ596208.1_Saussurea_katochaete24p1(T)101058,80958,818
OQ596208.1_Saussurea_katochaete25p1(T)111162,71662,726
OQ596208.1_Saussurea_katochaete26p1(T)171764,54164,557
OQ596208.1_Saussurea_katochaete27p1(A)101064,97164,980
OQ596208.1_Saussurea_katochaete28p2(TA)71467,29667,309
OQ596208.1_Saussurea_katochaete29p4(TATT)31267,35467,365
OQ596208.1_Saussurea_katochaete30p1(A)101068,57268,581
OQ596208.1_Saussurea_katochaete31p3(ATA)41270,64170,652
OQ596208.1_Saussurea_katochaete32p1(T)101070,93270,941
OQ596208.1_Saussurea_katochaete33p1(A)121277,28077,291
OQ596208.1_Saussurea_katochaete34p1(T)101077,52877,537
OQ596208.1_Saussurea_katochaete35p1(T)151579,96079,974
OQ596208.1_Saussurea_katochaete36p1(T)101080,54980,558
OQ596208.1_Saussurea_katochaete37p4(TTTC)31281,74281,753
OQ596208.1_Saussurea_katochaete38p1(A)1212107,033107,044
OQ596208.1_Saussurea_katochaete39p1(T)1313107,495107,507
OQ596208.1_Saussurea_katochaete40p4(TTTA)312111,711111,722
OQ596208.1_Saussurea_katochaete41p4(AGAA)312119,547119,558
OQ596208.1_Saussurea_katochaete42p4(AATC)312119,954119,965
OQ596208.1_Saussurea_katochaete43p4(GATT)312124,374124,385
OQ596208.1_Saussurea_katochaete44p4(CAAA)312127,060127,071
OQ596208.1_Saussurea_katochaete45p1(A)1313128,631128,643
OQ596208.1_Saussurea_katochaete46p1(T)1212129,094129,105
Table 9. SSRs in the chloroplast genome of S. superba.
Table 9. SSRs in the chloroplast genome of S. superba.
IDSSR nr.SSR TypeSSRSizeStartEnd
OQ596209.1_Saussurea_superba1c(ATAA)3(T)102219561977
OQ596209.1_Saussurea_superba2p1(T)101037893798
OQ596209.1_Saussurea_superba3p1(C)141453135326
OQ596209.1_Saussurea_superba4p1(A)101082308239
OQ596209.1_Saussurea_superba5p1(A)101012,99012,999
OQ596209.1_Saussurea_superba6p1(A)101018,03418,043
OQ596209.1_Saussurea_superba7p2(TA)51018,25718,266
OQ596209.1_Saussurea_superba8p2(AT)51019,25319,262
OQ596209.1_Saussurea_superba9p6(TATTCC)31821,21721,234
OQ596209.1_Saussurea_superba10p1(T)121222,85122,862
OQ596209.1_Saussurea_superba11p1(T)101025,64125,650
OQ596209.1_Saussurea_superba12p2(AT)71426,19926,212
OQ596209.1_Saussurea_superba13p3(TAA)41230,67330,684
OQ596209.1_Saussurea_superba14p5(ACTAT)31530,69630,710
OQ596209.1_Saussurea_superba15p3(TTC)41234,14634,157
OQ596209.1_Saussurea_superba16p1(A)141434,33334,346
OQ596209.1_Saussurea_superba17p1(T)101034,35234,361
OQ596209.1_Saussurea_superba18p1(A)101043,36743,376
OQ596209.1_Saussurea_superba19p1(A)151543,62143,635
OQ596209.1_Saussurea_superba20p1(T)131349,71749,729
OQ596209.1_Saussurea_superba21p1(T)131354,07154,083
OQ596209.1_Saussurea_superba22p6(CCTGCT)31858,25058,267
OQ596209.1_Saussurea_superba23p3(AAT)41258,37258,383
OQ596209.1_Saussurea_superba24p1(T)101058,71758,726
OQ596209.1_Saussurea_superba25p1(T)101062,62462,633
OQ596209.1_Saussurea_superba26p1(T)171764,44864,464
OQ596209.1_Saussurea_superba27p1(A)131364,87564,887
OQ596209.1_Saussurea_superba28p2(TA)71467,20367,216
OQ596209.1_Saussurea_superba29p4(TATT)31267,26167,272
OQ596209.1_Saussurea_superba30p1(A)111168,47968,489
OQ596209.1_Saussurea_superba31p3(ATA)41270,54970,560
OQ596209.1_Saussurea_superba32p1(T)111170,84070,850
OQ596209.1_Saussurea_superba33p1(A)121277,18977,200
OQ596209.1_Saussurea_superba34p1(T)101077,43677,445
OQ596209.1_Saussurea_superba35p1(T)161679,86879,883
OQ596209.1_Saussurea_superba36p1(T)101080,45880,467
OQ596209.1_Saussurea_superba37p4(TTTC)31281,65181,662
OQ596209.1_Saussurea_superba38p1(A)1212106,942106,953
OQ596209.1_Saussurea_superba39p1(T)1414107,404107,417
OQ596209.1_Saussurea_superba40p4(AGAA)312119,451119,462
OQ596209.1_Saussurea_superba41p4(AATC)312119,858119,869
OQ596209.1_Saussurea_superba42p4(GATT)312124,278124,289
OQ596209.1_Saussurea_superba43p4(CAAA)312126,964126,975
OQ596209.1_Saussurea_superba44p1(A)1313128,535128,547
OQ596209.1_Saussurea_superba45p1(T)1212128,998129,009
Table 10. SSRs in the chloroplast genome of S. stella.
Table 10. SSRs in the chloroplast genome of S. stella.
IDSSR nr.SSR TypeSSRSizeStartEnd
OQ596210.1_Saussurea_stella1p1(C)141453345347
OQ596210.1_Saussurea_stella2p1(A)111183438353
OQ596210.1_Saussurea_stella3p1(A)111194449454
OQ596210.1_Saussurea_stella4p1(A)101013,01313,022
OQ596210.1_Saussurea_stella5p1(A)101018,05518,064
OQ596210.1_Saussurea_stella6p2(TA)51018,27818,287
OQ596210.1_Saussurea_stella7p2(AT)51019,27419,283
OQ596210.1_Saussurea_stella8p6(TATTCC)31821,23821,255
OQ596210.1_Saussurea_stella9p1(T)111122,87222,882
OQ596210.1_Saussurea_stella10p1(T)101025,67325,682
OQ596210.1_Saussurea_stella11p2(AT)61226,23126,242
OQ596210.1_Saussurea_stella12p3(TAA)41230,70430,715
OQ596210.1_Saussurea_stella13p5(ACTAT)31530,72730,741
OQ596210.1_Saussurea_stella14p3(TTC)41234,17734,188
OQ596210.1_Saussurea_stella15p1(A)101034,36434,373
OQ596210.1_Saussurea_stella16p1(A)131335,31935,331
OQ596210.1_Saussurea_stella17p1(A)181843,68143,698
OQ596210.1_Saussurea_stella18p1(T)101045,79445,803
OQ596210.1_Saussurea_stella19p1(T)101046,15846,167
OQ596210.1_Saussurea_stella20p1(T)101049,72949,738
OQ596210.1_Saussurea_stella21p1(T)141454,05854,071
OQ596210.1_Saussurea_stella22p3(AAT)41258,36958,380
OQ596210.1_Saussurea_stella23p1(T)111158,72158,731
OQ596210.1_Saussurea_stella24p1(T)141464,44364,456
OQ596210.1_Saussurea_stella25p2(TA)61267,19367,204
OQ596210.1_Saussurea_stella26p4(TATT)31267,24967,260
OQ596210.1_Saussurea_stella27p3(ATA)41270,54070,551
OQ596210.1_Saussurea_stella28p3(AAT)41274,67574,686
OQ596210.1_Saussurea_stella29p1(T)101077,43177,440
OQ596210.1_Saussurea_stella30p1(T)141479,86379,876
OQ596210.1_Saussurea_stella31p1(T)101080,45180,460
OQ596210.1_Saussurea_stella32p4(TTTC)31281,64481,655
OQ596210.1_Saussurea_stella33p1(T)101081,68281,691
OQ596210.1_Saussurea_stella34p1(A)1212106,947106,958
OQ596210.1_Saussurea_stella35p1(T)1313107,409107,421
OQ596210.1_Saussurea_stella36p4(TTTA)312111,628111,639
OQ596210.1_Saussurea_stella37p4(ATAA)312117,553117,564
OQ596210.1_Saussurea_stella38p4(AGAA)312119,398119,409
OQ596210.1_Saussurea_stella39p4(GATT)312124,222124,233
OQ596210.1_Saussurea_stella40p1(T)1212124,837124,848
OQ596210.1_Saussurea_stella41p4(CAAA)312126,908126,919
OQ596210.1_Saussurea_stella42p1(A)1313128,479128,491
OQ596210.1_Saussurea_stella43p1(T)1212128,942128,953
Table 11. Codon usage bias analysis of the chloroplast genomes of S. katochaete, S. superba, and S. stella.
Table 11. Codon usage bias analysis of the chloroplast genomes of S. katochaete, S. superba, and S. stella.
CodonAmino AcidS. katochaeteS. superbaS. stella
QuantityTotalRSCUQuantityTotalRSCUQuantityTotalRSCU
UAA*29541.6129521.6729541.61
UAG*13 0.7212 0.6914 0.78
UGA*12 0.6711 0.6311 0.61
GCUA (Ala)53912101.7849411131.7850911541.76
GCCA (Ala)186 0.61174 0.63185 0.64
GCAA (Ala)349 1.15313 1.12328 1.14
GCGA (Ala)136 0.45132 0.47132 0.46
UGUC (Cys)1662301.441411891.491482031.46
UGCC (Cys)64 0.5648 0.5155 0.54
GAUD (Asp)6888611.65607101.585747251.58
GACD (Asp)173 0.4150 0.42151 0.42
GAAE (Glu)83711251.497319521.547529811.53
GAGE (Glu)288 0.51221 0.46229 0.47
UUUF (Phe)80912041.347029881.4270710081.4
UUCF (Phe)395 0.66286 0.58301 0.6
GGUG (Gly)49114971.3145613451.3647414101.34
GGCG (Gly)179 0.48166 0.49173 0.49
GGAG (Gly)572 1.53500 1.49523 1.48
GGGG (Gly)255 0.68223 0.66240 0.68
CAUH (His)3965101.553474461.563544521.57
CACH (His)114 0.4599 0.4498 0.43
AUUI (Ile)89618281.4777915611.581116921.49
AUCI (Ile)350 0.57284 0.55302 0.56
AUAI (Ile)582 0.96498 0.96516 0.95
AAAK (Lys)85511211.537499481.587769921.56
AAGK (Lys)266 0.47199 0.42216 0.44
UUAL (Leu)72922461.9566218942.166419422.05
UUGL (Leu)466 1.24388 1.23396 1.22
CUUL (Leu)481 1.28394 1.25406 1.25
CUCL (Leu)131 0.3596 0.3105 0.32
CUAL (Leu)299 0.8246 0.78257 0.79
CUGL (Leu)140 0.37108 0.34114 0.35
AUGM (Met)502502143543514424421
AAUN (Asn)80710271.576778541.597089001.57
AACN (Asn)220 0.43177 0.41192 0.43
CCUP (Pro)3438931.543007661.573218031.6
CCCP (Pro)159 0.71134 0.7140 0.7
CCAP (Pro)259 1.16218 1.14227 1.13
CCGP (Pro)132 0.59114 0.6115 0.57
CAAQ (Gln)5927661.555136491.585296771.56
CAGQ (Gln)174 0.45136 0.42148 0.44
CGUR (Arg)29512741.3928011181.528711481.5
CGCR (Arg)83 0.3971 0.3873 0.38
CGAR (Arg)284 1.34250 1.34262 1.37
CGGR (Arg)88 0.4176 0.4174 0.39
AGAR (Arg)385 1.81334 1.79344 1.8
AGGR (Arg)139 0.65107 0.57108 0.56
UCUS (Ser)47015981.7639312931.8239813351.79
UCCS (Ser)244 0.92181 0.84196 0.88
UCAS (Ser)319 1.2256 1.19259 1.16
UCGS (Ser)138 0.52102 0.47108 0.49
AGUS (Ser)333 1.25290 1.35294 1.32
AGCS (Ser)94 0.3571 0.3380 0.36
ACUT (Thr)43910721.643889421.654009761.64
ACCT (Thr)203 0.76185 0.79195 0.8
ACAT (Thr)335 1.25296 1.26304 1.25
ACGT (Thr)95 0.3573 0.3177 0.32
GUUV (Val)42211701.4437810401.4538610831.43
GUCV (Val)144 0.49124 0.48129 0.48
GUAV (Val)447 1.53407 1.57428 1.58
GUGV (Val)157 0.54131 0.5140 0.52
UGGW (Trp)387387133433413373371
UAUY (Tyr)6718331.615747121.615897351.6
UACY (Tyr)162 0.39138 0.39146 0.4
21,408 18,341 18,986
“*” denotes a termination codon.
Table 12. Plant samples used in this study.
Table 12. Plant samples used in this study.
SpeciesSubtribeTribeOriginAccession Number
C. tinctoriusCentaureinaeTrib. CynareaeNCBINC030783.1
C. cyanusNCBINC066898.1
A. lanceaCarlininaeNCBIMT834522.1
C. acaulisNCBINC063929.1
A. lappaCarduinaeNCBIMH375874.1
C. crispusNCBIMK652229.1
C. japonicumNCBINC053767.1
C. cardunculus var. scolymusNCBIKM035764.1
D. calophyllaNCBIMT128668.1
S. katochaeteQinghai, ChinaOQ596208.1
S. superbaQinghai, ChinaOQ596209.1
S. stellaQinghai, ChinaOQ596210.1
S. salicifoliaNCBINC057621.1
S. medusaNCBIMZ128902.1
S. sutchuenensisNCBIMK966029.1
S. apusNCBIMH926074.1
S. depsangensisNCBIMH926091.1
S. gossipiphoraNCBIMH926100.1
S. phaeanthaNCBIMT554930.1
S. inversaNCBINC050977.1
S. albifoliaNCBIMT478053.1
S. salsaNCBIMH926157
S. oligocephalaNCBINC067031.1
S. brachycephalaNCBIMH926079.1
S. amurensisNCBIMH926072.1
S. bullockiiNCBIMK966027.1
S. leucophyllaNCBIMH926118.1
S. tomentosaNCBIMH926177.1
S. komarovianaNCBIMH926112.1
H. annuus/Trib. HeliantheaeNCBINC007977.1
The three species included in this study are bolded.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

He, H.; Wang, T.; Tang, C.; Cao, Z.; Pu, X.; Li, Y.; Li, X. Complete Chloroplast Genomes of Saussurea katochaete, Saussurea superba, and Saussurea stella: Genome Structures and Comparative and Phylogenetic Analyses. Genes 2023, 14, 2002. https://doi.org/10.3390/genes14112002

AMA Style

He H, Wang T, Tang C, Cao Z, Pu X, Li Y, Li X. Complete Chloroplast Genomes of Saussurea katochaete, Saussurea superba, and Saussurea stella: Genome Structures and Comparative and Phylogenetic Analyses. Genes. 2023; 14(11):2002. https://doi.org/10.3390/genes14112002

Chicago/Turabian Style

He, Hui, Tao Wang, Chuyu Tang, Zhengfei Cao, Xiaojian Pu, Yuling Li, and Xiuzhang Li. 2023. "Complete Chloroplast Genomes of Saussurea katochaete, Saussurea superba, and Saussurea stella: Genome Structures and Comparative and Phylogenetic Analyses" Genes 14, no. 11: 2002. https://doi.org/10.3390/genes14112002

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop