Evolutionary dynamics of the chloroplast genome sequences of six Colobanthus species

Androsiuk, Piotr; Jastrzębski, Jan Paweł; Paukszto, Łukasz; Makowczenko, Karol; Okorski, Adam; Pszczółkowska, Agnieszka; Chwedorzewska, Katarzyna Joanna; Górecki, Ryszard; Giełwanowska, Irena

doi:10.1038/s41598-020-68563-5

Download PDF

Article
Open access
Published: 13 July 2020

Evolutionary dynamics of the chloroplast genome sequences of six Colobanthus species

Piotr Androsiuk¹,
Jan Paweł Jastrzębski¹,
Łukasz Paukszto¹,
Karol Makowczenko¹,
Adam Okorski²,
Agnieszka Pszczółkowska²,
Katarzyna Joanna Chwedorzewska³,
Ryszard Górecki¹ &
…
Irena Giełwanowska¹

Scientific Reports volume 10, Article number: 11522 (2020) Cite this article

3430 Accesses
18 Citations
Metrics details

Subjects

Abstract

The complete plastome sequences of six species were sequenced to better understand the evolutionary relationships and mutation patterns in the chloroplast genome of the genus Colobanthus. The length of the chloroplast genome sequences of C. acicularis, C. affinis, C. lycopodioides, C. nivicola, C. pulvinatus and C. subulatus ranged from 151,050 to 151,462 bp. The quadripartite circular structure of these genome sequences has the same overall organization and gene content with 73 protein-coding genes, 30 tRNA genes, four rRNA genes and five conserved chloroplast open reading frames. A total of 153 repeat sequences were revealed. Forward repeats were dominant, whereas complementary repeats were found only in C. pulvinatus. The mononucleotide SSRs composed of A/T units were most common, and hexanucleotide SSRs were detected least often. Eleven highly variable regions which could be utilized as potential markers for phylogeny reconstruction, species identification or phylogeography were identified within Colobanthus chloroplast genomes. Seventy-three protein-coding genes were used in phylogenetic analyses. Reconstructed phylogeny was consistent with the systematic position of the studied species, and the representatives of the same genus were grouped in one clade. All studied Colobanthus species formed a single group and C. lycopodioides was least similar to the remaining species.

Phylogenomics and the rise of the angiosperms

Article Open access 24 April 2024

Genomes of multicellular algal sisters to land plants illuminate signaling network evolution

Article Open access 01 May 2024

Complexity of avian evolution revealed by family-level genomes

Article Open access 01 April 2024

Introduction

The genus Colobanthus in the family Caryophyllaceae contains 26 species¹. Most species are found in the Southern Hemisphere, and the greatest diversity is observed in New Zealand². Colobanthus species are low-growing perennials with a cushion growth habit, narrow and dense leaves, and inconspicuous, solitary, greenish flowers without petals, but with four to six prominent sepals^3,4. The relationships within the family Caryophyllaceae are not easy to elucidate, partly due to arbitrarily and poorly defined genera and difficulties in determining phylogenetically useful morphological characters⁵. Colobanthus is one of the least studied genera, and it is sometimes confused with the related genus Sagina (Caryophyllaceae)⁶. Many areas where Colobanthus species occur are under protection, and some of them, such as the cold-temperate South Pacific Islands, have world heritage status⁷. Moreover, in contrast to the widespread species of C. quitensi, C. affinis and C. apetalus, taxa such as C. strictus, C. squarrosus and C. curtisiae (recorded in only three Tasmanian populations)^8,9,10 or C. nivicola (endemic to the alpine tract of the Mt Kosciusko area in New South Wales, Australia)¹¹ are extremely rare and are on the Australian list of rare or threatened plant species¹². In many areas of Australia, where the habitats of Colobanthus species overlap, species such as C. nivicola and C. pulvinatus may be difficult to distinguish in the field¹¹. Therefore, a precise identification of these species is necessary.

The members of the genus Colobanthus are extremely rarely studied, excluding Antarctic pearlwort (Colobanthus quitensis (Kunth) Bartl) which rose to fame as the only native representative of Magnoliopsida in the maritime Antarctic¹³. Colobanthus quitensis has been intensively studied to explore the traits responsible for its high tolerance to extreme Antarctic conditions^{14,15,16,17,18,19}. Despite the above, our knowledge of the genetic diversity of this species and the entire genus Colobanthus remains limited^{20,21,22,23,24}. Only two papers presented the complete sequence of the chloroplast genomes of C. quitensis²⁵ and C. apetalus²⁶, whereas the size of the nuclear genome in C. quitensis was estimated by flow cytometry in only one study²⁷.

In recent years, chloroplast genome sequences attracted significant interest in plant phylogenetics, phylogeography and molecular evolution research²⁸. Chloroplast genome sequences have numerous advantages, including low molecular weight, simple structure, uniparental (generally maternal) mode of inheritance, haploidy, highly conserved structure and a slower evolutionary rate of change than nuclear genomes. For this reason, chloroplast genomes constitute valuable data that are relatively easy to handle with source molecular data, support the validation of complex evolutionary relationships and detailed phylogenetic analyses at group, family or even genus level^29,30,31,32. Chloroplast sequences also have numerous applications in biotechnology^33,34 and the development of molecular markers for identifying species and distinguishing morphologically similar species^35,36,37. A high number of new chloroplast genomes have been reported ever since the complete chloroplast genome sequences of Nicotiana tabacum³⁸ and Marchantia polymorpha³⁹ were published in 1986. This phenomenal progress was made possible by the development of new high-throughput genome sequencing technologies which enable scientists to obtain high quality cp genome sequences in a more convenient and relatively inexpensive way. As a result, around 3,500 chloroplast genomes have been deposited in the database of the National Center for Biotechnology Information (NCBI). It has been recently proposed that the whole chloroplast genome sequence should be used as a universal super-barcode in the identification of plant species. This approach may overcome the limitations of the traditional two-locus barcode based mainly on sequence variation within two plastome regions (rbcL and matK) that is not always sufficient for species discrimination⁴⁰. In Colobanthus, the development of cpDNA markers has so far been restricted to the following loci: matK, trnK (C. masonae, C. affinis⁴¹), matK, trnK, trnL, trnL–trnF, trnF, rps16 (C. muscoides; unpublished data, available in NCBI) and ndhF (C. brevisepalus⁴²). Complete chloroplast genome sequences are available only for C. quitensis²⁵ and C. apetalus²⁶. The published data indicate that the Colobanthus cp genome is typical for angiosperms in terms of size (151,276 bp for C. quitensis and 151,228 bp for C. apetalus) and composition (112 genes). This genome has a conserved quadripartite circular structure with a large single copy (LSC) region, a small single copy (SSC) region and two copies of inverted repeat (IR) regions^43,44. The complete sequences of the cp genome of C. quitensis and C. apetalus provide molecular data that are essential for advanced genomic studies. However, the genomic data for other members of the genus Colobanthus are still limited.

The complete chloroplast genomes of six Colobanthus species have been sequenced and annotated for the first time in this study. The comparative study of these six chloroplast genomes as well as the previously published cp genomes for C. quitensis and C. apetalus had the following goals: (1) to determine the size and structure of Colobanthus cp genomes, (2) to identify genomic repeats, including forward, reverse, palindromic and complementary sequences among Colobanthus genomes, (3) to compare the variation of simple sequence repeats (SSRs) among Colobanthus cp genomes, (4) to verify the phylogenetic relationships among Colobanthus species and other Caryophyllaceae species for which complete chloroplast genomes are available.

Results

Organization of chloroplast genomes

Six Colobanthus species were sequenced to produce 1,986,760–7,849,218 raw reads (150 bp for average read length) which were mapped separately to the reference genome of C. quitensis. A total of 210,882–414,160 reads were ultimately mapped with 202.7× to 396.5 × coverage (Table 1). The six Colobanthus cp genome sequences were deposited in GenBank under the following accession numbers: MN273320 for C. acicularis, MN273318 for C. affinis, MN273317 for C lycopodioides, MN273316 for C. nivicola, MN273315 for C. pulvinatus, and MN273319 for C. subulatus. The chloroplast genome sequences described in this study ranged from 151,050 (C. acicularis) to 151,462 bp (C. lycopodioides). Each chloroplast genome was assembled into a single circular, double-stranded DNA sequence. All plastomes displayed a typical quadripartite structure with a pair of IRs (25,309–25,326 bp) separated by SSC (17,177–17,256 bp) and LSC (83,198–83,645 bp) regions (Fig. 1). The overall GC content was 36.61–36.66% and it was nearly identical in all Colobanthus cp genomes (Table 1). All of the analyzed Colobanthus cp genomes contained an identical set of 112 genes composed of 73 protein-coding genes, 30 tRNA genes, four rRNA genes and five conserved chloroplast ORFs (ycf1, ycf2, ycf3, ycf4, ycf68) (Table 2). Fifty-eight protein-coding genes, 22 tRNA genes and 2 conserved chloroplast ORFs (ycf3 and ycf4) are located in LSC, whereas the SSC region contained 11 protein-coding genes and one tRNA gene. The IR region contained four rRNA genes, seven tRNA genes and eight protein-coding genes, including ycf2, ycf68 and ycf1 on the border between IR_A/IR_B and SSC. The full ycf1 sequence is located on the IR_A/SSC border, and its incomplete copy on the IR_B/SSC border acts as a pseudogene. Fifteen genes contained one intron (atpF, ndhA, ndhB, petB, petD, rpl16, rpoC1, rps16, trnI-GAU, trnA-UGC, trnK-UUU, trnG-UCC, trnL-UAA, trnV-UAC, ycf3), and two genes consisted of three exons (rps12 and clpP). The first exon of rps12 (5′ end of the sequence) was found in the LSC region, and the remaining two exons were located in the IR region. This unique feature supported the identification of rps12 as a trans-spliced gene. The introns of the two remaining genes, trnK-UUU and trnI-GAU, include coding sequences for matK and ycf68, respectively.

Table 1 Summary of chloroplast genome characteristic of Colobanthus.

Full size table

Table 2. Genes present in six chloroplast genomes of Colobanthus species.

Full size table

The total number of codons for all protein-coding genes in the cp genomes of six Colobanthus species ranged from 25,159 to 26,162. The most and least abundant codons (excluding these associated with the initiation and termination of translation) were ATT (4.33%) and TGC (0.25%), respectively (Supplementary Table S1 online). Furthermore, leucine appeared as the dominant amino acid (10.7%), whereas cysteine was less frequently encountered (1.2%). Since the data for codon usage were not available for the previously published plastomes of C. quitensis and C. apetalus, these species were included in the analysis. Both species shared the same pattern of codon usage and amino acid frequency.

The boundaries between IR and SSC/LSC regions in all Colobanthus cp genomes were identified (Fig. 2). The IR_A/SSC junction was found within the ycf1 gene (1819 bp from its 5′ end), and the boundary between IR_B and LSC region was identified within the rps19 gene (161 bp from its 5′ end). Consequently, the full ycf1 sequence is located only on the IR_A/SSC border, and its incomplete copy on IR_B/SSC border acts as a pseudogene (ψycf1). A similar situation was observed for rps19, for which the complete sequence can be found on the IR_B/LSC border, whereas the ψrps19 pseudogene is located in the IR_A region. The IR_B/SSC boundary was identified within the ndhF gene, and a 45 bp string from its 3′-end overlapped ψycf1 within IR_B. The IR_A/LSC junction was adjacent to the trnH gene. The boundaries between the IR and SSC/LSC regions of six Colobanthus species were generally found in the same positions and within the same genomic elements. One base shift in the IR_A/SSC and IR_B/SSC border position was found only in C. subulatus (Fig. 2).

Repetitive sequences and SSRs

A total of 153 repeats were observed in the plastomes of six Colobanthus species. The number of repeats was highest (39) in C. pulvinatus and lowest (21) in C. lycopodioides and C. subulatus (Supplementary Table S2A–F online). Forward repeats dominated in the identified repetitive sequences (from 38.5% in C. pulvinatus to 57.1% in C. subulatus), followed by palindromic (from 25% in C. acicularis to 42.9% in C. lycopodioides) and reverse repeats (from 4.8% in C. lycopodioides and C. subulatus to 21.4% in C. acicularis). Complementary repeats were found only in the cp genome of C. pulvinatus with a frequency of 17.9% (Fig. 3A). Most repeat sequences (77.1%) were detected in the LSC region, followed by IR (11.8%) and SSC regions (11.1%) (Fig. 3B). These sequences were found predominantly within intergenic spacers and introns with a frequency of 79.5% (C. pulvinatus) to 57.1% (C. subulatuc and C. lycopodioides). Our study also revealed that many repeats shared the same locus in all Colobanthus cp genomes. Fourteen such loci were identified: psaA, psaA–ycf3, psaB, psaI–ycf4, petN–psbM, trnG-GCC, trnG-UCC, trnS-GCU, trnS-GGA and trnS-UGA in the LSC region, ndhA and ycf1 in the SSC region, and trnV-GAC-rps7 and ycf2 in the IR region.

The Phobos analysis supported the identification of 39–47 SSRs in six cp genomes (Fig. 4A), including mono-, di-, tri-, tetra-, penta- and hexanucleotides (Supplementary Table S3A–F online). The mononucleotide SSRs were most common with a frequency ranging from 46.3% in C. subulatus to 55.3% in C. lycopodioides (Fig. 4B). All mononucleotide SSRs were composed of A/T repeat units. Motifs composed of adenine and thymine were also predominant in di- and trinucleotide SSRs, where only AT/TA and AAT/TTA motifs were observed, respectively. Tetranucleotide SSRs were the second most frequent repeats that ranged from 26.7% in C. nivicola to 30.8% in C. affinis. The frequency of di-, tri- and pentanucleotide SSRs did not exceed 9.8%. Hexanucleotide SSRs were detected only in C. nivicola and C. pulvinatus with a frequency of 2.2% and 2.3%, respectively. The majority of SSRs were located in the LSC region (from 75% in C. pulvinatus to 85.1% in C. lycopodioides), followed by SSC (from 12.8% in C. lycopodioides to 20.5% in C. pulvinatus) and IR regions (from 2.1% in C. lycopodioides to 4.5% in C. pulvinatus) (Fig. 4C). Furthermore, SSRs were identified predominantly within intergenic spacers (from 70.2% in C. lycopodioides to 73.3% in C. nivicola), whereas the remaining SSRs were distributed in various proportions between exons and introns (Fig. 4D).

Sequence divergence

The overall sequence identity and divergent regions in the cp genomes of C. acicularis, C. affinis, C. lycopodioides, C. nivicola, C. pulvinatus, C. subulatus and the previously published plastomes of C. quitensis and C. apetalus^25,26 were determined in MAUVE and DnaSP programs. MAUVE results are shown in Supplementary Figure S1 online. Rearrangements (inversions or translocations) were not detected in any of the eight chloroplast genome sequences. The high sequence similarity points to the conservative character of all eight cp genomes. In the DnaSP, nucleotide diversity (π) in the cp genomes of Colobanthus species was determined at 0.00262. The most variable regions were identified in sliding window analysis, i.e. regions for which π values exceeded 0.008 (Fig. 5). Divergence was generally higher in non-coding regions. In the coding region, differences were found only in the ycf1 locus. In non-coding regions, the highest divergence and the highest π value (0.01299) were observed for ndhF–rpl32, rpl32–trnL-UAG, trnK-UUU–rps16, rps16–trnQ-UUG, trnS-GCU–trnG-UCC, trnD-GUC–trnY-GUA, psaA–ycf3, trnL-UAA–trnF-GAA, trnF-GAA–ndhJ, petA–psbJ and trnV-GAC–rps7 (Supplementary Table S4 online). The majority of highly variable regions (9) were identified in LSC. There were three such regions in SSC, and none in the IR region.

Synonymous (Ks) and non-synonymous (Ka) substitution rate analysis

The substitution rate varied widely across plastome genes in each functional group, and the values of Ka and Ks were determined in the range of 0–0.0117 and 0–0.152, respectively (Supplementary Table S5 online). The highest average value of Ks (0.0111) was noted for coding sequences associated with the large subunit of ribosome. The average value of Ks was lowest in genes related to the cytochrome b/f complex (0.0008), RubisCO large subunit (0.0015), Photosystem II (0.0022) and Photosystem I (0.0026). The genes associated with Photosystem II, Photosystem I and the cytochrome b/f complex were also characterized by the lowest average values of Ka (0.0003, 0.0005 and 0.0005, respectively), and the highest average value of Ka (0.0025) was noted in the RubisCO large subunit. In general, no differences were observed in the sequences of 18 plastome genes (Ka = 0, Ks = 0) of the studied Colobanthus species. The remaining 60 genes shared 99% similarity, but only synonymous substitutions (Ka = 0) were observed in 33 of those genes. The Ka/Ks ratio was less than 1 in all genes, excluding rpoC2 (1.5238 for C. affinis, and 1.381 for C. nivicola and C. pulvinatus) and matK (1.1333 for C. lycopodioides). The Ka/Ks ratio exceeded 1 in rpoC2 and matK, which could suggest that in the above species, these genes had undergone positive selection (adaptation to a specific environment). A Ka/Ks ratio of less than 1 points to the influence of purifying selection on the remaining 76 genes.

RNA-editing

The results of the PREP prediction revealed 49 editing sites in 18 protein coding genes in the plastomes of Colobanthus species, excluding C. acicularis and C. lycopodioides where 48 such elements were found (Supplementary Table S6 online). One RNA editing site within the ndhF gene was missing in C. acicularis, and one editing site within the ndhB sequence was missing in C. lycopodioides. All editing events involved C to U conversion. Fifteen non-synonymous mutations were found at the first position of the codon, 34 mutations were identified at the second position, and none were found at the third position. Serine (S) to leucine (L) changes accounted for nearly a third (32.6%) of the identified mutations, whereas arginine (R) to tryptophan (W) and serine (S) to phenylalanine (F) changes were least frequently observed (4.1% for both). Each RNA editing site in the corresponding genes of the eight Colobanthus species was generally found at the same nucleotide position. Three base shifts were identified in only two RNA editing sites within the rpoB sequence in the cp genome of C. quitensis.

Phylogenetic analysis

The phylogenetic trees generated by BI and ML had a consistent topology. In the BI tree, Bayesian posterior probability reached 1.0 in 92.6% of the nodes (25 out of 27). The reconstructed phylogeny is consistent with the taxonomic position of the studied species, and it revealed the following relationships: all studied Colobanthus species formed one clade; the second clade grouped four Pseudostellaria species; the third clade consisted of four Dianthus species and a solitary branch of Gysophilla vacaria; all members of the genus Silene and Lychnis wilfordii formed the fourth clade in the proximity of a separate branch of Agrostemma githago; the most divergent branches were formed by Gymnocarpos przewalski and Spergula arvensis (Fig. 6, Table 3).

Table 3 GenBank accession numbers and references for cp genomes used in this study.

Full size table

Discussion

The length of the complete sequences of the six new chloroplast genomes of Colobanthus species ranged from 151,050 (C. acicularis) to 151,462 bp (C. lycopodioides), it was very similar to the previously sequenced plastomes of C. quitensis (151,276 bp)²⁵ and C. apetalus (151,228 bp)²⁶, and was within the size range of cp genomes of other angiosperms⁴⁵. A comparison of all Colobanthus cp genomes that have been sequenced to date also revealed considerable similarities in genome composition—all eight species had the same gene content and order. Moreover, their protein-coding sequences were characterized by low variation. Consequently, the differences in the size and organization of intergenic spacers were most probably responsible for the observed variations in the size of Colobanthus cp genomes. As previously described⁴⁶, the variation in the size of cp genomes in different plant lineages could also be attributed to the expansion and contraction of IR regions. Similarly to most angiosperms, IR boundaries were found within ycf1 and rps19 genes in the presented cp genomes of Colobanthus⁴⁷. The location of IR boundaries was identical in all Colobanthus species. One base shift in the position of IR_A/SSC and IR_B/SSC borders was found only in C. subulatus. The size of IR regions was highly similar in all Colobanthus cp genomes that have been sequenced to date, ranging from 25,303 (C. quitensis) to 25,326 bp (C. nivicola and C. pulvinatus), which corresponds to the values reported in other dicotyledons^{48, 49}. Somewhat greater differences were observed in Colobanthus species when the length of LSC and SSC regions was considered. The differences in the length of LSC and SSC between the longest and the shortest element were determined at 447 bp (C. lycopodioides vs. C. acicularis) and 79 bp (C. affinis vs. C. lycopodioides), respectively.

The accumulation of point mutations in the form of synonymous and non-synonymous nucleotide substitutions is one of the key mechanisms of gene evolution⁵⁰. In this study, synonymous nucleotide substitutions were more frequently observed; therefore, the analyzed Colobanthus species have maintained a high degree of sequence conservation, especially in genes involved in photosynthesis. However, considerable variation was observed in several genes, including rps16 and ycf1 which are characterized by the highest number of non-synonymous nucleotide substitutions. These two elements of chloroplast genomes are very often found among the most variable chloroplast loci of numerous genera⁵¹, including Silene (Caryophyllaceae)^52,53,54. The Ka/Ks ratio was examined to determine selection pressure on protein-coding genes. The six Colobanthus cp genomes exhibited highly conserved organization, but positive selection pressure (Ka/Ks > 1) was observed in rpoC2 (C. affinis, C. nivicola and C. pulvinatus) and matK (C. lycopodioides), which suggests that these loci are undergoing essential adaptation to environmental conditions. In C. lycopodioides, C. nivicola and C. pulvinatus, the Ka/Ks ratio for the rbcL gene was only somewhat lower than 1 (0.965), which could indicate that positive selection played some role in the acceleration of the substitution rate for that locus. All three genes have been also previously reported to undergo positive selection in other plant species. Gene rbcL, which encodes the large subunit of RuBisCO, appears to undergo positive selection most often, in up to 75–88% of terrestrial plants⁵⁵. According to the literature, rbcL could be a chloroplast region that was positively selected during the evolutionary processes associated with adaptation to temperature^55,56,57,58, CO₂ concentration^55,59,60 or water deficiency^57,60. The matK gene encoding the maturase enzyme which catalyzes the removal of a nonautocatalytic intron from premature RNAs⁶¹ was also found to undergo positive selection in 32 plant groups⁶². Despite its important function, the rpoC2 gene encoding the β subunit of plastid-encoded plastid RNA polymerase (RNA polymerase type I) is also a relatively rapidly evolving chloroplast sequence⁶³ that undergoes positive selection in various groups of plants, including Lamiaceae⁶⁴, Orobanchaceae⁶⁵ and Annonaceae⁶⁶. The fact that traces of positive selection were detected in such different functional gene classes, including genes encoding photosynthesis (rbcL), transcription and transcript processing (matK and rpoC2), could indicate that natural selection targets different chloroplast functions. A higher rate of substitutions in these Colobanthus genes could be indicative of continuous fine-tuning to specific environmental conditions.

Repeat regions within genomes play an important role in sequence divergence and rearrangement, which is why they have to be identified, and their number and distribution has to be determined in genomic studies^67,68. In the six reported here plastomes of Colobanthus species, most repeat regions were identified in intergenic regions and introns (57.1–79.5%), which is highly consistent with the values previously reported in C. apetalus (76.7%) and C. quitensis (53.3%)²⁶, as well as for other Caryophyllaceae such as Silene capitata (56.0%) and Lychnis wilfordii (69.2%)⁶⁹. Our study also demonstrated that repeat regions are not randomly distributed within Colobanthus cp genomes, and they were identified mainly within highly divergent regions of rpl32–trnL-UAG, ycf1, trnK-UUU–rps16, psaA–ycf3 and petA–psbJ in the LSC.

Simple sequence repeats, also known as microsatellites, are important molecular markers with many applications, including species identification, population genetics and phylogenetic studies^70,71,72. Mononucleotide SSRs were identified most frequently (51% on average) among the microsatellites of the six analyzed cp genomes of Colobanthus species, with A/T as the prevalent motif type. The frequency of mononucleotide SSRs was highly similar in C. apetalus and C. quitensis (48.8% and 54.2%, respectively)²⁶ as well as in other plant species, including members of the family Caryophyllaceae^69,73,74. In turn, hexanucleotide SSRs were least abundant, and only one such element was identified in the cp genomes of C. nivicola and C. pulvinatus. Moreover, the majority of SSRs were located within intergenic spacers and introns, while only 13.5% (on average) were positioned in the exons of 11 genes. Four of these loci (ycf1, rpoC2, rrn23, atpA) harbored SSRs in all six Colobanthus species, whereas four other loci (petB, petD, ndhA, rpoA) contained SSRs in only one, specific species.

In terrestrial plants, cytidines are systematically converted to uridines (C to U editing) in both mitochondrial and plastid mRNA transcripts to restore conserved codons⁷⁵. This important post-transcriptional process that had appeared in early stages of flowering plant evolution⁷⁶ is considered to be functionally significant in chloroplasts⁷⁷. In our work, potential RNA editing sites were identified in 18 out of the 34 analyzed chloroplast protein-coding genes. These sites were highly conserved in the analyzed species, excluding two sites within the rpoB gene in the cp genome of C. quitensis. In this species, the three base shift appear to be a consequence of the insertion of three bases (CAG) which added glutamine in position 636 of the rpoB amino acid sequence. This study also demonstrated that leucine codons, including those that have potentially emerged from RNA editing, are heavily used in the cp genomes of all eight Colobanthus species. Previous research revealed a high demand for leucine biosynthesis in chloroplasts and suggested that this amino acid plays an important role in photosynthesis-related metabolism^78,79.

Colobanthus is one of the 86 genera within the family Caryophyllaceae⁴². Genus Colobanthus contains around 26 confirmed species and 24 taxa whose status has not been resolved¹. Therefore, a taxonomic inventory of the genus Colobanthus is still needed. However, this is a challenging task due to the extensive geographic distribution of the species and high variation within morphological traits. The DNA barcode has recently emerged as an effective biological tool for accurate species identification⁸⁰. Despite the above, the molecular data for Colobanthus are highly limited, and species-specific barcode sequences are not available. In the eight Colobanthus species with completely sequenced cp genomes, the genetic variation in the chloroplast regions of matK and rbcL that are widely used in the barcoding of terrestrial plants was lower than expected. However, genome-wide comparative analyses based on nucleotide diversity (π) supported the identification of 12 highly variable regions (π > 0.008) that could be utilized as a source of potential markers for species identification and reconstruction of the phylogenetic relationships within this plant group: ndhF–rpl32, rpl32–trnL-UAG, ycf1, trnK-UUU–rps16, rps16–trnQ-UUG, trnS-GCU–trnG-UCC, trnD-GUC–trnY-GUA, psaA–ycf3, trnL-UAA–trnF-GAA, trnF-GAA–ndhJ, petA–psbJ and trnV-GAC–rps7. High nucleotide diversity (π) in regions ycf1, rpl32–trnL, trnS–trnG, petA–psbJ and rps16–trnQ was also reported in other plants⁵¹. Moreover, the ndhF–rpl32–trnL-UAG region has been widely used in phylogenetic studies^81,82. Two of these highly variable regions (rps16–trnQ-UUG and trnL-UAA–trnF-GAA) were among the 5 chloroplast sequences that were analyzed by Greenberg and Donoghue⁸³ to explore the phylogenetic relationships within the family Caryophyllaceae.

It has recently been postulated that complete chloroplast genome sequences can be used as a super barcode for identifying plant species⁸⁴. This approach is particularly useful for distinguishing between closely related taxa where limited sequence variation has resulted from a low rate of genome evolution or a relatively short time since the divergence event^85,86. Considering the still unresolved status of many Colobanthus species, a phylogeny reconstruction based on whole plastome sequences seems to be highly desired for this genus. In this study, chloroplast genome sequences were used to resolve the phylogenetic relationships within the genus Colobanthus and the Caryophyllaceae family, but the analysis involved only species with completely elucidated plastome sequences. The phylogeny reconstructed based on 73 concatenated protein-coding gene sequences appeared to be consistent with the taxonomic position of the studied species and previous phylogenies of the Caryophyllaceae^41,83,87. However, in the genus Colobanthus, this was an initial step towards resolving the phylogenetic relationships within that group of plants. The phylogenetic tree with highly supported nodes revealed that all eight Colobanthus species were grouped in one clade, where C. pulvinatus and C. nivicola, C. apetalus and C. affinis as well as C. subulatus and C. quitensis formed three pairs of most similar species, whereas C. lycopodioides appeared to be most different.

Conclusion

The chloroplast genomes of C. acicularis, C. affinis, C lycopodioides, C. nivicola, C. pulvinatus and C. subulatus were sequenced and characterized for the first time. Their plastomes have a typical quadripartite circular structure and share the same overall organization and gene content. The information regarding sequence variation, distribution and characteristics of SSR loci within the studied cp genomes could be useful in future studies on the population genetics, phylogenetics and evolution of Colobanthus. Nevertheless, further research is required to investigate whether highly variable regions or complete chloroplast genome sequences could be used as reliable and effective DNA barcodes for Colobanthus species.

Methods

Plant material, DNA extraction and chloroplast genome sequencing

Fresh leaves of five Colobanthus species were sampled from plants grown from seeds in a greenhouse of the Department of Plant Physiology, Genetics and Biotechnology at the University of Warmia and Mazury in Olsztyn, Poland. The seeds of C. affinis were obtained from the Royal Botanic Gardens, Victoria, Australia. The seeds of C nivicola and C. pulvinatus were acquired from the Australian National Botanic Gardens, Canberra. The seeds of C. subulatus originated from the Royal Botanic Gardens, Kew, United Kingdom. The seeds of C. lycopodioides were collected in the region of Mendoza, Andes, Argentina, at an altitude of 4,024 m a.s.l., (33°10′ S; 69°50′ W). Only in C. acicularis, DNA was extracted from dried tissue of one individual, supplied in silica gel by the Royal Botanic Garden, Edinburgh, UK. Plant material was formally identified before the analyses. Professor Irena Giełwanowska performed morphological and anatomical analyses of both vegetative and generative organs harboring characteristic traits for the identification of Colobanthus species^1,88,89. Voucher specimens of each studied species have been deposited in the Vascular Plants Herbarium of the Department of Botany and Nature Protection at the University of Warmia and Mazury in Olsztyn, Poland (OLS), under the following numbers: C. acicularis (No. OLS 33824), C. affinis (No. OLS 33825), C lycopodioides (No. OLS 33826), C. nivicola (No. OLS 33827), C. pulvinatus (No. OLS 33828) and C. subulatus (No. OLS 33829). Total genomic DNA was extracted from fresh/dry tissue of a single plant using the Maxwell16LEV Plant DNA Kit (Promega, Madison, WI). The quality of DNA was verified on 1% (w/v) agarose gel stained with 0.5 µg/ml ethidium bromide. The concentration and purity of DNA samples were assessed spectrophotometrically.

Genome libraries were prepared using the Nextera XT kit (Illumina Inc., San Diego, CA, USA), and genomes were sequenced on the Illumina MiSeq platform (Illumina Inc., San Diego, CA, USA) with a 150 bp paired-end read.

Annotation and genome analysis

The FastQC tool was used to check the quality of raw reads. Raw reads were trimmed (5 bp of each read end, regions with more than 5% probability of error per base) and mapped to the reference chloroplast genome of C. quitensis (NC_028080) in Geneious v.R7 software⁹⁰ with default medium–low sensitivity settings. Subsequent steps of annotation and genome analysis were described in our previous paper²⁶. The gene maps of the annotated cp genomes were developed with the OrganellarGenome DRAW tool⁹¹.

Genomic repeats and SSR analysis

The size and location of genomic repeats, including forward, reverse, palindromic and complementary sequences within the analyzed chloroplast genomes, were identified using REPuter software⁹² with the following settings: (1) hamming distance of 3, (2) sequence identity ≥ 90%, and (3) minimum repeat size ≥ 30 bp. Phobos v.3.3.12⁹³ was used to detect chloroplast simple sequence repeats (SSRs). Only perfect SSRs with a motif size of one to six nucleotide units were considered, with standard thresholds for chloroplast SSRs identification: ≥ 12 repeat units for mononucleotide SSRs, ≥ 6 repeat units for dinucleotide SSRs, ≥ 4 repeat units for trinucleotide SSRs, and ≥ 3 repeat units for tetra-, penta- and hexanucleotide SSRs⁹⁴. A single IR region was used to eliminate the influence of doubled IR regions. Redundant results in REPuter were deleted manually.

Comparative chloroplast genome analysis

Genome synteny analysis of the eight Colobanthus plastomes (six genomes reported in this paper, and C. quitensis and C. apetalus that were previously characterized and deposited in NCBI^25,26) was performed with the use of MAUVE v.1.1.1⁹⁵. The sequences were aligned in MAFFT v.7.310⁹⁶ to perform sliding window analysis and evaluate nucleotide diversity (π) in cp genomes using DnaSP v.6.10.04⁹⁷. The step size was set to 50 base pairs, and window length was set to 800 base pairs.

The evolutionary rate of the plastome genes identified in all Colobanthus species (C. acicularis, C. affinis, C. apetalus, C lycopodioides, C. nivicola, C. pulvinatus, C. quitensis and C. subulatus) was analyzed. A total of 78 genes were selected to estimate the ratio of non-synonymous (Ka) to synonymous (Ks) substitutions. Colobanthus quitensis was the reference species. These genes were extracted and aligned separately using MAFFT v7.310. The values of Ka and Ks in the shared genes were calculated in DnaSP v.6.10.04. Genes with non-applicable (NA) Ka/Ks ratios were changed to zero.

The chloroplast genome borders of LSC, SSC, and IRs were identified and compared based on their annotations. The data on the distribution of codon usage was acquired from the Geneious v.7 statistics panel.

Potential RNA editing sites in the protein-coding genes of chloroplast genomes were predicted using the Predictive RNA Editor for Plants (PREP) suite⁹⁸. The cutoff value for the analyzed Colobanthus species was set at 0.8, and 34 out of the 35 reference genes in PREP were used. rpl23 was not included in the analysis because it was not identified within the chloroplast genomes of the studied Colobanthus species. Two previously sequenced cp genomes of C. quitensis and C. apetalus^25,26 were also included in this analysis.

Phylogenetic analysis

The available (24) complete chloroplast genomes representing Caryophyllaceae lineages and the cp genome of Arabidopsis thaliana as an outgroup were downloaded from the NCBI database to investigate the phylogenetic relationships among the studied representatives of the genus Colobanthus and the genera in the family Caryophyllaceae. The cp genomes used in phylogenetic analyses are presented in Table 3. The sequences of 73 shared protein coding genes were extracted using custom R script, and they were aligned in MAFFT v.7.310. Finally, 73 concatenated protein-coding gene sequences where used for phylogeny reconstruction by Bayesian Inference (BI) and Maximum-Likelihood (ML) method. The best-fit model of sequence evolution was identified in MEGA v.7⁹⁹, and the GTR + G + I model was selected. The BI analysis was performed in MrBayes v.3.2.6^100,101, and the ML analysis was conducted in PhyML v.3.0¹⁰². Parameter settings were previously described by Androsiuk et al.²⁶.

Data availability

The complete chloroplast genomes of the six Colobanthus species have been submitted to the NCBI database (https://www.ncbi.nlm.nih.gov/https://www.ncbi.nlm.nih.gov/) under the accession numbers: MN273320 for C. acicularis, MN273318 for C. affinis, MN273317 for C. lycopodioides, MN273316 for C. nivicola, MN273315 for C. pulvinatus and MN273319 for C. subulatus.

References

The Plant List 2013. Version 1.1. https://www.theplantlist.org/browse/A/Caryophyllaceae/Colobanthus. (2019).
West, J. G. Colobanthus curtisiae (Caryophyllaceae), a new species from eastern Tasmania, Australia. R. Soc. Tasm. Hobart 124(2), 75–78 (1991).
Google Scholar
Giełwanowska, I. et al. Biology of generative reproduction of Colobanthus quitensis (Kunth) Bartl. from King George Island, South Shetland Islands. Pol. Polar Res. 32, 139–155 (2011).
Article Google Scholar
The AGS online Plant Encyclopaedia. Alpine Garden Society. http://encyclopaedia.alpinegardensociety.net/plants/Colobanthus. (2019).
Harbaugh, D. T. et al. A new lineage-based tribal classification of the family Caryophyllaceae. Int. J. Plant Sci. 171(2), 185–198 (2010).
Article CAS Google Scholar
Timaná, M. E. Sagina diffusa (Hook.f.) Timaná, comb. Nov. (Caryophyllaceae), a new combination for the flora of île St. Paul (Southern Indian Ocean), with some historical notes. Adansonia 40(3), 47–53 (2018).
Article Google Scholar
Cooper, J., Cuthbert, R. J., Gremmen, N. J. M., Ryan, P. G. & Shaw, J. D. Earth, fire and water: Applying novel techniques to eradicate the invasive plant, procumbent pearlwort Sagina procumbens, on Gough Island, a World Heritage Site in the South Atlantic. International Conference on Island Invasives, Auckland, New Zealand (Gland, Switzerland: IUCN 2011).
West, J. G. Colobanthus curtisiae (Caryophyllaceae), a new species from Eastern Australia. In Aspects of Tasmanian Botany: A Tribute to Winifred Curtis (eds. Banks, M. R., Smith, S. J., Orchard, A. E. & Kantvilas, G.) (Royal Society of Tasmania, 1991).
Sneddon, B. V. The taxonomy and breeding system of Colobanthus squarrosus (Caryophyllaceae). N. Z. J. Bot. 37, 195–204 (1999).
Article Google Scholar
Gilfedder, L. & Kirkpatrick, J. B. The distribution, ecology and conservation needs of Colobanthus curtisiae west. Pap. Proc. R. Soc. Tasmania 130(1), 25–30 (1996).
Article Google Scholar
Gray, M. Miscellaneous notes on Australian plants. 3. Craspedia, Gnaphalium, Epacris, Tasmannia, Colobanthus and Deyeuxia. Contr. Herbarium Australiense 26, 1–11 (1976).
Google Scholar
Briggs, J. D. & Leigh, J. H. Rare or Threatened Australian Plants (CSIRO Publishing, Clayton, 1996).
Book Google Scholar
Skottsberg, C. Antarctic flowering plants. Svensk Bot. Tidskr. 51, 330–338 (1954).
Google Scholar
Bravo, L. A. et al. Effect of cold acclimation on the photosynthetic performance of two ecotypes of Colobanthus quitensis (Kunth) Bartl. J. Exp. Bot. 58, 3581–3590 (2007).
Article CAS PubMed Google Scholar
Giełwanowska, I., Pastorczyk, M., Lisowska, M., Wegrzyn, M. & Gorecki, R. J. Cold stress effects on organelle ultrastructure in polar Caryophyllaceae species. Pol. Polar Res. 35, 627–646 (2014).
Article Google Scholar
Bascunan-Godoy, L. et al. Cold-acclimation limits low temperature induced photoinhibition by promoting a higher photochemical quantum yield and a more effective PSII restoration in darkness in the Antarctic rather than the Andean ecotype of Colobanthus quitensis Kunt Bartl (Cariophyllaceae). BMC Plant Biol. 12, 114 (2012).
Article CAS PubMed PubMed Central Google Scholar
Navarrete-Gallegos, A. A., Bravo, L. A., Molina-Montenegro, M. A. & Corcuera, L. J. Antioxidant responses in two Colobanthus quitensis (Caryophyllaceae) ecotypes exposed to high UV-B radiation and low temperature. Rev. Chil. Hist. Nat. 85, 419–433 (2012).
Article Google Scholar
Pastorczyk, M., Giełwanowska, I. & Lahuta, L. B. Changes in soluble carbohydrates in polar Caryophyllaceae and Poaceae plants in response to chilling. Acta Physiol. Plant. 36, 1771–1780 (2014).
Article CAS Google Scholar
Cuba-Díaz, M., Castel, K., Acuña, D., Machuca, Á & Cid, I. Sodium chloride effect on Colobanthus quitensis seedling survival and in vitro propagation. Antarct. Sci. 29, 45–46 (2017).
Article Google Scholar
Lee, D. W. & Postle, R. L. Isozyme variation in Colobanthus quitensis (Kunth) Bartl.: Methods and preliminary analysis. Brit. Antarct. Surv. B. 41–42, 133–137 (1975).
Google Scholar
Gianoli, E. et al. Ecotypic differentiation in morphology and cold resistance in populations of Colobanthus quitensis (Caryophyllaceae) from the Andes of Central Chile and the Maritime Antarctic. Arct. Antarct. Alp. Res. 36, 484–489 (2004).
Article Google Scholar
Acuña-Rodríguez, I. S., Oses, R., Cortés-Vasquez, J., Torres-Díaz, C. & Molina-Montenegro, M. M. Genetic diversity of Colobanthus quitensis across the Drake Passage. Plant Genet. Resour. C. 12, 147–150 (2014).
Article Google Scholar
Androsiuk, P., Chwedorzewska, K. J., Szandar, K. & Giełwanowska, I. Genetic variability of Colobanthus quitensis from King George Island (Antarctica). Pol. Polar Res. 36, 281–295 (2015).
Article Google Scholar
Koc, J. et al. Range-wide pattern of genetic variation in Colobanthus quitensis. Polar Biol. 41, 2467–2479 (2018).
Article Google Scholar
Kang, Y. et al. The complete chloroplast genome of Antarctic pearlwort Colobanthus quitensis (Kunth) Bartl. Mitochondrial DNA A. 27, 4677–4678 (2016).
Article CAS Google Scholar
Androsiuk, P. et al. The complete chloroplast genome of Colobanthus apetalus (Labill.) Druce: Genome organization and comparison with related species. PeerJ 6, e4723 (2018).
Article PubMed PubMed Central CAS Google Scholar
Cuba-Díaz, M., Cerda, G., Rivera, C. & Gómez, A. Genome size comparison in Colobanthus quitensis populations show differences in species ploidy. Polar Biol. 40, 1475–1480 (2017).
Article Google Scholar
Dong, W., Xu, C., Cheng, T. & Zhou, S. Complete chloroplast genome of Sedum sarmentosum and chloroplast genome evolution in Saxifragales. PLoS ONE 8, e77965 (2013).
Article CAS PubMed PubMed Central Google Scholar
Corriveau, J. L. & Coleman, A. W. Rapid screening method to detect potential biparental inheritance of plastid DNA and results for over 200 angiosperm species. Am. J. Bot. 75(10), 1443–1458 (1988).
Article Google Scholar
Jansen, R. K. et al. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc. Natl. Acad. Sci. USA 104, 19369–19374 (2007).
Article CAS PubMed PubMed Central Google Scholar
Drouin, G., Daoud, H. & Xia, J. Relative rates of synonymous substitutions in the mitochondrial, chloroplast and nuclear genomes of seed plants. Mol. Phylogenet. Evol. 49, 827–831 (2008).
Article CAS PubMed Google Scholar
Parks, M., Cronn, R. & Liston, A. Increasing phylogenetic resolution at low taxonomic levels using massively parallel sequencing of chloroplast genomes. BMC Biol. 7, 84 (2009).
Article PubMed PubMed Central CAS Google Scholar
Sabir, J. et al. Evolutionary and biotechnology implications of plastid genome variation in the inverted repeat lacking clade of legumes. Plant Biotechnol. J. 12(6), 743–754 (2014).
Article CAS PubMed Google Scholar
Ruhlman. T. & Jansen, R. The plastid genomes of flowering plants. In Chloroplast Biotechnology (ed. Maglia, P.) 3–38 (Humana Press, 2014).
Nock, C. J. et al. Chloroplast genome sequences from total DNA for plant identification. Plant Biotechnol. J. 9, 328–333 (2011).
Article CAS PubMed Google Scholar
Kane, N. et al. Ultrabarcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA. Am. J. Bot. 99, 320–329 (2012).
Article CAS PubMed Google Scholar
Park, I., Yang, S., Choi, G., Kim, W. J. & Moon, B. C. The complete chloroplast genome sequences of Aconitum pseudolaeve and Aconitum longecassidatum, and development of molecular markers for distinguishing species in the Aconitum subgenus Lycoctonum. Molecules 22, E2012 (2017).
Article PubMed CAS Google Scholar
Shinozaki, K. et al. The complete nucleotide sequence of the tobacco chloroplast genome: Its gene organization and expression. EMBO J. 5, 2043–2049 (1986).
Article CAS PubMed PubMed Central Google Scholar
Ohyama, K. et al. Chloroplast gene organization deduced from complete sequence of liverwort Marchantia polymorpha chloroplast DNA. Nature 322, 572–574 (1986).
Article CAS Google Scholar
Hollingsworth, P. M. et al. A DNA barcode for land plants. Proc. Natl. Acad. Sci. USA 106, 12794–12797 (2009).
Article CAS PubMed Central Google Scholar
Dillenberger, M. S. & Kadereit, J. W. Maximum polyphyly: Multiple origins and delimitation with plesiomorphic characters require a new circumscription of Minuartia (Caryophyllaceae). Taxon 63(1), 64–88 (2014).
Article Google Scholar
Smissen, R. D., Clement, J. C., Garnock-Jones, P. J. & Chambers, G. K. Subfamilial relationships within Caryophyllaceae as inferred from 5’ ndhF sequences. Am. J. Bot. 89(8), 1336–1341 (2002).
Article CAS PubMed Google Scholar
Palmer, J. D. Comparative organization of chloroplast genomes. Annu. Rev. Genet. 19, 325–354 (1985).
Article CAS PubMed Google Scholar
Jansen, R. K. & Ruhlman, T. A. Plastid Genomes of Seed Plants (Springer Press, Berlin, 2012).
Book Google Scholar
Dong, W., Xu, C., Cheng, T., Lin, K. & Zhou, S. Sequencing angiosperm plastid genomes made easy: A complete set of universal primers and a case study on the phylogeny of Saxifragales. Genome Biol. Evol. 5(5), 989–997 (2013).
Article PubMed PubMed Central CAS Google Scholar
Wang, W. & Messing, J. High-throughput sequencing of three Lemnoideae (duckweeds) chloroplast genomes from total DNA. PLoS ONE 6(9), e24670 (2011).
Article CAS PubMed PubMed Central Google Scholar
Goulding, S. E., Olmstead, R. G., Morden, C. W. & Wolfe, K. H. Ebb and flow of the chloroplast inverted repeat. Mol. Gen. Genet. 252, 195–206 (1996).
Article CAS PubMed Google Scholar
Goremykin, V. V., Hirsch-Ernst, K. I., Wolfl, S. & Hellwig, F. H. Analysis of the Amborella trichopoda chloroplast genome sequence suggest that amborella is not a basal angiosperm. Mol. Biol. Evol. 20(9), 1499–1505 (2003).
Article CAS PubMed Google Scholar
Hupfer, H. et al. Complete nucleotide sequence of the Oenothera elata plastid genome representing plastome I of the five distinguishable Euoenothera plastomes. Mol. Gen. Genet. 263(4), 581–585 (2000).
CAS PubMed Google Scholar
Kimura, M. The Neutral Theory of Molecular Evolution (Cambridge University Press, Cambridge, 1983).
Book Google Scholar
Dong, W., Liu, J., Yu, J., Wang, L. & Zhou, S. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding. PLoS ONE 7(4), e35071 (2012).
Article CAS PubMed PubMed Central Google Scholar
Erixon, P. & Oxelman, B. Reticulate or tree-like chloroplast DNA evolution in Sileneae (Caryophyllaceae)?. Mol. Phylogenet. Evol. 48, 313–325 (2008).
Article CAS PubMed Google Scholar
Sloan, D. B., Alverson, A. J., Wu, M., Palmer, J. D. & Taylor, D. R. Recent acceleration of plastid sequence and structural evolution coincides with extreme mitochondrial divergence in the angiosperm genus Silene. Genome Biol. Evol. 4, 294–306 (2012).
Article PubMed PubMed Central CAS Google Scholar
Sloan, D. B. et al. A recurring syndrome of accelerated plastid genome evolution in the angiosperm tribe Sileneae (Caryophyllaceae). Mol. Phylogenet. Evol. 72, 82–89 (2014).
Article CAS PubMed Google Scholar
Kapralov, M. V. & Filatov, D. A. Widespread positive selection in the photosynthetic Rubisco enzyme. BMC Evol. Biol. 7, 73 (2007).
Article PubMed PubMed Central CAS Google Scholar
Kapralov, M. V. & Filatov, D. A. Molecular adaptation during adaptive radiation in the Hawaiian endemic genus Schiedea. PLoS ONE 1, e8 (2006).
Article PubMed PubMed Central CAS Google Scholar
Iida, S. et al. Molecular adaptation of rbcL in the heterophyllous aquatic plant Potamogeton. PLoS ONE 4, e4633 (2009).
Article PubMed PubMed Central CAS Google Scholar
Kapralov, M. V., Smith, J. A. C. & Filatov, D. A. Rubisco evolution in C4 eudicots: An analysis of Amaranthaceae sensu lato. PLoS ONE 7, e52974 (2012).
Article CAS PubMed PubMed Central Google Scholar
Christin, P. A. et al. Evolutionary switch and genetic convergence on rbcL following the evolution of C4 photosynthesis. Mol. Biol. Evol. 25, 2361–2368 (2008).
Article CAS PubMed Google Scholar
Galmes, J. et al. Environmentally driven evolution of Rubisco and improved photosynthesis and growth within the C3 genus Limonium (Plumbaginaceae). New Phytol. 203, 989–999 (2014).
Article CAS PubMed Google Scholar
Vogel, J., Borner, T. & Hess, W. Comparative analysis of splicing of the complete set of chloroplast group II introns in three higher plant mutants. Nucleic Acids Res. 27, 3866–3874 (1999).
Article CAS PubMed PubMed Central Google Scholar
Hao, D. C., Chen, S. L. & Xiao, P. G. Molecular evolution and positive Darwinian selection of the 520 chloroplast maturase matK. J. Plant Res. 123, 241–247 (2010).
Article CAS Google Scholar
Logacheva, M. D., Penin, A. A., Samigullin, T. H., Vallejo-Roman, C. M. & Antonov, A. S. Phylogeny of flowering plants by the chloroplast genome sequences: In search of a “Lucky Gene”. Biochemistry 72, 1324–1330 (2007).
CAS PubMed Google Scholar
Krawczyk, K. & Sawicki, J. The uneven rate of the molecular evolution of gene sequences of DNA-dependent RNA polymerase I of the genus Lamium L. Int. J. Mol. Sci. 14, 11376–11391 (2013).
Article PubMed PubMed Central CAS Google Scholar
Zeng, S. et al. The complete chloroplast genome sequences of six Rehmannia species. Genes 8, 103 (2017).
Article PubMed Central CAS Google Scholar
Blazier, J. C. et al. Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement. Sci. Rep. 6, 24595 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cavalier-Smith, T. Chloroplast evolution: Secondary symbiogenesis and multiple losses. Curr. Biol. 12, R62–R64 (2002).
Article CAS PubMed Google Scholar
Timme, R. E., Kuehl, J. V., Boore, J. L. & Jansen, R. K. A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: Identification of diverged regions and categorization of shared repeats. Am. J. Bot. 94, 302–312 (2007).
Article CAS PubMed Google Scholar
Kang, J. S., Lee, B. Y. & Kwak, M. The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes. PLoS ONE 12(2), e0172924 (2017).
Article PubMed PubMed Central CAS Google Scholar
Yang, A. H., Zhang, J. J., Yao, X. H. & Huang, H. W. Chloroplast microsatellite markers in Liriodendron tulipifera (Magnoliaceae) and cross-species amplification in L. chinense. Am. J. Bot. 98(5), e123–e126 (2011).
Article CAS PubMed Google Scholar
Jiao, Y. et al. Development of simple sequence repeat (SSR) markers from a genome survey of Chinese bayberry (Myrica rubra). BMC Genomics. 13, 201 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. The complete chloroplast genome sequences of five Epimedium species: Lights into phylogenetic and taxonomic analyses. Front. Plant Sci. 7, 306 (2016).
PubMed PubMed Central Google Scholar
Kuang, D. Y. et al. Complete chloroplast genome sequence of Magnolia kwangsiensis (Magnoliaceae): Implications for DNA barcoding and population genetics. Genome 54(8), 663–673 (2011).
Article PubMed Google Scholar
Sonah, H., Deshmukh, R. K., Sharma, A., Singh, V. P. & Gupta, D. K. Genome-wide distribution and organization of microsatellites in plants: An insight into marker development in Brachypodium. PLoS ONE 6(6), e21298 (2011).
Article CAS PubMed PubMed Central Google Scholar
Knoop, V. When you can’t trust the DNA: RNA editing changes transcript sequences. Cell Mol. Life Sci. 68, 567–586 (2011).
Article CAS PubMed Google Scholar
Tsunewaki, K., Matsuoka, Y., Yamazaki, Y. & Ogihara, Y. Evolutionary dynamics of wheat mitochondrial gene structure with special remarks on the origin and effects of RNA editing in cereals. Genes Genet. Syst. 83(4), 301–320 (2008).
Article CAS PubMed Google Scholar
Sasaki, Y., Kozaki, A., Ohmori, A., Iguchi, H. & Nagano, Y. Chloroplast RNA editing required for functional acetyl-CoA carboxylase in plants. J. Biol. Chem. 276(6), 3937–3940 (2001).
Article CAS PubMed Google Scholar
Knill, T., Reichelt, M., Paetz, C., Gershenzon, J. & Binder, S. Arabidopsis thaliana encodes a bacterial-type heterodimeric isopropylmalate isomerase involved in both Leu biosynthesis and the Met chain elongation pathway of glucosinolate formation. Plant Mol. Biol. 71(3), 227–239 (2009).
Article CAS PubMed PubMed Central Google Scholar
He, Y. et al. A redox-active isopropylmalate dehydrogenase functions in the biosynthesis of glucosinolates and leucine in Arabidopsis. Plant J. 60(4), 679–690 (2009).
Article CAS PubMed Google Scholar
Yu, X. Q., Drew, B. T., Yang, J. B., Gao, L. M. & Li, D. Z. Comparative chloroplast genomes of eleven Schima (Theaceae) species: Insights into DNA barcoding and phylogeny. PLoS ONE 12, e0178026 (2017).
Article PubMed PubMed Central CAS Google Scholar
Shaw, J. et al. The tortoise and the hare II: Relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis. Am. J. Bot. 92, 142–166 (2005).
Article CAS PubMed Google Scholar
Song, Y. et al. Comparative analysis of complete chloroplast genome sequences of two tropical trees Machilus yunnanensis and Machilus balansae in the family Lauraceae. Front. Plant Sci. 6, 662 (2015).
PubMed PubMed Central Google Scholar
Greenberg, A. K. & Donoghue, M. J. Molecular systematics and character evolution in Caryophyllaceae. Taxon 60(6), 1637–1652 (2011).
Article Google Scholar
Hebert, P. D. N., Cywinska, A., Ball, S. L. & DeWaard, J. R. Biological identifications through DNA barcodes. Proc. Biol. Sci. 270, 313–321 (2003).
Article CAS PubMed PubMed Central Google Scholar
Daniell, H., Lin, C. S., Yu, M. & Chang, W. J. Chloroplast genomes: Diversity, evolution and applications in genetic engineering. Genome Biol. 17(1), 134 (2016).
Article PubMed PubMed Central CAS Google Scholar
Tonti-Filippini, J., Nevill, P. G., Dixon, K. & Small, I. What can we do with 1000 plastid genomes?. Plant J. 90(4), 808–818 (2017).
Article CAS PubMed Google Scholar
Fior, S., Karis, P. O., Casazza, G., Minuto, L. & Sala, F. Molecular phylogeny of the Caryophyllaceae (Caryophyllales) inferred from chloroplast matK and nuclear rDNA ITS sequences. Am. J. Bot. 93, 399–411 (2006).
Article CAS PubMed Google Scholar
West, J. G. & Cowley, K. J. Colobanthus. In Flora of Victoria. Vol. 3, Dicotyledons Winteraceae to Myrtaceae (ed. Walsh, N. G. & Entwisle, T. J.) (Inkata Press, 1996).
Mantowani, A. & Vieira, R. C. Leaf micromorphology of Antarctic pearlwort Colobanthus quitensis (Kunth) Bartl. Polar Biol. 23, 531–538 (2000).
Article Google Scholar
Kearse, M. et al. Geneious basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28(12), 1647–1649 (2012).
Article PubMed PubMed Central Google Scholar
Lohse, M., Drechsel, O. & Bock, R. OrganellarGenomeDRAW (OGDRAW): A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet 52, 267–274 (2007).
Article CAS PubMed Google Scholar
Kurtz, S. et al. REPuter: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 29, 4633–4642 (2001).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Mayer, C. Phobos Version 3.3.11. A tandem repeat search program. 2006–2010. https://www.rub.de/spezzoo/cm/cm_phobos.htm (2019).
Sablok, G. et al. ChloroMitoSSRDB 2.00: More genomes, more repeats, unifying SSRs search patterns and on-the-fly repeat detection. Database 2015, av084 (2015).
Article CAS Google Scholar
Darling, A. C. E., Mau, B., Blattner, F. R. & Perna, N. T. Mauve: Multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 14(7), 1394–1403 (2004).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rozas, J. et al. DnaSP 6: DNA sequence polymorphism analysis of large datasets. Mol. Biol. Evol. 34, 3299–3302 (2017).
Article CAS PubMed Google Scholar
Mower, J. P. The PREP suite: Predictive RNA editors for plant mitochondrial genes, chloroplast genes and user-defined alignments. Nucleic Acids Res. 37, W253–W259 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Huelsenbeck, J. P. & Ronquist, F. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17, 754–755 (2001).
Article CAS PubMed Google Scholar
Ronquist, F. & Huelsenbeck, J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003).
Article CAS PubMed Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS PubMed Google Scholar
Sato, S., Nakamura, Y., Kaneko, T., Asamizu, E. & Tabata, S. Complete structure of the chloroplast genome of Arabidopsis thaliana. DNA Res. 6(5), 283–290 (1999).
Article CAS PubMed Google Scholar
Gurusamy, R., Lee, D. H. & Park S. The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus. Mitochondrial DNA A DNA Mapp. Seq. Anal. 27(3), 2015–2017 (2016).
CAS PubMed Google Scholar
Yao, G. et al. Plastid phylogenomic insights into the evolution of Caryophyllales. Mol. Phylogenet. Evol. 134, 74–86 (2019).
Article PubMed Google Scholar

Download references

Acknowledgements

Authors are grateful to Royal Botanic Gardens, Victoria, Australia; Australian National Botanic Gardens, Canberra; Royal Botanic Gardens, Kew, UK and Royal Botanic Garden, Edinburgh, UK for sharing the research material representing genus Colobanthus. This research was supported by National Science Centre, Poland (No. 2018/02/X/NZ8/02243). The funding agency had no role in the design of the experiment, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Department of Plant Physiology, Genetics and Biotechnology, Faculty of Biology and Biotechnology, University of Warmia and Mazury in Olsztyn, ul. M. Oczapowskiego 1A, 10-719, Olsztyn, Poland
Piotr Androsiuk, Jan Paweł Jastrzębski, Łukasz Paukszto, Karol Makowczenko, Ryszard Górecki & Irena Giełwanowska
Department of Entomology, Phytopathology and Molecular Diagnostics, Faculty of Environmental Management and Agriculture, University of Warmia and Mazury in Olsztyn, ul. Prawocheńskiego 17, 10-720, Olsztyn, Poland
Adam Okorski & Agnieszka Pszczółkowska
Department of Agronomy, Warsaw University of Life Sciences-SGGW, ul. Nowoursynowska 166, 02-787, Warsaw, Poland
Katarzyna Joanna Chwedorzewska

Authors

Piotr Androsiuk
View author publications
You can also search for this author in PubMed Google Scholar
Jan Paweł Jastrzębski
View author publications
You can also search for this author in PubMed Google Scholar
Łukasz Paukszto
View author publications
You can also search for this author in PubMed Google Scholar
Karol Makowczenko
View author publications
You can also search for this author in PubMed Google Scholar
Adam Okorski
View author publications
You can also search for this author in PubMed Google Scholar
Agnieszka Pszczółkowska
View author publications
You can also search for this author in PubMed Google Scholar
Katarzyna Joanna Chwedorzewska
View author publications
You can also search for this author in PubMed Google Scholar
Ryszard Górecki
View author publications
You can also search for this author in PubMed Google Scholar
Irena Giełwanowska
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.A. conceived and designed this study, acquired the research material, performed the repeat and phylogenetic analyses, conducted the comparative analyses and interpreted the data, wrote the manuscript; J.P.J., K.M. and Ł.P.: performed genome assembly and annotation, assisted in comparative analyses, were responsible for figures preparation and GenBank submissions, drew the cp genome map; A.O. and A.P.: performed DNA extraction, were responsible for genome library preparation and DNA sequencing; K.J.C. collected C. lycopodioides, assisted in data interpretation, revised all versions of the manuscript; R.G. provided valuable comments on manuscript development and revised the manuscript; I.G. was responsible for seeds germination and plant growth in the greenhouse, assisted in data interpretation, revised all versions of the manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Piotr Androsiuk.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figure S1.

Supplementary Table S1.

Supplementary Table S2.

Supplementary Table S3.

Supplementary Table S4.

Supplementary Table S5.

Supplementary Table S6.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Androsiuk, P., Jastrzębski, J.P., Paukszto, Ł. et al. Evolutionary dynamics of the chloroplast genome sequences of six Colobanthus species. Sci Rep 10, 11522 (2020). https://doi.org/10.1038/s41598-020-68563-5

Download citation

Received: 14 January 2020
Accepted: 25 June 2020
Published: 13 July 2020
DOI: https://doi.org/10.1038/s41598-020-68563-5

This article is cited by

The complete Chloroplast genome of Stachys geobombycis and comparative analysis with related Stachys species
- Ru Wang
- Zheng Lan
- Zhijun Deng
Scientific Reports (2024)
Cannabaceae comparative analysis based on plastid genome evolution
- Cristiane Barbosa D’Oliveira Matielo
- Geferson Fernando Metz
- Valdir Marcos Stefenon
Journal of Crop Science and Biotechnology (2024)
Genomic exploration of Sesuvium sesuvioides: comparative study and phylogenetic analysis within the order Caryophyllales from Cholistan desert, Pakistan
- Nida Javaid
- Musarrat Ramzan
- Abdurahman Hajinur Hirad
BMC Plant Biology (2023)
Complete chloroplast genomes of Cerastium alpinum, C. arcticum and C. nigrescens: genome structures, comparative and phylogenetic analysis
- Sylwia E. Milarska
- Piotr Androsiuk
- Irena Giełwanowska
Scientific Reports (2023)
Development of a molecular marker based on chloroplast gene for specific identification of Korean Hibiscus (Hibiscus syriacus ‘Simbaek’)
- Rongbo Wang
- Sang Yong Park
- Yeon-Ju Kim
Applied Biological Chemistry (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Organization of chloroplast genomes

Repetitive sequences and SSRs

Sequence divergence

Synonymous (Ks) and non-synonymous (Ka) substitution rate analysis

RNA-editing

Phylogenetic analysis

Discussion

Conclusion

Methods

Plant material, DNA extraction and chloroplast genome sequencing

Annotation and genome analysis

Genomic repeats and SSR analysis

Comparative chloroplast genome analysis

Phylogenetic analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links