ALL Metrics
-
Views
Get PDF
Get XML
Cite
Export
Track
Genome Note

Draft genome sequence of a predatory bacterium from northern peatland soil

[version 1; peer review: awaiting peer review]
PUBLISHED 21 Oct 2025
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS AWAITING PEER REVIEW

Abstract

Predatory bacteria are abundant in soil, but their diversity and functions remain not fully understood, especially in subarctic regions. Here, we report strain 1-FT3.2, a predatory bacterium obtained from peatland soil in Northern Finland (Pallas, 68 °N). The bacterium was cultivated on Mucilaginibacter cryoferens FT3.2 as prey. Although a pure culture of strain 1-FT3.2 was not obtained, its draft genome was assembled from sequencing reads derived from the co-culture with its prey. The draft genome of 1-FT3.2 is 7.2 Mb in length and 81% complete. Genome analyses suggested that 1-FT3.2 belongs to the family Polyangiaceae (phylum Myxococcota), which comprises predatory bacteria. The genome annotation revealed (near-)complete metabolic modules of central carbon metabolism and aerobic respiration. Two proviral regions were predicted in the draft genome, both putatively representing tailed phages of the class Caudoviricetes. Several CRISPR-Cas system proteins were also identified. The draft genome sequence could be used in future comparative studies assessing the diversity of predatory bacteria in northern soils or other environments.

Keywords

subarctic soil, predatory bacteria, Myxococcota, Polyangiaceae, Mucilaginibacter cryoferens, short-read sequencing, bacterial genome

Introduction

Predatory bacteria are important players in microbial food webs (Hungate et al. 2021). Myxobacteria are a group of bacteria associated with the phylum Myxococcota, characterised by group predatory behaviour and a complex lifestyle, where rod-shaped vegetative cells can aggregate into multicellular fruiting bodies and produce spores (Saggu et al. 2023). Myxobacteria are globally distributed and especially abundant in soil (Zhou et al. 2014; Wang et al. 2021). Together with other micropredators, myxobacteria play leading roles in carbon sequestration and mineralization in soil (Lueders et al. 2006). Moreover, myxobacteria may dominate among other potential bacterivores and have been suggested to represent one of the keystone taxa in soil microbial food webs (Petters et al. 2021). Still, more data are needed to resolve their taxonomic diversity as well as metabolic and lifestyle capacities across environments, including relatively underexplored subarctic regions.

Since soil microbial communities are highly diverse, obtaining complete genomes through metagenomics may be a challenging task (Anthony et al. 2024). Cultivating soil microbes makes it possible to reconstruct their genome sequences reliably and link genetic information to the observed phenotype. In this study, we obtained strain 1-FT3.2, a predatory bacterium from northern peatland soil in the Pallas region, Finland, using Mucilaginibacter cryoferens FT3.2 (Kumar et al. 2025) as prey. M. cryoferens, recently described as a new species, was isolated from Arctic tundra soils in the Kilpisjärvi region, Finland, where it may play important roles in litter decomposition and carbon recycling together with other Mucilaginibacter species (Männistö et al. 2009; Kumar et al. 2025). Strain 1-FT3.2 remained in a mixed culture with its prey, but the analyses of its draft genome sequence obtained from the co-culture suggest that it belongs to the Polyangiaceae family.

Methods

Soil sampling, isolation and cultivation conditions

A soil sample was collected from peatland in the Pallas area, Northern Finland, in September 2022 (N67°59’ E24°13’, Figure 1A). The vegetation was mainly sedges ( Figure 1B). The sample was collected from a depth of 5 cm with sterile instruments and stored at 4°C. The pure culture of Mucilaginibacter cryoferens FT3.2 (Kumar et al. 2025), was used as the prey for isolating predatory bacteria from the soil sample. Bacteria were cultivated using R2A medium (Neogen, NCM0188A), which contained 0.5 g L−1 yeast extract, 0.5 g L−1 meat peptone, 0.5 g L−1 casamino acid, 0.5 g L−1 glucose, 0.5 g L−1 starch, 0.3 g L−1 K2PO4, 0.05 g L−1 MgSO4, and 0.3 g L−1 C3H3NaO3, and was adjusted to pH 6. For solid and top agar, 15 and 4 g L−1 of agar (Sigma-Aldrich, A4550) were added, respectively. The cultures were grown aerobically at room temperature (RT).

51486b55-ae62-42eb-9818-c2012fdc5095_figure1.gif

Figure 1. (A, B) Sampling location, Pallas, and (C) strain 1-FT3.2 growing on plate.

In (A), Kilpisjärvi, the original isolation location for the prey strain, Mucilaginibacter cryoferens FT3.2, is additionally shown. Map modified from Wikimedia Commons (NordNordWest). In (C), a representative plate with lysis zones on the M. cryoferens FT3.2 lawn after 14 days of incubation is shown, scale bar, 1 cm.

For the isolation, 5 g of the soil sample (wet weight) was resuspended in 50 ml of R2A broth and incubated on a shaker (~200 rpm) at RT for two weeks for the sample enrichment. The enriched sample was centrifuged (ThermoScientific F15-6x100y, 30 min, 2,500 g, 20°C) and 100 μl of non-diluted supernatant plated with 300 μl of the M. cryoferens FT3.2 liquid culture and 3 ml of R2A soft agar (46°C) as a top layer on R2A solid agar plates. The plates were incubated aerobically at RT. The observed growth inhibition/lysis zone was picked up with a sterile pipette tip, resuspended in R2A broth, and plated in a top agar layer as before, which was repeated three consecutive times.

DNA extraction and sequencing

The top agar layers of the semi-confluent plates were collected and resuspended in R2A broth (3 ml per plate), incubated with shaking (~200 rpm) at RT for one hour and centrifuged (ThermoScientific F15-6x100y, 30 min, 10,000 g, 4°C). The supernatant was collected and stored at 4°C. The stock titers were determined by plating serial dilutions in a top agar layer as described above. DNA was extracted with the GeneJET Genomic DNA Purification Kit (Thermo Scientific, K0721) using the manufacturer’s protocol for Gram-negative bacteria and 20 ml of the agar stock as input. Note that the agar stocks contained cells from both M. cryoferens FT3.2 and the new strain.

For sequencing, 100 ng of genomic DNA was converted to a sequencing library using the Illumina DNA prep. Samples were dual indexed using the sequencing core unit’s own Nextera primers. Seven cycles were used in the PCR step and DNA was pooled and purified using Illumina’s SPB bead purification. The Library pool was sequenced at 12 pM on the AVITI sequencer (Element Biosciences) using the AVITI 2x150 Sequencing kit Cloudbreak FreeStyle High Output. Sequencing was performed at the DNA Sequencing and Genomics Laboratory (supported by HiLIFE and Biocenter Finland funding), Institute of Biotechnology, University of Helsinki.

Genome annotation

FastQC v. 0.11.9 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was used to assess the quality of reads. Raw reads were trimmed and adaptors removed with Cutadapt v. 2.7 (-m 50 --nextseq-trim 20) (Martin 2011). Read-based taxonomic profiling was performed using PhyloFlash v. 3.4.2 and SILVA138.1.eukmod database (Gruber-Vodicka et al. 2020). Since the sample contained a mixed culture of the prey strain M. cryoferens FT3.2 and a new potentially predatory strain, SPAdes v. 3.15.5 was used in the --meta mode for genome assembly (Bankevich et al. 2012). BBTools Stats was used for assessing the assembly statistics, Reformat for sorting scaffolds by their GC content, and Dedupe for dereplicating scaffolds (minidentity = 95 absorbrc = t absorbmatch = t sort = length) (sourceforge.net/projects/bbmap/). The full-length SSU rRNA gene sequences obtained from the PhyloFlash run and the assembled scaffolds of ≥10 kbp in length were searched with BLASTN (Altschul et al. 1990) against the NCBI nt database using an E-value cutoff of 0.001. The quality of the draft genome of a new strain was assessed with CheckM2 v. 1.0.1 (Chklovski et al. 2023). GTDB-Tk v. 2.3.2 with GTDB release 226 database (Chaumeil et al. 2022) was used for assigning a taxonomic classification. For the genome annotation, DRAM v. 0.1.2 (Shaffer et al. 2020) was used at KBase (Arkin et al. 2018). Putative (pro)viral sequences were predicted by geNomad v. 1.7 (Camargo et al. 2023) and their quality and completeness were assessed with CheckV v. 0.8.1 (Nayfach et al. 2021). Bowtie2 v. 2.5.3 was used for the additional mapping of reads to putative viral sequences (Langmead and Salzberg 2012).

Results

Isolation

After about two weeks of incubating the plates, growth inhibition/lysis areas of 4-5 mm were observed. In subsequent platings, the size of lytic zones reached up to ~1 cm ( Figure 1C). The central parts of these zones were clear, while edges were hazier. Agar stocks produced lysis zones on the M. cryoferens FT3.2 lawn when diluted up to 10000-fold, but no lysis zones could be observed when titrating filtered stocks (0.22 and 0.45 μm PES LLG-Syringe filters Spheros), suggesting that the origin of the observed lytic zones was not viral. Very small, almost transparent or whitish colonies growing over the lysis zones were observed ( Figure 1C), but no aggregated structures like fruiting bodies were seen. Despite our attempts, these tiny colonies could not be transferred to a fresh plate for independent growth. An alternative cultivation approach using the myxobacterium-suited CY-C10 medium ((Karwowski et al. 1996) modified by omitting antibiotics) and higher incubation temperature (28°C) for stock titration did not improve colony growth visibility. We named the strain causing lytic zones on M. cryoferens FT3.2 as 1-FT3.2.

Genome sequencing and assembly

Sequencing genomic DNA of a mixed culture resulted in 245,936,278 raw read pairs (150 bp + 150 bp), of which 245,436,350 pairs were retained after read trimming and quality control. With the read-based profiling by PhyloFlash, 225,532 reads (0.092% of all reads) could be mapped to SSU rRNA sequences in SILVA database. Of the mapped reads, 212,996 (94%) were assigned to the order Sphingobacteriales (Bacteroidota), where the genus Mucilaginibacter belongs to, and 9,050 (4%) were assigned to the order Polyangiales (Myxococcota). The rest of the hits constituted less than 0.01% of mapped reads each. Thus, read-based profiling suggested two strains present in the sample, comprising about 98% of reads together. Furthermore, full-length SSU rRNA gene sequences assembled by SPAdes, matched to SILVA database, were only two OTUs with the closest-matching references of Mucilaginibacter sp. M20-56 (Sphingobacteriales; GenBank acc. no.: KP899210.1, 99% id., 100% cov.) and Phaselicystis metagenome (Polyangiales; GenBank acc. no.: FPLS01001412.1, 95% id., 99% cov.). Additional BLASTN searches of the two detected OTUs against the NCBI nt database resulted in hits to 16S rRNA gene sequences of Mucilaginibacter sp. strain FT3.2 (100% id., 100% cov., 0 E-value) and the members of the order Polyangiales (the genera Minicystis, Sorangium, Chondromyces, Labilithrix, Polyangium, and uncultured bacterium, 91-92% id., 100% cov., 0 E-value), respectively.

The assembly of the mixed culture consisted of 6,318 scaffolds, of which 140 scaffolds were longer than 10 kbp and represented 95% of the total length of all scaffolds ( Table 1). Most scaffolds longer than 10 kb were characterised by a GC content of either 41-43% (71 scaffolds) or 64-66% (56 scaffolds) ( Figure 2). The Mucilaginibacter cryoferens FT3.2 genome GC content is known to be 42.1 % (Genbank acc. no. CP183228.1). Therefore, 56 scaffolds with a GC content of 64-66% were separated from the rest of the assembly, representing strain 1-FT3.2. Dedupe run confirmed the non-redundancy of the assembled draft genome. The total length of the 1-FT3.2 draft genome was 7,202,438 bp with the scaffolds ranging from 13,622 to 664,534 bp in length ( Table 2). Based on the CheckM2 assessment, the genome is 81.3% complete and 0.5% contaminated.

Table 1. Statistics for the mixed culture assembly, listed as of different minimal scaffold length thresholds.

Minimum scaffold length, bpNumber of scaffoldsNumber of contigsTotal scaffold length, bpTotal contig length, bp Scaffold contig coverage, %
506,3186,36915,040,30515,035,38599.97
1001,4561,50714,717,70014,712,78099.97
25029935014,552,68114,547,76199.97
50022227114,529,29114,524,39199.97
1,00020125014,513,76114,508,86199.97
2,50016821714,459,05814,454,15899.97
5,00015520314,409,14414,404,34499.97
10,00014018814,302,88314,298,08399.97
25,00011616113,895,32613,890,82699.97
50,0009313613,006,06413,001,76499.97
100,000568910,405,94110,402,64199.97
250,0006122,322,2732,321,67399.97
500,00013664,534664,33499.97
51486b55-ae62-42eb-9818-c2012fdc5095_figure2.gif

Figure 2. The distribution of GC content across assembled scaffolds longer than 10 kbp.

Table 2. 1-FT3.2 draft genome features.

Feature
Number of scaffolds56
Total length, bp7,202,438
Longest scaffold, bp664,534
Contig N50, bp171,520
Completeness, %81.29
Contamination, %0.5
Total number of coding sequences (CheckM2)5,629
GC content, %65
Number of tRNA genes67
Number of proviruses2

Genome classification and annotation

In the BLASTN search, the 1-FT3.2 draft genome scaffolds recruited numerous hits to sequences representing the phylum Myxococcota. GTDB-Tk run on the draft genome suggested classifying 1-FT3.2 within the family Polyangiaceae, order Polyangiales, class Polyangia, phylum Myxococcota. With DRAM, no rRNA encoding genes were identified in the draft genome scaffolds. DRAM-based annotations ( Figure 3) revealed a few complete metabolic modules: pentose phosphate cycle, citrate cycle (TCA cycle), glyoxylate cycle, cytochrome c oxidase, and F-type ATPase, as well as a near-complete (8/9) glycolysis module, suggesting robust central carbon metabolism and aerobic respiration. Also, arsenate reductase (glutaredoxin), acetyl-CoA synthetase, acetate kinase, and alcohol dehydrogenase were predicted, but no CAZy enzymes. The incomplete nature of the draft genome sequence precludes full understanding of metabolic capacities or the lack of those in 1-FT3.2. Among other DRAM predictions, several different CRISPR-Cas system proteins were identified (Cas1, Cas2, Cas3, CasA, CasB, CasC, CasD, CasE, Cmr1, Cmr2, Cmr3, Cmr4, Cmr5, and Cmr6). About 36% of all predicted proteins had no significant hits to any DRAM database.

51486b55-ae62-42eb-9818-c2012fdc5095_figure3.gif

Figure 3. Metabolic functions of 1-FT3.2 strain predicted with DRAM.

Using geNomad with all scaffolds from the mixed-culture assembly resulted in the prediction of two proviral sequences on scaffolds that belonged to the 1-FT3.2 draft genome: at coordinates 58-30,107 nt in NODE_10_length_239568_cov_144.087670 and 23,906-79,669 nt in NODE_67_length_79671_cov_165.578627. These proviral elements were medium-quality (80 and 53 % complete, respectively) and both assigned as tailed phages within the class Caudoviricetes. In addition, three other short scaffolds (0.2, 5.4, and 6.9 kbp), were identified as viral by geNomad, although the presence of viral genes could be confirmed by CheckV only for one of them. Mapping reads to these three short scaffolds resulted in an overall alignment rate of only 0.00002%, confirming that the nature of the observed lysis zones is unlikely to be viral.

Conclusions

The genome analysis of 1-FT3.2, the new predatory bacterium strain reported here, placed it within the family Polyangiaceae (Myxococcota). Members of this family are terrestrial isolates mainly from soil and plant decay material, characterised by large genomes and high GC content, with some strains being able to degrade cellulose and produce various secondary metabolites (Garcia and Müller 2014). Polyangiaceae representatives are rarely isolated from subarctic soils (Dawid 2000). The draft genome sequence of 1-FT3.2 could be used in future comparative studies aiming to resolve the diversity of the family Polyangiaceae and/or more broadly, predatory bacteria residing in subarctic soils. Although the reported genome is incomplete, it still contributes to increasing the sequenced space of the soil microbiome. Having the strain available for future laboratory studies makes it possible to explore its lifestyle and metabolic capacities in more detail.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 21 Oct 2025
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Demina T, Ihonen R, Männistö MK and Hultman J. Draft genome sequence of a predatory bacterium from northern peatland soil [version 1; peer review: awaiting peer review]. F1000Research 2025, 14:1153 (https://doi.org/10.12688/f1000research.171863.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status:
AWAITING PEER REVIEW
AWAITING PEER REVIEW
?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 21 Oct 2025
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.