Host DNA Depletion in Clinical Metagenomic Sequencing: Emerging Strategies, Ecological Insights, and the Hidden Challenge of Microbial Signal Recovery

An Duy Duong

doi:10.25163/microbbioacts.5210708

Microbial Bioactives

Microbial Bioactives | Online ISSN 2209-2161

402

Citations

293k

Views

187

Articles

Submit

Volume 5 Number 2 2022

Figures and Tables

REVIEWS (Open Access)

Previous Next Contents Vol 5 (2)

Host DNA Depletion in Clinical Metagenomic Sequencing: Emerging Strategies, Ecological Insights, and the Hidden Challenge of Microbial Signal Recovery

An Duy Duong¹*

+ Author Affiliations

Microbial Bioactives 5 (2) 1-8 https://doi.org/10.25163/microbbioacts.5210708

Submitted: 20 January 2022 Revised: 12 March 2022 Published: 23 March 2022

Abstract

Metagenomic next-generation sequencing (mNGS) has increasingly positioned itself as one of the more promising approaches for broad-spectrum infectious disease diagnostics. Yet the field continues to encounter a surprisingly persistent obstacle: the overwhelming dominance of host-derived nucleic acids within clinical samples. In many specimen types, particularly blood, cerebrospinal fluid, and respiratory aspirates, human DNA can comprise more than 99% of total extracted genetic material, leaving diagnostically relevant microbial reads buried within an immense genomic background. This review explores how the challenge of host DNA depletion intersects with concepts long recognized in environmental microbiology, microbial ecology, and natural-product discovery. Drawing from studies involving Streptomyces bioprospecting, marine microbial symbioses, biosynthetic gene cluster mining, and computational dereplication, the review examines how selective enrichment, sequence-guided discovery, and bioinformatic filtering collectively shape modern metagenomic workflows. Emerging depletion strategies—including differential lysis, enzymatic digestion, density-based separation, and computational host-read removal—are discussed alongside their limitations, particularly regarding viral nucleic acid preservation and interpretative ambiguity. The review further argues that host-associated material may not simply represent analytical contamination but instead reflects biologically dynamic ecological systems that influence microbial adaptation and pathogenicity. Overall, the findings suggest that the future of clinical metagenomics may depend less on sequencing depth alone and more on integrated ecological, computational, and molecular strategies capable of selectively recovering meaningful microbial signals from overwhelmingly complex genomic environments.

Keywords: Clinical metagenomics; Host DNA depletion; Metagenomic next-generation sequencing; Streptomyces; Biosynthetic gene clusters; Bioinformatic dereplication; Microbial dark matter

1. Introduction

Metagenomic next-generation sequencing (mNGS) has, over the last decade, moved from being largely an exploratory research technology to something far more clinically ambitious. What once served mainly environmental microbiologists searching for uncultured organisms in oceans, soils, or hypersaline ecosystems is now increasingly viewed as a universal diagnostic framework capable of detecting pathogens directly from patient samples. The appeal is obvious: rather than relying on targeted assays or culture-dependent techniques, clinicians can theoretically identify bacteria, viruses, fungi, and parasites simultaneously from a single specimen, even when the causative organism is unexpected or previously uncharacterized. Yet despite this promise, the practical implementation of clinical metagenomics remains constrained by a deceptively simple biological reality—the overwhelming abundance of host DNA in clinical samples.

In most clinical specimens, particularly blood, bronchoalveolar lavage fluid, cerebrospinal fluid (CSF), and tissue biopsies, human nucleic acids dominate the sequencing landscape. In some cases, host-derived DNA accounts for more than 99% of total recovered genetic material, leaving only a minute fraction attributable to microbial organisms. This imbalance creates a major analytical bottleneck because sequencing instruments devote the majority of reads to the human genome rather than to the pathogen of interest. Consequently, diagnostically relevant microbial sequences may remain undetected unless sequencing depth is increased dramatically, an approach that is expensive, computationally burdensome, and not always feasible in routine clinical settings.

The challenge resembles what microbiologists have long described as the “Great Plate Count Anomaly,” where only a small proportion of environmental microorganisms can be cultivated under laboratory conditions (Mohr, 2018). In a similar sense, clinical metagenomics often struggles with a “signal buried in noise” problem. Rare pathogen sequences must compete against billions of human base pairs, making low-abundance infections particularly difficult to detect. Hug et al. (2018) referred to related obstacles in environmental microbiology as the problem of “undesirable background,” where dominant non-target organisms obscure rare or novel taxa. Although originally discussed in the context of actinomycete discovery, the conceptual parallel to clinical sequencing is striking. In both fields, researchers are essentially searching for biologically meaningful traces hidden within overwhelmingly complex mixtures.

The urgency of overcoming this limitation has become increasingly apparent as infectious diseases grow more complex and antimicrobial resistance continues to escalate globally. Conventional diagnostic workflows, while still indispensable, often suffer from delayed turnaround times, low sensitivity after antibiotic exposure, or dependence on prior clinical suspicion. Culture-based identification, for instance, may fail entirely for fastidious, intracellular, or slow-growing pathogens (Vartoukian et al., 2010). Meanwhile, the broader antimicrobial resistance crisis has intensified the demand for rapid, unbiased pathogen detection platforms capable of guiding precision therapy (Aminov, 2010). Clinical metagenomics emerged partly in response to this need, promising a more comprehensive view of infectious disease ecology within the host.

At the same time, advances in microbial genomics and natural-product discovery have demonstrated how much biological information remains hidden within uncultured microbial communities. Studies involving rare actinomycetes, marine microorganisms, and myxobacteria have repeatedly shown that untapped microbial diversity harbors enormous biochemical potential (Genilloud et al., 2011; Subramani & Aalbersberg, 2013). Genome sequencing efforts, including the landmark sequencing of Streptomyces coelicolor, further revealed that many microbial biosynthetic pathways remain silent or poorly expressed under standard laboratory conditions (Bentley et al., 2002). These discoveries reinforced a broader scientific realization: microbial communities are vastly more complex than previously appreciated, and conventional methods capture only a fraction of their functional diversity.

This realization has directly influenced the evolution of metagenomic diagnostics. Clinical sequencing is no longer viewed solely as a pathogen-identification tool; increasingly, it is also considered a means of understanding microbial ecology, virulence potential, resistance determinants, and host–microbe interactions. However, the success of such approaches depends heavily on efficient enrichment of microbial nucleic acids. Without adequate host depletion, valuable information regarding microbial diversity, low-copy-number pathogens, or secondary metabolite pathways may remain inaccessible.

Several laboratory strategies have therefore been developed to address host DNA interference. One widely adopted approach involves differential lysis, where human cells are selectively disrupted while microbial cells remain intact. Following host-cell lysis, nucleases such as DNase I are used to degrade exposed human DNA before microbial nucleic acids are extracted. Similar enzymatic cleanup strategies have been employed in environmental sequencing workflows to enrich biologically active microbial fractions (Edgcomb & Bernhard, 2013). In clinical applications, saponin-based lysis combined with nuclease digestion has shown encouraging results, particularly for respiratory and blood-derived samples. Still, these methods are not without trade-offs. Aggressive depletion protocols may inadvertently damage fragile microbial cells or degrade viral nucleic acids, thereby reducing diagnostic sensitivity for certain pathogens.

Physical separation techniques have also gained attention as alternatives or complements to enzymatic depletion. Filtration systems, density-gradient centrifugation, and microfluidic devices attempt to separate microbial cells from host material based on size, morphology, or biophysical properties. Although conceptually appealing, these approaches often face practical limitations when dealing with polymicrobial infections or samples containing highly variable microbial loads. Nevertheless, continued refinement of these methods suggests that integrated workflows combining chemical, enzymatic, and physical separation may eventually provide more robust depletion efficiencies.

Beyond laboratory-based enrichment, bioinformatic strategies have become increasingly central to host DNA management. High-throughput sequencing generates enormous datasets, and computational dereplication pipelines are now routinely used to identify and remove host-derived reads after sequencing. Macintyre et al. (2014) emphasized the importance of dereplication and metabolomic filtering approaches in handling complex microbial datasets, particularly within marine symbiont research. In clinical metagenomics, analogous computational frameworks rapidly map sequencing reads against human reference genomes, allowing non-human sequences to be isolated for downstream analysis. While such approaches do not physically reduce host DNA before sequencing, they substantially improve interpretability and analytical efficiency.

The growing sophistication of genome-mining tools has further expanded the possibilities of clinical metagenomics. Platforms such as antiSMASH, PRISM, and ClusterFinder—originally developed for natural-product discovery—can now identify biosynthetic gene clusters, resistance genes, and virulence-associated pathways from metagenomic data (Chen et al., 2019). These tools highlight an important conceptual shift within the field. Clinical sequencing is gradually evolving from simple taxonomic detection toward functional characterization of microbial communities. In chronic infections, for example, the presence of silent or low-expression microbial genes may provide clinically relevant insights even when active pathogen replication is minimal.

Interestingly, emerging developments in biomaterials and nanotechnology may also contribute to future host depletion strategies. Guo et al. (2020) demonstrated that scaffold topography and nanomaterial properties can strongly influence cellular interactions and molecular binding dynamics. Although these concepts were explored primarily in regenerative medicine, they raise intriguing possibilities for metagenomic sample preparation. Functionalized nanomaterials capable of selectively binding human DNA, immune-cell fragments, or microbial cell walls could theoretically enhance microbial enrichment before sequencing. While such applications remain largely hypothetical, they reflect the increasingly interdisciplinary nature of modern metagenomics.

Another challenge receiving growing attention is the preservation of viral nucleic acids during host depletion workflows. Many existing depletion methods were optimized primarily for bacterial DNA enrichment, yet clinical infections are often polymicrobial and may involve RNA viruses, DNA viruses, fungi, or mixed microbial communities. Excessive nuclease treatment or harsh extraction conditions can disproportionately affect fragile viral genomes, potentially introducing diagnostic bias. Consequently, researchers are increasingly attempting to balance depletion efficiency with nucleic acid preservation, recognizing that no single method performs optimally across all pathogen classes.

Moreover, the interpretation of metagenomic findings continues to raise difficult clinical questions. Detecting microbial sequences does not necessarily establish causation, particularly when low-abundance organisms or commensal microbiota are involved. Distinguishing contamination, colonization, and true infection remains a major unresolved issue in clinical metagenomics. This uncertainty becomes even more pronounced when sequencing reveals “silent” microbial markers—genetic signatures present within the sample but not actively expressed. Whether such markers represent dormant infection, residual microbial DNA, or clinically irrelevant background remains an active area of investigation.

Taken together, these challenges underscore why host DNA depletion has become one of the defining technical priorities in clinical metagenomic sequencing. The field is progressing rapidly, yet many workflows remain insufficiently standardized, especially across diverse sample types and pathogen categories. What seems increasingly clear, however, is that successful clinical metagenomics will likely depend not on a single depletion technology but on integrated, multi-layered strategies combining selective enrichment, computational dereplication, and advanced molecular analytics.

As sequencing technologies become faster, cheaper, and more accessible, the importance of optimizing host depletion will only intensify. Clinical metagenomics sits at an unusual intersection of microbiology, genomics, bioinformatics, and translational medicine. Its future may ultimately depend on how effectively these disciplines converge to solve a persistent problem: recovering meaningful microbial signals from an overwhelming excess of host genetic material.

2. Materials and Methods

2.1 Study Design and Conceptual Framework

This narrative review was designed to synthesize current knowledge regarding host DNA depletion strategies in clinical metagenomic sequencing while integrating perspectives from environmental microbiology, microbial ecology, natural-product discovery, and bioinformatic genome mining. Rather than functioning as a conventional systematic review with strict quantitative inclusion metrics, the present work adopted an interpretative narrative approach intended to explore conceptual, technical, and translational developments shaping the evolving field of metagenomic next-generation sequencing (mNGS). The review particularly focused on the biological and analytical challenges associated with excessive host-derived nucleic acids in clinical specimens and how strategies originally developed for environmental bioprospecting may provide useful frameworks for clinical host depletion workflows.

The conceptual basis of the review emerged from the growing overlap between environmental microbiology and clinical metagenomics. Earlier studies involving unculturable microorganisms, rare actinomycetes, and marine microbial symbionts demonstrated that biologically meaningful microbial signals are often concealed within highly complex ecological backgrounds (Mohr, 2018; Subramani & Aalbersberg, 2013). Similarly, clinical mNGS workflows frequently struggle to recover pathogen-derived sequences from samples dominated by human DNA. Accordingly, this review sought to examine how ecological filtering, selective enrichment, bioinformatic dereplication, and sequence-guided discovery have evolved across both disciplines.

2.2 Literature Search Strategy

The literature used in this review was identified through a structured but flexible search strategy involving multidisciplinary scientific databases, including PubMed, Scopus, Web of Science, and Google Scholar. Additional references were identified through citation chaining from highly relevant review articles and foundational studies. The search process focused on publications related to host DNA depletion, metagenomic sequencing, microbial dark matter, biosynthetic gene cluster discovery, environmental microbiology, and bioinformatic dereplication tools.

Search terms included combinations of keywords such as “host DNA depletion,” “clinical metagenomics,” “metagenomic next-generation sequencing,” “biosynthetic gene clusters,” “Streptomyces,” “marine microbiomes,” “bioinformatic dereplication,” “antiSMASH,” “PRISM,” “sequence-guided discovery,” and “unculturable microorganisms.” Additional targeted terms such as “Great Plate Count Anomaly,” “differential lysis,” “microbial dark matter,” and “silent biosynthetic gene clusters” were also incorporated to capture studies addressing broader ecological and methodological concepts relevant to metagenomic enrichment and interpretation.

Priority was generally given to peer-reviewed studies published between 2000 and 2022, although several earlier landmark studies were retained because of their foundational importance to microbial genomics and cultivation-independent microbiology. For example, the genome sequencing study of Streptomyces coelicolor by Bentley et al. (2002) and discussions surrounding unculturable microorganisms by Vartoukian et al. (2010) and Stewart (2012) were considered essential for contextualizing the historical evolution of microbial discovery methods.

2.3 Inclusion and Exclusion Criteria

Studies were included if they addressed at least one of the following themes: (i) host DNA depletion strategies in clinical sequencing, (ii) environmental microbial enrichment and selective cultivation methods, (iii) metagenomic natural-product discovery, (iv) biosynthetic gene cluster identification, (v) bioinformatic genome-mining tools, or (vi) host-associated microbiome ecology. Both experimental and review-based studies were considered eligible if they contributed meaningful conceptual or methodological insight relevant to the objectives of this narrative review.

Studies focusing exclusively on unrelated sequencing technologies, non-microbial genomic systems, or purely clinical case reports without methodological discussion were excluded. Similarly, publications lacking sufficient methodological detail or clear relevance to metagenomic enrichment, microbial ecology, or bioinformatic analysis were not incorporated into the final synthesis.

Because the present work aimed to provide conceptual integration rather than statistical meta-analysis, methodological heterogeneity among the included studies was considered acceptable. The review intentionally incorporated research spanning environmental microbiology, marine ecology, computational biology, natural-product chemistry, and translational infectious disease diagnostics in order to capture the interdisciplinary nature of contemporary metagenomics.

2.4 Data Extraction and Thematic Synthesis

Relevant information from the selected literature was extracted manually and organized into thematic categories corresponding to major areas of investigation. These categories included: (i) biological background interference, (ii) selective enrichment and differential lysis strategies, (iii) functional versus sequence-guided screening approaches, (iv) bioinformatic dereplication and genome mining, and (v) host-associated microbial ecosystems.

Comparative synthesis tables were developed to summarize key findings regarding cultivation strategies, secondary metabolite discovery, computational pipelines, and host-associated microbiome diversity. These tables were designed not only to organize methodological information but also to illustrate conceptual parallels between environmental bioprospecting and clinical metagenomic diagnostics. Particular attention was given to studies involving Streptomyces, marine microbial symbioses, and computational biosynthetic gene cluster prediction tools such as antiSMASH, PRISM, ClusterFinder, and NaPDos (Chen et al., 2019; Mohimani et al., 2018).

2.5 Narrative Interpretation and Analytical Scope

Unlike quantitative systematic reviews, narrative reviews inherently involve interpretative synthesis. Therefore, the present study emphasized conceptual relationships, methodological trends, and emerging interdisciplinary connections rather than pooled statistical outcomes. The review particularly explored how ecological theories originally associated with microbial cultivation and environmental metagenomics may inform future host DNA depletion strategies in clinical sequencing workflows. In addition, the analysis considered broader translational implications involving antimicrobial resistance, microbial ecology, pathogen detection sensitivity, and silent biosynthetic gene cluster activation (Aminov, 2010; Montiel et al., 2015). The integration of environmental and clinical perspectives was intended to provide a more holistic understanding of host-associated microbial systems and the evolving technological frameworks used to interrogate them.

Overall, this methodology was designed to support a comprehensive and conceptually integrative evaluation of host DNA depletion strategies within the rapidly expanding landscape of clinical metagenomics.

3. Marine and Terrestrial Streptomyces Research

3.1 The Bioprospecting Paradox Across Terrestrial and Marine Ecosystems

The genus Streptomyces has long occupied a near-mythic position in natural product research, particularly because of its extraordinary capacity to produce bioactive secondary metabolites. Yet despite decades of investigation, interest in this microbial group has not diminished. If anything, recent discoveries suggest that the biosynthetic potential of Streptomyces may still be far from exhausted. Donald et al. (2022) reported that between 2015 and 2020 alone, approximately 121 Streptomyces species yielded 279 newly identified natural products, reaffirming the genus as one of the most prolific sources of antibiotic and pharmacologically relevant compounds. Still, there remains a subtle but persistent uncertainty within the field. Researchers increasingly question whether ongoing bioprospecting efforts are uncovering genuinely novel chemistry or merely rediscovering structurally related metabolites from increasingly remote ecological niches.

Traditionally, terrestrial ecosystems have served as the dominant reservoir for Streptomyces exploration. Soil-

Figure 1: Streptomyces as a Persistent Reservoir of Novel Natural Products, Bioprospecting Trends, Ecological Shifts, and Emerging Frontiers in Secondary Metabolite Discovery. Overview of the enduring importance of Streptomyces in natural product discovery, highlighting its prolific biosynthetic capacity and the continued identification of novel bioactive metabolites between 2015 and 2020.

Table 1. Isolation and Cultivation Strategies for Novel Streptomyces Strains. This table summarizes selective cultivation approaches used to isolate rare and environmentally adapted Streptomyces species from diverse ecological niches, including deserts, hypersaline soils, mine sediments, and insect-associated microbiomes. The compiled data highlight how pH adjustment, salinity modulation, and selective additives such as cycloheximide, nalidixic acid, and potassium dichromate function as ecological filters that suppress competing microorganisms while enriching resilient actinomycetes with potential biosynthetic significance.

Isolate Name	Sample Nature	Isolation Medium	pH/Salinity	Additives/Antibiotics	Country	Reference
S. boncukensis	Saltern soil	Starch Casein agar	pH 7.0–7.2	Cycloheximide, 3% NaCl	Turkey	Donald et al. (2022)
S. taklimakanensis	Desert	Gauze’s No. 1	—	Nystatin, Nalidixic acid	China	Donald et al. (2022)
S. fodineus	Acidic mine soil	Starch Casein Agar	pH 5	Cycloheximide, Nystatin	Korea	Donald et al. (2022)
S. dengpaensis	Desert soil	ISP 7	—	K₂Cr₂O₇, Ca propionate	China	Donald et al. (2022)
S. xiangtanensis	Manganese mine soil	Gause’s synthetic 1	pH 7.2	K₂Cr₂O₇ solution	China	Donald et al. (2022)
S. arcticus	Frozen soil	Mineral agar 1	—	K₂Cr₂O₇ (50 mg/L)	Arctic region	Donald et al. (2022)
S. canalis	Hypersaline soil	B7 medium	—	1.5% (w/v) NaCl	China	Donald et al. (2022)
S. capitiformicae	Head of an ant	Sodium succinate medium	pH 7.2	Cycloheximide, Nalidixic acid	China	Donald et al. (2022)
S. lasiicapitis	Head of an ant	HV agar	—	Cycloheximide	China	Donald et al. (2022)
S. sporangiiformans	Mount Song soil	HV agar	—	Cycloheximide, Nalidixic acid	China	Donald et al. (2022)

derived actinomycetes continue to account for a substantial proportion of newly described species and metabolites, with nearly 80% of isolates still originating from land-based environments (Donald et al., 2022). However, the nature of terrestrial sampling has evolved considerably. Conventional agricultural soils, once heavily relied upon during the “golden age” of antibiotic discovery, are now considered comparatively overexplored. Contemporary research instead gravitates toward ecologically stressed or extreme habitats where microbial competition may drive unusual metabolic adaptation. Hyper-arid deserts, heavy-metal-contaminated mining zones, volcanic sediments, and saline soils have become increasingly attractive targets for microbial prospecting (Okoro et al., 2009). Figure 1 summarizes the sustained significance of Streptomyces as a major source of bioactive natural products, while also illustrating current ecological and bioprospecting trends shaping modern secondary metabolite discovery.

This shift is partly motivated by the enduring challenge known as the “Great Plate Count Anomaly,” which highlights the reality that the overwhelming majority of environmental microorganisms remain unculturable using standard laboratory conditions (Mohr, 2018; Stewart, 2012). In many ecosystems, less than 1% of microbial diversity can be readily cultivated. Consequently, researchers have adopted increasingly selective isolation strategies designed to suppress fast-growing bacterial competitors while enriching resilient, spore-forming actinomycetes. Methods such as dry heat pretreatment, phenol exposure, and selective nutrient limitation have become common practice in Streptomyces isolation protocols (Donald et al., 2022). These approaches are undeniably effective at recovering rare strains, yet they also raise an uncomfortable possibility: in aggressively eliminating microbial “weeds,” researchers may inadvertently discard fragile or slow-growing organisms with equally valuable biosynthetic potential.

At the same time, marine environments have emerged as perhaps the most exciting frontier in Streptomyces research. For many years, marine-derived actinomycetes were assumed to represent little more than terrestrial contaminants transported into oceanic systems. That assumption has now largely collapsed. Marine Streptomyces are increasingly recognized as ecologically distinct microorganisms adapted to saline, high-pressure, and nutrient-variable environments. Many isolates exhibit halophilic or halotolerant characteristics and require seawater-like salinity conditions to sustain metabolic activity (Donald et al., 2022). Roughly 71% of marine isolates are recovered from deep-sea sediments, habitats characterized by intense environmental pressures that may stimulate unique biosynthetic pathways and chemical architectures (Donald et al., 2022).

The growing emphasis on host-associated microbiomes has further transformed marine bioprospecting strategies. Marine invertebrates, including sponges, corals, and tunicates, are now viewed less as individual organisms and more as complex microbial ecosystems. Almeida et al. (2023) described marine sponges and octocorals as reservoirs of metabolically versatile bacterial communities capable of producing antimicrobial compounds active against human pathogens. Similarly, Zvi-Kedem et al. (2022) emphasized the ecological complexity of marine microbial mosaics, where environmental drivers and host interactions shape microbial diversity and functional potential.

Within these symbiotic systems, Streptomyces species frequently function as defensive partners, synthesizing secondary metabolites that protect the host from microbial competitors or predatory organisms. Compounds such as ilamycins and related peptide metabolites exemplify this ecological chemistry, illustrating how symbiosis may drive the evolution of structurally diverse natural products (Donald et al., 2022). Interestingly, host-associated strains often display broader or more unusual metabolite repertoires than free-living isolates. One possible explanation is that nutrient stability within host tissues allows bacteria to allocate more metabolic energy toward secondary metabolite biosynthesis rather than basic survival.

Despite these advances, perhaps the most transformative realization in recent years has been the discovery of “silent” or cryptic biosynthetic gene clusters (BGCs). Whole-genome sequencing has revealed that individual Streptomyces genomes may contain between 25 and 70 BGCs, yet only a small subset are typically expressed under laboratory conditions (Donald et al., 2022). This genomic silence fundamentally altered the philosophy of natural product discovery. Historically, researchers relied on culture-based screening followed by chemical isolation. Increasingly, however, the field has shifted toward sequence-guided discovery, where genomes are mined computationally before compounds are experimentally observed.

Bioinformatic platforms such as antiSMASH and PRISM have become central tools in this transition. These systems allow researchers to predict biosynthetic potential directly from genomic data by identifying conserved biosynthetic signatures and metabolic architectures (Mohimani et al., 2018). In many cases, genome mining reveals the theoretical existence of compounds that have never been chemically isolated, effectively exposing a vast reservoir of hidden microbial chemistry.

Metagenomic technologies have expanded this concept even further by bypassing cultivation altogether. Instead of isolating organisms individually, environmental DNA can now be sequenced directly from soil, marine sediments, or host-associated microbiomes, enabling recovery of BGCs from previously inaccessible microorganisms (Chen et al., 2019). This approach has significantly broadened the scope of natural product discovery, particularly in environments where cultivation remains technically difficult.

Nevertheless, identifying a biosynthetic gene cluster does not guarantee successful metabolite production. One of the major unresolved challenges involves activating silent pathways under laboratory conditions. Various approaches have been explored, including promoter engineering, heterologous expression, and the use of chemical elicitors that stimulate dormant metabolic networks (Montiel et al., 2015; Moore et al., 2012). Yet activation remains highly unpredictable. Researchers can often identify promising biosynthetic machinery within a genome but still struggle to induce functional expression of the corresponding metabolite.

In many ways, modern Streptomyces research now exists at the intersection of ecology, genomics, chemistry, and bioinformatics. The field has moved beyond simple microbial isolation toward a far more integrated understanding of microbial functionality and environmental adaptation. Terrestrial and marine ecosystems each offer distinct evolutionary pressures that shape secondary metabolism in different ways, and both continue to reveal unexpected chemical diversity.

Still, the bioprospecting paradox persists. The more researchers sequence, mine, and explore microbial genomes, the clearer it becomes that biosynthetic potential vastly exceeds our current capacity to interpret or exploit it fully. We are, perhaps, no longer limited by the absence of microbial diversity but by our incomplete understanding of how to awaken and harness the silent chemistry hidden within it.

4. Selective Microbial Enrichment, Biosynthetic Discovery, and Computational Genome Mining in Host-Dominated Metagenomic Systems

4.1 Ecological Selectivity and the Expanding Landscape of Microbial Recovery

The synthesis of evidence across the reviewed studies revealed that microbial recovery in both environmental microbiology and clinical metagenomics is governed less by simple detection capability and more by the ability to selectively enrich biologically meaningful signals from overwhelmingly complex backgrounds. Interestingly, this challenge appeared remarkably consistent across vastly different ecosystems. Whether researchers were isolating rare Streptomyces strains from hyper-arid soils or attempting to identify low-abundance pathogens in host-dominated clinical samples, the same underlying obstacle repeatedly emerged: valuable microbial signatures were often concealed beneath dominant non-target genetic material.

The cultivation strategies summarized in Table 1 demonstrated how environmental microbiologists increasingly rely on ecological filtering to recover rare actinomycetes from highly competitive microbial communities. Pretreatment conditions involving salinity adjustment, selective antibiotics, pH manipulation, and oxidative stress appeared particularly effective for enriching resilient Streptomyces species adapted to extreme ecological niches. As shown in Table 1, isolates such as S. boncukensis, S. arcticus, and S. canalis were recovered using combinations of cycloheximide, potassium dichromate, or elevated NaCl concentrations, suggesting that environmental stress itself may function as a selective gateway for discovering metabolically specialized microorganisms (Donald et al., 2022). In many respects, these approaches resemble a form of controlled ecological pressure rather than straightforward laboratory cultivation.

The findings further suggested that selective enrichment strategies may unintentionally shape the diversity of organisms ultimately recovered. Dry heat treatment, antifungal suppression, and nutrient limitation effectively reduced fast-growing competitors, yet there remains some uncertainty regarding how many biologically valuable organisms are simultaneously excluded through these aggressive filtering processes. Stewart (2012) and Vartoukian et al. (2010) previously emphasized that most environmental microorganisms remain unculturable under standard laboratory conditions, and the present synthesis reinforces that concern. Selective isolation undoubtedly improves recovery efficiency for resistant taxa, but it may also narrow the observable microbial landscape in ways that remain poorly understood.

The comparative data presented in Table 1 therefore highlight an important conceptual parallel with clinical host DNA depletion workflows. Differential lysis and nuclease digestion strategies used in metagenomic sequencing operate according to nearly identical principles: fragile host cells are selectively disrupted while structurally resilient microbial cells remain intact long enough for downstream sequencing. Although developed independently, both environmental cultivation and clinical depletion strategies appear fundamentally rooted in the same biological logic—preferential survival through selective pressure.

4.2 Secondary Metabolite Discovery and the Hidden Biosynthetic Reservoir

The studies summarized in Table 2 collectively revealed that environmental and host-associated microorganisms continue to represent extraordinarily rich reservoirs of unexplored biochemical diversity. Yet the findings also suggested that conventional discovery pipelines may capture only a relatively small fraction of total biosynthetic potential. Many of the compounds identified through metagenomic and sequence-guided approaches originated from organisms that were previously inaccessible through standard cultivation methods.

Several metabolites listed in Table 2 illustrated the remarkable pharmacological diversity associated with environmental microbiomes. Teixobactin, isolated from Eleftheria terrae using iChip cultivation technology, demonstrated potent activity against Staphylococcus aureus without detectable resistance development during initial studies (Mohimani et al., 2018). Similarly, malacidin A and lassomycin highlighted the growing importance of metagenomics for recovering antimicrobial compounds from uncultured bacteria. These findings reinforce the broader realization that microbial “dark matter” may contain substantial untapped therapeutic potential.

Host-associated microbiomes appeared particularly significant in this regard. The tunicate-associated Streptomyces strains summarized in Table 2 produced metabolites such as platensimycin B with activity against methicillin-resistant pathogens including MRSA and Enterococcus faecium (Utermann et al., 2020). Likewise, marine sponge-associated bacteria described by Almeida et al. (2023) exhibited broad biosynthetic versatility and antimicrobial potential. These observations suggest that host-associated ecosystems may promote unusually specialized metabolic adaptation driven by intense microbial competition and ecological interdependence.

At the same time, the findings revealed a growing methodological transition from phenotype-dependent discovery toward sequence-guided identification strategies. Traditional culture-based screening remains valuable, yet repeated rediscovery of previously characterized metabolites continues to limit its efficiency (Genilloud et al., 2011). In contrast, metagenomic sequencing and biosynthetic gene cluster prediction increasingly allow researchers to identify theoretically valuable pathways before metabolite expression is experimentally confirmed.

This transition became especially evident in compounds such as landepoxin A, where computational sequence analysis identified biosynthetic signatures prior to functional validation (Mohimani et al., 2018). Such findings suggest that microbial genomes may encode extensive “silent” chemistry that remains inaccessible under standard laboratory conditions but detectable through genomic mining. Consequently, the distinction between discovery and prediction is becoming progressively blurred within modern metagenomic research.

The results summarized in Table 2 therefore indicate that microbial secondary metabolism extends far beyond what can be observed experimentally through conventional cultivation alone. Increasingly, metagenomics appears to function not merely as a sequencing tool but as a predictive framework for identifying hidden biochemical potential embedded within complex microbial communities.

4.3 Expansion of Bioinformatic Genome Mining and Computational Dereplication

The reviewed evidence further demonstrated that bioinformatics has evolved from a supportive analytical tool into one of the central driving forces behind contemporary metagenomic interpretation. As sequencing

Table 2. Natural Product Discovery and Biological Activities from Environmental and Host-Associated Microorganisms. This table presents representative secondary metabolites identified through culture-dependent, metagenomic, and sequence-guided discovery pipelines across terrestrial, marine, and host-associated microbial systems. The data emphasize the diversity of bioactive compounds produced by environmental microorganisms and demonstrate how approaches such as iChip cultivation, metagenomics, GC-MS, and UPLC-MS/MS have expanded access to previously hidden biosynthetic potential.

Compound Name	Chemical Class	Producing Organism	Habitat	Targeted Pathogen	Bioactivity	Detection Method	Reference
Chaxalactin A	Macrolactone PK	Streptomyces C34	Desert soil	—	Medium-dependent activity	Culture-based screening	Mohimani et al. (2018)
Teixobactin	Antibiotic	Eleftheria terrae	Soil	S. aureus	No detectable resistance	iChip	Mohimani et al. (2018)
Lassomycin	Cyclic peptide	Lentzea sp.	Soil	M. tuberculosis	ATP-dependent activity	iChip	Mohimani et al. (2018)
Malacidin A	Lipopeptide	Uncultured bacterium	Soil	Gram-positive bacteria	Antibacterial activity	Metagenomics	Mohimani et al. (2018)
Landepoxin A	Epoxyketone	Metagenomic clone	Soil	—	Proteasome inhibition	KS-tag sequencing	Mohimani et al. (2018)
Arimetamycin A	Anthracycline	Streptomyces albus	Soil	—	Cytotoxic activity	Sequence-guided screening	Mohimani et al. (2018)
TP-1161	Thiopeptide	Nocardiopsis sp.	Marine environment	Gram-positive bacteria	Vancomycin-like activity	OSMAC	Mohimani et al. (2018)
2,3-Butane diol	Diol	SP7, SP9 isolates	Saltpan	—	Antifreeze activity	GC-MS	Ramganesh et al. (2017)
Borinic acid	Organic acid	SP8 isolate	Saltpan	Acne-associated bacteria	Dermatological potential	GC-MS	Ramganesh et al. (2017)
Platensimycin B	Diterpenoid	Streptomyces sp.	Tunicate gut	MRSA, E. faecium	Antibacterial activity	UPLC-MS/MS	Utermann et al. (2020)

Table 3. Bioinformatic Tools and Pipelines for Metagenomic Mining and Biosynthetic Gene Cluster Identification. This table summarizes widely used computational platforms designed for metagenomic dereplication, biosynthetic gene cluster (BGC) prediction, peptide identification, and phylogenetic classification. These tools collectively support sequence-guided discovery by enabling rapid annotation of NRPS, PKS, RiPP, and cryptic biosynthetic pathways from increasingly large and complex metagenomic datasets.

Tool Name	Input Data	Output Type	BGC Class	Unique Feature	Accessibility	Major Advantage	Reference
antiSMASH 4.0	Genome sequence	Prediction/BLAST output	NRPS, PKS	Cluster boundary identification	Web server	Comprehensive annotation	Mohimani et al. (2018)
PRISM 3	Genome sequence	Chemical structure prediction	NRPS, PKS	Tailoring reaction prediction	Open-source	Structural identification	Mohimani et al. (2018)
SeMPI	Genome sequence	Top 10 matches	Type I PKS	Non-modified PKS analysis	Web application	Best-match comparison	Mohimani et al. (2018)
Pep2Path	Genome sequence	Mass-shift analysis	NRPS, RiPPs	Tag-BGC alignment	Open-source	Automated peptide linking	Mohimani et al. (2018)
RiPPquest	Genome sequence	Peptide identification	RiPPs	MS/MS alignment	Open-source	Peptidic compound detection	Mohimani et al. (2018)
NRPquest	Genome sequence	Peptide identification	NRPS	Mass spectrometry coupling	Open-source	NRPS characterization	Mohimani et al. (2018)
NaPDos	Sequence tags	Phylogenetic classification	NRPS, PKS	Domain-based search	Web tool	Phylogenetic analysis	Mohimani et al. (2018)
eSNAPD	Sequence tags	NPST library generation	—	Diversity survey platform	Tool set	Large dataset processing	Mohimani et al. (2018)
ClustScan	Metagenomic data	Annotation output	Modular BGCs	Homology searching	Software package	Semi-automated analysis	Chen et al. (2019)
ClusterFinder	Metagenomic data	Novel cluster prediction	Unspecified	Hidden BGC detection	Algorithm-based	Novelty-focused discovery	Chen et al. (2019)

Table 4. Host-Associated Microbiome Diversity and Metagenomic Functional Potential This table highlights the microbial diversity, biosynthetic potential, and ecological specialization observed within host-associated microbiomes and environmentally complex microbial communities. The compiled studies demonstrate how marine invertebrates, estuarine sediments, geothermal systems, and insect-associated microbiota function as reservoirs of biosynthetic gene clusters linked to antimicrobial, anticancer, cytotoxic, and ecosystem-regulating activities.

Sample Source	Location	Dominant Microbes	Genera Count	Sequenced Strains/Features	Key SM-BGCs	Functional Potential	Reference
Ciona intestinalis	Baltic/North Sea	Gammaproteobacteria	33	101 isolates	Peptides, PKs	Anticancer and antimicrobial activity	Utermann et al. (2020)
Marine sponges	Taiwan	Proteobacteria	55	70 genomes	RiPPs, NRPS, PKS	Anti-Candida activity	Almeida et al. (2023)
T. swinhoei	Marine environment	Uncultured symbionts	—	Entotheonella spp.	Polytheonamides	Cytotoxic peptide production	Mohimani et al. (2018)
Didemnum molle	Papua New Guinea	Cyanobacteria	—	Prochloron spp.	Divamides	Anti-HIV activity	Mohimani et al. (2018)
Estuary sediment	Pearl River	Archaea, Bacteria	—	ASVs quantified	Chitin turnover pathways	Environmental adaptation	Zvi-Kedem et al. (2022)
Geothermal waters	Croatia	Eukaryota	—	V9 rRNA regions	Protist diversity	Ecosystem stability
Soil microbiomes	Global (five continents)	NRPS, PKS producers	—	Diverse microbiota	PKs and NRPs	Antimicrobial resistance ecology	Chen et al. (2019)
Saltpan water	South Africa	Bacteroidetes	—	SP1–SP22 isolates	—	Hydrocarbon degradation	Ramganesh et al. (2017)
Camponotus ants	China	Streptomyces spp.	—	Ant-associated strains	NRPS, RiPPs	Symbiotic defense	Donald et al. (2022)
Marine invertebrates	Orkney, Scotland	Gammaproteobacteria	23	77 extracts	Metabolites <1000 Da	Novel metabolomic diversity	Macintyre et al. (2014)

capacity increased dramatically, the primary bottleneck shifted away from data generation and toward the identification of biologically meaningful information hidden within massive genomic datasets.

The computational platforms summarized in Table 3 collectively illustrated how modern genome-mining tools now enable rapid annotation of biosynthetic gene clusters (BGCs), peptide pathways, and cryptic metabolic signatures. antiSMASH and PRISM emerged as particularly influential platforms because of their ability to identify conserved biosynthetic architectures directly from genome sequences (Mohimani et al., 2018). These tools substantially reduced analytical redundancy by rapidly distinguishing previously characterized pathways from potentially novel biosynthetic systems.

Interestingly, different computational frameworks appeared to prioritize different forms of discovery. antiSMASH emphasized broad-spectrum annotation and comparative genomic mapping, whereas ClusterFinder focused more aggressively on detecting atypical or low-similarity pathways that might otherwise remain overlooked (Chen et al., 2019). While this increased the possibility of false-positive predictions, it also improved the likelihood of identifying genuinely novel biosynthetic chemistry.

The importance of dereplication became especially apparent in host-associated microbiome studies. Utermann et al. (2020) reported annotation efficiencies approaching 71% in tunicate-associated microbial systems, substantially accelerating prioritization of candidate pathways for downstream characterization. Similarly, NaPDos and eSNAPD provided phylogenetic and diversity-based frameworks capable of processing increasingly large environmental sequence datasets.

These findings suggest that bioinformatics is no longer functioning solely as a post-sequencing cleanup mechanism. Rather, computational dereplication now actively shapes how microbial diversity is interpreted, prioritized, and biologically contextualized. The ability to computationally suppress redundant information while enriching low-frequency genomic signals increasingly mirrors the conceptual goals of physical host DNA depletion itself.

4.4 Host-Associated Microbial Ecosystems and Functional Specialization

One of the more striking findings emerging from the reviewed literature involved the ecological complexity of host-associated microbial communities. Marine invertebrates, tunicates, and sponge-associated microbiomes consistently demonstrated unusually high levels of microbial specialization and biosynthetic diversity. These systems appear less like simple microbial colonization events and more like highly integrated ecological networks shaped by long-term coevolutionary pressures.

The microbial ecosystems summarized in Table 4 revealed substantial functional specialization across both environmental and host-associated microbiomes. Sponge-associated bacteria exhibited extensive NRPS, PKS, and RiPP biosynthetic capabilities, while tunicate-associated communities produced metabolites linked to antimicrobial and anticancer activities (Almeida et al., 2023; Utermann et al., 2020). In many cases, microbial symbionts appeared to function as biochemical defense partners that directly contribute to host survival.

Zvi-Kedem et al. (2022) further demonstrated that estuarine microbial systems display pronounced ecological structuring driven by environmental gradients and host interactions. Similarly, geothermal water microbiomes revealed unexpectedly diverse eukaryotic microbial populations capable of maintaining ecological stability under extreme physicochemical conditions.

These findings collectively reinforce the idea that host-derived material should perhaps not be viewed exclusively as undesirable analytical interference. Instead, host-associated environments appear to actively shape microbial adaptation, biosynthetic specialization, and metabolic expression. This perspective may carry important implications for clinical metagenomics, where host DNA depletion strategies have traditionally focused primarily on subtraction rather than ecological interpretation.

5. Reinterpreting Host DNA Depletion Through Ecological Filtering, Predictive Metagenomics, and Host–Microbe Interaction Dynamics

5.1 Host DNA Depletion as an Ecological Rather Than Purely Technical Problem

The findings synthesized throughout this review suggest that host DNA depletion in clinical metagenomic sequencing may be fundamentally misunderstood when framed solely as a laboratory preprocessing challenge. Increasingly, the evidence points toward a broader ecological problem involving selective visibility, microbial competition, and genomic dominance. The parallels between environmental microbiology and clinical metagenomics are difficult to ignore. In both contexts, researchers are attempting to recover biologically meaningful signals from extraordinarily complex mixtures where dominant background material obscures low-abundance organisms.

The “Great Plate Count Anomaly” described by Mohr (2018) and Stewart (2012) provides an unexpectedly useful conceptual framework for interpreting this problem. Environmental microbiologists have long recognized that only a very small fraction of microbial diversity becomes experimentally visible under conventional laboratory conditions. Clinical metagenomics appears to face a remarkably similar limitation, although the obstruction now originates from host-derived nucleic acids rather than cultivation bias alone.

The evidence presented across Tables 1–4 reinforces the idea that microbial invisibility is not simply a matter of insufficient sequencing depth. Rather, it reflects ecological imbalance. In many clinical samples, human DNA constitutes such an overwhelming proportion of extracted nucleic acids that diagnostically relevant microbial reads become statistically marginalized before sequencing analysis even begins. Consequently, the problem may be less about generating larger datasets and more about selectively reshaping the biological composition of those datasets before interpretation occurs.

5.2 Selective Enrichment and the Risks of Overcorrection

One of the more important observations emerging from this review involves the dual nature of selective enrichment. Environmental microbiology has repeatedly demonstrated that selective pressure can dramatically improve recovery of rare microbial taxa. The isolation protocols summarized in Table 1—including heat pretreatment, salinity modulation, oxidative stress, and antifungal suppression—successfully enriched resilient Streptomyces populations from highly competitive ecosystems (Donald et al., 2022). Clinical host depletion strategies appear to function according to essentially the same biological principle.

Yet selective enrichment also introduces unavoidable trade-offs. Aggressive depletion methods capable of removing large quantities of host DNA may simultaneously damage fragile microbial cells or degrade low-abundance viral genomes. This concern becomes particularly significant in polymicrobial infections where bacterial, fungal, and viral nucleic acids coexist within the same specimen. The biological “sieve” designed to remove unwanted material may inadvertently eliminate diagnostically valuable organisms as well.

There is perhaps a tendency within metagenomic optimization studies to define success primarily through depletion efficiency percentages. However, the reviewed evidence suggests that this metric alone may be misleading. A workflow capable of removing 99% of host DNA may still perform poorly if it simultaneously destroys rare pathogens or destabilizes microbial nucleic acid integrity. Future optimization strategies may therefore require a more balanced framework that evaluates both host reduction and microbial preservation simultaneously.

5.3 The Shift Toward Predictive Metagenomics

Another major theme emerging from the reviewed literature involves the growing transition from observational microbiology toward predictive metagenomics. Historically, microbial discovery relied heavily on direct phenotype observation: antimicrobial activity, colony morphology, pigmentation, or measurable metabolite production. Increasingly, however, modern metagenomics identifies microbial potential before functional activity becomes experimentally visible.

The sequence-guided discovery approaches summarized in Tables 2 and 3 illustrate this shift clearly. Biosynthetic gene cluster prediction platforms such as antiSMASH, PRISM, and ClusterFinder now allow researchers to computationally infer the existence of complex metabolic pathways directly from genomic information (Chen et al., 2019; Mohimani et al., 2018). In many cases, compounds may remain experimentally undetectable while their biosynthetic signatures are already computationally recognizable.

This transition carries substantial implications for clinical diagnostics. Metagenomic sequencing may eventually evolve beyond simple pathogen identification toward predictive characterization of virulence potential, resistance architecture, metabolic activity, and ecological adaptation. The presence of silent biosynthetic pathways or dormant microbial populations may still hold clinical relevance even when active replication is absent.

At the same time, predictive sequencing introduces considerable interpretative ambiguity. Detection of microbial DNA does not necessarily establish active infection or pathogenic causation. Residual DNA fragments, commensal colonizers, dormant organisms, and environmental contaminants may all contribute detectable genomic signals. Thus, while sequencing technologies continue to improve sensitivity, biological interpretation remains considerably more complicated.

5.4 Bioinformatics as a Second-Stage Depletion Strategy

One particularly interesting conceptual development emerging from this review is the realization that bioinformatics increasingly functions as a second-stage host depletion mechanism. Traditional depletion methods physically remove host-derived material before sequencing. Computational dereplication, by contrast, selectively suppresses irrelevant genomic information after sequencing has occurred.

The platforms summarized in Table 3 demonstrate how advanced computational filtering now shapes virtually every stage of metagenomic interpretation. antiSMASH, PRISM, NaPDos, and related pipelines not only reduce analytical redundancy but also actively determine which microbial signatures become biologically visible during downstream analysis (Mohimani et al., 2018). In this sense, computational pipelines are no longer passive analytical tools; they are becoming ecological filters in their own right.

Still, these approaches remain constrained by reference database completeness. Organisms lacking adequate genomic representation may remain effectively invisible despite successful sequencing. Consequently, bioinformatic accuracy remains inherently dependent on the quality and diversity of existing microbial databases.

5.5 Host–Microbe Interactions and the Reconsideration of “Background.”

Perhaps the most conceptually important implication of this review is the growing realization that host-derived material may not simply represent undesirable contamination. The host environment itself appears to function as a dynamic ecological system capable of shaping microbial adaptation, metabolic specialization, and virulence expression.

The host-associated microbiomes summarized in Table 4 strongly support this interpretation. Marine sponges, tunicates, and coral-associated microbial communities demonstrated highly specialized biosynthetic behavior driven by long-term ecological interactions (Almeida et al., 2023; Macintyre et al., 2014). Similar ecological pressures may operate within human-associated microbiomes, where host immune responses, tissue architecture, and nutrient availability continuously influence microbial behavior.

This perspective suggests that complete removal of host-associated information may not always be biologically desirable. Some host-derived signals could provide important contextual information regarding microbial stress responses, metabolic activation, or infection dynamics. Future host depletion strategies may therefore need to move beyond purely subtractive models toward more ecologically informed frameworks capable of preserving biologically meaningful host–microbe interactions.

Overall, the findings of this review suggest that the future of clinical metagenomics will depend less on indiscriminate sequencing expansion and more on increasingly sophisticated strategies for selective enrichment, ecological interpretation, and computational prioritization. The challenge is no longer simply detecting microbial DNA. It is learning how to recognize which microbial signals truly matter within overwhelmingly complex biological systems.

6. Limitations

This review has several limitations that should be considered when interpreting its conclusions. First, the study was designed as a narrative review rather than a formal systematic review or meta-analysis, meaning that quantitative effect estimation and statistical synthesis were not performed. Consequently, the interpretation of findings inevitably carries some degree of subjectivity, particularly when integrating evidence from highly diverse disciplines such as environmental microbiology, bioinformatics, clinical sequencing, and natural-product discovery. Second, many host DNA depletion strategies remain experimentally heterogeneous, with considerable variability in sample types, sequencing platforms, depletion efficiencies, and computational pipelines across published studies. This variability makes direct methodological comparisons difficult. In addition, several emerging technologies discussed in this review, including nanomaterial-assisted depletion systems and advanced microfluidic platforms, remain largely experimental and have not yet undergone broad clinical validation. Finally, because metagenomic databases remain incomplete, interpretations regarding silent biosynthetic gene clusters, rare taxa, or low-abundance pathogens may evolve substantially as genomic reference libraries expand over time.

7. Conclusion

Host DNA depletion has emerged as one of the defining technical and conceptual challenges in clinical metagenomic sequencing. While significant progress has been achieved through selective enrichment, computational dereplication, and sequence-guided discovery, the recovery of meaningful microbial signals remains constrained by biological complexity and interpretative uncertainty. This review highlights how principles derived from environmental microbiology and microbial ecology may help reshape future metagenomic workflows. Increasingly, successful clinical metagenomics appears likely to depend on integrated, multi-layered strategies that preserve microbial diversity while minimizing host interference. Ultimately, the field may advance not simply by sequencing more deeply, but by learning how to recover, interpret, and biologically contextualize the faint microbial signatures hidden within overwhelmingly host-dominated genomic landscapes.

References

Almeida, J. F., Marques, M., Oliveira, V., Egas, C., Mil-Homens, D., Viana, R., Cleary, D. F. R., Huang, Y. M., Fialho, A. M., Teixeira, M. C., Gomes, N. C. M., Costa, R., & Keller-Costa, T. (2023). Marine sponge- and octocoral-associated bacteria exhibit versatile potential for secondary metabolite biosynthesis and antimicrobial activity against human pathogens. Marine Drugs, 21(1), 34. https://doi.org/10.3390/md21010034

Aminov, R. I. (2010). A brief history of the antibiotic era: Lessons learned and challenges for the future. Frontiers in Microbiology, 1, 134. https://doi.org/10.3389/fmicb.2010.00134

Bentley, S. D., Chater, K. F., Cerdeño-Tárraga, A.-M., Challis, G. L., Thomson, N. R., James, K. D., Harris, D. E., Quail, M. A., Kieser, H., Harper, D., et al. (2002). Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature, 417(6885), 141–147. https://doi.org/10.1038/417141a

Chen, R., Wong, H. L., & Burns, B. P. (2019). New approaches to detect biosynthetic gene clusters in the environment. Medicines, 6(1), 32. https://doi.org/10.3390/medicines6010032

Donald, L., Pipite, A., Subramani, R., Owen, J., Keyzers, R. A., & Taufa, T. (2022). Streptomyces: Still the biggest producer of new natural secondary metabolites, a current perspective. Microbiological Research, 13(3), 418–465. https://doi.org/10.3390/microbiolres13030031

Edgcomb, V. P., & Bernhard, J. M. (2013). Heterotrophic protists in hypersaline microbial mats and deep hypersaline basin water columns. Life, 3(2), 346–362. https://doi.org/10.3390/life3020346

Genilloud, O., Gonzalez, I., Salazar, O., Martin, J., Tormo, J. R., & Vicente, F. (2011). Current approaches to exploit actinomycetes as a source of novel natural products. Journal of Industrial Microbiology & Biotechnology, 38(3), 375–389. https://doi.org/10.1007/s10295-010-0882-7

Guo, X., Chang, J., & Lin, K. (2020). Topographical properties of scaffolds for MSC differentiation. Nanomaterials, 10(9), 2070. https://doi.org/10.3390/nano10092070

Hug, J. J., Bader, C. D., Remškar, M., Cirnski, K., & Müller, R. (2018). Concepts and methods to access novel antibiotics from actinomycetes. Antibiotics, 7(2), 44. https://doi.org/10.3390/antibiotics7020044

Macintyre, L., Zhang, T., Viegelmann, C., Martinez, I. J., Dowdells, C., Cheng, C., Ramadan, U., Gernert, C., Hentschel, U., & Edrada-Ebel, R. (2014). Metabolomic tools for secondary metabolite discovery from marine microbial symbionts. Marine Drugs, 12(6), 3416–3448. https://doi.org/10.3390/md12063416

Mohimani, H., Gurevich, A., Mikheenko, A., Garg, N., Nothias, L.-F., Ninomiya, A., Takada, K., Dorrestein, P. C., & Pevzner, P. A. (2018). Metagenomics for natural product discovery. Antibiotics, 7(2), 44. https://doi.org/10.3390/antibiotics7020044

Mohr, K. I. (2018). Diversity of myxobacteria—We only see the tip of the iceberg. Microorganisms, 6(3), 84. https://doi.org/10.3390/microorganisms6030084

Montiel, D., Kang, H.-S., Chang, F.-Y., Charlop-Powers, Z., & Brady, S. F. (2015). Yeast homologous recombination-based promoter engineering for the activation of silent natural product biosynthetic gene clusters. Proceedings of the National Academy of Sciences of the United States of America, 112(29), 8953–8958. https://doi.org/10.1073/pnas.1507600112

Moore, J. M., Bradshaw, E., Seipke, R. F., Hutchings, M. I., & McArthur, M. (2012). Use and discovery of chemical elicitors that stimulate biosynthetic gene clusters in Streptomyces bacteria. Methods in Enzymology, 517, 367–385. https://doi.org/10.1016/B978-0-12-404634-4.00018-8

Okoro, C. K., Brown, R., Jones, A. L., Andrews, B. A., Asenjo, J. A., Goodfellow, M., & Bull, A. T. (2009). Diversity of culturable actinomycetes in hyper-arid soils of the Atacama Desert, Chile. Antonie van Leeuwenhoek, 95(2), 121–133. https://doi.org/10.1007/s10482-008-9295-2

Ramganesh, S., Maredza, A., & Tekere, M. (2017). Bacterial secondary metabolites in salt pans. Molecules, 22(4), 657. https://doi.org/10.3390/molecules22040657

Stewart, E. J. (2012). Growing unculturable bacteria. Journal of Bacteriology, 194(16), 4151–4160. https://doi.org/10.1128/JB.00345-12

Subramani, R., & Aalbersberg, W. (2013). Culturable rare actinomycetes: Diversity, isolation and marine natural product discovery. Applied Microbiology and Biotechnology, 97(21), 9291–9321. https://doi.org/10.1007/s00253-013-5229-7

Utermann, C., Echelmeyer, V. A., Oppong-Danquah, E., Blümel, M., & Tasdemir, D. (2020). Gut microbiota of Ciona intestinalis: Biodiversity, bioactive compounds, and secondary metabolite potential. Marine Drugs, 19(1), 6. https://doi.org/10.3390/md19010006

Vartoukian, S. R., Palmer, R. M., & Wade, W. G. (2010). Strategies for culture of “unculturable” bacteria. FEMS Microbiology Letters, 309(1), 1–7. https://doi.org/10.1111/j.1574-6968.2010.02000.x

Zvi-Kedem, T., Lalzar, M., Sun, J., Li, J., Tchernov, D., & Meron, D. (2022). Exploring the microbial mosaic: Insights into composition, diversity, and environmental drivers in the Pearl River Estuary sediments. Microorganisms, 12(7), 1273. https://doi.org/10.3390/microorganisms12071273

Microbial Bioactives

Article Contents

Host DNA Depletion in Clinical Metagenomic Sequencing: Emerging Strategies, Ecological Insights, and the Hidden Challenge of Microbial Signal Recovery

Abstract

1. Introduction

2. Materials and Methods

3. Marine and Terrestrial Streptomyces Research

4. Selective Microbial Enrichment, Biosynthetic Discovery, and Computational Genome Mining in Host-Dominated Metagenomic Systems

5. Reinterpreting Host DNA Depletion Through Ecological Filtering, Predictive Metagenomics, and Host–Microbe Interaction Dynamics

6. Limitations

7. Conclusion

References

Recommended articles

Stay connected