cover of episode A comparison of bioinformatics pipelines for compositional analysis of the human gut microbiome

A comparison of bioinformatics pipelines for compositional analysis of the human gut microbiome

2023/2/14
logo of podcast PaperPlayer biorxiv bioinformatics

PaperPlayer biorxiv bioinformatics

Shownotes Transcript

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.02.13.528280v1?rss=1

Authors: Szopinska-Tokov, J., Bloemendaal, M., Boekhorst, J., Hermes, G. D., Ederveen, T. H., Vlaming, P., Buitelaar, J. K., Franke, B., Arias-Vasquez, A.

Abstract: Investigating the impact of gut microbiome on human health is a rapidly growing area of research. A significant limiting factor in the progress in this field is the lack of consistency between study results, which hampers the correct biological interpretation of findings. One of the reasons is variation of the applied bioinformatics analysis pipelines. This study aimed to compare five frequently used bioinformatics pipelines (NG-Tax 1.0, NG-Tax 2.0, QIIME, QIIME2 and mothur) for the analysis of 16S rRNA marker gene sequencing data and determine whether and how the analytical methods affect the downstream statistical analysis results. Based on publicly available case-control analysis of ADHD and two mock communities, we show that the choice of bioinformatic pipeline does not only impact the analysis of 16S rRNA gene sequencing data but consequently also the downstream association results. The differences were observed in obtained number of ASVs/OTUs (range: 1,958 - 20,140), number of unclassified ASVs/OTUs (range: 210 - 8,092) or number of genera (range: 176 - 343). Also, the case versus control comparison resulted in different results across the pipelines. Based on our results we recommend: i) QIIME1 and mothur when interested in rare and/or low-abundant taxa, ii) NG-Tax1 or NG-Tax2 when favouring stringent artefact filtering, iii) QIIME2 for a balance between two abovementioned points, and iv) to use at least two pipelines to assess robustness of the results. This work illustrates the strengths and limitations of frequently used microbial bioinformatics pipelines in the context of biological conclusions of case-control comparisons. With this, we hope to contribute to "best practice" approaches for microbiome analysis, promoting methodological consistency and replication of microbial findings.

Copy rights belong to original authors. Visit the link for more info

Podcast created by Paper Player, LLC