cover of episode GSR-DB: a manually curated and optimised taxonomical database for 16S rRNA amplicon analysis

GSR-DB: a manually curated and optimised taxonomical database for 16S rRNA amplicon analysis

2023/4/21
logo of podcast PaperPlayer biorxiv bioinformatics

PaperPlayer biorxiv bioinformatics

Frequently requested episodes will be transcribed first

Shownotes Transcript

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.04.19.537515v1?rss=1

Authors: Molano, L.-A., Vega-Abellaneda, S., Manichanh, C.

Abstract: Amplicon-based 16S ribosomal RNA sequencing remains the most widely used method to profile microbial communities, as a low-cost and low-complexity approach. Reference databases are a mainstay for taxonomic assignments, which typically rely on popular databases such as SILVA, Greengenes, GTDB, or RDP. However, the inconsistency of the nomenclature across databases, and the presence of shortcomings in the annotation of these databases are limiting the resolution of the analysis. To overcome these limitations, we created the GSR database (Greengenes, SILVA, and RDP database), an integrated and manually curated database for bacterial and archaeal 16S amplicon taxonomy analysis. Unlike previous integration approaches, this database creation pipeline includes a taxonomy unification step to ensure consistency in taxonomical annotations. The database was validated with three mock communities and two real datasets and compared with existing 16S databases such as Greengenes, GTDB, ITGDB, SILVA, RDP, and MetaSquare. Results showed that the GSR database enhances taxonomical annotations of 16S sequences, outperforming current 16S databases at the species level. The GSR database is available for full-length 16S sequences and the most commonly used hypervariable regions: V4, V1-V3, V3-V4, and V3-V5.

Copy rights belong to original authors. Visit the link for more info

Podcast created by Paper Player, LLC