cover of episode PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science

PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science

2023/3/20
logo of podcast PaperPlayer biorxiv bioinformatics

PaperPlayer biorxiv bioinformatics

Shownotes Transcript

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.03.17.533059v1?rss=1

Authors: Cho, M. H., Cho, K.-H., No, K. T.

Abstract: The nature of taxonomic science and the scientific nomenclature system makes it difficult to use scientific names as identifiers without running into complications. To facilitate high-throughput analysis of biological data involving scientific names, we designed PhyloSophos, a Python package that takes into account the properties of scientific names and taxonomic systems to map name inputs to the entries within the reference database of choice. We would like to present three case-studies which demonstrates how our implementations, including rule-based pre-processing and recursive mapping could improve mapping performance and information availability. We expect PhyloSophos to help with the systematic processing of poorly digitized and curated biological data, such as biodiversity information and ethnopharmacological resources, thus enabling full-scale bioinformatics analysis using these data.

Copy rights belong to original authors. Visit the link for more info

Podcast created by Paper Player, LLC