genomeRxiv : a microbial whole-genome database for classification, identification, and data sharing
Pritchard, Leighton and Harrington, Bailey and Irber, Luiz and Mazloom, Reza and Pierce, Tessa and Sharma, Parul and Heath, Lenwood and Brown, C Titus and Vinatzer, Boris (2021) genomeRxiv : a microbial whole-genome database for classification, identification, and data sharing. In: Microbiology Society Annual Conference 2021, 2021-04-26 - 2021-04-30, Online. (https://doi.org/10.6084/m9.figshare.14376350.v1)
Preview |
Text.
Filename: Pritchard_etal_MSAC_2021_genomeRxiv_a_microbial_whole_genome_database_for_classification_identification_and_data_sharing.pdf
License: Download (4MB)| Preview |
Abstract
genomeRxiv is a newly-funded US-UK collaboration to provide a public, web-accessible database of public genome sequences, accurately catalogued and classified by whole-genome similarity independent of their taxonomic affiliation. Our goal is to supply the basic and applied research community with rapid, precise and accurate identification of unknown isolates based on genome sequence alone, and with molecular tools for environmental analysis. The DNA sequencing revolution enabled the use of cultured and uncultured microorganism genomes for fast and precise identification. However, precise identification is impossible without 1. reference databases that precisely circumscribe classes of microorganisms, and label these with their uniquely-shared characteristics 2. fast algorithms that can handle the volumes of genome data Our approach integrates the highly-resolved classification framework of Life Identification Numbers (LINs) with the speed and computational efficiency of sourmash and k-mer hashing algorithms, and the precision and filtering of average nucleotide identity (ANI). We aim to construct a single genome-based indexing scheme that extends from phylum to strain, enabling the unique and consistent placement of any sequenced prokaryote genome. genomeRxiv includes protocols for confidentiality, allowing groups to identify and announce the identities of newly-sequenced organisms without sharing genome data directly. This protects communities working with commercially- and ethically-sensitive organisms (e.g. production engineering strains, potential bioweapons, and to enable benefit sharing with indigenous communities). genomeRxiv will also provide online capability to design molecular diagnostic tools for metabarcoding and qPCR, to enable tracking of specific groupings of bacteria directly in the environment.
ORCID iDs
Pritchard, Leighton ORCID: https://orcid.org/0000-0002-8392-2822, Harrington, Bailey ORCID: https://orcid.org/0000-0003-3149-9345, Irber, Luiz, Mazloom, Reza, Pierce, Tessa, Sharma, Parul, Heath, Lenwood, Brown, C Titus and Vinatzer, Boris;-
-
Item type: Conference or Workshop Item(Poster) ID code: 76058 Dates: DateEvent6 April 2021PublishedSubjects: Science > Microbiology Department: Faculty of Science > Strathclyde Institute of Pharmacy and Biomedical Sciences Depositing user: Pure Administrator Date deposited: 08 Apr 2021 15:33 Last modified: 11 Nov 2024 17:03 URI: https://strathprints.strath.ac.uk/id/eprint/76058