Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology

Cock, Peter J.A. and Grüning, Björn A. and Paszkiewicz, Konrad and Pritchard, Leighton (2013) Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology. PeerJ. e167. ISSN 2167-8359 (https://doi.org/10.7717/peerj.167)

[thumbnail of Cock-etal-PeerJ-2013-Galaxy-tools-and-workflows-for-sequence-analysis-with-applications]
Preview
Text. Filename: Cock_etal_PeerJ_2013_Galaxy_tools_and_workflows_for_sequence_analysis_with_applications.pdf
Final Published Version
License: Creative Commons Attribution 3.0 logo

Download (4MB)| Preview

Abstract

The Galaxy Project offers the popular web browser-based platform Galaxy for running bioinformatics tools and constructing simple workflows. Here, we present a broad collection of additional Galaxy tools for large scale analysis of gene and protein sequences. The motivating research theme is the identification of specific genes of interest in a range of non-model organisms, and our central example is the identification and prediction of "effector" proteins produced by plant pathogens in order to manipulate their host plant. This functional annotation of a pathogen's predicted capacity for virulence is a key step in translating sequence data into potential applications in plant pathology. This collection includes novel tools, and widely-used third-party tools such as NCBI BLASTC wrapped for use within Galaxy. Individual bioinformatics software tools are typically available separately as standalone packages, or in online browserbased form. The Galaxy framework enables the user to combine these and other tools to automate organism scale analyses as workflows, without demanding familiarity with command line tools and scripting.Workflows created using Galaxy can be saved and are reusable, so may be distributed within and between research groups, facilitating the construction of a set of standardised, reusable bioinformatic protocols. The Galaxy tools and workflows described in this manuscript are open source and freely available from the Galaxy Tool Shed (http://usegalaxy.org/toolshed or http://toolshed.g2.bx.psu.edu).