Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing

Kevin M. Mendez; Leighton Pritchard; Stacey N. Reinke; David I. Broadhurst

doi:10.1007/s11306-019-1588-0

Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing

Kevin M. Mendez, Leighton Pritchard, Stacey N. Reinke, David I. Broadhurst

Source

Metabolomics > 2019 > 15 > 10 > 1-16

Abstract

Background

A lack of transparency and reporting standards in the scientific community has led to increasing and widespread concerns relating to reproduction and integrity of results. As an omics science, which generates vast amounts of data and relies heavily on data science for deriving biological meaning, metabolomics is highly vulnerable to irreproducibility. The metabolomics community has made substantial efforts to align with FAIR data standards by promoting open data formats, data repositories, online spectral libraries, and metabolite databases. Open data analysis platforms also exist; however, they tend to be inflexible and rely on the user to adequately report their methods and results. To enable FAIR data science in metabolomics, methods and results need to be transparently disseminated in a manner that is rapid, reusable, and fully integrated with the published work. To ensure broad use within the community such a framework also needs to be inclusive and intuitive for both computational novices and experts alike.

Aim of Review

To encourage metabolomics researchers from all backgrounds to take control of their own data science, mould it to their personal requirements, and enthusiastically share resources through open science.

Key Scientific Concepts of Review

This tutorial introduces the concept of interactive web-based computational laboratory notebooks. The reader is guided through a set of experiential tutorials specifically targeted at metabolomics researchers, based around the Jupyter Notebook web application, GitHub data repository, and Binder cloud computing platform.

Identifiers

journal ISSN :	1573-3882
journal e-ISSN :	1573-3890
DOI	10.1007/s11306-019-1588-0

Authors

Kevin M. Mendez

Edith Cowan University, Centre for Metabolomics & Computational Biology, School of Science, Joondalup, Australia

Leighton Pritchard

Strathclyde Institute of Pharmacy & Biomedical Sciences, University of Strathclyde, Glasgow, Scotland, UK

Stacey N. Reinke

Edith Cowan University, Centre for Metabolomics & Computational Biology, School of Science, Joondalup, Australia

David I. Broadhurst

Edith Cowan University, Centre for Metabolomics & Computational Biology, School of Science, Joondalup, Australia

Keywords

Open access Reproducibility Data science Statistics Cloud computing Jupyter

Additional information

Publication languages: English

Data set: Springer

Publisher

Springer US

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Kevin M. Mendez

Leighton Pritchard

Stacey N. Reinke

David I. Broadhurst

Keywords

Additional information

Publisher

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing