ORCESTRA: a platform for orchestrating and sharing high-throughput pharmacogenomic analyses

bioRxiv (Cold Spring Harbor Laboratory)(2020)

引用 0|浏览0
暂无评分
摘要
Reproducibility is essential to Open Science, as there is limited relevance for finding that cannot be reproduced by independent research groups, regardless of its validity. It is therefore crucial for scientists to describe their experiments in sufficient detail so they can be reproduced, challenged, and built upon. However, due to recent advances in the biological and computational sciences, it has become difficult to process, analyze, and share data with the community in a manner that is transparent. This has made reproducing research findings more challenging, with some researchers going as far as suggesting that the biomedical sciences are experiencing a reproducibility crisis. To overcome these issues, we created a cloud-based platform called ORCESTRA (www.orcestra.ca), which provides a flexible framework for the reproducible processing of multimodal biomedical data. The platform enables processing of genomic and pharmacological profiles of cancer samples through the use of automated processing pipelines that are user-customizable, which are executed through Pachyderm, a data versioning and orchestration tool. ORCESTRA creates an integrated and fully documented data object known as a PharmacoSet (PSet), with a persistent identifier (DOI), that can be used and shared for future analyses using the Bioconductor PharmacoGx package.
更多
查看译文
关键词
high-throughput
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要