Using Nextflow to create scalable and reproducible pipelines at Genomics Medicine Ireland

Simone Coughlan

Bioinformatician, Genomics Medicine Ireland, Ireland

Genomics Medicine Ireland (GMI) is an Irish life sciences company performing large-scale studies in the Irish population using whole genome sequencing (WGS) and additional omics technologies. It works in collaboration with the healthcare system, patients, researchers and industry to advance understanding of the genetic basis of multiple diseases, to aid in the discovery of new diagnostics and therapeutics. In order to do this, our data analysis pipelines must be reproducible and easily scale to large numbers of samples. At GMI, we have created multiple Nextflow pipelines, which also employ the use of containerisation technologies and Conda environments to ensure full reproducibility. These include pipelines to perform analysis of genotyping arrays, concordance analysis of genotyping and sequencing data, copy number variation calling and downstream processing of files for a validated secondary analysis pipeline. In summary, the ability to rapidly prototype workflows using any tool, resume execution upon failure, scale easily and maintain a consistent environment across runs make Nextflow well suited to our work.

Deck

Bio

Dr Simone Coughlan is a bioinformatics scientist at Genomics Medicine Ireland, an Irish Life Sciences company undertaking large scale analysis of genomics and other omics data to help discover new diagnostics and therapeutics. She has a PhD in Bioinformatics from the National University of Ireland Galway and a background in parasite and bacterial genomics before turning to humans where she is now involved in research across multiple diseases. Writing robust, scalable pipelines is an essential part of the job and Nextflow allows her to do this with minimal fuss!

More information

The event program is available at this link. For registration and other information check it out this page.