Episode 9 End of DSL1 and AI pipelines

Updates to the Nextflow language and putting ML pipelines to good use..

  • News and Views
  • 11 January 2023


In this News and Views episode, Phil Ewels, Chris Hakkaart, and Marcel Ribeiro-Dantas discuss the hottest topics in the Nextflow world.

New “fair” process directive

  • New Nextflow directive added! See nextflow-io/nextflow@60d34cf - > This commit adds the process ‘fair’ directive. When fair is set to true the process outputs are guaranteed to me emitted in the same sequence as the inputs where received instead of the first-completed-first-output semantic that’s usually used by nextflow tasks

The end of DSL1 support!

  • DSL1 now not supported in the latest edge release - 22.12.0-edge
  • You can still run old pipelines with older versions of Nextflow
    • Use NXF_VER before commands, this is good practice for reproducibility anyway
    • eg. NXF_VER=22.10.4 nextflow run [...]
  • All documentation and training should now be about DSL2. Should be less confusing…
  • Never a better time to convert to DSL2!

ChatGPT

Stable diffusion ML pipeline

  • New pipeline for Stable diffusion: a deep learning, text-to-image model released in 2022.
  • It is primarily used to generate detailed images conditioned on text descriptions, though it can also be applied to other tasks such as inpainting, outpainting, and generating image-to-image translations guided by a text prompt.
  • Evan wrote a Nextflow pipeline (evanfloden/stable-diffusion-nf) that incorporates Stable diffusion and uses Tower to run on AWS.
  • Great example of using Tower and Nextflow to execute AI on the cloud.
  • Expanding this type of application in the future.

Whisper pipeline

  • OpenAI Whisper is a “neural net that approaches human level robustness and accuracy on English speech recognition”
  • Marcel wrote a Nextflow pipeline (mribeirodantas/nf-whisper) to use Whisper pre-trained models to generate transcriptions / translations of audio content.
  • Can fetch YouTube videos and generate transcriptions in a few minutes using GPU
  • Runs on Nextflow Tower and can use Wave containers

Upcoming events

  • There’s a nf-core/bytesize talk on January 17th by Sofia Stamouli about the nf-core/taxprofiler pipeline
    • A bioinformatics best-practice analysis pipeline for taxonomic profiling of shotgun metagenomic data. It allows for in-parallel profiling with multiple profiling tools against multiple databases, produces standardised output tables.

  • Please sign up for the March 2023 Nextflow / nf-core Training!
  • Please sign up for the March 2023 nf-core hackathon!
Episode coming soon..

news and views opensource community