• Skip to main content
  • Skip to footer
CWL logo | Homepage link
  • Home
  • About
    • Vision and Mission
    • Timeline
    • Governance
    • Contributors
  • Features
  • Getting Started
  • Community
  • Ecosystem
    • Implementations
    • Development Tools
    • Repositories
    • Projects
  • Specification
  • User Gallery
  • Donate
  • CWL Matrix Space A bold lowercase "m" inside thin square brackets

CWL User Gallery

The CWL standards have become central to a diverse community of users across multiple fields of science and engineering. Below are listed some adopters of the CWL standards.

The 2021 and 2022 CWL Conferences showcased CWL users and developers. To see adopters talk about their use of CWL, go to CWLcon 2021 and CWLcon 2022 to watch the conference videos hosted on YouTube and ConfTube. Additionally, see the list of over 500 citations and mentions of CWL in published papers and reports.

To nominate a user or project for this page, please fill out this form.

Note: Listing does not imply endorsements of the CWL standards by the entity nor endorsement of the entity by the CWL project.

Shortcuts

  • Life Sciences
    • US Food and Drug Administration (FDA)
    • US National institute of Health
    • Children's Hospital of Philadelphia
    • European Open Science Cloud - Life (EOSC-Life)
    • BioExcel, Centre of Excellence for Computational Biomolecular Research
    • ELIXIR
    • Wellcome Trust Sanger Institute
    • Memorial Sloan Kettering Cancer Center
    • UNLOCK
    • PubSeq
  • Geospatial
    • Open Geospatial Consortium
  • Hydrology
    • The European Open Science Cloud
  • Radio Astronomy
    • The European Open Science Cloud for Research Pilot Project
  • Cross-Domain
    • The [Netherlands] National Plan Open Science
    • University of Manchester, eScience Lab

Life Sciences §

US Food and Drug Administration (FDA) §

BioCompute Object / IEEE Std 2791™--2020FDA approved the IEEE Std 2791™--2020 (also known as "BioCompute Object" ) standard for use in regulatory submissions. IEEE 2791™--2020 recommends that "[i]f execution portability is desired, then the included script should be in the Common Workflow Language v1.0 or later format.”

Additional Info:

https://standards.ieee.org/standard/2791-2020.html
https://www.govinfo.gov/content/pkg/FR-2020-07-22/pdf/2020-15907.pdf
US Food and Drug Administration (FDA) logo

US National Institutes of Health (NIH) §

RAPT §

NCBI Insights : Read assembly and Annotation Pipeline Tool (RAPT) is available for use and testingRAPT is a NCBI pipeline designed for assembling and annotating short genomic sequencing reads obtained from bacterial or archaeal isolates. RAPT is written using docker for maximal portability. A RAPT Docker container includes SKESA, a high-accuracy assembler for short reads, PGAP, the annotation pipeline written in the common workflow language (CWL) and used by RefSeq, and cwltool, the reference implementation for CWL.

NCI Cancer Genomics Cloud §

NCI Cancer Genomics CloudThe Seven Bridges Cancer Genomics Cloud (CGC) by the US National Cancer Institute (NCI) enables researchers to rapidly access and collaborate on massive public cancer genomic datasets, including The Cancer Genome Atlas. Within the first 15 months, over 1,900 researchers have registered on the platform, representing 150 institutions across 30 countries. CGC users have deployed more than 5,000 tools or workflows and performed 80,000 executions, representing over 97 years of total computation. All tools on CGC packaged within Docker containers with execution instructions described using Common Workflow Language.

NCI Cancer Genomics Cloud logo

NIH BioData Catalyst §

NHLBI BioData CatalystThe BioData Catalyst powered by Seven Bridges offers researchers collaborative workspaces for analyzing genomics data at scale. Researchers can find and analyze the hosted TOPMed studies by using hundreds of optimized analysis tools and workflows (pipelines); creating their own workflows; or interactive analysis.

NCI Genomic Data Commons §

https://gdc.cancer.gov/The National Cancer Institute’s (NCI’s) Genomic Data Commons (GDC) is to provide the cancer research community with a data repository of uniformly processed genomic and associated clinical data that enables data sharing and collaborative analysis in the support of precision medicine. All major GDC data production pipelines are written in the Common Workflow Language.

Gabriella Miller Kids First Data Resource Center §

Gabriella Miller Kids First Data Resource CenterThe NIH Common Fund-supported Gabriella Miller Kids First Data Resource Center enables researchers, clinicians, and patients to work together to accelerate research and promote new discoveries for children affected with cancer and structural birth defects. The group collects patient data and processes the data with CWL workflows in order to create a dataset that is ready for further research to better know the diseases and ultimately improve outcomes for children with similar problems in the future.

Example workflows:

  • https://github.com/kids-first/kf-alignment-workflow
  • https://github.com/kids-first/kf-rnaseq-workflow
  • https://github.com/kids-first/kf-somatic-workflow
  • https://github.com/kids-first/kf-jointgenotyping-workflow
  • https://github.com/kids-first/kf-genotype-refinement-workflow
Gabriella Miller Kids First Data Resource Center logo

Children's Hospital of Philadelphia §

CAVATICACAVATICA is a cloud-based platform for collaboratively accessing, sharing, and analyzing pediatric cancer data. Cavatica was created as part of the White House’s launch of the Precision Medicine Initiative. Cavatica enables researchers to analyze data through portable, shareable, and reproducible CWL workflows to discover the shared mutations or characteristics across many different diseases. Cavactica was created by the Center for Data-Driven Discovery in Biomedicine in collobration with Seven Bridges and interoperates with the Cancer Genomics Cloud. Cavatica was awarded Best-in-Show at the 2017 Bio-IT World Conference and was a highlighted partner of the Biden Cancer Moonshot.

Children's Hospital of Philadelphia logo

European Open Science Cloud - Life (EOSC-Life) §

European Open Science Cloud - Life (EOSC-Life)EOSC-Life brings together the 13 Life Science ‘ESFRI’ research infrastructures (LS RIs) to create an open, digital and collaborative space for biological and medical research. The EOSC-Life Workflow Hub is a workflow registry designed around FAIR principles. Beta-released in Sept 2020, the Hub now holds nearly 100 workflows, including 36 curated COVID-19 workflows. Use of the Common Workflow Language is encouraged in the Workflow Hub since CWL provides a canonical description of the workflow.

Additional Info:

https://about.workflowhub.eu/About-WorkflowHub
European Open Science Cloud - Life (EOSC-Life) logo

BioExcel, Centre of Excellence for Computational Biomolecular Research §

BioExcel, Centre of Excellence for Computational Biomolecular ResearchThe BioExcel Center of Excellence supports academia and industry with the usage of advanced techniques for high-end computing. The CWL Viewer was created as a third-year project at The University of Manager and further developed as part of the BioExcel project. The CWL Viewer is available to visualize any CWL workflow definitions, show their annotations and composition.

BioExcel, Centre of Excellence for Computational Biomolecular Research logo

ELIXIR §

Interoperability PlatformELIXIR coordinates and develops life science resources across Europe so that researchers can more easily find, analyse and share data, exchange expertise, and implement best practices. This makes it possible for them to gain greater insights into how living organisms work. ELIXIR's activities are divided into five areas called 'Platforms'. These are Data, Tools, Compute, Interoperability, and Training. The Interoperability Platform focuses on developing and encouraging the adoption of standards. One of the tasks of this plafrom is to develop a FAIR service infrastructure that incorporates tools that are fit-for-purpose. CWL is supported in this infrastructure to allow for workflow interoperability.

Additional Info:

  • Marine Metagenomics Community
  • 2018 Implementation Study: Enabling the reuse, extension, scaling, and reproducibility of scientific workflows
  • 2018 Implementation Study: Compatible Cloud Analysis Platform
  • ELIXIR Cloud CWL-WES
ELIXIR logo

Memorial Sloan Kettering Cancer Center §

MSK-ACCESS teamMemorial Sloan Kettering Cancer Center has devoted more than 135 years to exceptional patient care, innovative research, and outstanding educational programs. It is one of 51 National Cancer Institute–designated Comprehensive Cancer Centers, with state-of-the-art science flourishing side by side with clinical studies and treatment. Genome scientists, bioinformaticians, and molecular pathologists at Memorial Sloan Kettering have developed MSK-Access. The MSK-ACCESS assay is a comprehensive liquid biopsy test that offers noninvasive cancer genomic profiling and disease monitoring. It is designed to detect genetic alterations in cfDNA (cell-free DNA) specimens, such as blood and other body fluids. CWL was used to create the MSK-ACCESS analysis workflows.

Memorial Sloan Kettering Cancer Center logo

UNLOCK §

UNLOCK-HomeUNLOCK is an open infrastructure for exploring new horizons for research on microbial communities. It is composed of three complementary experimental platforms for high-throughput discovery and characterization of microbial communities and a FAIR-data platform for large scale data storage, data extraction and analysis of high-throughput data in a cloud-based infrastructure.

UNLOCK logo

PubSeq §

COVID-19 PubSeq: Public SARS-CoV-2 Sequence ResourcePubSeq is a public data and workflow initiative targeting COVID19 (initially). COVID-19 PubSeq is a free and open online bioinformatics public sequence resource with federated data using unique identifiers and with unique metadata, such as disambiguated Geo localisation. On COVID-19 PubSeq the data, metadata, and analysis tools live together, publicly and freely. PubSeq comes with on-the-fly analysis of sequenced SARS-CoV-2 samples that allows for a quick turnaround in identification of new virus strains. PubSeq allows anyone to upload sequence material in the form of FASTA or FASTQ files with accompanying metadata through a web interface or REST API. COVID-19 PubSeq uses CWL workflows and is backed by Arvados.

PubSeq logo

Geospatial §

Open Geospatial Consortium §

Open Geospatial ConsortiumThe Open Geospatial Consortium (OGC) is "an international consortium of more than 500 businesses, government agencies, research organizations, and universities driven to make geospatial (location) information and services FAIR - Findable, Accessible, Interoperable, and Reusable."

Additional Info:

Webinar: https://register.gotowebinar.com/recording/4458989551587584774

~5 minutes each promotional videos:
  1. Terradue: Earth Observations Applications Architecture
  2. OGC EO Apps pilot - Applications
  3. OGC EO Apps pilot - Platforms
  4. EO Applications Architecture for Application Developers - Urban app deployment and execution (DLR)
Open Geospatial Consortium logo

Hydrology §

The European Open Science Cloud for Research Pilot Project §

Science Demonstrator: Earth Sciences – Hydrology: Switching on the EOSC for Reproducible Computational Hydrology by FAIRifying eWaterCycle and SWITCH-ON

The European Open Science Cloud will offer 1.7 million European researchers and 70 million professionals in science and technology a virtual environment with open and seamless services for storage, management, analysis and re-use of research data, across borders and scientific disciplines by federating existing scientific data infrastructures, today scattered across disciplines and Member States. The EOSCpilot project has been funded to support the first phase in the development of the European Open Science Cloud (EOSC). Within this pilot, science demonstrators show the relevance and usefulness of EOSC Services and how they enable data reuse, and will drive EOSC development.

The European Open Science Cloud for Research Pilot Project logo

Radio Astronomy §

The European Open Science Cloud for Research Pilot Project §

Science Demonstrator: LOFAR data

The European Open Science Cloud will offer 1.7 million European researchers and 70 million professionals in science and technology a virtual environment with open and seamless services for storage, management, analysis and re-use of research data, across borders and scientific disciplines by federating existing scientific data infrastructures, today scattered across disciplines and Member States. The EOSCpilot project has been funded to support the first phase in the development of the European Open Science Cloud (EOSC). Within this pilot, science demonstrators show the relevance and usefulness of EOSC Services and how they enable data reuse, and will drive EOSC development.

The European Open Science Cloud for Research Pilot Project logo

Cross-Domain §

The [Netherlands] National Plan Open Science §

National Plan Open Science - Feb. 2019The National Programme Open Science (NPOS) was established to join and coordinate efforts towards open science and promote its importance. The National Plan Open Science serves to implement a transition towards an open science system in the Netherlands. The National Plan Open Science contains an inventory of existing initiatives. The eScience center is recongnized for the research activity support it provides. The eScience Center ensures that the entire scientific community has public access, on an open-source fully documented basis, to research data related to all research projects involving Dutch universities, through the eScience Technology Platform and open platforms such as GitHub and Zenodo. The eScience Center also makes an active contribution to the continued development and Dutch use of the most promising software and tools in the field of open science. Examples include the Common Workflow Language, for the description of analyses and the use of software, and Docker, which can be used to create a durable record of software for reuse. Both assist with the reproducible performance of analyses irrespective of the computer or environment.

The [Netherlands] National Plan Open Science logo

University of Manchester, eScience Lab §

eScience Lab - Common Workflow LanguageThe eScience Lab team is focused on research and development around a set of tools designed for data driven and computational research. The eScience Lab has participated in the CWL project since its early days, contributing with our workflow and reproducibility expertise from Apache Taverna, Research Objects and myExperiment. The CWL Viewer, developed by the eScience Lab with support from BioExcel, provides a graphical visualization of CWL workflows.

University of Manchester, eScience Lab logo

The ACROSS European Project §

HPC Big Data Artificial Intelligence Cross Stack Platform Towards ExascaleThe ACROSS project (G.A. 955648) is an European High-Performance Computing Joint Undertaking that will co-design and develop an HPC, BD, and Artificial Intelligence (AI) convergent platform, supporting applications in the Aeronautics, Climate and Weather, and Energy domains. The project will leverage on effective mechanisms to describe and manage complex workflows for the next generation of heterogeneous pre-exascale infrastructures.

The ACROSS European Project logo

Copyright © 2014-2022 • Common Workflow Language (CWL) • Code of Conduct