Methylation Sequencing

You can find this application in the demos folder of your Jupyter notebook environment.

samplesheet.csv
methylseq_workflow.ipynb

Methylation sequencing (Methyl-Seq) is a key tool for studying DNA methylation, an essential epigenetic modification influencing gene expression and cellular function. The nf-core/methylseq pipeline simplifies the analysis of Methyl-Seq data, offering a robust and reproducible workflow. Running this pipeline on the Camber platform, powered by the Nextflow Engine, makes it easy to perform complex bioinformatics tasks with minimal effort, providing scalability and efficiency for researchers.

The first step is to import the nextflow package:

from camber import nextflow

Here’s an example of how to setup configurations and execute a job:

pipeline="nf-core/methylseq": specify pipeline to run.
engine_size="MICRO": indicate engine size to perform the job.
num_engines=4: indicate number of engines to run workflow tasks in parallel.

Pipeline parameters must be defined in params argument. To ensure the pipeline works as expected, please take note that:

"--input": "./samplesheet.csv": the relative path of samplesheet.csv file to the current notebook. In case of using local FastQ files, the locations of them in samplesheet.csv file content are relative also.
"--outdir": "/camber_outputs": the location stores output data of the job.

nf_methylseq_job = nextflow.create_job(
    pipeline="nf-core/methylseq",
    engine_size="MICRO",
    num_engines=4,
    params={
        "--input": "./samplesheet.csv",
        "--outdir": "/camber_outputs",
        "-r": "3.0.0",
        "--genome": "GRCh37",
    },
)

This step is to check job status:

nf_methylseq_job.status

To monitor job exectution, you can show job logs in real-time by read_logs method:

nf_methylseq_job.read_logs()

When the job is done, you can discover and download the results and logs of the job by two ways:

Browser data directly in notebook environment:

Go to the Stash UI:

This tutorial highlights how Camber simplifies running the nf-core/methylseq pipeline, enabling efficient execution and result retrieval, making it a great choice for Methyl-Seq analysis.