Rna multisample

Processing unimodal multi-sample RNA transcriptomics data.

Info

ID: rna_multisample
Namespace: multiomics

Links

Source

Example commands

You can run the pipeline using nextflow run.

View help

You can use --help as a parameter to get an overview of the possible parameters.

nextflow run openpipelines-bio/openpipeline \
  -r 0.10.0 -latest \
  -main-script ./workflows/multiomics/rna_multisample/main.nf \
  --help

Run command

Example of params.yaml

# Inputs
id: # please fill in - example: "concatenated"
input: # please fill in - example: "dataset.h5mu"

# Output
# output: "$id.$key.output.h5mu"

# Filtering highly variable genes
filter_with_hvg_var_output: "filter_with_hvg"
filter_with_hvg_obs_batch_key: "sample_id"
filter_with_hvg_flavor: "seurat"
# filter_with_hvg_n_top_genes: 123

# QC metrics calculation options
var_qc_metrics: ["filter_with_hvg"]
top_n_vars: [50, 100, 200, 500]

# Nextflow input-output arguments
publish_dir: # please fill in - example: "output/"
# param_list: "my_params.yaml"

nextflow run openpipelines-bio/openpipeline \
  -r 0.10.0 -latest \
  -profile docker \
  -main-script ./workflows/multiomics/rna_multisample/main.nf \
  -params-file params.yaml

Note

Replace -profile docker with -profile podman or -profile singularity depending on the desired backend.

Argument groups

Inputs

Name	Description	Attributes
`--id`	ID of the concatenated file	`string`, required, example: `"concatenated"`
`--input`	Path to the samples.	`file`, required, example: `"dataset.h5mu"`

Output

Name	Description	Attributes
`--output`	Destination path to the output.	`file`, required, example: `"output.h5mu"`

Filtering highly variable genes

Name	Description	Attributes
`--filter_with_hvg_var_output`	In which .var slot to store a boolean array corresponding to the highly variable genes.	`string`, default: `"filter_with_hvg"`
`--filter_with_hvg_obs_batch_key`	If specified, highly-variable genes are selected within each batch separately and merged. This simple process avoids the selection of batch-specific genes and acts as a lightweight batch correction method. For all flavors, genes are first sorted by how many batches they are a HVG. For dispersion-based flavors ties are broken by normalized dispersion. If flavor = ‘seurat_v3’, ties are broken by the median (across batches) rank based on within-batch normalized variance.	`string`, default: `"sample_id"`
`--filter_with_hvg_flavor`	Choose the flavor for identifying highly variable genes. For the dispersion based methods in their default workflows, Seurat passes the cutoffs whereas Cell Ranger passes n_top_genes.	`string`, default: `"seurat"`
`--filter_with_hvg_n_top_genes`	Number of highly-variable genes to keep. Mandatory if filter_with_hvg_flavor is set to ‘seurat_v3’.	`integer`

QC metrics calculation options

Name	Description	Attributes
`--var_qc_metrics`	Keys to select a boolean (containing only True or False) column from .var. For each cell, calculate the proportion of total values for genes which are labeled ‘True’, compared to the total sum of the values for all genes.	List of `string`, default: `"filter_with_hvg"`, example: `"ercc,highly_variable"`, multiple_sep: `","`
`--top_n_vars`	Number of top vars to be used to calculate cumulative proportions. If not specified, proportions are not calculated. `--top_n_vars 20,50` finds cumulative proportion to the 20th and 50th most expressed vars.	List of `integer`, default: `50, 100, 200, 500`, multiple_sep: `","`

Authors

Dries De Maeyer (author)
Robrecht Cannoodt (author, maintainer)
Dries Schaumont (author)

Visualisation

flowchart LR
    p0(Input)
    p3(toSortedList)
    p5(flatMap)
    p7(toSortedList)
    p9(Output)
    p15(normalize_total)
    p17(join)
    p25(log1p)
    p27(join)
    p35(delete_layer)
    p37(join)
    p45(filter_with_hvg)
    p47(join)
    p55(rna_calculate_qc_metrics)
    p57(join)
    p64(Output)
    p0-->p3
    p3-->p5
    p5-->p7
    p7-->p9
    p5-->p17
    p5-->p15
    p15-->p17
    p17-->p27
    p17-->p25
    p25-->p27
    p27-->p37
    p27-->p35
    p35-->p37
    p37-->p47
    p37-->p45
    p45-->p47
    p47-->p57
    p47-->p55
    p55-->p57
    p57-->p64