Binning
Conversion of (pre-processed) expression count data into relative values (bins) to address scale differences across sequencing batches
Info
ID: binning
Namespace: scgpt
Links
Example commands
You can run the pipeline using nextflow run.
View help
You can use --help as a parameter to get an overview of the possible parameters.
nextflow run openpipelines-bio/openpipeline \
-r 2.1.1 -latest \
-main-script target/nextflow/scgpt/binning/main.nf \
--helpRun command
Example of params.yaml
# Inputs
input: # please fill in - example: "input.h5mu"
modality: "rna"
# input_layer: "foo"
var_input: "id_in_vocab"
n_input_bins: 51
# Outputs
# output: "$id.$key.output.h5mu"
# output_compression: "gzip"
output_obsm_binned_counts: "binned_counts"
# seed: 123
# Nextflow input-output arguments
publish_dir: # please fill in - example: "output/"
# param_list: "my_params.yaml"
# Argumentsnextflow run openpipelines-bio/openpipeline \
-r 2.1.1 -latest \
-profile docker \
-main-script target/nextflow/scgpt/binning/main.nf \
-params-file params.yaml
Note
Replace -profile docker with -profile podman or -profile singularity depending on the desired backend.
Argument groups
Inputs
| Name | Description | Attributes |
|---|---|---|
--input |
Input h5mu file. | file, required, example: "input.h5mu" |
--modality |
string, default: "rna" |
|
--input_layer |
Mudata layer (key from .layers) to use as input data for binning. If not specified, .X is used. | string |
--var_input |
The name of the adata.var column containing boolean mask for vocabulary-cross checked and/or highly variable genes. | string, default: "id_in_vocab" |
--n_input_bins |
The number of bins to discretize the data into. When no value is provided, data won’t be binned. | integer, default: 51 |
Outputs
| Name | Description | Attributes |
|---|---|---|
--output |
The output h5mu file containing the binned data. | file, required, example: "output.h5mu" |
--output_compression |
The compression algorithm to use for the output h5mu file. | string, example: "gzip" |
--output_obsm_binned_counts |
The name of the adata layer to write the binned data to. | string, default: "binned_counts" |
--seed |
Seed for random number generation. | integer |