Differential RNA splicing analysis with single-cell/nucleus RNA-seq data¶
Before you start¶
- Perform mapping of sc(sn)RNA-seq reads to the reference genome using STARsolo.
- You can download a test input file mapped by STARsolo on the mouse genome from here.
- Download a gene annotataion file of your interest in GTF format.
Installation¶
1 2 3 4 |
|
scShiba¶
1. Prepare inputs¶
experiment.tsv
: A tab-separated text file of barcode file and STAR solo raw output directory.
1 2 3 4 5 |
|
barcodes.tsv
is a tab-separated text file of barcode and group name like this:
1 2 3 4 5 6 7 8 9 10 |
|
Make sure to use tabs
If you copy and paste the above example, your experiment.tsv file may contain spaces instead of tabs, which will causes an error when you run scShiba. Please make sure that you are using a tab character between the columns.
config.yaml
: A yaml file of the configuration.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
|
- The working directory where the output files will be saved. Please make sure that you have write permission to this directory.
- The path to the gene annotation file in GTF format.
- The path to the
experiment.tsv
file. - Set to
True
if you want to skip the differential analysis and only calculate PSI values for each sample. - Significance threshold for differential splicing analysis.
- Minimum difference in PSI values between groups to be considered significant.
- Reference group for differential splicing analysis.
- Alternative group for differential splicing analysis.
- Minimum number of reads required to calculate PSI values.
- Set to
True
if you want to generate a file of splicing analysis results in excel format.
2. Run¶
1 |
|
You are going to use 4 threads for parallelization. You can change the number of threads by changing the -p
option.
Did you encounter any problems?
You can run scShiba with the --verbose
option to see the debug log. This will help you to find the problem.
1 |
|
SnakeScShiba¶
A snakemake-based workflow of scShiba. This is useful for running scShiba on a cluster. Snakemake automatically parallelizes the jobs and manages the dependencies between them.
1. Prepare inputs¶
experiment.tsv
: A tab-separated text file of sample ID, path to fastq files, and groups for differential analysis. This is the same as the input for scShiba.
config.yaml
: A yaml file of the configuration. This is the same as the input for scShiba but with the addition of the container
field.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
|
- The working directory where the output files will be saved. Please make sure that you have write permission to this directory.
- The Docker image of Shiba.
- The path to the gene annotation file in GTF format.
- The path to the
experiment.tsv
file. - Set to
True
if you want to skip the differential analysis and only calculate PSI values for each sample. - Significance threshold for differential splicing analysis.
- Minimum difference in PSI values between groups to be considered significant.
- Reference group for differential splicing analysis.
- Alternative group for differential splicing analysis.
- Minimum number of reads required to calculate PSI values.
- Set to
True
if you want to generate a file of splicing analysis results in excel format.
2. Run¶
Please make sure that you have installed Snakemake and Singularity and cloned the Shiba repository on your system.
1 2 3 4 5 |
|