Overview of this workflow

Structural variant calling from single-cell Strand-seq data Snakemake pipeline.

Overview of this workflow

This workflow uses Snakemake to execute all steps of MosaiCatcher in order. The starting point are single-cell BAM files from Strand-seq experiments and the final output are SV predictions in a tabular format as well as in a graphical representation. To get to this point, the workflow goes through the following steps:

Binning of sequencing reads in genomic windows of 200kb via mosaic
Strand state detection
[Optional]Normalization of coverage with respect to a reference sample
Multi-variate segmentation of cells (mosaic)
Haplotype resolution via StrandPhaseR
Bayesian classification of segmentation to find SVs using MosaiClassifier
Visualization of results using custom R plots


MosaiCatcher snakemake pipeline and visualisations examples
ashleys-qc-pipeline

📘 Documentation

📆 Roadmap

Technical-related features

Bioinformatic-related features

Small issues to fix

replace input_bam_location by data_location (harmonization with ashleys-qc-pipeline)
List of commands available through list_commands parameter (1.8.6
Move pysam / SM tag comparison script to snakemake rule (2.2.0)

🛑 Troubleshooting & Current limitations

Do not change the structure of your input folder after running the pipeline, first execution will build a config dataframe file (OUTPUT_DIRECTORY/config/config.tsv) that contains the list of cells and the associated paths
Do not change the list of chromosomes after a first execution (i.e: first execution on chr17, second execution on all chromosomes)

💂‍♂️ Authors (alphabetical order)

Ashraf Hufash
Cosenza Marco
Ebert Peter
Ghareghani Maryam
Grimes Karen
Gros Christina
Höps Wolfram
Jeong Hyobin
Kinanen Venla
Korbel Jan
Marschall Tobias
Meiers Sasha
Porubsky David
Rausch Tobias
Sanders Ashley
Van Vliet Alex
Weber Thomas (maintainer and current developer)

📕 References

MosaiCatcher v2 publication: Weber Thomas, Marco Raffaele Cosenza, and Jan Korbel. 2023. ‘MosaiCatcher v2: A Single-Cell Structural Variations Detection and Analysis Reference Framework Based on Strand-Seq’. Bioinformatics 39 (11): btad633. https://doi.org/10.1093/bioinformatics/btad633.

Strand-seq publication: Falconer, E., Hills, M., Naumann, U. et al. DNA template strand sequencing of single-cells maps genomic rearrangements at high resolution. Nat Methods 9, 1107–1112 (2012). https://doi.org/10.1038/nmeth.2206

scTRIP/MosaiCatcher original publication: Sanders, A.D., Meiers, S., Ghareghani, M. et al. Single-cell analysis of structural variations and complex rearrangements with tri-channel processing. Nat Biotechnol 38, 343–354 (2020). https://doi.org/10.1038/s41587-019-0366-x

ArbiGent publication: Porubsky, David, Wolfram Höps, Hufsah Ashraf, PingHsun Hsieh, Bernardo Rodriguez-Martin, Feyza Yilmaz, Jana Ebler, et al. 2022. “Recurrent Inversion Polymorphisms in Humans Associate with Genetic Instability and Genomic Disorders.” Cell 185 (11): 1986-2005.e26. https://doi.org/10.1016/j.cell.2022.04.017.

scNOVA publication: Jeong, Hyobin, Karen Grimes, Kerstin K. Rauwolf, Peter-Martin Bruch, Tobias Rausch, Patrick Hasenfeld, Eva Benito, et al. 2022. “Functional Analysis of Structural Variants in Single Cells Using Strand-Seq.” Nature Biotechnology, November, 1–13. https://doi.org/10.1038/s41587-022-01551-4.

Name		Name	Last commit message	Last commit date
Latest commit History 3,161 Commits
.github		.github
.tests		.tests
afac		afac
config		config
docs		docs
envs		envs
github-actions-runner		github-actions-runner
watchdog_pipeline		watchdog_pipeline
workflow		workflow
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.gitpod.Dockerfile		.gitpod.Dockerfile
.gitpod.yml		.gitpod.yml
.gitpod.yml.bak		.gitpod.yml.bak
.pre-commit-config.yaml		.pre-commit-config.yaml
.snakemake-workflow-catalog.yml		.snakemake-workflow-catalog.yml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md

License

friendsofstrandseq/mosaicatcher-pipeline

Folders and files

Latest commit

History

Repository files navigation

Overview of this workflow

📘 Documentation

📆 Roadmap

Technical-related features

Bioinformatic-related features

Small issues to fix

🛑 Troubleshooting & Current limitations

💂‍♂️ Authors (alphabetical order)

📕 References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages