Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rna-seq pipeline #4932

Open
chubukov opened this issue May 16, 2024 · 2 comments
Open

rna-seq pipeline #4932

chubukov opened this issue May 16, 2024 · 2 comments

Comments

@chubukov
Copy link

chubukov commented May 16, 2024

I'm wondering if there is concise documentation somewhere for reproducing the exact analysis of the TCGA/GTEX data in this paper (but with new samples).

https://github.com/BD2KGenomics/toil-rnaseq/tree/master does not seem to be maintained any more, and I was unable to pip install toil-rnaseq with either python2.7 or python3.11.

Thanks.

┆Issue is synchronized with this Jira Story
┆Issue Number: TOIL-1569

@adamnovak
Copy link
Member

adamnovak commented May 16, 2024

It looks like toil-rnaseq did not set a maximum Toil version:
https://github.com/BD2KGenomics/toil-rnaseq/blob/23045b896a2e08a61284d63c822ccfdcd3dab0a7/version.py#L17-L18
So installing it with pip install will pull in versions of Toil that could be too new to be compatible with it.

It looks like the workflow has a Docker container available, and instructions for using it, so that is probably the easiest way to run the workflow on new samples. It seems like this bundles an appropriate version of Toil. You might need to make sure that the Docker daemon you are using is sufficiently old, since in Docker v26 the ability to pull images in older formats before v2 schema 2 was turned off, while some required images for the pipeline, such as docker.io/jvivian/rsem_postprocess:latest, are in schema 1 format.

How the original workflow was run is in "Supplementary Note 6" in the paper, but the procedure given may no longer work, because published versions of packages installed from the Internet may have changed. I think there may also have been changes made to the AWS API so that Toil 3.12 and/or the unmaintained cgcloud might not be able to talk to it anymore.

@chubukov
Copy link
Author

Thanks @adamnovak, I didn't appreciate that I could just run the docker container without the python wrapper. I'll give it a shot.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants