Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
impute-gene-expression
Project overview
Project overview
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Commits
Issue Boards
Open sidebar
Kevin Kunzmann
impute-gene-expression
Commits
599605f5
Commit
599605f5
authored
Sep 02, 2019
by
Kevin Kunzmann
Browse files
Options
Browse Files
Download
Plain Diff
Merge branch 'master' of
https://github.com/kkmann/impute-gene-expression
parents
cb431c22
87dfc01d
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
24 additions
and
12 deletions
+24
-12
README.md
README.md
+24
-12
No files found.
README.md
View file @
599605f5
# Impute gene expression
data for CENTER-TBI using
PrediXcan
# Impute gene expression
for CENTER-TBI with
PrediXcan
The singularity container is available for download under https://doi.org/10.5281/zenodo.3376504.
The singularity container with most software dpendencies is available at
Data currently needs to be accessed manually due to access restrictions, this workflow should work for enssentially any
https://doi.org/10.5281/zenodo.3376504.
vcf.gz file with dosage (DS) information.
Data currently needs to be accessed manually due to access restrictions.
This workflow is design for
*
.vcf.gz files with dosage (DS) information.
More information on PrediXcan can be found here https://github.com/hakyimlab/PrediXcan and here in the publication:
More information on PrediXcan can be found here https://github.com/hakyimlab/PrediXcan and in:
> Gamazon ER†, Wheeler HE†, Shah KP†, Mozaffari SV, Aquino-Michaels K, Carroll RJ, Eyler AE, Denny JC,
> Gamazon ER†, Wheeler HE†, Shah KP†, Mozaffari SV, Aquino-Michaels K, Carroll RJ, Eyler AE, Denny JC,
Nicolae DL, Cox NJ, Im HK. (2015) A gene-based association method for mapping traits using reference transcriptome data.
Nicolae DL, Cox NJ, Im HK. (2015) A gene-based association method for mapping traits using reference transcriptome data.
Nat Genet. doi:10.1038/ng.3367.
Nat Genet. doi:10.1038/ng.3367.
We use snakemake to organize the workflow (also pre-installed in the container) and support cluster execution.
> Johannes Köster, Sven Rahmann, Snakemake—a scalable bioinformatics workflow engine, Bioinformatics,
Volume 28, Issue 19, 1 October 2012, Pages 2520–2522, https://doi.org/10.1093/bioinformatics/bts480
## Dependencies
## Dependencies
1.
linux shell (
`bash`
), possibly via virtual machine on Windows/Mac
1.
linux shell (
`bash`
), possibly via virtual machine on Windows/Mac
2.
`wget`
(pre-installed or via distribution package manager)
2.
`wget`
(pre-installed or via distribution package manager)
3.
`singularity`
container software (tested on 3.3.0
) https://sylabs.io/guides/3.3/user-guide
3.
`singularity`
container software (tested on 3.3.0
, https://sylabs.io/guides/3.3/user-guide)
4.
`git`
4.
`git`
(https://git-scm.com/book/en/v2/Getting-Started-Installing-Git)
### Optional
### Optional
5.
python 3.7+ and snakemake
5.
python 3.7+ and snakemake
6.
slurm cluster
6.
slurm cluster
We use snakemake to organize the workflow (also pre-installed in the container) and support cluster execution.
Snakemake is available via
`pip`
package for python 3.7.
> Johannes Köster, Sven Rahmann, Snakemake—a scalable bioinformatics workflow engine, Bioinformatics,
Volume 28, Issue 19, 1 October 2012, Pages 2520–2522, https://doi.org/10.1093/bioinformatics/bts480
## Execution
## Execution
Download and extract the contents of this repository (might be access restricted)
Download and extract the contents of this repository (might be access restricted)
...
@@ -46,4 +48,14 @@ Optionally, if snakemake is installed, the workflow can be run in parallel via
...
@@ -46,4 +48,14 @@ Optionally, if snakemake is installed, the workflow can be run in parallel via
snakemake --use-singularity -j 8 impute
snakemake --use-singularity -j 8 impute
where '8' can be replaced by the number of available cores.
where '8' can be replaced by the number of available cores.
Cluster execution is enables via the
`scripts/slurm_snakemake.sh`
script as
bash scripts/slurm_snakemake.sh impute
## Results
Intermediate files (PrediXcan dosage files and raw outputs) are stored in the
`outputs/`
subfolder of the working directory.
The file
`output/gene_expressions_combined.rds`
combines imputed gene expression levels across
all available brain regions in a compressed .rds file (R data set).
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment