modify Snakefile and add option in conf file to run dekupl on ensembl… #30
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hello, in the past I changed the code of the dekupl-run Snakefile and dekupl-annotation to use it on ensembl official gff/gtf, making possible to run de-kupl on non-human models. My initial tests were based on the zebrafish annotation for example. I tried to make this version with the minimum of modifications and exclusively on dekupl-run. To be brief the genes names and other informations are stored differentially in the fasta and I changed the extraction in consequence. The config file contain now the "annotation_type" that can be "gencode" or "ensembl", it works as an option. Finally the gff file of ensembl don't contain the ENSGXXXXXXXX .X information, so I removed it when the gene reference is extracted form the fasta. Otherwise dekupl-annotation getSwitches.R code will output nothing. DEKUPL-ANNOTATION IS NOT FULLY TESTED WITH THIS VERSION. I'm afraid that i must remove the .X information in transcript references too, even if I'm not sure that transcript reference is used in gff management of dekupl-annotation.
ps : it lack a space in annotation download commande, do not accept this modification.