Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
462 commits
Select commit Hold shift + click to select a range
82495b2
Initial commit of tools/MergeAlignmentsToCsv.py
Oct 10, 2017
7236148
Added references for BWA & bowtie
Oct 31, 2017
d590e27
Added manual
Oct 31, 2017
72a48ca
Update README.md
ChrisHIV Oct 31, 2017
c8456c6
Update README.md
ChrisHIV Oct 31, 2017
50d3541
Initial commit of tools/QuantifyPairwiseIndels.py
Nov 6, 2017
6064586
Added --reference option to specify coords with respect to a named se…
Nov 6, 2017
d2f4329
Updated comment
Nov 6, 2017
577f531
Update README.md
ChrisHIV Oct 31, 2017
3b8fe51
Added comment
Nov 6, 2017
3739c8a
Bug fix: recent change of output style from ConstructBestRef broke so…
Nov 6, 2017
9b55dbc
Changed progress reporting interval
Nov 7, 2017
0134be8
Initial commit of tools/CompareLengthsInPairwiseAln.py
Nov 14, 2017
da74e7e
Extra options used for the published version of the paper.
Nov 14, 2017
bf43650
Upped the default MinGapSizeToSplitGontig from 100 to 160
Nov 14, 2017
fb336a2
Added the option to randomly select a subset of sequences of the spec…
Nov 17, 2017
f57e83a
Bug fix, for when two distinct parts of the same contig blast to an i…
Nov 17, 2017
5e67a02
More helpful error message for /1 or /2 in read names.
Dec 4, 2017
77bf85c
Redirecting error message to stderr
Dec 4, 2017
5309916
We now crash if the specified reads are not in the working dir and fi…
Dec 4, 2017
cb3a99b
Trivial: 80 char margin rule, deleted unused code, white space.
Dec 4, 2017
6b1d289
Initial commit of trimmomatic short cut
Dec 6, 2017
8c9cfbb
Manual update
Dec 6, 2017
b063d0a
Changed temp file names to show which is mafft --add and which is --a…
ChrisHIV Dec 19, 2017
aebf06a
Update MacInstallationNotes.txt
ChrisHIV Dec 6, 2017
2fd2586
Rename MacInstallationNotes.txt to InstallationNotes.sh
ChrisHIV Dec 7, 2017
d45212c
Update InstallationNotes.sh
ChrisHIV Dec 7, 2017
62211be
Update README.md
ChrisHIV Dec 7, 2017
14e9bf8
Update InstallationNotes.sh
ChrisHIV Dec 7, 2017
193de2b
Update InstallationNotes.sh
ChrisHIV Dec 7, 2017
db3c27e
Update README.md
ChrisHIV Dec 7, 2017
61547d6
Update README.md
ChrisHIV Dec 7, 2017
d527836
Update README.md
ChrisHIV Dec 7, 2017
1ff8ca0
Made FindSeqsInFasta take either names or a file of names as an optio…
ChrisHIV Feb 2, 2018
e6f0526
Initial commit of tools/AddAllPossibleSNPsToSeqs.py
ChrisHIV Mar 8, 2018
706078e
Keep all original seq names in the output (not using a mutated name w…
ChrisHIV Mar 8, 2018
db16284
Initial commit of tools/ExactBLAST.py
ChrisHIV Mar 8, 2018
8141038
Added option to trim primers containing a single SNP (turned off by d…
ChrisHIV Mar 9, 2018
891574d
Added two new read pairs for testing primer trimming.
ChrisHIV Mar 9, 2018
4c837a1
Bug fix: from Feb 2nd 2018 till now, removal of contaminant reads bef…
ChrisHIV Mar 9, 2018
d3fee54
Set default TrimPrimerWithOneSNP=false
ChrisHIV Mar 9, 2018
198b9ee
The pairwise alignment for HXB2 coords now uses linsi be default rath…
ChrisHIV Mar 9, 2018
9a8d185
Changed the criterion for selecting between mafft --addfragments or -…
Mar 10, 2018
b06028a
Bug fix: the user supplying the mapping reference HXB2, named exactly…
ChrisHIV Mar 14, 2018
cb4eb49
shiver_align_contigs.sh now returns an exit status of 2 if and only i…
ChrisHIV Mar 14, 2018
4d0a65b
Changed 'no HIV contigs' exit code from 2 to 3, as 2 has a special me…
ChrisHIV Mar 16, 2018
67de6f7
Initial commit of tools/EstimateAmbiguousBases.py
ChrisHIV Mar 23, 2018
a1a9390
Added count information where ambiguous bases were interpreted.
ChrisHIV Mar 29, 2018
79dc77e
Added --keep-gaps-by-missing and --use-n-for-missing options
ChrisHIV Apr 9, 2018
a5bb197
Added CallAmbigBaseIfNeeded func
ChrisHIV Apr 9, 2018
be047de
Added --skip-ref-in-output option
ChrisHIV Apr 9, 2018
b6c086b
Bug fix: now returns exit status 3 instead of 1 if no good HIV contig…
ChrisHIV Apr 11, 2018
974ebea
Return an exit status of 3 if no reads mapped.
ChrisHIV Apr 11, 2018
992bd52
Now return an exit status of 3 if there are no reads initally, after …
ChrisHIV Apr 11, 2018
55310c2
Added --ignore-n option
ChrisHIV Apr 11, 2018
2826347
Initial commit of tools/CleanGlobalAln.py
ChrisHIV May 1, 2018
54c622c
Removed unused code
ChrisHIV May 1, 2018
9580529
Removed hard-coded skipping of BEE0006-2
ChrisHIV May 1, 2018
9a39749
Added comments
ChrisHIV May 1, 2018
5238f03
When splitting up into separate alignments for each region, made the …
ChrisHIV May 1, 2018
a24ebd2
Initial commit of info/ToDoListForImprovingShiver.txt
ChrisHIV May 1, 2018
a0aa12b
By default we now prepend temp_ to the pre-mapping read filenames; th…
ChrisHIV May 2, 2018
2c74413
Changed the suffix for the reads after cleaning from 'clean' to 'PreM…
ChrisHIV May 2, 2018
50fad35
Made the file of contigs passing the length threshold temporary.
ChrisHIV May 2, 2018
dfcb31b
Exit with status 3 instead of 1 if splitting the aligned corrected co…
ChrisHIV May 4, 2018
740f8ae
Added --trim-missing-ends option
ChrisHIV May 4, 2018
889bbcd
Updated manual
ChrisHIV May 4, 2018
b55ee36
Updated my email address
ChrisHIV May 4, 2018
b27984d
Email address typo
ChrisHIV May 4, 2018
a8ff46f
Restructuring ahead of option for individual consensuses: seqs now re…
ChrisHIV May 8, 2018
134925a
Moved private TranslateSeqCoordsToAlnCoords function to ShiverFuncs
ChrisHIV May 9, 2018
437d934
Rewrite of tools/CleanGlobalAln.py to allow individual consensus file…
ChrisHIV May 10, 2018
1ec17a8
File renamed
ChrisHIV May 10, 2018
d075b49
Made the --split-amplicons option provide the usual output as well.
ChrisHIV May 10, 2018
426b826
Changed output files to be labelled by seq id instead of beehive id
ChrisHIV May 10, 2018
e68dc4b
Bug fix: individual consensuses files that only became wholly undeter…
ChrisHIV May 11, 2018
d1e8a27
Added --skip-string option
ChrisHIV May 11, 2018
cdb68ad
bug fix - missing arg namespace
ChrisHIV May 11, 2018
f9e972d
Added --undetermined-start and --undetermined-end options
ChrisHIV May 15, 2018
aa35ccd
Added --longest-gap option
ChrisHIV May 15, 2018
0744a26
Made multiple input fasta files possible
ChrisHIV May 15, 2018
19c6e08
Added --first-seq-only option
ChrisHIV May 15, 2018
c53777e
Initial commit of tools/AlignBaseFreqFiles_ByConsensuses.py
ChrisHIV May 21, 2018
6088ea4
File renamed
ChrisHIV May 21, 2018
9184bef
Discussion of the AlignBaseFreqFiles differences
ChrisHIV May 21, 2018
caed0af
Update README.md
ChrisHIV May 17, 2018
e040ec4
Update README.md
ChrisHIV May 21, 2018
998b27e
Updated manual
ChrisHIV May 21, 2018
48de6c4
Updated to-do list
ChrisHIV May 21, 2018
3b555fe
Updated diagram
ChrisHIV May 21, 2018
1de4467
Initial commit of tools/CallGlobalConsensusFromCsv.py
ChrisHIV Jun 6, 2018
7c3f23d
Update README.md
ChrisHIV May 21, 2018
451a2e2
Critical bug fix for Mac OS - it's sort --merge --unique command seem…
ChrisHIV Jun 22, 2018
3334201
Added check that the length of the most common kmer is the most commo…
ChrisHIV Jun 22, 2018
666957d
smalt brew command updated
ChrisHIV Jun 21, 2018
633fe8b
mafft now installable from brew
ChrisHIV Jun 21, 2018
7564d1b
Removed superfluous reordering of blast hits from testing
ChrisHIV Jun 25, 2018
8b3b22f
Moved RemoveBlankColumns function into ShiverFuncs. Defined GetSeqSta…
ChrisHIV Jun 26, 2018
a75b853
Added --trim-overhangs option to CutAlignedContigs, and made it remov…
ChrisHIV Jun 26, 2018
f8eec65
Made it the default to trim contigs that overhang the edge of the ali…
ChrisHIV Jun 26, 2018
1c7755f
Renamed tools/SplitGappyContigs.py to tools/CutAlignedContigs.py
ChrisHIV Jun 26, 2018
d04f260
Clarified message
ChrisHIV Jun 26, 2018
50a871f
Updated manual
ChrisHIV Jun 26, 2018
dad131f
Bug fix: the AlignContigsToConsensus option gave an error due to an o…
ChrisHIV Jul 2, 2018
b262e3d
Added --header option to tools/KeepBestLinesInDataFile.py
ChrisHIV Jul 5, 2018
1bb8e1c
In tools/KeepBestLinesInDataFile.py, exit(1) if no data found.
ChrisHIV Jul 5, 2018
1cfe281
Fix for bug from minutes ago: tested emptiness of wrong object.
ChrisHIV Jul 5, 2018
5bea03b
Updated comment
ChrisHIV Jul 5, 2018
1c4c576
Updated manual
ChrisHIV Sep 14, 2018
3124a56
Trivial bug fix for newer Biopython that doesn't allow comparison of …
ChrisHIV Oct 22, 2018
0b8e58d
Made output arg compulsory. Changed expected header of seq-based blac…
ChrisHIV Nov 9, 2018
682aad2
More generic shebang
ChrisHIV Nov 9, 2018
7badc7f
Added option to clean base freq files
ChrisHIV Nov 20, 2018
cd8fce3
Now add to our blacklist those seqs that were skipped because they we…
ChrisHIV Nov 20, 2018
0f062f2
Removed print hangover from testing
ChrisHIV Nov 20, 2018
1a37627
Changed default behaviour to be masking any bases before the first or…
ChrisHIV Nov 21, 2018
031000b
Trivial fix of recent bug that would have stopped the individual cons…
ChrisHIV Nov 21, 2018
967d5f6
For the split_amplicons output for individual consensuses, leading an…
ChrisHIV Nov 21, 2018
ba51237
Moved post-mapping steps into the function ProcessBam. Trivial bug fi…
ChrisHIV Dec 3, 2018
d8bb53f
Initial commit of tools/process_bam.sh
ChrisHIV Dec 3, 2018
cc574d2
Trivial movement of variable assignment to avoid repetition
ChrisHIV Jan 8, 2019
fd93f16
Minor bug fix for tools/CleanConsensuses.py: now masks anything befor…
ChrisHIV Jan 14, 2019
30f4594
Trivial bug fix: one occurence of samtools instead of meant it would…
ChrisHIV Jan 24, 2019
7cb1c7a
Initial commit of tools/GetCoverageFromBamFile.py
ChrisHIV Feb 4, 2019
9d2fa28
samtools BAQ calculation now turned off by deafult, as seems to be ap…
ChrisHIV Feb 5, 2019
528664f
Bug fix / improvement: previously contigs that were discarded for bei…
ChrisHIV Feb 7, 2019
035589d
Updated shiver diagram
ChrisHIV Feb 7, 2019
c98afa6
Added --dont_blacklist_missingness to tools/CleanConsensuses.py
ChrisHIV Feb 18, 2019
b87b779
Initial commit of tools/RecoverAlteredUpperLowerCase.py
ChrisHIV Feb 21, 2019
8e3d7a3
Allow full control of which blast options are used for the contigs (n…
ChrisHIV Mar 12, 2019
4f03180
tools/CorrectContigs.py: added option to output remaining blast hits
ChrisHIV Mar 12, 2019
eb8d4b3
tools/CorrectContigs.py: don't bother converting evalue and pident fr…
ChrisHIV Mar 12, 2019
492bfe4
Gave tools/CorrectContigs.py the compulsory option OverlapFracToMerge
ChrisHIV Mar 21, 2019
abc0dcb
Added the flexibility to try multiple different blast task modes when…
ChrisHIV Mar 21, 2019
5ff3148
Clarified comment
ChrisHIV Mar 21, 2019
e096d2e
Improvement for rare edge case in tools/EstimateAmbiguousBases.py: if…
ChrisHIV Mar 27, 2019
95895e4
Deleting shiver_reprocess_bam.sh because it's superseded by tools/pro…
ChrisHIV Apr 8, 2019
a97a825
Minor bug fix for a crash caused by a very large number of contaminan…
ChrisHIV May 15, 2019
b45b727
Trivial fix: redirecting an error message from stdout to stderr
ChrisHIV May 15, 2019
35b6874
Clarified error message if a position after the end of the reference …
ChrisHIV May 30, 2019
44e089b
Added file name safety check to tools/TranslateSeqForGlobalAln.py to …
ChrisHIV May 30, 2019
e3155e2
Updated manual
ChrisHIV Jun 17, 2019
51a8acd
Changed tools/GetCoverageFromBamFile.py to output reference positions…
ChrisHIV Jun 17, 2019
64132a3
Minor bug fix: now when cleaning the reads, if none of the forward or…
ChrisHIV Jun 28, 2019
37f5516
Redirected an error message from stdout to stderr
ChrisHIV Jun 28, 2019
8e9c6ce
Minor bugfix: if there was precisely 1 pair of reads blasting better …
ChrisHIV Aug 1, 2019
1f254d5
We no longer sort reads as a prelude to read cleaning. This is more r…
ChrisHIV Aug 5, 2019
28f0ec8
Minor bugfix resulting from recent change, where precisely zero conta…
ChrisHIV Aug 5, 2019
9c5f51a
Added a contaminant read pair to the example data that trigger cleaning.
ChrisHIV Aug 5, 2019
516b640
Update README.md
ChrisHIV Aug 5, 2019
ee1e8f2
Added --allow-duplicates option to tools/FindSeqsInFasta.py
ChrisHIV Aug 15, 2019
b20fe87
Print how the script was called, and what the config file parameter v…
ChrisHIV Nov 4, 2019
57183f3
Added clarifying comment
ChrisHIV Nov 4, 2019
c8b6231
In tools/LinkIdentityToCoverage.py, changed the default behaviour to …
ChrisHIV Nov 5, 2019
abf1f46
Initial commit of tools/LinkIdentityToCoverage_CombineBams.py
ChrisHIV Nov 5, 2019
289bf5f
In tools/CleanConsensuses.py, made the patient_based_blacklist argume…
ChrisHIV Nov 18, 2019
49b9f2a
Added script explanatory message
ChrisHIV Nov 18, 2019
5758625
Initial commit of tools/CleanConsensusesOxford.py
ChrisHIV Nov 18, 2019
72a932c
In tools/CleanConsensusesNoAmplicons.py, made the dates input file op…
ChrisHIV Nov 18, 2019
354f19c
Call all python scripts with user-specified python exectuable.
ChrisHIV Mar 24, 2021
4860588
Update InstallationNotes.sh
ChrisHIV May 18, 2021
a6350ca
Update InstallationNotes.sh
ChrisHIV Oct 10, 2022
5e0b62e
Update InstallationNotes.sh
ChrisHIV Oct 10, 2022
682ea36
Update InstallationNotes.sh
ChrisHIV Oct 13, 2022
5d49d0f
Minor bug fix: the optional description field on fastq identifier lin…
ChrisHIV Jan 26, 2023
c654af2
Update README.md
ChrisHIV Jan 26, 2023
76a477a
Updated funcs.sh and map_reads.sh
susannah-b Jan 15, 2024
285eba9
Modified config and FindContaminantReadPairs to allow unpaired
susannah-b Jan 15, 2024
7fcc11b
FindContaminantReadPairs.py bash script for unpaired data, plus fixes
susannah-b Jan 19, 2024
4af4f40
Further edits across map_reads and funcs to correctly handle unpaired…
susannah-b Jan 22, 2024
9e99e0f
Added a check for /1 or /2 suffixes in the unpaired data CheckReadNam…
susannah-b Jan 23, 2024
1505764
Renamed unpaired reads variable, moved some checks, restructured sect…
Jan 24, 2024
2b7d8a7
Consolidated unpaired/paired functions, moved trimmomatic sanity chec…
Jan 25, 2024
ca6efba
Some fatal bugs fixed; others remain
ChrisHIV Jan 25, 2024
73257b0
sam_to_bam bug fixed
ChrisHIV Jan 26, 2024
e0b792e
Testing done - minor edits made to CheckConfig
susannah-b Jan 26, 2024
0750de3
More informative error messages for bad config file options for unpai…
ChrisHIV Jan 26, 2024
127ac47
Checks for samtools -f flag values incompatible with unpaired data
Jan 30, 2024
8309702
More informative error messages for samtoolsReadFlags
ChrisHIV Jan 30, 2024
8100722
Used CheckForPaired instead of Paired inside CheckConfig for consiste…
ChrisHIV Feb 2, 2024
b075eb0
Moved code that was present in both 'if paired' and 'if unpaired' out…
ChrisHIV Feb 2, 2024
6d24915
Trivial bug fix: don't create the MappedContaminantReads file if the …
ChrisHIV Feb 6, 2024
b22bc96
Updated manual re: paired/unpaired
ChrisHIV Feb 8, 2024
504a319
Update README.md
ChrisHIV Feb 6, 2024
beaad80
Update README.md
ChrisHIV Feb 12, 2024
50a40f6
Codon correction update for shiver, including standalone CodonCorrect…
Mar 7, 2024
2a48f3b
Improved handling of sequences with missing coverage
Mar 7, 2024
999f6fc
Improve handling of arguments and errors. Separated gene extraction p…
susannah-b Apr 26, 2024
1c3a889
Corrected indel detection, added a check for single sequences in the …
susannah-b Apr 26, 2024
87ac11b
Added call_virulign function to simplify code. Added a function to ap…
susannah-b Apr 26, 2024
92a9923
Added colours to more easily identify warnings/errors
susannah-b Apr 26, 2024
a9bc1f1
Removed debug failed directory as the alignments that failed did not …
susannah-b Apr 26, 2024
a950f02
Modified init script to instead create a BLASTn database per gene, an…
susannah-b Apr 26, 2024
d4e5524
Replaced the whole-genome approach with BLAST-ing to individual genes…
susannah-b Apr 26, 2024
98e4c62
Added frameshift positional information, improved python error handli…
susannah-b Apr 26, 2024
f7921e1
Updated for Python 3
susannah-b Apr 26, 2024
fc0fe35
Fixed grep issue when MissingCoverage.txt does not exist.
Apr 22, 2024
8adaa3c
Only Python2-style print
ChrisHIV Apr 1, 2024
e2fc88a
from __future__ needed for previous print change
ChrisHIV Apr 1, 2024
4c505b2
modernize itertools_six
ChrisHIV Apr 2, 2024
2937375
modernize map
ChrisHIV Apr 2, 2024
ce7ed49
modernize xrange_six
ChrisHIV Apr 2, 2024
be33c31
modernize dict_six
ChrisHIV Apr 2, 2024
7b1fed3
modernize classic_division
ChrisHIV Apr 2, 2024
e732830
2to3 manual correction of modernize classic_division
ChrisHIV Apr 2, 2024
7eecf58
modernize idioms
ChrisHIV Apr 2, 2024
453dd6f
Prepended '_' to all internal variable names for comprehensions to be…
ChrisHIV Apr 2, 2024
196a47b
Changed Biopython's deprecated ungap() for replace()
ChrisHIV Apr 2, 2024
66d2cee
Extra(previously missed) change of Biopython's deprecated ungap() for…
ChrisHIV Apr 2, 2024
6804faa
Try both .ungap() and .replace() on seq objects for flexibility with …
ChrisHIV Apr 3, 2024
c9e2830
Replaced fastaq conversion from fastq to fasta with own code
ChrisHIV Apr 8, 2024
f5fbc31
Changed reference construction to break ties between which contig to …
ChrisHIV Apr 9, 2024
582b224
Changed default python from 2 to 3. Renamed python2 variable to python.
ChrisHIV Apr 10, 2024
a39ce5e
Fixed bug affecting only temp file temp_GappyRefWithExtraSeq.fasta: t…
ChrisHIV Apr 10, 2024
a09aeda
renamed variable for clarity
ChrisHIV Apr 10, 2024
8f3e65a
Moved imports to the top. Enforced importation from a given module on…
ChrisHIV Apr 10, 2024
d4be22c
Update README.md
ChrisHIV Apr 10, 2024
6f3265b
Changed ContigMinBlastOverlapToMerge in config to switch on some merg…
ChrisHIV Apr 12, 2024
9df294f
Added --help and --test options to the main three bash scripts
ChrisHIV Apr 18, 2024
bfef458
Added to all executable Python scripts if __name__ == "__main__":
ChrisHIV Apr 18, 2024
9dff8b2
Initial commit of tools/TestPythonModulesInstalled.py
ChrisHIV Apr 18, 2024
2097560
Changed python2 shebangs to python
ChrisHIV Apr 18, 2024
aa9dd94
Removed unnecessary import
ChrisHIV Apr 18, 2024
a17b167
Removed tools/trimmomatic and instead added instructions how to make …
ChrisHIV Apr 19, 2024
e5e7204
Deleted example contigs for full auto mode
ChrisHIV Apr 22, 2024
6f1a57f
Updated manual
ChrisHIV Apr 22, 2024
9c06b2b
Updated manual
ChrisHIV Apr 22, 2024
abe2170
Updated manual
ChrisHIV Apr 22, 2024
b196794
Update README.md
ChrisHIV Apr 22, 2024
a224ff3
Update README.md
ChrisHIV Apr 22, 2024
acefe5b
Update README.md
ChrisHIV Apr 22, 2024
d6878e6
Adjusted file locations to align with new shiver structure.
susannah-b Apr 26, 2024
98f4119
Adjusted file structure for /shiver/bin
susannah-b Apr 26, 2024
4315997
Implemented the ungap auxiliary function for biopython version compat…
susannah-b Apr 26, 2024
3958d17
Basic shiver implementation WIP - currently returns an Init Directory…
susannah-b Apr 26, 2024
843b325
Restructured files for merging and removed CC_Extract_Genes which is …
susannah-b May 1, 2024
6736513
Latest merge and some additional comments
susannah-b May 1, 2024
d1f2d36
Bug fix in codon correction Python funcs: none of the caught errors w…
ChrisHIV May 3, 2024
3bc8cc4
In CodonCorrectionInit.sh: removed check for file extensions, used fu…
ChrisHIV May 3, 2024
ca2efa3
Bug fix in CodonCorrectionInit.sh: check for 1+ sequences in the refe…
ChrisHIV May 3, 2024
59996f4
For codon correction initalisation, do not allow references with sequ…
ChrisHIV May 3, 2024
318edb0
Relaxed previous commit from an error to a warning (ready for eventua…
ChrisHIV May 3, 2024
f280d3f
Bug fix for codon correction initalisation: we now forbid gene end po…
ChrisHIV May 3, 2024
2ddaa70
Made error message more consistent with others
ChrisHIV May 3, 2024
71c437d
For codon correction initialisation, require gene lengths (in nucleot…
ChrisHIV May 3, 2024
1d39b57
Added missing whitespace in message
ChrisHIV May 3, 2024
eddbafc
For codon correction initialisation, redirect blast database messages…
ChrisHIV May 3, 2024
e4e4d4b
Added speech marks where needed and removed where not needed; removed…
ChrisHIV May 3, 2024
e4c3930
Fixed bug I introduced with -> , by using whitespace
ChrisHIV May 3, 2024
44f23f8
In codon correction initialisation, check references are not duplicat…
ChrisHIV May 3, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1,791 changes: 1,791 additions & 0 deletions bin/CodonCorrectionReferences/AnnotatableGenePositions.txt

Large diffs are not rendered by default.

Large diffs are not rendered by default.

15 changes: 15 additions & 0 deletions bin/config.sh
Original file line number Diff line number Diff line change
Expand Up @@ -252,6 +252,21 @@ KeepPreMappingReads=false
MaxContigGappiness=0.05
MinContigHitFrac=0.9

# CodonCorrection.sh variables:
virulign=~/VIRULIGN/virulign # testing - set to a more 'default' variable before finalising CC.sh
# Shall we codon-correct the shiver output before remapping?
CodonCorrect=true
# The genes that can be analysed by VIRULIGN:
genes="GAG POL VIF VPR VPU ENV NEF"
# Which data should be output by VIRULIGN? Separated by ',' and enclosed by double quotes (see CodonCorrection.sh help for more details)
VirulignOptions="Nucleotides,Mutations,AminoAcids"
# Which genes should be corrected? Separated by ',' and enclosed by double quotes (see CodonCorrection.sh help for more details)
GenesToCorrect="all"
# Shall we delete temporary files within the CodonCorrection.sh output?
CC_DeleteTemp=false
# Should key messages in the CodonCorrection output be printed in colour?
CC_EnableColours=true

# Suffixes we'll append to the sample ID for output files.
# If you change the extension (whatever follows the dot) you might break
# something.
Expand Down
2 changes: 2 additions & 0 deletions bin/shiver_funcs.sh
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,8 @@ Code_PrintSeqLengths="$ToolsDir/PrintSeqLengths.py"
Code_AddSNPsToSeqs="$ToolsDir/AddAllPossibleSNPsToSeqs.py"
Code_KeepBestLinesInDataFile="$ToolsDir/KeepBestLinesInDataFile.py"
Code_ConvertFastqToFasta="$ToolsDir/ConvertFastqToFasta.py"
Code_CodonCorrectionInit="$ToolsDir/CodonCorrectionInit.sh"
Code_CodonCorrection="$ToolsDir/CodonCorrection.sh"

# Only needed if GiveHXB2coords is set to true in the config file
HXB2file=$(dirname "$ThisDir")'/data/external/B.FR.83.HXB2_LAI_IIIB_BRU.K03455.fasta'
Expand Down
15 changes: 15 additions & 0 deletions bin/shiver_init.sh
Original file line number Diff line number Diff line change
Expand Up @@ -147,3 +147,18 @@ mkdir -p "$IndividualRefDir" || \
exit 1; }
"$python" "$Code_SplitFasta" -G "$RefAlignment" "$IndividualRefDir" || { echo "Problem" \
"splitting $RefAlignment into one file per sequence. Quitting." >&2 ; exit 1; }

# Make the CodonCorrection init directory
# NB This assumes that the /CodonCorrectionReferences is part of shiver to avoid changing the init arguments for now.
# If this is not the case, then the reference file and gene coordinates will need to be supplied as an init argument.
if [[ "$CodonCorrect" == "true" ]]; then
CC_RefFile="$ThisDir"/'CodonCorrectionReferences/HIV1_ALL_2020_genome_DNA_no_ambiguity_only_annotated_aligned_no_gap_columns_aligned.fasta'
CC_GeneCoords="$ThisDir"/'CodonCorrectionReferences/AnnotatableGenePositions.txt'
CC_Init="$OutDir"/'CodonCorrectionInit'
"$Code_CodonCorrectionInit" "$CC_RefFile" "$CC_Init" "$CC_GeneCoords" || \
{ echo "Problem initialising CodonCorrection. Quitting." >&2 ; exit 1; }
fi




8 changes: 8 additions & 0 deletions bin/shiver_map_reads.sh
Original file line number Diff line number Diff line change
Expand Up @@ -745,4 +745,12 @@ if [[ "$remap" == "true" ]]; then
{ echo 'Problem remapping to the consensus from the first round of mapping.'\
'Quitting.' >&2 ; exit 1 ; }
fi

# Codon-correct the round 2 consensus
CC_Init="$InitDir"/'CodonCorrectionInit'
CC_Output=./'CodonCorrection'
CC_Consensus="$NewSID"'_consensus_MinCov_'"$MinCov1"'_'"$MinCov2.fasta"
"$Code_CodonCorrection" "$CC_Consensus" "$virulign" "$CC_Init" "$VirulignOptions" "$GenesToCorrect" "$CC_Output" || \
{ echo 'Problem codon-correcting the consensus before remapping. Quitting.' >&2 ; exit 1 ; }

fi
Loading