Here you will find the materials for the Physalia course Reproducibility in Bioinformatics.
The course will be held online on Apr 15-17 2024.
In this course we want to increase awareness and introduce strategies how to improve reproducibility of bioinformatic analyses. Through a mixture of theoretical blocks and hands-on exercises the instructors will guide participants to develop skills to increase reproducibility of bioinformatic analyses and workflows using containers, versioning and virtual environments.
The target audience for this course are graduate students and researchers who work with large datasets. Basic working knowledge of the Linux command line (eg. navigation in the file system, creating files and folders, executing commands) is required and experience with working on remote systems (via ssh) is an advantage. Basic knowledge of a scripting language is also beneficial (eg. python or Perl).
- Basic concepts and techniques for modern reproducible bioinformatics data analyses
- Data organization, documentation and software versioning
- Setting up and working in virtual software environments
- Software containerization strategies and caveats - how to use and build containers
- Knowledge of how to use common workflow management systems
.. toctree:: :caption: Course overview :name: overview :maxdepth: 1 :hidden: introduction/about.rst introduction/setup.rst
.. toctree:: :caption: Day 1 :name: day1 :maxdepth: 1 :hidden: day-1/exercise-1-shell-intro.rst day-1/exercise-2-reproducible-software-installation.rst day-1/exercise-3-data-organization.rst day-1/exercise-4-git.rst
.. toctree:: :caption: Day 2 :name: day2 :maxdepth: 1 :hidden: day-2/exercise-1-conda.rst day-2/exercise-2-docker-intro.rst day-2/exercise-3-advanced-docker.rst day-2/exercise-4-docker-pitfalls.rst
.. toctree:: :caption: Day 3 :name: day3 :maxdepth: 1 :hidden: day-3/exercise-1-workflows.rst day-3/exercise-2-phylogenomics.rst