Workshop · 2 days · 4 hours each2026

Applied Bioinformatics

A two-day, hands-on workshop that takes you from raw FASTQ files to a finished variant and RNA-seq analysis. Eight modules, nine Jupyter notebooks, runnable shell pipelines, and a synthetic chr22 capstone dataset.

8

Modules

9

Notebooks

5

Pipeline scripts

1

Capstone dataset

What you'll leave with

  • Run a complete short-read pipeline: QC → trim → align → call → annotate → interpret.
  • Read and manipulate FASTA, FASTQ, SAM/BAM, VCF, BED, and GFF3 files with confidence.
  • Drive command-line tools (BWA, SAMtools, GATK, Trimmomatic, fastp) from reproducible shell and Python scripts.
  • Run a differential expression analysis end-to-end with DESeq2.
  • Diagnose pipeline failures from FastQC and MultiQC reports.

Who it's for

  • ·PhD students and postdocs entering bioinformatics from a wet-lab or stats background.
  • ·Data scientists rotating into a genomics team.
  • ·Research software engineers who need to read and own existing pipelines.

Prerequisites

  • ·Comfortable opening a terminal and running basic commands.
  • ·Reading-level Python (loops, dicts, simple functions).
  • ·No prior bioinformatics experience required.
  • ·Laptop with conda or Docker — see conda_env.yml.

Day 1 · Raw data to aligned reads

Day 1 slides ↗

Day 2 · From aligned reads to biology

Day 2 slides ↗

Downloads

Book this workshop for your lab or team

Public cohorts, private corporate training, and self-paced licenses available. Custom modules on single-cell, ATAC-seq, or long-read sequencing on request.