Genomic Sequence Data Analysis  
Click here to register for this workshop

This is a virtual workshop, you can participate by reading workshop materials, watching video presentations and carrying out exercises on an assigned BioHPC Cloud server (server assignments are listed here). If you need help, please sign up for our Office Hours. The server assigned to you for hands-on exercises will be available for a limited time, see server assignment page or bottom of this page for details. Please note, the server is assigned to you for hands-on exercises, NOT heavy computing, using the server for workshop unrelated computing is not allowed and will result in assignment termination.

This workshop covers three commonly used genomic sequence data analysis methods: (1) de novo whole genome assembly with long or short reads, (2) model-based gene prediction, and (3) reference genome guided variant calling. Each workshop participant will be assigned to a Linux server, and will have three weeks to finish several data analysis projects.   (Qi Sun, Robert Bukowski)

Workshop videos:

Session 1
Session 2

Session 3
Session 5
Session 6

Workshop server assignment


Server assignment length 10 days  
Session 1 Oct 12 2020 3:30PM - 5:00PM Zoom
De novo genome assembly with long or short reads;  Assembly quality assessment. 
Session 2 Oct 14 2020 3:30PM - 5:00PM Zoom
Hands-on session
Session 3 Oct 19 2020 3:30PM - 5:00PM Zoom
Gene prediction using Braker.
Session 4 Oct 21 2020 3:30PM - 5:00PM Zoom
Hands-on session
Session 5 Oct 26 2020 3:30PM - 5:00PM Zoom
Variant calling with Illumina whole genome shotgun sequence data.  
Session 6 Oct 28 2020 3:30PM - 5:00PM Zoom
Hands-on session
Website credentials: login  Web Accessibility Help