A weeklong look at cloud computing and GWAS analysis
Join us July 27–31 from 9 am to noon each day for this short intensive course, taught by Dr. Quanhu "Tiger" Sheng, associate professor of biostatistics, VANGARD deputy technical director, and associate director of advanced computing at CQS, with assistance from Dr. Hua-Chang Chen.
The course will be held in person at 2525 West End Avenue in Nashville, Tennessee. It is open only to Vanderbilt Health and Vanderbilt University employees and trainees, and space is limited to 20 participants. Registration will open on May 1.
Tuition for this course (in USD)
Vanderbilt University / Vanderbilt Health faculty and staff: $700
Vanderbilt University / Vanderbilt Health students, trainees, and postdocs: $450
A 20% early bird discount will be applied to employee registrations received between May 1 and May 31.
About this course
This course provides an in-depth exploration of Terra-based cloud computing with a focus on genome-wide association studies (GWAS) analysis using BioVU whole genome sequencing data. Students will use Visual Studio Code to navigate course materials and engage in hands-on exercises. The curriculum introduces key concepts and tools, including the Terra environment, Docker image creation, workflow description language (WDL), cohort building with the BioVU synthetic derivative BigQuery database, and GWAS analysis using Regenie4. Through practical activities, participants will develop skills in cloud-based GWAS analysis, covering environment setup, software packaging, cohort construction, and data processing.
Prerequisites
- Knowledge of genomics and GWAS fundamentals
- Familiarity with Python and Jupyter Notebook
- Basic proficiency in SQL
- Experience using Linux command-line interfaces
- Visual Studio Code installed on your computing device (see "Preparing for this course" below for more instructions)
- GitHub, Docker Hub, Google Cloud Platform, and Terra accounts set up by July 20 (one week before the start of the course). See below for more information.
Preparing for this course
This course uses Visual Studio Code (VS Code) for navigating course materials and conducting hands-on exercises.
- The community version of VS Code is free and can be downloaded at https://code.visualstudio.com/Download
- Required extensions: Remote-SSH, Python, Jupyter, R
Students must have the following accounts set up by the start of this course:
- GitHub – for code version control
- Personal GitHub accounts created with non-VU/non-VUMC emails are acceptable.
- You must provide your GitHub account username to Dr. Sheng at least one week prior to the start of the course.
- Docker Hub – for publishing Docker images
- Personal Docker accounts created with non-VU or non-VUMC emails are acceptable.
- Google Cloud Platform (GCP)
- Your GCP account must be registered to your VU (vanderbilt.edu) or VUMC (vumc.org) email address.
- You must provide your Google Cloud Platform (GCP) account email to Dr. Sheng at least one week prior to the start of the course.
- Terra
- Your Terra account must be linked to your GCP account (with your VU or VUMC email address).
- When you register for Terra, select “Sign in with Google” (not “Sign in with Microsoft”) to ensure proper access to AGD (Alliance for Genomic Discovery) genomics data, the Synthetic Derivative (SD) BigQuery database, and the introductory data portal.
For students with limited or no experience in Python or SQL, Dr. Sheng strongly recommends reviewing the following resources prior to the start of the course:
- Wes McKinney’s Python for Data Analysis, 3rd Edition. Python is utilized in the course for data manipulation and querying the SD BigQuery database.
- SQL. You will use SQL during the course to extract and analyze phenotype data from the SD BigQuery database.
Need to brush up on the prerequisites? Dr. Sheng recommends the following:
- Knowledge of genomics and GWAS fundamentals
- Familiarity with Python and Jupyter Notebook
- Basic proficiency in SQL
- Experience using Linux command-line interfaces
Details subject to change without notice