>
2026 CQS Summer Institute

Terra-Based Cloud Computing

A weeklong look at cloud computing and GWAS analysis

Chinese man in navy jacket with glasses sitting at round conference table with notes

Join us July 27–31 from 9 am to noon each day for this short intensive course, taught by Dr. Quanhu "Tiger" Sheng, associate professor of biostatistics, VANGARD deputy technical director, and associate director of advanced computing at CQS, with assistance from Dr. Hua-Chang Chen.

The course will be held in person at 2525 West End Avenue in Nashville, Tennessee. It is open only to Vanderbilt Health and Vanderbilt University employees and trainees, and space is limited to 20 participants. Registration will open on May 1.

Tuition for this course (in USD)

Vanderbilt University / Vanderbilt Health faculty and staff: $700

Vanderbilt University / Vanderbilt Health students, trainees, and postdocs: $450

A 20% early bird discount will be applied to employee registrations received between May 1 and May 31.

About this course

This course provides an in-depth exploration of Terra-based cloud computing with a focus on genome-wide association studies (GWAS) analysis using BioVU whole genome sequencing data. Students will use Visual Studio Code to navigate course materials and engage in hands-on exercises. The curriculum introduces key concepts and tools, including the Terra environment, Docker image creation, workflow description language (WDL), cohort building with the BioVU synthetic derivative BigQuery database, and GWAS analysis using Regenie4. Through practical activities, participants will develop skills in cloud-based GWAS analysis, covering environment setup, software packaging, cohort construction, and data processing.

Prerequisites

  • Knowledge of genomics and GWAS fundamentals
  • Familiarity with Python and Jupyter Notebook
  • Basic proficiency in SQL
  • Experience using Linux command-line interfaces
  • Visual Studio Code installed on your computing device (see "Preparing for this course" below for more instructions)
  • GitHub, Docker Hub, Google Cloud Platform, and Terra accounts set up by July 20 (one week before the start of the course). See below for more information.

Preparing for this course

This course uses Visual Studio Code (VS Code) for navigating course materials and conducting hands-on exercises.

Students must have the following accounts set up by the start of this course:

  • GitHub – for code version control
    • Personal GitHub accounts created with non-VU/non-VUMC emails are acceptable.
    • You must provide your GitHub account username to Dr. Sheng at least one week prior to the start of the course.
  • Docker Hub – for publishing Docker images
    • Personal Docker accounts created with non-VU or non-VUMC emails are acceptable.
  • Google Cloud Platform (GCP)
    • Your GCP account must be registered to your VU (vanderbilt.edu) or VUMC (vumc.org) email address.
    • You must provide your Google Cloud Platform (GCP) account email to Dr. Sheng at least one week prior to the start of the course.
  • Terra
    • Your Terra account must be linked to your GCP account (with your VU or VUMC email address).
    • When you register for Terra, select “Sign in with Google” (not “Sign in with Microsoft”) to ensure proper access to AGD (Alliance for Genomic Discovery) genomics data, the Synthetic Derivative (SD) BigQuery database, and the introductory data portal.

For students with limited or no experience in Python or SQL, Dr. Sheng strongly recommends reviewing the following resources prior to the start of the course:

Need to brush up on the prerequisites? Dr. Sheng recommends the following:

Details subject to change without notice