Contact information

Instructor: Shishi Luo, shishi.luo@berkeley.edu

Connector assistant: Moulay Draidia, mzdraidia@berkeley.edu

Office Hours

Thu 10:00-11:00 AM, 418 Evans Hall

Course Description

All course materials can be found at the following website or in bCourses:

https://github.com/shishiluo/Genomics-DataScience

Genomics is triggering a revolution in medical discovery. Students will explore genomic data, including HIV genomics, personal genomics, and DNA forensics, as well as related legal and ethical issues. Biology background not required.

In this connector course, we will interact with a variety of genomic datasets, with specific emphasis on HIV genomics, personal genomics, and DNA forensics. We will learn to use tools, both from the foundations course and introduced in this course, to perform exploratory analyses of genomic metadata and DNA sequence data. Students will also explore the legal and ethical issues concerning the collection and use of genetic data through readings of news articles. By the end of this course, students will be able to perform a quantitative analysis of variation in a gene or genomic region and communicate the results of this analysis effectively to a non-expert audience.

This connector course will make use of the skills taught in the Foundations of Data Analysis (Data 8) as well as introduce material specific to the application of data science to genomic data. For example, students will apply their knowledge of tables and data visualization to explore quantitative characteristics of genomes, such as genome length, across different species. They will apply Bayes Rule to the problem of quantifying whether a defendant is guilty given DNA evidence against them. In the module on personal genomics, they will be introduced to genome-wide association studies, a high-dimensional version of multiple regression.