UCSF Department of Epidemiology & Biostatistics UCSF School of Medicine UCSF Search UCSF
 BIOSTAT 202
 Schedule
 Syllabus
 Roster
 Objectives
 Prerequisites
 Faculty
 Format
 Materials
 Enrollment
 

Opportunities and Challenges of
Complex Biomedical Data:
Introduction to the Science of "Big Data"
BIOSTAT 202 Summer 2017 (3 unit)



Application Deadline: July 17, 2017

OBJECTIVES

The growing availability of large amounts of data -- obtained either through research or electronic capture of everyday activity -- has been termed "big data". This course introduces the opportunities and challenges of using biological and health-related "big data" to perform biomedical research. We will distinguish big data from non-big data and explore the phases of data science: obtaining data, cleaning data, visualizing data, analyzing data, and drawing conclusions.

At the conclusion of this course, students will be able to:

  • Access public use (and non-public) sources of data such as NHANES, and social media data;
  • Use software to manipulate and clean “big data”;
  • Generate effective graphical displays of data;
  • Describe the advantages and disadvantages of different approaches to both supervised (classification and regression) and unsupervised predictive modeling (clustering and data reduction);
  • Describe the issues that arise when trying to use "big data" observational studies to derive causal conclusions; and
  • Describe the features of pragmatic clinical trials and how they are different from more usual clinical trials.
PREREQUISITES

None.

FACULTY

Course Director:

John Kornak, PhD
Phone: 415-514-8028
email: john.kornak@ucsf.edu

Lecturer:

Elaine Allen, PhD
email: isabel.allen@ucsf.edu

Charles McCulloch, PhD
email: charles.mcculloch@ucsf.edu

Mark Pletcher, MD, MPH
email: mpletcher@epi.ucsf.edu

Teaching Assistants: Zara Izadi, MPharm
email: zara.izadi@ucsf.edu

John Sy, MD
email: john.sy@ucsf.edu

 

FORMAT

Twice weekly lectures introduce the substantive materials for each module, which is subsequently reinforced in weekly applied homework problem sets. Weekly computer lab sessions give students guided problems to work through and the opportunity to learn to use the software, ask questions, and have more interaction with faculty.

  1. Lectures: Mondays and Thursdays: 1:00 to 2:30 PM.
    Lecture recordings will be available online later in the day. To determine if you have sufficient bandwidth to view online lectures, please visit our demonstration site.


  2. Computer Labs: In-person labs: Thursdays: 2:45 to 4:15 PM.
    You will need to bring a laptop to all lab sessions.

In addition, all students will be required to submit a final project in which they manipulate, clean, and analyze data emanating from a large data source. Students will be given a choice of datasets and guidelines for performing the project.

All students are expected to attend or watch all the lectures. Computer labs are an integral part of the course. All students are expected to turn in their weekly homework problem sets through the assigned online portal.

All course materials will be posted on the course's online syllabus.

MATERIALS

The software package IBM SPSS Modeler is used in this course. This software is available free for students on the course. Instructions on how to obtain the free license for this software and install it will be provided by the Course Director prior to the course start date.

GRADING

Grades will be based on the Computer Lab assignments and the Final Project. Lab assignments will be due by the start of lecture the following week. Homework problem sets will account for 70% of the points for the course. The final project, based on course supplied data sets, will account for 30% of the points possible for the course.

Students must hand in all homework problem sets (even if late), complete a satisfactory Final Project, and receive at least 80% of the total number of points assigned during the quarter to receive a Satisfactory (if taking Satisfactory/Unsatisfactory) or B (if taking for a letter grade) in the course.

Students not in full-year TICR Programs who satisfactorily pass all course requirements will, upon request, receive a Certificate of Course Completion.

UCSF Graduate Division Policy on Disabilities

ENROLLMENT

To apply for this course, please fill out and submit the application below. Please see our fee page for cost information. The deadline for application is July 17, 2017. Only one application needs to be completed for all courses desired during the quarter.

The application is best completed using the latest version of Firefox, Chrome or Safari.

APPLICATION

Information for how to pay;
please read before applying