Day 2 - Introduction to GitHub, Correlation, and Basic Clustering

Managing source code and first steps towards data analysis.

Topics

Introduction to GitHub

  • Rationale for version control and collaboration
  • Source code version control
  • Collaboration using GitHub

Correlation

  • Measures of correlation
  • Causality vs correlation

Introduction to Clustering

  • Basic clustering workflow
  • R data structures for hierarchical clustering

Materials

Notes Exercises Solutions

You need to be logged into your GitHub account to be able to access materials hosted in the private class repository.

Logistics

Date 16 August 2016

Time 10 am to 12 pm (Instruction) / 1 pm to 2 pm (Intro to Exercises and Q&A) / 4 pm to 5 pm (Q&A)

Room Countway 403 (morning) and Countway 424 (afternoon)

Teaching Staff

Nils Gehlenborg (Course Director)

Chamith Fonseka (Teaching Assistant)