COLUMBIA UNIVERSITY COMS 6113

Information

Prereqs

Grading

(Tentative)

Overview

This course is intended as an advanced graduate-level course in database systems research.
The content will cover classic and modern database systems research. Topics will range from classic database system design, modern optimizations in single-node and multi-node settings, data cleaning and explanation, and data provenance.

The class places a heavy emphasis on paper reading and writing good paper reviews. The point is to practice reading papers critically, writing proper reviews, implementing ideas in research papers, and conducting research. As such, students will be expected to read papers in depth, complete assignments based ideas from the readings, and conduct a semester-long research project.

Students are expected to be comfortable with a range of programming languages, reading code, actively participate in discussions, and presenting.

Course capped at 25. If waitlist is huge, an assignment will be used to choose participants.

Recent Announcements

Schedule

Date

Topic

Notes

Readings

Assigned

Due

L1: 22-Jan Intro Notes Readings HW0 (weds)
L2: 24-Jan System R overview Notes Readings HW0 1/27 11:59PM
L3: 29-Jan Ingres/Postgres
Presenter: Wu
Notes Readings HW1 (weds)
L4: 31-Jan Column Stores
Presenter: Wu
Notes Readings
L5: 5-Feb OLTP engines
Presenter: Wu
Notes Readings
L6: 7-Feb Query Compilation
Presenter: Wu
Notes Readings HW2 HW1
L7: 12-Feb Multi-dim Indexes
Presenter: Wu
Notes Readings
L8: 14-Feb Single machine joins
Presenter: -
Readings Prospectus
L9: 19-Feb Distributed joins
Presenter: Xinyue Wang
Readings
L10: 21-Feb Volcano Exchange
Presenter: -
Readings
L11: 26-Feb Eddies
Presenter: Jennifer Bi
Readings HW3 HW2
L12: 28-Feb Hybrid Caching/UDFs
Presenter: -
Readings
L13: 5-Mar Volcano/Cascades Optimizer
Presenter: Haneen
Readings
L14: 7-Mar Large-scale Dataflow Basics
Presenter: Wu
Readings
L15: 12-Mar Large-scale Dataflow Basics2
Presenter: Wu
Readings
L16: 14-Mar Naiad
Presenter: Wu
Notes Readings HW3
L17: 22-Mar - Readings Paper Draft Due
L18: 26-Mar Datalog and Recursion
Presenter: Wu
Notes Readings
L19: 28-Mar Lineage
Presenter: Yiliang Shi
Readings
L20: 29-Mar - Readings PC Reviews
L21: 2-Apr Mock PC
Presenter: -
Readings Mock PC
L22: 4-Apr Mock PC
Presenter: -
Readings Mock PC
L23: 9-Apr Materialized Views
Presenter: Mari Husain
Readings
L24: 11-Apr Datacubes
Presenter: Yiru Chen
Readings
L25: 16-Apr Visualization/Exploration
Presenter: Ziao Wang
Readings
L26: 18-Apr Self-tuning DBs
Presenter: Dean Deng
Readings
L27: 23-Apr More Lineage
Presenter: Wu
Readings
L28: 25-Apr Fast Scans
Presenter: Alan Du
Readings
L29: 30-Apr Using Lineage/Last Lecture
Presenter: Amita Shukla
Readings
L30: 2-May Readings Project Report

Tentative list of papers for last lectures