COLUMBIA UNIVERSITY COMS 6113

Reading Tips

Ask the following questions while readings

Papers on how to read papers

Some papers on reviewing papers

The Papers

Review

Required

Further Reading

Indexes

Required

Further reading

Joins

Required

Further reading

Query Optimization

Readings

Further reading

Cost Estimation

Required

Further Reading

Main-memory Databases (Vectorization)

Required (Edited 2/16: swapped the background and main topic papers)

Further Reading

Some things to think about when reading

Main-memory Databases (Compilation)

Required

Further reading

Data Flow

Required

Further Reading

Some notes to guide your reading and thinking.

Incrementally Maintaining Materialized Views

Required:

Further Reading

In-DBMS ML

Required

Further Reading

In-DBMS ML over Joins

Required

Further Reading

Hybrid Caching/UDFs

Required

For this week, you do NOT need to create slides for your team roles. Instead, your team will play around with Convex, relate it with concepts from the readings, and prepare questions and observations to share with Sujay before his presentation. Specifically

Further Readings

Data Quality

Required

Further Reading:

Data Markets

Required

Further Reading

Unscheduled Topics

System R Overview

Readings

Questions to consider

INGRES/POSTGRES

Readings

Questions to consider

Concurrency Control

Readings

OLTP Stores

Column Stores

Readings

Cloud-scale Analytics

DB and Query Representations

Distributed Consistency under Replication

Materialized Views

Further Reading

Streaming

Datalog and Recursion

Readings

Questions

Lineage

Question to comment on:

Lineage Systems

Read one of the two required papers:

Serverless Querying

Self-tuning DBs

Approximate Query Processing

Windows and Streaming

Fast Scans

Data Cubes

Oblivious Databases

Adaptive Query Processing

Readings

Further Reading

Explanation

Readings

Further Reading