| Supplementary Readings |
|
This page contains supplemental readings for the course.
|
| General |
|
The Data Warehousing Information
Center
Data Warehousing and OLAP:
A Research-Oriented Bibliography
Data
Warehousing Resource Site
The OLAP Report
|
| Week 1 - Data Warehousing Basics |
|
The Case For Data
Warehousing
The PANDA Project Data cube: a relational aggregation operator generalizing group-by, cross-tabs and subtotals, by J. Gray, A. Bosworth, A. Layman, and H. Pirahesh, Data Mining and Knowledge Discovery 1:1, 1997. (A PDF version.) An overview of data warehousing and OLAP technology, by Surajit Chaudhuri and Umesh Dayal, ACM SIGMOD Record 26:1, 1997.
Research
problems in data warehousing, by Jennifer Widom, Int'l Conference
on Information and Knowledge Management (CIKM), 1995.
|
| Weeks 2-3 - Dimensional Modeling |
|
Data Warehousing Articles
from Ralph Kimball
Design Tips
from Ralph Kimball
|
| Data Cleaning |
|
Data
Integration Course Web Page
Record Linkage: Current Practice and Future Directions
by L. Gu, R. Baxter, D. Vickers, and C. Rainsford
Data
Cleaning: Problems and Current Approaches
by Erhard Rahm and Hong Hai Do (2000)
Robust
and Efficient Fuzzy Match for Online Data Cleaning by S. Chaudhuri, K.
Ganjam, V. Ganti, and R. Motwani (2003)
A Comparison of String-Distance Metrics for Name-Matching Tasks
by W. Cohen, P. Ravikumar, and S. Fienberg (2003)
Eliminating
Fuzzy Duplicates in Data Warehouses
by R. Ananthakrishna, S. Chaudhuri, and V. Ganti (2002)
An Efficient
Domain-Independent Algorithm for Detecting
Approximately Duplicate Database Records by A. Monge and C. Elkan
(1996)
The
Merge/Purge Problem for Large Databases
by M. Hernandez and S. Stolfo (1995)
|
| Weeks 4-5 - Query Processing |
|
SQL
Server Query Processor Overview
Star
Queries in Oracle8
The Value of Merge Join
and Hash Join in SQL Server by Graefe (1999)
Hash Joins and
Hash Teams in Microsoft SQL Server by Graefe, Bunker,
and Cooper (1998)
|
| Bitmap Indexes and Compression |
|
Improved Query
Performance with Variant Indexes by O'Neil and Quass (1997)
Performance
Measurements of Compressed Bitmap Indices by
Johnson (1999)
Compressing Bitmap
Indexes for Faster Search Performance by Wu, Otoo, and Shoshani (2002)
|