Courses (2011-2012)
Research
My research focuses broadly on data-oriented systems and the way they drive computing. Current projects include:
- BOOM and bloom: Orders Of Magnitude simpler code for the Cloud.
- d^p ("deep"): Data to the People
More information on current and past research here.
Selected Talks
- Consistency Analysis in Bloom: A CALM and Collected Approach, CIDR
2011. [.pptx], [.pdf]
- The Declarative Imperative: Experiences and Conjectures in Distributed Logic. Keynote, ACM PODS, 2010. [.key.zip], [pdf], [video]
- MAD Skills: New Practices for Big Data. VLDB, 2009. [pptx], [pdf]
- Quantitative Data Cleaning for Large Databases. Keynote,
QDB, 2009. [.key.zip], [pdf]
- Bricolage: Data at Play. Keynote, ICDM 2007. [.key.zip] [.mov] [pdf]
-
The Marvelous Structure of Reality. Keynote, WebDB 2003 [PDF], [.mov]
[.key.sit]
Recent Papers
- Distributed GraphLab: A Framework for Machine Learning in the Cloud (with Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, and C. Guestrin). VL
DB 2012. [pdf]
- Probabilistically Bounded Staleness for Practical Partial Quorums (with P. Bailis, S. Venkataraman, M. J. Franklin and I. Stoica). VLDB 2012. [pdf]
- Integrated Statistical Analysis and Visualization for Data Profiling (with S. Kandel, R. Parikh, A. Paepcke and J. Heer). AVI 2012.
- Shreddr: Pipelined Paper Digitization for Low-Resource Organizations (with Kuang Chen, A. Kannan, Y. Yano and T. S. Parikh). ACM DEV 2012. [pdf]
- Dedalus: Datalog in Time and Space (with P. Alvaro, W. R. Marczak, N. Conway, D. Maier and R. Sears). Datalog 2.0 2011. [pdf]
- Proactive Wrangling: Mixed-Initiative End-User Programming of Data Transformation Scripts (with P. J. Guo, S. Kandel and J. Heer). UIST 2011. [pdf]
- Hybrid In-Database Inference for Declarative Information Extraction (with D. Z. Wang, M. J. Franklin, M. Garofalakis and M. L. Wick). SIGMOD 2011. [pdf]
- Wrangler: Interactive Visual Specification of Data Transformation Scripts (with S. Kandel, A. Paepcke, and J. Heer). CHI 2011. [PDF]
- CommentSpace: Structured Support for Collaborative Visual Analysis (with W. Willett, J. Heer, and M. Agrawala). CHI 2011.
[PDF]
- FATE and DESTINI: A Framework for Cloud Recovery Testing (with H.S. Gunawi, et al.). NSDI 2011.
- Consistency Analysis in Bloom: a CALM and Collected Approach (with P. Alvaro, N. Conway, and W.R. Marczak). CIDR 2011. [PDF]
- Data in the First Mile (with K. Chen and T. Parikh). CIDR 2011 [PDF].
 |
Selected Publications
- The Declarative Imperative: Experiences and Conjectures in Distributed Logic. SIGMOD Record 39:1, Sep. 2010. [pdf]
- Consistency Analysis in Bloom: a CALM and Collected Approach (with P. Alvaro, N. Conway, and W.R. Marczak). CIDR 2011. [PDF]
- Data in the First Mile (with K. Chen and T. Parikh). CIDR 2011 [PDF].
- Wrangler: Interactive Visual Specification of Data Transformation Scripts (with S. Kandel, A. Paepcke, and J. Heer). CHI 2011. [PDF]
- Declarative Networking (with B. T. Loo, T. Condie, M. Garofalakis, D. E. Gay, P. Maniatis, R. Ramakrishnan, T. Roscoe and I. Stoica). Research Highlights, CACM 52(11), 2009. [Intro by Peter Druschel] [pdf].
- Quantitative Data Cleaning for Large Databases. White paper, United Nations Economic Commission for Europe, February, 2008. [PDF]
- Architecture of a Database System. (with M. Stonebraker and J. Hamilton). Foundations and Trends in Databases 1(2). [PDF]
- Implementing Declarative Overlays. (with B. T. Loo,
T. Condie, P. Maniatis, T. Roscoe, and I. Stoica). In 20th SOSP, 2005. [PDF]
- TinyDB: An Acqusitional Query Processing System for Sensor Networks. (with S. Madden, M. Franklin, and Wei Hong). ACM TODS. [PDF]
- Model-Driven Data Acquisition in Sensor Networks
(with
A. Deshpande, C. Guestrin, S. Madden and W. Hong.) VLDB 2004
[PDF]
- TelegraphCQ: Continuous Dataflow Processing for an
Uncertain World (with the Telegraph team). CIDR 03 [pdf]
- Commencement Address. Computer Science, College
of Letters and Science, UC Berkeley, May 26, 2002. [pdf]
- On a Model of Indexability and its Bounds for Range
Queries (with
E. Koutsoupias, D. Miranker, C. Papadimitriou, and V. Samoladas). JACM
49(1) (2002). [pdf]
- Potter's Wheel: An Interactive Data Cleaning System
(with V.
Raman). VLDB 2001. [PDF]
-
Eddies: Continuously Adaptive Query Processing (with
R. Avnur).
SIGMOD 2000. [PDF]
[PS].
-
Interactive Data Analysis with CONTROL (with many
others). IEEE
Computer, August 1999. [PDF]
-
Generalized Search Trees for Database Systems (with J.
F. Naughton
and A. Pfeffer.) VLDB 1995. [PS]
-
Readings
in Database Systems, Fourth Edition.
J. M. Hellerstein and M. Stonebraker, eds.
MIT Press, 2005.
[Supplemental material]
 |
|