UC Berkeley
 
Home
Biography
Publications
Talks
Research
Students
Courses
Quotes and Links
Campus Directions
Campus Map
Directions to Soda Hall
Blog

Joseph M. Hellerstein

Chancellor's Professor
EECS Computer Science Division
UC Berkeley

<lastname>@cs.berkeley.edu
http://www.linkedin.com/in/joehellerstein

Joseph M. Hellerstein
MADlib Magnetic Agile Deep

Courses (2011-2012)

Research

My research focuses broadly on data-oriented systems and the way they drive computing. Current projects include:

  • BOOM and bloom: Orders Of Magnitude simpler code for the Cloud.
  • d^p ("deep"): Data to the People

More information on current and past research here.

Selected Talks

  • Consistency Analysis in Bloom: A CALM and Collected Approach, CIDR 2011. [.pptx], [.pdf]
  • The Declarative Imperative: Experiences and Conjectures in Distributed Logic. Keynote, ACM PODS, 2010. [.key.zip], [pdf], [video]
  • MAD Skills: New Practices for Big Data. VLDB, 2009. [pptx], [pdf]
  • Quantitative Data Cleaning for Large Databases. Keynote, QDB, 2009. [.key.zip], [pdf]
  • Bricolage: Data at Play. Keynote, ICDM 2007. [.key.zip] [.mov] [pdf]
  • The Marvelous Structure of Reality. Keynote, WebDB 2003 [PDF], [.mov] [.key.sit]

Recent Papers

  • Distributed GraphLab: A Framework for Machine Learning in the Cloud (with Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, and C. Guestrin). VL DB 2012. [pdf]
  • Probabilistically Bounded Staleness for Practical Partial Quorums (with P. Bailis, S. Venkataraman, M. J. Franklin and I. Stoica). VLDB 2012. [pdf]
  • Integrated Statistical Analysis and Visualization for Data Profiling (with S. Kandel, R. Parikh, A. Paepcke and J. Heer). AVI 2012.
  • Shreddr: Pipelined Paper Digitization for Low-Resource Organizations (with Kuang Chen, A. Kannan, Y. Yano and T. S. Parikh). ACM DEV 2012. [pdf]
  • Dedalus: Datalog in Time and Space (with P. Alvaro, W. R. Marczak, N. Conway, D. Maier and R. Sears). Datalog 2.0 2011. [pdf]
  • Proactive Wrangling: Mixed-Initiative End-User Programming of Data Transformation Scripts (with P. J. Guo, S. Kandel and J. Heer). UIST 2011. [pdf]
  • Hybrid In-Database Inference for Declarative Information Extraction (with D. Z. Wang, M. J. Franklin, M. Garofalakis and M. L. Wick). SIGMOD 2011. [pdf]
  • Wrangler: Interactive Visual Specification of Data Transformation Scripts (with S. Kandel, A. Paepcke, and J. Heer). CHI 2011. [PDF]
  • CommentSpace: Structured Support for Collaborative Visual Analysis (with W. Willett, J. Heer, and M. Agrawala). CHI 2011.
  • [PDF]
  • FATE and DESTINI: A Framework for Cloud Recovery Testing (with H.S. Gunawi, et al.). NSDI 2011.
  • Consistency Analysis in Bloom: a CALM and Collected Approach (with P. Alvaro, N. Conway, and W.R. Marczak). CIDR 2011. [PDF]
  • Data in the First Mile (with K. Chen and T. Parikh). CIDR 2011 [PDF].

Selected Publications

  • The Declarative Imperative: Experiences and Conjectures in Distributed Logic. SIGMOD Record 39:1, Sep. 2010. [pdf]
  • Consistency Analysis in Bloom: a CALM and Collected Approach (with P. Alvaro, N. Conway, and W.R. Marczak). CIDR 2011. [PDF]
  • Data in the First Mile (with K. Chen and T. Parikh). CIDR 2011 [PDF].
  • Wrangler: Interactive Visual Specification of Data Transformation Scripts (with S. Kandel, A. Paepcke, and J. Heer). CHI 2011. [PDF]
  • Declarative Networking (with B. T. Loo, T. Condie, M. Garofalakis, D. E. Gay, P. Maniatis, R. Ramakrishnan, T. Roscoe and I. Stoica). Research Highlights, CACM 52(11), 2009. [Intro by Peter Druschel] [pdf].
  • Quantitative Data Cleaning for Large Databases. White paper, United Nations Economic Commission for Europe, February, 2008. [PDF]
  • Architecture of a Database System. (with M. Stonebraker and J. Hamilton). Foundations and Trends in Databases 1(2). [PDF]
  • Implementing Declarative Overlays. (with B. T. Loo, T. Condie, P. Maniatis, T. Roscoe, and I. Stoica). In 20th SOSP, 2005. [PDF]
  • TinyDB: An Acqusitional Query Processing System for Sensor Networks. (with S. Madden, M. Franklin, and Wei Hong). ACM TODS. [PDF]
  • Model-Driven Data Acquisition in Sensor Networks (with A. Deshpande, C. Guestrin, S. Madden and W. Hong.) VLDB 2004 [PDF]
  • TelegraphCQ: Continuous Dataflow Processing for an Uncertain World (with the Telegraph team). CIDR 03 [pdf]
  • Commencement Address. Computer Science, College of Letters and Science, UC Berkeley, May 26, 2002. [pdf]
  • On a Model of Indexability and its Bounds for Range Queries (with E. Koutsoupias, D. Miranker, C. Papadimitriou, and V. Samoladas). JACM 49(1) (2002). [pdf]
  • Potter's Wheel: An Interactive Data Cleaning System (with V. Raman). VLDB 2001. [PDF]
  • Eddies: Continuously Adaptive Query Processing (with R. Avnur). SIGMOD 2000. [PDF] [PS].
  • Interactive Data Analysis with CONTROL (with many others). IEEE Computer, August 1999. [PDF]
  • Generalized Search Trees for Database Systems (with J. F. Naughton and A. Pfeffer.) VLDB 1995. [PS]
  • Readings in Database Systems, Fourth Edition. J. M. Hellerstein and M. Stonebraker, eds. MIT Press, 2005. [Supplemental material]
The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Other restrictions to copying individual documents may apply.
Last modified: $Date: 2012/04/27 00:17:52 $ by Joe Hellerstein