Current Software Projects
Bud is Bloom Under Development: a Ruby DSL implementation of the Bloom programming language for distributed systems. Friends-and-family alpha to be released Spring, 2011.
MADLib is an open-source library for scalable in-database analytics. Code available at http://madlib.net.
DataWrangler is a tool for interactive data transformation. Code and online service available here.
CommentSpace is a service for collaborative visual data analysis. Online service at http://commentspace.net.
Listed roughly in reverse chronological order. These projects are no longer actively supported.
HOP: Hadoop Online Prototype
A variant of the Hadoop MapReduce system that provides non-blocking pipelined flows, with support for continuous queries and online aggregation. Code available here.
BOOM Analytics is a compact Overlog-based declarative implementation of the Hadoop FileSystem API, with a version of Hadoop that contains a compact Overlog scheduler. Embeds the JOL Java-based Overlog Runtime system. Code available here.
DSN: Declarative Sensor Networks
DSN is a full-service declarative networking and application stack for the Berkeley mote sensor platform, supporting the SNLog language. It can be downloaded here.
P2 is a declarative networking engine supporting the Overlog language. It can be downloaded here.
TelegraphCQ is an adaptive data stream processor. The TelegraphCQ 0.2 release can be downloaded here.
PIER is an internet-scale query processor. Source code for PIER can be downloaded here.
The YFilter filtering engine allows users or applications to submit multiple queries that are to be continuously executed against streaming XML messages. The YFilter 1.0 code can be downloaded here.
TinyDB is a query processing system for extracting information from a network of TinyOS sensors. The latest version of TinyDB can be downloaded here.
BigSur is a scientific database workflow system. Subprojects include:
- BigSur: the NASA Earth Observing System Data and Information System (EOSDIS) Alternative Architecture prototype.
- ESMDIS: the Earth System Model Data Information System prototype.
- Ocean ESIP: part of a NASA experiment to create a federation of data producers.
Tioga Datasplash project
A database visualization system, the Tioga project designed and prototyped a database browser and visual programming language (using a “boxes and arrows” paradigm) for scientific applications. The ideas behind DataSplash were commercialized in the Visionary product of Informix Software, Inc.
An agoric distributed database system, Mariposa designed and prototyped a scalable (10000+ site) wide-area distributed data manager. Mariposa was commercialized by Cohera Corp. (now a part of Oracle).
A group of volunteers significantly enhanced University POSTGRES 4.2 (SQL support, working aggregates, etc.), resulting in PostgreSQL (at one time known as POSTGRES95).
The University POSTGRES project, which demonstrated the practicality of object-relational technology, was discontinued in 1994. POSTGRES was commercialized by Illustra Information Technologies, which was purchased by Informix Corporation, which in turn was purchased by IBM.
The University INGRES project, one of the pioneering relational database management systems, was discontinued in 1985. INGRES was commercialized by Relational Technology, Inc. (a.k.a. Ingres Corp., a division of The ASK Group; now part of Computer Associates, Inc.).
- DB seminar 12/02/11, 12:30 – Michael Carey: Towards a Scalable, Semistructured Data Platform for Evolving World Models
- DB seminar 11/18/11, 12:30 – Jignesh Patel: Implications of Evolving Hardware Trends: Towards Energy-Conscious Main-Memory Data Processing
- 11/04/11, 12:30 – Carlo Zaniolo: Query Languages & Systems for Continuous Analytics and Data Mining: the Stream Mill Experience
- DB seminar 10/28/11, 12:30 – Anastasia Ailamaki: Scaling up database transactions
- DB seminar 10/21/11, 12:30 – John Wu: Parallel Index and Query for Large Scale Data Analysis