Work

System R

project · 1974

Computing Databases

System R was IBM’s research project to build the first complete implementation of Edgar Codd’s relational database model. Developed at IBM San Jose Research Laboratory from 1974 to 1979, it proved that relational databases could be practical and efficient, and it created SQL—the query language that became an industry standard.

Origins

After Edgar Codd published his relational model paper in 1970, IBM management was skeptical. Their existing IMS database was profitable; why invest in an unproven approach? A research project was approved to test whether relational databases could achieve acceptable performance[1].

The System R team, led by Donald Chamberlin and Raymond Boyce, set out to build a fully functional relational database system from scratch.

Key Innovations

SQL (Originally SEQUEL): Chamberlin and Boyce designed Structured English Query Language, later shortened to SQL due to trademark issues. SQL allowed users to describe what data they wanted without specifying how to retrieve it—a revolutionary abstraction[2].

Query Optimization: System R pioneered automatic query optimization. The system analyzed SQL statements and determined the most efficient execution strategy, choosing optimal join orders and access paths.

Transaction Management: The project implemented ACID transactions (Atomicity, Consistency, Isolation, Durability), ensuring database reliability even during system failures.

Indexing and Access Methods: B-tree indexes and sophisticated access path selection made relational queries competitive with navigational databases.

Two Phases

System R developed in two phases:

Commercial Impact

Though IBM chose to develop SQL/DS and DB2 as commercial products rather than productizing System R directly, the research’s influence was immediate. Larry Ellison’s Relational Software (later Oracle) implemented SQL based on published System R papers, beating IBM to market. Ingres at Berkeley, though using a different query language initially, also built on relational concepts[1].

Recognition

System R demonstrated that relational databases were not just theoretically elegant but practically viable. The project’s leaders received the ACM Software System Award in 1988. SQL became an ANSI/ISO standard and remains the dominant database query language.


Sources

  1. Wikipedia. “IBM System R.” History and development.
  2. Chamberlin, Donald. “Early History of SQL.” IEEE Annals of the History of Computing, 2012.