SlideShare a Scribd company logo
1 of 53
Download to read offline
Elephant Roads


               17 Years of Postgres Forks and Variants




Josh Berkus
October 2009
JDCon West
NULL = ?
E.F. Codd
Relational Theory




 The First Day
    (1970)
Michael Stonebraker




INteractive Graphics REtrieval
            System
Michael Stonebraker




     INGRES
  The Second Day
      (1974)
Michael Stonebraker




    POSTGRES
   The Third Day
      (1986)
Stonebraker Marches On ...
             ●   Fourth Day: Cohera
             ●   Fifth Day: Streambase
             ●   Sixth Day: Vertica
             ●   Seventh Day: SciDB
             ●   Eighth Day: VoltDB
             ●   …?
Illustra: The First Fork
●   Stonebraker and a team from UC Berkeley
    forked POSTGRES in 1992
    ●   added SQL support
    ●   added new object-relational features
●   Started a new company called Miró
    ●   Miró became Montage
    ●   Montage became Illustra
Informix Swallows Illustra
●   1997: Informix buys Illustra
●   Illustra features added to Informix OnLine
    ●   and DataBlades ran Illustra
●   2000: Informix Universal Server 9

●   2001: IBM Eats Informix
PostgreSQL: The Second Fork
●   1995: a rag-tag band of POSTGRES users and
    students decided to save the DBMS by taking it
    off-campus.
●   It became: Postgres95
●   1996: they put it on a public CVS server
    ●   ported it to SQL
●   It became: PostgreSQL
    ●   you know the rest from here ...
Postgres Begat Many More Forks
●   Because of the      ●   Because our code is
    license                 clean and easy to
●   Because there's a       modify
    history of forks    ●   Because Postgres is
                            modular and easy to
                            break up or add to
                        ●   Because the
                            community is OK with
                            forks and variants
Why Change PostgreSQL?
●   To experiment with new DB technology
●   To commercialize it
●   To bundle it with useful tools
●   To specialize it for specific tasks
●   To add features the community doesn't want
    ●   or that aren't ready for OSS yet
●   Because you can!
But Aren't Forks Bad?



       No.
Forks Are Good
●   Open source means freedom to fork
    ●   If nobody forks a project, then it's not
        widely used or actively developed.
●   Most forks and their owners
    contribute to core PostgreSQL
    ●   money, code, ideas
●   Some forks develop code to be
    integrated into main Postgres
    ●   best way to try out really challenging
        ideas
4 Types of Postgres Variants
●   Forks: incompatible or proprietary major
    changes to the core code.
●   Patches: compatible, open source major
    changes to the core code.
●   Add-Ons: major middleware or plugins which
    greatly enhance or change Postgres's
    functionality
●   Redistributions: repackaging Postgres under a
    different name and/or license
Expired Forks
Great Bridge PostgreSQL
●   What: PostgreSQL plus some advance patches
    and support.
●   Type: Redistribution
●   Licensing: BSD
●   Versions: Forked from 7.0, merged to 7.1
●   R.I.P.: Great Bridge LLC died in 2001
●   Contributed: lots of code in 1999-2001
    ●   most of Core worked for Great Bridge
Red Hat Database
●   What: PostgreSQL redistribution with
    management tools by Red Hat Inc.
●   Type: Re-distribution
●   Licensing: BSD & GPL
●   Versions: 7.2 to 7.3
●   R.I.P.: Abandoned by Red Hat in 2003
●   Contributed: Tom Lane's salary
    ●   plus visual query analyzer to pgAdmin
NuSphere UltraSQL
●   What: Native Windows Port of PostgreSQL
●   Type: Fork
●   Licensing: Proprietary
●   Versions: 7.2 and 7.3
●   R.I.P.: Lost relevance with release of
    PostgreSQL 8.0
    ●   still technically available
●   Contributed: code to PostgreSQL 8.0
PowerGres & PowerGres Plus
●   What: Native Windows Port of PostgreSQL by
    SRA Inc., with optional custom storage
●   Type: Fork
●   Licensing: Proprietary
●   Versions: 7.3, maybe 7.4
●   R.I.P.: Lost relevance with release of
    PostgreSQL 8.0, HA version still around
●   Contributed: code to PostgreSQL 8.0
Pervasive Postgres
●   What: Regular PostgreSQL with support.
●   Type: Re-distribution
●   Licensing: BSD
●   Versions: 8.0
●   R.I.P.: Pervasive dropped Postgres in 2006
●   Contributed: money to the community, mainly
    for events
Bizgres
●   What: PostgreSQL plus BI and large database
    features.
●   Type: Fork
●   Licensing: BSD
●   Versions: 8.1, 8.2
●   R.I.P.: Project died out in 2008
    ●   Greenplum stopped merging code
●   Contributed: code, money (Greenplum's)
Core Forks & Redistributions
Mammoth PostgreSQL
●   What: PostgreSQL with Mammoth Replicator
●   Owner: CommandPrompt Inc.
●   Type: Fork
●   Licensing: open source
●   Compatibility: High, merges with main code
●   Status: current production & development
    ●   Contributes: money & code
    ●   Used to be a proprietary fork
Postgres Plus
●   What: PostgreSQL with open source add-ons
●   Owner: EnterpriseDB
●   Type: Redistribution
●   Licensing: open source (varies)
●   Compatibility: High, merges with main code
●   Status: current production & development
    ●   Contributes: money & code
Postgres Plus Advanced Server
●   What: PostgreSQL with Oracle Compatibility
    and proprietary tools
●   Owner: EnterpriseDB
●   Type: Fork
●   Licensing: Proprietary
●   Compatibility: High, merges with main code
●   Status: current production & development
    ●   Contributes: money & code
    ●   Formerly EnterpriseDB Advanced Server
Fujitsu Supported PostgreSQL
●   What: PostgreSQL with custom storage engine
●   Owner: Fujitsu Australia
●   Type: Fork
●   Licensing: Proprietary
●   Compatibility: High with 8.2
●   Status: current production
    ●   Contributes: money, occasionally
PostgreSQL for Solaris
●   What: PostgreSQL packaged for Solaris with
    tools and support
●   Owner: Sun Microsystems
●   Type: Redistribution
●   Licensing: open source (varies)
●   Compatibility: High, 8.3
●   Status: current production
    ●   Contributes: code
    ●   Likely to be terminated when Oracle deal completes
Exotic Features
PostGIS
●   What: PostgreSQL plus Geographic support
    ●   #1 open source geo database
●   Type: Add-On
●   Licensing: BSD & GPL
●   Compatibility: High, 8.4
●   Status: current production & development
    ●   Contributes: code, users
    ●   Used to be a package under GPL, now an add-on
Fork: Truviso
●   2000: TelegraphCQ Project started at UC
    Berkeley
    ●   “Continuous Query” processing for PostgreSQL
●   2006: UCB Profs. launch startup Amalgamated
    Insight
    ●   commercialize TelegraphCQ
    ●   re-merged with current PostgreSQL code
●   2008: Amalgamated Insight renamed Truviso
    ●   first commercial releases
Truviso
●   What: PostgreSQL plus Streaming Database
    ●   good for high-volume data streams where you don't
        care about old data
●   Type: Fork
●   Licensing: Proprietary
●   Compatibility: High
    ●   Version: Forked from 7.1, merged to 8.4
●   Status: current production & development
    ●   Contributes: money & code
SEPostgres
●   What: PostgreSQL integrated with SELinux
    label-based access control
●   Type: Patch
●   Owner: SEPostgres project / NEC
●   Licensing: BSD
●   Compatibility: High, 8.4
●   Status: production, working on merge with core
●   Contributes: code
Clustering Forks & Add-Ons
Postgres-R
●   What: PostgreSQL with Group Communication
    support for clustering.
●   Owner: Postgres-R project
●   Type: Patch
●   Licensing: open source (BSD)
●   Compatibility: High, merges
●   Status: alpha, current development
    ●   Contributes: code
pgCluster (and CyberCluster)
●   What: PostgreSQL with Statement Replication
    and controller support for clustering.
●   Owner: pgCluster project, Cybertech.AT
●   Type: Fork
●   Licensing: open source (BSD)
●   Compatibility: High, 8.0
●   Status: beta, development halted
    ●   Contributes: not anymore
pgCluster-II
●   What: PostgreSQL with shared memory &
    storage support for clustering ala RAC
●   Owner: pgCluster project, NTT
●   Type: Fork
●   Licensing: open source (BSD)
●   Compatibility: High, 8.3
●   Status: alpha
    ●   Contributes: code
PostgresForest
●   What: PostgreSQL with middleware-based
    statement replication clustering
●   Owner: NTT (I think)
●   Type: Add-On
●   Licensing: open source
●   Compatibility: Not sure
●   Status: beta, development halted
    ●   Contributes: code
pgPool-II
●   What: statement-replication and partitioning
    middleware for PostgreSQL
●   Owner: pgPool project, SRA
●   Type: Add-On
●   Licensing: open source
●   Compatibility: High, 8.4
●   Status: beta, active development
    ●   Contributes: code, money (SRA)
Continuent
●   What: statement-replication, mangagement and
    partitioning middleware for PostgreSQL
●   Owner: Continuent corporation
●   Type: Add-On
●   Licensing: part open source, part proprietary
●   Compatibility: High, 8.4
●   Status: old version production, Tungsten RSN
    ●   Contributes: code, money
GridSQL
●   What: statement-replication, mangagement and
    partitioning middleware for PostgreSQL
●   Owner: EnterpriseDB
●   Type: Add-On
●   Licensing: GPL
●   Compatibility: High, 8.4
●   Status: production, active development
    ●   Contributes: code, money
    ●   Used to be “ExtenDB”, integrated into Postgres Plus
Data Warehousing / BI
Netezza
●   What: Some PostgreSQL code (30-50%) plus
    proprietary code and hardware package
●   Type: Fork
●   Owner: Netezza Co.
●   Licensing: Proprietary
●   Compatibility: Low
    ●   Forked from 7.2, no longer compatible
●   Status: current production
●   Contributes: nothing
Greenplum
●   What: Mostly PostgreSQL with proprietary MPP
    plus BI and VLDB tools
    ●   great for huge data mining
●   Owner: Greenplum
●   Type: Fork
●   Licensing: Proprietary
●   Compatibility: Medium, 8.2
●   Status: production, active development
    ●   Contributes: code, money
    ●   used to be “Bizgres MPP”
Vertica
●   What: Proprietary code with a few PostgreSQL
    libraries for multi-TB column store database
    ●   great for OLAP
●   Owner: Vertica Co.
●   Type: Fork (sort of)
●   Licensing: Proprietary
●   Compatibility: Low
●   Status: production, active development
    ●   Stonebraker project (no contributions)
Paraccel
●   What: Part-Postgres, part-proprietary in-
    memory clustered pseudo-column store.
    ●   made for “online data analysis”
●   Owner: Paraccel Co.
●   Type: Fork
●   Licensing: Proprietary
●   Compatibility: Medium
●   Status: production, active development
    ●   Contributions: none to date
AsterDB
●   What: Patches and proprietary middleware for
    map/reduce queries on many Postgres servers.
●   Owner: Aster Data Co.
    ●   including former Continuent developer
●   Type: Patch, Add-On
●   Licensing: Proprietary
●   Compatibility: High, 8.4
●   Status: production, active development
    ●   Contributions: code, money
Everest
●   What: PostgreSQL with proprietary storage,
    MPP and column store
    ●   scales to 10's of petabytes
●   Owner: Yahoo!
●   Type: Fork
●   Licensing: Not available to public
●   Compatibility: N/A
●   Status: production
    ●   Contributions: threw a nice party for us
HadoopDB
●   What: Middleware for running map/reduce
    queries over many Postgres instances.
    ●   brand-new
●   Owner: Yale University
●   Type: Add-On
●   Licensing: unclear
●   Compatibility: High, 8.4
●   Status: alpha, active development
    ●   Contributions: none yet
List of Variants
●   Exotic                       ●   Clustering           ●   BI/DW
                                     ●   Postgres-R
    ●   Truviso                                               ●   Netezza
                                     ●   pgCluster
    ●   PostGIS                      ●   pgCluster-II         ●   Greenplum
                                     ●   PostgresForest
    ●   SEPostgres                   ●   pgPool-II
                                                              ●   Vertica
                                         Continuent
    Core Variants                                                 Paraccel
                                     ●
                                                              ●
●
                                     ●   GridSQL
                                                              ●   AsterDB
    ●   Mammoth
                                                              ●   HadoopDB
    ●   Postgres Plus
                                                              ●   Everest
    ●   P.P. Advanced Server
    ●   Fujistu Supported PG
    ●   PostgreSQL for Solaris
So, what are you waiting for?




  Go make your own fork!
Contact
●   Josh Berkus: josh@pgexperts.com
    ●   blog: it.toolbox.com/blogs/database-soup
    ●   www.pgexperts.com/presentations.html
●   PostgreSQL: www.postgresql.org

●   PostgreSQL Experts Inc.:
    www.pgexperts.com


              This talk is copyright 2009 Josh Berkus, and is licensed under the Creative Commons Attribution License.
              Most images are the property of their respective copyright holders. All images are used here for
              purposes of satire or reference to trademarked products.

More Related Content

What's hot

PostgreSQL Replication in 10 Minutes - SCALE
PostgreSQL Replication in 10  Minutes - SCALEPostgreSQL Replication in 10  Minutes - SCALE
PostgreSQL Replication in 10 Minutes - SCALEPostgreSQL Experts, Inc.
 
High Availability PostgreSQL with Zalando Patroni
High Availability PostgreSQL with Zalando PatroniHigh Availability PostgreSQL with Zalando Patroni
High Availability PostgreSQL with Zalando PatroniZalando Technology
 
On The Building Of A PostgreSQL Cluster
On The Building Of A PostgreSQL ClusterOn The Building Of A PostgreSQL Cluster
On The Building Of A PostgreSQL ClusterSrihari Sriraman
 
PostgreSQL HA
PostgreSQL   HAPostgreSQL   HA
PostgreSQL HAharoonm
 
Logical replication with pglogical
Logical replication with pglogicalLogical replication with pglogical
Logical replication with pglogicalUmair Shahid
 
Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...
Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...
Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...Command Prompt., Inc
 
Replication Solutions for PostgreSQL
Replication Solutions for PostgreSQLReplication Solutions for PostgreSQL
Replication Solutions for PostgreSQLPeter Eisentraut
 
Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)
Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)
Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)Masao Fujii
 
Problems with PostgreSQL on Multi-core Systems with MultiTerabyte Data
Problems with PostgreSQL on Multi-core Systems with MultiTerabyte DataProblems with PostgreSQL on Multi-core Systems with MultiTerabyte Data
Problems with PostgreSQL on Multi-core Systems with MultiTerabyte DataJignesh Shah
 
Streaming huge databases using logical decoding
Streaming huge databases using logical decodingStreaming huge databases using logical decoding
Streaming huge databases using logical decodingAlexander Shulgin
 
pgpool-II demonstration
pgpool-II demonstrationpgpool-II demonstration
pgpool-II demonstrationelliando dias
 
Massively Scaled High Performance Web Services with PHP
Massively Scaled High Performance Web Services with PHPMassively Scaled High Performance Web Services with PHP
Massively Scaled High Performance Web Services with PHPDemin Yin
 
Autovacuum, explained for engineers, new improved version PGConf.eu 2015 Vienna
Autovacuum, explained for engineers, new improved version PGConf.eu 2015 ViennaAutovacuum, explained for engineers, new improved version PGConf.eu 2015 Vienna
Autovacuum, explained for engineers, new improved version PGConf.eu 2015 ViennaPostgreSQL-Consulting
 
MySQL HA with PaceMaker
MySQL HA with  PaceMakerMySQL HA with  PaceMaker
MySQL HA with PaceMakerKris Buytaert
 
MySQL HA with Pacemaker
MySQL HA with  PacemakerMySQL HA with  Pacemaker
MySQL HA with PacemakerKris Buytaert
 

What's hot (20)

The Accidental DBA
The Accidental DBAThe Accidental DBA
The Accidental DBA
 
PostgreSQL Replication in 10 Minutes - SCALE
PostgreSQL Replication in 10  Minutes - SCALEPostgreSQL Replication in 10  Minutes - SCALE
PostgreSQL Replication in 10 Minutes - SCALE
 
High Availability PostgreSQL with Zalando Patroni
High Availability PostgreSQL with Zalando PatroniHigh Availability PostgreSQL with Zalando Patroni
High Availability PostgreSQL with Zalando Patroni
 
Fail over fail_back
Fail over fail_backFail over fail_back
Fail over fail_back
 
On The Building Of A PostgreSQL Cluster
On The Building Of A PostgreSQL ClusterOn The Building Of A PostgreSQL Cluster
On The Building Of A PostgreSQL Cluster
 
PostgreSQL HA
PostgreSQL   HAPostgreSQL   HA
PostgreSQL HA
 
Logical replication with pglogical
Logical replication with pglogicalLogical replication with pglogical
Logical replication with pglogical
 
Tuning Linux for MongoDB
Tuning Linux for MongoDBTuning Linux for MongoDB
Tuning Linux for MongoDB
 
Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...
Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...
Building tungsten-clusters-with-postgre sql-hot-standby-and-streaming-replica...
 
Replication Solutions for PostgreSQL
Replication Solutions for PostgreSQLReplication Solutions for PostgreSQL
Replication Solutions for PostgreSQL
 
JahiaOne - Performance Tuning
JahiaOne - Performance TuningJahiaOne - Performance Tuning
JahiaOne - Performance Tuning
 
Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)
Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)
Streaming Replication (Keynote @ PostgreSQL Conference 2009 Japan)
 
Problems with PostgreSQL on Multi-core Systems with MultiTerabyte Data
Problems with PostgreSQL on Multi-core Systems with MultiTerabyte DataProblems with PostgreSQL on Multi-core Systems with MultiTerabyte Data
Problems with PostgreSQL on Multi-core Systems with MultiTerabyte Data
 
Streaming huge databases using logical decoding
Streaming huge databases using logical decodingStreaming huge databases using logical decoding
Streaming huge databases using logical decoding
 
pgpool-II demonstration
pgpool-II demonstrationpgpool-II demonstration
pgpool-II demonstration
 
Massively Scaled High Performance Web Services with PHP
Massively Scaled High Performance Web Services with PHPMassively Scaled High Performance Web Services with PHP
Massively Scaled High Performance Web Services with PHP
 
Autovacuum, explained for engineers, new improved version PGConf.eu 2015 Vienna
Autovacuum, explained for engineers, new improved version PGConf.eu 2015 ViennaAutovacuum, explained for engineers, new improved version PGConf.eu 2015 Vienna
Autovacuum, explained for engineers, new improved version PGConf.eu 2015 Vienna
 
MySQL HA with PaceMaker
MySQL HA with  PaceMakerMySQL HA with  PaceMaker
MySQL HA with PaceMaker
 
How to Monitor MySQL
How to Monitor MySQLHow to Monitor MySQL
How to Monitor MySQL
 
MySQL HA with Pacemaker
MySQL HA with  PacemakerMySQL HA with  Pacemaker
MySQL HA with Pacemaker
 

Similar to Elephant Roads: PostgreSQL Patches and Variants

Delivering a bleeding edge community-led openstack distribution: RDO
Delivering a bleeding edge community-led openstack distribution: RDO Delivering a bleeding edge community-led openstack distribution: RDO
Delivering a bleeding edge community-led openstack distribution: RDO Chandan Kumar
 
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff Davis
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff DavisDeep Postgres Extensions in Rust | PGCon 2019 | Jeff Davis
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff DavisCitus Data
 
Beyond Postgres: Interesting Projects, Tools and forks
Beyond Postgres: Interesting Projects, Tools and forksBeyond Postgres: Interesting Projects, Tools and forks
Beyond Postgres: Interesting Projects, Tools and forksSameer Kumar
 
An evening with Postgresql
An evening with PostgresqlAn evening with Postgresql
An evening with PostgresqlJoshua Drake
 
GitBucket: Open source self-hosting Git server built by Scala
GitBucket: Open source self-hosting Git server built by ScalaGitBucket: Open source self-hosting Git server built by Scala
GitBucket: Open source self-hosting Git server built by Scalatakezoe
 
High performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodbHigh performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodbWei Shan Ang
 
Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...
Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...
Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...Igalia
 
Upcoming features in Airflow 2
Upcoming features in Airflow 2Upcoming features in Airflow 2
Upcoming features in Airflow 2Kaxil Naik
 
When it all GOes right
When it all GOes rightWhen it all GOes right
When it all GOes rightPavlo Golub
 
Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?DoKC
 
Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?DoKC
 
CON6423: Scalable JavaScript applications with Project Nashorn
CON6423: Scalable JavaScript applications with Project NashornCON6423: Scalable JavaScript applications with Project Nashorn
CON6423: Scalable JavaScript applications with Project NashornMichel Graciano
 
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...DoKC
 
Scalable Clusters On Demand
Scalable Clusters On DemandScalable Clusters On Demand
Scalable Clusters On DemandBogdan Kyryliuk
 
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)Gabriele Bartolini
 
Benchmarking for postgresql workloads in kubernetes
Benchmarking for postgresql workloads in kubernetesBenchmarking for postgresql workloads in kubernetes
Benchmarking for postgresql workloads in kubernetesDoKC
 

Similar to Elephant Roads: PostgreSQL Patches and Variants (20)

Delivering a bleeding edge community-led openstack distribution: RDO
Delivering a bleeding edge community-led openstack distribution: RDO Delivering a bleeding edge community-led openstack distribution: RDO
Delivering a bleeding edge community-led openstack distribution: RDO
 
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff Davis
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff DavisDeep Postgres Extensions in Rust | PGCon 2019 | Jeff Davis
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff Davis
 
Go at Skroutz
Go at SkroutzGo at Skroutz
Go at Skroutz
 
Beyond Postgres: Interesting Projects, Tools and forks
Beyond Postgres: Interesting Projects, Tools and forksBeyond Postgres: Interesting Projects, Tools and forks
Beyond Postgres: Interesting Projects, Tools and forks
 
Go at uber
Go at uberGo at uber
Go at uber
 
An evening with Postgresql
An evening with PostgresqlAn evening with Postgresql
An evening with Postgresql
 
GitBucket: Open source self-hosting Git server built by Scala
GitBucket: Open source self-hosting Git server built by ScalaGitBucket: Open source self-hosting Git server built by Scala
GitBucket: Open source self-hosting Git server built by Scala
 
High performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodbHigh performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodb
 
Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...
Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...
Multimedia support in WebKitGTK and WPE, current status and plans (GStreamer ...
 
Upcoming features in Airflow 2
Upcoming features in Airflow 2Upcoming features in Airflow 2
Upcoming features in Airflow 2
 
When it all GOes right
When it all GOes rightWhen it all GOes right
When it all GOes right
 
Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?
 
Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?Why run Postgres in Kubernetes?
Why run Postgres in Kubernetes?
 
PostgreSQL Development Today: 9.0
PostgreSQL Development Today: 9.0PostgreSQL Development Today: 9.0
PostgreSQL Development Today: 9.0
 
CON6423: Scalable JavaScript applications with Project Nashorn
CON6423: Scalable JavaScript applications with Project NashornCON6423: Scalable JavaScript applications with Project Nashorn
CON6423: Scalable JavaScript applications with Project Nashorn
 
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
 
Grafana 7.0
Grafana 7.0Grafana 7.0
Grafana 7.0
 
Scalable Clusters On Demand
Scalable Clusters On DemandScalable Clusters On Demand
Scalable Clusters On Demand
 
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
 
Benchmarking for postgresql workloads in kubernetes
Benchmarking for postgresql workloads in kubernetesBenchmarking for postgresql workloads in kubernetes
Benchmarking for postgresql workloads in kubernetes
 

More from PostgreSQL Experts, Inc.

More from PostgreSQL Experts, Inc. (20)

HowTo DR
HowTo DRHowTo DR
HowTo DR
 
Give A Great Tech Talk 2013
Give A Great Tech Talk 2013Give A Great Tech Talk 2013
Give A Great Tech Talk 2013
 
Pg py-and-squid-pypgday
Pg py-and-squid-pypgdayPg py-and-squid-pypgday
Pg py-and-squid-pypgday
 
Five steps perform_2013
Five steps perform_2013Five steps perform_2013
Five steps perform_2013
 
PWNage: Producing a newsletter with Perl
PWNage: Producing a newsletter with PerlPWNage: Producing a newsletter with Perl
PWNage: Producing a newsletter with Perl
 
10 Ways to Destroy Your Community
10 Ways to Destroy Your Community10 Ways to Destroy Your Community
10 Ways to Destroy Your Community
 
Open Source Press Relations
Open Source Press RelationsOpen Source Press Relations
Open Source Press Relations
 
5 (more) Ways To Destroy Your Community
5 (more) Ways To Destroy Your Community5 (more) Ways To Destroy Your Community
5 (more) Ways To Destroy Your Community
 
Preventing Community (from Linux Collab)
Preventing Community (from Linux Collab)Preventing Community (from Linux Collab)
Preventing Community (from Linux Collab)
 
Development of 8.3 In India
Development of 8.3 In IndiaDevelopment of 8.3 In India
Development of 8.3 In India
 
PostgreSQL and MySQL
PostgreSQL and MySQLPostgreSQL and MySQL
PostgreSQL and MySQL
 
50 Ways To Love Your Project
50 Ways To Love Your Project50 Ways To Love Your Project
50 Ways To Love Your Project
 
8.4 Upcoming Features
8.4 Upcoming Features 8.4 Upcoming Features
8.4 Upcoming Features
 
Writeable CTEs: The Next Big Thing
Writeable CTEs: The Next Big ThingWriteable CTEs: The Next Big Thing
Writeable CTEs: The Next Big Thing
 
9.1 Mystery Tour
9.1 Mystery Tour9.1 Mystery Tour
9.1 Mystery Tour
 
Postgres Open Keynote: The Next 25 Years
Postgres Open Keynote: The Next 25 YearsPostgres Open Keynote: The Next 25 Years
Postgres Open Keynote: The Next 25 Years
 
9.1 Grand Tour
9.1 Grand Tour9.1 Grand Tour
9.1 Grand Tour
 
Keyvil Lightning Talk
Keyvil Lightning TalkKeyvil Lightning Talk
Keyvil Lightning Talk
 
Safe Data is Happy Data
Safe Data is Happy DataSafe Data is Happy Data
Safe Data is Happy Data
 
Unit Test Your Database! (PgCon 2009)
Unit Test Your Database! (PgCon 2009)Unit Test Your Database! (PgCon 2009)
Unit Test Your Database! (PgCon 2009)
 

Elephant Roads: PostgreSQL Patches and Variants

  • 1. Elephant Roads 17 Years of Postgres Forks and Variants Josh Berkus October 2009 JDCon West
  • 3.
  • 4. E.F. Codd Relational Theory The First Day (1970)
  • 6. Michael Stonebraker INGRES The Second Day (1974)
  • 7. Michael Stonebraker POSTGRES The Third Day (1986)
  • 8. Stonebraker Marches On ... ● Fourth Day: Cohera ● Fifth Day: Streambase ● Sixth Day: Vertica ● Seventh Day: SciDB ● Eighth Day: VoltDB ● …?
  • 9. Illustra: The First Fork ● Stonebraker and a team from UC Berkeley forked POSTGRES in 1992 ● added SQL support ● added new object-relational features ● Started a new company called Miró ● Miró became Montage ● Montage became Illustra
  • 10. Informix Swallows Illustra ● 1997: Informix buys Illustra ● Illustra features added to Informix OnLine ● and DataBlades ran Illustra ● 2000: Informix Universal Server 9 ● 2001: IBM Eats Informix
  • 11. PostgreSQL: The Second Fork ● 1995: a rag-tag band of POSTGRES users and students decided to save the DBMS by taking it off-campus. ● It became: Postgres95 ● 1996: they put it on a public CVS server ● ported it to SQL ● It became: PostgreSQL ● you know the rest from here ...
  • 12. Postgres Begat Many More Forks ● Because of the ● Because our code is license clean and easy to ● Because there's a modify history of forks ● Because Postgres is modular and easy to break up or add to ● Because the community is OK with forks and variants
  • 13. Why Change PostgreSQL? ● To experiment with new DB technology ● To commercialize it ● To bundle it with useful tools ● To specialize it for specific tasks ● To add features the community doesn't want ● or that aren't ready for OSS yet ● Because you can!
  • 14. But Aren't Forks Bad? No.
  • 15. Forks Are Good ● Open source means freedom to fork ● If nobody forks a project, then it's not widely used or actively developed. ● Most forks and their owners contribute to core PostgreSQL ● money, code, ideas ● Some forks develop code to be integrated into main Postgres ● best way to try out really challenging ideas
  • 16. 4 Types of Postgres Variants ● Forks: incompatible or proprietary major changes to the core code. ● Patches: compatible, open source major changes to the core code. ● Add-Ons: major middleware or plugins which greatly enhance or change Postgres's functionality ● Redistributions: repackaging Postgres under a different name and/or license
  • 18. Great Bridge PostgreSQL ● What: PostgreSQL plus some advance patches and support. ● Type: Redistribution ● Licensing: BSD ● Versions: Forked from 7.0, merged to 7.1 ● R.I.P.: Great Bridge LLC died in 2001 ● Contributed: lots of code in 1999-2001 ● most of Core worked for Great Bridge
  • 19. Red Hat Database ● What: PostgreSQL redistribution with management tools by Red Hat Inc. ● Type: Re-distribution ● Licensing: BSD & GPL ● Versions: 7.2 to 7.3 ● R.I.P.: Abandoned by Red Hat in 2003 ● Contributed: Tom Lane's salary ● plus visual query analyzer to pgAdmin
  • 20. NuSphere UltraSQL ● What: Native Windows Port of PostgreSQL ● Type: Fork ● Licensing: Proprietary ● Versions: 7.2 and 7.3 ● R.I.P.: Lost relevance with release of PostgreSQL 8.0 ● still technically available ● Contributed: code to PostgreSQL 8.0
  • 21. PowerGres & PowerGres Plus ● What: Native Windows Port of PostgreSQL by SRA Inc., with optional custom storage ● Type: Fork ● Licensing: Proprietary ● Versions: 7.3, maybe 7.4 ● R.I.P.: Lost relevance with release of PostgreSQL 8.0, HA version still around ● Contributed: code to PostgreSQL 8.0
  • 22. Pervasive Postgres ● What: Regular PostgreSQL with support. ● Type: Re-distribution ● Licensing: BSD ● Versions: 8.0 ● R.I.P.: Pervasive dropped Postgres in 2006 ● Contributed: money to the community, mainly for events
  • 23. Bizgres ● What: PostgreSQL plus BI and large database features. ● Type: Fork ● Licensing: BSD ● Versions: 8.1, 8.2 ● R.I.P.: Project died out in 2008 ● Greenplum stopped merging code ● Contributed: code, money (Greenplum's)
  • 24. Core Forks & Redistributions
  • 25. Mammoth PostgreSQL ● What: PostgreSQL with Mammoth Replicator ● Owner: CommandPrompt Inc. ● Type: Fork ● Licensing: open source ● Compatibility: High, merges with main code ● Status: current production & development ● Contributes: money & code ● Used to be a proprietary fork
  • 26. Postgres Plus ● What: PostgreSQL with open source add-ons ● Owner: EnterpriseDB ● Type: Redistribution ● Licensing: open source (varies) ● Compatibility: High, merges with main code ● Status: current production & development ● Contributes: money & code
  • 27. Postgres Plus Advanced Server ● What: PostgreSQL with Oracle Compatibility and proprietary tools ● Owner: EnterpriseDB ● Type: Fork ● Licensing: Proprietary ● Compatibility: High, merges with main code ● Status: current production & development ● Contributes: money & code ● Formerly EnterpriseDB Advanced Server
  • 28. Fujitsu Supported PostgreSQL ● What: PostgreSQL with custom storage engine ● Owner: Fujitsu Australia ● Type: Fork ● Licensing: Proprietary ● Compatibility: High with 8.2 ● Status: current production ● Contributes: money, occasionally
  • 29. PostgreSQL for Solaris ● What: PostgreSQL packaged for Solaris with tools and support ● Owner: Sun Microsystems ● Type: Redistribution ● Licensing: open source (varies) ● Compatibility: High, 8.3 ● Status: current production ● Contributes: code ● Likely to be terminated when Oracle deal completes
  • 31. PostGIS ● What: PostgreSQL plus Geographic support ● #1 open source geo database ● Type: Add-On ● Licensing: BSD & GPL ● Compatibility: High, 8.4 ● Status: current production & development ● Contributes: code, users ● Used to be a package under GPL, now an add-on
  • 32. Fork: Truviso ● 2000: TelegraphCQ Project started at UC Berkeley ● “Continuous Query” processing for PostgreSQL ● 2006: UCB Profs. launch startup Amalgamated Insight ● commercialize TelegraphCQ ● re-merged with current PostgreSQL code ● 2008: Amalgamated Insight renamed Truviso ● first commercial releases
  • 33. Truviso ● What: PostgreSQL plus Streaming Database ● good for high-volume data streams where you don't care about old data ● Type: Fork ● Licensing: Proprietary ● Compatibility: High ● Version: Forked from 7.1, merged to 8.4 ● Status: current production & development ● Contributes: money & code
  • 34. SEPostgres ● What: PostgreSQL integrated with SELinux label-based access control ● Type: Patch ● Owner: SEPostgres project / NEC ● Licensing: BSD ● Compatibility: High, 8.4 ● Status: production, working on merge with core ● Contributes: code
  • 36. Postgres-R ● What: PostgreSQL with Group Communication support for clustering. ● Owner: Postgres-R project ● Type: Patch ● Licensing: open source (BSD) ● Compatibility: High, merges ● Status: alpha, current development ● Contributes: code
  • 37. pgCluster (and CyberCluster) ● What: PostgreSQL with Statement Replication and controller support for clustering. ● Owner: pgCluster project, Cybertech.AT ● Type: Fork ● Licensing: open source (BSD) ● Compatibility: High, 8.0 ● Status: beta, development halted ● Contributes: not anymore
  • 38. pgCluster-II ● What: PostgreSQL with shared memory & storage support for clustering ala RAC ● Owner: pgCluster project, NTT ● Type: Fork ● Licensing: open source (BSD) ● Compatibility: High, 8.3 ● Status: alpha ● Contributes: code
  • 39. PostgresForest ● What: PostgreSQL with middleware-based statement replication clustering ● Owner: NTT (I think) ● Type: Add-On ● Licensing: open source ● Compatibility: Not sure ● Status: beta, development halted ● Contributes: code
  • 40. pgPool-II ● What: statement-replication and partitioning middleware for PostgreSQL ● Owner: pgPool project, SRA ● Type: Add-On ● Licensing: open source ● Compatibility: High, 8.4 ● Status: beta, active development ● Contributes: code, money (SRA)
  • 41. Continuent ● What: statement-replication, mangagement and partitioning middleware for PostgreSQL ● Owner: Continuent corporation ● Type: Add-On ● Licensing: part open source, part proprietary ● Compatibility: High, 8.4 ● Status: old version production, Tungsten RSN ● Contributes: code, money
  • 42. GridSQL ● What: statement-replication, mangagement and partitioning middleware for PostgreSQL ● Owner: EnterpriseDB ● Type: Add-On ● Licensing: GPL ● Compatibility: High, 8.4 ● Status: production, active development ● Contributes: code, money ● Used to be “ExtenDB”, integrated into Postgres Plus
  • 44. Netezza ● What: Some PostgreSQL code (30-50%) plus proprietary code and hardware package ● Type: Fork ● Owner: Netezza Co. ● Licensing: Proprietary ● Compatibility: Low ● Forked from 7.2, no longer compatible ● Status: current production ● Contributes: nothing
  • 45. Greenplum ● What: Mostly PostgreSQL with proprietary MPP plus BI and VLDB tools ● great for huge data mining ● Owner: Greenplum ● Type: Fork ● Licensing: Proprietary ● Compatibility: Medium, 8.2 ● Status: production, active development ● Contributes: code, money ● used to be “Bizgres MPP”
  • 46. Vertica ● What: Proprietary code with a few PostgreSQL libraries for multi-TB column store database ● great for OLAP ● Owner: Vertica Co. ● Type: Fork (sort of) ● Licensing: Proprietary ● Compatibility: Low ● Status: production, active development ● Stonebraker project (no contributions)
  • 47. Paraccel ● What: Part-Postgres, part-proprietary in- memory clustered pseudo-column store. ● made for “online data analysis” ● Owner: Paraccel Co. ● Type: Fork ● Licensing: Proprietary ● Compatibility: Medium ● Status: production, active development ● Contributions: none to date
  • 48. AsterDB ● What: Patches and proprietary middleware for map/reduce queries on many Postgres servers. ● Owner: Aster Data Co. ● including former Continuent developer ● Type: Patch, Add-On ● Licensing: Proprietary ● Compatibility: High, 8.4 ● Status: production, active development ● Contributions: code, money
  • 49. Everest ● What: PostgreSQL with proprietary storage, MPP and column store ● scales to 10's of petabytes ● Owner: Yahoo! ● Type: Fork ● Licensing: Not available to public ● Compatibility: N/A ● Status: production ● Contributions: threw a nice party for us
  • 50. HadoopDB ● What: Middleware for running map/reduce queries over many Postgres instances. ● brand-new ● Owner: Yale University ● Type: Add-On ● Licensing: unclear ● Compatibility: High, 8.4 ● Status: alpha, active development ● Contributions: none yet
  • 51. List of Variants ● Exotic ● Clustering ● BI/DW ● Postgres-R ● Truviso ● Netezza ● pgCluster ● PostGIS ● pgCluster-II ● Greenplum ● PostgresForest ● SEPostgres ● pgPool-II ● Vertica Continuent Core Variants Paraccel ● ● ● ● GridSQL ● AsterDB ● Mammoth ● HadoopDB ● Postgres Plus ● Everest ● P.P. Advanced Server ● Fujistu Supported PG ● PostgreSQL for Solaris
  • 52. So, what are you waiting for? Go make your own fork!
  • 53. Contact ● Josh Berkus: josh@pgexperts.com ● blog: it.toolbox.com/blogs/database-soup ● www.pgexperts.com/presentations.html ● PostgreSQL: www.postgresql.org ● PostgreSQL Experts Inc.: www.pgexperts.com This talk is copyright 2009 Josh Berkus, and is licensed under the Creative Commons Attribution License. Most images are the property of their respective copyright holders. All images are used here for purposes of satire or reference to trademarked products.