This document discusses the PostgreSQL ecosystem and provides an overview of various tools and extensions. It begins with introductions of the speaker and Taiwan PostgreSQL User Group. The agenda then covers topics like load balancers (PgBouncer, Pgpool-II), replication (Postgres BDR), REST APIs (PostgREST), schema models (PostGIS, PipelineDB, Apache AGE), scalability (Greenplum DB, Postgres-XC/XC2/XL), foreign data wrappers, and other tools like Apache MADlib. The speaker hopes to promote wider adoption of PostgreSQL in Taiwan and collaboration with local system integrators.
Postgres indexing and toward big data application柏瑀 黃
Postgres has evolved from its origins as an academic project in the 1980s. It uses indexes like a book index to quickly find data by pointing to specific rows. As more data is added, updated, or deleted, indexes also need updating to accurately reflect the table data. Toward handling big data, Postgres supports parallelism, database partitioning, and sharding to distribute data across multiple machines.
Database-Migration and -Upgrade with Transportable TablespacesMarkus Flechtner
This document discusses using transportable tablespaces (TTS) to migrate a large telecommunications database from HP-UX to Linux with an Oracle upgrade. Key points:
- The customer has 4 databases totaling over 15TB that need to be migrated with downtime under 6 hours. TTS was chosen for the migration.
- Tuning efforts included resizing files, compression, and parallelizing file transfers and conversions across RAC nodes.
- Challenges included long metadata export times. The issue was addressed by splitting exports across multiple self-contained tablespace subsets in parallel.
- Automation scripts were created to coordinate the distributed migration work across RAC nodes.
This document summarizes the results of a benchmark test comparing the performance of GeoServer and MapServer web map server (WMS) implementations against different data backends and workloads. Key findings include that GeoServer was generally faster than MapServer at reading shapefiles and rendering plain polygons. Performance was similar between the two when using PostGIS and Oracle spatial backends. MapServer showed improved performance for labelled roads rendering compared to previous tests. Areas for potential improvement in future tests are also discussed.
Setup oracle golden gate 11g replicationKanwar Batra
How to setup Oracle Goldengate Replication between 11gR2 RAC or Single node instances. For RAC setup the GoldenGate custom cluster service . Not part of this document
The document discusses PostgreSQL's write-ahead log (WAL), which records database changes before writing them to disk for crash safety. The WAL allows for features like online backups by archiving WAL records, point-in-time recovery by restoring from backups and replaying WAL, and replication by transmitting WAL to standby servers. It works by writing each change as a WAL record before updating data pages, and replaying the log during recovery to reconstruct unfinished transactions after a crash.
Trivadis is a market leader in IT consulting, system integration, solution engineering, and IT services focusing on technologies in Switzerland, Germany, Austria and Denmark. It offers services in strategic business fields including Trivadis Services which takes over the interactive operation of IT systems. Trivadis has over 600 employees across 14 branches in Europe.
Real Application Cluster (RAC) allows multiple computers to simultaneously run Oracle RDBMS while accessing a single database, providing clustering. RAC provides high availability, scalability, and ease of administration by making multiple instances transparent to users. Nodes must have identical environments. Oracle Clusterware manages node additions and removals. Instances from different nodes write to the same physical database. The presentation covers RAC architecture, components, startup sequence, single instance configuration, node eviction, and tips for monitoring and improving the RAC environment.
This document discusses the PostgreSQL ecosystem and provides an overview of various tools and extensions. It begins with introductions of the speaker and Taiwan PostgreSQL User Group. The agenda then covers topics like load balancers (PgBouncer, Pgpool-II), replication (Postgres BDR), REST APIs (PostgREST), schema models (PostGIS, PipelineDB, Apache AGE), scalability (Greenplum DB, Postgres-XC/XC2/XL), foreign data wrappers, and other tools like Apache MADlib. The speaker hopes to promote wider adoption of PostgreSQL in Taiwan and collaboration with local system integrators.
Postgres indexing and toward big data application柏瑀 黃
Postgres has evolved from its origins as an academic project in the 1980s. It uses indexes like a book index to quickly find data by pointing to specific rows. As more data is added, updated, or deleted, indexes also need updating to accurately reflect the table data. Toward handling big data, Postgres supports parallelism, database partitioning, and sharding to distribute data across multiple machines.
Database-Migration and -Upgrade with Transportable TablespacesMarkus Flechtner
This document discusses using transportable tablespaces (TTS) to migrate a large telecommunications database from HP-UX to Linux with an Oracle upgrade. Key points:
- The customer has 4 databases totaling over 15TB that need to be migrated with downtime under 6 hours. TTS was chosen for the migration.
- Tuning efforts included resizing files, compression, and parallelizing file transfers and conversions across RAC nodes.
- Challenges included long metadata export times. The issue was addressed by splitting exports across multiple self-contained tablespace subsets in parallel.
- Automation scripts were created to coordinate the distributed migration work across RAC nodes.
This document summarizes the results of a benchmark test comparing the performance of GeoServer and MapServer web map server (WMS) implementations against different data backends and workloads. Key findings include that GeoServer was generally faster than MapServer at reading shapefiles and rendering plain polygons. Performance was similar between the two when using PostGIS and Oracle spatial backends. MapServer showed improved performance for labelled roads rendering compared to previous tests. Areas for potential improvement in future tests are also discussed.
Setup oracle golden gate 11g replicationKanwar Batra
How to setup Oracle Goldengate Replication between 11gR2 RAC or Single node instances. For RAC setup the GoldenGate custom cluster service . Not part of this document
The document discusses PostgreSQL's write-ahead log (WAL), which records database changes before writing them to disk for crash safety. The WAL allows for features like online backups by archiving WAL records, point-in-time recovery by restoring from backups and replaying WAL, and replication by transmitting WAL to standby servers. It works by writing each change as a WAL record before updating data pages, and replaying the log during recovery to reconstruct unfinished transactions after a crash.
Trivadis is a market leader in IT consulting, system integration, solution engineering, and IT services focusing on technologies in Switzerland, Germany, Austria and Denmark. It offers services in strategic business fields including Trivadis Services which takes over the interactive operation of IT systems. Trivadis has over 600 employees across 14 branches in Europe.
Real Application Cluster (RAC) allows multiple computers to simultaneously run Oracle RDBMS while accessing a single database, providing clustering. RAC provides high availability, scalability, and ease of administration by making multiple instances transparent to users. Nodes must have identical environments. Oracle Clusterware manages node additions and removals. Instances from different nodes write to the same physical database. The presentation covers RAC architecture, components, startup sequence, single instance configuration, node eviction, and tips for monitoring and improving the RAC environment.
PostgreSQL has advanced in many ways but bloat remains a challenge. A solution for this in development is zheap, a new storage format in which only the latest version of the data is kept in main storage and the old version will be moved to an undo log. In this presentation delivered at Postgres Vision 2018, Robert Haas, a Major Contributor to the PostgreSQL project who is leading development of zheap at EnterpriseDB, where he is Vice President, Chief Database Architect, explains the project.
Tungsten Use Case: How Gittigidiyor (a subsidiary of eBay) Replicates Data In...Continuent
Gittigidiyor, a subsidiary of eBay, needed to replicate data in real time from their MySQL database to an Oracle database to power their data warehouse. Continuent Tungsten was used to provide heterogeneous replication between the databases. The schema was translated from MySQL to Oracle using ddlscan. Initial data was exported and loaded into Oracle using parallel apply. Ongoing real-time replication now occurs between MySQL and Oracle using Tungsten Replicator with custom filters to handle data type translations.
Devrim Gunduz gives a presentation on Write-Ahead Logging (WAL) in PostgreSQL. WAL logs all transactions to files called write-ahead logs (WAL files) before changes are written to data files. This allows for crash recovery by replaying WAL files. WAL files are used for replication, backup, and point-in-time recovery (PITR) by replaying WAL files to restore the database to a previous state. Checkpoints write all dirty shared buffers to disk and update the pg_control file with the checkpoint location.
WMS Benchmarking presentation and results, from the FOSS4G 2011 event in Denver. 6 different development teams participated in this exercise, to display common data through the WMS standard the fastest. http://2011.foss4g.org/sessions/web-mapping-performance-shootout
The document summarizes performance tests comparing the open source WMS servers MapServer and GeoServer. It describes testing various configurations including PostGIS versus shapefiles, concurrent requests, and reprojection. The results show that both servers can achieve fast performance but require optimization of settings like using FastCGI for MapServer and Java 6 for GeoServer. Overall it aims to identify areas for improvement in both software packages to enhance WMS performance.
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff DavisCitus Data
Postgres relies heavily on an extension ecosystem, but that is almost 100% dependent on C; which cuts out developers, libraries, and ideas from the world of Postgres. postgres-extension.rs changes that by supporting development of extensions in Rust. Rust is a memory-safe language that integrates nicely in any environment, has powerful libraries, a vibrant ecosystem, and a prolific developer community.
Rust is a unique language because it supports high-level features but all the magic happens at compile-time, and the resulting code is not dependent on an intrusive or bulky runtime. That makes it ideal for integrating with postgres, which has a lot of its own runtime, like memory contexts and signal handlers. postgres-extension.rs offers this integration, allowing the development of extensions in rust, even if deeply-integrated into the postgres internals, and helping handle tricky issues like error handling. This is done through a collection of Rust function declarations, macros, and utility functions that allow rust code to call into postgres, and safely handle resulting errors.
Oracle Database 12c includes several new features:
1) Online statistics gathering improves optimizer performance by gathering statistics for new objects during creation instead of requiring a full data scan later.
2) Invisible columns allow adding a column to a table without showing it in SELECT queries or the table definition unless explicitly specified.
3) Multiple indexes on the same column are now supported if they differ in characteristics like being unique/non-unique or using different index types.
PostgreSQL 9.6 introduced wait events and PostgreSQL 10 progressed them, but what are they? What do they mean? How do I find them and how do I make them go away? Wait events are one of the most significant advancements in observability for PostgreSQL databases; their usefulness is unparalleled in troubleshooting performance. This talk will go into all that and more as we explore the world of PostgreSQL wait events.
The document discusses Oracle 12c's new "multi-process multi-threaded" model. This new feature allows Oracle database processes on Linux/Unix systems to run as operating system threads rather than processes. This reduces resource consumption by eliminating redundant memory and CPU usage from separate processes. Background processes and local client connections now run as threads within larger processes. Remote clients still use dedicated processes that connect via a connection broker thread.
DUCAT imparts training in a way that is more practical and efficient. Anyone who wants to learn ORACLE 11G RAC needs to look no further than DUCAT. It gives training so that after the training ORACLE 11G RAC becomes a part of you. ORACLE 11G RAC starts where Oracle Parallel Server ends. It allows someone to approach the same database simultaneously. It ensures full tolerance, load balancing and of course performance. The heart of this technology is a concept of shared disk subsystem.
The document discusses FDW-based sharding in PostgreSQL. It provides an overview of what database sharding and FDW-based sharding are. It then demonstrates FDW-based sharding in PostgreSQL 9.6, covering how data is inserted and queried across foreign child tables. The document outlines several challenges for FDW-based sharding like distributed transactions and asynchronous execution. It also reviews key techniques being developed for PostgreSQL 10 like push down optimizations and declarative partitioning.
Oracle Data Guard provides several key benefits: continuous database service during disasters or data failures, complete data protection against corruptions and loss, and offloading of queries and backups from primary systems. It uses redo transport to transfer redo logs from a primary database to one or more standby databases, and log apply to maintain synchronization. Different protection modes like maximum protection or maximum performance allow balancing data protection against primary performance.
12cR2 Single-Tenant: Multitenant Features for All EditionsFranck Pachot
Multitenant architecture is available even without Oracle's multitenant option. In this session take a look at the overhead and the 12.2 new features so that you can choose among single-tenant or non-container databases. These features include agility in data movement, easy flashback, and fast upgrade.
The document compares the performance of SQL Server Integration Services (SSIS) packages between SQL Server 2005 and 2008. It describes a study that loaded and aggregated over 1 billion rows of data from flat files using SSIS packages on high-end Unisys servers. The study found that SSIS 2008 packages with the optimized dataflow engine were over 3 times faster than equivalent SSIS 2005 packages for the same workload. Hardware upgrades, SQL Server 2008 configuration changes, and SSIS package optimizations all contributed to improved performance.
Postgres Toolkit is a collection of scripts and utilities that allows database administrators to perform complicated PostgreSQL management tasks with single commands. It focuses on frequent tasks like monitoring performance, checking configuration, and managing backups. The open source toolkit currently contains 13 scripts that work on Linux systems and PostgreSQL versions 9.0 through 9.4. It can be installed with a single curl command and includes utilities like pt-config to manage configuration files and pt-session-profiler to monitor long-running queries.
Toro DB- Open-source, MongoDB-compatible database, built on top of PostgreSQLInMobi Technology
Toro DB- Open-source, MongoDB-compatible database, built on top of PostgreSQL
By Álvaro Hernández at India PostgreSQL UserGroup Meetup, Bangalore
at InMobi.
http://technology.inmobi.com/events/india-postgresql-usergroup-meetup-bangalore
This document provides information on upgrading Oracle Clusterware from version 11gR2 to 12cR1. It begins with an introduction to the presenter and their experience. The agenda then outlines discussing introduction to Clusterware, prerequisites for upgrade, differences between traditional and Flex clusters, the upgrade process, recovering from failures, downgrade process, and tips for monitoring the RAC environment.
The document describes the initialization phase of an IMPORT job in Sqoop 2. It shows that the SqoopInputFormat gets splits from the database, which are then passed to the Partitioner to determine how to partition the data among mapper tasks. This establishes the basic workflow of reading data from the source and partitioning it for import into HDFS.
2022 COSCUP - Let's speed up your PostgreSQL services!.pptxJosé Lin
This document discusses ways to speed up PostgreSQL services. It begins with an introduction of the speaker and covers three main topics: architecture design, query optimization, and parameter configuration. For architecture design, it recommends using connection pools like PgBouncer and read-write separation with Pgpool-II. For query optimization, it suggests finding slow queries, diagnosing abnormal SQL, using EXPLAIN, and parallel queries. For parameter configuration, it provides guidance on tuning background writer settings and memory-related parameters. The goal is to avoid bottlenecks being discovered only during emergencies and optimize performance without focusing on hardware.
PostgreSQL has advanced in many ways but bloat remains a challenge. A solution for this in development is zheap, a new storage format in which only the latest version of the data is kept in main storage and the old version will be moved to an undo log. In this presentation delivered at Postgres Vision 2018, Robert Haas, a Major Contributor to the PostgreSQL project who is leading development of zheap at EnterpriseDB, where he is Vice President, Chief Database Architect, explains the project.
Tungsten Use Case: How Gittigidiyor (a subsidiary of eBay) Replicates Data In...Continuent
Gittigidiyor, a subsidiary of eBay, needed to replicate data in real time from their MySQL database to an Oracle database to power their data warehouse. Continuent Tungsten was used to provide heterogeneous replication between the databases. The schema was translated from MySQL to Oracle using ddlscan. Initial data was exported and loaded into Oracle using parallel apply. Ongoing real-time replication now occurs between MySQL and Oracle using Tungsten Replicator with custom filters to handle data type translations.
Devrim Gunduz gives a presentation on Write-Ahead Logging (WAL) in PostgreSQL. WAL logs all transactions to files called write-ahead logs (WAL files) before changes are written to data files. This allows for crash recovery by replaying WAL files. WAL files are used for replication, backup, and point-in-time recovery (PITR) by replaying WAL files to restore the database to a previous state. Checkpoints write all dirty shared buffers to disk and update the pg_control file with the checkpoint location.
WMS Benchmarking presentation and results, from the FOSS4G 2011 event in Denver. 6 different development teams participated in this exercise, to display common data through the WMS standard the fastest. http://2011.foss4g.org/sessions/web-mapping-performance-shootout
The document summarizes performance tests comparing the open source WMS servers MapServer and GeoServer. It describes testing various configurations including PostGIS versus shapefiles, concurrent requests, and reprojection. The results show that both servers can achieve fast performance but require optimization of settings like using FastCGI for MapServer and Java 6 for GeoServer. Overall it aims to identify areas for improvement in both software packages to enhance WMS performance.
Deep Postgres Extensions in Rust | PGCon 2019 | Jeff DavisCitus Data
Postgres relies heavily on an extension ecosystem, but that is almost 100% dependent on C; which cuts out developers, libraries, and ideas from the world of Postgres. postgres-extension.rs changes that by supporting development of extensions in Rust. Rust is a memory-safe language that integrates nicely in any environment, has powerful libraries, a vibrant ecosystem, and a prolific developer community.
Rust is a unique language because it supports high-level features but all the magic happens at compile-time, and the resulting code is not dependent on an intrusive or bulky runtime. That makes it ideal for integrating with postgres, which has a lot of its own runtime, like memory contexts and signal handlers. postgres-extension.rs offers this integration, allowing the development of extensions in rust, even if deeply-integrated into the postgres internals, and helping handle tricky issues like error handling. This is done through a collection of Rust function declarations, macros, and utility functions that allow rust code to call into postgres, and safely handle resulting errors.
Oracle Database 12c includes several new features:
1) Online statistics gathering improves optimizer performance by gathering statistics for new objects during creation instead of requiring a full data scan later.
2) Invisible columns allow adding a column to a table without showing it in SELECT queries or the table definition unless explicitly specified.
3) Multiple indexes on the same column are now supported if they differ in characteristics like being unique/non-unique or using different index types.
PostgreSQL 9.6 introduced wait events and PostgreSQL 10 progressed them, but what are they? What do they mean? How do I find them and how do I make them go away? Wait events are one of the most significant advancements in observability for PostgreSQL databases; their usefulness is unparalleled in troubleshooting performance. This talk will go into all that and more as we explore the world of PostgreSQL wait events.
The document discusses Oracle 12c's new "multi-process multi-threaded" model. This new feature allows Oracle database processes on Linux/Unix systems to run as operating system threads rather than processes. This reduces resource consumption by eliminating redundant memory and CPU usage from separate processes. Background processes and local client connections now run as threads within larger processes. Remote clients still use dedicated processes that connect via a connection broker thread.
DUCAT imparts training in a way that is more practical and efficient. Anyone who wants to learn ORACLE 11G RAC needs to look no further than DUCAT. It gives training so that after the training ORACLE 11G RAC becomes a part of you. ORACLE 11G RAC starts where Oracle Parallel Server ends. It allows someone to approach the same database simultaneously. It ensures full tolerance, load balancing and of course performance. The heart of this technology is a concept of shared disk subsystem.
The document discusses FDW-based sharding in PostgreSQL. It provides an overview of what database sharding and FDW-based sharding are. It then demonstrates FDW-based sharding in PostgreSQL 9.6, covering how data is inserted and queried across foreign child tables. The document outlines several challenges for FDW-based sharding like distributed transactions and asynchronous execution. It also reviews key techniques being developed for PostgreSQL 10 like push down optimizations and declarative partitioning.
Oracle Data Guard provides several key benefits: continuous database service during disasters or data failures, complete data protection against corruptions and loss, and offloading of queries and backups from primary systems. It uses redo transport to transfer redo logs from a primary database to one or more standby databases, and log apply to maintain synchronization. Different protection modes like maximum protection or maximum performance allow balancing data protection against primary performance.
12cR2 Single-Tenant: Multitenant Features for All EditionsFranck Pachot
Multitenant architecture is available even without Oracle's multitenant option. In this session take a look at the overhead and the 12.2 new features so that you can choose among single-tenant or non-container databases. These features include agility in data movement, easy flashback, and fast upgrade.
The document compares the performance of SQL Server Integration Services (SSIS) packages between SQL Server 2005 and 2008. It describes a study that loaded and aggregated over 1 billion rows of data from flat files using SSIS packages on high-end Unisys servers. The study found that SSIS 2008 packages with the optimized dataflow engine were over 3 times faster than equivalent SSIS 2005 packages for the same workload. Hardware upgrades, SQL Server 2008 configuration changes, and SSIS package optimizations all contributed to improved performance.
Postgres Toolkit is a collection of scripts and utilities that allows database administrators to perform complicated PostgreSQL management tasks with single commands. It focuses on frequent tasks like monitoring performance, checking configuration, and managing backups. The open source toolkit currently contains 13 scripts that work on Linux systems and PostgreSQL versions 9.0 through 9.4. It can be installed with a single curl command and includes utilities like pt-config to manage configuration files and pt-session-profiler to monitor long-running queries.
Toro DB- Open-source, MongoDB-compatible database, built on top of PostgreSQLInMobi Technology
Toro DB- Open-source, MongoDB-compatible database, built on top of PostgreSQL
By Álvaro Hernández at India PostgreSQL UserGroup Meetup, Bangalore
at InMobi.
http://technology.inmobi.com/events/india-postgresql-usergroup-meetup-bangalore
This document provides information on upgrading Oracle Clusterware from version 11gR2 to 12cR1. It begins with an introduction to the presenter and their experience. The agenda then outlines discussing introduction to Clusterware, prerequisites for upgrade, differences between traditional and Flex clusters, the upgrade process, recovering from failures, downgrade process, and tips for monitoring the RAC environment.
The document describes the initialization phase of an IMPORT job in Sqoop 2. It shows that the SqoopInputFormat gets splits from the database, which are then passed to the Partitioner to determine how to partition the data among mapper tasks. This establishes the basic workflow of reading data from the source and partitioning it for import into HDFS.
2022 COSCUP - Let's speed up your PostgreSQL services!.pptxJosé Lin
This document discusses ways to speed up PostgreSQL services. It begins with an introduction of the speaker and covers three main topics: architecture design, query optimization, and parameter configuration. For architecture design, it recommends using connection pools like PgBouncer and read-write separation with Pgpool-II. For query optimization, it suggests finding slow queries, diagnosing abnormal SQL, using EXPLAIN, and parallel queries. For parameter configuration, it provides guidance on tuning background writer settings and memory-related parameters. The goal is to avoid bottlenecks being discovered only during emergencies and optimize performance without focusing on hardware.
String Comparison Surprises: Did Postgres lose my data?Jeremy Schneider
Comparisons are fundamental to computing - and comparing strings is not nearly as straightforward as you might think. Come learn about the history, nuance and surprises of “putting words in order” that you never knew existed in computer science, and how that nuance impacts both general programming and SQL programming. Next, walk through a few actual scenarios and demonstrations using PostgreSQL as a user and administrator, which you can re-run yourself later for further study, including one way you could easily corrupt your self-managed PostgreSQL database if you aren't prepared. Finally we’ll dive into an explanation of the surprising behaviors we saw in PostgreSQL, and learn more about user and administrative features PostgreSQL provides related to localized string comparison.
This was my second presentation taken on MySQL User Camp ( Bangalore ) held on Nov-08 2013. I have a made a presentation about the percona tools mostly used percona tools.
Using open source tools for network device dataplane testing.
Our experiences from redGuardian DDoS mitigation scrubber testing.
Presented at PLNOG 20 (2018).
PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...PROIDEA
Wybór docelowej platformy sieciowej (np. routera, firewalla, scrubbera DDoS) jest często poprzedzony jej testami. Jednym z celów testów jest sprawdzenie, czy parametry wydajnościowe deklarowane przez producenta odpowiadają rzeczywistości. Zespół rozwijający redGuardian Anty DDoS testuje rozwiązanie regresyjnie i wydajnościowo w sposób zautomatyzowany od początku jego istnienia. W czasie prezentacji przeanalizujemy aspekty, na które warto zwrócić uwagę w czasie testów wydajnościowych urządzeń IP oraz przyjrzymy się narzędziom open source pomocnym w realizacji tego zadania.
The document summarizes the new features in PostgreSQL 11, including enhancements to table partitioning such as hash partitioning, default partitions, updating partition keys, automatically creating indexes on partitions, adding unique constraints to partitions, and enabling partition-wise joins and aggregates. It also provides examples of using various partitioning features and explains the benefits of partition-wise processing.
The document discusses IO subsystem architecture in Linux. It contains the following key points:
1. It describes the different layers of the IO subsystem including the block layer, DM layer, request queue/elevator, and scheduler.
2. It explains various probes and tracepoints that can be used to trace IO requests as they pass through different layers like ioblock.request, ioscheduler.elv_add_request, etc.
3. It discusses tools like blktrace, btt, and stap that can be used to analyze block level traces and understand the lifecycle of an IO request through the various layers of the IO stack.
The document discusses IO subsystem architecture in Linux. It contains 3 layers: block layer, DM layer and request queue/elevator. The block layer handles generic block IO requests and completion events. The DM layer consists of components like LVM2 and EVMS. The request queue schedules requests using algorithms like deadline and anticipatory. It also contains probes and tracepoints to monitor IO events.
Cassandra was chosen over other NoSQL options like MongoDB for its scalability and ability to handle a projected 10x growth in data and shift to real-time updates. A proof-of-concept showed Cassandra and ActiveSpaces performing similarly for initial loads, writes and reads. Cassandra was selected due to its open source nature. The data model transitioned from lists to maps to a compound key with JSON to optimize for queries. Ongoing work includes upgrading Cassandra, integrating Spark, and improving JSON schema management and asynchronous operations.
The document discusses optimizations made to Infinispan to improve performance and consistency for a trading application at the Chicago Board Options Exchange (CBOE). Some key points include:
- Early adoption of Infinispan led to some performance and consistency issues that required troubleshooting logs, cache contents, and test applications.
- Optimizations focused on asynchronous communication, queue flushing, and separating state provider/consumer roles to balance performance and consistency across cache tiers.
- Log analysis was important for detecting issues like out-of-order operations between active and passive nodes. Configuration tweaks like increasing queue size also helped.
Bruno Decraene - Improving network availability through the graceful shutdown...PROIDEA
This document proposes a solution called "BGP graceful shutdown" to improve network availability when shutting down BGP sessions. It does so by allowing a graceful transition of traffic to an alternate path before fully removing the route, if an alternate path exists. The solution aims to minimize packet loss during the shutdown process. It involves preconfiguring policies on routers to tag outgoing routes for shutdown with a community and set local preference to 0 to redirect traffic. Evaluation on a test bed showed it can achieve the goal of zero packet loss compared to an abrupt route removal.
The document discusses several new features and enhancements in Oracle Database 11g Release 1. Key points include:
1) Encrypted tablespaces allow full encryption of data while maintaining functionality like indexing and foreign keys.
2) New caching capabilities improve performance by caching more results and metadata to avoid repeat work.
3) Standby databases have been enhanced and can now be used for more active purposes like development, testing, reporting and backups while still providing zero data loss protection.
The document discusses new features in Oracle Database 11g Release 1. Key points include:
1. Encrypted tablespaces allow encryption of data at the tablespace level while still supporting indexing and queries.
2. New caching capabilities improve performance by caching more results in memory, such as function results and query results.
3. Standby databases have enhanced capabilities and can now be used for more active purposes like development, testing and reporting for increased usability and value.
Reproducible Computational Pipelines with Docker and Nextflowinside-BigData.com
This document summarizes a presentation about using Docker and Nextflow to create reproducible computational pipelines. It discusses two major challenges in computational biology being reproducibility and complexity. Containers like Docker help address these challenges by creating portable and standardized environments. Nextflow is introduced as a workflow framework that allows pipelines to run across platforms and isolates dependencies using containers, enabling fast prototyping. Examples are given of using Nextflow with Docker to run pipelines on different systems like HPC clusters in a scalable and reproducible way.
Confoo.ca conference talk February 24th 2021 on MySQL new features found in version 8.0 including server and supporting utility updates for those who may have missed some really neat new features
The document provides an overview of PostgreSQL best practices, including installation, configuration, performance optimization, and security. It discusses setting up a PostgreSQL cluster, optimizing the operating system, installing PostgreSQL, securing the database with configuration files, tuning main PostgreSQL parameters, and performing backups and recovery. It also outlines an OLTP performance benchmark comparing PostgreSQL to Oracle configurations and results.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Zilliz
Join us to introduce Milvus Lite, a vector database that can run on notebooks and laptops, share the same API with Milvus, and integrate with every popular GenAI framework. This webinar is perfect for developers seeking easy-to-use, well-integrated vector databases for their GenAI apps.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
14. 1) Deduplication in B-Tree Indexes
Taiwan PostgreSQL User Group 142020/8/3
https://www.researchgate.net/figure/Overview-of-deduplication-processing_fig1_305801856
https://www.highgo.ca/2020/07/06/features-in-pg13-deduplication-in-b-tree-indexes/
Deduplication Implement Concept (general for files)
15. 1) Deduplication in B-Tree Indexes
實測 (create table、insert 100萬筆、 create index)
Taiwan PostgreSQL User Group 152020/8/3
postgres=# CREATE TABLE btree_dups AS (SELECT GENERATE_SERIES(1, 1000000)::BIGINT
AS val);
SELECT 1000000
postgres=# CREATE INDEX btree_idx ON btree_dups(val);
CREATE INDEX
postgres=# SELECT c.relname , c.relkind, pg_size_pretty(pg_relation_size(c.oid))
FROM pg_class c
WHERE c.relname = 'btree_dups'
OR c.oid IN (
SELECT i.indexrelid FROM pg_index i WHERE i.indrelid =
'btree_dups'::regclass );
relname | relkind | pg_size_pretty
------------+---------+----------------
btree_dups | r | 35 MB
btree_idx | i | 21 MB
(2 rows)
postgres=# UPDATE btree_dups SET val = val + 1;
UPDATE 1000000
16. 1) Deduplication in B-Tree Indexes
差異比較 (效能)
Taiwan PostgreSQL User Group 162020/8/3
-- 12.2
postgres=# EXPLAIN SELECT val FROM
btree_dups;
QUERY PLAN
--------------------------------------
Seq Scan on btree_dups
(cost=0.00..23275.00 rows=1000000
width=8)
(1 row)
Time: 2.415 ms
postgres=# DO
postgres-# $$BEGIN
postgres$# PERFORM * FROM btree_dups;
postgres$# END;
postgres$# $$;
DO
Time: 190.101 ms
-- 13 Beta 2
postgres=# EXPLAIN SELECT val FROM
btree_dups;
QUERY PLAN
--------------------------------------
Seq Scan on btree_dups
(cost=0.00..23274.00 rows=1000000
width=8)
(1 row)
Time: 2.221 ms
postgres=# DO
postgres-# $$BEGIN
postgres$# PERFORM * FROM btree_dups;
postgres$# END;
postgres$# $$;
DO
Time: 174.843 ms
17. 1) Deduplication in B-Tree Indexes
差異比較 (空間)
Taiwan PostgreSQL User Group 172020/8/3
-- 12.2
relname | relkind | pg_size_pretty
------------+---------+---------------
-
btree_dups | r | 104 MB
btree_idx | i | 86 MB
(2 rows)
-- 13 Beta 2
relname | relkind | pg_size_pretty
------------+---------+---------------
-
btree_dups | r | 104 MB
btree_idx | i | 62 MB
(2 rows)
18. 2) Logical Replication Partitioning
https://severalnines.com/database-blog/logical-replication-partitioning-postgresql-13
Taiwan PostgreSQL User Group 182020/8/3
No Partitioning Partitioned
新增 Partitioned Table 的 Logical Replication 功能
19. 2) Logical Replication Partitioning
Taiwan PostgreSQL User Group 192020/8/3
Now
支援Partitioning
Past
20. 2) Logical Replication Partitioning
差異比較 (SQL)
Taiwan PostgreSQL User Group 202020/8/3
-- 12.2
CREATE PUBLICATION rep_part_pub FOR
TABLE stock_sales
WITH (publish_via_partition_root);
ERROR: "stock_sales" is a partitioned
table
DETAIL: Adding partitioned tables to
publications is not supported.
HINT: You can add the table
partitions individually.
-- 13 Beta 2
CREATE PUBLICATION rep_part_pub FOR
TABLE stock_sales
WITH (publish_via_partition_root);
CREATE SUBSCRIPTION rep_part_sub
CONNECTION 'host=192.168.56.101
port=5432 user=rep_usr
password=rep_pwd dbname=postgres'
PUBLICATION rep_part_pub;
21. 3) Incremental sort
Taiwan PostgreSQL User Group 212020/8/3
新增 enable_incrementalsort 參數
-- 13 Beta, enable_incrementalsort=on
postgres=# set enable_incrementalsort=on ;
SET
postgres=# SHOW enable_incrementalsort ;
enable_incrementalsort
------------------------
on
(1 row)
22. 3) Incremental sort
實測 (建 Table, data, index)
Taiwan PostgreSQL User Group 222020/8/3
-- CREATE
CREATE TABLE t_is(a int4,b
int4,ctime timestamp(6) without
time zone);
-- INSERT 2 times
INSERT INTO t_is(a,b,ctime) SELECT
n,round(random()*100000000),
clock_timestamp() FROM
generate_series(1,1000000) n;
INSERT INTO t_is(a,b,ctime) SELECT
n,round(random()*100000000),
clock_timestamp() FROM
generate_series(1,1000000) n;
-- CREATE INDEX
CREATE INDEX idx_t_is_a ON t_is
USING BTREE(a);
-- Check
postgres=# SELECT * FROM t_is ORDER BY a,b
LIMIT 10;
a | b | ctime
---+----------+----------------------------
1 | 60379526 | 2020-07-21 16:28:42.034869
1 | 73197294 | 2020-07-21 16:28:45.496297
2 | 943408 | 2020-07-21 16:28:45.496346
2 | 27584454 | 2020-07-21 16:28:42.036121
3 | 31616182 | 2020-07-21 16:28:45.496348
3 | 88997913 | 2020-07-21 16:28:42.036134
4 | 21557231 | 2020-07-21 16:28:45.49635
4 | 23206459 | 2020-07-21 16:28:42.036136
5 | 13268559 | 2020-07-21 16:28:45.496351
5 | 33672766 | 2020-07-21 16:28:42.036137
(10 rows)
https://postgres.fun/20200721193000.html
23. 3) Incremental sort
差異比較 (13版, 啟用incrementalsort)
Taiwan PostgreSQL User Group 232020/8/3
-- 13 Beta, enable_incrementalsort=on
postgres=# EXPLAIN ANALYZE SELECT * FROM t_is ORDER BY a,b LIMIT 10;
QUERY PLAN
----------------------------------------------------------------------------
-------------------------------------------------------------
Limit (cost=0.51..1.16 rows=10 width=16) (actual time=0.042..0.044 rows=10
loops=1)
-> Incremental Sort (cost=0.51..130115.72 rows=2000000 width=16)
(actual time=0.041..0.042 rows=10 loops=1)
Sort Key: a, b
Presorted Key: a
Full-sort Groups: 1 Sort Method: quicksort Average Memory: 25kB
Peak Memory: 25kB
-> Index Scan using idx_t_is_a on t_is (cost=0.43..58848.31
rows=2000000 width=16) (actual time=0.021..0.027 rows=11 loops=1)
Planning Time: 0.106 ms
Execution Time: 0.064 ms
(8 rows)
26. 3. Incremental sort
Taiwan PostgreSQL User Group 262020/8/3
差異比較
-- 13 Beta, enable_incrementalsort=on
Execution Time: 0.064 ms
-- 13 Beta, enable_incrementalsort=off
Execution Time: 160.027 ms
-- 12 no enable_incrementalsort parameter
Execution Time: 147.247 ms
27. 4. parallel vacuum performance
實測說明
BACKGROUND
For testing the parallel vacuum performance we have constructed a scenario where vacuum is at
the verge of freezing by executing 50 million (vacuum_freeze_min_age) transactions. We
executed non-in place updates which will create huge bloat in the table as well as
indexes . After this point, we have maintained the copy of the database and executed the
vacuum with different numbers of workers on the same state of database and measured the
execution time.
OBSERVATION
We have observed that when the database is in dire need of completing the vacuum the non-
parallel vacuum took more than an hour to execute which we are able to complete in just 16 mins
with parallel vacuum which is nearly 4 times faster.
https://www.enterprisedb.com/postgres-tutorials/what-parallel-vacuum-postgresql-13
https://www.highgo.ca/2020/02/28/parallel-vacuum-in-upcoming-postgresql-13/
Taiwan PostgreSQL User Group 272020/8/3
28. 4. parallel vacuum performance
Taiwan PostgreSQL User Group 282020/8/3
Parameters for parallel vacuum
Min_parallel_index_scan_size 512kB
Max_parallel_maintenance_workers 8
Other Performance Parameters
Shared_buffers 128GB
Maintenance_work_mem 1GB
參數設定 建立Table
CREATE TABLE pgbench_accounts (
aid bigint,
bid bigint,
abalance bigint,
filler1 text DEFAULT md5(random()::text),
filler2 text DEFAULT md5(random()::text),
filler3 text DEFAULT md5(random()::text),
filler4 text DEFAULT md5(random()::text),
filler5 text DEFAULT md5(random()::text),
filler6 text DEFAULT md5(random()::text),
filler7 text DEFAULT md5(random()::text),
filler8 text DEFAULT md5(random()::text),
filler9 text DEFAULT md5(random()::text),
filler10 text DEFAULT md5(random()::text),
filler11 text DEFAULT md5(random()::text),
filler12 text DEFAULT md5(random()::text)
);
29. 4. parallel vacuum performance
建立 DATA & INDEXES
Taiwan PostgreSQL User Group 292020/8/3
INSERT INTO pgbench_accounts select i,i%10,0 FROM
generate_series(1,100000000) as i;
CREATE UNIQUE INDEX pgb_a_aid ON pgbench_accounts(aid);
CREATE INDEX pgb_a_bid ON pgbench_accounts(bid);
CREATE INDEX pgb_a_abalance ON pgbench_accounts(abalance);
CREATE INDEX pgb_a_filler1 ON pgbench_accounts(filler1);
CREATE INDEX pgb_a_filler2 ON pgbench_accounts(filler2);
CREATE INDEX pgb_a_filler3 ON pgbench_accounts(filler3);
CREATE INDEX pgb_a_filler4 ON pgbench_accounts(filler4);
CREATE INDEX pgb_a_filler5 ON pgbench_accounts(filler5);
CREATE INDEX pgb_a_filler6 ON pgbench_accounts(filler6);
CREATE INDEX pgb_a_filler7 ON pgbench_accounts(filler7);
CREATE INDEX pgb_a_filler8 ON pgbench_accounts(filler8);
CREATE INDEX pgb_a_filler9 ON pgbench_accounts(filler9);
CREATE INDEX pgb_a_filler10 ON pgbench_accounts(filler10);
CREATE INDEX pgb_a_filler11 ON pgbench_accounts(filler11);
CREATE INDEX pgb_a_filler12 ON pgbench_accounts(filler12);
30. 4). parallel vacuum performance
WORKLOAD
Taiwan PostgreSQL User Group 302020/8/3
./pgbench -c32 -j32 -t15000000 -M prepared -f script.sql postgres
set aid random(1, 100000000)
set bid random(1, 100000000)
set delta random(-5000, 5000)
BEGIN;
UPDATE pgbench_accounts SET bid=:bid WHERE aid = :aid;
END;
SCRIPT
31. 4). parallel vacuum performance
Taiwan PostgreSQL User Group 312020/8/3
postgres=# VACUUM (PARALLEL 4, VERBOSE) pgbench_accounts;
INFO: vacuuming "public.pgbench_accounts"
INFO: launched 2 parallel vacuum workers for index vacuuming
(planned: 2)
...
...
VACUUM
32. 5. Others
a) Modified catalogs: 13 Added, 5 Dropped, 3 changed
b) Data types: 64-bit transaction IDs…
c) Disk based hash aggregation
d) Backup manifests:with consistency checks.
e) libpq connection string:ssl_min_protocol_version, ssl_max_protocol_versionparameter
f) Column trigger:now executed on the subscription side.
Taiwan PostgreSQL User Group 322020/8/3
PostgreSQL 13 performs storage based hash aggregation if the hash table cannot be stored in work memory.
This feature can be controlled by the parameter enable_hashagg_disk (default ‘on’) and the hashagg_disk
(default ‘on’) and the parameter enable_groupingsets_hash_disk (default 'off') used in the GROUPING SETS
clause. In parameter enable_groupingsets_hash_disk (default 'off') used in the GROUPING SETS clause.
34. 03. SQL Statement (1/3)
a) ALTER NO DEPENDS ON (for functions, indexes, materialized views, and triggers)
b) ALTER TABLE DROP EXPRESSION statement
c) ALTER VIEW
d) CREATE DATABASE
e) CREATE INDEX
Taiwan PostgreSQL User Group 342020/8/3
ALTER FUNCTION func1 DEPENDS ON EXTENSION cube ;
ALTER FUNCTION func1 NO DEPENDS ON EXTENSION cube ;
CREATE TABLE gen1(c1 INT, c2 INT, c3 INT GENERATED ALWAYS AS (c1 + c2) STORED) ;
ALTER TABLE gen1 ALTER COLUMN c3 DROP EXPRESSION IF EXISTS ;
ALTER VIEW view_name RENAME COLUMN old_name TO new_name ;
CREATE DATABASE database_name [[WITH] LOCALE [=] locale_name
CREA TE INDEX idx2_dedup ON data1(c3) WITH (deduplicate_items = on) ;
35. 03. SQL Statement (2/3)
f) CREATE TABLE (added parameters)
Taiwan PostgreSQL User Group 352020/8/3
ALTER TABLE data1 SET (toast.vacuum_index_cleanup = OFF) ;
ALTER TABLE data1 SET (autovacuum_vacuum_insert_threshold = 10000) ;
36. 03. SQL Statement (3/3)
g) INSERT OVERRIDING USER VALUE
h) JSON
• Allow Unicode escape
• Datetime method of jsonpath
i) MAX/MIN pg_lsn
j) ROW expressions
k) Distance operators (<->)
l) gen_random_uuid() ;
Taiwan PostgreSQL User Group 362020/8/3
38. 04. Utilities (1/2)
a) dropdb
b) pg_basebackup
a) added manifest related parameters
b) added checksum related parameters
c) pg_dump
d) pg_verifybackup
• Manifest file version
• The checksum of the manifest file itself
• File size
• File checksum
• WAL file integrityWAL file integrity
Taiwan PostgreSQL User Group 382020/8/3
$ dropdb --force --echo demodb
$pg_dump –d demodb –-include-foreign-data=svr1
39. 04. Utilities (2/2)
e) reindexdb
f) vacuumdb
Taiwan PostgreSQL User Group 392020/8/3
$ reindexdb --concurrently --jobs 2 postgres
reindexdb: warning: cannot reindex system catalogs concurrently,
skipping all
$vacuumdb --parallel=4 postgres
vacuumdb: vacuuming database "postgres"