Oracle JSON treatment evolution - from 12.1 to 18 AOUG-2018

JSON treatment challenges
FROM ORACLE 12.1 TO 18C

“
”
Oracle Database provides all of the benefits of
SQL and relational databases to JSON data, which
you store and manipulate in the same ways and
with the same confidence as any other type of
database data.
Oracle JSON developer guide
Disclaimer: The contents of this presentation are for informal guidance and discussion purposes only.

Agenda
 JSON in RDBMS
 Configuration
 Storage
 Ingestion
 Retrieval
 Generation
 Maintenance
 Q&A

JSON in RDBMS
Why
 Consistency (integrity, transaction ACIDity)
 complex objects denormalization
What
 Logs
 Configuration data, key/value user preferences
 Unstructured/semi-structured objects
Please forget about JSON-based analytics

DB configuration
 JSON patches on regular base
 sql scripts to check old issues

DB configuration 12.1
These patches MUST be installed
1. Patch 20080249: JSON Patch Bundle 1
Proactive bundle patches with JSON fixes - Doc ID 1937782.1

DB configuration 12.2
 Convince DBA to install latest Database Release Update with JSON fixes -
Doc ID 2285557.1

DB configuration 18C
Do not care!
Oracle 18C

RFC 4627 JSON requirements
Encoding
JSON text SHALL be encoded in Unicode. The default encoding is UTF-8.

JSON storage
CLOB = UCS2 => Size = characters count * 2 bytes!

JSON constraints
JAVA processing fails!

JSON constraints
Constraint works fine but JAVA still fails 

JSON ingestion performance
N, run
with json strict
unique names,
seconds
with json strict,
seconds
with json lax,
seconds
without
seconds
1 132 115 121 83
2 142 119 117 80
3 132 119 115 91
4 136 115 110 90
5 138 117 125 92
6 135 122 117 90
7 134 116 117 88
8 142 127 120 81
9 152 115 125 80
10 147 118 114 83
AVG,
seconds 139 118,3 118,1 85,8
2x performance degradation!

JSON ingestion performance
N, run
with json strict
and unique
names, seconds
with json
seconds
With json strict
and cache,
seconds
with json lax,
seconds
without
seconds
without
constraints
and with
cache
1 132 115 110 121 83 78
2 142 119 110 117 80 78
3 132 119 106 115 91 84
4 136 115 111 110 90 75
5 138 117 109 125 92 78
6 135 122 108 117 90 75
7 134 116 102 117 88 75
8 142 127 105 120 81 78
9 152 115 105 125 80 77
10 147 118 110 114 83 77
AVG,
seconds 139 118 107 118 85 77

Storage recommendations
Data types
 Small values up to 4000 characters – VARCHAR2
 More than 4000 - BLOB
BLOB
 2x less space consumption
 2x less I/O due less space
 No implicit character-set conversion if the database character set is not AL32UTF8
Constraints
 JSON STRICT without unique keys
Columns settings
 CACHE=YES

JSON limitations
 Path length - 4000 bytes
 In-memory JSON length – 32767 bytes max
 sql json functions return value length – 32Kb max

Ingestion
 JSON = string
 Insert works fine
 Update = full rewrite

Ingestion nuances
JSON is mentioned nowhere!!!

Ingestion nuances
IS JSON on source = no direct path! Oracle 18 behave the same!

Loading from files
Single row only 

Retrieval
 Parse on application server
 SQL Oracle JSON features parsing
 Virtual columns
 Oracle JSON features parsing in views

Virtual columns
Switch Adaptive Optimization off
or
Patch 24563796: WRONG RESULT IN COLUMN VALUE IN 12C should be applied!

Retrieval
Each JSON_VALUE reparses JSON!
Visible on documents more 4000 characters
In Oracle 12.2 it works fine!

Retrieval Common issues
 Views often become non mergeable
 ORA-600 and ORA-7445 No Data to be read from socket
 COUNT(distinct) = ORA-7445 No Data to be read from socket
 2 or more json_table = wrong results in aggregates

Remediation
 dbms_utility.expand_sql_text + sql plan
 use single json_table
 /*+ NO_MERGE*/ hint
 /*+ NO_QUERY_TRANSFORMATION*/ hint
 materialize JSON - materialize hint
 materialize JSON - by CTAS
 apex_json package + json-to-xml tranformation

Materialized views 12.1
Use 12.2 and later….

Real-time materialized views 12.2
Doesn’t work nowhere 

Real-time materialized views 12.2
18c and later…

Materialized views 18c
No commit!!!

JSON Validation
ORA-01722: invalid number
Validator 

JSON search
. notation is not supported again 12.1, 12.2, 18c!

Search quiz 2
Who stolen titles?!

Search quiz 2
Never do when troubleshoot JSON!!!

Search quiz 3
Task: find json with value = ‘640’ in class type ‘Country’.

Search quiz 3
5 steps
Index is used

Search expressions
Filter Anchor
Inside one object

Search expressions 18c
abs()
boolean()
booleanOnly()
ceiling()
date()
double()
floor()
length()
lower()
number()
numberOnly()
size()
string()
stringOnly()
timestamp()
type()
upper()

Ingestion
Execution time: ~100 seconds

Ingestion
Refresh job execution time: ~4 seconds

Ingestion

Ingestion
= 1
TRANSACTIONAL doesn’t work with JSON!

$G table recommendations
 Pin $G table in KEEP pool
 Setup stage_itab_max_row – 12.2 only

Ingestion
Standard tables: 5 seconds
$G KEEP pool: 2 seconds
Merge job: 2 seconds

Ingestion with Text Index
3 sessions simultaneously
Sequential processing on DR$INDEXNAME$R! table!

Sequential processing on DR$INDEXNAME$R! table!
Split count – 20 000 000 record!

Semi-parallel processing on DR$INDEXNAME$R! table!
3x time less locks!
3 rows

Ingestion with Text Index Oracle 12.1

Fast search KEEP pool
1. Allocate KEEP pool memory area using DB_KEEP_CACHE_SIZE
2. Pin in KEEP pool properly all:
 DR$ tables
 DR$ indexes
 LOB segments of DR$ tables
3. Set CACHE for DR$ tables
4. Load data in KEEP pool once by stored procedure or do nothing – keep pool will be
populated by itself during queries
5. Be happy with 5x performance boost
Until server reboot 

Extra fast search
Never do it!
Changes will be lost after rebuild!

Extra fast search
Repeat the same for indexes using i_index_clause!

Ultra-fast search InMemory option
 Oracle 12.2
 Extended data types should be enabled
 IMMEMORY_EXPRESSIONS_USAGE=ENABLE
 IMMEMORY_VIRTUAL_COLUMNS=ENABLE
 IS JSON check constraint is a must
 The whole table should be marked as INMEMORY

JSON storage InMemory option
 Stores JSON in binary OSON format (32 Kb max)
 Tries to create in-memory virtual columns
 Doesn’t affect json_textcontains operator
Use JSON-based InMemory materialized views instead !

Generation Plain JSON
N, run Concatenation Function UDF function JSON_OBJECT
1 0,156 0,39 0,375 0,187
2 0,188 0,343 0,39 0,203
3 0,156 0,36 0,391 0,203
4 0,171 0,359 0,359 0,188
5 0,156 0,359 0,359 0,219
6 0,172 0,36 0,343 0,204
7 0,188 0,344 0,344 0,204
8 0,156 0,344 0,36 0,188
9 0,172 0,391 0,359 0,203
10 0,172 0,36 0,359 0,188
AVG, seconds 0,17 0,38 0,36 0,20
Overhead, percent
% 15,10

Generation Nested JSON
N, run Concatenation Function UDF function XMLAGG JSON_OBJECT
1 0,328 0,5 0,49 6,047 0,171
2 0,328 0,525 0,52 5,98 0,156
3 0,344 0,526 0,52 5,340 0,156
4 0,344 0,598 0,51 6,015 0,188
5 0,344 0,56 0,55 6,000 0,141
6 0,328 0,65 0,65 6,102 0,156
7 0,344 0,599 0,6 5,894 0,203
8 0,382 0,6 0,6 5,25 0,172
9 0,328 0,687 0,54 5,998 0,156
10 0,328 0,642 0,53 5,97 0,156
AVG, seconds 0,34 0,59 0,55 5,82 0,17
Profit, percent 51,29

Data structure maintenance
 Do not base any checks on DBA_JSON_COLUMNS view!
 Posfix/prefix columns with JSON data via _JSON like INVOICE_JSON
 Create daily checks:
 JSON format (strict/lax)
 Field type (clob/blob/varchar2)
 CACHE option

DBA_JSON_COLUMNS Quiz
NO JSON there!

DBA_JSON_COLUMNS Quiz
The behavior is consistent between 12.1, 12. 2 and 18C!
No data at all!

1. Gather index statistics via CTX_REPORT.INDEX_STAT
2. Collect fragmented indexes - estimated row fragmentation.
3. Collect indexes with many deleted rows - estimated garbage size
4. Run ctx_ddl.optimize_index in FULL mode: SERIAL or PARALLEL
5. Oracle 18C Automatic Background Index Maintenance doesn’t optimize the index
Index optimization
No optimization = up to 10x search performance degradation!

Maintenance
 Check index statuses
 Rebuild the index

Maintenance for STAGE_ITAB option
12.1-12.2
 Enable AUTO_OPTIMIZE option
12.2-18
 If you under 12.2 proper parallelism should be setup via stage_itab_max_parallel
 If you under 12.2 setup how often optimization starts via stage_itab_max_rows
18
 Disable AUTO_OPTIMIZE option
 Use preference stage_itab_auto_opt

Maintenance for STAGE_ITAB option
12.1-12.2

Conclusion
 JSON = tradeoff
 row-per-row scenario is safe
 knowledge of Oracle Text is required
 .notation isn’t production ready
 Only Oracle 18 looks mature
 Dedicated JSON search solutions are faster than Oracle

THANK YOU FOR YOUR TIME!
Alexander Tokarev
Database expert
DataArt
shtock@mail.ru
https://github.com/shtock
https://www.linkedin.com/in/alexander-tokarev-14bab230

Oracle JSON treatment evolution - from 12.1 to 18 AOUG-2018

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Oracle JSON treatment evolution - from 12.1 to 18 AOUG-2018

Similar to Oracle JSON treatment evolution - from 12.1 to 18 AOUG-2018 (20)

More from Alexander Tokarev

More from Alexander Tokarev (20)

Recently uploaded

Recently uploaded (20)

Oracle JSON treatment evolution - from 12.1 to 18 AOUG-2018

Editor's Notes