SQL, Data Plans and Gettings Things Fast From Our Database -- SkiPHP Presentation

  • 306 views
Uploaded on

Many developers are stumped at how to make SQL really perform and this presentation covers some of the logic behind SQL, how to read EXPLAIN output, and using VISUAL EXPLAIN. Many folks wonder why an …

Many developers are stumped at how to make SQL really perform and this presentation covers some of the logic behind SQL, how to read EXPLAIN output, and using VISUAL EXPLAIN. Many folks wonder why an entire table needs to be read when you use a LIMIT or why indexes need maintanance.

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
306
On Slideshare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
10
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. SQL, Data Plans and Gettings Things Fast From Our Database Very few programmers really understand SQL or how to speed queries to their databases. This session covers that basics of relational calculus (no actual math/calculus will be demanded of attendees), how a RDMS like MySQL tries to optimize the query, and introduces query tuning. Dave Stokes david.stokes@oracle.com @stoker slideshare.net/davestokes
  • 2. SQL Structured Query Language Structured Query Language (/ˈɛs kjuː ˈɛl/,[4] or /ˈsiːkwəl/; (SQL)[5][6][7][8]) is a special-purpose programming language designed for managing data held in a relational database management system (RDBMS). Originally based upon relational algebra and tuple relational calculus, SQL consists of a data definition language and a data manipulation language. The scope of SQL includes data insert, query, update and delete, schema creation and modification, and data access control. Although SQL is often described as, and to a great extent is, a declarative language (4GL), it also includesprocedural elements. relational algebra is an offshoot of first-order logic and of algebra of sets concerned with operations over finitary relations, usually made more convenient to work with by identifying the components of a tuple by a name (called attribute) rather than by a numeric column index, which is called a relation in database terminology. --Wikipedia
  • 3. You send SQL to the server … The mysqld process will take your input and parse it for VALID syntax. Then it will build a query plan on how best to retrieve the data. Finally it goes to fetch the data. * MySQL’s NoSQL queries that skip these steps are MUCH faster
  • 4. GOALs 1. Get the data that you need and only what you need as fast as possible. No ‘SELECT * FROM’ 2. Avoid unnecessary disk/memory reads and disk writes. 3. Make data as compact as is usefull, no BIGINTs for zipcodes.
  • 5. Cost Based Optimizer C.5.6. Optimizer-Related Issues MySQL uses a cost-based optimizer to determine the best way to resolve a query. In many cases, MySQL can calculate the best possible query plan, but sometimes MySQL does not have enough information about the data at hand and has to make “educated” guesses about the data. So MySQL wants to get your data as cheaply as possible and plans accordingly.
  • 6. Query Plan not lockable with MySQL Each time MySQL gets a query it will optimise it! It builds a list of statistics over time to help keep track of data & speed retrieval of the data (5.6 lets you save/restore this information)
  • 7. EXPLAIN EXPLAIN is a tool to ask the server how it wants to optimize the query. Prepend to a QUERY
  • 8. Example Table
  • 9. Example Query
  • 10. Example EXPLAIN
  • 11. Example EXPLAIN with G Query# Reads all records in table! No keys (indexes)
  • 12. A Quick Word on Indexes Indexes allow you to go directly to the record(s) you want (think SSN) instead of reading all records to find the one(s) wanted. But they require maintenance and overhead. Not a panacea!
  • 13. select_type
  • 14. Using WHERE Only 274 records read!
  • 15. Previous query w/o INDEX No index used and all records in table read to find 274 records wanted!
  • 16. How to find index(es) already in use
  • 17. OR ...
  • 18. A more common example Has to read ALL records in Country Could use the PRIMARY KEY but doesn’t! Uses CountryCode Optimer estimates 8 reads to match all records - 8x239 = 1,912
  • 19. Slightly more complex query SELECT a.Name as 'City', b.Name as 'Country', a.population FROM City a JOIN Country b ON (a.CountryCode = b.Code) WHERE a.population > 3000000 AND b.LifeExpectancy > 66 ORDER BY b.name, a.Population LIMIT 20;
  • 20. Gee, we added all that stuff after the where and we are still doing the 239x8 reads! But to GET the records we need to GET the records and then filter!
  • 21. Visual Explain MySQL 5.6 and Workbench 6 use JSON format output to generate diagram. Costs published with 5.7 and 6.1!
  • 22. Yet a little deeper into complexity SELECT CONCAT(customer.last_name, ', ', customer.first_name) AS customer, address.phone, film.title FROM rental INNER JOIN customer ON rental.customer_id = customer.customer_id INNER JOIN address ON customer.address_id = address.address_id INNER JOIN inventory ON rental.inventory_id = inventory.inventory_id INNER JOIN film ON inventory.film_id = film.film_id WHERE rental.return_date IS NULL AND rental_date + INTERVAL film.rental_duration DAY < CURRENT_DATE()
  • 23. *************************** 4. row *************************** id: 1 select_type: SIMPLE table: customer type: eq_ref possible_keys: PRIMARY,idx_fk_address_id key: PRIMARY key_len: 2 ref: sakila.rental.customer_id rows: 1 Extra: NULL *************************** 5. row *************************** id: 1 select_type: SIMPLE table: address type: eq_ref possible_keys: PRIMARY key: PRIMARY key_len: 2 ref: sakila.customer.address_id rows: 1 Extra: NULL 5 rows in set (0.00 sec) *************************** 1. row *************************** id: 1 select_type: SIMPLE table: film type: ALL possible_keys: PRIMARY key: NULL key_len: NULL ref: NULL rows: 1000 Extra: NULL *************************** 2. row *************************** id: 1 select_type: SIMPLE table: inventory type: ref possible_keys: PRIMARY,idx_fk_film_id key: idx_fk_film_id key_len: 2 ref: sakila.film.film_id rows: 2 Extra: Using index *************************** 3. row *************************** id: 1 select_type: SIMPLE table: rental type: ref possible_keys: idx_fk_inventory_id,idx_fk_customer_id key: idx_fk_inventory_id key_len: 3 ref: sakila.inventory.inventory_id rows: 1 Extra: Using where
  • 24. A little easier to understand
  • 25. Compound Indexes We can use this index searching on 1. City, State, and Zip 2. City, State 3. City
  • 26. Musts for better queries 1. Read chapter 8 of the MySQL Manual 2. Join on like data types, INTs with INTS 3. Keep columns as small as practical (PROCEDURE ANALYSE) 4. Maintain B-tree index with ANALYSE TABLE when things are quiet 5. Keep looking for improvments
  • 27. Hard to teach all in a few minutes
  • 28. MySQL Connect Four days with the MySQL Engineers, innovative customers (Facebook, Twitter, Playful Play, Verizon, Paypal, & you), and the top professionals from the MySQL Community. Starts September 27th in San Francisco!
  • 29. Questions and Answers David.Stokes@Oracle.com @stoker Slideshare.net/davestokes