0% found this document useful (0 votes)

15 views

29-Query Optimization-04-10-2024

vfsvzfv

Uploaded by

Hemesh R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

29-Query Optimization-04-10-2024

vfsvzfv

Uploaded by

Hemesh R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Query Processing

Heuristic Query Optimization

• σ marks >60 (π marks (student_marks))

• π marks ( σ marks >60 (student_marks))

Processing a Query
• Tasks in processing a high-level query
1. Scanner scans the query and identifies the language tokens
2. Parser checks syntax of the query
3. The query is validated by checking that all attribute names and
relation names are valid
4. An intermediate internal representation for the query is created
(query tree or query graph)
5. Query execution strategy is developed
6. Query optimizer produces an execution plan
7. Code generator generates the object code
8. Runtime database processor executes the code

• Query processing and query optimization

Processing a Query
• Input:
– A query written in SQL is given as input to the
query processor.
• Parsing
– In this step, the parser of the query processor
module checks the syntax of the query, the user’s
privileges to execute the query, the table names
and attribute names, etc. The correct table names,
attribute names and the privilege of the users can
be taken from the system catalog (data
dictionary).
• Translation
– If we have written a valid query, then it is
converted from high level language SQL to low
level instruction in Relational Algebra.
– For example, our SQL query can be converted into
a Relational Algebra equivalent as follows;
– SELECT Ename FROM Employee, Proj_Assigned
WHERE Employee.Eno = Proj_Assigned.Eno AND
DOP > 10;
– πEname(σDOP>10 Λ Employee.Eno=Proj_Assigned.Eno (Employee X
Prof_Assigned))
• Optimizer
– Optimizer uses the statistical data stored as part of
data dictionary. The statistical data are information
about the size of the table, the length of records,
the indexes created on the table, etc. Optimizer also
checks for the conditions and conditional attributes
which are parts of the query.
• Execution Plan
– A query can be expressed in many ways. The query
processor module, at this stage, using the
information collected in step 3 to find different
relational algebra expressions that are equivalent
and return the result of the one which we have
written already.
– For our example, the query written in Relational
algebra can also be written as the one given below;
– πEname(Employee ⋈Eno (σDOP>10 (Prof_Assigned)))
– So far, we have got two execution plans. Only
condition is that both plans should give the same
result.
• Evaluation
– At this stage, we choose one execution plan of the
several we have developed. This Execution plan
accesses data from the database to give the final
result.
– In our example, the second plan may be good. In the
first plan, we join two relations (costly operation)
then apply the condition (conditions are considered
as filters) on the joined relation. This consumes more
time as well as space.
– In the second plan, we filter one of the tables
(Proj_Assigned) and the result is joined with the
Employee table. This join may need to compare less
number of records. Hence, the second plan is the
best (with the information known, not always).
Heuristic Optimization of Query Trees

Heuristic: Rule that leads to the least cost in most

cases

Query Tree (relational algebra expression)

leaf node :relations

Internal node :relational algebra operations
execution of query trees: post order traversal of tree
Heuristic Optimization of Query Trees-
Example
Heuristic Optimization of Query Trees-
Example
• Query
"Find the last names of employees born after
1957 who work on a project named ‘Aquarius’."

• SQL
SELECT LNAME
FROM EMPLOYEE, WORKS_ON, PROJECT
WHERE PNAME=‘Aquarius’ AND PNUMBER=PNO
AND ESSN=SSN AND BDATE.‘1957-12-31’;
Steps in converting a query tree during heuristic optimization.

a) Initial (canonical) query tree for SQL query Q.

b) Moving SELECT operations down the query tree.

c) Applying the more restrictive SELECT operation first.

d) Replacing CARTESIAN PRODUCT and SELECT with JOIN operations.

e) Moving PROJECT operations down the query tree.

SELECT LNAME
FROM EMPLOYEE, WORKS_ON, PROJECT
WHERE PNAME=“Aquarius’ AND PNUMBER=PNO
AND ESSN=SSN AND BDATE > ‘DEC-31-1957’

Canonical
query tree

6-16 18-17
a) Executing this tree directly first creates a very large file
containing the CARTESIAN PRODUCT of the entire
EMPLOYEE, WORKS_ON, and PROJECT files. That is
why the initial query tree is never executed, but is
transformed into another equivalent tree that is efficient to
execute. This particular query needs only one record from
the PROJECT relation— for the ‘Aquarius’ project—and
only the EMPLOYEE records for those whose date of birth
is after ‘1957-12-31’.
Moving SELECT operations
down the query tree

6-16 18-19
(b) shows an improved query tree that first applies the
SELECT operations to reduce the number of tuples that
appear in the CARTESIAN PRODUCT.
(c) Applying more
restrictive SELECT operation first

SELECT LNAME
FROM EMPOYEE, WORKS_ON, PROJECT
WHERE PNAME=‘Aquarius’ AND
PUMBER=PNO AND
ESSN=SSN AND
BDATE > ‘DEC-31-1957’

6-17 18-21
c) A further improvement is achieved by switching the
positions of the EMPLOYEE and PROJECT relations in the
tree, as shown in Figure (c). This uses the information that
Pnumber is a key attribute of the PROJECT relation, and
hence the SELECT operation on the PROJECT relation will
retrieve a single record only.
Replacing CARTESIAN PRODUCT and SELECT with JOIN

6-17 18-23
d) We can further improve the query tree by replacing any
CARTESIAN PRODUCT operation that is followed by a join
condition with a JOIN operation, as shown in Figure(d).
Moving PROJECT operations down

Transformation should keep equivalence

6-18 18-25
e) Another improvement is to keep only the attributes
needed by subsequent operations in the intermediate
relations, by including PROJECT (π) operations as early as
possible in the query tree, as shown in Figure (e). This
reduces the attributes (columns) of the intermediate
relations, whereas the SELECT operations reduce the
number of tuples (records).
SQL Query with an Uncorrelated
Subquery
Find the movies with stars born in 1960
MovieStar(name, address, gender, birthdate)
StarsIn(title, year, starName)

SELECT title
FROM StarsIn
WHERE starName IN (
SELECT name
FROM MovieStar
WHERE birthdate LIKE ‘%1960’
);
Parse Tree
<Query>

<SFW>

SELECT <SelList> FROM <FromList> WHERE <Condition>

<Attribute> <RelName> <Tuple> IN <Query>

title StarsIn <Attribute> ( <Query> )

starName <SFW>

SELECT <SelList> FROM <FromList> WHERE <Condition>

<Attribute> <RelName> <Attribute> LIKE <Pattern>

name MovieStar birthDate ‘%1960’

Generating Relational Algebra
title
Two-argument selection

StarsIn <condition>

<tuple> IN name
<attribute> birthdate LIKE ‘%1960’
starName MovieStar
Applying the Rewrite Rule

title title
starName=name

StarsIn <condition>

<tuple> IN name StarsIn δ

<attribute> birthdate LIKE ‘%1960’ name

birthdate LIKE ‘%1960’
starName MovieStar

MovieStar
Improving the Logical Query Plan
title
title
starName=name
 starName=name

StarsIn δ
StarsIn name
name
birthdate LIKE ‘%1960’
birthdate LIKE ‘%1960’
MovieStar
MovieStar
SQL Queries and Relational Algebra
(1)
• Example
SELECT Lname, Fname
FROM EMPLOYEE
WHERE Salary > ( SELECT MAX(Salary)
FROM EMPLOYEE
WHERE Dno = 5 )
• Inner block and outer block

ICS 424 - 01 (072) Query Processing and Optimization 32

Translating SQL Queries into Relational Algebra
SELECT LNAME, FNAME
FROM EMPLOYEE
WHERE > ( SELECT MAX (SALARY)
SALARY FROM EMPLOYEE
WHERE DNO = 5);

SELECT LNAME, FNAME SELECT MAX (SALARY)

FROM EMPLOYEE FROM EMPLOYEE
WHERE SALARY > C WHERE DNO = 5

πLNAME, FNAME (σSALARY>C(EMPLOYEE)) ℱMAX SALARY (σDNO=5 (EMPLOYEE))

ICS 424 - 01 (072) Query Processing and Optimization 33

SQL Queries and Relational Algebra
(2)

• Uncorrelated nested queries Vs Correlated nested queries

• Example
Retrieve the name of each employee who works on all the projects controlled
by department number 5.

SELECT FNAME, LNAME

FROM EMPLOYEE
WHERE ( (SELECT PNO
FROM WORKS_ON
WHERE SSN=ESSN)
CONTAINS
(SELECT PNUMBER
FROM PROJECT
WHERE DNUM=5) )
ICS 424 - 01 (072) Query Processing and Optimization 34
SQL Queries and Relational Algebra
• Example (3)
For every project located in ‘Stafford’, retrieve the project number,
the controlling department number and the department
manager’s last name, address and birthdate.
• SQL query:
SELECT P.NUMBER,P.DNUM,E.LNAME, E.ADDRESS, E.BDATE

FROM PROJECT AS P,DEPARTMENT AS D, EMPLOYEE AS E

WHERE P.DNUM=D.DNUMBER AND D.MGRSSN=E.SSN AND
P.PLOCATION=‘STAFFORD’;

• Relation algebra:
PNUMBER, DNUM, LNAME, ADDRESS, BDATE (((PLOCATION=‘STAFFORD’(PROJECT))
DNUM=DNUMBER (DEPARTMENT)) MGRSSN=SSN (EMPLOYEE))

ICS 424 - 01 (072) Query Processing and Optimization 35

SQL Queries and Relational Algebra
(4)

ICS 424 - 01 (072) Query Processing and Optimization 36

DLP - Examen - Revisado
80% (5)
DLP - Examen - Revisado
20 pages
Interview Help Guide
100% (3)
Interview Help Guide
7 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Module - 4
No ratings yet
Module - 4
60 pages
Chapter 2 Querry Proccessing
No ratings yet
Chapter 2 Querry Proccessing
7 pages
Chapter 1 Query Processing
No ratings yet
Chapter 1 Query Processing
58 pages
Chapter 2 - Query Processing and Optimization
100% (1)
Chapter 2 - Query Processing and Optimization
28 pages
CH - 2 Query Process
No ratings yet
CH - 2 Query Process
44 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
26 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
63 pages
Query Processing
No ratings yet
Query Processing
5 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
63 pages
ADB Chapter 2
No ratings yet
ADB Chapter 2
40 pages
AMSAL
No ratings yet
AMSAL
58 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
query_optimization_part1
No ratings yet
query_optimization_part1
52 pages
Chapter Two Query Processing (2)
No ratings yet
Chapter Two Query Processing (2)
60 pages
CH - 1 Query Process SW
No ratings yet
CH - 1 Query Process SW
43 pages
Query Processing and Optimization: Dessalegn Mequanint
No ratings yet
Query Processing and Optimization: Dessalegn Mequanint
31 pages
ADBChapter 1
No ratings yet
ADBChapter 1
32 pages
Chapter 2 Query processing and optimization [Autosaved]
No ratings yet
Chapter 2 Query processing and optimization [Autosaved]
35 pages
QUERY Processing and Relational Algebra
No ratings yet
QUERY Processing and Relational Algebra
27 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
21 pages
Lecture 20+Query+Processing+ +opt
No ratings yet
Lecture 20+Query+Processing+ +opt
22 pages
Chapter 2-Query Processing and Optimi
No ratings yet
Chapter 2-Query Processing and Optimi
43 pages
Query Optimization
No ratings yet
Query Optimization
103 pages
Chapter 4 Query Optimization
100% (2)
Chapter 4 Query Optimization
35 pages
CO3 Session 7
No ratings yet
CO3 Session 7
32 pages
ch2. pdf
No ratings yet
ch2. pdf
72 pages
Chapter 1 - Query Processing and Optimization
No ratings yet
Chapter 1 - Query Processing and Optimization
62 pages
2 Algorithms For Query Processing Optimization
No ratings yet
2 Algorithms For Query Processing Optimization
46 pages
DE_Module5_QueryOptimization
No ratings yet
DE_Module5_QueryOptimization
11 pages
Presentation9 - Query Processing and Query Optimization in DBMS
No ratings yet
Presentation9 - Query Processing and Query Optimization in DBMS
36 pages
CH 02
No ratings yet
CH 02
127 pages
Module 4 - 3 Bhargavi
No ratings yet
Module 4 - 3 Bhargavi
56 pages
Adb_ch2
No ratings yet
Adb_ch2
72 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
2 Chapter 3 Query Optimization
No ratings yet
2 Chapter 3 Query Optimization
29 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
Query Processing
No ratings yet
Query Processing
28 pages
Mod 7 - Query Optimization
No ratings yet
Mod 7 - Query Optimization
29 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
31 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
45 pages
Chapter 2 Query Processing
No ratings yet
Chapter 2 Query Processing
21 pages
chapter 2
No ratings yet
chapter 2
47 pages
Module-4
No ratings yet
Module-4
8 pages
Chapter 1 Query Processing and Optimization
No ratings yet
Chapter 1 Query Processing and Optimization
40 pages
Unit-5 Query Processing and Optimization
No ratings yet
Unit-5 Query Processing and Optimization
40 pages
Itm661 Lecture03 Part2 2015
No ratings yet
Itm661 Lecture03 Part2 2015
47 pages
Advanced Database Systems: Chapter 3:query Processing and Evaluation
100% (1)
Advanced Database Systems: Chapter 3:query Processing and Evaluation
36 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
ADBMS Notes
67% (3)
ADBMS Notes
48 pages
Chapter 2 - Query Optimization
No ratings yet
Chapter 2 - Query Optimization
40 pages
Chapter 2 Query Optimization
No ratings yet
Chapter 2 Query Optimization
31 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
24 pages
ADB Chapter 2 DB Part1
No ratings yet
ADB Chapter 2 DB Part1
10 pages
Chapter 1 Query Processing and Optimization
No ratings yet
Chapter 1 Query Processing and Optimization
108 pages
Chapter 1 Query Processing and Optimization
No ratings yet
Chapter 1 Query Processing and Optimization
129 pages
Chapter 6 - Query Processing and Optimization Algorithm
No ratings yet
Chapter 6 - Query Processing and Optimization Algorithm
27 pages
1 Intro Select Project
No ratings yet
1 Intro Select Project
28 pages
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
Basic DBA Query v.1: Oracle Database
From Everand
Basic DBA Query v.1: Oracle Database
Oraclesql-plsql
5/5 (1)
9-Keys in DBMS-05-08-2024
No ratings yet
9-Keys in DBMS-05-08-2024
14 pages
24-Multi-Level Indexing, Dynamic Multilevel Indexing, B-Tree-11-09-2024
No ratings yet
24-Multi-Level Indexing, Dynamic Multilevel Indexing, B-Tree-11-09-2024
40 pages
25-Hashing Techniques - 16-09-2024
No ratings yet
25-Hashing Techniques - 16-09-2024
39 pages
1 Unnamed 03 01 2024
No ratings yet
1 Unnamed 03 01 2024
10 pages
Forms and Event Handlers
No ratings yet
Forms and Event Handlers
5 pages
Web Programming
No ratings yet
Web Programming
2 pages
Web Technology Notes Btech Csvtu 6th Sem
No ratings yet
Web Technology Notes Btech Csvtu 6th Sem
14 pages
XCT 7 40vcc617fqee
No ratings yet
XCT 7 40vcc617fqee
294 pages
PL1 Language Reference Oct82
No ratings yet
PL1 Language Reference Oct82
235 pages
Test 1 CS111 Solution Ver 1
No ratings yet
Test 1 CS111 Solution Ver 1
13 pages
Unit2 1 PDF
No ratings yet
Unit2 1 PDF
14 pages
Memory and IO Interfacing - 1
No ratings yet
Memory and IO Interfacing - 1
7 pages
The Digital Age Odyssey: A Journey Through The Evolution and Impact of Computers
No ratings yet
The Digital Age Odyssey: A Journey Through The Evolution and Impact of Computers
2 pages
X20BC8083 Eng
No ratings yet
X20BC8083 Eng
5 pages
EZSource Training
No ratings yet
EZSource Training
32 pages
CN Module 3 PDF
No ratings yet
CN Module 3 PDF
52 pages
Unit 4
No ratings yet
Unit 4
10 pages
VCP WinCE50 ReleaseNotes
No ratings yet
VCP WinCE50 ReleaseNotes
3 pages
IBM Mainframe COBOL DB2 JCL CICS Tutorials
No ratings yet
IBM Mainframe COBOL DB2 JCL CICS Tutorials
21 pages
Dump State
No ratings yet
Dump State
10 pages
Frequently Asked Questions: 475 Field Communicator
No ratings yet
Frequently Asked Questions: 475 Field Communicator
5 pages
Sas Laptop History
No ratings yet
Sas Laptop History
1 page
Analogicas Opcionales Slot de Comunicaciones MAB221-ADB21-DAB21
No ratings yet
Analogicas Opcionales Slot de Comunicaciones MAB221-ADB21-DAB21
18 pages
Network and F Issue S
No ratings yet
Network and F Issue S
17 pages
Worksheet 3.1 - Internet Safety, Cyber Securit
No ratings yet
Worksheet 3.1 - Internet Safety, Cyber Securit
2 pages
Working With The HP T5710 Thin Client
No ratings yet
Working With The HP T5710 Thin Client
7 pages
Silantharajula Govinda Sai Mohan: Career Objective
No ratings yet
Silantharajula Govinda Sai Mohan: Career Objective
2 pages
Checkpoint.156-215.80.V2020-02-18.Q110: Show Answer
No ratings yet
Checkpoint.156-215.80.V2020-02-18.Q110: Show Answer
41 pages
Anr 6.01 (60100007) 20240129 193703 1611174316
No ratings yet
Anr 6.01 (60100007) 20240129 193703 1611174316
10 pages
106-Subsets With Duplicates (Easy) - Grokking The Coding Interview - Patterns For Coding Questions
No ratings yet
106-Subsets With Duplicates (Easy) - Grokking The Coding Interview - Patterns For Coding Questions
6 pages
Campus Area Network
No ratings yet
Campus Area Network
9 pages
ConclusionsPaper IMA India TechRupt Workshop March 2022
No ratings yet
ConclusionsPaper IMA India TechRupt Workshop March 2022
21 pages
How To Rock An Algorithms Interview
No ratings yet
How To Rock An Algorithms Interview
3 pages
Oracle Golden Gate (GG) vs. Oracle Stream: Sanjay Naik
No ratings yet
Oracle Golden Gate (GG) vs. Oracle Stream: Sanjay Naik
4 pages

29-Query Optimization-04-10-2024

Uploaded by

29-Query Optimization-04-10-2024

Uploaded by

Query Processing

Heuristic Query Optimization

• π marks ( σ marks >60 (student_marks))

• Query processing and query optimization

Heuristic: Rule that leads to the least cost in most

Query Tree (relational algebra expression)

leaf node :relations

a) Initial (canonical) query tree for SQL query Q.

b) Moving SELECT operations down the query tree.

c) Applying the more restrictive SELECT operation first.

d) Replacing CARTESIAN PRODUCT and SELECT with JOIN operations.

e) Moving PROJECT operations down the query tree.

Transformation should keep equivalence

SELECT <SelList> FROM <FromList> WHERE <Condition>

<Attribute> <RelName> <Tuple> IN <Query>

title StarsIn <Attribute> ( <Query> )

SELECT <SelList> FROM <FromList> WHERE <Condition>

<Attribute> <RelName> <Attribute> LIKE <Pattern>

name MovieStar birthDate ‘%1960’

<attribute> birthdate LIKE ‘%1960’ name

ICS 424 - 01 (072) Query Processing and Optimization 32

SELECT LNAME, FNAME SELECT MAX (SALARY)

πLNAME, FNAME (σSALARY>C(EMPLOYEE)) ℱMAX SALARY (σDNO=5 (EMPLOYEE))

ICS 424 - 01 (072) Query Processing and Optimization 33

• Uncorrelated nested queries Vs Correlated nested queries

SELECT FNAME, LNAME

FROM PROJECT AS P,DEPARTMENT AS D, EMPLOYEE AS E

ICS 424 - 01 (072) Query Processing and Optimization 35

ICS 424 - 01 (072) Query Processing and Optimization 36

You might also like