0% found this document useful (0 votes)

26 views20 pages

SQL Indexes

Uploaded by

sumanshu Pradhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views20 pages

SQL Indexes

Uploaded by

sumanshu Pradhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Indexes

📘1. Indexing Overview

🧬 Origin of indexes
🟦 Structured Format ?
🧱 2. Types of Indexes
🌳 B-Tree Index
🌳 B+ Tree Index (Enhanced version of B- Tree)
1. Clustered
2. No-Clustered index
3. Composite (multi-column) index
4. Unique index
🧮 Bitmap Index
🗂️3. How Data is Stored Using Indexes
📊4. How do EXPLAIN and EXPLAIN ANALYZE help
in query analysis?
⚙️CPU cost
💽 I/O cost
🔢 Number of rows read
🔁 Execution steps (loops, join types)
🔍 Scan types (full table scan, index scan, index seek)
⚠️ 5. What are common indexing mistakes and
how to avoid them?

Note: I used MySql database - workbench

👍
Created By Ashokachari P, Good Luck
Origin of the indexes ?

📂 Data Structures
This is the top-level category in computer science that refers to ways we organize and store data
for efficient access and modification.
Examples include arrays, linked lists, stacks, queues, and trees.

└── 📂 Indexing Structures

A category of data structures used to speed up the retrieval of records in large
datasets—commonly used in database systems.
Indexes reduce the need for full table scans, improving performance of queries.

└── 📂 Tree-Based Indexes

These are indexes that rely on tree-shaped hierarchical structures.
They provide fast lookup, insert, delete, and range operations, typically in O(log n) time.

└── 📂 Search Trees

Search trees are specialized for storing sorted data and allow efficient searching.
Their design ensures values are positioned in a way that supports binary-style decisions at each
level of the tree.
└── 📂 Balanced Trees
Balanced trees ensure that no branch of the tree becomes too deep, which could degrade
performance.
These trees automatically balance themselves when data is inserted or deleted.

Balanced trees ensure:

● Optimal performance: O(log n) time for search/insert/delete.

● Better memory and I/O efficiency.

├── 🌿 AVL Tree

● Named after inventors Adelson-Velsky and Landis.
● First self-balancing binary search tree.
● Rebalances the tree using rotations after insertions or deletions.
● Maintains a strict balance factor (height difference between left and right subtrees is at
most 1).
● Suitable for in-memory applications but less for disk-based systems due to rebalancing
overhead.

├── 🌿 Red-Black Tree

● Another self-balancing binary search tree, but less strict than AVL.
● Balances the tree using coloring rules instead of strict height checking.
● Allows faster insertion/deletion than AVL (because it rebalances less often).
● Widely used in:
○ Java TreeMap
○ Linux kernel
○ C++ STL map/set

└── 🌳 B-Tree (General)

● A multi-way search tree, where nodes can have more than two children.
● Designed specifically for disk-based systems to reduce I/O operations.
● Keeps data sorted and allows searches, sequential access, insertions, and deletions in
logarithmic time.
● Every node can contain multiple keys and children, unlike binary trees.

├── 🧱 B-Tree (Traditional)

● Both internal nodes and leaf nodes store keys and values.
● Searching can end at internal nodes.
● Supports point queries well but less efficient for range queries.
● Used in some older file systems and early DBMS implementations.

└── 🧱 B+ Tree (Enhanced B-Tree)

● Most widely used tree structure in modern RDBMS (e.g., MySQL InnoDB, PostgreSQL,
SQL Server).
● Internal nodes contain only keys, no actual data.
● Leaf nodes store all data and are linked together for fast sequential/range access.
● Advantages:
○ Efficient range scans
○ Great for disk-based storage
○ Supports index-only scans (since data is in leaves)

└── 📂 Bitmap-Based Indexes

Bitmap indexing is a technique used especially in data warehouses and OLAP systems.

● Unlike tree indexes, these use bitmaps (arrays of bits) to represent the presence of
values in a column.
● Best suited for columns with low cardinality (e.g., gender, status, yes/no).

└── 🧮 Bitmap Index

● For each distinct value in a column, a bit vector is created.
Example: A gender column with values Male and Female will have two bitmaps.
● A 1 in a bitmap indicates the row contains that value; 0 means it does not.
● Efficient for:
○ Complex Boolean conditions (AND, OR, NOT)
○ Aggregation
○ Filtering large datasets
● Great for analytical queries, but:
○ Not ideal for frequent updates
○ Not suited for high-cardinality columns

Don't Confuse:
Note: ✅ B+ Tree indexes are an enhanced version of B- Tree indexes — with important
structural improvements that make them faster and more efficient for database indexing.
And clustered, non-clustered, unique and composite indexes are the part of the B+/B- indexes.
🔍 What Are Indexes?
Indexes are data structures used by relational databases to quickly locate rows without
scanning the entire table=.
They work like a book's index — pointing you to the exact page (row) for a specific keyword
(column value).
Indexes dramatically speed up SELECT queries but can add overhead to INSERT, UPDATE, and
DELETE operations.

1. Clustered index:
● Data rows themselves are stored in the leaf nodes of the B-Tree.
● The table's physical order on disk matches the index order.

🔹 Page
● A physical block of storage on disk or in memory.
● Typically fixed size (e.g., 8 KB in MySQL/InnoDB, SQL Server, PostgreSQL)
● It is the smallest unit of I/O — the database reads/writes pages, not individual
rows.
● All B-Tree nodes (root, internal, leaf) are stored in pages.

👉 Think of a page as a container of rows or keys.

🔹 Node
● A logical structure inside the B-Tree: Root node, Internal node, Leaf node.
● Each node is stored in a page.
● Nodes contain:
■ Keys (index values)
■ Pointers to other nodes or to rows
■ Metadata (e.g., number of keys, sibling pointers, etc.)

👉 A node is the role or structure, a page is the container that holds it.

How to create clustered index:

✅ In MySQL
● You CANNOT create a clustered index explicitly using CREATE INDEX.
● The clustered index is always tied to the PRIMARY KEY.
● You can only create it implicitly by declaring a PRIMARY KEY.
This is how the data internally stores:

Note: This picture is talking about the enhanced b- tree storage structure.

💡 Explanation of Diagram Structure:

● Root Node: Entry point, contains index keys and pointers to internal nodes.
● Internal Nodes: Help navigate to the right leaf page.
● Leaf Pages: Store actual table rows sorted by emp_id.

Each leaf page holds multiple rows, and the rows are physically sorted by emp_id.
2. Non-Clustered Index

✅ Key Idea:
● The index is separate from the table data.
● The leaf nodes store keys + row locators (not actual rows).

📦 How It Stores:

Example:
CREATE INDEX idx_dept_id ON employee(dept_id);

3. Unique Index

✅ Key Idea:
● A Unique Index ensures that no two rows in a table have the same value in the indexed
column(s).
● It’s a constraint and a performance feature.

🧠 Use Case:
● Email addresses
● Social Security Numbers (SSNs)
● Mobile numbers
● Any other business key that must be unique

📦 How It Stores:
● Same structure as clustered/non-clustered index
● Database rejects inserts/updates if duplicate keys are attempted.

🔍 Example: CREATE UNIQUE INDEX idx_email ON employee(email);

4. Composite (Multi-Column) Index

✅ Key Idea:
● A composite index (also called a multi-column index) is an index created on two or
more columns of a table. It helps improve performance for queries that filter, join, or sort
using those columns together.

Benefits of Composite Indexes:

● Faster filtering on multiple columns

● Optimized JOIN, ORDER BY, and GROUP BY when columns match the index
● Reduce the need for multiple single-column indexes

📌 Use Composite Index When:

● Your query often filters by two or more specific columns
● You want to avoid creating multiple separate indexes (which take more space and
maintenance)

📦 How It Stores:

Example:
CREATE INDEX index_name ON table_name (column1, column2, ...);

Primary key and foreign keys from indexes perspective:

Primary key:
● when we create a table with primary key and unique index will be created (BTREE).
● the primary key index is also the clustered index, meaning:
○ Table data is physically stored in the order of the primary key.Foreign key:

Foreign key:
● when we create a table with foreign key and non-unique index will be created (BTREE).
● Other indexes (called secondary indexes) refer to the primary key to locate full rows.
○ You can see which is secondary and which is the primary index by “show indexes on
table_name”.
Statistics
SHOW INDEXES FROM customers;

● Table: The table name.

● Non_unique: Whether the index can have duplicates (0 = unique, 1 = not unique).
● Key_name: The name of the index (e.g., PRIMARY, or a custom index name).
● Seq_in_index: The sequence of the column in the index (useful for multi-column
indexes).
● Column_name: The name of the indexed column.
● Collation: How the column is sorted in the index (A for ascending).
● Cardinality: Estimate of the number of unique values.
● Index_type: The type of index (e.g., BTREE, FULLTEXT, HASH).

✅ EXPLAIN Plan Output :

● Column: Meaning
● Id: Step in the plan (1 = first)
● Select_type: Type of query (e.g., SIMPLE)
● Table: Table being accessed
● Type: Access type (const, ref, range, ALL, etc.)
● Possible_keys: Indexes that could be used
● Key: Index that is actually used
● Key_len: Length of the index used
● Ref: Column compared to the index
● Rows: Estimated number of rows scanned (1 is best for PK lookups)
● Filtered: Percentage of rows filtered (ideally 100.00)
● ExtraAny additional info (e.g., Using index)
● Note : there are 5000 records in the customer table.
✅ EXPLAIN ANALYZE Plan Output :

● 🔍 Iterator / Operation → e.g., Index scan, Table scan, Nested loop

● 📋 Table → The table being accessed
● 📊 Rows examined → Actual rows read from table/index
● 📤 Rows produced → Actual rows returned by this step
● 🎯 Estimated rows → MySQL's estimate for rows
● ⏱️ Actual time → Real start and end time for each step
● 🔁 Loops → Number of times this operation was executed
● 🧭 Access type → e.g., const, ref, eq_ref, ALL (full scan)
● 🧮 Index used → Shows the index used for that step (if any)
● 🔗 Join type → e.g., Nested Loop Join, Hash Join
● 🎯 Filtered % → % of rows that passed the condition
● 💰 Cost info → Includes estimated:
○ 🧠 CPU cost

○ 📦 I/O cost

○ 📈 Subtree cost

● 🧠 Execution order → Step-by-step logical flow

● 💾 Buffer info → Indicates if data was read from memory (buffer pool) or disk
● 🧩 Condition pushed down → Whether filter conditions are pushed to storage engine
Explain plan for the Query With index :

Explain Analyse plan for the QueryWith index:

Explain plan for the Query Without index :

Created another table by copying the customer table to distinguish.

CREATE TABLE customer_copy AS SELECT * FROM customer;

Note: we can see the attributes information in the above explain plan.
Explain Analyse plan for the Query Without index:

✅ What is Cost in MySQL?

Query cost is an estimated number that represents how "expensive" a query is to execute.
The MySQL optimizer uses it to choose the fastest execution plan.

→ Think of cost as a score — lower is better.

💡 Factors That Affect Query Cost

● Number of Rows Examined
The more rows the query scans, the higher the cost.

● Table Size
Larger tables take more time to process, increasing the cost.

● Index Usage
Using indexes reduces the cost. If no index is used, a full table scan increases cost
significantly.

● Join Operations
Queries with multiple joins, especially on large tables, increase the cost.

● Sorting and Grouping

Using ORDER BY, GROUP BY, or DISTINCT adds CPU and memory overhead.

● Subqueries and Nested Queries

Complex subqueries may cause the optimizer to perform more work, increasing the cost.

● Functions on Columns
Using functions in WHERE or JOIN clauses (e.g., YEAR(date_column) = 2024) can
prevent index use, increasing cost.

● Data Distribution (Selectivity)

Highly selective conditions (e.g., customer_id = 1) cost less than broad ones (e.g.,
gender = 'M' if 50% of rows match).

● Temporary Tables / Disk Usage

If MySQL needs to create temp tables or use disk (e.g., due to a big sort or join), cost goes
up.

● Loops and Execution Steps

More operations (like multiple nested loops) make the query more expensive to execute.

1. Table Scans
🔍 1. Full Table Scan:
🔸 What is it?
● MySQL reads every row in the table.
● Happens when there is no usable index for the condition.

⚠️ Performance:
● Slowest, especially on large tables.
● Avoid for large datasets unless absolutely needed.

🧪 Example:
SELECT * FROM customer WHERE first_name = 'John';

Note: If first_name has no index → full table scan.

🔍 2. Full Index Scan (type = index)

🔸 What is it?
● MySQL reads every entry in an index instead of the full table.
● Rows are read in index order (efficient for ORDER BY).
● Still reads all rows, just through the index structure (e.g., B-Tree).

⚠️ Performance:
● Better than a full table scan because index data is smaller and already sorted.
● Still not ideal for filtering — it’s just reading everything in index order.

🧪 Example:
SELECT first_name FROM customer;
Note: If first_name is indexed and the query doesn't need full rows.

🔍 3. Index Range Scan (type = range)

🔸 What is it?
● MySQL uses an index to read a range of rows.
● Common with conditions like >, <, BETWEEN, or LIKE 'abc%'.

✅ Performance:
● Very efficient if the range is selective.

🧪 Example:
SELECT * FROM customer WHERE customer_id BETWEEN 100 AND 200;

🔍 4. Index Lookup (Exact Match) (type = ref, eq_ref, or const)

🔸 What is it?
● MySQL uses an index to directly find matching rows.
● Most efficient — especially when using PRIMARY or UNIQUE keys.

✅✅ Best Performance
🧪 Example:
SELECT * FROM customer WHERE customer_id = 100;

💾2. I/O Cost (Input/Output Cost)

🔍 What is it?
● I/O cost is the estimated cost of reading data from disk or memory.
● MySQL uses disk I/O when it reads rows, index blocks, or temporary tables that are not in
the buffer pool (memory cache).

📚 Examples of I/O operations:

● Reading table rows (full scan or range scan)
● Reading index pages (B-Tree nodes)
● Sorting large datasets into temporary files
● Creating temporary tables on disk for GROUP BY, ORDER BY, etc.

⚠️ High I/O cost = more disk access = slower query

🧪 Real-world analogy:
Imagine searching for a page in a physical book (disk) vs a digital note on your phone
(memory). Disk is slower.

✅ How to reduce I/O cost:

● Use indexes to reduce row scans
● Increase InnoDB buffer pool size to cache more data in memory
● Use COVERING INDEXES so MySQL doesn’t need to go back to the table
● Optimize queries to avoid temporary tables and filesorts

🧠3. CPU Cost

🔍 What is it?
● CPU cost is the estimated processing time required by MySQL to:
○ Evaluate expressions and conditions
○ Apply filters (WHERE)
○ Perform sorting, aggregation, grouping
○ Process joins and result sets

🧮 CPU cost is mainly affected by:

● Number of rows read
● Complexity of filtering and joining logic
● Functions or expressions (e.g., LOWER(name))
● Sorting or grouping large datasets

🧪 Real-world analogy:
Disk = slow bookshelf (I/O), CPU = your brain processing the info.

✅ How to reduce CPU cost:

● Avoid complex expressions and functions in WHERE, JOIN
● Fetch only needed columns
● Use indexed filters to reduce rows early
● Avoid sorting/grouping large sets unless needed
🔁4. Loop Count (Nested Loop Iterations)
🔍 What is it?
● Shows how many times a part of the plan is repeated, usually in a nested loop join.
● If you're joining two tables, and the inner loop runs for each row in the outer loop, this count
can grow large.

📌 Example:
Suppose you join customer (1000 rows) with orders (10000 rows):

Example:

SELECT * FROM customer

JOIN orders ON customer.customer_id = orders.customer_id;

● MySQL might loop over 1000 customers, and for each one, scan matching orders.
● If poorly indexed, it might check orders 1000 times = high loop count.

⚠️ High loop count = performance bottleneck

✅ How to reduce loops:
● Add indexes on join keys
● Avoid nested subqueries that loop per row
● Flatten queries when possible (use joins instead of correlated subqueries)

🔹5. Index Maintenance & Storage Cost

While indexes improve read performance, they come with costs that affect writes, storage, and
maintenance. Understanding these trade-offs is key.

⚙️ A. Write Performance Impact

Whenever you do:

● INSERT
● UPDATE (on indexed column)
● DELETE

MySQL has to:

● Update each index associated with the table.
● Re-balance B-Trees (used by most indexes, including BTREE).
● Possibly lock resources longer if concurrency is high.

📌 Example:
If you have 5 indexes on a table, each INSERT triggers 5 separate index updates.

🧱 B. Storage Overhead
Each index:

● Requires disk space.

● Grows with table size.
● May consume RAM in buffer pool.

🧮 Estimate:
● A table with 10 million rows and a 3-column composite index could consume hundreds of
MBs or more just for that one index.

🛠️ C. Extra Maintenance Requirements

When your data changes often (OLTP systems):

● You may need to regularly optimize or rebuild indexes.

● Use:

ANALYZE TABLE table_name; -- updates index statistics

OPTIMIZE TABLE table_name; -- reorganizes storage and index pages

📈 D. Too Many Indexes = Slower Writes + Larger Storage

📌 Rule of thumb:
Don’t index everything. Index only what helps your queries.

🔹6. Indexing Pitfalls (Common Mistakes to Avoid)

❌ A. Indexing Low Cardinality Columns
Low cardinality = few unique values
E.g., gender, status, country_code

👉 Indexes are ineffective here:

● The optimizer knows filtering won’t reduce row access significantly.
● Full or large index scans are nearly as bad as full table scans.

📌 Use only in composite indexes where other columns are selective.

❌ B. Over-Indexing
Common mistake:

● Indexing every column "just in case"

Results in:

● Slower writes
● Confusing query optimizer (chooses suboptimal plans)
● High storage cost
● Maintenance burden

👉 Audit your indexes:

SHOW INDEX FROM table_name;

And drop unused ones.

❌ C. Wrong Column Order in Composite Index

For index (col1, col2):

● Works for WHERE col1 = ? or WHERE col1 = ? AND col2 = ?

● ❌ Doesn't work for WHERE col2 = ? only
🧠 Remember: MySQL uses leftmost prefix rule.
❌ D. Functions Prevent Index Use
If you use a function on an indexed column:

WHERE LOWER(email) = '[email protected]'

👉 MySQL won’t use the index on email.

✅ Rewrite as:
WHERE email = '[email protected]'

❌ E. LIKE with Wildcard at the Start

WHERE name LIKE '%john'

● Index not used

● Full scan happens

✅ Use:
WHERE name LIKE 'john%'

● Index can be used

❌ F. Multiple Indexes Matching a Query

If you have:

● Index A on (first_name)
● Index B on (last_name)

Query:

SELECT * FROM users WHERE first_name = 'Ashok' AND last_name =

'Punugoti'

⚠️ MySQL may pick just one index and not use both effectively.
✅ Solution:
Create a composite index: -INDEX (first_name, last_name

Black Elegant and Modern Startup Pitch Deck Presentation
No ratings yet
Black Elegant and Modern Startup Pitch Deck Presentation
16 pages
Lec 8 Indexing & Data Structures For Query Processing
No ratings yet
Lec 8 Indexing & Data Structures For Query Processing
51 pages
Indexing
No ratings yet
Indexing
10 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
Module 12 - Managing Indexes
No ratings yet
Module 12 - Managing Indexes
19 pages
Database Management System-203105251: Assistant Professor Computer Science & Engineering
No ratings yet
Database Management System-203105251: Assistant Professor Computer Science & Engineering
35 pages
Lecture12 (CNC 312)
No ratings yet
Lecture12 (CNC 312)
36 pages
Understanding Database Index Types
No ratings yet
Understanding Database Index Types
29 pages
Database Indexing Techniques
No ratings yet
Database Indexing Techniques
2 pages
08 Indexes1
No ratings yet
08 Indexes1
7 pages
Indexing
No ratings yet
Indexing
4 pages
Types of Oracle Indexes Explained
100% (1)
Types of Oracle Indexes Explained
27 pages
Database Storage and Indexing
No ratings yet
Database Storage and Indexing
14 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
9 pages
Module Iippt
No ratings yet
Module Iippt
27 pages
DBMS
No ratings yet
DBMS
4 pages
Tuning
100% (2)
Tuning
29 pages
UNIT-5: Indexing and Hashing
No ratings yet
UNIT-5: Indexing and Hashing
78 pages
08 Indexes1
No ratings yet
08 Indexes1
7 pages
Adbms All
No ratings yet
Adbms All
57 pages
CS 345: Topics in Data Warehousing: Thursday, October 21, 2004
No ratings yet
CS 345: Topics in Data Warehousing: Thursday, October 21, 2004
29 pages
Ch14, Veiws, Normalization - Summary
No ratings yet
Ch14, Veiws, Normalization - Summary
68 pages
Converted Text
No ratings yet
Converted Text
5 pages
Unit Iv
No ratings yet
Unit Iv
29 pages
Mod 4
No ratings yet
Mod 4
4 pages
MySQL Indexing
No ratings yet
MySQL Indexing
19 pages
G-03 Presentation
No ratings yet
G-03 Presentation
19 pages
DBMS Unit5
No ratings yet
DBMS Unit5
40 pages
Database Indexing Essentials
No ratings yet
Database Indexing Essentials
110 pages
SQL Indexing and Optimization Guide
No ratings yet
SQL Indexing and Optimization Guide
29 pages
DBMS File & Index Organization
No ratings yet
DBMS File & Index Organization
10 pages
DBMS Case Study 19 1
No ratings yet
DBMS Case Study 19 1
12 pages
Index and Hashing
No ratings yet
Index and Hashing
82 pages
Database Index Management Guide
No ratings yet
Database Index Management Guide
32 pages
Blog Algomaster Io P A Detailed Guide On Database Indexes
No ratings yet
Blog Algomaster Io P A Detailed Guide On Database Indexes
8 pages
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
No ratings yet
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
44 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
Viewinedx
No ratings yet
Viewinedx
11 pages
Link
No ratings yet
Link
4 pages
B-Tree Indexing for Data Retrieval
No ratings yet
B-Tree Indexing for Data Retrieval
208 pages
UNIT V Imp Questions
No ratings yet
UNIT V Imp Questions
12 pages
Indexing
No ratings yet
Indexing
6 pages
Indexing in Database
No ratings yet
Indexing in Database
33 pages
File Structure and Indexing
No ratings yet
File Structure and Indexing
7 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
DBMS Unit 5 Part2
No ratings yet
DBMS Unit 5 Part2
7 pages
B Tree and Index Hash
No ratings yet
B Tree and Index Hash
2 pages
Unit 5 Indexing 2024
No ratings yet
Unit 5 Indexing 2024
50 pages
CSCE5350 Activity 7
No ratings yet
CSCE5350 Activity 7
32 pages
MYSQL Indexing Essentials
No ratings yet
MYSQL Indexing Essentials
3 pages
Comparing Indexing and Hashing in DBMS
No ratings yet
Comparing Indexing and Hashing in DBMS
47 pages
DBMS Unit-Iv
No ratings yet
DBMS Unit-Iv
9 pages
Selecting RAID Levels For Disk Arrays
No ratings yet
Selecting RAID Levels For Disk Arrays
13 pages
Oracle Security and Auditing
No ratings yet
Oracle Security and Auditing
0 pages
XML Parsing in QTP Using XMLUtil - A Simple Example
No ratings yet
XML Parsing in QTP Using XMLUtil - A Simple Example
8 pages
Snowflake Elastic Data Warehouse
No ratings yet
Snowflake Elastic Data Warehouse
2 pages
Database Renaming with NID Guide
No ratings yet
Database Renaming with NID Guide
2 pages
Little Women Movie - Seminar On Literature (UTS)
No ratings yet
Little Women Movie - Seminar On Literature (UTS)
16 pages
AGRI 8 Syllabus 1st Semester 2024 2025
No ratings yet
AGRI 8 Syllabus 1st Semester 2024 2025
10 pages
Base de Datos (BD - Ventas) - 1
No ratings yet
Base de Datos (BD - Ventas) - 1
14 pages
Eudamed Evaluation Report 2012
No ratings yet
Eudamed Evaluation Report 2012
13 pages
Python Programming Lab Manual Guide
No ratings yet
Python Programming Lab Manual Guide
6 pages
Linguistic Banjar
No ratings yet
Linguistic Banjar
18 pages
Move To Cloud ERP - Data Strategy - SAP BW in SAP BDC
No ratings yet
Move To Cloud ERP - Data Strategy - SAP BW in SAP BDC
22 pages
SCC2560 Solution Selling - April 2022
No ratings yet
SCC2560 Solution Selling - April 2022
82 pages
5CS4-03 Operating System Priyanka Sharma
No ratings yet
5CS4-03 Operating System Priyanka Sharma
373 pages
Challenges and Innovations for Street Sweepers
No ratings yet
Challenges and Innovations for Street Sweepers
10 pages
CMR Poster 2021 Eng
No ratings yet
CMR Poster 2021 Eng
1 page
Cb3401-Unit 1
100% (1)
Cb3401-Unit 1
28 pages
List of Registered Voters by Municipality San Antonio
No ratings yet
List of Registered Voters by Municipality San Antonio
39 pages
Feature Briefing - NetBackup 7 1 - Auto Image Replication
No ratings yet
Feature Briefing - NetBackup 7 1 - Auto Image Replication
9 pages
What Is chatGPT and How Its Works.
No ratings yet
What Is chatGPT and How Its Works.
4 pages
37 A Review Paper On Big Data Analytics
No ratings yet
37 A Review Paper On Big Data Analytics
4 pages
Create Table Function in SQL Using SAP HANA Studio
No ratings yet
Create Table Function in SQL Using SAP HANA Studio
5 pages
New Graduate Product Manager Resume Template
No ratings yet
New Graduate Product Manager Resume Template
1 page
Dhrumil Patel: Data Analyst Resume
No ratings yet
Dhrumil Patel: Data Analyst Resume
3 pages
Database Systems Concepts Guide
No ratings yet
Database Systems Concepts Guide
28 pages
019 - Distributed Data Flows
No ratings yet
019 - Distributed Data Flows
3 pages
Skyess - Informatica Syllabus.
100% (1)
Skyess - Informatica Syllabus.
4 pages
The IoT For Smart Sustainable Cities PDF
No ratings yet
The IoT For Smart Sustainable Cities PDF
25 pages
LIQUIBASE
No ratings yet
LIQUIBASE
6 pages
Master Report Format Template
No ratings yet
Master Report Format Template
2 pages

SQL Indexes

Uploaded by

SQL Indexes

Uploaded by

Indexes

📘1. Indexing Overview

Note: I used MySql database - workbench

└── 📂 Indexing Structures

└── 📂 Tree-Based Indexes

└── 📂 Search Trees

Balanced trees ensure:

●​ Optimal performance: O(log n) time for search/insert/delete.

├── 🌿 AVL Tree

├── 🌿 Red-Black Tree

└── 🌳 B-Tree (General)

├── 🧱 B-Tree (Traditional)

└── 🧱 B+ Tree (Enhanced B-Tree)

└── 📂 Bitmap-Based Indexes

└── 🧮 Bitmap Index

👉 Think of a page as a container of rows or keys.

How to create clustered index:

💡 Explanation of Diagram Structure:

🔍 Example: CREATE UNIQUE INDEX idx_email ON employee(email);

Benefits of Composite Indexes:

●​ Faster filtering on multiple columns

📌 Use Composite Index When:

Primary key and foreign keys from indexes perspective:

●​ Table: The table name.

✅ EXPLAIN Plan Output :

●​ 🔍 Iterator / Operation → e.g., Index scan, Table scan, Nested loop​

●​ 🧠 Execution order → Step-by-step logical flow​

Explain Analyse plan for the QueryWith index:

Explain plan for the Query Without index :

CREATE TABLE customer_copy AS SELECT * FROM customer;

✅ What is Cost in MySQL?

→ Think of cost as a score — lower is better.

💡 Factors That Affect Query Cost

●​ Sorting and Grouping​

●​ Subqueries and Nested Queries​

●​ Data Distribution (Selectivity)​

●​ Temporary Tables / Disk Usage​

●​ Loops and Execution Steps​

Note: If first_name has no index → full table scan.

🔍 2. Full Index Scan (type = index)

🔍 3. Index Range Scan (type = range)

🔍 4. Index Lookup (Exact Match) (type = ref, eq_ref, or const)

💾2. I/O Cost (Input/Output Cost)

📚 Examples of I/O operations:

⚠️ High I/O cost = more disk access = slower query

✅ How to reduce I/O cost:

🧠3. CPU Cost

🧮 CPU cost is mainly affected by:

✅ How to reduce CPU cost:

SELECT * FROM customer

JOIN orders ON customer.customer_id = orders.customer_id;

⚠️ High loop count = performance bottleneck

🔹5. Index Maintenance & Storage Cost

⚙️ A. Write Performance Impact

MySQL has to:

●​ Requires disk space.

🛠️ C. Extra Maintenance Requirements

●​ You may need to regularly optimize or rebuild indexes.

ANALYZE TABLE table_name; -- updates index statistics

OPTIMIZE TABLE table_name; -- reorganizes storage and index pages

📈 D. Too Many Indexes = Slower Writes + Larger Storage

🔹6. Indexing Pitfalls (Common Mistakes to Avoid)

👉 Indexes are ineffective here:

📌 Use only in composite indexes where other columns are selective.

●​ Indexing every column "just in case"

👉 Audit your indexes:

And drop unused ones.

❌ C. Wrong Column Order in Composite Index

●​ Works for WHERE col1 = ? or WHERE col1 = ? AND col2 = ?

WHERE LOWER(email) = '[email protected]'

👉 MySQL won’t use the index on email.

❌ E. LIKE with Wildcard at the Start

●​ Index not used

●​ Index can be used

❌ F. Multiple Indexes Matching a Query

SELECT * FROM users WHERE first_name = 'Ashok' AND last_name =

● Optimal performance: O(log n) time for search/insert/delete.

● Faster filtering on multiple columns

● Table: The table name.

● 🔍 Iterator / Operation → e.g., Index scan, Table scan, Nested loop

● 🧠 Execution order → Step-by-step logical flow

● Sorting and Grouping

● Subqueries and Nested Queries

● Data Distribution (Selectivity)

● Temporary Tables / Disk Usage

● Loops and Execution Steps

● Requires disk space.

● You may need to regularly optimize or rebuild indexes.

● Indexing every column "just in case"

● Works for WHERE col1 = ? or WHERE col1 = ? AND col2 = ?

● Index not used

● Index can be used