0% found this document useful (0 votes)

27 views8 pages

Day 2 - Functions and Grouping Data Deep Dive

Uploaded by

Linda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views8 pages

Day 2 - Functions and Grouping Data Deep Dive

Uploaded by

Linda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

🗓️ Day 2: Functions and Grouping Data Deep Dive 📊

The focus of Day 2 is on transforming and summarizing data, which is essential for reporting and
analysis. This involves using Functions (to change individual data values) and Aggregate
Functions (to calculate summaries of groups of data).

2.1 Single-Row Functions 🔢

Single-row functions operate on one row at a time and return one result per row. They can be
used anywhere a column or expression can be used (in SELECT, WHERE, ORDER BY).

Function
Function Description Example
Type
UPPER(), LOWER(), Changes casing of a SELECT UPPER(last_name)
Character FROM employees;
INITCAP() string.
SUBSTR(str, start, SELECT SUBSTR('Oracle', 1,
len) Extracts a substring. 3) FROM DUAL; (Returns 'Ora')

LENGTH()
Returns the number of SELECT LENGTH('SQL') FROM
characters. DUAL;
Rounds a number to a SELECT ROUND(45.923, 1)
Numeric ROUND(n, precision) specified number of
FROM DUAL; (Returns 45.9)
decimal places.
TRUNC(n, precision)
Truncates (cuts off) a SELECT TRUNC(45.923, 1)
number. FROM DUAL; (Returns 45.9)
Returns the current date
Date SYSDATE SELECT SYSDATE FROM DUAL;
and time on the server.
Returns the number of
MONTHS_BETWEEN(d1,
d2) months between two
dates.
Converts a date or
TO_CHAR(value, TO_CHAR(hire_date, 'YYYY-
Conversion format) number to a character MM-DD')
string.
TO_DATE(string, Converts a character TO_DATE('13-NOV-2025', 'DD-
format) string to a date. MON-YYYY')

SQL Functions and Grouping Data: A Deep Dive

Page 1: Introduction - The "What" and "Why"

At its core, SQL is a language for managing and manipulating sets of data. While simple SELECT
statements can retrieve raw data, the true analytical power of SQL is unlocked through
Functions and Grouping. These features transform SQL from a simple data retrieval tool into a
powerful engine for aggregation, summarization, and transformation.

The Core Problem They Solve:

Imagine a database table with millions of sales records. A question like "What was our total
revenue?" is impossible to answer by looking at individual rows. You need a way to collapse all
those rows into a single, meaningful value. This is the fundamental purpose of aggregation and
grouping.

 Functions perform operations on data, either on individual values (scalar functions) or on

sets of values (aggregate functions), to produce a new result.
 Grouping (GROUP BY) allows you to partition your dataset into distinct subsets, and then
apply aggregate functions to each subset, enabling comparisons and summaries across
categories.

Together, they allow you to answer complex business questions:

 "What is the average salary for each department?"

 "What is the total sales by region and by quarter?"
 "Who are our top 10 customers by total order value?"

This deep dive will dissect the types of functions, the mechanics of GROUP BY and HAVING, and
culminate in advanced grouping operations.

Page 2: A Taxonomy of SQL Functions

SQL functions are broadly categorized by their operating domain: single values vs. sets of
values.

1. Scalar Functions (Row-by-Row)

Scalar functions operate on a single value from a single row and return a single result for each
row processed. They do not change the number of rows returned.

 String Functions:
o UPPER(column_name), LOWER(column_name): Change case.
o LENGTH(column_name): Returns the length of a string.
o SUBSTRING(column_name, start, length): Extracts a portion of a string.
o TRIM(column_name): Removes leading and trailing spaces.
 Numeric Functions:
o ROUND(column_name, decimals): Rounds a number.
o CEIL(), FLOOR(): Rounds up or down to the nearest integer.
o ABS(column_name): Returns the absolute value.
 Date/Time Functions:
o YEAR(date_column), MONTH(), DAY(): Extract parts of a date.
o DATEADD(interval, number, date): Adds to a date.
o DATEDIFF(interval, start_date, end_date): Calculates the difference
between two dates.
o GETDATE(), NOW(): Returns the current date and time.

Example:

sql

SELECT
first_name,
UPPER(last_name) AS last_name_upper,
YEAR(birth_date) AS birth_year
FROM employees;

This processes each row individually, transforming the data without summarizing it.

2. Aggregate Functions (Set-Based)

Aggregate functions operate on a set of rows (a column from multiple rows) and return a single,
summarizing value. They are the cornerstone of data analysis in SQL.

 COUNT(*): Counts the number of rows in the set, including NULLs.

 COUNT(column_name): Counts the number of non-NULL values in a specific column.
 SUM(column_name): Calculates the total sum of a numeric column.
 AVG(column_name): Calculates the average of a numeric column.
 MIN(column_name), MAX(column_name): Finds the minimum and maximum value.
 STRING_AGG(column_name, separator): (In some DBMS like PostgreSQL/SQL
Server) Concatenates values from multiple rows into a single string.

Crucial Point: When you use an aggregate function in a SELECT clause without a GROUP BY, it
collapses the entire result set into a single row.

Example:

sql

SELECT
COUNT(*) AS total_employees,
AVG(salary) AS average_salary,
MAX(salary) AS highest_salary
FROM employees;

This query returns exactly one row, summarizing the entire employees table.
Page 3: The Mechanics of GROUP BY - Creating Subsets

The GROUP BY clause is what allows you to apply aggregate functions to subsets of your data. It
partitions the result set into groups of rows that have matching values in the specified column(s).
The aggregate function is then calculated for each group independently.

Syntax and Logic:

sql

SELECT column1, aggregate_function(column2)

FROM table
GROUP BY column1;

The Mental Model:

1. FROM: The database reads the entire table.

2. WHERE: (Optional) Filters out individual rows that do not meet the criteria.
3. GROUP BY: The remaining rows are sorted into "buckets" or "groups." Each unique
combination of the GROUP BY columns gets its own bucket.
4. SELECT: For each bucket, the SELECT clause outputs:
o The value of the GROUP BY column(s).
o The result of the aggregate function calculated only on the rows within that
bucket.

Example: Total Sales by Region

sql

SELECT
region,
SUM(sale_amount) AS total_sales
FROM sales
GROUP BY region;

Visualizing the Process:

sale_id region sale_amount

1 North 100

2 South 150

3 North 200

4 South 50
sale_id region sale_amount

The GROUP BY region creates two buckets:

 North Bucket: Rows 1 & 3 -> SUM(sale_amount) = 300

 South Bucket: Rows 2 & 4 -> SUM(sale_amount) = 200

Result:

region total_sales

North 300

South 200

Page 4: The HAVING Clause - The Filter for Groups

The WHERE clause filters rows before they are aggregated. But what if you want to filter the
results of the aggregation? This is the job of the HAVING clause.

WHERE vs. HAVING: A Critical Distinction

 WHERE: Filters individual rows based on column values. It cannot use aggregate functions.
 HAVING: Filters groups based on the results of aggregate functions. It cannot use regular
column values (unless they are in the GROUP BY).

Use Case: Find regions with total sales greater than 250.

sql

SELECT
region,
SUM(sale_amount) AS total_sales
FROM sales
GROUP BY region
HAVING SUM(sale_amount) > 250; -- Filter on the aggregate result

Following our previous example, the HAVING clause would eliminate the "South" group
(total_sales = 200) and only return the "North" group.

You can use both together: Find the total sales for the 'North' and 'South' regions, but only
show them if their total sales exceed 250.
sql

SELECT
region,
SUM(sale_amount) AS total_sales
FROM sales
WHERE region IN ('North', 'South') -- Row-level filter
GROUP BY region
HAVING SUM(sale_amount) > 250; -- Group-level filter

The Complete Logical Query Processing Order:

Understanding this order is key to mastering SQL:

1. FROM & JOINs

2. WHERE
3. GROUP BY
4. HAVING
5. SELECT (including window functions, which we'll touch on)
6. ORDER BY

Page 5: Advanced Grouping Concepts

1. Grouping Sets, ROLLUP, and CUBE

Sometimes, you need multiple levels of aggregation in a single query. Modern SQL provides
extensions to GROUP BY for this.

 GROUPING SETS: Allows you to specify multiple grouping lists. It's the foundation for
ROLLUP and CUBE.

sql

-- Get totals by (region), by (product), and a grand total (())

SELECT region, product, SUM(sales)
FROM sales_data
GROUP BY GROUPING SETS (
(region),
(product),
() -- Grand Total
);

ROLLUP: Creates a hierarchy of aggregates, from the most detailed to a grand total. It's perfect for
subtotals.

sql

-- Gets: (Year, Quarter), (Year), and Grand Total

SELECT YEAR(order_date) AS OrderYear, QUARTER(order_date) AS OrderQtr,
SUM(amount)
FROM orders
GROUP BY ROLLUP (OrderYear, OrderQtr);

 Result:

OrderYear OrderQtr SUM(amount)

2023 1 1000

2023 2 1500

2023 NULL 2500 <-- Subtotal for 2023

NULL NULL 2500 <-- Grand Total

 CUBE: Generates all possible combination of aggregates for the specified columns.

sql

-- Gets all combinations: (Region, Product), (Region), (Product), Grand Total.

SELECT Region, Product, SUM(sales)
FROM sales_data
GROUP BY CUBE (Region, Product);

2. The OVER() Clause - Window Functions (A Brief Preview)

While not strictly "grouping," the OVER() clause is the next evolutionary step in aggregation. It
allows you to perform aggregate calculations without collapsing the result set. You get aggregate
results alongside the original row-level data.

sql

SELECT
employee_id,
department,
salary,
AVG(salary) OVER (PARTITION BY department) AS avg_department_salary
FROM employees;

This query returns every employee, their salary, and alongside it, the average salary for their
entire department. The PARTITION BY within the OVER() clause acts like a "soft" GROUP BY that
doesn't reduce the rows.
Page 6: Summary and Best Practices

Summary:

 Scalar Functions transform data row-by-row.

 Aggregate Functions (SUM, AVG, COUNT) summarize a set of rows into a single value.
 GROUP BY is used to apply aggregate functions to subsets of data defined by one or more
columns.
 HAVING is the only way to filter the results of aggregate functions, acting as a filter for
groups created by GROUP BY.
 Advanced Grouping (ROLLUP, CUBE) and Window Functions (OVER()) provide
powerful tools for multi-level analysis and row-level aggregates.

Common Pitfalls and Best Practices:

1. GROUP BY Mismatch: Every column in the SELECT list that is not an argument to an
aggregate function must be included in the GROUP BY clause. This is the most common
error.
o Wrong: SELECT region, product, SUM(sales) FROM sales GROUP BY
region;
o Right: SELECT region, product, SUM(sales) FROM sales GROUP BY
region, product;
2. Filtering with HAVING instead of WHERE: Using HAVING to filter on non-aggregated
columns is inefficient. Always use WHERE for row-level filters to reduce the number of
rows the database has to group.
3. COUNT(*) vs. COUNT(column_name): Remember that COUNT(*) counts all rows, while
COUNT(column_name) counts only non-NULL values in that column. Choose the one that
matches your intent.
4. NULLs in Grouping: GROUP BY treats all NULL values as a single, separate group. Be
aware of this, as it can sometimes lead to an unexpected "NULL" group in your results.

By deeply understanding these concepts, you move from simply writing queries to architecting
them, allowing you to extract profound insights and build robust reporting directly from your
database.

Advanced SQL Techniques for Data Science
No ratings yet
Advanced SQL Techniques for Data Science
38 pages
SQL Aggregate Functions Avg, Max, Min, Count
No ratings yet
SQL Aggregate Functions Avg, Max, Min, Count
6 pages
LECTURE - 9 Aggregation and Grouping
No ratings yet
LECTURE - 9 Aggregation and Grouping
39 pages
SQL Grouping, Functions, and Joins - 251021 - 053852
No ratings yet
SQL Grouping, Functions, and Joins - 251021 - 053852
7 pages
Lecture 11 DMS
No ratings yet
Lecture 11 DMS
15 pages
DDD Lab 01 Support Material
No ratings yet
DDD Lab 01 Support Material
9 pages
SQL Group By & Aggregate Guide
No ratings yet
SQL Group By & Aggregate Guide
67 pages
Ora Final Material 2024
No ratings yet
Ora Final Material 2024
41 pages
SQL Notes PDF
No ratings yet
SQL Notes PDF
23 pages
Aggregation
No ratings yet
Aggregation
8 pages
Assignmet 3
No ratings yet
Assignmet 3
7 pages
Introduction to SQL Basics
No ratings yet
Introduction to SQL Basics
22 pages
Week11 Relational Algebra & SQL - Aggregation and Grouping Operation
No ratings yet
Week11 Relational Algebra & SQL - Aggregation and Grouping Operation
23 pages
What Are The Benefits of Using Cloud Services? How Does The DISTINCT Keyword Work in SQL? What Are Common Aggregate Functions in SQL?
No ratings yet
What Are The Benefits of Using Cloud Services? How Does The DISTINCT Keyword Work in SQL? What Are Common Aggregate Functions in SQL?
3 pages
Database Query Using SQL
No ratings yet
Database Query Using SQL
23 pages
CNG351 Lecture 10 DML Part 1
No ratings yet
CNG351 Lecture 10 DML Part 1
19 pages
SQL Notes
100% (1)
SQL Notes
42 pages
F. Aggregate Functions
No ratings yet
F. Aggregate Functions
3 pages
Dbms Lab 4
No ratings yet
Dbms Lab 4
7 pages
Module 6 - Notes
No ratings yet
Module 6 - Notes
5 pages
Introduction To Oracle Functions and Group by Clause
100% (2)
Introduction To Oracle Functions and Group by Clause
62 pages
Advanced SQL Concepts Explained
No ratings yet
Advanced SQL Concepts Explained
5 pages
Aggregate Functions in SQL Explained
No ratings yet
Aggregate Functions in SQL Explained
6 pages
SQL 1
No ratings yet
SQL 1
58 pages
An Introduction To SQL Functions (Slides)
No ratings yet
An Introduction To SQL Functions (Slides)
13 pages
Structured Query Language: Next Slide
No ratings yet
Structured Query Language: Next Slide
14 pages
SQL Notes
No ratings yet
SQL Notes
5 pages
Ch-1 IP Notes
No ratings yet
Ch-1 IP Notes
7 pages
SQL Aggregate Functions - Explore 5 Types of Functions
No ratings yet
SQL Aggregate Functions - Explore 5 Types of Functions
27 pages
SQL Functions for Developers
No ratings yet
SQL Functions for Developers
13 pages
Oup Func
No ratings yet
Oup Func
19 pages
DBMS Exp-4
No ratings yet
DBMS Exp-4
8 pages
Database Functions Lab Guide
No ratings yet
Database Functions Lab Guide
6 pages
Oracle SQL Built-in Functions Overview
No ratings yet
Oracle SQL Built-in Functions Overview
23 pages
Database Nest Quiz
No ratings yet
Database Nest Quiz
22 pages
Grouping and Aggregating Data: Module Overview
No ratings yet
Grouping and Aggregating Data: Module Overview
24 pages
DBMS Unit 4
No ratings yet
DBMS Unit 4
31 pages
Lab - 4 - Retrieving Data From Multiple Tables
No ratings yet
Lab - 4 - Retrieving Data From Multiple Tables
16 pages
IP XII Quick Notes - Querying in MYSQL
No ratings yet
IP XII Quick Notes - Querying in MYSQL
11 pages
SQL Guide Detailed
No ratings yet
SQL Guide Detailed
3 pages
IDAB Assignment 3: 1. Explain SQL Subqueries
No ratings yet
IDAB Assignment 3: 1. Explain SQL Subqueries
6 pages
Crack Your Data Engineering SQL Round
100% (1)
Crack Your Data Engineering SQL Round
112 pages
Chapter 11
No ratings yet
Chapter 11
35 pages
Interview - 7 - IMP
No ratings yet
Interview - 7 - IMP
26 pages
Self-Notes: Data Manipulation Using SQL
No ratings yet
Self-Notes: Data Manipulation Using SQL
4 pages
Learn SQL - Aggregate Functions Cheatsheet - Codecademy
No ratings yet
Learn SQL - Aggregate Functions Cheatsheet - Codecademy
3 pages
Exp 6 - 7 - 8
No ratings yet
Exp 6 - 7 - 8
26 pages
SQL Question
No ratings yet
SQL Question
99 pages
SQL Group By and Having Functions Guide
No ratings yet
SQL Group By and Having Functions Guide
12 pages
Clauses
No ratings yet
Clauses
8 pages
Group by
No ratings yet
Group by
3 pages
Labsheet 11
No ratings yet
Labsheet 11
10 pages
Functions
No ratings yet
Functions
13 pages
SQL PL SQL Queries q1 To q13
No ratings yet
SQL PL SQL Queries q1 To q13
15 pages
SQL Group By, Having, and Functions Guide
No ratings yet
SQL Group By, Having, and Functions Guide
25 pages
Dbms Lab
No ratings yet
Dbms Lab
36 pages
Blog_ Personalizing AP Invoice Routing in Oracle Cloud
No ratings yet
Blog_ Personalizing AP Invoice Routing in Oracle Cloud
8 pages
Oracle Application's Blog_ Report Bursting in Oracle Fusion
No ratings yet
Oracle Application's Blog_ Report Bursting in Oracle Fusion
6 pages
Oracle Application's Blog_ Oracle Fusion Tax Implementation_ How to Implement the Taxation in Oracle Fusion
No ratings yet
Oracle Application's Blog_ Oracle Fusion Tax Implementation_ How to Implement the Taxation in Oracle Fusion
6 pages
The AI Creator Course
0% (1)
The AI Creator Course
49 pages
Day 5 - Database Objects & Advanced Queries (DDL)
No ratings yet
Day 5 - Database Objects & Advanced Queries (DDL)
8 pages
Day 4-Data Manipulation & Transactions (DML)
No ratings yet
Day 4-Data Manipulation & Transactions (DML)
10 pages
Renting Out Assets
No ratings yet
Renting Out Assets
7 pages
Day 1-Getting Started & The SELECT Foundation Deep Dive
No ratings yet
Day 1-Getting Started & The SELECT Foundation Deep Dive
8 pages
Oracle Fusion Payroll Implementation Checklist To Apply
No ratings yet
Oracle Fusion Payroll Implementation Checklist To Apply
6 pages
Implementing Oracle Fusion Payroll Retropay
No ratings yet
Implementing Oracle Fusion Payroll Retropay
4 pages
Oracle Fusion Cash Management Setup
No ratings yet
Oracle Fusion Cash Management Setup
3 pages
Oracle Fusion Cloud Procurement: Purchasing With Redwood Interface
No ratings yet
Oracle Fusion Cloud Procurement: Purchasing With Redwood Interface
8 pages
Example of Modifying Invoice Approval Workflow Notifications Using Oracle Analytics Publisher
No ratings yet
Example of Modifying Invoice Approval Workflow Notifications Using Oracle Analytics Publisher
7 pages
Absence Attendance Type Customization
No ratings yet
Absence Attendance Type Customization
3 pages
SAE ARP 4754 - Certification Considerations For Aircraft Systems
100% (3)
SAE ARP 4754 - Certification Considerations For Aircraft Systems
88 pages
Grounding System
No ratings yet
Grounding System
3 pages
Netsoft PS9200T 1
No ratings yet
Netsoft PS9200T 1
10 pages
Bn81-23593e-01 Web G55TQB Eu Eng 231102.0
No ratings yet
Bn81-23593e-01 Web G55TQB Eu Eng 231102.0
39 pages
Biochemistry Equipment List
No ratings yet
Biochemistry Equipment List
15 pages
Causes and Solutions of Overfitting
No ratings yet
Causes and Solutions of Overfitting
1 page
Is Technology A Boon or A Curse
No ratings yet
Is Technology A Boon or A Curse
2 pages
63 9243 Rev E VLP 16 User Manual
No ratings yet
63 9243 Rev E VLP 16 User Manual
140 pages
Induction Generator
No ratings yet
Induction Generator
12 pages
Quickspecs: Aruba Airwave™ Visualrf™ Aruba Airwave™ Visualrf™ Product Overview
No ratings yet
Quickspecs: Aruba Airwave™ Visualrf™ Aruba Airwave™ Visualrf™ Product Overview
5 pages
hAP Lite - User Manuals - MikroTik Documentation
No ratings yet
hAP Lite - User Manuals - MikroTik Documentation
1 page
CCSP Exam Cram Domain 6 Handout
No ratings yet
CCSP Exam Cram Domain 6 Handout
142 pages
2010 Lancer PDF
No ratings yet
2010 Lancer PDF
592 pages
Digital Thesis Universitas Kristen Petra
No ratings yet
Digital Thesis Universitas Kristen Petra
5 pages
The Power Point - The Effective Email and Report Writing 2022
No ratings yet
The Power Point - The Effective Email and Report Writing 2022
72 pages
Batch and Serial Numbers
No ratings yet
Batch and Serial Numbers
38 pages
1991 - Eigenfaces For Recognition
No ratings yet
1991 - Eigenfaces For Recognition
16 pages
APS REPORT Harsh
No ratings yet
APS REPORT Harsh
16 pages
Qa10 Pi PC 200-8 (Lower Area)
No ratings yet
Qa10 Pi PC 200-8 (Lower Area)
1 page
Krones Replacement Parts Catalog
No ratings yet
Krones Replacement Parts Catalog
3 pages
Sai Fiber
100% (1)
Sai Fiber
3 pages
Printer-Friendly Grimdark Millennium - 40K Edition (Beta 18012025)
No ratings yet
Printer-Friendly Grimdark Millennium - 40K Edition (Beta 18012025)
35 pages
List Perlengkapan Newborn
No ratings yet
List Perlengkapan Newborn
7 pages
MPP A2 Final
No ratings yet
MPP A2 Final
26 pages
Calibration Unit 01.03.2011
No ratings yet
Calibration Unit 01.03.2011
428 pages
Rubina Pradhan: Dev0ps
No ratings yet
Rubina Pradhan: Dev0ps
4 pages
IT Cooling Full Product Catalogue 2022 2023
No ratings yet
IT Cooling Full Product Catalogue 2022 2023
27 pages
Chapter-2-Methods of Data Presentation
No ratings yet
Chapter-2-Methods of Data Presentation
16 pages
Web-Based Attendance System with QR Code
No ratings yet
Web-Based Attendance System with QR Code
9 pages
2015 Guide To WAN Architecture and Design
No ratings yet
2015 Guide To WAN Architecture and Design
44 pages

Day 2 - Functions and Grouping Data Deep Dive

Uploaded by

Day 2 - Functions and Grouping Data Deep Dive

Uploaded by

🗓️ Day 2: Functions and Grouping Data Deep Dive 📊

2.1 Single-Row Functions 🔢

SQL Functions and Grouping Data: A Deep Dive

The Core Problem They Solve:

 Functions perform operations on data, either on individual values (scalar functions) or on

Together, they allow you to answer complex business questions:

 "What is the average salary for each department?"

Page 2: A Taxonomy of SQL Functions

1. Scalar Functions (Row-by-Row)

2. Aggregate Functions (Set-Based)

 COUNT(*): Counts the number of rows in the set, including NULLs.

Syntax and Logic:

SELECT column1, aggregate_function(column2)

The Mental Model:

1. FROM: The database reads the entire table.

Example: Total Sales by Region

Visualizing the Process:

sale_id region sale_amount

The GROUP BY region creates two buckets:

 North Bucket: Rows 1 & 3 -> SUM(sale_amount) = 300

Page 4: The HAVING Clause - The Filter for Groups

WHERE vs. HAVING: A Critical Distinction

The Complete Logical Query Processing Order:

1. FROM & JOINs

Page 5: Advanced Grouping Concepts

1. Grouping Sets, ROLLUP, and CUBE

-- Get totals by (region), by (product), and a grand total (())

-- Gets: (Year, Quarter), (Year), and Grand Total

OrderYear OrderQtr SUM(amount)

2023 NULL 2500 <-- Subtotal for 2023

NULL NULL 2500 <-- Grand Total

-- Gets all combinations: (Region, Product), (Region), (Product), Grand Total.

2. The OVER() Clause - Window Functions (A Brief Preview)

 Scalar Functions transform data row-by-row.

Common Pitfalls and Best Practices:

You might also like