0% found this document useful (0 votes)

63 views3 pages

Data Warehouse Concepts and SQL Functions

The document discusses data warehousing concepts including dimensional modeling, fact and dimension tables, and schemas like star, snowflake, and hybrid. It defines dimensional modeling as subject-oriented, integrated, time-variant, and non-volatile. Fact tables contain numeric facts and dimension tables contain descriptive attributes used to analyze facts. Dimension tables are typically de-normalized while fact tables are highly normalized.

Uploaded by

RajeshCuddapah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views3 pages

Data Warehouse Concepts and SQL Functions

Uploaded by

RajeshCuddapah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

According to D.W.

Inmon :
DWH

Subject Oriented
Integrated
Time Varient
Non Volatile

D.W Implementation
Approach

Top Down Approach : D.W.Inmon

Bottom up Approach : Ralph Kimball

SQL Set Operators

Desgined Subject oriented to design

Analysis
Business info collected from various
sources
Allows to anlysis the data with time Eg:
Month, YOY
Once the data entered in DW cannot
change
First Develop EDW then Devlop
Datamarts
First Develop Datamarts then Develp
EDW

Syntax

UNION[ALL]
,MINUS,INTERSECT
String Functions
CONCAT

CONCAT(String1,String2 [..])

RTRIM

RTRIM('String')

LTRIM

LTRIN('String')

TRIM

TRIM('String')
SUBSTRING('String', Start integer, Length of
String)

SUBSTRING
Analytic function
COALESCE Similar to Case
IsNULL (allows only two
arguments)

Returns the first non-null expression in

the list
Replaces NULL with the specified
replacement value.

ROW_NUMBER

COALESCE ( expression [ ,...n ] )

ISNULL ( check_expression,
replacement_value )
ROW_NUMBER() OVER(ORDER BY Column
nam)

RANK

RANK() OVER (ORDER BY Col )

DENSE_RANK
ROW_NUMBER with
PARTITION

DENSE_RANK() OVER (ORDER BY Col )

ROW_NUMBER() OVER(PARTITION BY Col
ORDER BY Col ASC)

Ranking within your ordered partition

No ranks are skipped if there are ranks
with
multiple items
This is kinda like using a Row_number
with Group by

eg:CONVERT(VARCHAR(19),GETDATE())

General function that

converts an expression of one data
type to another

CONVERT()

Returns the sequential number

Null Functions
ISNULL(), NVL(), IFNULL() and
COALESCE()
Joins
Inner Join

Returns all rows when there is at least

one match in BOTH tables

Dimension Table

Return all rows from the left table, and

the matched rows from the right table
Return all rows from the right table,
and the matched rows from the left
table
Return all rows when there is a match
in ONE of the tables
If a table contains primary keys and it
gives the detailed info about business
then such a table
- Entry Points to the fact tables
- Typically in De-Normalized form
- Generally Static and descriptive fields
- Typically used by Group by in SQL
- Typically either Primarkey and
Dimensional attribu

Fact Table

A fact table which contains foreign keys

to dimension tables and numeric facts
- The term FACT represents a single
business measure. E.g. Sales, Qty Sold
- Facts can be detailed level facts or
summarized facts
- Typically the MOST NORMALIZED
TABLE in a dimensional model
- contain HUGE DATA VOLUMES
running into millions of rows

LEFT JOIN

RIGHT JOIN
FULL JOIN

Types of Dimension tables

Degnerated Dimension

Junk Dimeonsions

Slowley Changing Dimension

Conformed Dimension
Fast Chaning Dimensions

data that is dimensional in nature but

stored in a fact table
contain miscellaneous data like flags,
gender, text values etc which is not
useful for reporting
If the data values are changed slowly in
a column or in a row over the period of
time then that dimension table
If Dimension table shared with multiple
fact tables
Changes very fast Eg: Acc Bal,Income
etc

Types of Facts
Additive
Semi Additive
Non Additive
Transformations

Measures that can be added across all

dimensions
Measures that can be added across few
dimensions and not with others
Measures that cannot be added across
all dimensions
It is the process of transforming the
data into a required business format

Data Aggregation

process of integrating the data from

multiple input
Process of removing unwanted/error
out/inaccurate data
multiple detailed values are
summarized into a single unit

Data Purging

Earsing of the data completely

Data Profiling

examining data available from an

existing information source,collecting
stastics and summerise the data with
Aggrate functions (Min,MAX,AVG)

Data Merging
Data Cleansing/Scrubbing

Schemas

Star Schema

Snowflake Schema
Hybrid Schema/Galaxy
Schema/
Fact constellation

It has single fact table connected to

dimension tables like a star.
- The star schema is highly
denormalized
- Simple structure -> easy to
understand schema
- Relatively long time of loading data
into dimension tables (reduendent of
data)
- Performance less compare to Snow
flake
It is an extension of the star schema.In
snowflake schema, very large
dimension tables are normalized into
multiple tables. It is used when a
dimensional table becomes very big
- Highly normalized
- Complex compare to star schema
- Less time to load (due to normalized
data)
- Very good in performance
Combination of Star and Snowflake
schema

Data Warehouse Design Techniques Explained
No ratings yet
Data Warehouse Design Techniques Explained
38 pages
Data Warehouse Design Principles
No ratings yet
Data Warehouse Design Principles
75 pages
SQL Set Operations and Data Management
No ratings yet
SQL Set Operations and Data Management
11 pages
Data Warehouse Interview Guide
No ratings yet
Data Warehouse Interview Guide
7 pages
Data Warehouse Course Overview and Schema Types
No ratings yet
Data Warehouse Course Overview and Schema Types
38 pages
Dimensional Modeling Guide
No ratings yet
Dimensional Modeling Guide
9 pages
Advanced SQL Techniques for Efficiency
No ratings yet
Advanced SQL Techniques for Efficiency
12 pages
Understanding Data Warehousing Concepts
No ratings yet
Understanding Data Warehousing Concepts
5 pages
SQL and QlikView Data Modeling Guide
No ratings yet
SQL and QlikView Data Modeling Guide
6 pages
Factless Fact Tables Explained
No ratings yet
Factless Fact Tables Explained
5 pages
Unit 3 OLAP and OLTP
No ratings yet
Unit 3 OLAP and OLTP
64 pages
SSAS Interview Prep Guide
No ratings yet
SSAS Interview Prep Guide
7 pages
SQL Server BI Interview Questions Guide
No ratings yet
SQL Server BI Interview Questions Guide
8 pages
Data Warehouse Concepts and Approaches
No ratings yet
Data Warehouse Concepts and Approaches
39 pages
Data Warehouse Ques
No ratings yet
Data Warehouse Ques
10 pages
Data Warehousing & BI Insights
No ratings yet
Data Warehousing & BI Insights
6 pages
Data Warehouse Design and Concepts
No ratings yet
Data Warehouse Design and Concepts
37 pages
SQL Interview Questions Explained
No ratings yet
SQL Interview Questions Explained
4 pages
SQL and ETL Concepts Overview
No ratings yet
SQL and ETL Concepts Overview
7 pages
Understanding Data Warehousing Concepts
No ratings yet
Understanding Data Warehousing Concepts
11 pages
Interview Questions and Answar
No ratings yet
Interview Questions and Answar
22 pages
BI - Chap 3 - Data Warehouses Design
No ratings yet
BI - Chap 3 - Data Warehouses Design
54 pages
ETL and Data Warehouse Concepts Explained
50% (2)
ETL and Data Warehouse Concepts Explained
149 pages
Data Warehousing Essentials
No ratings yet
Data Warehousing Essentials
29 pages
DWH Unit 2
No ratings yet
DWH Unit 2
13 pages
Power BI DAX Training Manual
No ratings yet
Power BI DAX Training Manual
12 pages
DWBI Interview Questions & Answers Guide
100% (1)
DWBI Interview Questions & Answers Guide
9 pages
Business Intelligence Interview Questions and Answer
No ratings yet
Business Intelligence Interview Questions and Answer
12 pages
Data Warehouse Schema Types Explained
No ratings yet
Data Warehouse Schema Types Explained
12 pages
Data Modeling and Warehouse Schemas
No ratings yet
Data Modeling and Warehouse Schemas
11 pages
Data Warehousing Concepts Overview
No ratings yet
Data Warehousing Concepts Overview
41 pages
DWM Unit-Ii Notes
No ratings yet
DWM Unit-Ii Notes
27 pages
Difference Between in SQL Interview Questions: Primary Key Foreign Key
No ratings yet
Difference Between in SQL Interview Questions: Primary Key Foreign Key
4 pages
Understanding Data Warehousing Concepts
100% (1)
Understanding Data Warehousing Concepts
44 pages
Data Warehouse Fundamentals Explained
No ratings yet
Data Warehouse Fundamentals Explained
24 pages
Data Warehousing: Dimension & Fact Tables
No ratings yet
Data Warehousing: Dimension & Fact Tables
2 pages
Data Warehouse Schema
No ratings yet
Data Warehouse Schema
10 pages
OBIEE - Quick Guide
No ratings yet
OBIEE - Quick Guide
78 pages
Data Warehousing Essentials
No ratings yet
Data Warehousing Essentials
28 pages
Understanding Data Warehousing Concepts
No ratings yet
Understanding Data Warehousing Concepts
14 pages
Star Schema and Data Warehousing Guide
No ratings yet
Star Schema and Data Warehousing Guide
15 pages
Datawarehousing Top50 Interview Questions
No ratings yet
Datawarehousing Top50 Interview Questions
10 pages
Data Warehousing Exam Guide
No ratings yet
Data Warehousing Exam Guide
4 pages
Expt 2 - 2-1
No ratings yet
Expt 2 - 2-1
31 pages
SQL Normalization and Query Techniques
No ratings yet
SQL Normalization and Query Techniques
58 pages
ETL Testing - Concepts - V24
No ratings yet
ETL Testing - Concepts - V24
60 pages
U.S. Healthcare Industry DAX Functions Guide
No ratings yet
U.S. Healthcare Industry DAX Functions Guide
14 pages
Understanding Data Warehousing Concepts
No ratings yet
Understanding Data Warehousing Concepts
11 pages
Understanding Data Warehousing Concepts
No ratings yet
Understanding Data Warehousing Concepts
11 pages
Understanding Database Schemas and Cubes
No ratings yet
Understanding Database Schemas and Cubes
25 pages
Denormalization and Star Schema Guide
No ratings yet
Denormalization and Star Schema Guide
27 pages
SQL Server 2012 BI Course Guide
No ratings yet
SQL Server 2012 BI Course Guide
69 pages
MVA Implementing A Data Warehouse With SQL Jump Start Mod 1 Final
No ratings yet
MVA Implementing A Data Warehouse With SQL Jump Start Mod 1 Final
37 pages
Understanding Database Management Systems
No ratings yet
Understanding Database Management Systems
44 pages
1.1 (Dimensional Modelling)
No ratings yet
1.1 (Dimensional Modelling)
51 pages
Very Short Notes
No ratings yet
Very Short Notes
13 pages
100 BI Analyst Interview Questions
No ratings yet
100 BI Analyst Interview Questions
109 pages
Dimensional Modeling Fundamentals Guide
100% (1)
Dimensional Modeling Fundamentals Guide
19 pages
Apollo MX20 User's Guide Overview
No ratings yet
Apollo MX20 User's Guide Overview
82 pages
Compal HEL80/81 Schematics Overview
No ratings yet
Compal HEL80/81 Schematics Overview
43 pages
Huawei NetEngine AR600 Series Enterprise Routers Datasheet
No ratings yet
Huawei NetEngine AR600 Series Enterprise Routers Datasheet
8 pages
Understanding Symbolic Logic Basics
No ratings yet
Understanding Symbolic Logic Basics
22 pages
Types of Distributed DBMSs
No ratings yet
Types of Distributed DBMSs
10 pages
Pinnerformer: Sequence Modeling For User Representation at Pinterest
No ratings yet
Pinnerformer: Sequence Modeling For User Representation at Pinterest
11 pages
Research On Consumer's Impulse Buying Behaviour.: Required
No ratings yet
Research On Consumer's Impulse Buying Behaviour.: Required
3 pages
ADC Services 5.10.4.1 Reference - 20240415
No ratings yet
ADC Services 5.10.4.1 Reference - 20240415
122 pages
Image Restoration Techniques Explained
No ratings yet
Image Restoration Techniques Explained
3 pages
Contrarian ETF Strategy Using VRP
No ratings yet
Contrarian ETF Strategy Using VRP
16 pages
Bugtong, Cyrene M. - Reader Response No.1
No ratings yet
Bugtong, Cyrene M. - Reader Response No.1
1 page
Part One
No ratings yet
Part One
10 pages
CSS Text Changing Animation Guide
No ratings yet
CSS Text Changing Animation Guide
4 pages
Ethics of Artificial Intelligence and Robotics
100% (1)
Ethics of Artificial Intelligence and Robotics
41 pages
SPLDS-PPT 1
100% (1)
SPLDS-PPT 1
92 pages
Observation Made During Inspection at Amiga Informatis Pvt. Ltd. On 29-11-2024
No ratings yet
Observation Made During Inspection at Amiga Informatis Pvt. Ltd. On 29-11-2024
2 pages
Dialect Harmonization Using Text-To-Speech-Audio T
No ratings yet
Dialect Harmonization Using Text-To-Speech-Audio T
5 pages
ArtiosCAD 24 - Complete Documentation
100% (1)
ArtiosCAD 24 - Complete Documentation
2,187 pages
Mini Tesla Coil Project PDF
No ratings yet
Mini Tesla Coil Project PDF
11 pages
B Auto 200 Service Manual
No ratings yet
B Auto 200 Service Manual
53 pages
1MRK504159-UEN B en Technical Manual Transformer Protection RET650 2.1
No ratings yet
1MRK504159-UEN B en Technical Manual Transformer Protection RET650 2.1
762 pages
Electribe Sampler PG E3 PDF
No ratings yet
Electribe Sampler PG E3 PDF
20 pages
Business Object Model
No ratings yet
Business Object Model
7 pages
Data Sheet: Graphics Displays and Interface/adaptor Cards
No ratings yet
Data Sheet: Graphics Displays and Interface/adaptor Cards
8 pages
BS en Iso 18589-7-2016
No ratings yet
BS en Iso 18589-7-2016
66 pages
E-Waste Management Trends in India
No ratings yet
E-Waste Management Trends in India
28 pages
Knowledge Sharing
No ratings yet
Knowledge Sharing
5 pages
Wireless Value Realization: Group
No ratings yet
Wireless Value Realization: Group
15 pages
L9000 Light Source-Service Guide-P11780 - A
No ratings yet
L9000 Light Source-Service Guide-P11780 - A
32 pages
Designing Switch Routers
No ratings yet
Designing Switch Routers
351 pages

Data Warehouse Concepts and SQL Functions

Uploaded by

Data Warehouse Concepts and SQL Functions

Uploaded by

According to D.W.

Top Down Approach : D.W.Inmon

SQL Set Operators

Desgined Subject oriented to design

Returns the first non-null expression in

COALESCE ( expression [ ,...n ] )

RANK() OVER (ORDER BY Col )

DENSE_RANK() OVER (ORDER BY Col )

Ranking within your ordered partition

General function that

Returns the sequential number

Returns all rows when there is at least

Return all rows from the left table, and

A fact table which contains foreign keys

Types of Dimension tables

Slowley Changing Dimension

data that is dimensional in nature but

Measures that can be added across all

process of integrating the data from

Earsing of the data completely

examining data available from an

It has single fact table connected to

You might also like