A Data Pipeline Should Address These Issues:: Topics To Study
A Data Pipeline Should Address These Issues:: Topics To Study
no Topic Links
1. Indexes https://docs.microsoft.com/en-us/sql/relational-
databases/indexes/heaps-tables-without-clustered-
indexes?view=sql-server-2017
https://www.red-gate.com/simple-talk/sql/learn-sql-
server/sql-server-index-basics/
https://www.red-gate.com/simple-talk/sql/database-
administration/brads-sure-guide-to-indexes/
2. Data warehouse https://www.1keydata.com/datawarehousing/
concepts dimensional.html
3. Practicing sql HackerRank (SQL)
Leetcode (SQL) (Worth paying for premium for sql as
many questions are in premium. I took premium for a
month)
https://pgexercises.com/
o Diststyle in redshift
o Relational data modelling and dimensional data modelling diff
o how to distribute storage while creating the table
o if I have a data model which has a lot of dimension how can I simplify it
https://stackoverflow.com/questions/27690617/star-schema-structure-to-many-
dimensions
o SCD types. if I have a table which has a lot of attributes column but only few
changes frequently how can I capture these changes
o Diff between oltp and master data
https://metamug.com/article/difference-between-master-and-transaction-
table.html
o how can we implement normalization
o Table Questions
Find cumulative sum of values from a table of dept, item and value
From same table, find item with maximum value in each dept?
o Create table of fixtures from below table of countries
Country
Ind
Aus
SA
Result:
c1 | c2
ind | aus
aus | sa
sa | ind
o INPUT:
Output:
asin start_day end_day is_instock
a1 1 2 0
a1 3 4 1
a1 5 5 0
o There is a list of countries say IND, PAK, CHN, AFG, SRI, BNG. Create a
combination of countries with the help of this list using one query
How about IND-PAK & PAK-IND duplicate, this is where people get stuck?
Could not arrive at the solution or approach
o Which range has most visitors
TBL1: <start_dt> <end_dt>
TBL2: <date> <num_of_visitors>
o How to delete Duplicate Records from a table considering there is no primary
key. For example, consider the table below
id
1
1
1
2
2
o You have two tables:
A
id
1
1
1
1
1
B
id
1
1
Select count(*) from A INNER JOIN B On A.id = B.Id [ans] 2 correct is
10
Select count(*) from A LEFT OUTER JOIN B On A.id = B.Id [ans] 5
correct is 10
Select count(*) from A RIGHT OUTER JOIN B On A.id = B.Id [ans]2
correct is 10
o You have table i.e. customer with details