Lecture 1 - Introduction To Data Science
Lecture 1 - Introduction To Data Science
Data Science
Data Science
What Is Data Science
• Data Science is about data gathering, analysis and decision-
making.
• Data science is a collection of techniques used to extract value 2
from data
• Data Science is about finding patterns in data, through analysis,
and make future predictions.
• By using Data Science, companies are able to make:
I. Better decisions (should we choose A or B)
II. Predictive analysis (what will happen next?)
III. Pattern discoveries (find pattern, or maybe hidden information
in the data)
Where is Data Science Needed?
▹ Machine Learning
▹ 4
Statistics
▹ Programming (Python or R)
▹ Mathematics
▹ Databases
Cont.
Here is how a Data Scientist works:
▹ Ask the right questions - To understand the business
problem.
▹ Explore and collect data - From database, web logs,
customer feedback, etc.
▹ Extract the data - Transform the data to a standardized 5
format.
▹ Clean the data - Remove erroneous values from the data.
▹ Find and replace missing values - Check for missing values
and replace them with a suitable value (e.g. an average
value).
▹ Normalize data - Scale the values in a practical range (e.g.
140 cm is smaller than 1,8 m. However, the number 140 is
larger than 1,8. - so scaling is important).
▹ Analyze data, find patterns and make future predictions.
▹ Represent the result - Present the result with useful
insights in a way the "company" can understand.
AI, MACHINE LEARNING, AND DATA
SCIENCE