Intro To Data and Data Science
Intro To Data and Data Science
@nandini-gangrade
1 The Different Data Science Fields
• Importance of Data:
• Professional Evolution:
• Statisticians from 25 years ago can seamlessly integrate into diverse fields with
modern technologies.
2: "Analysis vs Analytics"
• Analysis:
• Analytics:
• Data science relies on data, incorporating complex tools. Business analytics doesn't
solely rely on data.
• Infographic Overview:
• Rows answer key questions: When, Why, What techniques, Where applicable, How
implemented, Who does it.
• Columns:
5. Using ML techniques
• Infographic Purpose:
1: “When are Traditional data, Big Data, BI, Traditional Data Science and ML applied”
• Data Definition: Information in digital format, the basis for analysis and
decision-making.
• Types of Data:
• Example Explanation:
• Think of traditional data as a neatly organized Excel sheet and big data as a
vast, rapidly updating stream of information. Data science combines these
through Business Intelligence, traditional methods, and machine learning to
extract valuable insights.
• Key Points:
• Traditional Methods vs. Machine Learning: Both aim for predictive insights
but differ in the era of technology.
• Example Explanation:
Example: Analyzing survey responses about product preferences using various data sources.
Example: Extracting insights from vast social media text data for sentiment analysis.
Reports, Dashboards(KPI)
Example: Using BI tools to analyze historical sales data and optimize future strategies.
• Clustering and Time Series: Grouping data for meaningful patterns, tracking
values over time.
Example: Predicting future sales trends using time series data and regression models.
1. Programming Languages & Software Employed in Data Science – All the tools you need
• Software: Tools like Excel, SPSS, and specialized software address domain-
specific challenges.
• Limitations: Python and R might not suffice for certain domains, e.g.,
relational database management systems; SQL is preferred.
Example: Using Python for statistical computations and SQL for relational database queries
in data science projects.
1. Data Architect:
2. Data Engineer:
3. Database Administrator:
• Responsibility: Analyzes and reports past historical data for business insights.
5. BI Consultant:
6. BI Developer:
8. Data Analyst:
Example: A Data Engineer processing raw customer data for a Data Scientist to build a
predictive model.
• Correction: Big Data involves variety, variability, velocity, veracity, and other
characteristics, not just sheer volume.
• Clarification: Qualitative methods like SWOT are not quantitative but play a
crucial role in business strategy.
• Reality Check: These tools are successfully used in many companies by data
science teams.
Example: Understanding that Big Data is more than just a large volume and recognizing the
role of qualitative analysis in business intelligence.