0% found this document useful (0 votes)
68 views5 pages

Dev Unit I

The document is a question bank for the course AD3301 – Data Exploration and Visualization at Anna University, focusing on exploratory data analysis (EDA). It covers fundamental concepts, significance, software tools, and various techniques related to EDA, including data transformation, aggregation, and visualization aids. The question bank includes both short answer and essay-type questions to assess understanding of EDA principles and practices.

Uploaded by

Divya Priya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
68 views5 pages

Dev Unit I

The document is a question bank for the course AD3301 – Data Exploration and Visualization at Anna University, focusing on exploratory data analysis (EDA). It covers fundamental concepts, significance, software tools, and various techniques related to EDA, including data transformation, aggregation, and visualization aids. The question bank includes both short answer and essay-type questions to assess understanding of EDA principles and practices.

Uploaded by

Divya Priya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd

2332 ACET

DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA


SCIENCE

[Link]. – Artificial Intelligence and Data

Science Anna University Regulation:

2021

AD3301 – Data Exploration and

Visualization

II Year / III

Semester

QUESTION

BANK

AD3301_DEV
QUESTION BANK
AD3301 – DATA EXPLORATION AND VISUALIZATION

UNIT I EXPLORATORY DATA ANALYSIS


EDA fundamentals – Understanding data science – Significance of EDA – Making sense of
data – Comparing EDA with classical and Bayesian analysis – Software tools for EDA -
Visual Aids for EDA- Data transformation techniques-merging database, reshaping and
pivoting, Transformation techniques - Grouping Datasets - data aggregation – Pivot tables
and cross-tabulations.

PART – A

1. Define Exploratory Data Analysis (EDA)?

EDA is the process of examining and visualizing data to uncover patterns, trends, and
insights before more advanced analyses.
2. What is the significance of EDA in data science?
EDA is crucial in data science as it helps identify patterns, outliers, and data quality issues,
providing a foundation for further analysis.
3. Differentiate EDA from classical statistical analysis?
EDA focuses on visual exploration, while classical statistical analysis involves hypothesis
testing and parameter estimation.
4. Why is making sense of data important in EDA?
Making sense of data involves extracting meaningful information, enabling informed
decisions and insights.
5. Compare EDA with Bayesian analysis?

AD3301_DEV
EDA is non-parametric and exploratory, while Bayesian analysis incorporates prior
knowledge and updates probabilities based on new data.
6. Name two software tools commonly used for EDA?
Pandas and Matplotlib are commonly used tools for EDA in Python.
7. Define data transformation techniques in EDA?
Data transformation techniques include normalization, scaling, and handling missing values
to prepare data for analysis.

AD3301_DEV
8. What is the purpose of merging databases in EDA?
Merging databases combines datasets based on common identifiers to create a unified dataset
for analysis.
9. Differentiate between reshaping and pivoting in EDA?
Reshaping transforms data between wide and long formats, while pivoting reorganizes data to
create a new structure.
10. Define data aggregation in EDA?
Data aggregation involves summarizing grouped data using functions like sum, mean, or
count.
11. How do pivot tables aid in EDA?
Pivot tables facilitate multidimensional analysis and summarization of data in a tabular
format.
12. What visual aids are commonly used in EDA?
Histograms, box plots, scatter plots, and heatmaps are common visual aids in EDA for
understanding data distributions and relationships.
13. Define the concept of grouping datasets in EDA?
Grouping datasets involves creating subsets based on certain criteria, enabling focused
analysis on specific segments.
14. Why is cross-tabulation useful in EDA?
Cross-tabulation is useful in EDA for displaying the frequency distribution of variables in a
contingency table.
15. Name a transformation technique in EDA for handling outliers?
Winsorizing is a transformation technique that involves replacing extreme values with less
extreme values to handle outliers.
16. Define the term "data normalization" in EDA?
Data normalization in EDA is the process of rescaling variables to a standard range, typically
between 0 and 1.
17. What is the role of visual aids like violin plots in EDA?
Violin plots display the distribution of data, providing insights into both central tendency and
spread.
18. Define the concept of data scaling in EDA?
Data scaling in EDA involves transforming variables to have a similar scale, preventing
dominance by certain features.

AD3301_DEV
19. How does EDA contribute to data science projects?
EDA contributes by providing an initial understanding of data, guiding subsequent modeling
and analysis decisions.
20. Why are pivot tables and cross-tabulations useful in summarizing data?
Pivot tables and cross-tabulations provide a concise summary of data, making it easier to
identify patterns and trends across different dimensions.

PART – B

1. Explain the Purpose of EDA


2. Differentiate EDA from Classical Analysis
3. Illustrate Visual Aids in EDA
4. Describe Data Transformation in EDA
5. Explore the Significance of Grouping Datasets and how it aids in focused analysis.
6. Explain the Role of Data Aggregation
7. Illustrate the Application of Pivot Tables
8. Compare EDA with Bayesian Analysis:

AD3301_DEV

You might also like