0% found this document useful (0 votes)

63 views3 pages

Python Scenario Based Interview QA

The document presents various scenario-based Python interview questions tailored for freshers applying for data analysis roles. Each scenario includes a specific data-related challenge, such as cleaning data, analyzing sales, preparing datasets for churn prediction, identifying outliers, merging datasets, visualizing trends, and preparing categorical data for machine learning. The document provides sample code snippets and methodologies to address these scenarios effectively.

Uploaded by

gauri pingat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views3 pages

Python Scenario Based Interview QA

Uploaded by

gauri pingat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Scenario-Based Python Interview Questions for Data Analysis Roles (Freshers)

1. Scenario: You receive a CSV file with missing values, inconsistent casing, and duplicate

rows. How would you clean this data using Python?

Answer:

import pandas as pd

df = pd.read_csv('[Link]')

# Remove duplicates

df = df.drop_duplicates()

# Standardize casing (e.g., for a 'Name' column)

df['Name'] = df['Name'].[Link]()

# Handle missing values

df = [Link](method='ffill')

2. Scenario: You have a sales dataset with columns: Date, Product, and Revenue. How would

you find the top 3 products with the highest average monthly revenue?

Answer:

df['Date'] = pd.to_datetime(df['Date'])

df['Month'] = df['Date'].dt.to_period('M')

monthly_avg = [Link](['Month', 'Product'])['Revenue'].mean().reset_index()

top_products =

monthly_avg.groupby('Product')['Revenue'].mean().sort_values(ascending=False).head(3)

3. Scenario: How would you prepare customer data with demographic info and activity logs
for a churn prediction model?

Answer:

- Handle missing values

- Convert categorical features to numeric using pd.get_dummies()

- Normalize/scale numerical features

- Merge datasets if activity logs are separate

- Label churn (e.g., Churn = 1 if customer left, else 0)

4. Scenario: You suspect some products have incorrect prices in a dataset. How would you

identify and handle outliers?

Answer:

Q1 = df['Price'].quantile(0.25)

Q3 = df['Price'].quantile(0.75)

IQR = Q3 - Q1

outliers = df[(df['Price'] < Q1 - 1.5IQR) | (df['Price'] > Q3 + 1.5IQR)]

df = df[~[Link]([Link])]

5. Scenario: You have two datasets: [Link] and [Link]. How would you

combine them to analyze total spending per user?

Answer:

users = pd.read_csv('[Link]')

transactions = pd.read_csv('[Link]')

merged = [Link](users, transactions, on='user_id')

spending = [Link]('user_id')['amount'].sum()

6. Scenario: You have daily temperature data. How would you visualize trends and seasonal

patterns?
Answer:

import [Link] as plt

df['Date'] = pd.to_datetime(df['Date'])

df.set_index('Date', inplace=True)

[Link](figsize=(10,5))

[Link](df['Temperature'])

[Link]('Daily Temperature Trends')

[Link]('Date')

[Link]('Temperature')

[Link]()

7. Scenario: You have a column 'Country' with many categories. How would you prepare this

for machine learning?

Answer:

# Use OneHotEncoder or pd.get_dummies

df = pd.get_dummies(df, columns=['Country'], drop_first=True)

8. Scenario: Your dataset has a column 'Join_Date'. What features can you extract from it?

Answer:

df['Join_Date'] = pd.to_datetime(df['Join_Date'])

df['Year'] = df['Join_Date'].[Link]

df['Month'] = df['Join_Date'].[Link]

df['Weekday'] = df['Join_Date'].dt.day_name()

df['Join_Quarter'] = df['Join_Date'].[Link]

Bharat Dynamics
0% (1)
Bharat Dynamics
18 pages
IGCSE 2020 Jan Pastpaper!! Feel Free To Doenload
No ratings yet
IGCSE 2020 Jan Pastpaper!! Feel Free To Doenload
28 pages
Economics
No ratings yet
Economics
24 pages
IB Solar 3 KW
No ratings yet
IB Solar 3 KW
75 pages
HR Induction
No ratings yet
HR Induction
17 pages
May 22 P2R QP
No ratings yet
May 22 P2R QP
20 pages
June 2021 QP
No ratings yet
June 2021 QP
24 pages
4ec1 01r Que 20250515
No ratings yet
4ec1 01r Que 20250515
21 pages
INDICES
No ratings yet
INDICES
126 pages
Economics: Tuesday 21 May 2024
No ratings yet
Economics: Tuesday 21 May 2024
24 pages
FN7397605248
No ratings yet
FN7397605248
1 page
Studio Bagru: Social Entrepreneurship Insights
No ratings yet
Studio Bagru: Social Entrepreneurship Insights
2 pages
Unit-5 SEO
No ratings yet
Unit-5 SEO
14 pages
Derrick Oisd STD 190 Modified
No ratings yet
Derrick Oisd STD 190 Modified
33 pages
Top Citation Sources by Region
No ratings yet
Top Citation Sources by Region
32 pages
ACCA Brochure09.09.2024
No ratings yet
ACCA Brochure09.09.2024
15 pages
Kek
No ratings yet
Kek
5 pages
Corizo Bengaluru: Google My Business Tips
No ratings yet
Corizo Bengaluru: Google My Business Tips
64 pages
Valentine's Marketing Guide
No ratings yet
Valentine's Marketing Guide
35 pages
Office Cleaning 3
100% (1)
Office Cleaning 3
2 pages
Attire May
No ratings yet
Attire May
2 pages
Social Media Marketing Mastery
No ratings yet
Social Media Marketing Mastery
15 pages
The Rise of Empires
No ratings yet
The Rise of Empires
7 pages
Skin Care in China
No ratings yet
Skin Care in China
16 pages
Skin Care in Thailand
No ratings yet
Skin Care in Thailand
17 pages
Afshan's Target List - 12 LOD
No ratings yet
Afshan's Target List - 12 LOD
29 pages
This Is A Sample of IGCSE Exam Paper Idk
No ratings yet
This Is A Sample of IGCSE Exam Paper Idk
24 pages
SEO - Proposal - 24072024 Ver1
No ratings yet
SEO - Proposal - 24072024 Ver1
4 pages
Comment Backlink 21-8-22
No ratings yet
Comment Backlink 21-8-22
4 pages
April Attire
No ratings yet
April Attire
3 pages
150 High DA Image Submission Sites List Free Download 2025
No ratings yet
150 High DA Image Submission Sites List Free Download 2025
2 pages
Why Digital Marketing Is Essential in 2025
No ratings yet
Why Digital Marketing Is Essential in 2025
12 pages
Slide Share Backlinks
No ratings yet
Slide Share Backlinks
3 pages
Culinary Skills for Hospitality Students
No ratings yet
Culinary Skills for Hospitality Students
2 pages
TMP - 6075-Spanish Cooking445861693
No ratings yet
TMP - 6075-Spanish Cooking445861693
39 pages
1.1 Gabriella Tamasi Understanding Complying IATA LAR IATA PCR
No ratings yet
1.1 Gabriella Tamasi Understanding Complying IATA LAR IATA PCR
21 pages
Live Animal Transport On WY Information Form
No ratings yet
Live Animal Transport On WY Information Form
2 pages
Disposable Suppliers Nationwide Partial
No ratings yet
Disposable Suppliers Nationwide Partial
2 pages
Lahore's Best Local Food Spots Guide
No ratings yet
Lahore's Best Local Food Spots Guide
3 pages
50+ Video Submission Websites With High DA
No ratings yet
50+ Video Submission Websites With High DA
2 pages
Science Worksheet 3
No ratings yet
Science Worksheet 3
7 pages
MH Jazz Bowing E Book
No ratings yet
MH Jazz Bowing E Book
6 pages
Social Media Management Proposal
No ratings yet
Social Media Management Proposal
1 page
Mar Payslips
No ratings yet
Mar Payslips
1 page
Freelance Invoice Chisom Emmanuella Nwosu
No ratings yet
Freelance Invoice Chisom Emmanuella Nwosu
2 pages
Top 200 High DA Forum Submission Sites Lists 2024
No ratings yet
Top 200 High DA Forum Submission Sites Lists 2024
6 pages
150 High DA Image Submission Sites List With
No ratings yet
150 High DA Image Submission Sites List With
2 pages
75 HQ Profiles - Kokitoto
No ratings yet
75 HQ Profiles - Kokitoto
6 pages
Satvic Food Book Mini PDF
No ratings yet
Satvic Food Book Mini PDF
48 pages
AJ19
No ratings yet
AJ19
24 pages
IGCSE Economics Exam Paper
No ratings yet
IGCSE Economics Exam Paper
24 pages
Morocco 7 Day
No ratings yet
Morocco 7 Day
9 pages
Spelling 1
No ratings yet
Spelling 1
14 pages
50 High DA Image Submission Sites
No ratings yet
50 High DA Image Submission Sites
2 pages
23-24 S1 Winter Booklet - Ss
No ratings yet
23-24 S1 Winter Booklet - Ss
24 pages
FN0177601801
No ratings yet
FN0177601801
1 page
A - B Testing
No ratings yet
A - B Testing
3 pages
Attendance Sheet Format
No ratings yet
Attendance Sheet Format
7 pages
Architects - Builders Jaipur - 1315.xlsx - Google Sheets
No ratings yet
Architects - Builders Jaipur - 1315.xlsx - Google Sheets
3 pages
B Tech-AIML-question Bank-2 Answer Key
No ratings yet
B Tech-AIML-question Bank-2 Answer Key
9 pages
Personal Selling
No ratings yet
Personal Selling
8 pages
Bicol Savings and Loan Association vs. Court of Appeals: - Second Division
No ratings yet
Bicol Savings and Loan Association vs. Court of Appeals: - Second Division
4 pages
Ifeanyi's CV
No ratings yet
Ifeanyi's CV
3 pages
Salt Content Analysis in Oil Products
No ratings yet
Salt Content Analysis in Oil Products
4 pages
MAINBOARD MANUAL P4M-865G MAX P4M-865PE ... - Maxdata
No ratings yet
MAINBOARD MANUAL P4M-865G MAX P4M-865PE ... - Maxdata
64 pages
Change Theories in Nursing
100% (3)
Change Theories in Nursing
7 pages
Data Warehousing Schemas Guide
No ratings yet
Data Warehousing Schemas Guide
18 pages
Accounting Basics for Students
No ratings yet
Accounting Basics for Students
4 pages
Periodontology 2000 - 2023 - Calciolari - Efficacy of Biomaterials For Lateral Bone Augmentation Performed With Guided Bone
No ratings yet
Periodontology 2000 - 2023 - Calciolari - Efficacy of Biomaterials For Lateral Bone Augmentation Performed With Guided Bone
30 pages
Health and Safety Policy
No ratings yet
Health and Safety Policy
2 pages
PIL Research Paper PDF
No ratings yet
PIL Research Paper PDF
14 pages
The Complete Poems of Percy Bysshe Shelley Percy Bysshe Shelley Download
100% (1)
The Complete Poems of Percy Bysshe Shelley Percy Bysshe Shelley Download
10 pages
Rainscreen Cladding Installation Guide
100% (1)
Rainscreen Cladding Installation Guide
20 pages
AROGYA ADVANCE - Pdfdisplayname AROGYA ADVANCE
100% (1)
AROGYA ADVANCE - Pdfdisplayname AROGYA ADVANCE
2 pages
Comprehensive Database of Indian Corporates
No ratings yet
Comprehensive Database of Indian Corporates
4 pages
Classification Trees Testing
No ratings yet
Classification Trees Testing
33 pages
Test Your Understanding - Classes and Objects (Copy) - Attempt Review
No ratings yet
Test Your Understanding - Classes and Objects (Copy) - Attempt Review
8 pages
Hello
No ratings yet
Hello
10 pages
Document 36
No ratings yet
Document 36
9 pages
SitRep 01 - Maharashtra Flood 26-09-2025
No ratings yet
SitRep 01 - Maharashtra Flood 26-09-2025
2 pages
B.tech-8th Sem (1ST, 2ND, 3RD & 4TH Marksheet) New
No ratings yet
B.tech-8th Sem (1ST, 2ND, 3RD & 4TH Marksheet) New
3 pages
Vibxpert II Catalog
No ratings yet
Vibxpert II Catalog
74 pages
Philippine Heart Center 2020 Audit Report
No ratings yet
Philippine Heart Center 2020 Audit Report
5 pages
BookingReceipt JOPJUS
No ratings yet
BookingReceipt JOPJUS
2 pages
Vision 2018
No ratings yet
Vision 2018
20 pages
Death Obituary Cause of Death Ookht PDF
No ratings yet
Death Obituary Cause of Death Ookht PDF
4 pages
Effect of Credit Risk Management On The Performance of Commercial Banks in Nigeria 1 To 3
No ratings yet
Effect of Credit Risk Management On The Performance of Commercial Banks in Nigeria 1 To 3
57 pages
Mid-Term Examination CB-E
No ratings yet
Mid-Term Examination CB-E
2 pages
Airbrush Kit Assembly Guide
0% (1)
Airbrush Kit Assembly Guide
11 pages
High-Performance Mulching Attachments
No ratings yet
High-Performance Mulching Attachments
2 pages

Python Scenario Based Interview QA

Uploaded by

Python Scenario Based Interview QA

Uploaded by

Scenario-Based Python Interview Questions for Data Analysis Roles (Freshers)

rows. How would you clean this data using Python?

# Standardize casing (e.g., for a 'Name' column)

# Handle missing values

monthly_avg = [Link](['Month', 'Product'])['Revenue'].mean().reset_index()

- Handle missing values

- Convert categorical features to numeric using pd.get_dummies()

- Normalize/scale numerical features

- Merge datasets if activity logs are separate

- Label churn (e.g., Churn = 1 if customer left, else 0)

identify and handle outliers?

outliers = df[(df['Price'] < Q1 - 1.5*IQR) | (df['Price'] > Q3 + 1.5*IQR)]

combine them to analyze total spending per user?

merged = [Link](users, transactions, on='user_id')

import [Link] as plt

[Link]('Daily Temperature Trends')

for machine learning?

# Use OneHotEncoder or pd.get_dummies

df = pd.get_dummies(df, columns=['Country'], drop_first=True)

You might also like

outliers = df[(df['Price'] < Q1 - 1.5IQR) | (df['Price'] > Q3 + 1.5IQR)]