Micron Interview Questions Summary # Question 1 Parsing The HTML Webpages

The document contains summaries of Micron interview questions and answers. For question 1, the answer describes parsing HTML pages using Beautiful Soup and pandas to extract table data into a dictionary. For question 2a, the answer checks for new data in NewData.csv to append to MasterDB.csv if the status is "Available" and price and COE are not "N.A.". For 2b, it describes using left outer join to check for and update changes between the files. For 2c, it removes rows from MasterDB.csv if the status in NewData.csv is "Sold". For question 3a, it splits the "Car Name" column into "Car Make", "

Uploaded by

Kartik Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views2 pages

Micron Interview Questions Summary # Question 1 Parsing The HTML Webpages

Uploaded by

Kartik Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Micron interview questions summary

# Question 1 Parsing the html webpages

For parsing the html pages I have used the beautiful Soup along with the pandas
for the dataframes object. Then I read the tables which was in the section of your
question using soup html parser. Then I find the rows and row data associated
with it. In order to process the column headers and values I have separated both
and merged afterwards by cleaning it for any spaces.
And then as required I have packed all the list values which contains column
headers and values to a dict.

# Question 2
a Check if there are new lines in the ‘[Link]’, and append them to the
existing ‘[Link]’, as long as the ‘Status’ in the row is ‘Available’, and
the ‘Price’ and ‘COE’ columns are not ‘N.A’ (has value in ).

Initially read the data and check the condition given for appending the new data

rows_to_be_updated=new_data[(new_data['COE']!="N.A.") &
(new_data['Price']!="N.A") &(new_data['Status']=='Avaialble')]

And after fetching the above records, there is missing value which has to be
treated. Then for comparing the rows from master and fetched rows , there is
‘compare’ method in pandas which I have avoided as it’s resource intensive and
not supported with some pandas versions which could be bottleneck. Inorder to
compare I have used the last index of master data and then appended the
fetched rows according to it.

b. For the existing lines, see if the [Link], contains any changes. If
yes, update the changes in the ‘[Link]’.

Used left outer join for comparing the further rows and removed unwanted rows.
We could have done with several methods alternatively.
c. If the column ‘Status’ in the [Link] is ‘Sold’, then remove those
lines from the ‘[Link]’
Just checked the condition for not equals sold and then filtered the remaining
rows in master.

# Question 3
a. Develop a script that can split Column ‘Car Name’ to get the following
attributes

i. Car Make

ii. Car Model Name

iii. COE End Date

Used the lambda functions for splitting the column according to space. And for
the end date I have fetched last elements and extracted the date from it. Lambda
function can be used in python as well as in spark which provides better
performance.

b. Build statistical model for every car

make(Eg. Toyota)
i. Mean, Median, mode

ii. +- 3 Sigma Value

In order to extract all the above statistics I have formatted the data in specific
dtypes. And the nI have used groupby pandas as well as aggregation for all the
values.

Pandas
No ratings yet
Pandas
13 pages
Python Scenario Based Interview QA
No ratings yet
Python Scenario Based Interview QA
3 pages
Data Science
No ratings yet
Data Science
10 pages
HCLTech
No ratings yet
HCLTech
5 pages
Pandas NumPy Practice Questions
No ratings yet
Pandas NumPy Practice Questions
2 pages
Pandas Notes
No ratings yet
Pandas Notes
6 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Data Analyst Interview Q&A Guide
No ratings yet
Data Analyst Interview Q&A Guide
20 pages
Python Interview Questions 1653100147
No ratings yet
Python Interview Questions 1653100147
24 pages
DS Practical
No ratings yet
DS Practical
30 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Pandas
No ratings yet
Pandas
5 pages
IP Project On Car Rental System in India
100% (8)
IP Project On Car Rental System in India
33 pages
Complete Data Engineering Interview QA
No ratings yet
Complete Data Engineering Interview QA
6 pages
DataFrame 1
No ratings yet
DataFrame 1
3 pages
Python Interview Cheat Sheet Moodys
No ratings yet
Python Interview Cheat Sheet Moodys
2 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Input - Value ( ('A', 'Mumbai', 4000
No ratings yet
Input - Value ( ('A', 'Mumbai', 4000
2 pages
CSV Data Handling Guide
No ratings yet
CSV Data Handling Guide
14 pages
Spark Test Que
No ratings yet
Spark Test Que
3 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Deloitte Data Engineer Interview Experience (0-3 Yoe)
No ratings yet
Deloitte Data Engineer Interview Experience (0-3 Yoe)
22 pages
Python For Data Science
No ratings yet
Python For Data Science
4 pages
Data Cleaning & Transformation
No ratings yet
Data Cleaning & Transformation
3 pages
Python Questions BA
No ratings yet
Python Questions BA
5 pages
Pandas
No ratings yet
Pandas
12 pages
Interview Questions For Data Analysis and Data Science
No ratings yet
Interview Questions For Data Analysis and Data Science
19 pages
12 CS Set A Anskey
No ratings yet
12 CS Set A Anskey
16 pages
Features of Python
No ratings yet
Features of Python
14 pages
Python Data Handling with Pandas
No ratings yet
Python Data Handling with Pandas
12 pages
Python Data Cleaning Cheat Sheet
100% (4)
Python Data Cleaning Cheat Sheet
8 pages
DS Question Bank Unit-1 Part-2
No ratings yet
DS Question Bank Unit-1 Part-2
3 pages
Data Engineering Interview QA
No ratings yet
Data Engineering Interview QA
4 pages
Python Data Science Cheat Sheet
0% (1)
Python Data Science Cheat Sheet
3 pages
Reading An Entire File at Once: Generating Current Date
No ratings yet
Reading An Entire File at Once: Generating Current Date
2 pages
Pandas Library: Data Manipulation & Analysis Guide
No ratings yet
Pandas Library: Data Manipulation & Analysis Guide
9 pages
Pandas Practise Problems
No ratings yet
Pandas Practise Problems
8 pages
Python NumPy and Pandas MCQs
No ratings yet
Python NumPy and Pandas MCQs
8 pages
Python Cheat Sheet # 1: (: Essential Syntax)
No ratings yet
Python Cheat Sheet # 1: (: Essential Syntax)
12 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
.2 Dse
No ratings yet
.2 Dse
14 pages
EDA With Pandas CheatSheet
No ratings yet
EDA With Pandas CheatSheet
3 pages
Data Manipulation Topics List
No ratings yet
Data Manipulation Topics List
6 pages
Data Wrangling & Pandas Guide
No ratings yet
Data Wrangling & Pandas Guide
48 pages
Data Science Notes
No ratings yet
Data Science Notes
44 pages
Car Sales Data Analysis in Python
No ratings yet
Car Sales Data Analysis in Python
7 pages
Q.1 Explain Process of Working With Data From Files in Data Science
No ratings yet
Q.1 Explain Process of Working With Data From Files in Data Science
20 pages
Pandas Operations Guide
No ratings yet
Pandas Operations Guide
6 pages
Python Data Science Cheat Sheet
100% (2)
Python Data Science Cheat Sheet
6 pages
Print Print Print Print Print Print Print Print Int Input 1 2 3 4 5 6 7
No ratings yet
Print Print Print Print Print Print Print Print Int Input 1 2 3 4 5 6 7
2 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Pandas Cheat Sheet for Data Manipulation
No ratings yet
Pandas Cheat Sheet for Data Manipulation
1 page
Data Manipulation in Python Using Pandas
No ratings yet
Data Manipulation in Python Using Pandas
12 pages
57 Pandas
No ratings yet
57 Pandas
7 pages
Cheat Sheet
No ratings yet
Cheat Sheet
12 pages
Bca 3rd Sem Relational Algebra
No ratings yet
Bca 3rd Sem Relational Algebra
9 pages
Practical SQL Guide for Databases
No ratings yet
Practical SQL Guide for Databases
76 pages
Components of a Subquery Explained
100% (1)
Components of a Subquery Explained
128 pages
Syllabus BCA
No ratings yet
Syllabus BCA
39 pages
Joins in Pyspark
No ratings yet
Joins in Pyspark
10 pages
Learn SQL - Multiple Tables Cheatsheet - Codecademy
No ratings yet
Learn SQL - Multiple Tables Cheatsheet - Codecademy
2 pages
Alberto Ferrari - Optimizing DAX Queries
No ratings yet
Alberto Ferrari - Optimizing DAX Queries
43 pages
PySpark Cheat 23
No ratings yet
PySpark Cheat 23
9 pages
SQL Essentials: Mark Mcilroy
No ratings yet
SQL Essentials: Mark Mcilroy
36 pages
SQL Server Questionnaire-I
No ratings yet
SQL Server Questionnaire-I
47 pages
SQL Cheat Sheet
0% (1)
SQL Cheat Sheet
16 pages
Understanding LINQ Syntax and Execution
No ratings yet
Understanding LINQ Syntax and Execution
63 pages
Imp - QB RDBMS
No ratings yet
Imp - QB RDBMS
3 pages
Module 3 Lecture
No ratings yet
Module 3 Lecture
78 pages
SQL Final V2.0
No ratings yet
SQL Final V2.0
1,099 pages
9.3.1 Result Set Transformations: SELECT TO - CHAR (Order - DT, 'DAY') Day - of - Week
No ratings yet
9.3.1 Result Set Transformations: SELECT TO - CHAR (Order - DT, 'DAY') Day - of - Week
77 pages
Oracle Pass4sure 1z0-082 v2020-10-27 by Isaac 53q
No ratings yet
Oracle Pass4sure 1z0-082 v2020-10-27 by Isaac 53q
33 pages
Oracle SQL Exam Questions and Answers
No ratings yet
Oracle SQL Exam Questions and Answers
14 pages
Upgradation and Transportation
No ratings yet
Upgradation and Transportation
68 pages
DE Module2 RelationalModel PPT
No ratings yet
DE Module2 RelationalModel PPT
71 pages
Hive Data Types and Functions Overview
No ratings yet
Hive Data Types and Functions Overview
20 pages
SQL Skills Assessment Guide
50% (2)
SQL Skills Assessment Guide
5 pages
MS SQL Server Interview Questions
100% (4)
MS SQL Server Interview Questions
17 pages
SAP ABAP 7.5 C_TAW12_750 Exam Guide
0% (1)
SAP ABAP 7.5 C_TAW12_750 Exam Guide
5 pages
Where Emp - Deptno Dept - Deptno and Sal 12 30000 and JOB 'CLERK'
100% (1)
Where Emp - Deptno Dept - Deptno and Sal 12 30000 and JOB 'CLERK'
8 pages
Converting Oracle Rows To Columns
No ratings yet
Converting Oracle Rows To Columns
4 pages
SQL Lab Exercise: Data Retrieval Techniques
No ratings yet
SQL Lab Exercise: Data Retrieval Techniques
22 pages
04 - Aggregation Operations
No ratings yet
04 - Aggregation Operations
68 pages
EJ1136651
No ratings yet
EJ1136651
9 pages
Database Management System Basics
No ratings yet
Database Management System Basics
10 pages

Micron Interview Questions Summary # Question 1 Parsing The HTML Webpages

Uploaded by

Micron Interview Questions Summary # Question 1 Parsing The HTML Webpages

Uploaded by

Micron interview questions summary

# Question 1 Parsing the html webpages

ii. Car Model Name

iii. COE End Date

b. Build statistical model for every car

ii. +- 3 Sigma Value

You might also like