0% found this document useful (0 votes)
25 views

Modules Data Science-IITK

Uploaded by

Nikhil Kumar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views

Modules Data Science-IITK

Uploaded by

Nikhil Kumar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

IIT-Kanpur Professional

Certificate course in

Data Science
Now learn in Hindi

3 Months / 5 Months (Weekday/Weekend)

Live Classes with Assured Job Opportunities


About IIT-Kanpur

IIT-Kanpur has a rich history of academic excellence and


innovation. IIT-Kanpur has played a pivotal role in
catalyzing GUVI's mission to provide tech education in
regional languages. The collaboration between IIT-Kanpur
and GUVI showcases the commitment of both
establishments to bridge the gap between quality
education and language accessibility.

GUVI In A Glance

Grab Ur Vernacular Imprint-GUVI is World’s First


Vernacular Ed-Tech Learning Platform. Introduced by

Ex PayPal Employees, GUVI empowers students to


master programming skills with the comfort of their
native language. Its mission is to impart technical skills
to all through focussed pedagogical tools.
About the Co-Founders

Co - Founder at GUVI

20+ years of Technical Expertise

& more

Tech Women Entrepreneur who was selected

For Google Developers’ Launchpad Program


Sridevi

CEO & Founder at GUVI

20+ years of Technical Expertise

& more

Built 7 Products from Scratch Mentored 1000+

Arun Prakash
students Hosted 200+ sessions & 25+ webinars

Co - Founder at GUVI

17+ years of experience with IT industry

Technologist with 9+ years of Entrepreneurial

experience & Member of the Syllabus Sub-


Bala Murugan
Committee at Anna University

About GUVI’s IIT-K Data Science Course

GUVI s IIT-K Data Science Zen Class is your gateway to a


world-class upskilling experience, meticulously designed
for ambitious job seekers in the dynamic field of emerging
tech careers. Our Data Science course with IIT-K Certified
Advanced Programming is your key to career
enhancement. This boot camp-style upskilling program
offers unmatched flexibility with learning in हिदी.
Experience a comprehensive curriculum equipping you
with top-tier skills such as Python, MySQL, MongoDB,
Pandas, NumPy, NLP & much more with expert-led
mentorship, hands-on data science projects in real-time
environments, and industry- recognized certifications that
serve as a catalyst for your journey to excel in the thriving
Data Science industry.
Why Choose

Data Science as
Your Career?

Average base salary


₹10 LPA (Ambitionbox)

Annual Growth Rate


27.7% from 2021-2026, creating 88,000 jobs
on average

Job Vacancies
13000+ Job Openings by 2028 makes it a

highly demanding market

Used across Industries


E-commerce, AI & ML, Government, Healthcare,
Finance, Marketing, Energy

Work with Companies


Apple, IBM, Google, Microsoft, Spotify,
Hotstar, Flipkart, Salesforce

Highest Salary
25 LPA- 40 LPA

Scale Success with Lucrative


Career Opportunities After
Course Completion

Data Scientist · Data Analyst · Data Architect ·


Data Storyteller · Machine Learning Scientist ·
Machine Learning Engineer

High Demand
84,000+ Data Science Jobs in India

1,34,000+ Data Science Jobs in the US


Highlights of Guvi’s
IIT-K Data Science
Course

175% Highest Hike ₹21 Lakhs Highest Salary

40% Hike in Demand 600+ Hiring Partners

360+ Best Industry

Assured Job Opportunites


Experts

Highlights of Guvi’s
IIT-K Data Science
Course

Weekdays/Weekend LIVE Classes

Hands-on Industry Projects + Bi-weekly

Hackathons

Technical Mentorship by Industry Experts +

Periodic developer Meetups

Practise on Coding Practise Platforms CodeKata,

WebKata, MicroArc

Mock interviews + Group Discussion Test

Live Cumulative Coding Test + Capstone Project

Proceed to Placement Phase


How GUVI’s

Helps You Get a Job

Post Course completion

Receive Exclusive Job Notifications in

Data Science Domain

Company-Specific Expectations are

Set Interviews Scheduled

Tech Guidance and Support

+ Assured Job Opportunities


How Does GUVI Work?

Book a Seat with ₹8000

(100% Refundable)

Attend Pre-Bootcamp Session



(2 sessions)

Take Assessment

If Selected/
If Not Selected/

Interested Not Interested

Pay the remaining

amount and proceed Immediate Refund

to the program of ₹8000

Top skills you'll learn in


Guvi’s IIT-K Data Science
Course
Best Statistical programming language skills with

Python.

Excellent database querying skills.

Good understanding of Analytical tools and statistics.

Conceptual clarity towards Predictive performance &

algorithm optimization.

Master data visualization & communication skills.


Data Science

Course Curriculum
Module-1
We will go through the basics of Python with all essential
beginner-friendly concepts of Python programming like
datatypes, loops, data structures, and functions, followed
by assessments and assignments.

Python - Basics

Why Python

Python IDE

Hello World Program

Variables & Names

String Basics

List

Tuple

Dictionaries

Conditional Statements

For and While Loop

Functions

Numbers and Math Functions

Common Errors in Python

Module-2
Since we have the essential basics of python we will see
some advanced concepts like Comprehensions, File
handling, Regular Expressions, Object-oriented
Programming, Pickling and many more essential
concepts.

Python - Advanced

Functions as Arguments

List Comprehension

File Handling

Debugging in Python

Class and Objects

Lambda, Filters and Map

Regular Expressions

Python PIP

Read Excel Data in Python

Python MySQL

Iterators

Pickling

Python JSON
Module-3
We will explore the need for Algorithmic Thinking and the
necessity of efficient coding, we will drive through Data
Structures and Algorithms along with Memory
Management Techniques

Algorithmic thinking with Python

Introduction to algorithmic Thinking

Algorithm Efficiency and time complexity

Example algorithms - binary search,

Euclid’s algorithm

Data structures - stack, heap and binary trees

Memory Management/Technologies

Best Practices – Keeping it simple, dry code,

naming Conventions, Comments and docs.

Assessment

Module-4
Since we need to handle huge amounts of data, we will
be implementing data handling techniques with Pandas
library. And we will explore the different miscellaneous
functions of Pandas library in detail.

Data handling in Python - Pandas & MongoDB

Introduction to Pandas

Series Data Structure - Querying and Indexing

DataFrame Data Structure - Querying,

Indexing and loading

Merging data frames

Group by operation

Pivot table

Date/Time functionality

Example: Manipulating DataFrame


Module-5
Now we are going to explore the popular NoSQL
databases like MongoDB and their importance as well as
their key components. We will perform several
operations like CRUD operations, etc.

MongoDB

No Schema

Install MongoDB

How MongoDB Works?

Insert First Data

CRUD Operations

Insert Many

Update and Update Many

Delete and Delete Many

Module-6
We will continue with MongoDB beyond basics like
embed documents, dealing with types of data, schema
types and Data relationships in MongoDB, etc.

MongoDB - Continued

Diving Deep into find Difference

between update and update many

Projection

Intro to Embed Documents

Embed Documents in Action

Adding Arrays

Fetching Data From Structured Data

Schema Types

Types of Data in MongoDB

Relationship between data's

One to One using Embed Method

One to One using ReferenceMany

One to Many Embed

One to ManyReferenceMethod

Assessment - MongoDB
Module-7
We will go through Probability and Statistics whereas
they are key to understand, process and interpret the
vast amount of data, we will deal with the basics of
probability and statistics like Probability theory, Bayes
theorem, distributions etc and their importance. Besides
that, we will do hands-on with Numpy upon those
concepts

Probability and Statistics with Numpy

Why counting and probability theory?

Basics of sample and event space

Axioms of probability

Total Probability theorem and Bayes Theorem

Random variables, PMF and CDF

Discrete Distributions - Bernoulli, Binomial

and Geometric

Expectation and its properties

Variance and its properties

Continuous Distributions - uniform, exponential

and normal

Sampling from continuous distributions

Simulation techniques - simulating in NumPy

Assessment

Module-8
We will continue with statistics and probability and
we will deal with descriptive and inferential statistics
along with Hypothesis testing and a lot of other
relevant statistics methods

Probability and Statistics with Numpy - Continued

Inferential statistics - sample vs population


CLT and it’s proof

Chi-squared distribution and its properties

Point and Interval Estimators

Estimation technique - MLE

Interval Estimator of μ with unknown σ

Examples of estimators

Hypothesis Testing - I

Hypothesis Testing - II

Hypothesis Testing - III

Assessment
Module-9
Data Visualization is used to understand data in a visual
context so that the patterns, trends and correlations in
the data can be understood. We will do a lot of
visualization with libraries like Seaborn, Matplotlib etc in
turn that leads to effective storytelling.

Data Visualisation in Python (Matplotlib, Seaborn)

Read Complex JSON files

Styling Tabulation

Distribution of Data - Histogram

Box Plot

Data Visualization - Recap

Pie Chart

Donut Chart

Stacked Bar Plot

Relative Stacked Bar Plot

Stacked Area Plot

Scatter Plots

Bar Plot

Continuous vs Continuous Plot

Line Plot

Line Plot Covid Data

Assessment

Module-10
Plotly’s Dash is a famous open-source data visualization
library which we will be using to build custom data
visualization projects which allows for better storytelling.

Data Visualisation with Plotly’s dash

Dash by Plotly setup

Dash core components

Style our Dash Application

Callbacks,

Adding interactivity to our Dash Apps using


Callbacks

Module-11

It is always needed to analyze the data and preprocess

it , since the real-world data is not always industry ready,

so in this week we will be dealing with a lot of data

cleaning and Exploratory data Analysis techniques which

is a very crucial stage for any data science project

Data Engineering with Python

Handling missing data

Techniques to impute missing values

Encoding the data

Outlier detection and correction

Meaningful data transformation

Assessment

Module-12

Real-World data will not always be of the numerical

form, we need to know how to handle a lot of other

forms of data like image and text. In this week we are

going to see all essential data analysis techniques on

image and text data.

Data Analysis on Image and text data

How computers process and understand images,

Pixel

Basic Properties of Images

Greyscale, Processing Pixel Values

Masking

Image Processing

Text data preprocessing

Cleaning Text Data

Exploratory Data Analysis on Image and text data

Assessment

Module-13

We are going to explore the need for machine learning

and its types, Algorithms when to use and how to use

essential mathematical intuition along with Evaluation

metrics. We will see in detail about regression algorithms.

Machine Learning with Sklearn

Introduction to machine learning

Expert systems and 6 Jars

Supervised Learning - Regression and

Classification

Evaluation metrics and measuring accuracy

Introduction to regression

Interpreting models

Feature selection

Regularisation - Ridge and Lasso

Assessment
Module-14
We are going to explore the need for machine learning
and its types, Algorithms when to use and how to use
essential mathematical intuition along with Evaluation
metrics. We will see in detail about regression algorithms.

Machine Learning with Sklearn - Continued

Introduction to classification

Evaluation metrics - TP, FP and AUC

Classification using logistic regression

Classification using KNN

SVM

Assessment

Module-15
We are going to explore classification algorithms like
tree-based algorithms in detail like how to interpret
trees, pruning and ensemble methods like bragging and
boosting, etc.

Machine Learning with Sklearn - Continued

Introduction to decision trees

Building, pruning and interpreting trees

Ensemble techniques - Bagging and boosting

Random forests

Boosted trees - Gradient boosting

Assessment

Module-16
After dealing with a lot of Supervised Machine Learning
Algorithms we will compare and get to know when to use
what, Besides that we will deal with the do’s and don'ts
while training an ML model.

Machine Learning with Sklearn - Continued

Comparison of supervised

techniques - when to use what?

Do’s and Dont’s while training ML models

Handling imbalanced data

Undersampling

Oversampling

Other methods - ROSE, SMOTE, etc.

Assessment

Module-17
Now we will explore Unsupervised learning algorithms,
why unsupervised ?, when to use them and as well as the
essential mathematical intuition

Machine Learning with Sklearn - Continued

Introduction to unsupervised learning

Market Basket Analysis

K means algorithm

Assessment
Module-18
We are going to explore Natural Language Processing
(NLP). At first, we will see Syntactic analysis or parsing to
analyze text using basic grammar rules to identify
sentence structure, how words are organized, and how
words relate to each other. Some of them are
Tokenization, Part of Speech Tagging (PoS Tagging),
Lemmatization and Stemming, Stop word removal etc

Natural Language Processing

Syntactic Analysis

Tokenization

Part of Speech Tagging (PoS Tagging)

Lemmatization and Stemming

Stop word removal

Module-19
Now we will see that Semantic analysis focuses on
capturing the meaning of the text. First, it studies the
meaning of each individual word (lexical semantics).
Then, it looks at the combination of words and what they
mean in context. Some of the sub-tasks of semantic
analysis which we are going to explore are Word sense
disambiguation, Relationship extraction. Besides that, we
will explore Sentiment Analysis, Text extraction etc

Natural Language Processing - Continued

Semantic Analysis

Word sense disambiguation

Relationship extraction

Sentiment Analysis, Text extraction

Module-20
This whole week we are going to work on industry
projects which are currently in demand under the
guidance of industry experts

Putting it together - Solving DS problems

Case Study - I : Credit Card Fraud detection

Case Study - II : Airline Customer segmentation

Case Study - III : Product recommendation

engine

Module-21
Eventually, it's time to attend the mock interviews which
will be conducted by the industry experts like Data
scientists, IIT professors and renowned HR’s in order to
mould you in every area possible

Mock Interviews
Final Project and Course
wrap-up
20+ projects with Industry experts mentorship

Oil Price Prediction

A time-dependent model capable enough to take the

modeling decisions predicting around three months

price of the crude oil. Sound understanding of Data

engineering and Data visualizing. Hands-on experience

on Model fitting and Model Validation.

COVID-19 Data Visualization

A pictorial representation of the current COVID-19 data

on a map, presenting a quick update on the situation.

Collect and visualize a map version of the coronavirus

outbreak. Agile know-how on the crisis, pinned with a

glimpse at its history.

Gender and Age Detection

A model that can detect any human being’s age &

gender through analyses of single face detection via

an image. Developed on the UTK Face dataset.

It is widely used in applications like access control,

human-computer interaction, law enforcement, etc.


Hear it from our learners

After I graduated, I worked in a core domain After my graduation in ECE, I was looking to start my
as an Engineer. But soon, I decided to shift career in Data Science and I enrolled in GUVI since
my domain to IT for a better growth and one of my friends recommended me .With the
salary, that’s when I enrolled in GUVI’s Data trainer's simple explanations coding became easy to
Science program. I would like to thank the learn. Their project-based learning approach
placement team that continuously worked to strengthened my skill set in the data science field
get me placed in a good company. Thanks to and I was able to improve my communication skills as
GUVI, I’m working as a Data Scientist at Axa well with the guidance of the placement team to get
Insurance.

placed in GoFrugal as a Data Scientist.

Kishore Kumar D G Vamsi Krishna

“I have attended several classes of Masters in Data


“Guvi helps me to improve my self-confidence science course conducted by Guvi. It is really
in coding skills . The zoom classes are totally helpful to gain knowledge as it is different from
comfortable,friendly and easy to learn .It helps other online courses. Here, we have mentors in live
me to understand the basic and the core sessions, so we will be more concentrated than
concepts and it helped me to. Build logical other online courses where we watch pre
skills.I got great mentor's which helped me to recorded videos. Also we are getting weekly tasks
bridge between the academics. I'm very proud that would make us learn even if there is no class.

Thanks to Guvi.”

I am thankful for all the people in Guvi for building


up such a valuable program for our career.”

Gokila Gokul

I never thought cracking a tech job would be “I always liked coding but I didn't really get a
so easy with industry-ready skills. Guvi good platform to learn things as per industrial
transformed me completely. I had zero coding requirements. When I was in search I got to know
skills. But with strong guidance and support, I about Guvi, I really felt trustworthy by their
was able to learn from the basics to the response When I joined the Data Science course
advanced level in Data Science, everything the weekend live classes and recorded course
was taught with an ease that helped me to videos has made learning easy to me. Eventually
secure a place as a senior analyst at EY. 
I started spending more time practicing in
Codekata. I loved the way Guvi took care of
clarifying doubts asap. Thank you!.”

Gokak Mohd Ishtiyaque Sonia kola

“Hello folks, if you are thinking of a career transition in


the ‘Data Science’ field then, “GUVI” is the best
“The datascience course is very good,
platform to get nourished, indulged and protruded in
the concepts are being explained in a
this upcoming field and also, it doesn’t matter from
crisp manner. The instructors have good
which engineering background you are or whether
depth in the subject and solve every
you are a working fellow. The best thing I found here
doubt one might have. Thanks to GUVI for
is you will always get motivated unknowingly and
setting a great structured program.”

become curious to learn more & more from the


tutorial videos conducted by the IITM professors.
GUVI helps me to think about the problem in
multidimensional ways. Thanks to the GUVI team”

Shubham Nehete Diliban Sibi

“This course is designed being dynamic, interactive


“The course videos help you to learn the tools by and range of materials to refer. This is very well
yourself and you can track the progress.The structured in such a way that it makes the
mentors are very patient and ensure that participants to perform, discuss, and to participate in
students understand the concept, sometimes assessments that will help the participants to
going the extra mile and explaining. Sometimes maximize the utilization. This program is suitable for all
the mentors try to teach in your native language, if students, freshers and working professionals. This
needed. The practice platforms are easy to learn course is excellent for those who would like to learn
and practice. By completing this data science the basics of program like Python and would like to
course, sure you can become a Data Scientist.”

broaden their knowledge in Data Science. I enjoyed


seeing videos in GUVI website from experts that also
explains the concepts in a detailed manner. ”

Sridharan K Anbazhagan
Instructors
Learn from India’s Top Industry Leaders

Mr Koushik Krishnan Shabarinath


Independant Trading ResPro Labs - Manager
Desk - Quant Trader

Vinish Vivek Shyam Kumar


Freelance- Consultant - Skit.ai - Machine Learning
Python Solutions Lead

Neeru Dubey Neeru Dubey


Salesken AI - Research Salesken AI - Research
Scientist Scientist
Our Placements

"I got a 57% Hike,

Thank you GUVI Team"

Sonia Kola

Data Scientist

Watch Video

“Every topic was

covered from scratch”

Rakesh

Python Developer

Watch Video

Watch Video
Program Details

3-Months (Weekday)/5-Months (Weekend)

Please contact our Data Science coordinator

Deepak: +91-9736097320

Total Course Fee ₹1,30,000 + GST

Pre-BootCamp Booking Fees -₹8000

Remaining Fee ₹1,22,000 + GST

Upto 24 Months*

Note: Valid documents are required for EMI Process.


Additional processing fee will be applied. EMI Amount
might vary with Vendors

No - Eligibility / Restrictions!!!

Any interested individual who is aspiring to get a job in


the IT industry as a Data Scientist

Working Professionals who wish to switch their career


into Data Science.
GUVI believes in zero restrictions! Anybody
can enroll and start learning from scratch.
Especially helpful for freshers & working
professionals, seeking opportunities to
upskill their data science proficiency for
faster career growth.

Accelerate your

Upskilling Journey

with GUVI and

Become a Data Science

Professional with IIT-K

Certification in Advanced
Programming!
*Just within 5 Months

For further information:


IITM Research park - phase 2

deepak@guvi.in
module #9, 3rd floor, D block,

+91 9736097320 Kanagam Rd, Tharamani, Chennai,

Tamil Nadu, India. 600113

You might also like