Data Science
Data Science
Blossom Corporate Training’s Data Science Course is a transformative course that prepares
professionals to break into data careers.
As a graduate, you’ll leave poised to succeed in various Data Science and Analytics roles,
creating predictive models that drive decision-making and a strategy throughout organizations of
all kinds.
PREREQUISITE
This course is designed for intermediate learners. We recommend that students arrive with a
mathematical foundation or familiarity with Python and programming fundamentals. Some of our
successful graduates joined our program with technical backgrounds, such as a degree in
computer science or mathematics or work experience in research or analysis. Other students
engage in self-learning to build a foundation ahead of class.
CAPSTONE PROJECT
The Capstone Project spans the entire duration of the program (12 weeks) in which students
pursue independent projects on a question or problem of their choice, subject to approval from
Blossom Corporate Training LLC.
This is to ensure the project can be completed within the stipulated period.
The capstone project is both a valuable intellectual experience and also a vehicle through which
students can demonstrate their competency to prospective employers.
DURATION
16 Weeks
PRE-WORK
Data Science Introduction.
Dive into a series of self-paced lessons on the essentials of Python programming and applied
math for data science before the course begins.
● Explore fundamental Python programming concepts, including variables, lists, loops,
dictionaries, and data sets.
● Leverage programming tools like GitHub and the command-line interface to manage data
science projects.
● Practice solving coding challenges similar to the questions used in task-based data
science interviews.
● Write and run Python functions using multiple arguments.
● Discover how key math concepts like statistical significance and probability distribution
are applied throughout data science.
CONTENT
Unit 0 Intro to Data Science This unit will introduce you to the basic concept of Data
Science to enhance your understanding of the course.
● What is Data Science?
● Setting up a Python environment
Unix commands
● Utilize UNIX commands to navigate file systems and
modify files.
● Learn to track changes.
● Manipulating files.
Guided project
● Analyze data with Python, write conclusions, and make
recommendations based on analytical findings.
● Github review
Predictive modeling
● Overview of predictive modeling.
● Types of predictive modeling.
Unguided Project Solve real-world problems with predictive models and forecast
the future for actionable decision-making.
Unit 3 Machine Learning Build machine learning models. Explore the difference
Models between supervised and unsupervised learning via clustering,
natural language processing, and neural networks.
Supervised learning
● Data preprocessing for machine learning.
● Feature engineering.
● Feature selection.
● Data scaling.
● Machine learning algorithms.
● Building regression models.
● Building classification models.
● Ensembles.
● Hyperparameter tuning.
● Pipelines.
● Model evaluation and model selection.
Guided Project
● Solve real-world problems with regression models
through guidance.
● Solve real-world problems with classification models
through guidance.
Unsupervised learning
● Application of clustering in customer segmentation.
● Data preparation for unsupervised learning.
● Dimensionality reduction (PCA, TSNE).
● Clustering (K-means clustering, Hierarchical clustering).
Guided Project
● Apply unsupervised machine learning to segment
customers for target marketing.
Deploying ML models
● Turning codes into production codes (modular
programming)
● Introduction to Streamlit.
● Preparing models for deployment.
● Deploying models with Streamlit.
Capstone Project Choose a data set to explore and model, providing a detailed
notebook of your technical approach and a public presentation
on your findings.