0% found this document useful (0 votes)
31 views6 pages

Data Science

Blossom Corporate Training's 16-week Data Science course prepares students to enter data science and analytics roles. The course covers collecting, cleaning, analyzing, and visualizing data using tools like SQL, Python, and machine learning algorithms. Students complete hands-on projects, build a portfolio, and must finish a 12-week capstone project to graduate. Prerequisites include an intermediate Python knowledge or related technical background.

Uploaded by

anthony
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
Download as docx, pdf, or txt
0% found this document useful (0 votes)
31 views6 pages

Data Science

Blossom Corporate Training's 16-week Data Science course prepares students to enter data science and analytics roles. The course covers collecting, cleaning, analyzing, and visualizing data using tools like SQL, Python, and machine learning algorithms. Students complete hands-on projects, build a portfolio, and must finish a 12-week capstone project to graduate. Prerequisites include an intermediate Python knowledge or related technical background.

Uploaded by

anthony
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1/ 6

Blossom Corporate Training LLC

Kwabenya, Point One 75 9th Ave. 6th Floor


Accra-Ghana New York, NY. 10011
Phone: +233 59 498 2555 Phone: +1 838 245 6782

Data Science Curriculum


OVERVIEW

Blossom Corporate Training’s Data Science Course is a transformative course that prepares
professionals to break into data careers.

As a graduate, you’ll leave poised to succeed in various Data Science and Analytics roles,
creating predictive models that drive decision-making and a strategy throughout organizations of
all kinds.

Throughout the expert-designed course, you’ll:


● Collect, extract, query, clean, and aggregate data for analysis
● Gather, store, and organize data using SQL, and Git.
● Perform visual and statistical analysis on data using Python and its associated libraries
and tools.
● Craft and share compelling narratives through data visualizations.
● Build and implement appropriate machine learning modules and algorithms to evaluate
data science problems spanning finance, e-commerce, and more.
● Compile clear stakeholder reports to communicate the nuances of your analysis.
● Apply question, modeling, and validation problem-solving processes to data sets from
various industries to provide insights into real-world problems and solutions.
● Prepare for the world of work, compiling a professional-grade portfolio of solo, group, and
client projects.

PREREQUISITE
This course is designed for intermediate learners. We recommend that students arrive with a
mathematical foundation or familiarity with Python and programming fundamentals. Some of our
successful graduates joined our program with technical backgrounds, such as a degree in
computer science or mathematics or work experience in research or analysis. Other students
engage in self-learning to build a foundation ahead of class.

The course also requires


● Your own laptop (PC or Mac). It must be no more than four years old and able to run the
most recent operating system.
● If remote, a webcam, headphones, and good access to the internet are required.
We recommend students who are new to data science pursue our Data Analytics course prior to
taking this course.

CAPSTONE PROJECT
The Capstone Project spans the entire duration of the program (12 weeks) in which students
pursue independent projects on a question or problem of their choice, subject to approval from
Blossom Corporate Training LLC.

Students are strongly encouraged to choose a project


1. that has 70%+ relevant topics covered within the program,
2. in which the availability of data is 70%+ certain.

This is to ensure the project can be completed within the stipulated period.

The capstone project is both a valuable intellectual experience and also a vehicle through which
students can demonstrate their competency to prospective employers.

Note: Completion of the Capstone Project is a requirement for graduation.


Each project is designed to provide hands-on experience with as many data science concepts as
possible.

DURATION
16 Weeks

PRE-WORK
Data Science Introduction.

Dive into a series of self-paced lessons on the essentials of Python programming and applied
math for data science before the course begins.
● Explore fundamental Python programming concepts, including variables, lists, loops,
dictionaries, and data sets.
● Leverage programming tools like GitHub and the command-line interface to manage data
science projects.
● Practice solving coding challenges similar to the questions used in task-based data
science interviews.
● Write and run Python functions using multiple arguments.
● Discover how key math concepts like statistical significance and probability distribution
are applied throughout data science.
CONTENT

Unit 0 Intro to Data Science This unit will introduce you to the basic concept of Data
Science to enhance your understanding of the course.
● What is Data Science?
● Setting up a Python environment

Unit 1 Fundamentals Python Revision


● Data types and Data Structures.
● Functions.
● Project-oriented programming
(Classes, methods, inheritance).
● Modular programming.
● Scripting with Python.
● Data manipulation with NumPy and pandas.
● Data Visualization.
● Exploratory data analysis.

Descriptive Statistics crash course


● Measures of central tendency.
● Measures of variation.
● Basic Statistical concepts.

Inferential Statistics crash course


● Review on hypothesis testing
● Review of statistical testing concepts.
● Implementation of hypothesis testing concepts (p values,
confidence intervals, correlation/causation) with SciPy
and Statsmodel.

Unix commands
● Utilize UNIX commands to navigate file systems and
modify files.
● Learn to track changes.
● Manipulating files.

Git and Github


● Introduction to Git and Github
● What is Git and Github?
● Installing Git.
● Creating a GitHub account and setting up your profile.
● Connecting your local repository to Github.
● How to upload your projects on Github.
● Creating and managing branches.

Guided project
● Analyze data with Python, write conclusions, and make
recommendations based on analytical findings.
● Github review

Unit 2 Statistical Modeling Phases of data science project


● The organizing phase.
● The development phase.
● The deployment/operation phase.

Review on Exploratory Data Analysis (EDA)


● Importance of EDA
● Steps in EDA
● Non-visual data exploration
● Visual data exploration
● Putting all together

Predictive modeling
● Overview of predictive modeling.
● Types of predictive modeling.

Regression with stats model


● Introduction to linear.
● Preparing data for modeling.
● Simple linear regression.
● Multiple linear regression.
● Metrics and model selection.

Logistic regression with stats model


● Logistics regression.
● Metrics and model selection.

Time series forecast


● Introduction to time series.
● Time series data manipulation.
● Time series data visualization.
● Time series models (ARMA, ARIMA, SARIMA, SARIMAX).
● Metrics and model selection.

Unguided Project Solve real-world problems with predictive models and forecast
the future for actionable decision-making.

Unit 3 Machine Learning Build machine learning models. Explore the difference
Models between supervised and unsupervised learning via clustering,
natural language processing, and neural networks.

Introduction to Machine Learning


● What is machine learning?
● Types of machine learning.
● Important machine learning concepts.
● Machine learning project design.

Supervised learning
● Data preprocessing for machine learning.
● Feature engineering.
● Feature selection.
● Data scaling.
● Machine learning algorithms.
● Building regression models.
● Building classification models.
● Ensembles.
● Hyperparameter tuning.
● Pipelines.
● Model evaluation and model selection.

Guided Project
● Solve real-world problems with regression models
through guidance.
● Solve real-world problems with classification models
through guidance.

Unsupervised learning
● Application of clustering in customer segmentation.
● Data preparation for unsupervised learning.
● Dimensionality reduction (PCA, TSNE).
● Clustering (K-means clustering, Hierarchical clustering).

Guided Project
● Apply unsupervised machine learning to segment
customers for target marketing.

Unit 4 ChatGPT Get introduced to ChatGPT and understand prompt


engineering in relation to data science. Implement what you’ve
learned (EDA, data cleaning, feature engineering, machine
learning) on ChatGPT
● Read junk files in ChatGpt to have a glance at the data.
● Generate accurate prompts to assist ChatGPT in
providing you with responses (checking missing values
etc.).
● Build and apply data analysis techniques to data.
● Practice and polish reports using insights from data.

Unguided Project Choose a dataset of your own or use an existing real-world


dataset to explore diverse forms of analysis and also create a
report using ChatGPT.
Unit 5 Deep Learning Introduction to Deep Learning
● Tensorflow basics.
● Deep learning concepts (activations functions, cross-
entropy, backpropagation).

Deep Learning algorithms


● Multilayer Perceptron (MLP).
● Convolutional Neural Network (CNN) for image
classification.
● Recurrent Neural Network (RNN) for time series.
● Natural Language processing for sentiment analysis.

Unit 6 Deployment Introduction to MLOPs


● Overview of ML operations.
● Tools for ML operations.

Deploying ML models
● Turning codes into production codes (modular
programming)
● Introduction to Streamlit.
● Preparing models for deployment.
● Deploying models with Streamlit.

Capstone Project Choose a data set to explore and model, providing a detailed
notebook of your technical approach and a public presentation
on your findings.

You might also like