0% found this document useful (0 votes)

41 views28 pages

Capstone Project

The Capstone Project is a final academic project that allows students to apply their theoretical knowledge to real-world issues through independent research and comprehensive projects. It involves problem definition, data gathering, model construction, and evaluation, utilizing frameworks like Design Thinking to approach complex problems. Successful completion requires understanding data requirements, modeling approaches, and validation techniques to ensure the effectiveness of the solutions developed.

Uploaded by

personal788790

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Topics covered

Sentiment Analysis,
Data Scientist Role,
Technical Jargon,
Real-World Application,
AI Model Evaluation,
Modeling Techniques,
Train-Test Split,
Feedback Iteration,
Prototype Testing,
Iterative Refinement

0% found this document useful (0 votes)

41 views28 pages

Capstone Project

Uploaded by

personal788790

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Topics covered

Sentiment Analysis,
Data Scientist Role,
Technical Jargon,
Real-World Application,
AI Model Evaluation,
Modeling Techniques,
Train-Test Split,
Feedback Iteration,
Prototype Testing,
Iterative Refinement

CAPSTONE

PROJECT
CAPSTONE PROJECT

The final project of an academic program, typically integrating

all of the learning from the program is called the Capstone
Project.

A capstone project is a project where students must research

a topic independently to find a deep understanding of the
subject matter. It gives an opportunity for the student to
integrate all their knowledge and demonstrate it through a
comprehensive project.
Objectives of the Capstone Project:

Application of Learning: The goal is to apply theoretical

knowledge to practical, real-world issues. This
demonstrates your ability to translate academic concepts
into actionable solutions.

Example: If you’ve learned about neural networks, you

should be able to apply them to a project such as image
classification.
Communicating Solutions: It’s important to present
your findings in a way that non-technical stakeholders
can understand. Explaining complex algorithms in
simple, clear language is key.

Example: When explaining a model’s predictions to a

business audience, you would avoid jargon like
“backpropagation” and instead focus on how the model
benefits the business.
Choosing the Right Algorithm: You need to analyze the
problem carefully to determine the most appropriate
algorithm to solve it.

Example: For predicting stock prices (a regression task),

you might choose linear regression or a more complex
algorithm like a neural network, depending on the dataset
and pand problem complexity.
THESE ARE SOME SIMPLE CAPSTONE PROJECT IDEAS

1. Stock Prices Predictor

2. Develop A Sentiment Analyzer
3. Movie Ticket Price Predictor
4. Students Results Predictor
5. Human Activity Recognition using Smartphone Data set
6. Classifying humans and animals in a photo
1. Understanding The Problem

Artificial Intelligence is perhaps the most transformative technology available

today. At a high level, every Al
project follows the following six steps:
1. Problem definition - Clearly define the issue you’re addressing.
2. Data gathering - Collect the right data for training your model.
3. Feature definition - Identify the key factors (features) that influence the
outcome.
4. Al model construction - Build and train a suitable AI model.
5. Evaluation & refinements - Assess the model’s performance and make
improvements.
6. Deployment - Implement the solution in a real-world setting.
2. Decomposing The Problem Through DT Framework

Design Thinking is a design methodology that provides a

solution- based approach to solving problems. It's extremely
useful in tackling complex problems that are ill-defined or
unknown.
The five stages of Design Thinking are as follows:
1. Empathize
 Observe consumers to gain a deeper understanding of the problem
 Observation must be made with empathy
 Use 5W1H method for right questioning
 Who, What, When, Where, Why & How
Empathy Map
It is a collaborative visualization used to clarify
our understanding of a specific type of user.
2.Define
 Define the problem statement
 Determining the cause of the problem
 Brainstorming to generate possible solutions
 Selecting most suitable solution.

3.Ideate
 Gather ideas to solve the problem you defined
 Brainstorm to arrive at various creative solutions
4. Prototype
 A prototype is a simple experimental model for a proposed solution
 Build representation(charts, models) of one or more ideas

5. Test
 Test the prototype and gain user feedback
 Iterate(Design thinking is an iterative process)
Problem decomposition steps
1. Understand the problem and then restate the problem in your own
words
 Know what the desired inputs and outputs are
 Ask questions for clarification
2. Break the problem down into a few large pieces.
3. Break complicated pieces down into smaller pieces. Keep doing this until
all of the pieces are small.
4. Code one small piece at a time.
 Think about how to implement it
 Write the code/query
 Test it ... on its own.
 Fix problems, if any
3. Analytic Approach
Those who work in the domain of Al and Machine Learning solve problems and answer
questions through data every day. They build models to predict outcomes or discover
underlying patterns, all to gain insights leading to actions that will improve future outcomes.
Pick analytic approach based on type of question

Descriptive
. Current status

Diagnostic (Statistical Analysis)

. What happened?
. Why is this happening?

Predictive (Forecasting)
. What if these trends continue?
. What will happen next?

Prescriptive
. How do we solve it?
• If the question is to determine probabilities of an action,
then a predictive model might be used.

• If the question is to show relationships, a descriptive

approach maybe required.

• Statistical analysis applies to problems that require counts:

if the question requires a yes/ no answer, then a
classification approach to predicting a response would be
suitable.
DATA REQUIREMENT
• The data scientist must determine the following if the
issue at hand is "a recipe," so to speak, and data is "an
ingredient:

1. which ingredients are required?

2. how to source or the collect them?
3. how to understand or work with them?
4. and how to prepare the data to meet the desired
outcome?
• Prior to undertaking the data collection and data preparation stages of the
methodology, it's vital to define the data requirements for decision-tree
classification. This includes identifying the necessary data content,
formats and sources for initial data collection.
• In this phase the data requirements are revised and decisions are made
as to whether or not the collection requires more or less data. Once the
data ingredients are collected, the data scientist will have a good
understanding of what they will be working with.
• Techniques such as descriptive statistics and visualization can be
applied to the data set, to assess the content, quality, and initial insights
about the data. Gaps in data will be identified and plans to either fill or
make substitutions will have to be made.
• In essence, the ingredients are now sitting on the cutting board.
MODELING APPROACH
Modeling
• In what way can the data be visualized to get to the answer that is required?

Evaluation
• Does the model used really answer the intial question or does it need to be
adjusted?
• Data Modeling focuses on developing models that are either
descriptive or predictive.

• An example of a descriptiue model might examine things like:

if a person did this, then they're likely to prefer that.

• A predictive model tries to yield yes/no, or stop/go type

outcomes. These models are based on the analytic approach
that was taken, either statistically driven or machine learning
driven.
• The data scientist will use a training set for predictive modelling. A
training set is a set of historical data in which the outcomes are already
known. The training set acts like a gauge to determine if the model
needs to be calibrated. In this stage, the data scientist will play around
with different algorithms to ensure that the variables in play are
actually required.
• The success of data compilation, preparation and modelling, depends
on the understanding of the problem at hand, and the appropriate
analytical approach being taken. The data supports the answering of
the question, and like the quality of the ingredients in cooking, sets the
stage for the outcome.
Constant refinement, adjustments and tweaking are necessary
within each step to ensure the outcome is one that is solid. The
framework is geared to do 3 things:

• First, understand the question at hand.

• Second, select an analytic approach or method to solve the
problem.
• Third, obtain, understand, prepare, and model the data.

The end goal is to move the data scientist to a point where a data
model can be built to answer the question.
Model Validation Techniques

Train-Test Split:
• In this technique, you split the data into two parts: a training
set and a test set. You train the model on the training data
and then test its performance on the test data. The
performance is usually measured using metrics like accuracy,
precision, or RMSE.

• Common Ratios: The typical split is 80% training and 20%

testing, but other ratios like 70-30% or even 50-50% can be
used depending on the dataset size.
Cross-Validation:
• Cross-validation involves splitting the data into ‘k’ folds, training
the model on some folds, and testing it on the remaining fold.
This process repeats for each fold, and the results are
averaged for more reliable performance metrics.

• Example: In a 5-fold cross-validation, the data is divided into 5

equal parts. Each time, 4 parts are used for training, and 1 part
is used for testing. This is repeated 5 times, and the model’s
performance is averaged across all runs.
Model Quality Metrics – Measuring Success
When building AI models, especially regression models, it’s crucial to evaluate
how well your model’s predictions match the actual data. Two commonly used
metrics for this purpose are Mean Squared Error (MSE) and Root Mean
Squared Error (RMSE).
1.Mean Squared Error (MSE)
• Definition: MSE measures the average of the squares of the errors—that is,
the average squared difference between the predicted values and the actual
values.
• Formula:
Interpretation:
• A lower MSE indicates better model performance; it means the
predictions are closer to the actual values.
• Since errors are squared, larger errors have a disproportionately
large effect on MSE, making it sensitive to outliers.

Example:
• Suppose we’re predicting the test scores of students based on the
number of hours they studied.
• Actual Test Scores: [85, 78, 92, 75, 80]
• Predicted Test Scores: [83, 76, 95, 70, 82]
Calculating MSE:
Calculate the squared errors:
Student 1: (83 – 85)² = (-2)² = 4
Student 2: (76 – 78)² = (-2)² = 4
Student 3: (95 – 92)² = (3)² = 9
Student 4: (70 – 75)² = (-5)² = 25
Student 5: (82 – 80)² = (2)² = 4
Sum the squared errors:
Total = 4 + 4 + 9 + 25 + 4 = 46
Calculate MSE:
MSE = 46/5 = 9.2
Interpretation:
• The Mean Squared Error is 9.2, indicating the average squared difference
between the predicted and actual test scores.
2. Root Mean Squared Error (RMSE)
Definition: RMSE is the square root of the MSE. It provides the error metric in
the same units as the target variable, making it more interpretable.
• Formula:

Example:
Using the MSE calculated above:
1.Calculate RMSE:
RMSE = (9.2)1/2 ≈3.033
Interpretation:
•The RMSE of approximately 3.033 means that, on average, the model’s
predictions are about 3 points off from the actual test scores.

AI Capstone Project Guide
100% (1)
AI Capstone Project Guide
47 pages
AI Capstone Project Guide
No ratings yet
AI Capstone Project Guide
21 pages
Class Xii Ai Worksheet Booklet Part2 2023-2024
No ratings yet
Class Xii Ai Worksheet Booklet Part2 2023-2024
26 pages
Capstone Project
No ratings yet
Capstone Project
6 pages
Capstone Project Guide for AI Modeling
No ratings yet
Capstone Project Guide for AI Modeling
40 pages
7118 Ds Methodology Ss
No ratings yet
7118 Ds Methodology Ss
56 pages
AI Project Cycle
No ratings yet
AI Project Cycle
31 pages
Ai Project Cycle Short Note
No ratings yet
Ai Project Cycle Short Note
9 pages
Capstone Project
No ratings yet
Capstone Project
6 pages
Notes 12th Artificial Intelligence
No ratings yet
Notes 12th Artificial Intelligence
28 pages
Stages of the AI Project Cycle
No ratings yet
Stages of the AI Project Cycle
33 pages
AI Project Cycle Class 10 Notes
No ratings yet
AI Project Cycle Class 10 Notes
7 pages
Al Project Cycle
No ratings yet
Al Project Cycle
10 pages
CH 1. AI Project Life Cycle
No ratings yet
CH 1. AI Project Life Cycle
14 pages
Data Science Methodology
No ratings yet
Data Science Methodology
21 pages
Project Life Cycle
No ratings yet
Project Life Cycle
52 pages
AI Project Cycle-Notes
No ratings yet
AI Project Cycle-Notes
14 pages
Unit - 2 AI Project Cycle
No ratings yet
Unit - 2 AI Project Cycle
7 pages
21aug2024 StaticMedia Unit 3 Capstone
No ratings yet
21aug2024 StaticMedia Unit 3 Capstone
15 pages
AI Capstone Project Overview and Techniques
No ratings yet
AI Capstone Project Overview and Techniques
5 pages
AI Project Cycle (PPT)
No ratings yet
AI Project Cycle (PPT)
35 pages
Introduction To Data Science Methodology
No ratings yet
Introduction To Data Science Methodology
45 pages
AI Project With Placeholders Final
No ratings yet
AI Project With Placeholders Final
24 pages
Introduction To Predictive Analytics: UNIT-1
No ratings yet
Introduction To Predictive Analytics: UNIT-1
14 pages
Project Cycle 1-2-25
No ratings yet
Project Cycle 1-2-25
6 pages
Capstone Project and Machine Learning Insights
No ratings yet
Capstone Project and Machine Learning Insights
9 pages
ML Challenges and Metrics
No ratings yet
ML Challenges and Metrics
19 pages
Notes Unit 1
No ratings yet
Notes Unit 1
8 pages
Statistics For Data Science
100% (3)
Statistics For Data Science
39 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Data Science Methodology Overview
No ratings yet
Data Science Methodology Overview
23 pages
Understanding the AI Project Cycle
No ratings yet
Understanding the AI Project Cycle
28 pages
AI Project Lifecycle Overview
No ratings yet
AI Project Lifecycle Overview
30 pages
Ai Class 10
No ratings yet
Ai Class 10
78 pages
AI Project Cycle
No ratings yet
AI Project Cycle
6 pages
Dsur Ea2352001010391 W3
No ratings yet
Dsur Ea2352001010391 W3
3 pages
AI Project Cycle for Beginners
No ratings yet
AI Project Cycle for Beginners
75 pages
Shivani Patel
No ratings yet
Shivani Patel
40 pages
AI Project Cycle Overview and Stages
No ratings yet
AI Project Cycle Overview and Stages
20 pages
Understanding Big Data Characteristics
No ratings yet
Understanding Big Data Characteristics
71 pages
Data Science Process
No ratings yet
Data Science Process
13 pages
Understanding the AI Project Cycle
No ratings yet
Understanding the AI Project Cycle
22 pages
Data Science and Machine Learning Guide
No ratings yet
Data Science and Machine Learning Guide
67 pages
Components of AI Project Framework - Rishi Saha
No ratings yet
Components of AI Project Framework - Rishi Saha
3 pages
C1000-177 STU SGC1000177v2
No ratings yet
C1000-177 STU SGC1000177v2
9 pages
Data Science Project Framework Guide
No ratings yet
Data Science Project Framework Guide
78 pages
Data Science Glossary Terms Explained
No ratings yet
Data Science Glossary Terms Explained
3 pages
Ai Project Cycle
No ratings yet
Ai Project Cycle
5 pages
Ai Notes Class Ix 2025-26
No ratings yet
Ai Notes Class Ix 2025-26
22 pages
Extra
No ratings yet
Extra
5 pages
Data Science Notes
No ratings yet
Data Science Notes
13 pages
AI Project Cycle: Stages Explained
No ratings yet
AI Project Cycle: Stages Explained
9 pages
Xii Ai Capstone Project
No ratings yet
Xii Ai Capstone Project
35 pages
AI Project Cycle Overview for Class 10
No ratings yet
AI Project Cycle Overview for Class 10
11 pages
CH 3
No ratings yet
CH 3
33 pages
Understanding AI Problem Scoping and Data
No ratings yet
Understanding AI Problem Scoping and Data
3 pages
Fods Unit 1
No ratings yet
Fods Unit 1
9 pages
Types of Ecological Pyramids
No ratings yet
Types of Ecological Pyramids
10 pages
Conditions For Free Electrons
No ratings yet
Conditions For Free Electrons
1 page
Cyber Safety 14
No ratings yet
Cyber Safety 14
14 pages
PTS-04 (XII 2025-26) - by O.P. GUPTA
No ratings yet
PTS-04 (XII 2025-26) - by O.P. GUPTA
10 pages
Magnetic Terms
No ratings yet
Magnetic Terms
4 pages
The Rattrap Chapter4 Summary
No ratings yet
The Rattrap Chapter4 Summary
3 pages
Cs Question Paper 1
No ratings yet
Cs Question Paper 1
6 pages
AI Questions Answers
No ratings yet
AI Questions Answers
3 pages
Vogel's Approximation Method Guide
No ratings yet
Vogel's Approximation Method Guide
5 pages
Deep Learning for Second-Hand Price Prediction
No ratings yet
Deep Learning for Second-Hand Price Prediction
29 pages
LPP Model
No ratings yet
LPP Model
16 pages
Derivation of The T-Distribution: T U V/N
No ratings yet
Derivation of The T-Distribution: T U V/N
2 pages
Top 45 Machine Learning Interview Questions (2023) - Simplilearn
No ratings yet
Top 45 Machine Learning Interview Questions (2023) - Simplilearn
25 pages
MATLAB Digital Signal Processing Experiments
No ratings yet
MATLAB Digital Signal Processing Experiments
22 pages
AI Handwritten Notes for AL3391
No ratings yet
AI Handwritten Notes for AL3391
65 pages
Database Management System Course Overview
No ratings yet
Database Management System Course Overview
5 pages
Operation+Research+ +2022+ +lecture+1
No ratings yet
Operation+Research+ +2022+ +lecture+1
19 pages
Strand Sort Algorithm Explained
No ratings yet
Strand Sort Algorithm Explained
18 pages
Numerical Methods for Algebraic Equations
No ratings yet
Numerical Methods for Algebraic Equations
24 pages
4 Dynamic Programming
No ratings yet
4 Dynamic Programming
45 pages
9 Edexcel Computer Science
No ratings yet
9 Edexcel Computer Science
7 pages
Discrete-Event Simulation Input Modeling
No ratings yet
Discrete-Event Simulation Input Modeling
8 pages
Backtracking for 8 Queens Puzzle
No ratings yet
Backtracking for 8 Queens Puzzle
3 pages
DC Motor PID Control in MATLAB
100% (1)
DC Motor PID Control in MATLAB
5 pages
True or False: AI Concepts Explained
No ratings yet
True or False: AI Concepts Explained
10 pages
Data Structures - U4
No ratings yet
Data Structures - U4
21 pages
Data Compression: Multiple Choice Questions On
No ratings yet
Data Compression: Multiple Choice Questions On
238 pages
Understanding Decision Trees in Analysis
No ratings yet
Understanding Decision Trees in Analysis
1 page
Lec 5
No ratings yet
Lec 5
28 pages
Final ML File
No ratings yet
Final ML File
39 pages
Genetic Algorithms for Solving Sudokus
No ratings yet
Genetic Algorithms for Solving Sudokus
26 pages
Probability in Business Management
No ratings yet
Probability in Business Management
1 page
Diabetes Disease Prediction Using Significant Attribute Selection and Classification Approach
No ratings yet
Diabetes Disease Prediction Using Significant Attribute Selection and Classification Approach
37 pages
cs188 Fa24 Lec24
No ratings yet
cs188 Fa24 Lec24
46 pages
1 Logistic Regression
No ratings yet
1 Logistic Regression
1 page
Introduction and Conclusion
No ratings yet
Introduction and Conclusion
2 pages
Multiple Scales for Nonlinear Equations
No ratings yet
Multiple Scales for Nonlinear Equations
65 pages

Capstone Project

Uploaded by

Capstone Project

Uploaded by

CAPSTONE

The final project of an academic program, typically integrating

A capstone project is a project where students must research

Application of Learning: The goal is to apply theoretical

Example: If you’ve learned about neural networks, you

Example: When explaining a model’s predictions to a

Example: For predicting stock prices (a regression task),

1. Stock Prices Predictor

Artificial Intelligence is perhaps the most transformative technology available

Design Thinking is a design methodology that provides a

Diagnostic (Statistical Analysis)

• If the question is to show relationships, a descriptive

• Statistical analysis applies to problems that require counts:

1. which ingredients are required?

• An example of a descriptiue model might examine things like:

• A predictive model tries to yield yes/no, or stop/go type

• First, understand the question at hand.

• Common Ratios: The typical split is 80% training and 20%

• Example: In a 5-fold cross-validation, the data is divided into 5

You might also like