AI Voice Assistant for Accessibility

Uploaded by

rohansatpute2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views13 pages

AI Voice Assistant for Accessibility

Uploaded by

rohansatpute2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Ajeenkya DY Patil School of Engineering, Pune

Department of Artificial Intelligence & Data Science Engineering

AY: 2024-25 Class: TE SEM-I

A Presentation
on

AI Assistant For Handicap

Presented by:
Guided by:
Name :Rohan Satpute
Gopika Fatthepurkar
Pranav Gadage
Soham Chattar
1
Agenda
 Introduction
 Motivation

 Problem Statement

 Literature Survey

 System Architecture/Algorithm

 Conclusion

 Future Scope

 References

2
Introduction

The project demonstrates the integration of advanced speech

recognition, natural language processing, and text-to-speech
technologies to create a voice-activated assistant. The assistant is
designed to recognize user speech, process it using OpenAI’s GPT-3,
and provide real-time responses through voice. This system leverages
the power of GPT-3 for intelligent, human-like conversations and
incorporates speech-to-text and text-to-speech components to ensure
smooth, hands-free interaction.

3
Motivation
The primary goal of this project is to develop a voice-activated assistant that
can recognize spoken commands, process them using OpenAI’s GPT-3 model, and
then respond with human-like speech. The motivation behind this project stems from
the desire to make technology more accessible and intuitive through natural language
interaction. In addition, there is a growing trend in the tech industry towards voice
interfaces, with virtual assistants such as Siri, Alexa, and Google Assistant becoming
an integral part of everyday life. However, while these assistants are functional, many
are still limited in their conversational depth and understanding.

Voice interfaces provide a significant advantage in terms of ease of use,

especially for individuals who may face physical challenges that make typing or using
traditional input devices difficult. For example, people with disabilities, the elderly, or
those in hands-free environments can benefit greatly from a system like this. By
integrating speech-to-text and text-to-speech technologies, the system becomes
accessible to a wider audience.
4
Problem Statement

Voice assistants like Siri, Alexa, and Google Assistant excel at simple tasks but
struggle with complex, context-aware interactions, causing user frustration. The
challenge is developing systems that can handle diverse, spontaneous requests with
relevant, intelligent responses

5
Literature Survey :
Sr No Details Of Paper Problem Identification Paper approach for the Results/Datasets
problem

1. Veeresh Ambe; As per the World Health Organization This paper proposes an intelligent text is converted to
Prayag Gokhale; (WHO), 285 million people are visually text reader using python. This speech by a Python
Vaishnavi Patil; impaired of whom 39 million are completely product is built on a Raspberry Pi based TTS (text to
blind. Though there exist enough remedies module connected with camera speech) conversion unit
Rajamani M. Kulkar to the problems of assisting individuals who that is used to capture the input
ni embedded in the
; Preetam R. are visually impaired to read, there is a image. The input image is Raspberry Pi. Finally,
requirement for an intelligent text reader enhanced using Image processing the audio output is
Kalburgimath which is economical, accurate and easily techniques. given to the Audio
accessible in order to help them read for day Amplifier for it to be
to day activities. read out.
2. Hasan U. Zaman; The idea of this paper is to build an The whole bodywork is also Conversion of text
Saif Mahmood; automated virtual reader. In this integrated with Optical to speech can also
Sadat Hossain; modern era, there is an urge for an Character Recognition (OCR), be done in Matlab
Iftekharul automated reader which is cost- Text-To-Speech (TTS) and a but that wouldn't be
Islam Shovon effective, accurate and also portable at speaker. portable and user-
the same time. friendly.
Sr Details of paper Problem Paper approach for the Results/Datasets
No Identification problem
.
3. Elan Markowitz1,2 Zheng Chen1 Multi-domain We propose to unify these Our proposed methods have
Ziyan Jiang1 Fan Yang1 Greg Ver recommendation approaches: using information been proven effective in a
Steeg1,3 Xing Fan1 Aram leverages users’ from interactions in other scalable ,real-world Ai
Galstyan1, assistant usecase, bringing
interactions in previous domains as well as external
domains to improve knowledge graphs to make significant benefits to
personalized AI assistants
recommendations in a predictions in a new domain
new one. that would not be possible with
either information source
alone.

4. Naveenkumar T. Rudrappa; Speech processing Research on speech processing The method utilizes the
Mallamma V. Reddy; M. embeds the recording mandates recording, storing, operating system
Hanumanthappa of speech which is a playing and analyzing a wide functionality to activate
huge container of variety of spoken languages the microphone for
private, confidential specifically for Indian context. recording, hard disk for
and business records, storage and activation of a
used for a wide variety speaker for voice output.
of applications like
health care, customer
services and individual
identification.
Algorithm/Mathematical Model
Algorithm for Voice-Activated Assistant

[Link]-to-Text (STT):
- Capture audio from the microphone.
- Convert audio to text using a speech recognition model (e.g., Google Speech API).

2. Natural Language Understanding (NLU):

- Tokenize the text.
- Recognize entities (e.g., dates, locations).
- Identify user intent (e.g., query, command).
- Analyze context.

3. Contextual Query Processing with GPT-3:

- Format the input as a prompt for GPT-3.
- GPT-3 generates a response based on the input.

8
Algorithm/Mathematical Model

4. Text-to-Speech (TTS):
- Convert GPT-3’s response into speech using a TTS engine.
- Play the audio response to the user.

5. Continuous Loop:
- Wait for new input.
- Repeat the process until the user ends the session.

9
Conclusion
The voice-activated assistant system leverages state-of-the-art technologies such as
speech recognition, natural language processing, and text-to-speech synthesis to create a
seamless and interactive user experience. By integrating powerful models like OpenAI’s
GPT-3, the system is capable of understanding and generating human-like responses to a
wide variety of voice commands, enabling dynamic conversations with users. The
continuous loop of capturing audio, processing it for meaning, generating an appropriate
response, and then converting that response back into speech allows for natural and
intuitive interactions. Despite the impressive advancements, challenges remain in
improving context retention, handling ambiguous queries, and ensuring ethical responses.
However, the future of such voice-driven systems is promising, with potential applications
across numerous industries, from customer service and healthcare to smart home
management and education. As research progresses, these systems will become
increasingly adaptive, intelligent, and capable of understanding complex, real-world
interactions.
10
Future Scope
The future scope of voice-activated assistant systems is vast and continually evolving,
driven by advancements in artificial intelligence, machine learning, and natural language
processing. As speech recognition accuracy improves, these systems will be able to
understand a wider range of accents, dialects, and languages, making them more
accessible to a global audience. Furthermore, with the integration of more advanced
contextual understanding and emotional intelligence, future systems will be able to
engage in more meaningful and empathetic conversations. Enhanced voice assistants
could also have broader applications in various fields such as healthcare, where they can
assist with diagnostics, patient monitoring, and elderly care, or in education, providing
personalized tutoring and learning experiences. With advancements in multi-modal AI,
where voice assistants integrate with visual data (e.g., via cameras or augmented
reality), they could expand their functionalities to handle more complex tasks, such as
object recognition or real-time translation. The future holds great promise for
developing more intuitive, secure, and intelligent voice-driven interfaces, enabling them
to become an integral part of daily life and transforming industries worldwide. 11
References
[Link] GPT-3 Documentation
OpenAI. (2021). "GPT-3: Language Models are Few-Shot Learners." Retrieved from
[Link]

[Link] Speech-to-Text API Documentation

Google Cloud. (2021). "Speech-to-Text Documentation." Retrieved from
[Link]

3. TechCrunch: "The Future of Voice Assistants: Trends to Watch" (2022). Retrieved from
[Link]

[Link] 2: Generating Human-like Speech from Text

Wang, Y., et al. (2017). "Tacotron: Towards End-to-End Speech Synthesis." Google Research. Retrieved
from [Link]

[Link] Speech Recognition Library

SpeechRecognition Documentation. (2021). "SpeechRecognition: Recognizing Speech from Audio."
Retrieved from [Link]
12
13

AI Voice Assistant for Disabilities
No ratings yet
AI Voice Assistant for Disabilities
9 pages
Speech-to-Text Voice Interface Overview
No ratings yet
Speech-to-Text Voice Interface Overview
9 pages
My Voice Assistant Using Python
No ratings yet
My Voice Assistant Using Python
6 pages
AI Desktop Voice Assistant Overview
No ratings yet
AI Desktop Voice Assistant Overview
4 pages
Voice Assistant Using Python and NLP
No ratings yet
Voice Assistant Using Python and NLP
35 pages
AI Voice Agent: Intelligent Speech Assistant
No ratings yet
AI Voice Agent: Intelligent Speech Assistant
44 pages
Voice Interaction with OpenAI GPT Project
No ratings yet
Voice Interaction with OpenAI GPT Project
15 pages
AI Voice Assistant Features and Innovations
No ratings yet
AI Voice Assistant Features and Innovations
1 page
AI Voice Assistant Development Overview
0% (1)
AI Voice Assistant Development Overview
22 pages
AI Voice Agent Project Report
No ratings yet
AI Voice Agent Project Report
66 pages
Desktop Voice Assistant Mini-Project
No ratings yet
Desktop Voice Assistant Mini-Project
15 pages
Research Paper (1) - 1
No ratings yet
Research Paper (1) - 1
4 pages
AI Voice Assistant Development in Python
No ratings yet
AI Voice Assistant Development in Python
12 pages
AI Voice Assistant Development in Python
No ratings yet
AI Voice Assistant Development in Python
7 pages
Voice Assistant Project Using Python
No ratings yet
Voice Assistant Project Using Python
16 pages
Iratj 08 00240
No ratings yet
Iratj 08 00240
6 pages
Python Virtual Voice Assistant FRIDAY
No ratings yet
Python Virtual Voice Assistant FRIDAY
4 pages
Aura Voice Assistant Overview
No ratings yet
Aura Voice Assistant Overview
39 pages
Automated Text-to-Speech System
No ratings yet
Automated Text-to-Speech System
2 pages
AI Voice Assistant Echo for PC
No ratings yet
AI Voice Assistant Echo for PC
24 pages
AI Voice Assistant Echo for PC
No ratings yet
AI Voice Assistant Echo for PC
12 pages
AI Voice Assistant Project Report
No ratings yet
AI Voice Assistant Project Report
31 pages
Smart Voice Assistant with OpenAI & Python
No ratings yet
Smart Voice Assistant with OpenAI & Python
15 pages
Abstract
No ratings yet
Abstract
8 pages
Voice Activated Servo Motor Project
No ratings yet
Voice Activated Servo Motor Project
12 pages
AI-Based Voice Assistant Development
No ratings yet
AI-Based Voice Assistant Development
4 pages
Virtual Assistant for Blind Users
No ratings yet
Virtual Assistant for Blind Users
3 pages
Voice Assistant Development with ASR
No ratings yet
Voice Assistant Development with ASR
5 pages
Optimizing Voice Assistant Latency
No ratings yet
Optimizing Voice Assistant Latency
12 pages
Python-Based Voice Assistant Project
No ratings yet
Python-Based Voice Assistant Project
18 pages
Mini PROJECT REPORT FORMAT - Converted - 112337
No ratings yet
Mini PROJECT REPORT FORMAT - Converted - 112337
21 pages
Assistant Using Python
No ratings yet
Assistant Using Python
4 pages
Project Preliminary Investigation Report (PPIR) : Ugc Autonomous Institute
No ratings yet
Project Preliminary Investigation Report (PPIR) : Ugc Autonomous Institute
11 pages
Java Voice Assistant Development Guide
No ratings yet
Java Voice Assistant Development Guide
4 pages
Offline AI Voice Assistant Development
No ratings yet
Offline AI Voice Assistant Development
3 pages
Voice Assistant Project Report 2024
No ratings yet
Voice Assistant Project Report 2024
15 pages
Mini - Project - Synopsis - Voice-Enabled Intelligent Ai Assistant Using NLP
No ratings yet
Mini - Project - Synopsis - Voice-Enabled Intelligent Ai Assistant Using NLP
9 pages
Python Voice Assistant Development Guide
No ratings yet
Python Voice Assistant Development Guide
6 pages
VoiceMate: Speech Recognition Project Guide
No ratings yet
VoiceMate: Speech Recognition Project Guide
10 pages
Advanced Virtual Assistant Based On Speech Processing Oriented Technology On Edge Concept S.P.O.T
No ratings yet
Advanced Virtual Assistant Based On Speech Processing Oriented Technology On Edge Concept S.P.O.T
4 pages
Smart Python Coding Through Voice Recognition: M. A. Jawale, A. B. Pawar, D. N. Kyatanavar
No ratings yet
Smart Python Coding Through Voice Recognition: M. A. Jawale, A. B. Pawar, D. N. Kyatanavar
3 pages
Java-Based Voice Assistant Overview
No ratings yet
Java-Based Voice Assistant Overview
10 pages
AI Voice Assistant Project in Python
No ratings yet
AI Voice Assistant Project in Python
56 pages
AI Voice-Controlled Multi-Utility Vehicle
No ratings yet
AI Voice-Controlled Multi-Utility Vehicle
35 pages
Python Voice Assistant Project Overview
No ratings yet
Python Voice Assistant Project Overview
5 pages
Ijirt181648 Paper
No ratings yet
Ijirt181648 Paper
10 pages
Smart Assistant with Machine Learning
No ratings yet
Smart Assistant with Machine Learning
14 pages
AI in Speech Recognition for Assistants
No ratings yet
AI in Speech Recognition for Assistants
8 pages
AI Voice Assistant for Speech Impairment
No ratings yet
AI Voice Assistant for Speech Impairment
55 pages
Voice Assistant Presentation Overview
No ratings yet
Voice Assistant Presentation Overview
18 pages
Voice Activated Servo Motor Project
No ratings yet
Voice Activated Servo Motor Project
12 pages
Desktop Voice Assistant Development Guide
No ratings yet
Desktop Voice Assistant Development Guide
10 pages
Offline AI Voice Assistant for PC
No ratings yet
Offline AI Voice Assistant for PC
2 pages
AI Desktop Assistant for Email Tasks
No ratings yet
AI Desktop Assistant for Email Tasks
14 pages
Voice-Controlled Assistant Development
No ratings yet
Voice-Controlled Assistant Development
21 pages
Advanced Voice Assistant System
No ratings yet
Advanced Voice Assistant System
6 pages
Advanced Personal Voice Assistant Framework
No ratings yet
Advanced Personal Voice Assistant Framework
10 pages
Pyttsx3 for Windows Voice Assistant
No ratings yet
Pyttsx3 for Windows Voice Assistant
10 pages
Voice Recognition & Synthesis in Assistive Tech
No ratings yet
Voice Recognition & Synthesis in Assistive Tech
15 pages
A Level Biology: Heart & Circulatory Answers
No ratings yet
A Level Biology: Heart & Circulatory Answers
3 pages
NSTP Civil Welfare Training Syllabus
No ratings yet
NSTP Civil Welfare Training Syllabus
3 pages
Accenture's Target Market Insights
No ratings yet
Accenture's Target Market Insights
4 pages
Course Syllabus
No ratings yet
Course Syllabus
2 pages
Form 8936: Clean Vehicle Credit Guide
No ratings yet
Form 8936: Clean Vehicle Credit Guide
1 page
Guidelines for General Operator Certificate
No ratings yet
Guidelines for General Operator Certificate
6 pages
Advanced Communication Techniques Guide
No ratings yet
Advanced Communication Techniques Guide
3 pages
Freezing Point and Density of Hydrogen Peroxide
No ratings yet
Freezing Point and Density of Hydrogen Peroxide
2 pages
Mechanical Engineering Syllabus Overview
No ratings yet
Mechanical Engineering Syllabus Overview
107 pages
Mobile App Testing Challenges & Solutions
No ratings yet
Mobile App Testing Challenges & Solutions
22 pages
Pros and Cons of School Uniforms
No ratings yet
Pros and Cons of School Uniforms
2 pages
N-STEP Policies and Operations Manual
100% (4)
N-STEP Policies and Operations Manual
1,415 pages
Megawide Construction Q2 2021 Report
No ratings yet
Megawide Construction Q2 2021 Report
102 pages
Enhancing Oral Communication Skills in SHS
No ratings yet
Enhancing Oral Communication Skills in SHS
31 pages
Question and Answer Format for MBA Exams
No ratings yet
Question and Answer Format for MBA Exams
40 pages
O Level Chemistry Paper 2 Q&A 2023
No ratings yet
O Level Chemistry Paper 2 Q&A 2023
20 pages
Dental Care Management System Overview
No ratings yet
Dental Care Management System Overview
34 pages
Delivery Note for Alkaram Packages
No ratings yet
Delivery Note for Alkaram Packages
1 page
30-06 Rifle Cartridge Ballistics Analysis
No ratings yet
30-06 Rifle Cartridge Ballistics Analysis
23 pages
Fine Aggregates in Inorganic Polymer Concretes
No ratings yet
Fine Aggregates in Inorganic Polymer Concretes
11 pages
SNx414 Hex Schmitt-Trigger Inverters
No ratings yet
SNx414 Hex Schmitt-Trigger Inverters
37 pages
Negar Mottahedeh: Academic Profile
No ratings yet
Negar Mottahedeh: Academic Profile
17 pages
Community Development Professional Profile
No ratings yet
Community Development Professional Profile
1 page
Accurate SPT Method for PHC Piles
No ratings yet
Accurate SPT Method for PHC Piles
22 pages
Cyclic Codes in Digital Communication
No ratings yet
Cyclic Codes in Digital Communication
68 pages
Anthropometric Measurement Formulas
No ratings yet
Anthropometric Measurement Formulas
40 pages
Uncertainty in AI Decision-Making
No ratings yet
Uncertainty in AI Decision-Making
38 pages
Eli Schwartz: SEO and Marketing Insights
No ratings yet
Eli Schwartz: SEO and Marketing Insights
3 pages
Labor Standards vs. Labor Relations Explained
No ratings yet
Labor Standards vs. Labor Relations Explained
26 pages
Women Leaders Driving Change
No ratings yet
Women Leaders Driving Change
22 pages