0% found this document useful (0 votes)

3 views11 pages

Computer Vision Introduction

Computer vision is a subfield of artificial intelligence focused on enabling machines to interpret and understand visual data, such as images and videos, mimicking human vision capabilities. Its development spans several decades, with significant milestones including advancements in object recognition, real-time face detection, and the use of deep learning models like convolutional neural networks. Applications of computer vision are widespread across industries such as healthcare, retail, and manufacturing, enhancing processes like automated inspections, patient monitoring, and traffic management.

Uploaded by

Anaswara K U

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

3 views11 pages

Computer Vision Introduction

Uploaded by

Anaswara K U

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Download as docx, pdf, or txt

You are on page 1/ 11

Computer Vision Introduction

Computer vision is a subfield of artificial intelligence that deals with acquiring,

processing, analyzing, and making sense of visual data such as digital images and
videos. It is one of the most compelling types of artificial intelligence that we
regularly implement in our daily routines.

Computer vision helps to understand the complexity of the human vision system
and trains computer systems to interpret and gain a high-level understanding of
digital images or videos. In the early days, developing a machine system having
human-like intelligence was just a dream, but with the advancement of artificial
intelligence and machine learning, it also became possible. Similarly, such
intelligent systems have been developed that can "see" and interpret the world
around them, similar to human eyes. The fiction of yesterday has become the
fact of today

Further, Artificial intelligence is the branch of computer science that primarily

deals with creating a smart and intelligent system that can behave and think like
the human brain. So, we can say if artificial intelligence enables computer
systems to think intelligently, computer vision makes them capable of seeing,
analyzing, and understanding.

History of Computer Vision

Computer vision is not a new technology because scientists and experts have been trying to
develop machines that can see and understand visual data for almost six decades. The
evolution of computer vision is classified as follows:

 1959: The first experiment with computer vision was initiated in 1959, where they
showed a cat as an array of images. Initially, they found that the system reacts first to
hard edges or lines, and scientifically, this means that image processing begins with
simple shapes such as straight edges.
 1960: In 1960, artificial intelligence was added as a field of academic study to solve
human vision problems.
 1963: This was another great achievement for scientists when they developed
computers that could transform 2D images into 3-D images.
 1974: This year, optical character recognition (OCR) and intelligent character
recognition (ICR) technologies were successfully discovered. The OCR has solved
the problem of recognizing text printed in any font or typeface, whereas ICR can
decrypt handwritten text. These inventions are one of the greatest achievements in
document and invoice processing, vehicle number plate recognition, mobile
payments, machine translation, etc.
 1982: In this year, the algorithm was developed to detect edges, corners, curves, and
other shapes. Further, scientists also developed a network of cells that could recognize
patterns.
 2000: In this year, scientists worked on a study of object recognition.
 2001: The first real-time face recognition application was developed.
 2010: The ImageNet data set became available to use with millions of tagged images,
which can be considered the foundation for recent Convolutional Neural Network
(CNN) and deep learning models.
 2012: CNN has been used as an image recognition technology with a reduced error
rate.
 2014: COCO has also been developed to offer a dataset for object detection and
support future research.

How does Computer Vision Work?

Computer vision is a technique that extracts information from visual data, such
as images and videos. Although computer vision works similarly to human eyes
with brain work, this is probably one of the biggest open questions for IT
professionals: How does the human brain operate and solve visual object
recognition?
On a certain level, computer vision is all about pattern recognition which
includes the training process of machine systems for understanding the visual
data such as images and videos, etc.

Firstly, a vast amount of visual labeled data is provided to machines to train it.
This labeled data enables the machine to analyze different patterns in all the
data points and can relate to those labels. E.g., suppose we provide visual data
of millions of dog images. In that case, the computer learns from this data,
analyzes each photo, shape, the distance between each shape, color, etc., and
hence identifies patterns similar to dogs and generates a model. As a result, this
computer vision model can now accurately detect whether the image contains a
dog or not for each input image.

Computer vision applications use input from sensing devices,

artificial intelligence, machine learning, and deep learning to
replicate the way the human vision system works. Computer
vision applications run on algorithms that are trained on massive
amounts of visual data or images in the cloud. They recognize
patterns in this visual data and use those patterns to determine
the content of other images.
How an image is analyzed with computer vision
 A sensing device captures an image. The sensing device is
often just a camera, but could be a video camera, medical
imaging device, or any other type of device that captures an
image for analysis.

 The image is then sent to an interpreting device. The

interpreting device uses pattern recognition to break the
image down, compare the patterns in the image against its
library of known patterns, and determine if any of the
content in the image is a match. The pattern could be
something general, like the appearance of a certain type of
object, or it could be based on unique identifiers such as
facial features.

 A user requests specific information about an image, and

the interpreting device provides the information requested
based on its analysis of the image.

Deep learning and computer vision

 Modern computer vision applications are shifting away from
statistical methods for analyzing images and increasingly
relying on what is known as deep learning. With deep
learning, a computer vision application runs on a type of
algorithm called a neural network, which allows it deliver
even more accurate analyses of images. In addition, deep
learning allows a computer vision program to retain the
information from each image it analyzes—so it gets more
and more accurate the more it is used.

Computer vision capabilities

There are three main functions for how computer vision programs process
images and return information:

Object classification
Object identification
Object tracking
Optical character recognition
1. Object classification

The system classifies the objects in an image according to a defined

category. For example, with object classification, a computer could
distinguish people from objects in a photo and determine how many
people appear in the photo.

2. Object identification
The system identifies a particular object in a photo, video, or image. For
example, with object identification, the system would be able to not only
distinguish people in a photo, but also analyze their appearance to determine the
identity or traits of those people.
3. Object tracking

The system analyzes a video to process the location of a moving object over
time. For example, with object tracking, a parking lot surveillance camera could
identify cars in a parking lot and provide information about the location and
movements of those cars over time.
4. Optical character recognition

The system identifies letters and numbers in images and convert that text
into machine-encoded text that can be read by other computer
applications or edited by users.

How Computer Vision Is Used

Computer vision systems use machine learning and deep

learning models to train the system to recognize aspects of
an image or video and make predictions about them. Types
of computer vision models include:

 Image classification for inspecting an image and

assigning it a class label based on the content. For
example, an image classification model can be used to
predict which images contain a dog, cat, or angry
customer.
 Image segmentation for identifying objects and
extracting them from their background, such as
isolating a tumor from surrounding brain tissue in X-
ray results.
 Object detection for scanning images or videos and
finding target objects. Object detection models
commonly highlight multiple objects simultaneously
and can be used for tasks such as identifying items on
shelves for improved inventory management or
anomalies in items on a production line.
 Object tracking for tracking movements of detected
objects as they navigate in an environment. For
example, object tracking can be used in autonomous
driving to track pedestrians on sidewalks or as they
cross the road.
 Feature extraction for isolating useful
characteristics captured in an image or video and
sharing them with a second AI algorithm, such as
search and retrieve image matching. For example,
feature extraction can be used to automate traffic
monitoring and incident detection.
 Optical character recognition for extracting and
converting text from an image into a format that a
machine can read. This is often used in banking and
healthcare to process important documents and
patient records.

Computer Vision Applications across Industries

Computer vision is one of the most advanced innovations of artificial
intelligence and machine learning. As per the increasing demand for AI and
Machine Learning technologies, computer vision has also become a center of
attraction among different sectors. It greatly impacts different industries,
including retail, security, healthcare, automotive, agriculture, etc.

Computer vision enables a range of new use cases, helping

companies across industries solve real-world problems like
reducing operational costs, unlocking business automation,
and creating new services or revenue streams. Here are
the top industries using computer vision and the exciting
ways they apply this technology.

Industrial Automation and Manufacturing

Manufacturers use computer vision to enable automation,

which helps make production processes more efficient,
reduce human error, improve worker safety, and produce
higher outputs at lower costs. Some common applications
of computer vision in manufacturing include:
 Automated product inspection: Visual product
inspections are critical to quality control. By
automating optical inspections using production line
cameras, AI models for defect classification and
anomaly detection, and edge computing,
manufacturers can improve quality assurance
accuracy and speed.
 Safety monitoring: Computer vision can be used to
monitor factory floors to help ensure employee safety.
For example, analysis of real-time video can help
identify and alert staff to accidents or spills or detect
access to restricted, hazardous areas.

Healthcare

From preventive care to cancer treatment planning,

computer vision is used by healthcare organizations in a
variety of ways to help improve patient outcomes, enhance
accuracy, accelerate disease detection, and more.
Examples of how computer vision is being applied
in healthcare include:

 Medical imaging: Equipping CT scanners, X-ray

systems, endoscopy cameras, and other medical
imagining technology with computer vision systems
can help enable rapid processing of massive amounts
of data, streamlined workflows, and accurate and
efficient image evaluation. Deep learning technology
is being applied to assist with whole slide imaging in
digital pathology.
 Remote patient monitoring: Cameras and sensors
equipped with computer vision applications can be
used to collect and analyze data about patient
movement, such as gait or body positioning, to
identify deviations from established norms and alert
care team members to possible urgent needs.
Retail

From understanding where to place products to the optimal

time to restock inventory to in-store customer behavior
tracking, computer vision can help retailers discover
powerful insights about their operations for more-informed
business decision-making. Some applications of computer
vision in retail include:

 Loss prevention: Computer vision models can

analyze data from existing store cameras or self-check
kiosks to identify suspicious behaviour and send real-
time alerts to managers so they can intervene and
help stop fraud being committed.
 Touchless self-service checkout terminals:
Retailers looking to increase efficiency and enhance
the customer experience can leverage 3D smart scan
technology and computer vision models to capture,
detect, and recognize nonbarcoded food items,
enabling quick and convenient checkout with minimal
staff intervention.

Smart Cities

Smart city technologies can help gather video feeds from

street cameras so city leaders can make more-informed
operational decisions to help improve citizen safety,
mobility, and quality of life. Here are a few ways computer
vision can be applied in smart cities:

 Traffic management: City governments can

implement computer vision systems to monitor and
analyze street intersections and traffic patterns and
detect and track vehicles and pedestrians to optimize
traffic flow and help enhance safety at intersections.
 Infrastructure maintenance: Computer vision
models can be trained to recognize road and bridge
problems, such as potholes or cracked pavement,
throughout a city or entire county and inform crews of
locations needing maintenance.

How Computer Vision Works

Computer vision combines components like edge
computing, cloud computing, software, and AI deep
learning models to enable computers to “see” data
collected from cameras and videos; quickly recognize
specific objects, people, and patterns; make predictions
about them; and take action if necessary.

The Role of Convolutional Neural Networks

Computer vision systems use deep learning models from a

family of algorithms known as convolutional neural
networks (CNNs) to guide image processing and analysis.
These deep learning models analyze the RGB values
embedded in digital image pixels to detect identifiable
patterns. CNNs can be developed to evaluate pixels based
on a wide range of features—including color distribution,
shape, texture, and depth—and accurately recognize and
classify objects.

Training Computer Vision Models

Before a computer vision system can be put to work, data

scientists and developers must train the system’s deep
learning model for its specific use case. This requires
inputting large amounts of application-specific data that
the model can use to recognize what it has been developed
to identify. For example, for a computer vision application
designed to recognize a dog, the model must first learn
what a dog looks like. It does this by being trained on
thousands, maybe even millions, of images of dogs of
different breeds, sizes, colors, and characteristics.
Most commonly, training takes place in data centers or
cloud environments. For especially complex training
initiatives, GPUs and AI accelerators can be applied to
expedite the process and better handle the increased
number of parameters involved. Once the model has
completed the training phase, it has the knowledge needed
to interpret and infer information from digital images. The
model may also be further fine-tuned or retrained over
time.
It’s also important to note that those seeking to build
computer vision solutions can use off-the-shelf,
foundational models as starting points for fine-tuning to
accelerate development times and avoid starting from
scratch.

Deploying Computer Vision Models

Once trained, computer vision models can be deployed to

computer systems to perform inference and interpret
conditions in the field—continuously assessing image and
video data to extract insights and information. While
computer vision solutions can run inferencing workloads in
the cloud or data center, many organizations today are
exploring edge AI applications, where computer vision
models run closer to where data is generated on
lightweight, optimized edge hardware or embedded
devices.
Moving AI inferencing capabilities closer to the edge can
offer several key benefits:

 Increased speed and lower latency: Moving data

processing and analysis to where it is generated helps
speed system response, enabling faster transactions
and better experiences that are vital in many
computer vision applications.
 Improved network traffic
management: Minimizing the amount of data sent
over a network to the cloud can reduce the bandwidth
and costs of transmitting and storing large volumes of
data.
 Greater reliability: The amount of data that
networks can transmit simultaneously is limited. For
locations with subpar internet connectivity, storing
and processing data at the edge improves reliability.
 Enhanced security: With proper implementation, an
edge computing solution may increase data security
by limiting data transmission over the internet.
 Privacy compliance requirements: Some
governments, customers, or industries may require
that data being used for computer vision applications
remain in the jurisdiction where it was created. Edge
computing can help businesses stay compliant with
such rules and regulations.

1 The Road
No ratings yet
1 The Road
3 pages
Microsoft Certified Azure AI Fundamentals
No ratings yet
Microsoft Certified Azure AI Fundamentals
75 pages
Tokyo-Montana Express, The
100% (2)
Tokyo-Montana Express, The
274 pages
Thane 1
No ratings yet
Thane 1
132 pages
Computer Vision PDF
No ratings yet
Computer Vision PDF
6 pages
CV_UNIT_1
No ratings yet
CV_UNIT_1
17 pages
Computer Vision in Aritificial Intelligence
No ratings yet
Computer Vision in Aritificial Intelligence
33 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision Advancement Rebecca
No ratings yet
Computer Vision Advancement Rebecca
17 pages
What Is Computer Vision
No ratings yet
What Is Computer Vision
9 pages
CV Unit 1
No ratings yet
CV Unit 1
30 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
3 pages
Unit 1 to 5 Computer Vision and Image Processing
No ratings yet
Unit 1 to 5 Computer Vision and Image Processing
56 pages
CV
No ratings yet
CV
2 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
AI and Computer Vision Bundle
No ratings yet
AI and Computer Vision Bundle
75 pages
Unit 1
No ratings yet
Unit 1
20 pages
Computer Vision and Data Science Notes
No ratings yet
Computer Vision and Data Science Notes
11 pages
Computer Vision Is A Field of Artificial Intelligence
No ratings yet
Computer Vision Is A Field of Artificial Intelligence
2 pages
Skill Enhancement Course (SEC) Artificial Intelligence
No ratings yet
Skill Enhancement Course (SEC) Artificial Intelligence
54 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Computer Vision (1) (2)
No ratings yet
Computer Vision (1) (2)
14 pages
1 Intro to CV
No ratings yet
1 Intro to CV
76 pages
Augmented Reality
No ratings yet
Augmented Reality
27 pages
Image Manipulation Finall
No ratings yet
Image Manipulation Finall
7 pages
AI-900 Related Question Bank
No ratings yet
AI-900 Related Question Bank
52 pages
Grade 10 Unit 5 - Computer Vision
No ratings yet
Grade 10 Unit 5 - Computer Vision
21 pages
TH TH
No ratings yet
TH TH
3 pages
Percept: Fundamentals and Applications
From Everand
Percept: Fundamentals and Applications
Fouad Sabry
No ratings yet
Artificial Intelligence1
No ratings yet
Artificial Intelligence1
35 pages
AI 900 Documentation
No ratings yet
AI 900 Documentation
40 pages
computer-vision-revision-notes_250322_101703
No ratings yet
computer-vision-revision-notes_250322_101703
4 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
AI CV NOTES
No ratings yet
AI CV NOTES
6 pages
AI_Applications
No ratings yet
AI_Applications
5 pages
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
From Everand
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
Fouad Sabry
No ratings yet
Computer Security
No ratings yet
Computer Security
23 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
CV_UNIT_5
No ratings yet
CV_UNIT_5
11 pages
computer vision technology
No ratings yet
computer vision technology
29 pages
DLVS Unit 1
No ratings yet
DLVS Unit 1
36 pages
CV 1
No ratings yet
CV 1
21 pages
Std 10 Unit 5 - Computer Vision
No ratings yet
Std 10 Unit 5 - Computer Vision
4 pages
All Unit AI
No ratings yet
All Unit AI
53 pages
AI Unit 1 Notes
No ratings yet
AI Unit 1 Notes
20 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
19 pages
Robotic Vision Two Marks
No ratings yet
Robotic Vision Two Marks
5 pages
ai-part-b-ch9
No ratings yet
ai-part-b-ch9
22 pages
AI Course1 Study Material
No ratings yet
AI Course1 Study Material
53 pages
Computer Vision Unit 3-1
No ratings yet
Computer Vision Unit 3-1
23 pages
All Unit AI
No ratings yet
All Unit AI
53 pages
AI
No ratings yet
AI
26 pages
Unit 5 of Ai Notes
No ratings yet
Unit 5 of Ai Notes
9 pages
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
Updated UNIT 1 AI
No ratings yet
Updated UNIT 1 AI
12 pages
200 Questions AI-900
No ratings yet
200 Questions AI-900
19 pages
Computer Vision
No ratings yet
Computer Vision
7 pages
ETHICS
No ratings yet
ETHICS
11 pages
Ai Fundamentals
No ratings yet
Ai Fundamentals
15 pages
Computer Vision: Exploring the Depths of Computer Vision
From Everand
Computer Vision: Exploring the Depths of Computer Vision
Fouad Sabry
No ratings yet
Computer Vision: Fundamentals and Applications
From Everand
Computer Vision: Fundamentals and Applications
Fouad Sabry
No ratings yet
JU EMERGING ASSIGNMENT Sec22 Group 3
No ratings yet
JU EMERGING ASSIGNMENT Sec22 Group 3
10 pages
Chapter One-3
No ratings yet
Chapter One-3
8 pages
Josaa/Csab-2023: State Wise List of Reporting Centers For Physical Verification of PWD Candidates
No ratings yet
Josaa/Csab-2023: State Wise List of Reporting Centers For Physical Verification of PWD Candidates
15 pages
2024-10-04_09.49.29
No ratings yet
2024-10-04_09.49.29
2 pages
Ingeneous Water Conffilict Management in Gambella Region
No ratings yet
Ingeneous Water Conffilict Management in Gambella Region
83 pages
Instant Download Geographic Information Systems for the Social Sciences Investigating Space and Place 1st Edition Steven J. Steinberg PDF All Chapters
100% (8)
Instant Download Geographic Information Systems for the Social Sciences Investigating Space and Place 1st Edition Steven J. Steinberg PDF All Chapters
81 pages
Money Matters!
No ratings yet
Money Matters!
4 pages
Unit 2506 Raffles Corporate Center, F. Ortigas Jr. Road, Ortigas Center, Pasig City, Philippines 1605
No ratings yet
Unit 2506 Raffles Corporate Center, F. Ortigas Jr. Road, Ortigas Center, Pasig City, Philippines 1605
4 pages
Bead and Jewellery Spring Special
96% (24)
Bead and Jewellery Spring Special
84 pages
Nichod Series MCQ
No ratings yet
Nichod Series MCQ
96 pages
Grundwortschatz Englisch/Deutsch
100% (2)
Grundwortschatz Englisch/Deutsch
11 pages
DEED of SALE Foreshore Land 2
No ratings yet
DEED of SALE Foreshore Land 2
2 pages
Surface Development
No ratings yet
Surface Development
6 pages
7 Grade History Semester 1 Exam Study Guide
No ratings yet
7 Grade History Semester 1 Exam Study Guide
3 pages
Winter Wellbeing - The Final Report
No ratings yet
Winter Wellbeing - The Final Report
21 pages
sample 1
No ratings yet
sample 1
20 pages
PEBC EE Blueprint PDF
No ratings yet
PEBC EE Blueprint PDF
2 pages
Life - Pre-Int - Wordlist - Unit - 1 (1) - Trang-4-6
No ratings yet
Life - Pre-Int - Wordlist - Unit - 1 (1) - Trang-4-6
3 pages
Samaritan Update Vol XVII
No ratings yet
Samaritan Update Vol XVII
122 pages
As 516 Obe Syllabus
No ratings yet
As 516 Obe Syllabus
8 pages
Lo, Mamaysky &wang - Foundations of Technical Analysis PDF
No ratings yet
Lo, Mamaysky &wang - Foundations of Technical Analysis PDF
61 pages
Soal Uas Kelas 3
No ratings yet
Soal Uas Kelas 3
3 pages
Hakim
No ratings yet
Hakim
6 pages
Individual Project: Muhammad Zubair Tahir
No ratings yet
Individual Project: Muhammad Zubair Tahir
10 pages
Fermentation Types: 8 Types of Fermentations - Industrial Biotechnology
No ratings yet
Fermentation Types: 8 Types of Fermentations - Industrial Biotechnology
12 pages
Oratorical Speech
100% (4)
Oratorical Speech
2 pages
03 ABB Worldwide Service Station Contact Info
No ratings yet
03 ABB Worldwide Service Station Contact Info
26 pages
Importance of Money
100% (2)
Importance of Money
3 pages
M2Scientific Investigation Entrepreneurship Research Process
No ratings yet
M2Scientific Investigation Entrepreneurship Research Process
13 pages