0% found this document useful (0 votes)
61 views26 pages

Lecture1 Introduction CVML

This document discusses an introduction to machine learning and computer vision using machine learning. It provides an overview of the syllabus which includes topics like convolutional neural networks, recurrent neural networks, and using deep learning techniques like CNNs and RNNs to solve computer vision problems. It also lists some reference books and papers related to the topics covered in the course.

Uploaded by

Michael Corleone
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
0% found this document useful (0 votes)
61 views26 pages

Lecture1 Introduction CVML

This document discusses an introduction to machine learning and computer vision using machine learning. It provides an overview of the syllabus which includes topics like convolutional neural networks, recurrent neural networks, and using deep learning techniques like CNNs and RNNs to solve computer vision problems. It also lists some reference books and papers related to the topics covered in the course.

Uploaded by

Michael Corleone
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 26

Computer Vision Using Machine

Learning

Lecture Slides
Acknowledgement
We thankfully acknowledge following Professors for using their
Lecture Slides, Tutorials, video lectures for preparing the course
materials

• Prof. Andrew Ng, Stanford University


• Prof. Fei Fei Li, Stanford Vision Lab
• Prof. Mubarak Shah, UCF
• Prof. Geoffrey E. Hinton, University of Toronto
• Prof. P K Biswas, IIT Kharagpur
and many more…
INTRODUCTION
Syllabus
• Introduction to Computer Vision: Image representation and filtering,
Interest point detection, SIFT, HOG, Object recognition /
categorization, Segmentation, Bag of words;

• Introduction to Machine Learning: Linear regression, Logistic


regression, Over-fitting and Regularization, Support Vector
Machine; Neural Network

• Convolution Neural Network: Architectures, Auto-encoders, Deep


Belief Network, Restricted Boltzmann Machine; Beyond image
classification: localization, detection, segmentation

• Recurrent Neural Networks and LSTM

• Computer vision problems using deep learning aka CNN , RNN etc.
Books
• Text Books:

– Image and video text recognition using convolutional neural networks:


Study of new CNNs architectures for binarization, segmentation and
recognition of text images, by Zohra Saidane, Publisher: LAP
LAMBERT Academic Publishing (April 5, 2011)

– Face Image Analysis with Convolutional Neural Networks, by Stefan


Duffner , Publisher: GRIN Verlag (October 18, 2013)

– Fundamentals of Deep Learning, Designing Next-Generation Artificial


Intelligence Algorithms By Nikhil Buduma, Publisher: O'Reilly Media
Books
• Reference Books/Papers:
– Neural Networks and Deep Learning, by Michael Nielsen, Online Book:
http://neuralnetworksanddeeplearning.com.

– ImageNet Classification with Deep Convolutional Neural Networks, Alex


Krizhevsky, Ilya Sutskever, Geoffrey E Hinton, NIPS 2012.

– Going Deeper with Convolutions, Christian Szegedy, Wei Liu, Yangqing Jia,
Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent
Vanhoucke, Andrew Rabinovich, 19-Sept-2014.

– Learning Hierarchical Features for Scene Labeling, Clement Farabet,


Camille Couprie, Laurent Najman and Yann LeCun, IEEE Transactions on
Pattern Analysis and Machine Intelligence, 2013.

– Learning Convolutional Feature Hierachies for Visual Recognition, Koray


Kavukcuoglu, Pierre Sermanet, Y-Lan Boureau, Karol Gregor, Michaël
Mathieu and Yann LeCun, Advances in Neural Information Processing Systems
(NIPS 2010), 23, 2010.

– Ciresan, Dan, Ueli Meier, and Jürgen Schmidhuber. “Multi-column deep


neural networks for image classification.” Computer Vision and Pattern
Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2012.
Grades
• Mid-Semester
– Percentage of grade : 20%

• End-Semester
– Percentage of grade : 30%

• Assignments:
– Percentage of grade : 50%

• Attendance below 75% --- F Grade


• Random Attendance
What is (computer) vision?
• When we “see” something, what does it mean?

• An color image, it is just an matrix of RGB values


(color pixels)

• Semantic definition of the image by computer


Task

Interpretation:
A Bird / blue sky and green land /
clouds / sign of peace
Why important now?
• In every 60 seconds, approximately 150 hours video
data are uploaded

• For a human observer, its almost impossible to


annotate such large amount of data

• Intelligent machine vision such as automatic image


or video understanding is essential

• A large space of applications


Application Areas
Field of study that gives computers the
ability to learn without being explicitly
programmed.

By: Arthur Samuel (1959)


Applications
• Database mining:
– Web click data, medical records, biology, engineering etc.

• Application that can’t be programmed by hand


– Autonomous helicopter, handwriting recognition, NLP,
computer vision

• Self Customizing Programs Recommendation


– Amazon, Netflix

• Understanding human learning


– Brain, real AI
Learning Problem
• Well-posed learning problem:
A computer program is said to learn from
experience E with respect to some task T and some
performance measure P, if its performance on T, as
measured by P, improves with experience E.

– By Tome Mitchell (1998)


Example
Your Email program watches which emails you do
or do not mark as a spam and based on that learns
how to better filter spam.

• Task (T): Classifying the emails as spam or not

• Experience (E): Watching you label emails as spam or not


spam

• Performance (P): The number of emails correctly classified


as spam / not spam
Machine Learning Algorithms
• Supervised Learning
• Un-supervised Learning

• Others: Reinforcement learning, recommender


system

• Where to apply which algorithms


Supervised Learning

Supervised Learning: Regression:


Right answer given Predict continuous valued output
Supervised Learning

Fruit : Apple or not?


Classification
Discrete values
1(Y) output (0 or 1)

Binary classifier

Multi-class classifier

0(N)
Fruit size
Supervised Learning
Fruit : Apple or not?

• Features:

Weight • color
• texture
• cost
• …..
Fruit size

Two features examples


Supervised Learning

x2

x1
Un-supervised Learning
No label

Clustering Algorithm
x2

x1
Application of Clustering Algorithms
• Organizing computer clusters

• Social network analysis

• Market segmentation

• Astronomical image/data analysis

• Speaker recognition and many more…

You might also like