The document covers fundamental concepts of Computer Vision, including definitions of resolution, CNN, image convolution, and pixel values. It discusses the objectives of Computer Vision, its training methods, and various applications such as facial recognition and image classification. Additionally, it highlights the role of Computer Vision in fields like healthcare and autonomous vehicles, emphasizing its significance in interpreting visual data.
Download as DOCX, PDF, TXT or read online on Scribd
Download as docx, pdf, or txt
0 ratings0% found this document useful (0 votes)
3 views4 pages
Std 10 Unit 5 - Computer Vision
The document covers fundamental concepts of Computer Vision, including definitions of resolution, CNN, image convolution, and pixel values. It discusses the objectives of Computer Vision, its training methods, and various applications such as facial recognition and image classification. Additionally, it highlights the role of Computer Vision in fields like healthcare and autonomous vehicles, emphasizing its significance in interpreting visual data.
Download as DOCX, PDF, TXT or read online on Scribd
Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1/ 4
Computer Vision
A. Short answer type questions.
1. What do you mean by resolution? Ans: The number of pixels in an image is called its resolution. It determines the level of detail or clarity of images displayed on screens or captured by cameras. 2. Define CNN. Ans: A Convolutional Neural Network is a deep Learning algorithm that is commonly used in image recognition and processing. CNN analyses an image, extracts the best features and reduces its size to make it manageable, while still preserving its important features. This helps it to differentiate one image from the other. 3. What is meant by image convolution? Ans: An image convolution is simply an element-wise multiplication of the image array with another array called the kernel followed by sum. This results in forming a convolution matrix or filtered image. 4. What are considered good features of an image during image processing? Ans: The features like corners are easy to find as their exact location can be pinpointed in the image. Thus, corners are always good features to extract from an image followed by the edges. 5. What are pixels? Ans: The word "pixel" stands for "picture element". Every digital photograph is made up of tiny elements called pixels. A pixel is the smallest unit of information that makes up a text, image or video on a computer. Even a small image can contain millions of pixels of different colours. Pixels are usually arranged in a 2-dimensional grid and are often in round or square shape. 6. State the objective of Computer Vision. Ans: The objective of computer vision is to replicate both the way humans see and the way humans make sense of what they see. 7. How are Computer Vision algorithms trained to identify image content? Ans: Computer vision models are trained on massive amounts of visual data. Once a large amount of data is fed through the model, the computer will "look" at the data and teach itself to differentiate one image from other using deep learning algorithms. 8. Explain the concept of feature extraction in Computer Vision. Ans: In image processing, the image can have features like a blob, an edge or a corner. These features help us to perform certain tasks and analysis. Feature extraction refers to the process of automatically extracting relevant and meaningful features from raw input images. The features like corners are easy to find as their exact location can be pinpointed in the image, whereas the patches that are spread over a line or an edge look the same all along. B. Long answer type questions. 1. How do computers store RGB images? Ans: Every RGB image is stored on a computer in the form of three different channels – the Red channel, the Green channel and the Blue channel. Each channel contains a number of pixels, with the value of each pixel ranging from 0 to 255. When all channels are combined together, they form a coloured image. This means that in an RGB image, each pixel has a set of three different values which together give colour to that particular pixel. 2. Explain CV and provide any two applications of it. Ans: The Computer Vision domain of artificial intelligence enables machines to interpret visual data, process it and analyse it using algorithms and methods to interpret real-world phenomena. It helps machines derive meaningful information from digital images, videos and other visual inputs and take actions based on that information. Applications of Computer Vision are: Face filters: This is one of the popular applications used in apps like Instagram and Snapchat. A face filter is a filter applied to photographs, or videos in real time, to make the face look more attractive. You can also use it to combine a face with animal features to give it a funny appearance. Facial recognition: With smart homes becoming more popular, computer vision is being used for making homes more secure. Computer Vision facial recognition is used to verify the identity of the visitors and guests and to maintain a log of the visitors. This technology is also used in social networking applications for detecting faces and tagging friends. 3. What is meant by the pixel value of a digital image? Ans: Each pixel in a digital image on a computer has a pixel value which determines its brightness or colour. The most common pixel format is the byte image, where this value is stored as an 8-bit integer having a range of possible values from 0 to 255. Typically, zero is considered as no colour or black and 255 is considered to be full colour or white. 4. Describe the CV tasks for a single object in an image. Ans: The CV tasks for a single object in an image are: Image Classification: This task involves assigning a label to the entire image based on its content. Image Classification plus Localisation: This is the task which involves both processes of identifying what object is present in the image and at the same time identifying at what location that object is present in that image. 5. State the difference between CV and human vision. Ans: Humans see an image with the help of their eyes, and then the brain processes and identifies the image through learning and experience. In computer vision, AI first perceives the image with a sensing device, and then computer vision and other AI algorithms identify and classify the elements in the image to recognise it. 6. The Face Lock feature of a smart phone is an example of Computer Vision. Briefly discuss localization? Give an example to illustrate this difference. Ans: The face-lock feature on smartphones uses computer vision to analyse and identify facial features. When a user activates the face-lock feature, the smartphone's CV system compares the facial features with pre-registered photographs stored on the device. If the facial characteristics match, the device grants access2 to the user. This authentication method offers convenience and security for the user. 7. What is the difference between ‘image classification’ and ‘image classification and localization’. Give an example to illustrate this difference. Ans: Image classification involves identifying the main object category in a photo, while image classification with localisation determines both the object's category and its precise location within the image, often by drawing a bounding box around it. For example, in an image showing a cat, the image classification algorithm will identify and label the image as a cat. Whereas, the image classification with localization algorithm will not only identify the cat, but will also draw a box to indicate the location of the cat in the image. 8. Explain the role and function of the convolution operator is CNNs Ans: The convolution operator is a fundamental mathematical operation used in image processing. It involves multiplying two arrays of numbers, element-wise, to produce a third array. In image processing, convolution is a common tool used for image editing to apply filters or effects, such as blurring, sharpening, outlining or embossing on an image. An image convolution is simply an element-wise multiplication of the image array with another array called the kernel followed by sum. This results in forming a convolution matrix or filtered image. 9. List the different types of pooling that can be performed on an image in a CNN. Ans: There are two types of pooling which can be performed on an image. They are: a. Max Pooling: This returns the maximum value from the portion of the image covered by the Kernel. b. Average Pooling: This returns the average value from the portion of the image covered by the Kernel. 10. Apart from text- based search, search engines also carry out image-based searches. How does Computer Vision help in this task? Ans: Visual search algorithms in search engines use computer vision technology to help you search for different objects using real world images. CV compares different features of the input image to its database of images, analyses the image features and gives us the search result. Computer vision, combined with machine learning allows the device not only to see the image, but also to interpret what is in the picture, helping make decisions based on it. 11. How is Computer Vision used in the following fields? Ans: Role of computer vision in the following fields: a. Healthcare: Medical imaging has greatly benefited from computer vision. It not only creates and analyses images, but also acts as an assistant and helps doctors to better understand a patient’s health condition. CV analyses X-Rays, CT scans and MRI is used to read and convert 2D scanned images into interactive 3D models. This results in an increase in the accuracy and efficiency of diagnosis since the machines can identify details invisible to the human eye. b. Warehouses: CV can be used in warehouses to remove human error during the receiving and storing process of products by automating the scanning and data entry process for inventory management. Robots equipped with CV accurately pick parcels and pack them. CV also automatically checks the order against the contents. 12. “Computer Vision is the basis of autonomous vehicles.” Justify. Ans: Autonomous driving involves identifying objects, getting navigational routes and monitoring the surroundings. Automated cars from companies like Tesla can detect the 360-degree movements of pedestrians, vehicles, road signs and traffic lights and create 3D maps. CV helps them detect and analyse objects in real-time and take decisions like breaking, stopping or keep driving.