Computer Vision
Computer Visions (CV) enables machines to detect patterns and gain a high-level understanding of images and videos
Neural Networks are made up of layers, Input layer, Hidden layer(s), Output layer
Input Layer receives the data
Hidden Layer finds important features in the input data that can be used for prediction
Output Layer generates the result
Deep Learning enables machines to learn the patterns present in the data through feature extraction.
Input Layer: The first layer in a neural network. This layer receives all data that passes through the neural network.
Hidden Layer: A layer that occurs between the output and input layers. Hidden layers are tailored to a specific task.
Output Layer: The last layer in a neural network. This layer is where the predictions are generated based on the information captured in the hidden layers.
Three main components of neural networks
Input Layer: This layer receives data during training and when inference is performed after the model has been trained.
Hidden Layer: This layer finds important features in the input data that have predictive power based on the labels provided during training.
Output Layer: This layer generates the output or prediction of your model.
Modern computer vision
Modern-day applications of computer vision use neural networks call convolutional neural networks or CNNs.
In these neural networks, the hidden layers are used to extract different information about images. We call this process feature extraction.
Computer Vision Applications
Use Cases
Process Automation
Content Moderation
Safety Monitoring
Visual Search
Medical Imaging
Computer Vision Tasks
Image Classification
Image classification can be used to answer questions like What's in this image?
Sorting
Text- Detection(OCR)
Content Filtering (Moderation)
Object Detection
Object detection is closely related to image classification, but it allows users to gather more granular detail about an image. For example, rather than just knowing whether an object is present in an image, a user might want to know if there are multiple instances of the same object present in an image, or if objects from different classes appear in the same image.
Self-Driving Cars
Semantic Segmentation
Segmentation tells us the pixel level detail of where an object is in an image
Satellite Imagery via Image Segmentation
Cellular Segmentation
Activity Recognition
Activity recognition is an application of computer vision that is based around videos rather than just images. Video has the added dimension of time and, therefore, models are able to detect changes that occur over time.
Explore computer vision
Analyze images with the Computer Vision service
Classify images with the Custom Vision service
You can use a machine learning classification technique to predict which category, or class, something belongs to. Classification machine learning models use a set of inputs, which we call features, to calculate a probability score for each possible class and predict a label that indicates the most likely class that an object belongs to.
For example, The features of a flower might include the measurements of its petals, stem, sepals, and other quantifiable characteristics. A machine learning model could be trained by applying an algorithm to these measurements that calculates the most likely species of the flower - its class.
Detect objects in images with the Custom Vision service
Object detection is a form of machine learning based computer vision in which a model is trained to recognize individual types of objects in an image, and to identify their location in the image.
Last updated