Object Recognition using Python

Object Recognition is a technology that lies under the broader domain of Computer Vision. This technology is capable of identifying objects that exist in images and videos and tracking them. Object Recognition also known as Object Detection, has various applications like face recognition, vehicle recognition, pedestrian counting, self-driving vehicles, security systems, and a lot more.

The two significant objectives of object recognition involve:

Identification of all objects that exist in an image
Filtration of the object that seeks attention

In the following tutorial, we will understand how to perform Object Recognition in the Python programming language using the ImageAI library. We will create a basic object recognition model using the ImageAI library in Python by the end of this tutorial.

So, let’s get begun.

Deep Learning for Object Recognition

Techniques of Deep learning have been shown state of the art for different problems related to Object Recognition. Some of the generally used approaches of deep learning for object recognition are as follows:

ImageAI
Single Shot Detectors
YOLO (You Only Look Once)
Region-based Convolutional Neural Networks

However, in this tutorial, we will understand what ImageAI is and how we can use it in performing Object Recognition.

Understanding the ImageAI library

Python offers a library built to empower programmers and developers for building applications and systems with self-contained deep learning and Computer Vision capabilities with the help of some lines of simple coding script. ImageAI consists of Python implementation of nearly all state-of-the-art deep learning algorithms such as RetinaNet, YOLOv3, and TinyYOLOv3.

ImageAI makes use of several APIs that work offline – it has object detection, video detection, and object tracking APIs that can be called without accessing the Internet. ImageAI uses a pre-trained model and can easily be customized.

The ObjectDetection class of the ImageAI library consists of methods in order to perform object detection on any image or set of images with the help of pre-trained models. With ImageAI, we can detect and recognize eighty distinct types of common, everyday objects.

Setting up the Environment

In this section of the tutorial, we will consider working through the installation of required libraries, including ImageAI.

In order to utilize ImageAI, we have to install some dependencies. The initial step is to have Python installed on the system. We can download and install Python 3 from Python’s official website: https://www.python.org/.

Once we have installed Python on the system successfully, we have to install the following dependencies with the help of the pip installer:

OpenCV
TensorFlow
Keras
ImageAI

The installation command for the same is shown below:

Syntax:

  # installing OpenCV  $ pip install opencv-python    # installing TensorFlow  $ pip install tensorflow    # installing Keras  $ pip install keras    # installing ImageAI  $ pip install imageAI  

Now we have to download the TinyYOLOv3 model file containing the classification method that we will use for object recognition.

The link for the same can be found below:

https://github.com/OlafenwaMoses/ImageAI/releases/download/1.0/yolo-tiny.h5

Performing Object Recognition using ImageAI

In this section, we will discuss how we can utilize the ImageAI library in Python. The procedure of performing Object Recognition is divided into several steps for better understanding and clarity.

Step 1

The initial step is to create the necessary folders. For this tutorial, we will need the folders as shown below:

Object_Recognition: This will be the root folder.
Models: This folder will store the pre-trained model.
Input: This folder will store the image file on which we have to perform object detection.
Output: This folder will store the image file with detected objects.

Once we created the necessary folder, the Object Recognition folder should have the following sub-folders:

Step 2

For the second step, we will open the preferred text editor, which is Visual Studio Code, in this case, to write a Python script and create a new file recognizer.py

Step 3

Now, let us begin importing ObjectDetection class from the ImageAI library. The syntax for the same is shown below:

File: recognizer.py

Step 4

Now that the required ImageAI library is imported and the ObjectDetection class, the next thing is to create an instance of the class ObjectDetection. Let us consider the following snippet of code for the same.

File: recognizer.py

Step 5

Let us specify the path from the model, input image, and output image using the following snippet of code.

File: recognizer.py

  # defining the paths  path_model = “./Models/yolo-tiny.h5”  path_input = “./Input/images.jpg”  path_output = “./Output/newimage.jpg”  

Step 6

Once, we instantiated the ObjectDetection class we can now call different functions from the class. The class consists of the following functions in order to call pre-trained models:

setModelTypeAsRetinaNet()
setModelTypeAsYOLOv3()
setModelTypeAsTinyYOLOv3()

For this tutorial’s purpose, we will utilize the pre-trained TinyYOLOv3 model, and thus, we will be using the setModelTypeAsTinyYOLOv3() function in order to load the model.

Let us consider the following snippet of code for the same:

File: recognizer.py

  # using the setModelTypeAsTinyYOLOv3() function  recognizer.setModelTypeAsTinyYOLOv3()  

Step 7

Now, we will be going to call the function setModelPath(). This function will accept a string that consists of the path to the pre-trained model.

Let us consider the following snippet of code for the same:

File: recognizer.py

  # setting the path to the pre-trained Model  recognizer.setModelPath(path_model)  

Step 8

In this step, we will call the loadModel() function from the recognizer instance. This function will load the model from the path specified above with the help of the setModelPath() class method.

Let us consider the following snippet of code for the same.

File: recognizer.py

Step 9

We have to call the detectObjectsFromImage() function with the help of the recognizer object that we created earlier.

This function accepts two parameters: input_image and output_image_path. The input_image parameter is the path where the image we recognise is situated, whereas the output_image_path parameter is the path storing the image with detected objects. This function will return a diction containing the names and percentage probabilities of every object detected in the image.

The syntax for the same is shown below:

File: recognizer.py

  # calling the detectObjectsFromImage() function  recognition = recognizer.detectObjectsFromImage(      input_image = path_input,      output_image_path = path_output      )  

Step 10

At last, we can access the dictionary elements by iterating through each element present in the dictionary.

The syntax for the same is shown below:

File: recognizer.py

  # iterating through the items found in the image  for eachItem in recognition:      print(eachItem[“name”] , ” : “, eachItem[“percentage_probability”])  

Complete Python script for Object Recognition model

Let us consider the following script for the Object Recognition model.

File: recognizer.py

  # importing the required library  from imageai.Detection import ObjectDetection    # instantiating the class  recognizer = ObjectDetection()    # defining the paths  path_model = “./Models/yolo-tiny.h5”  path_input = “./Input/images.jpg”  path_output = “./Output/newimage.jpg”    # using the setModelTypeAsTinyYOLOv3() function  recognizer.setModelTypeAsTinyYOLOv3()  # setting the path of the Model  recognizer.setModelPath(path_model)  # loading the model  recognizer.loadModel()  # calling the detectObjectsFromImage() function  recognition = recognizer.detectObjectsFromImage(      input_image = path_input,      output_image_path = path_output      )    # iterating through the items found in the image  for eachItem in recognition:      print(eachItem[“name”] , ” : “, eachItem[“percentage_probability”])  

Output:

car  :  88.85036110877991  car  :  85.83406209945679  bus  :  70.04978060722351  car  :  80.88288903236389  car  :  55.334705114364624  person  :  61.084866523742676  car  :  68.46083402633667  person  :  56.165677309036255  person  :  71.58655524253845  car  :  59.49597954750061  person  :  55.276620388031006  person  :  69.08922791481018  person  :  59.92640256881714  car  :  82.73208141326904  person  :  54.69227433204651  person  :  67.25137233734131  car  :  68.9003050327301  person  :  77.32996344566345  person  :  53.02640199661255  person  :  81.33729696273804  person  :  83.60849618911743  person  :  50.34937262535095

Actual Image:

Object Recognition using Python

Image after Object Recognition:

Object Recognition using Python

At last, we can observe that ImageAI has successfully identified cars and persons in the image.

Next TopicPython VLC module

Object Recognition using Python

Object Recognition using Python

Deep Learning for Object Recognition

Understanding the ImageAI library

Setting up the Environment

Performing Object Recognition using ImageAI

Step 1

Step 2

Step 3

Step 4

Step 5

Step 6

Step 7

Step 8

Step 9

Step 10

Complete Python script for Object Recognition model

Asserts in Postman

Python Wand library

You may also like