Object Detection¶
This document will explain how to use the "Object Detection" module in the Model Training and Inference Library under Mind+ > Programming > Real-Time Mode to apply an object detection model you have trained yourself and complete an object detection project.
Features¶
Using the object detection module, users can load pre-trained object detection models to perform inference and recognition on local images or live camera feeds. This allows them to obtain results such as the object’s label, confidence score, center point X and Y coordinates, width, and height, which can then be used for subsequent logical decisions, interactive control, or visual display.
With this, users can not only quickly apply pre-trained object detection models to create various object detection projects, but also gain an intuitive understanding of the entire application process—from image input to model inference to result output—enabling them to build AI projects with the capabilities to “see, judge, and interact,” thereby providing foundational support for both educational instruction and practical applications.
Preparations¶
Hardware Preparation¶
- a computer
- A webcam (either the one built into your computer or a USB webcam)
Software Preparation¶
Install Mind+ version 2.0.4 or later. Click here to view the Mind+ installation guide. For instructions on how to check your software version, see the FAQ.
Model Preparation¶
Before creating an object detection project, you must first train and export an object detection model. You can use the Object Detection module in the Mind+ V2.0 model training tool to train the model and export it for subsequent inference. The exported object detection model is a compressed file with the extension **.zip. In subsequent projects, this compressed file will be used directly to load the object detection model and perform inference for object detection tasks.
Please refer to the tutorial below to prepare an object detection model for use in the upcoming project.
- Object Detection Model Training Tutorial: Object Detection—Training the Model
- Tutorial on Exporting Object Detection Models: Object Detection—Model Export
Load the model training and inference library¶
Open Mind+ version 2.0.4 or later, and tap to enter "RealTime Mode."
In RealTime mode, click "Extensions" in the lower-left corner, locate "Model Training and Inference " in the Stage Extensions, and click "Load."
Once loading is complete, return to the real-time programming page and click "Object Detection" under "Model Inference" to find the object detection building blocks, as shown below.
Usage Instructions¶
Project 1: Local Image Object Detection¶
This project demonstrates how to use a pre-trained object detection model to recognize a local image and obtain the corresponding detection results.
In practice, you can replace the example model with a target detection model that you have trained yourself or one you already have, while keeping the rest of the code flow the same.
Sample Program¶
Runtime Results¶
After running the program, a window displaying the model’s inference results will pop up. The detection results will be overlaid on the original image, with boxes drawn around the recognized traffic signs; the corresponding labels and confidence scores will be displayed in the upper-right corner of each box.
Project 2: Real-Time Object Detection with a Camera¶
This project demonstrates how to use a pre-trained object detection model to continuously detect objects in real-time video feed from a camera and obtain detection results—such as the number of objects, object labels, and the coordinates of the object centers—in real time.
The model used in this example is the same as the one in Project 1. You can replace it with an object detection model you’ve trained yourself or one you already have; the rest of the code flow remains the same.
Sample Program¶
Runtime Results¶
After running the program, observe the camera feed displayed on the stage. Once the object detection model has finished loading, press the spacebar to start real-time traffic sign detection. The detection results will be overlaid on the video feed in real time, with boxes drawn around all traffic signs that appear.
Based on the detection data it receives, the Mind+ character will move sequentially to the center of each corresponding marker and say the corresponding label.



























