Coding with the AI Vision Sensor in VEXcode V5 Blocks – VEX Library

Make sure you have Color Signatures and Color Codes configured with your AI Vision Sensor so they can be used with your blocks. To learn more about how to configure them, you can read the articles below:

The AI Vision Sensor can also detect AI Classifcations and AprilTags. To learn how to enable these detection modes, go here:

To learn more detail about these individual Blocks and how to use them in VEXcode, go to the API site.

Take Snapshot

A block that contains the command take a snapshot of with two dropdown selections: one labeled AIVision1 and the other labeled COL1. This block is designed to take a snapshot from an AI Vision sensor and reference a specific object or color from the designated variables in a visual coding environment. The block's shape has slight curves on the ends, typical of block-based coding interfaces.

The Take Snapshot block takes a picture of what the AI Vision Sensor is currently seeing and pulls data from that snapshot that can then be used in a project. When a snapshot is taken, you need to specify what type of object the AI Vision Sensor should collect data of:

Color Signature
Color Code
AI Classifications
AprilTags

Taking a snapshot will create an array of all of the detected objects that you specified. For instance, if you wanted to detect a "Red" Color Signature, and the AI Vision Sensor detected 3 different red objects, data from all three would be put in the array.

For more information on how to specify between different objects, go to the "Set Object Item" section in this article.

In this example, it will only detect objects that match its configured “Blue” Color Signature and nothing else.

Data Taken From a Snapshot

Keep in mind that the AI Vision Sensor will use its last taken snapshot for any Blocks that come after. To make sure you're always getting the most up-to-date information from your AI Vision Sensor, retake your snapshot every time you want to pull data from it.

Resolution

Understanding the AI Vision Sensor's resolution is crucial for accurate data interpretation. The sensor has a resolution of 320x240 pixels, with the exact center at coordinates (160, 120).

X-coordinates less than 160 correspond to the left half of the sensor's field of view, while those greater than 160 represent the right half. Similarly, Y-coordinates less than 120 indicate the upper half of the view, and those greater than 120 represent the lower half.

Go to Understanding the Data in the AI Vision Utility in VEXcode V5 for more information about how objects are measured with the AI Vision Sensor.

Width and Height

This is the width or height of the detected object in pixels.

The image shows a blue Buckyball with a white square outline tracking it. The top left corner has a label indicating it is a blue object, with coordinates X:176, Y:117, and dimensions W:80, H:78. Red arrows highlight the width and height of the object.

The width and height measurements help identify different objects. For example, a Buckyball will have a larger height than a Ring.

Width and height also indicate an object's distance from the AI Vision Sensor. Smaller measurements usually mean the object is farther away, while larger measurements suggest it's closer.

In this example, the width of the object is used for navigation. The robot will approach the object until the width has reached a specific size before stopping.

CenterX and Center Y

This is the center coordinates of the detected object in pixels.

A blue Buckyball being tracked by a computer vision system. The object is outlined with a white square, and inside the outline is a smaller red square surrounding a centered white cross. In the top-left corner of the image, a label indicates the object is blue, with coordinates X:176, Y:117, and dimensions W:80, H:78.

CenterX and CenterY coordinates help with navigation and positioning. The AI Vision Sensor has a resolution of 320 x 240 pixels.

Two blue cubic objects tracked by a vision system. The upper object is labeled with coordinates X:215, Y:70, and dimensions W:73, H:84, with a white outline and a centered white cross. The lower object is labeled with coordinates X:188, Y:184, and dimensions W:144, H:113, also outlined in white with a centered white cross.

You can see that an object closer to the AI Vision Sensor will have a lower CenterY coordinate than an object that is farther away.

In this example, because the center of the AI Vision Sensor's view is (160, 120), the robot will turn right until a detected object's centerX coordinate is greater than 150 pixels, but less than 170 pixels.

Angle

Angle is a property only available for Color Codes and AprilTags. This represents if the detected Color Code or AprilTag is orientated differently.

You can see if the robot is orientated differently in relation to the Color Code or AprilTag and make navigation decisions according to that.

For instance, if a Color Code isn't detected at a proper angle, then the object it represents may not be able to be picked up properly by the robot.

OriginX and OriginY

OriginX and OriginY is the coordinate at the top-left corner of the detected object in pixels.

A blue Buckyball being tracked by a vision system. A white outline surrounds the object, with a centered white cross inside the outline. The top-left label indicates the object's color as blue, along with coordinates X:176, Y:117, and dimensions W:80, H:78. A small red square highlights the object's top-left corner.

OriginX and OriginY coordinates help with navigation and positioning. By combining this coordinate with the object's Width and Height, you can determine the size of the object's bounding box. This can help with tracking moving objects or navigating between objects.

In this example, a rectangle will be drawn on the Brain using the exact coordinates of its origin, width, and height.

tagID

The tagID is only available for AprilTags. This is the ID number for the specified AprilTag.

Identifying specific AprilTags allows for selective navigation. You can program your robot to move towards certain tags while ignoring others, effectively using them as signposts for automated navigation.

Score

The score property is used when detecting AI Classifications with the AI Vision Sensor.

The confidence score indicates how certain the AI Vision Sensor is about its detection. In this image, it's 99% confident in identifying these four objects' AI Classifications. You can use this score to ensure your robot only focuses on highly confident detections.

Set Object Item

When an object is detected by the AI Vision Sensor, it's put into an array. By default, the AI Vision Sensor will pull data from the first object in the array, or the object with the index of 1. If your AI Vision Sensor has only detected one object, then that object will be selected by default.

When your AI Vision Sensor has detected multiple objects at once, however, you'll need to use the Set Object Item block to specify which object you want to pull data from.

A light blue coding block. It contains a command to set the object item for AIVision1 to 1. This block is part of a block-based coding environment, typically used to define which object or item the AI Vision sensor should focus on or track. The shape of the block has slight curves, fitting into the modular nature of the visual coding platform.

When multiple objects are detected by the AI Vision Sensor, they are arranged in the array by largest to smallest. That means that the largest detected object will always be set to object index 1, and the smallest object will always be set to the highest number.

In this example, two objects have been detected with the Color Signature "Blue". They both will be put in the array when the Take Snapshot block is used.

Here, the object in the front would become object index 1, since it is the largest object, and the smallest object would become object index 2.

Object Exists

Before pulling any data from a snapshot, it's important to always check to make sure the AI Vision Sensor has detected any objects from that snapshot first. This is where the Object Exists block comes into play.

A light blue hexagonal coding block with the text AIVision1 object exists? This block is part of a block-based coding environment, typically used to check if an object is detected by the AI Vision sensor labeled as AIVision1. The block is designed to fit within a modular coding structure, with the slight curves and shape characteristic of such environments.

This block will return a True or False value on whether or not the last taken snapshot has any objects detected in it.

This block should always be used to ensure you're not trying to pull any data from a potentially empty snapshot.

For instance, here the robot will be constantly taking snapshots with the AI Vision Sensor. If it identifies any object with the “Blue” Color Signature, it will drive forward.

If any snapshot does not have the “Blue” Color Signature, the robot will stop moving.

Object Count

A light blue, rounded coding block labeled AIVision1 object count. This block is used in a block-based coding environment to retrieve the number of objects detected by the AI Vision sensor labeled as AIVision1. The block fits within a modular structure, commonly used in visual programming interfaces for robotics or vision systems.

Using the Object count block will allow you to see how many objects of a specific Color Signature the AI Vision Sensor can see in its last snapshot.

Here, we see the AI Vision Sensor has the configured Color Signature “Blue”, and is detecting two objects.

A console output with the number 2 printed. The console is part of a larger interface that likely displays results from a program running in a block-based coding environment. The top of the console has buttons for additional actions or controls, and the program running here is outputting data to the Print Console, which shows a result of 2 on the screen.

In this code, the AI Vision Sensor would take a snapshot and print “2” on the VEXcode console, since it only detects two “Blue” Color Signatures.

Object

The Object block allows you to report the property of your specified object. This lets you use any of the available data pulled from the most recently taken snapshot.

Object properties that can be pulled from taken snapshots are:

width
height
centerX
centerY
angle
originX
originY
tagID
score

Read the "Data Taken from Snapshot" section of this article for more information on these properties.

Detected AprilTag is

The Detected AprilTag is block is only available when the AprilTag Detection Mode is turned on.

This block will report True or False depending on if the specified object is a certain AprilTag.

Three AprilTags being tracked by a vision system. Each tag has an ID and associated coordinates. The left tag is labeled ID:0, with coordinates X:110, Y:96, W:41, H:41. The center tag is labeled ID:3, with coordinates X:187, Y:180, W:57, H:57. The right tag is labeled ID:9, with coordinates X:237, Y:89, W:38, H:38.

When multiple AprilTags are detected in a single snapshot, they are arranged in the array based on their identified ID, not by size.

In this image, three AprilTags are detected with IDs 0, 3, and 9. They will be organized in ascending order of their ID in the array. The object at index 1 would correspond to the AprilTag with ID 0, at index 2 to the AprilTag with ID 3, and at index 3 to the AprilTag with ID 9.

For more information on what AprilTags are and how to enable their detection with the AI Vision Sensor, read this article.

AI Classification is

The AI Classification is block is only available when the AI Classification Detection Mode is turned on.

This block will report True or False depending on if the specified object is a certain AI Classification.

What AI Classifications can be detected by the AI Vision Sensor varies depending on what model you are using. For more information on what AI Classifications are available and how to enable their detection with the AI Vision Sensor, read this article.