logo
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Readme
Files and versions

1.5 KiB

Object Detection with Yolo

author: shiyu22


Description

Object Detection is a computer vision technique that locates and identifies people, items, or other objects in an image. Object detection has applications in many areas of computer vision, including image retrieval, image annotation, vehicle counting, object tracking, etc.

This operator uses PyTorch.yolov5 to detect the object.


Code Example

Writing the pipeline in the simplified way

from towhee.dc2 import pipe, ops

p = (
    pipe.input('url')
        .map('url', 'img', ops.image_decode.cv2_rgb())
        .flat_map('img', ('boxes', 'class', 'score'), ops.object_detection.yolo())
        .output('class', 'score')
    )

res = p('test.png')
res.get()


Factory Constructor

Create the operator via the following factory method

object_detection.yolo()


Interface

The operator takes an image as input. It first detects the objects appeared in the image, and gives the bounding box of each object.

Parameters:

img: numpy.ndarray

​ Image data in ndarray format.

Return: List[List[(int, int, int, int)], ...], List[str], List[float]]

The return value is a tuple of (boxes, classes, scores). The boxes is a list of bounding boxes. Each bounding box is represented by the top-left and the bottom right points, i.e. (x1, y1, x2, y2). The classes is a list of prediction labels. The scores is a list of the confidence scores.

1.5 KiB

Object Detection with Yolo

author: shiyu22


Description

Object Detection is a computer vision technique that locates and identifies people, items, or other objects in an image. Object detection has applications in many areas of computer vision, including image retrieval, image annotation, vehicle counting, object tracking, etc.

This operator uses PyTorch.yolov5 to detect the object.


Code Example

Writing the pipeline in the simplified way

from towhee.dc2 import pipe, ops

p = (
    pipe.input('url')
        .map('url', 'img', ops.image_decode.cv2_rgb())
        .flat_map('img', ('boxes', 'class', 'score'), ops.object_detection.yolo())
        .output('class', 'score')
    )

res = p('test.png')
res.get()


Factory Constructor

Create the operator via the following factory method

object_detection.yolo()


Interface

The operator takes an image as input. It first detects the objects appeared in the image, and gives the bounding box of each object.

Parameters:

img: numpy.ndarray

​ Image data in ndarray format.

Return: List[List[(int, int, int, int)], ...], List[str], List[float]]

The return value is a tuple of (boxes, classes, scores). The boxes is a list of bounding boxes. Each bounding box is represented by the top-left and the bottom right points, i.e. (x1, y1, x2, y2). The classes is a list of prediction labels. The scores is a list of the confidence scores.