Image Captioning with ExpansionNet v2

author: David Wang

Description

This operator generates the caption with ExpansionNet v2 which describes the content of the given image. ExpansionNet v2 introduces the Block Static Expansion which distributes and processes the input over a heterogeneous and arbitrarily big collection of sequences characterized by a different length compared to the input one. This is an adaptation from jchenghu/ExpansionNet_v2.

Code Example

Load an image from path './image.jpg' to generate the caption.

Write the pipeline in simplified style:

import towhee

towhee.glob('./image.jpg') \
      .image_decode() \
      .image_captioning.expansionnet_v2(model_name='expansionnet_rf') \
      .show()

Write a same pipeline with explicit inputs/outputs name specifications:

import towhee

towhee.glob['path']('./image.jpg') \
      .image_decode['path', 'img']() \
      .image_captioning.expansionnet_v2['img', 'text'](model_name='expansionnet_rf') \
      .select['img', 'text']() \
      .show()

Factory Constructor

Create the operator via the following factory method

expansionnet_v2(model_name)

Parameters:

model_name: str

The model name of ExpansionNet v2. Supported model names:

expansionnet_rf

Interface

An image captioning operator takes a towhee image as input and generate the correspoing caption.

Parameters:

data: towhee.types.Image (a sub-class of numpy.ndarray)

The image to generate caption.

Returns: str

The caption generated by model.

wxywb 81f242e57d update the operator. Signed-off-by: wxywb <xy.wang@zilliz.com>			7 Commits
models		update the operator.	3 years ago
utils		update the operator.	3 years ago
weights		upload the weight.	3 years ago
.gitattributes	1.1 KiB	Initial commit	3 years ago
README.md	1.8 KiB	update the image for expansionnet_v2.	3 years ago
__init__.py	715 B	init the operator.	3 years ago
cap.png	10 KiB	update the image for expansionnet_v2.	3 years ago
demo_coco_tokens.pickle	238 KiB	init the operator.	3 years ago
expansionnet_v2.py	6.1 KiB	update the operator.	3 years ago
requirements.txt	44 B	update the requirement.	3 years ago
tabular.png	90 KiB	update the image for expansionnet_v2.	3 years ago