diff --git a/README.md b/README.md index a832f4a..987c5c8 100644 --- a/README.md +++ b/README.md @@ -64,14 +64,14 @@ Create the operator via the following factory method ## Interface -An image-text embedding operator takes a [towhee image](link/to/towhee/image/api/doc) as input and generate the correspoing caption. +An image captioning operator takes a [towhee image](link/to/towhee/image/api/doc) as input and generate the correspoing caption. **Parameters:** ​ ***img:*** *towhee.types.Image (a sub-class of numpy.ndarray)* -​ The image to generate embedding. +​ The image to generate caption. diff --git a/requirements.txt b/requirements.txt index ca9f2ee..f4f40e9 100644 --- a/requirements.txt +++ b/requirements.txt @@ -3,3 +3,6 @@ torch torchvision timm towhee +ftfy +yacs +numpy