timm/README.md

# Image Embedding with Timm

*author: [Jael Gu](https://github.com/jaelgu), Filip*

<br />

## Description

An image embedding operator generates a vector given an image.
This operator extracts features for image with pre-trained models provided by [Timm](https://github.com/rwightman/pytorch-image-models).
Timm is a deep-learning library developed by [Ross Wightman](https://twitter.com/wightmanr),
who maintains SOTA deep-learning models and tools in computer vision.

<br />

## Code Example

Load an image from path './towhee.jpeg'
and use the pre-trained ResNet50 model ('resnet50') to generate an image embedding.

 *Write the pipeline in simplified style:*

```python
import towhee

towhee.glob('./towhee.jpeg') \
      .image_decode() \
      .image_embedding.timm(model_name='resnet50') \
      .show()
```
<img src="./result1.png" height="50px"/>

*Write a same pipeline with explicit inputs/outputs name specifications:*

```python
import towhee

towhee.glob['path']('./towhee.jpeg') \
      .image_decode['path', 'img']() \
      .image_embedding.timm['img', 'vec'](model_name='resnet50') \
      .select['img', 'vec']() \
      .show()
```
<img src="./result2.png" height="150px"/>

<br />

## Factory Constructor

Create the operator via the following factory method:

***image_embedding.timm(model_name='resnet34', num_classes=1000, skip_preprocess=False)***

**Parameters:**

***model_name:*** *str*

The model name in string. The default value is "resnet34".
Refer to [Timm Docs](https://fastai.github.io/timmdocs/#List-Models-with-Pretrained-Weights) to get a full list of supported models.

***num_classes:*** *int*

The number of classes. The default value is 1000.
It is related to model and dataset.

***skip_preprocess:*** *bool*

The flag to control whether to skip image pre-process.
The default value is False.
If set to True, it will skip image preprocessing steps (transforms).
In this case, input image data must be prepared in advance in order to properly fit the model.

<br />

## Interface

An image embedding operator takes a towhee image as input.
It uses the pre-trained model specified by model name to generate an image embedding in ndarray.

**Parameters:**

***data:*** *Union[List[towhee._types.Image], towhee._types.Image]*

The decoded image data in numpy.ndarray. It allows both single input and a list for batch input.


**Returns:** *numpy.ndarray*

If only 1 image input, then output is an image embedding in shape of (feature_dim,).
If a list of images as input, then output is a numpy.ndarray in shape of (batch_num, feature_dim).
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`# Image Embedding with Timm`

Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`author: [Jael Gu](https://github.com/jaelgu), Filip`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`## Description`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`An image embedding operator generates a vector given an image.`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`This operator extracts features for image with pre-trained models provided by [Timm](https://github.com/rwightman/pytorch-image-models).`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`Timm is a deep-learning library developed by [Ross Wightman](https://twitter.com/wightmanr),`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`who maintains SOTA deep-learning models and tools in computer vision.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`## Code Example`

Update test Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 2 years ago			`Load an image from path './towhee.jpeg'`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`and use the pre-trained ResNet50 model ('resnet50') to generate an image embedding.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Modify README 3 years ago			`Write the pipeline in simplified style:`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
			```python
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`import towhee`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update test Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 2 years ago			`towhee.glob('./towhee.jpeg') \`
Modify README 3 years ago			`.image_decode() \`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`.image_embedding.timm(model_name='resnet50') \`
			`.show()`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			```
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<img src="./result1.png" height="50px"/>`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`Write a same pipeline with explicit inputs/outputs name specifications:`

			```python
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`import towhee`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update test Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 2 years ago			`towhee.glob['path']('./towhee.jpeg') \`
Modify README 3 years ago			`.image_decode['path', 'img']() \`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`.image_embedding.timm['img', 'vec'](model_name='resnet50') \`
Modify README 3 years ago			`.select['img', 'vec']() \`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`.show()`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			```
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<img src="./result2.png" height="150px"/>`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`## Factory Constructor`

[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`Create the operator via the following factory method:`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`*image_embedding.timm(model_name='resnet34', num_classes=1000, skip_preprocess=False)*`

			`Parameters:`

Modify README 3 years ago			`*model_name:* str`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`The model name in string. The default value is "resnet34".`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`Refer to [Timm Docs](https://fastai.github.io/timmdocs/#List-Models-with-Pretrained-Weights) to get a full list of supported models.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Modify README 3 years ago			`*num_classes:* int`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`The number of classes. The default value is 1000.`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`It is related to model and dataset.`

Modify README 3 years ago			`*skip_preprocess:* bool`
Fix typo in README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`The flag to control whether to skip image pre-process.`
Optimize README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`The default value is False.`
			`If set to True, it will skip image preprocessing steps (transforms).`
			`In this case, input image data must be prepared in advance in order to properly fit the model.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
			`## Interface`

Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`An image embedding operator takes a towhee image as input.`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`It uses the pre-trained model specified by model name to generate an image embedding in ndarray.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
			`Parameters:`

Allow list as input Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 2 years ago			`*data:* Union[List[towhee._types.Image], towhee._types.Image]`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Allow list as input Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 2 years ago			`The decoded image data in numpy.ndarray. It allows both single input and a list for batch input.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago

Modify README 3 years ago			`Returns: numpy.ndarray`
Optimize README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Allow list as input Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 2 years ago			`If only 1 image input, then output is an image embedding in shape of (feature_dim,).`
			`If a list of images as input, then output is a numpy.ndarray in shape of (batch_num, feature_dim).`