timm/README.md

# Image Embedding with Timm

*author: [Jael Gu](https://github.com/jaelgu), Filip*

<br />

## Description

An image embedding operator generates a vector given an image.
This operator extracts features for image with pre-trained models provided by [Timm](https://github.com/rwightman/pytorch-image-models).
Timm is a deep-learning library developed by [Ross Wightman](https://twitter.com/wightmanr),
who maintains SOTA deep-learning models and tools in computer vision.

<br />

## Code Example

Load an image from path './towhee.jpeg'
and use the pre-trained ResNet50 model ('resnet50') to generate an image embedding.

 *Write the pipeline in simplified style:*

```python
import towhee

towhee.glob('./towhee.jpeg') \
      .image_decode() \
      .image_embedding.timm(model_name='resnet50') \
      .show()
```
<img src="./result1.png" height="50px"/>

*Write a same pipeline with explicit inputs/outputs name specifications:*

```python
import towhee

towhee.glob['path']('./towhee.jpeg') \
      .image_decode['path', 'img']() \
      .image_embedding.timm['img', 'vec'](model_name='resnet50') \
      .select['img', 'vec']() \
      .show()
```
<img src="./result2.png" height="150px"/>

<br />

## Factory Constructor

Create the operator via the following factory method:

***image_embedding.timm(model_name='resnet34', num_classes=1000, skip_preprocess=False)***

**Parameters:**

***model_name:*** *str*

The model name in string. The default value is "resnet34".
Refer to [Timm Docs](https://fastai.github.io/timmdocs/#List-Models-with-Pretrained-Weights) to get a full list of supported models.

***num_classes:*** *int*

The number of classes. The default value is 1000.
It is related to model and dataset.

***skip_preprocess:*** *bool*

The flag to control whether to skip image pre-process.
The default value is False.
If set to True, it will skip image preprocessing steps (transforms).
In this case, input image data must be prepared in advance in order to properly fit the model.

<br />

## Interface

An image embedding operator takes a towhee image as input.
It uses the pre-trained model specified by model name to generate an image embedding in ndarray.

**Parameters:**

***data:*** *towhee.types.Image*

The decoded image data in towhee Image (a subset of numpy.ndarray).


**Returns:** *numpy.ndarray*

 An image embedding generated by model, in shape of (feature_dim,).


<br />

## Towhee Serve

Models which is supported the towhee.serve.

**Model List**

 models | models | models | models
--------- | ---------- | ------------ | -----------
adv_inception_v3 | bat_resnext26ts | beit_base_patch16_224
 beit_base_patch16_224_in22k | beit_base_patch16_384 | beit_large_patch16_224
 beit_large_patch16_224_in22k | beit_large_patch16_384 | beit_large_patch16_512
 botnet26t_256 | cait_m36_384 | cait_m48_448
 cait_s24_224 | cait_s24_384 | cait_s36_384
 cait_xs24_384 | cait_xxs24_224 | cait_xxs24_384
 cait_xxs36_224 | cait_xxs36_384 | coat_lite_mini
 coat_lite_small | coat_lite_tiny | convit_base
 convit_small | convit_tiny | convmixer_768_32
 convmixer_1024_20_ks9_p14 | convmixer_1536_20 | convnext_base
 convnext_base_384_in22ft1k | convnext_base_in22ft1k | convnext_base_in22k
 convnext_large | convnext_large_384_in22ft1k | convnext_large_in22ft1k
 convnext_large_in22k | convnext_small | convnext_small_384_in22ft1k
 convnext_small_in22ft1k | convnext_small_in22k | convnext_tiny
 convnext_tiny_384_in22ft1k | convnext_tiny_hnf | convnext_tiny_in22ft1k
 convnext_tiny_in22k | convnext_xlarge_384_in22ft1k | convnext_xlarge_in22ft1k
 convnext_xlarge_in22k | cs3darknet_focus_l | cs3darknet_focus_m
 cs3darknet_l | cs3darknet_m | cspdarknet53
 cspresnet50 | cspresnext50 | darknet53
 deit3_base_patch16_224 | deit3_base_patch16_224_in21ft1k | deit3_base_patch16_384
 deit3_base_patch16_384_in21ft1k | deit3_huge_patch14_224 | deit3_huge_patch14_224_in21ft1k
 deit3_large_patch16_224 | deit3_large_patch16_224_in21ft1k | deit3_large_patch16_384
 deit3_large_patch16_384_in21ft1k | deit3_small_patch16_224 | deit3_small_patch16_224_in21ft1k
 deit3_small_patch16_384 | deit3_small_patch16_384_in21ft1k | deit_base_distilled_patch16_224
 deit_base_distilled_patch16_384 | deit_base_patch16_224 | deit_base_patch16_384
 deit_small_distilled_patch16_224 | deit_small_patch16_224 | deit_tiny_distilled_patch16_224
 deit_tiny_patch16_224 | densenet121 | densenet161
 densenet169 | densenet201 | densenetblur121d
 dla34 | dla46_c | dla46x_c
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`# Image Embedding with Timm`

Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`author: [Jael Gu](https://github.com/jaelgu), Filip`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`## Description`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`An image embedding operator generates a vector given an image.`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`This operator extracts features for image with pre-trained models provided by [Timm](https://github.com/rwightman/pytorch-image-models).`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`Timm is a deep-learning library developed by [Ross Wightman](https://twitter.com/wightmanr),`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`who maintains SOTA deep-learning models and tools in computer vision.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`## Code Example`

Update test Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`Load an image from path './towhee.jpeg'`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`and use the pre-trained ResNet50 model ('resnet50') to generate an image embedding.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Modify README 3 years ago			`Write the pipeline in simplified style:`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
			```python
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`import towhee`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update test Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`towhee.glob('./towhee.jpeg') \`
Modify README 3 years ago			`.image_decode() \`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`.image_embedding.timm(model_name='resnet50') \`
			`.show()`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			```
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<img src="./result1.png" height="50px"/>`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`Write a same pipeline with explicit inputs/outputs name specifications:`

			```python
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`import towhee`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update test Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`towhee.glob['path']('./towhee.jpeg') \`
Modify README 3 years ago			`.image_decode['path', 'img']() \`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`.image_embedding.timm['img', 'vec'](model_name='resnet50') \`
Modify README 3 years ago			`.select['img', 'vec']() \`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`.show()`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			```
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<img src="./result2.png" height="150px"/>`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`## Factory Constructor`

[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`Create the operator via the following factory method:`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`*image_embedding.timm(model_name='resnet34', num_classes=1000, skip_preprocess=False)*`

			`Parameters:`

Modify README 3 years ago			`*model_name:* str`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`The model name in string. The default value is "resnet34".`
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`Refer to [Timm Docs](https://fastai.github.io/timmdocs/#List-Models-with-Pretrained-Weights) to get a full list of supported models.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Modify README 3 years ago			`*num_classes:* int`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`The number of classes. The default value is 1000.`
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`It is related to model and dataset.`

Modify README 3 years ago			`*skip_preprocess:* bool`
Fix typo in README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
[DOC] Refine Readme Signed-off-by: LocoRichard <lichen.wang@zilliz.com> 3 years ago			`The flag to control whether to skip image pre-process.`
Optimize README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`The default value is False.`
			`If set to True, it will skip image preprocessing steps (transforms).`
			`In this case, input image data must be prepared in advance in order to properly fit the model.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`<br />`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
			`## Interface`

Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`An image embedding operator takes a towhee image as input.`
Update README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`It uses the pre-trained model specified by model name to generate an image embedding in ndarray.`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
			`Parameters:`

add supported model list Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com> 3 years ago			`*data:* towhee.types.Image`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`The decoded image data in towhee Image (a subset of numpy.ndarray).`
Refactor Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago

Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`Returns: numpy.ndarray`
Optimize README Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago
Update Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> 3 years ago			`An image embedding generated by model, in shape of (feature_dim,).`
add supported model list Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com> 3 years ago

			`<br />`

			`## Towhee Serve`

			`Models which is supported the towhee.serve.`

			`Model List`

			`models \| models \| models \| models`
			`--------- \| ---------- \| ------------ \| -----------`
update Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com> 3 years ago			`adv_inception_v3 \| bat_resnext26ts \| beit_base_patch16_224`
			`beit_base_patch16_224_in22k \| beit_base_patch16_384 \| beit_large_patch16_224`
			`beit_large_patch16_224_in22k \| beit_large_patch16_384 \| beit_large_patch16_512`
			`botnet26t_256 \| cait_m36_384 \| cait_m48_448`
			`cait_s24_224 \| cait_s24_384 \| cait_s36_384`
			`cait_xs24_384 \| cait_xxs24_224 \| cait_xxs24_384`
			`cait_xxs36_224 \| cait_xxs36_384 \| coat_lite_mini`
			`coat_lite_small \| coat_lite_tiny \| convit_base`
			`convit_small \| convit_tiny \| convmixer_768_32`
			`convmixer_1024_20_ks9_p14 \| convmixer_1536_20 \| convnext_base`
			`convnext_base_384_in22ft1k \| convnext_base_in22ft1k \| convnext_base_in22k`
			`convnext_large \| convnext_large_384_in22ft1k \| convnext_large_in22ft1k`
			`convnext_large_in22k \| convnext_small \| convnext_small_384_in22ft1k`
			`convnext_small_in22ft1k \| convnext_small_in22k \| convnext_tiny`
			`convnext_tiny_384_in22ft1k \| convnext_tiny_hnf \| convnext_tiny_in22ft1k`
			`convnext_tiny_in22k \| convnext_xlarge_384_in22ft1k \| convnext_xlarge_in22ft1k`
			`convnext_xlarge_in22k \| cs3darknet_focus_l \| cs3darknet_focus_m`
			`cs3darknet_l \| cs3darknet_m \| cspdarknet53`
			`cspresnet50 \| cspresnext50 \| darknet53`
			`deit3_base_patch16_224 \| deit3_base_patch16_224_in21ft1k \| deit3_base_patch16_384`
			`deit3_base_patch16_384_in21ft1k \| deit3_huge_patch14_224 \| deit3_huge_patch14_224_in21ft1k`
			`deit3_large_patch16_224 \| deit3_large_patch16_224_in21ft1k \| deit3_large_patch16_384`
			`deit3_large_patch16_384_in21ft1k \| deit3_small_patch16_224 \| deit3_small_patch16_224_in21ft1k`
			`deit3_small_patch16_384 \| deit3_small_patch16_384_in21ft1k \| deit_base_distilled_patch16_224`
			`deit_base_distilled_patch16_384 \| deit_base_patch16_224 \| deit_base_patch16_384`
			`deit_small_distilled_patch16_224 \| deit_small_patch16_224 \| deit_tiny_distilled_patch16_224`
			`deit_tiny_patch16_224 \| densenet121 \| densenet161`
			`densenet169 \| densenet201 \| densenetblur121d`
			`dla34 \| dla46_c \| dla46x_c`
add supported model list Signed-off-by: junjie.jiang <junjie.jiang@zilliz.com> 3 years ago