BERT Text Embedding Operator (Pytorch)

Authors: Kyle He

Overview

This operator transforms text into embedding using BERT[1], which stands for Bidirectional Encoder Representations from Transformers.

__call__(self, text: str)

Args:

Returns:

The Operator returns a tuple Tuple[('embs', numpy.ndarray)] containing following fields:

You can get the required python package by requirements.txt.

The towhee/torch-bert Operator is based on Huggingface[2].

The guide to text-embedding-ada-002 model | OpenAI: text-embedding-ada-002: OpenAI's legacy text embedding model; average price/performance compared to text-embedding-3-large and text-embedding-3-small.
Sentence Transformers for Long-Form Text - Zilliz blog: Deep diving into modern transformer-based embeddings for long-form text.
What is BERT (Bidirectional Encoder Representations from Transformers)? - Zilliz blog: Learn what Bidirectional Encoder Representations from Transformers (BERT) is and how it uses pre-training and fine-tuning to achieve its remarkable performance.
Training Your Own Text Embedding Model - Zilliz blog: Explore how to train your text embedding model using the sentence-transformers library and generate our training data by leveraging a pre-trained LLM.
The guide to gte-base-en-v1.5 | Alibaba: gte-base-en-v1.5: specialized for English text; Built upon the transformer++ encoder backbone (BERT + RoPE + GLU)
Training Text Embeddings with Jina AI - Zilliz blog: In a recent talk by Bo Wang, he discussed the creation of Jina text embeddings for modern vector search and RAG systems. He also shared methodologies for training embedding models that effectively encode extensive information, along with guidance o

bert-embedding

Jael Gu ccf0085e52 Add more resources Signed-off-by: Jael Gu <mengjia.gu@zilliz.com>			4 Commits
README.md	2.4 KiB	Add more resources	10 months ago
__init__.py	592 B	add text embedding implementation	3 years ago
requirements.txt	32 B	add text embedding implementation	3 years ago
torch_bert.py	2.0 KiB	add text embedding implementation	3 years ago