
Build neural data processing pipelines simply and fast
Towhee is an open-source Python framework for building neural data processing and embedding pipelines that transform unstructured data — images, video, audio, text, and molecular data — into vector embeddings. It provides 700+ pre-trained models across computer vision, NLP, multimodal, audio, and medical domains, along with ready-to-use pipelines for tasks like RAG, image search, and video deduplication. Created by Zilliz, the company behind the Milvus vector database, Towhee serves as a lightweight ETL layer for AI applications.
State-of-the-art models spanning computer vision, NLP, multimodal, audio, and medical domains including BERT, CLIP, ViT, and SwinTransformer
300+ ready-to-use pipelines for image, audio, text, face, and multimodal embeddings
Pythonic API for building, prototyping, and running data transformation pipelines with minimal code
Ready-to-use ETL pipelines for Retrieval-Augmented Generation workflows including prompt management and knowledge retrieval
Handles images, video clips, audio, text, and molecular structures in a unified pipeline
Pipelines composed of operators wired as directed acyclic graphs for complex multi-step processing
Adapts to different large language models and supports hosting open-source models locally
Extract image embeddings and store in a vector DB to enable searching for visually similar images at scale
Build ETL pipelines that chunk, embed, and index documents for LLM-powered Q&A and chatbot applications
Detect duplicate or near-duplicate video clips using multimodal embeddings across large video libraries
Generate audio embeddings from music or speech files for music discovery or audio deduplication
Community hub with reusable, shareable operators across tasks and architectures
Serves models via Triton Inference Server for high-concurrency production deployments
First-class integration with Milvus vector database for storing and querying generated embeddings
Process medical imaging and molecular structure data using domain-specific pre-trained models for research

Unified platform for building, deploying, and scaling generative AI and ML models