Embetter
Visit Toolembetter provides scikit-learn compatible embeddings for computer vision and text, enabling rapid proof-of-concept building. It integrates seamlessly into scikit-learn pipelines for quick development.
At a glance
Trending
embetter provides scikit-learn compatible embeddings for computer vision and text, enabling rapid proof-of-concept building. It integrates seamlessly into scikit-learn pipelines for quick development.
Trending
About
embetter is an open-source Python library designed to provide useful embeddings for scikit-learn pipelines, making it easy to quickly build proof of concepts for machine learning tasks. It offers scikit-learn compatible embeddings for both computer vision and text data, simplifying the integration of advanced embedding techniques into existing workflows. The library is particularly helpful for bulk labeling efforts and plays well with tools like scikit-partial for handling out-of-core datasets. It includes components for grabbing data from pandas DataFrames, various encoders for images (TimmEncoder, ColorHistogramEncoder) and text (SentenceEncoder, MatryoshkaEncoder), and multi-modal models like ClipEncoder. Additionally, it supports finetuning components and external embedding providers requiring API keys, such as Cohere and OpenAI.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending