SimCSE
Visit ToolSimCSE is an open-source framework for simple contrastive learning of sentence embeddings, supporting both unlabeled and labeled data. It provides pre-trained models and an easy-to-use sentence embedding tool.
At a glance
Trending
SimCSE is an open-source framework for simple contrastive learning of sentence embeddings, supporting both unlabeled and labeled data. It provides pre-trained models and an easy-to-use sentence embedding tool.
Trending
About
SimCSE is an open-source framework that implements simple contrastive learning for generating high-quality sentence embeddings. It supports both unsupervised learning, using only standard dropout as noise, and supervised learning, incorporating annotated pairs from NLI datasets. The framework offers pre-trained models, including BERT-based and RoBERTa-based versions, and provides an easy-to-use Python package for encoding sentences and computing similarities. Users can also integrate SimCSE models with HuggingFace's Transformers library. The repository includes comprehensive training and evaluation code, allowing researchers and developers to reproduce results or train custom models on their own datasets. It is particularly useful for tasks requiring robust semantic textual similarity and downstream transfer capabilities.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending