Embedding Models Overview
Transform text into numerical vectors that capture semantic meaning, enabling powerful search, similarity comparison, and classification capabilities.Available Models
OpenAI Embeddings
- text-embedding-3-large: Highest quality embeddings (3,072 dimensions)
- text-embedding-3-small: Fast and efficient (1,536 dimensions)
- text-embedding-ada-002: Legacy model, still reliable (1,536 dimensions)
Cohere Embeddings
- embed-english-v3.0: Optimized for English text
- embed-multilingual-v3.0: Support for 100+ languages
- embed-english-light-v3.0: Lightweight for speed
Voyage AI
- voyage-large-2: High-performance general embeddings
- voyage-code-2: Specialized for code embeddings
- voyage-law-2: Optimized for legal documents
Google Embeddings
- textembedding-gecko: Versatile multilingual embeddings
- textembedding-gecko-multilingual: Enhanced multilingual support
Model Capabilities
Semantic Search
Find relevant content based on meaning, not just keywords
Similarity Comparison
Measure semantic similarity between texts
Classification
Categorize content based on embedded features
Clustering
Group similar content automatically
Embeddings API
Convert text to numerical vectors:Basic Example
Response Format
Batch Processing
Process multiple texts efficiently:Similarity Search Implementation
Cosine Similarity Calculation
Vector Database Integration
Pinecone Integration
Weaviate Integration
Model Comparison
Model | Dimensions | Languages | Strengths | Price/1K tokens |
---|---|---|---|---|
text-embedding-3-large | 3,072 | English+ | Highest quality | $0.00013 |
text-embedding-3-small | 1,536 | English+ | Speed & efficiency | $0.00002 |
cohere-embed-v3 | 1,024 | 100+ | Multilingual | $0.00010 |
voyage-large-2 | 1,536 | English+ | General purpose | $0.00012 |
Advanced Use Cases
Semantic Search Engine
Content Classification
Duplicate Detection
Best Practices
Choosing the Right Model
- text-embedding-3-large: Best quality, higher cost
- text-embedding-3-small: Good balance of quality and speed
- cohere-embed-v3: For multilingual content
- voyage-code-2: For code and technical content
Optimization Tips
- Batch processing: Send multiple texts in one request
- Caching: Store embeddings to avoid recomputation
- Preprocessing: Clean and normalize text before embedding
- Chunking: Split long documents into smaller segments
Vector Database Selection
- Pinecone: Managed, easy to use, great for production
- Weaviate: Open source, supports multiple data types
- Chroma: Lightweight, good for development
- Qdrant: High performance, supports filtering
Rate Limits
Embedding model limits by plan:Plan | Requests/Min | Tokens/Min | Daily Limit |
---|---|---|---|
Free | 60 | 150,000 | 1M tokens |
Pro | 3,000 | 1,000,000 | 10M tokens |
Enterprise | Custom | Custom | Custom |
Common Use Cases
Search & Retrieval
Document search, FAQ systems, knowledge bases
Recommendation Systems
Content recommendations, similar item suggestions
Content Moderation
Detect inappropriate content, spam filtering
Data Analysis
Clustering, topic modeling, trend analysis