voyage-4-large
The Voyage 4 model family from Voyage AI is a new generation of text embedding models designed to produce high-quality semantic vectors with an industry-first shared embedding space that lets different models in the series generate compatible embeddings so developers can mix and match models for document and query embedding to optimize accuracy, latency, and cost trade-offs. It includes voyage-4-large (a flagship model using a mixture-of-experts architecture delivering state-of-the-art retrieval accuracy at about 40% lower serving cost than comparable dense models), voyage-4 (balancing quality and efficiency), voyage-4-lite (high-quality embeddings with fewer parameters and lower compute cost), and the open-weight voyage-4-nano (ideal for local development and prototyping with an Apache 2.0 license). All four models in the series operate in a single shared embedding space, so embeddings generated by different variants are interchangeable, enabling asymmetric retrieval strategies.
Learn more
Codestral Embed
Codestral Embed is Mistral AI's first embedding model, specialized for code, optimized for high-performance code retrieval and semantic understanding. It significantly outperforms leading code embedders in the market today, such as Voyage Code 3, Cohere Embed v4.0, and OpenAI’s large embedding model. Codestral Embed can output embeddings with different dimensions and precisions; for instance, with a dimension of 256 and int8 precision, it still performs better than any model from competitors. The dimensions of the embeddings are ordered by relevance, allowing users to choose the first n dimensions for a smooth trade-off between quality and cost. It excels in retrieval use cases on real-world code data, particularly in benchmarks like SWE-Bench, which is based on real-world GitHub issues and corresponding fixes, and Text2Code (GitHub), relevant for providing context for code completion or editing.
Learn more
voyage-3-large
Voyage AI has unveiled voyage-3-large, a cutting-edge general-purpose and multilingual embedding model that leads across eight evaluated domains, including law, finance, and code, outperforming OpenAI-v3-large and Cohere-v3-English by averages of 9.74% and 20.71%, respectively. Enabled by Matryoshka learning and quantization-aware training, it supports embeddings of 2048, 1024, 512, and 256 dimensions, along with multiple quantization options such as 32-bit floating point, signed and unsigned 8-bit integer, and binary precision, significantly reducing vector database costs with minimal impact on retrieval quality. Notably, voyage-3-large offers a 32K-token context length, surpassing OpenAI's 8K and Cohere's 512 tokens. Evaluations across 100 datasets in diverse domains demonstrate its superior performance, with flexible precision and dimensionality options enabling substantial storage savings without compromising quality.
Learn more
Voyage AI
Voyage AI provides best-in-class embedding models and rerankers designed to supercharge search and retrieval for unstructured data. Its technology powers high-quality Retrieval-Augmented Generation (RAG) by improving how relevant context is retrieved before responses are generated. Voyage AI offers general-purpose, domain-specific, and company-specific models to support a wide range of use cases. The models are optimized for accuracy, low latency, and reduced costs through shorter vector dimensions. With long-context support of up to 32K tokens, Voyage AI enables deeper understanding of complex documents. The platform is modular and integrates easily with any vector database or large language model. Voyage AI is trusted by industry leaders to deliver reliable, factual AI outputs at scale.
Learn more