Jina AI’s 8K Embedding Models Hit AWS Marketplace for On-Prem Deployment
Jina AI launches 8K token length embedding v2 models on AWS marketplace, elevating enterprise AI deployments with EU-engineered innovation and a commitment to data sovereignty.
Berlin, Germany - November 20, 2023 - Catering to enterprise customers, Jina AI has released Embeddings v2 on AWS SageMaker, a milestone in accessible, top-tier AI solutions. Enterprise users can now search for jina-embeddings-v2-base/small
on the AWS Marketplace and deploy them directly to their own AWS accounts. As a part of the AWS Startups program, this release underscores the collaboration between Jina AI's innovation and AWS's commitment to supporting groundbreaking startups, marking a significant advancement in AI development.
Superior Models on a Robust Platform
- SageMaker Integration: With global availability on the AWS SageMaker Marketplace, Jina AI underscores its dedication to enterprise users, providing them with an effortless way to build applications using our advanced embedding models.
- Seamless Deployment: Enterprises can now easily deploy Jina Embedding v2 models as SageMaker endpoints, bypassing the complexity associated with custom infrastructure setups.
- Cost-Effective Licensing: The English base and small models are available without licensing fees. Clients incur costs only for their AWS instances, ensuring a privacy-first, cost-effective solution within their VPC.
Tailored Solutions for Varied Use Cases
- Model Diversity: With a 0.27GB base model and a 0.07GB small model, Jina AI provides tailored solutions for various needs, from in-depth analytics to lightweight applications.
- Use Cases: The base model is designed for comprehensive semantic representation, ideal for enterprise search and content discovery, while the small model caters to mobile and edge devices, optimizing for speed and efficiency.
Commenting on this significant milestone, Dr. Han Xiao, CEO of Jina AI, offered the following insights:
Launching Jina AI's 8K Context Length v2 Embedding Models on AWS Marketplace, we advance industry standards for private AI solutions. Developed in Germany, this pivotal release emphasizes data sovereignty and customer-centric innovation, addressing today's needs and shaping future secure, private AI deployments.
Jina AI aims to make continuous strides towards privacy-aware, state-of-the-art AI, as evident from its plans.
Why Jina Embeddings v2: A Leap in AI Capability
- Extended Context Length: Jina Embeddings v2 models support an unprecedented 8K (8192 tokens) context length, allowing for a full understanding of longer documents.
- Open Source Pioneer: Jina AI takes pride in offering the only open-source model with a context length that matches OpenAI’s proprietary models, broadening access to advanced AI.
- Benchmark Leadership: On the Massive Text Embedding Benchmark (MTEB) leaderboard, our models boast performance on par with industry-leading models, attesting to our commitment to excellence.
Performance vs. OpenAI's text-embedding-ada002
Below is a comparative performance snapshot that showcases the robust capabilities of Jina Embeddings v2:
Model | text-embedding-ada-002 | jina-embeddings-v2-base-en |
Rank | 18 | 21 |
Model Size (GB) | Unknown | 0.27 |
Average (56 datasets) | 60.99 | 60.38 |
Embedding Dimensions | 1536 | 768 |
Sequence Length | 8191 | 8192 |
Classification Average (12 datasets) | 70.93 | 73.45 |
Pair Classification Average (3 datasets) | 84.89 | 85.38 |
Summarization Average (1 dataset) | 30.8 | 31.6 |
Retrieval Average (15 datasets) | 49.25 | 47.87 |
Jina AI's base model excels particularly in Classification and Pair Classification tasks, underscoring its value in diverse applications ranging from document analysis to recommendation systems.
Get Started with Jina Embeddings v2 on AWS
To begin using Jina Embeddings v2, visit the AWS Marketplace listings and select the model that best fits your needs.
These sample notebooks can help users get started with Jina Embeddings v2 models:
Coming Soon: Multilingual Embeddings and More
Looking ahead, Jina AI is already deep in developing multilingual embedding models, making them available to its enterprise clients for private deployment on various cloud service providers (CSPs). With the imminent launch of these models, Jina AI is set to bridge language barriers, unlocking global opportunities for its clients.
About Jina AI GmbH
Located at Ohlauer Str. 43 (1st floor), zone A, 10999 Berlin, Germany, Jina AI is at the vanguard of reshaping the landscape of multimodal artificial intelligence. For inquiries, please reach out at contact@jina.ai.