Jina AI’s 8K Embedding Models Hit AWS Marketplace for On-Prem Deployment

Jina AI launches 8K token length embedding v2 models on AWS marketplace, elevating enterprise AI deployments with EU-engineered innovation and a commitment to data sovereignty.

Black abstract background with "EMBEDDINGS 2" and "lawsmarketplace" text, hinting at a sleek, modern promotional design

Berlin, Germany - November 20, 2023 - Catering to enterprise customers, Jina AI has released Embeddings v2 on AWS SageMaker, a milestone in accessible, top-tier AI solutions. Enterprise users can now search for jina-embeddings-v2-base/small on the AWS Marketplace and deploy them directly to their own AWS accounts. As a part of the AWS Startups program, this release underscores the collaboration between Jina AI's innovation and AWS's commitment to supporting groundbreaking startups, marking a significant advancement in AI development.

AWS Marketplace page for Jina AI with search options, product listings, and descriptions of two Jina Embeddings models
Enterprise users can now search jina-embeddings-v2-base/small on the AWS marketplace and deploy them directly on their own AWS account.

Superior Models on a Robust Platform

  • SageMaker Integration: With global availability on the AWS SageMaker Marketplace, Jina AI underscores its dedication to enterprise users, providing them with an effortless way to build applications using our advanced embedding models.
  • Seamless Deployment: Enterprises can now easily deploy Jina Embedding v2 models as SageMaker endpoints, bypassing the complexity associated with custom infrastructure setups.
  • Cost-Effective Licensing: The English base and small models are available without licensing fees. Clients incur costs only for their AWS instances, ensuring a privacy-first, cost-effective solution within their VPC.

Tailored Solutions for Varied Use Cases

  • Model Diversity: With a 0.27GB base model and a 0.07GB small model, Jina AI provides tailored solutions for various needs, from in-depth analytics to lightweight applications.
  • Use Cases: The base model is designed for comprehensive semantic representation, ideal for enterprise search and content discovery, while the small model caters to mobile and edge devices, optimizing for speed and efficiency.

Commenting on this significant milestone, Dr. Han Xiao, CEO of Jina AI, offered the following insights:

Launching Jina AI's 8K Context Length v2 Embedding Models on AWS Marketplace, we advance industry standards for private AI solutions. Developed in Germany, this pivotal release emphasizes data sovereignty and customer-centric innovation, addressing today's needs and shaping future secure, private AI deployments.

Jina AI aims to make continuous strides towards privacy-aware, state-of-the-art AI, as evident from its plans.

Why Jina Embeddings v2: A Leap in AI Capability

  • Extended Context Length: Jina Embeddings v2 models support an unprecedented 8K (8192 tokens) context length, allowing for a full understanding of longer documents.
  • Open Source Pioneer: Jina AI takes pride in offering the only open-source model with a context length that matches OpenAI’s proprietary models, broadening access to advanced AI.
  • Benchmark Leadership: On the Massive Text Embedding Benchmark (MTEB) leaderboard, our models boast performance on par with industry-leading models, attesting to our commitment to excellence.

Performance vs. OpenAI's text-embedding-ada002

Below is a comparative performance snapshot that showcases the robust capabilities of Jina Embeddings v2:

Modeltext-embedding-ada-002jina-embeddings-v2-base-en
Rank1821
Model Size (GB)Unknown0.27
Average (56 datasets)60.9960.38
Embedding Dimensions1536768
Sequence Length81918192
Classification Average (12 datasets)70.9373.45
Pair Classification Average (3 datasets)84.8985.38
Summarization Average (1 dataset)30.831.6
Retrieval Average (15 datasets)49.2547.87

Jina AI's base model excels particularly in Classification and Pair Classification tasks, underscoring its value in diverse applications ranging from document analysis to recommendation systems.

Get Started with Jina Embeddings v2 on AWS

To begin using Jina Embeddings v2, visit the AWS Marketplace listings and select the model that best fits your needs.

AWS Marketplace: Jina AI

These sample notebooks can help users get started with Jina Embeddings v2 models:

jina-sagemaker/notebooks/Real-time inference.ipynb at main · jina-ai/jina-sagemaker
Jina Embedding Models on AWS SageMaker. Contribute to jina-ai/jina-sagemaker development by creating an account on GitHub.

Real-time inference on SageMaker

jina-sagemaker/notebooks/Batch transform.ipynb at main · jina-ai/jina-sagemaker
Jina Embedding Models on AWS SageMaker. Contribute to jina-ai/jina-sagemaker development by creating an account on GitHub.

Embedding multiple sentences in a batch with SageMaker

Coming Soon: Multilingual Embeddings and More

Looking ahead, Jina AI is already deep in developing multilingual embedding models, making them available to its enterprise clients for private deployment on various cloud service providers (CSPs). With the imminent launch of these models, Jina AI is set to bridge language barriers, unlocking global opportunities for its clients.

About Jina AI GmbH

Located at Ohlauer Str. 43 (1st floor), zone A, 10999 Berlin, Germany, Jina AI is at the vanguard of reshaping the landscape of multimodal artificial intelligence. For inquiries, please reach out at contact@jina.ai.