Embedding Models

Embedding models convert text into numerical vectors that capture semantic meaning, enabling powerful semantic search and similarity matching in AINexLayer.

Overview

Embedding models are the foundation of semantic search in AINexLayer. They transform text into high-dimensional vectors that capture the meaning and context of your content, enabling the AI to find relevant information based on meaning rather than just keywords.

How Embeddings Work

Text to Vector Conversion

Text Input: Raw text from your documents
Tokenization: Break text into tokens (words, subwords)
Model Processing: Neural network processes tokens
Vector Output: Numerical representation of text meaning
Storage: Vectors stored in vector database

Semantic Understanding

Meaning Capture: Vectors represent semantic meaning
Context Awareness: Understands word context and relationships
Similarity Matching: Similar concepts have similar vectors
Cross-Language: Works across different languages

Supported Embedding Models

OpenAI Embeddings

Best for: General-purpose semantic search, high accuracy

Available Models

text-embedding-ada-002: General-purpose embedding model
text-embedding-3-small: Smaller, faster model
text-embedding-3-large: Larger, more accurate model

Configuration

{
  "provider": "openai",
  "model": "text-embedding-3-small",
  "dimensions": 1536,
  "apiKey": "your-openai-api-key"
}

Specifications

Dimensions: 1536 (ada-002), 1536 (3-small), 3072 (3-large)
Context Length: 8192 tokens
Languages: 100+ languages supported
Pricing: $0.0001/1K tokens

Azure OpenAI Embeddings

Best for: Enterprise deployments, compliance requirements

Available Models

text-embedding-ada-002: Enterprise-grade embedding model
text-embedding-3-small: Enterprise small model
text-embedding-3-large: Enterprise large model

Configuration

{
  "provider": "azure-openai",
  "model": "text-embedding-ada-002",
  "endpoint": "https://your-resource.openai.azure.com/",
  "apiKey": "your-azure-api-key",
  "apiVersion": "2024-02-15-preview"
}

Cohere Embeddings

Best for: Multilingual support, business applications

Available Models

embed-english-v3.0: English-optimized model
embed-multilingual-v3.0: Multilingual model
embed-english-light-v3.0: Lightweight English model

Configuration

{
  "provider": "cohere",
  "model": "embed-multilingual-v3.0",
  "apiKey": "your-cohere-api-key",
  "inputType": "search_document"
}

Specifications

Dimensions: 1024
Context Length: 512 tokens
Languages: 100+ languages
Pricing: $0.0001/1K tokens

Local Embedding Models

Best for: Privacy, offline use, cost control

Sentence Transformers

{
  "provider": "sentence-transformers",
  "model": "all-MiniLM-L6-v2",
  "dimensions": 384,
  "device": "cpu"
}

Available Models

all-MiniLM-L6-v2: Fast, general-purpose model
all-mpnet-base-v2: High-quality English model
paraphrase-multilingual-MiniLM-L12-v2: Multilingual model
distilbert-base-nli-mean-tokens: Distilled BERT model

Configuration

{
  "provider": "sentence-transformers",
  "model": "all-mpnet-base-v2",
  "dimensions": 768,
  "device": "cuda",
  "batchSize": 32
}

Ollama Embeddings

Best for: Local deployment, custom models

Available Models

nomic-embed-text: High-quality local embedding
mxbai-embed-large: Large local embedding model
all-minilm: Lightweight local model

Configuration

{
  "provider": "ollama",
  "model": "nomic-embed-text",
  "baseURL": "http://localhost:11434",
  "dimensions": 768
}

Installation

# Install embedding model
ollama pull nomic-embed-text

# Start Ollama service
ollama serve