Core Components

Confidential AI Inference Engines

Klave AI Engines run open-source models (LLMs, SLMs, VLMs) within TEEs using optimised inference frameworks like llama.cpp and Bitnet. Each engine maintains a unique cryptographic identity that can be remotely verified.

Attestation Process

Before processing begins, clients can verify that:

The correct model is loaded and unmodified
The inference engine runs in a genuine TEE
No unauthorised code can access model weights or user data

Bring Your Own Model (BYOM)

Model owners can deploy proprietary models by:

Spinning up a new Klave AI Engine instance
Verifying the enclave through remote attestation
Transferring encrypted model weights
Decrypting only within the verified enclave

Private Retrieval-Augmented Generation (RAG)

Private RAG allows users to leverage private and confidential knowledge sources to enhance LLM performance. This allows LLMs to access and utilise data beyond their training set, leadings to more accurate and contextually relevant responses. The RAG process retrieves relevant information from external sources, augment the user request input with these data and feed them to the the LLM to generate a more accurate response.

Traditional RAG systems expose private knowledge bases to administrators and infrastructure providers. Klave AI's Private RAG runs entirely within Intel SGX enclaves leverage Klave DB, ensuring that:

Document content never appears in plaintext outside the enclave
Vector embeddings are generated and stored confidentially
Retrieval operations maintain data privacy

Architecture

Fig. 1 - Private RAG Overview.

Document to Text extraction

The extraction engine converts diverse document formats into clean, searchable text while preserving semantic structure and metadata.

Supported Formats:

PDF documents (including scanned/OCR processing)
Microsoft Office files (Word, Excel, PowerPoint)
HTML and web content
Plain text and markdown files
Email formats (PST, EML, MSG)
Image files with OCR capabilities

Processing Pipeline

Document Ingestion: Files are processed within SGX enclaves
Embedding Generation: Text is converted to vectors using confidential models
Secure Storage: Vectors are encrypted and stored in Klave DB
Private Retrieval: User queries retrieve relevant context without exposing source data

In Klave AI, users can simply drag and drop documents to enable RAG.

System Prompt Enhancement

Klave AI lets you customise system prompt to taylor and optimise the context provided to LLMs in order to shape the model's output, allowing organisation to:

Define custom system prompts with proprietary guidelines
Inject context from private data sources
Maintain consistent model behavior across deployments
Audit prompt modifications for compliance

System Prompt Enhancement is facilitated by AI Agents.

Confidential AI Agents

Klave AI Agents are Klave Apps and therefore encapsulate all their characteristics. They run in TEEs, communicate via SA2A with other agents, and support fine grained identity, role-based access control (RBAC), and auditability through Attestation.

They provides the following capabilities:

Confidential logic and data handling
MCP clients for interactions with MCP servers
Interactions with other Agents
Interactions with Klave AI Engines

Klave AI Agents communicate leveraging an Attested version of the Agent-to-Agent (A2A) protocol called Secure-enclave A2A (SA2A).

SA2A (Secure-enclave Agent-to-Agent) Protocol

SA2A is the Klave AI purpose-built agent communication protocol. It follows the A2A standardised communication (JSON-RPC over HTTP(S)) and augments it with TEEs Attestation capabilities to enforce encrypted communication with identified Agents running within secure enclaves.

In adition of A2A capabilities (Agent discovery, Flexible interaction, etc.), SA2A covers:

Attestation exchange before data sharing
Proof of authorised access to specific data sources
Cryptographic audit trails for regulatory compliance

Model Context Protocol Server Apps

Model Context Protocol (MCP) enables AI agents to interact with external services. Klave AI provides the industry's first fully encrypted and attestatble MCP infrastructure. Klave Apps can be easily transformed into a fully fledged MCP server to augment AI Agents capabilities. In addition of running within secure enclaves, they also enhance the MCP by communication with AI Agents trough an Attested MCP named Secure-enclave MCP (SMCP).

Enhanced MCP Protocol (SMCP - Secure-enclave MCP):

SMCP is the Klave AI purpose-built MCP. It leverages TEEs capabilities to ensure secure communication through Attestation.

All MCP communications use attestation-based encryption
Server capabilities are cryptographically verified
Service execution occurs within confidential environments
Data lineage is maintained across all MCP interactions

On this page