Redproof

OWASP LLM Top 10 · LLM08

Vector and Embedding Weaknesses

Attacks and leaks in the RAG layer: the embeddings, vector store, and retrieval that ground your model.

LLM08OWASP LLM Top 10AI red-teaming

What it is

Retrieval-augmented generation adds a new attack surface most security reviews miss: the embedding model, the vector database, and the retrieval logic. Weaknesses here cause cross-context leakage, let attackers steer what gets retrieved, or expose sensitive data encoded in embeddings.

How it shows up in real apps

A concrete example

Scenario

A shared vector index serves multiple customers from one collection.

Attack

A user phrases queries to pull chunks that belong to a different customer's documents.

Result

Confidential content crosses the tenant boundary through retrieval rather than through the model itself.

How we test for it

We test the retrieval layer directly: partitioning and access control across tenants and users, whether crafted queries can surface out-of-scope chunks, and whether deletion and permissions actually propagate to the index. This is where the model is fine but the plumbing leaks.

How to reduce the risk

EU AI Act: commonly maps to Art. 10 (data governance) and Art. 15 (robustness). Redproof reports findings as independent testing evidence, not a conformity verdict.

Test this on your own AI before someone else does

Redproof is independent red-teaming for LLM and AI-agent products. We probe your system for vector and embedding weaknesses and the rest of the OWASP LLM Top 10, hand you severity-ranked findings with reproductions, fixes, and EU AI Act mapping, and re-test after you patch. That is the evidence your self-assessment needs, before a regulator or customer asks.