Skip to main content

Tagged SAP

4 posts on SAP.

Local

HNSW or IVF-PQ? What I Actually Chose at 2M Documents

8 min read

The recall-vs-memory decision behind a GDPR-compliant RAG platform. Why I stopped reaching for IVF-PQ, what the graph index cost me in RAM, and the one thing about HNSW's insert path I didn't see coming until we crossed a million vectors.

RAGVector SearchHNSWSAP
Local

Redact at Retrieval, Not at Ingest: A GDPR-Compliant RAG Architecture

9 min read

The naive PII strategy is to scrub the corpus at index time. It's also the strategy that quietly destroys recall on every query that legitimately mentions a public entity. Here's why I moved the redaction pass downstream of retrieval — and how a DeBERTa PII model, an HNSW index, and a cross-encoder reranker fit inside a sub-2s p95 budget without stepping on each other.

RAGGDPRArchitectureSAP
Local

Why We Fine-Tuned DeBERTa-base and Not XLM-R for German PII

10 min read

The multilingual model was the obvious pick and it lost by six F1 points. Why tokenizer coverage beat parameter breadth on Bundesdatenschutzgesetz entities, what disentangled attention did to German compound nouns, and the cost of specializing a model inside a regulated pipeline.

NLPDeBERTaFine-tuningGDPR
Local

A Dependency-Free Go Binary Is the Right Answer for a 9,000-Server Fleet

9 min read

Why I stopped shipping a Python client to 9,000 Linux servers for a security-critical credential fetch, what static linking actually buys you at fleet scale, and the surprising moment a stripped Go binary weighed less than the Python container image it replaced.

GoInfrastructureSecuritySAP