The 8purple blog

Field notes on AI & infrastructure

Practical writing on applied AI, GPU compute, platforms and the craft of shipping software that survives contact with production.

Vector Databases, Explained Without the Hype
Data & Infrastructure ·3 min read

Vector Databases, Explained Without the Hype

Do you need a dedicated vector database, or is your existing one enough? A practical look at what these systems actually do and when they earn their keep.

Read
How to Actually Evaluate an LLM Feature
Applied AI ·3 min read

How to Actually Evaluate an LLM Feature

You cannot ship what you cannot measure. Evaluating generative systems is harder than traditional software testing — and skipping it is how good demos become bad products.

Read
The Real Economics of Running GPUs
Data & Infrastructure ·3 min read

The Real Economics of Running GPUs

GPU sticker prices get the headlines, but the bill that matters is utilisation, power, and idle time. A field guide to what AI compute really costs.

Read
MLOps Foundations: From Notebook to Reliable Service
Data & Infrastructure ·3 min read

MLOps Foundations: From Notebook to Reliable Service

A model that works in a notebook is a science project. A model that serves real traffic reliably is an engineering system. Bridging the two is what MLOps is for.

Read
Prompt Injection and the New AI Attack Surface
Security ·3 min read

Prompt Injection and the New AI Attack Surface

When your application takes instructions in plain language, attackers can write instructions too. Prompt injection is the vulnerability class that traditional security never prepared us for.

Read
Model Quantization: Smaller, Faster, Almost as Good
Data & Infrastructure ·3 min read

Model Quantization: Smaller, Faster, Almost as Good

Quantization shrinks a model by storing its numbers with less precision. Done well, it cuts memory and cost dramatically while barely touching quality. Here is the intuition and the tradeoffs.

Read