Technical Articles

Structured knowledge from client work, explorations, and deployment challenges.

AI Systems

How Does ChatGPT Work?

Breaks down the ChatGPT product stack: user interface, transformer model, and GPU-backed infrastructure for real-time inference.

chatgpt, transformer, llm, api, product, gpu

Memory in Agentic AI

Introduces structured memory layers for AI agents — from per-interaction to long-term memory — and their implications for system design.

agent, memory, context, vector, architecture

RAG Strategy and Tooling

In this article, we walk through three types of RAG strategies, the tradeoffs behind each, and how to choose appropriate tooling for your context.

rag, llm, context, vector, semantic search

Why are Vector Databases Difficult? A Deep Dive

This article explores the core difficulties of vector databases.

vector, semantic search, databases, technical

Categorical Encoding for Machine Learning

A guide to choosing the right categorical encoding method for your ML model.

categorical encoding, feature engineering, machine learning, preprocessing, target encoding, one-hot encoding

LLM Evaluation Metrics

Evaluating large language models remains one of the more challenging aspects of deploying AI systems in production.

LLM, metrics, evaluation, machine learning

Sampling Strategies

Imbalanced datasets create a fundamental challenge: machine learning models naturally gravitate towards predicting the majority class.

data, sampling, unbalanced data

LLM Training Fundamentals: From Tokens to Human Preference

Explores the mathematical foundations of language model training (tokens, embeddings, softmax) and how human preference ranking introduces alignment biases that cannot be captured by statistical metrics.

LLM, training, RLHF, alignment

Filtered Vector Search

Examines how metadata integration in vector indexes affects query execution and how multiple ranking signals can be combined in production systems.

vector, semantic search, databases

Product & Strategy

Cost of ML

Breaks down the F(X)=Y structure of ML projects, covering data, compute, labelling, and how to make financially sound deployment decisions.

ml costs, infrastructure, hosting, openai, roi

Machine Learning Design Patterns

Outlines reusable solution patterns in ML engineering — like classification, recommendation, and segmentation — and how they relate to business goals.

design patterns, forecasting, anomaly detection, similarity

Alignment: Semantics in the AI/ML Era

Explores how abstraction in AI systems has shifted semantic responsibility from code syntax to goal alignment, and how engineers must adapt to this new landscape.

alignment, semantics, ai safety, developer guidance

5 Mistakes to Avoid When Building Your First AI Product

Highlights common pitfalls in early AI product development, from overbuilding to ignoring deployment and cost structure.

ai product, startup advice, common errors, first build

How to Brief a Developer on a Feature Without Being Technical

A practical guide for business and product owners to communicate clearly with AI engineers without needing to write code.

briefing, communication, stakeholders, alignment, requirements

MLE vs SWE

In software engineering, success is defined by delivery. Machine learning projects rarely follow this pattern. Success is based on iteration.

engineering, machine learning, project planning

Metrics for Business Goals

How to choose evaluation metrics that align with business objectives. Covers precision vs recall trade-offs, cost-sensitive evaluation, and business-specific metrics with practical examples.

model evaluation, precision, recall, business metrics, ml metrics, cost-sensitive

The Indirection Problem: How Programming Concepts Explain Modern Frustrations

Explores how C programming's indirection pattern mirrors modern platform economy issues with accountability and complexity.

software patterns, platform economy, accountability, systems thinking

Capital, Labour, and the Productivity Question

The capital-labour productivity dynamic, the distinction between exploration and exploitation phases, and what this framework reveals about automation adoption and economic structure.

productivity

Data Strategy

What Data Do I Need?

Provides a structured approach for assessing data availability, modality, labelling strategy, and how to turn datasets into ML-ready resources.

data audit, labelling, metrics, modality, compliance

Leveraging AI for Organisational Intelligence

Shows how communication metadata and structured extraction can reveal organisational structure, workforce engagement, and leadership alignment.

organisations, internal analytics, graphs, workforce, visualisation

What is Data Maturity?

Data maturity refers to the degree to which an organisation can trust, access, and make use of its data in consistent, scalable, and repeatable ways.

organisations, data, maturity, metrics, data audit, internal analytics

How to Isolate Bad Training Data

Using cross-validation as a diagnostic tool to systematically identify problematic records in training datasets.

cross-validation, data quality, outliers, model evaluation

SQL Schema Documentation for RAG Pipelines

Well-documented schemas serve as a practical form of data governance and provide a semantic layer between raw database structures and their business meaning.

data governance, semantic layer, documentation

Security & Resilience

Security Risks in AI Systems with Public Access

Summarises the threat landscape for public-facing AI: prompt injection, inference attacks, persistent memory exploits, and adversarial queries.

security, prompt injection, rag, memory attacks, api abuse

Architecture & Deployment

SaaS Architecture 101 for Scalability - A Guide for Non-CTOs

Provides non-technical founders with a structured overview of architectural choices in SaaS products, including stack selection, microservices, and cost trade-offs.

saas, architecture, cloud, infrastructure, startup guide

AI Tooling and the Separation Between Coders and Programming

AI tooling makes the separation between writing code and building systems more visible and, sometimes, more problematic.

architecture, tools, pace

Platforms vs Pipelines: Engineering for Compounding Returns

Choose to build platforms rather than optimise pipelines. The distinction matters because it determines whether engineering effort produces linear or exponential returns.

platforms, pipelines, product strategy, user experience, architecture

Scaling Databases: Lessons from Facebook's MySQL Journey

Most businesses will never face the scaling challenges that Facebook encountered in its early years. Yet the decisions Facebook made while scaling from thousands to millions of users contain lessons relevant to any growing system

architecture, databases, deepdive

Infrastructure

Docker Deep Dive

Explores the foundational Linux primitives, internals of container images, and key deployment considerations for using Docker in real environments.

docker, containers, chroot, cgroups, cloud, deployment

Deploying Models to AWS Lambda

Shows how to wrap an ML model in FastAPI, package it with Docker, and deploy it as a container-based Lambda function with Terraform.

fastapi, lambda, aws, docker, ml ops

Developer Tools

Developing with APIs Without Rate Limits: Introducing APICache

Describes a lightweight caching proxy for structured API development, with examples from OpenAI, GitHub, and public APIs.

api, cache, ratelimit, offline, testing

Perspectives

The Problem with Faster Horses

Reframes foundation models as non-deterministic data stores rather than accelerated human workers, and asks what operations that framing permits.

foundation models, llm, data store, reframing, distribution

PMFOps: The Missing Pipeline

Proposes a third operational layer alongside DevOps and MLOps, built around the synthetic audience as a versioned artefact for testing product-market-fit.

pmfops, devops, mlops, synthetic audience, product, evals, roles