Technical Articles
Structured knowledge from client work, explorations, and deployment challenges.
AI Systems
How Does ChatGPT Work?
Breaks down the ChatGPT product stack: user interface, transformer model, and GPU-backed infrastructure for real-time inference.
chatgpt, transformer, llm, api, product, gpu
Memory in Agentic AI
Introduces structured memory layers for AI agents — from per-interaction to long-term memory — and their implications for system design.
agent, memory, context, vector, architecture
RAG Strategy and Tooling
In this article, we walk through three types of RAG strategies, the tradeoffs behind each, and how to choose appropriate tooling for your context.
rag, llm, context, vector, semantic search
Why are Vector Databases Difficult? A Deep Dive
This article explores the core difficulties of vector databases.
vector, semantic search, databases, technical
Categorical Encoding for Machine Learning
A guide to choosing the right categorical encoding method for your ML model.
categorical encoding, feature engineering, machine learning, preprocessing, target encoding, one-hot encoding
LLM Evaluation Metrics
Evaluating large language models remains one of the more challenging aspects of deploying AI systems in production.
LLM, metrics, evaluation, machine learning
Sampling Strategies
Imbalanced datasets create a fundamental challenge: machine learning models naturally gravitate towards predicting the majority class.
data, sampling, unbalanced data
LLM Training Fundamentals: From Tokens to Human Preference
Explores the mathematical foundations of language model training (tokens, embeddings, softmax) and how human preference ranking introduces alignment biases that cannot be captured by statistical metrics.
LLM, training, RLHF, alignment
Filtered Vector Search
Examines how metadata integration in vector indexes affects query execution and how multiple ranking signals can be combined in production systems.
vector, semantic search, databases
Product & Strategy
Cost of ML
Breaks down the F(X)=Y structure of ML projects, covering data, compute, labelling, and how to make financially sound deployment decisions.
ml costs, infrastructure, hosting, openai, roi
Machine Learning Design Patterns
Outlines reusable solution patterns in ML engineering — like classification, recommendation, and segmentation — and how they relate to business goals.
design patterns, forecasting, anomaly detection, similarity
Alignment: Semantics in the AI/ML Era
Explores how abstraction in AI systems has shifted semantic responsibility from code syntax to goal alignment, and how engineers must adapt to this new landscape.
alignment, semantics, ai safety, developer guidance
5 Mistakes to Avoid When Building Your First AI Product
Highlights common pitfalls in early AI product development, from overbuilding to ignoring deployment and cost structure.
ai product, startup advice, common errors, first build
How to Brief a Developer on a Feature Without Being Technical
A practical guide for business and product owners to communicate clearly with AI engineers without needing to write code.
briefing, communication, stakeholders, alignment, requirements
MLE vs SWE
In software engineering, success is defined by delivery. Machine learning projects rarely follow this pattern. Success is based on iteration.
engineering, machine learning, project planning
Metrics for Business Goals
How to choose evaluation metrics that align with business objectives. Covers precision vs recall trade-offs, cost-sensitive evaluation, and business-specific metrics with practical examples.
model evaluation, precision, recall, business metrics, ml metrics, cost-sensitive
The Indirection Problem: How Programming Concepts Explain Modern Frustrations
Explores how C programming's indirection pattern mirrors modern platform economy issues with accountability and complexity.
software patterns, platform economy, accountability, systems thinking
Capital, Labour, and the Productivity Question
The capital-labour productivity dynamic, the distinction between exploration and exploitation phases, and what this framework reveals about automation adoption and economic structure.
productivity
Data Strategy
What Data Do I Need?
Provides a structured approach for assessing data availability, modality, labelling strategy, and how to turn datasets into ML-ready resources.
data audit, labelling, metrics, modality, compliance
Leveraging AI for Organisational Intelligence
Shows how communication metadata and structured extraction can reveal organisational structure, workforce engagement, and leadership alignment.
organisations, internal analytics, graphs, workforce, visualisation
What is Data Maturity?
Data maturity refers to the degree to which an organisation can trust, access, and make use of its data in consistent, scalable, and repeatable ways.
organisations, data, maturity, metrics, data audit, internal analytics
How to Isolate Bad Training Data
Using cross-validation as a diagnostic tool to systematically identify problematic records in training datasets.
cross-validation, data quality, outliers, model evaluation
SQL Schema Documentation for RAG Pipelines
Well-documented schemas serve as a practical form of data governance and provide a semantic layer between raw database structures and their business meaning.
data governance, semantic layer, documentation
Security & Resilience
Security Risks in AI Systems with Public Access
Summarises the threat landscape for public-facing AI: prompt injection, inference attacks, persistent memory exploits, and adversarial queries.
security, prompt injection, rag, memory attacks, api abuse
Architecture & Deployment
SaaS Architecture 101 for Scalability - A Guide for Non-CTOs
Provides non-technical founders with a structured overview of architectural choices in SaaS products, including stack selection, microservices, and cost trade-offs.
saas, architecture, cloud, infrastructure, startup guide
AI Tooling and the Separation Between Coders and Programming
AI tooling makes the separation between writing code and building systems more visible and, sometimes, more problematic.
architecture, tools, pace
Platforms vs Pipelines: Engineering for Compounding Returns
Choose to build platforms rather than optimise pipelines. The distinction matters because it determines whether engineering effort produces linear or exponential returns.
platforms, pipelines, product strategy, user experience, architecture
Scaling Databases: Lessons from Facebook's MySQL Journey
Most businesses will never face the scaling challenges that Facebook encountered in its early years. Yet the decisions Facebook made while scaling from thousands to millions of users contain lessons relevant to any growing system
architecture, databases, deepdive
Infrastructure
Docker Deep Dive
Explores the foundational Linux primitives, internals of container images, and key deployment considerations for using Docker in real environments.
docker, containers, chroot, cgroups, cloud, deployment
Deploying Models to AWS Lambda
Shows how to wrap an ML model in FastAPI, package it with Docker, and deploy it as a container-based Lambda function with Terraform.
fastapi, lambda, aws, docker, ml ops
Developer Tools
Developing with APIs Without Rate Limits: Introducing APICache
Describes a lightweight caching proxy for structured API development, with examples from OpenAI, GitHub, and public APIs.
api, cache, ratelimit, offline, testing
Perspectives
The Problem with Faster Horses
Reframes foundation models as non-deterministic data stores rather than accelerated human workers, and asks what operations that framing permits.
foundation models, llm, data store, reframing, distribution
PMFOps: The Missing Pipeline
Proposes a third operational layer alongside DevOps and MLOps, built around the synthetic audience as a versioned artefact for testing product-market-fit.
pmfops, devops, mlops, synthetic audience, product, evals, roles