Blog

Writing on AI systems, deployment, and evaluation

These articles focus on how AI systems are designed, tested, deployed, and governed in practice, with an emphasis on trade-offs that matter in real engineering work.

Enterprise AI·11 min read

SLMs for Swiss Re: Supervised, Structured, and Serving

For insurers, the winning pattern is rarely “largest model everywhere.” It is smaller, supervised, schema-bound models embedded into claims, underwriting, and compliance workflows with humans still controlling the decision.

SLMsInsuranceSwiss ReAI Governance

Apr 12, 2026Read more

More articles

Research System

Bayesian ML · Insurance·18 min read

ParaEval and the CRP/HDP Model: Bayesian Nonparametric Trigger Calibration for Parametric Insurance

Parametric insurance triggers are only as good as the statistical model behind them. This post walks through ParaEval — a decision-evaluation platform for parametric claims — and its CRP/HDP sub-model, which uses Bayesian nonparametric clustering to discover latent peril regimes and calibrate triggers with lower basis risk than standard actuarial baselines.

LLM Security·14 min read

Your Resume Is Also a Prompt: Why Prompt Injection Is the Defining Security Problem in Real-World LLM Systems

The most dangerous misunderstanding in enterprise AI is treating documents as data when LLMs treat them as instructions. This post examines the research on resume-based prompt injection, connects it to broader attack surfaces, and argues that interface design is now security design.

Prompt InjectionLLM SecurityOWASP

Insurance Systems·11 min read

Parametric Insurance Deep Dive: Speed, Basis Risk, and Trigger Design

Parametric insurance replaces loss adjustment with an observable trigger, but the hard part is not speed. It is picking a defensible index, reducing basis risk, and governing the data path from event to payout.

Parametric InsuranceBasis RiskClimate Risk

AI Governance·18 min read

Operating AI in Regulated Environments: HIPAA, GDPR, PCI DSS & Beyond

The moment an AI system touches health, payment, or EU personal data, architecture turns into compliance choreography. This guide translates the major regulations into the engineering artifacts and process controls they demand.

HIPAAGDPRPCI DSS

OWASP LLM Top 10

AI Security·16 min read

OWASP Top 10 for LLM Apps: Real Attacks, Real Fixes

For LLM apps, the attack often arrives as plain language rather than obviously malicious code. This guide walks through the OWASP risks as real failure stories, then shows the concrete controls that stop them.

OWASPLLM SecurityPrompt Injection

AI Governance·14 min read

Security & Compliance Standards for AI Systems

AI security begins where ordinary app security stops: the attack can be a dataset, a gradient, or a paragraph that looks harmless. This guide maps that wider threat surface and the controls regulated teams need.

SecurityComplianceISO 42001

LLM Systems·13 min read

AI Agents: From ReAct to Multi-Agent Systems

An agent is what happens when an LLM stops answering once and starts acting repeatedly in the world. This guide traces the control loops, tool use, and guardrails that separate a demo agent from a dependable one.

AgentsReActTool Use

AI Governance·12 min read

AI Governance and Regulations: From EU AI Act to ISO 42001

AI governance is the moment the story meets law: models leave the lab and enter a world of risk tiers, audits, and named obligations. This guide maps the major frameworks and what they require teams to actually build.

AI GovernanceEU AI ActNIST RMF

Data Architecture

Data Architecture·10 min read

Data Warehouse, Data Lake, and Lakehouse: A Visual Architecture Guide

Warehouses, lakes, and lakehouses are really three answers to one question: when should raw data be forced into shape? This guide turns that architectural choice into concrete diagrams and decision rules.

Data WarehouseData LakeLakehouse

Delivery Culture

DevOps · MLOps·10 min read

DevOps to MLOps: Building the Shared Delivery Muscle

DevOps taught teams to ship code like a disciplined factory line; MLOps adds a third moving part, data, and suddenly the factory floor shifts under your feet. This guide shows what transfers cleanly and what breaks.

DevOpsMLOpsDORA

Distributed Privacy

Privacy-Preserving ML·11 min read

Federated Learning: Training Models Without Moving Data

Federated learning flips the usual gravity of ML: instead of hauling sensitive data to one warehouse, it sends the model out like a traveling teacher and brings back only the lessons. This guide explains the math and the operational trade-offs.

Federated LearningPrivacyDifferential Privacy

Adaptation Methods

Large Language Models·13 min read

LLM Fine-Tuning: LoRA, QLoRA, DPO, and Mixture-of-Experts

A base LLM is a general instrument; fine-tuning changes how tightly it resonates with your task. This guide maps the adaptation spectrum from prompting to MoE, with the math behind each trade-off.

MLOps·9 min read

MLOps Systems Blueprint for Reliable AI

Production ML behaves like a three-body problem: code, data, and live behavior all pull in different directions. This guide shows how to turn that motion into a stable, self-correcting delivery loop.

MLOpsLLMOpsObservability

Deep Learning·10 min read

Neural Architectures Decoded: FFNN, RNN, and Transformers

Feedforward nets, RNNs, and transformers are three different ways of teaching machines to notice pattern: layers for shape, recurrence for memory, and attention for selective focus. This guide compares them without losing the math.

FFNNRNNTransformers

AI Ethics & Safety·11 min read

Responsible AI: Safety, Fairness, and Trustworthy Systems

Getting a model to work is only the opening scene; the harder plot begins when it must stay fair, explainable, safe, and accountable under pressure. This guide maps the pillars and practices that keep trust from collapsing.

Responsible AIAI SafetyFairness

Retrieval Systems

LLM Systems·12 min read

Retrieval-Augmented Generation: Architecture, Evaluation, and Production

RAG gives an LLM a memory it can check instead of bluffing from a frozen past. This guide follows the full pipeline from chunking to evaluation so a prototype can grow into a production system.

RAGVector SearchEmbeddings

Blog | Serkan Kumyol