Conversations on data, AI, and the systems behind them.
Long-form interviews with engineers, researchers, and operators working at the production edge of AI.
I'm Muhammad Afzaal — 20+ years building enterprise software, the last five on production LLM and data platforms. I help teams move from prototype to production without losing the plot.
Three projects that show what I bring: pragmatic system design, a bias toward measurable outcomes, and code that survives in real environments.
Built the d365fo-client Python library + MCP server: 29 tools, OData query, FTS5 metadata search. AI assistants can now operate D365 F&O via natural language.
Production RAG with chunking, hybrid retrieval, and reranking. Cost dropped 60% on the same answer quality after evaluation-driven optimization.
Mapped 6 OSS data-governance frameworks against NIST AI RMF; published the comparison + architecture guide that anchors team decisions.
Lessons from real systems — RAG cost economics, MCP integration, governance frameworks, evaluation pipelines that don't lie.
A deep dive into why Large Language Models are auto-regressive Markov chains, how GPU floating-point non-associativity and FlashAttention break determinism, and why agent pipelines behave like controlled stochastic systems.
Naive Service Principal access exposes sensitive ERP data to LLM context windows. Learn how to architect zero-trust AI agents using OAuth2 On-Behalf-Of (OBO) token exchange and database-enforced Row-Level Security (RLS).
Enterprise APIs are too massive for LLM prompts. Discover context engineering patterns like JIT schema pruning, semantic routing, and session compression to build efficient agents.
Brownian motion had been observed, priced, explained, and made rigorous. Kiyosi Itô did something stranger: he built a calculus for paths too jagged to differentiate.
Long-form interviews with engineers, researchers, and operators working at the production edge of AI.
Open-source assistant with LangGraph orchestration and a clean theme system. The same widget that powers the chat on this site.
I work with a small number of teams each quarter — usually on production LLM pipelines, MCP integrations with enterprise systems, or evaluation frameworks that survive contact with real data.