How to Think About AI Systems in 2026

Why GenAI, Predictive Models, and Agents Need to Work Together I’ve been working in AI since well before the GenAI wave, and one thing that has become almost annoying in the last couple of years is how quickly every problem gets thrown at an LLM—with the expectation that it will magically solve everything. It won’t. … Continue reading How to Think About AI Systems in 2026

Why Do Multi-Agent LLM Systems Fail? Insights from Recent Research

As GEN AI evolves, multi-agent systems (MASs) are gaining traction, yet many remain at the PoC stage. So this is where research papers and surveys help. Over the weekend, I explored their failure points and found a fascinating study worth sharing. MASs promise enhanced collaboration and problem-solving, but ensuring consistent performance gains over single-agent frameworks … Continue reading Why Do Multi-Agent LLM Systems Fail? Insights from Recent Research

Evaluation and Experiment Tracking for LLM’s

Have you ever done text generation with an LLM and though: I wish I could track experiments and evaluate different LLM results in terms of cost, sample comparison and memory together like below? If yes this article for you. Experimentation tracking is one critical piece of model development which is very important as it helps … Continue reading Evaluation and Experiment Tracking for LLM’s