Dynamic Agent Context with AIContextProvider

How to use AIContextProvider in Microsoft Agent Framework to inject dynamic memory, reduce tool-token overhead, add guardrails, and extend agent context at runtime.

April 30, 2026 · 7 min · 1314 words

Controlling Token Growth with Chat Reducers

How to control token growth in Microsoft Agent Framework with message-count and summarizing chat reducers, including setup, tradeoffs, and when each approach fits.

April 28, 2026 · 5 min · 1016 words

State Management and Chat History

How to manage short-term and persistent conversation state in Microsoft Agent Framework using AgentSession, StateBag, and a custom ChatHistoryProvider.

April 23, 2026 · 4 min · 770 words

Chat vs. Streaming: Don't Keep Your Users Waiting

When to use RunAsync vs. RunStreamingAsync in Microsoft Agent Framework, and why streaming improves chat UX while blocking calls still fit structured outputs and background work.

April 20, 2026 · 3 min · 638 words

Zero to First Agent

How to initialize the Microsoft Agent Framework, connect to Azure, OpenAI or local Ollama models and execute your first asynchronous agent run.

April 16, 2026 · 5 min · 930 words

Microsoft Agent Framework: Introduction

How Microsoft Agent Framework builds on Microsoft.Extensions.AI, when it supersedes Semantic Kernel for new .NET agent systems, and where MCP, context providers, and workflows fit.

April 14, 2026 · 7 min · 1416 words

Stop RAG Hallucinations with the Short-Circuit Pattern

How to reduce RAG hallucinations by short-circuiting generation when retrieval returns weak evidence, with a simple C# threshold check.

March 22, 2026 · 2 min · 316 words

Local LLMs in .NET

A minimal .NET starter for running local LLMs with Ollama + OllamaSharp behind IChatClient—no API keys, streaming chat, system prompts, and capped conversation history.

February 8, 2026 · 3 min · 630 words