Why CancellationToken Matters More in .NET AI Systems

Why CancellationToken matters more in .NET AI systems, and how to pass it through LLM calls, streaming responses, embeddings, retrieval, and tool execution.

May 24, 2026 · 12 min · 2407 words

Chat vs. Streaming: Don't Keep Your Users Waiting

When to use RunAsync vs. RunStreamingAsync in Microsoft Agent Framework, and why streaming improves chat UX while blocking calls still fit structured outputs and background work.

April 20, 2026 · 3 min · 638 words