<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Tool Calling on 卓琪的开发笔记</title>
    <link>https://zhuoqidev.com/tags/tool-calling/</link>
    <description>Recent content in Tool Calling on 卓琪的开发笔记</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>zh-CN</language>
    <copyright>© 2026 Liu ZhuoQi</copyright>
    <lastBuildDate>Sat, 13 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://zhuoqidev.com/tags/tool-calling/index.xml" rel="self" type="application/rss+xml" />
    
    <item>
      <title>Claude&#39;s Tool Calling Paradigm Shift: A Deep Dive into Programmatic Tool Calling and Dynamic Filtering</title>
      <link>https://zhuoqidev.com/en/posts/claude-programmatic-tool-calling-dynamic-filter/</link>
      <pubDate>Sat, 13 Jun 2026 00:00:00 +0000</pubDate>
      
      <guid>https://zhuoqidev.com/en/posts/claude-programmatic-tool-calling-dynamic-filter/</guid>
      <description>&lt;h2 class=&#34;relative group&#34;&gt;Background: The Cost Problem in Agent Tool Calling&#xA;    &lt;div id=&#34;background-the-cost-problem-in-agent-tool-calling&#34; class=&#34;anchor&#34;&gt;&lt;/div&gt;&#xA;    &#xA;    &lt;span&#xA;        class=&#34;absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none&#34;&gt;&#xA;        &lt;a class=&#34;text-primary-300 dark:text-neutral-700 !no-underline&#34; href=&#34;#background-the-cost-problem-in-agent-tool-calling&#34; aria-label=&#34;Anchor&#34;&gt;#&lt;/a&gt;&#xA;    &lt;/span&gt;&#xA;    &#xA;&lt;/h2&gt;&#xA;&lt;p&gt;In traditional agent tool-calling, every tool invocation requires a full cycle of &amp;ldquo;model inference → tool execution → result return → model re-inference.&amp;rdquo; This seemingly natural loop breaks down at scale in three ways:&lt;/p&gt;&#xA;&lt;ol&gt;&#xA;&lt;li&gt;&lt;strong&gt;Context Pollution&lt;/strong&gt;: Every tool result is injected verbatim into the context window. Fetch expense reports for 20 employees, and 2,000+ line items enter context — even though you only need to know &amp;ldquo;which 3 people exceeded their budget.&amp;rdquo;&lt;/li&gt;&#xA;&lt;li&gt;&lt;strong&gt;Inference Overhead&lt;/strong&gt;: Each tool call demands a full model inference pass. Five tools = five inference passes, each costing hundreds of milliseconds to seconds.&lt;/li&gt;&#xA;&lt;li&gt;&lt;strong&gt;Noise Degrades Accuracy&lt;/strong&gt;: When the context window is packed with intermediate results, the model must find signal in noise. &lt;a href=&#34;https://arxiv.org/abs/2509&#34;  target=&#34;_blank&#34; rel=&#34;noreferrer&#34;&gt;Context Rot research&lt;/a&gt; shows LLM performance on complex tasks drops 50-70% as context grows.&lt;/li&gt;&#xA;&lt;/ol&gt;&#xA;&lt;p&gt;As Florian Bruniaux puts it in the &lt;a href=&#34;https://cc.bruniaux.com/guide/architecture/&#34;  target=&#34;_blank&#34; rel=&#34;noreferrer&#34;&gt;Claude Code Architecture Guide&lt;/a&gt;: &lt;strong&gt;&amp;ldquo;The Outer Loop — everything outside the model: context management, tool invocation, verification, memory consolidation — increasingly determines system quality more than model inference itself.&amp;rdquo;&lt;/strong&gt;&lt;/p&gt;</description>
      
    </item>
    
  </channel>
</rss>
