AI Agent

How Agents Remember You: Human Memory Science and a Code Audit of Six Open-Source Systems

2026-07-30·5564 words· 27 min

Deep Dives AI Agent LLM Memory Memory Systems Cognitive Science Open-Source Architecture

Almost every agent project now claims to provide “long-term memory.” For one project, that means embedding chat history. For another, it means maintaining a user profile. A third lets the model edit Markdown files. A fourth builds a bitemporal knowledge graph. All four use the word memory, but they are not the same system and should not be placed on one undifferentiated leaderboard. To decide whether a system genuinely remembers, I would rather ask three questions:

Agent 如何记住你：人脑记忆史与六大开源系统代码审计

2026-07-30·1755 字· 9 分钟

深度调研 AI Agent LLM Memory 记忆系统认知科学开源架构

几乎每个 Agent 项目都说自己有「长期记忆」。有的意思是把聊天记录做 embedding，有的意思是维护一份用户画像，有的意思是让模型自己修改 Markdown，还有的已经做到了双时序知识图谱。它们都叫 memory，却不是同一种东西，也不该放在一张跑分榜上直接比较。要判断一个系统是不是真的「会记」，我更愿意问三个问题：一次经历之后，系统里的什么状态发生了变化？这个状态存在哪里，谁能修改，什么时候失效？下一次行动前，它如何被准确、合规地带回来？这篇文章从这三个问题出发。前半段把人类记忆科学与 Agent 记忆技术放在同一条历史轴上；后半段直接读代码，对照 Mem0、Letta、Graphiti、LangMem、Cognee 与 MemoryOS 的宣传卖点、实际数据流、系统边界和对应的记忆范式。先给结论：今天主流的 Agent 并没有获得一种像人脑那样的统一「记忆器官」。工程上真正有效的是一条闭环：经历 → 写入门控 → 表征 → 存储 → 检索 → 上下文组装 → 行动反馈 → 巩固 / 修订 / 遗忘。不同开源项目，只是选择接管这条闭环的不同部分。下面这张图不是某个产品的组件架构，而是全文共用的判断坐标系。它要回答的不是「数据放在哪」，而是「一次过去的经历如何真正影响下一次行动」。阅读时先沿中间的七步主环看信息如何从经历变成行动；再看左侧三种载体，区分当前任务、跨会话记忆与真实世界状态；右侧说明每一步完成的变换，底部则展示长期运行后必须发生的巩固、修订与遗忘。这样能避免把数据库、Context、缓存和真实状态都笼统地叫作“记忆”。

OpenClaw 实战：一行路径省掉 84% 的工具调用——Cron Job 排障实录

2026-06-20·更新于: 2026-07-30·561 字· 3 分钟

AI Agent 工程 OpenClaw AI Agent Cron Job SKILL.md Prompt Engineering 性能优化

OpenClaw 的 daily-ai-news 定时任务连续超时。根因不是模型不够强——是 SKILL.md 里少写了一行绝对路径，导致 Agent 每次花 15 次 exec 搜索工具位置。消息数 165→54，exec 调用 44→7，一行路径比任何算法调优都管用。

OpenClaw 记忆实战：从「向量搜索挂了也能用」到用 NVIDIA 免费 API 补全最后一块拼图

2026-06-20·更新于: 2026-07-30·544 字· 3 分钟

AI Agent 工程 OpenClaw AI Agent 记忆系统 Embedding NVIDIA 向量搜索 BM25

OpenClaw 记忆系统的向量检索默认不可用——但 BM25 文本搜索兜底让系统照常运转了两周。当你发现「不配 embedding 也能跑」，到底要不要修？怎么用 NVIDIA 免费 API 零成本补上？

OpenClaw Memory in Practice: From 'Vector Search Is Down But Everything Still Works' to Zero-Cost NVIDIA Embeddings

2026-06-20·更新于: 2026-07-30·1713 words· 9 min

Agent Engineering OpenClaw AI Agent Memory System Embedding NVIDIA Vector Search BM25

OpenClaw’s vector retrieval silently failed — but BM25 text search kept the memory system running for two weeks unnoticed. Should you even bother fixing it? Here’s how I used NVIDIA’s free embedding API to complete the picture at zero cost.

OpenClaw in Practice: One File Path Eliminated 84% of Tool Calls — A Cron Job Debugging Story

2026-06-20·更新于: 2026-07-30·1606 words· 8 min

Agent Engineering OpenClaw AI Agent Cron Job SKILL.md Prompt Engineering Performance

OpenClaw’s daily-ai-news cron job kept timing out. The root cause: a missing absolute path in the SKILL.md caused the Agent to spend 15 exec calls searching for a tool every run. Messages 165→54, exec calls 44→7 — one file path beat any algorithm optimization.

Claude's Tool Calling Paradigm Shift: A Deep Dive into Programmatic Tool Calling and Dynamic Filtering

2026-06-13·更新于: 2026-07-30·2655 words· 13 min

Agent Engineering Claude AI Agent Agent Architecture Tool Calling Context Engineering Programmatic Tool Calling Dynamic Filtering Code Execution

The important change is not “two more tool features.” It is the movement of multi-step orchestration into a code-execution environment, with only a compact result returning to model context. Background: The Cost Problem in Agent Tool Calling # In traditional agent tool-calling, every tool invocation requires a full cycle of “model inference → tool execution → result return → model re-inference.” This seemingly natural loop breaks down at scale in three ways:

Claude 工具调用范式转移：Programmatic Tool Calling 与 Dynamic Filter 深度解读

2026-06-13·更新于: 2026-07-30·1258 字· 6 分钟

AI Agent 工程 Claude AI Agent Agent 架构工具调用上下文工程 Programmatic Tool Calling Dynamic Filtering 代码执行

真正的变化不是“又多了两个工具功能”，而是 Agent 开始把多步工具编排移入代码执行环境，并只把压缩后的结果带回模型上下文。背景：Agent 工具调用的成本困境 # 在传统 Agent 工具调用模型中，每调用一个工具都需要完成一次"模型推理 → 工具执行 → 结果返回 → 模型再推理"的完整回合。这个看似自然的循环，在工具调用变多时会暴露出三个致命问题：上下文污染：每个工具的结果都被原封不动地注入上下文窗口。查 20 个员工的报销记录，2000+ 条费用明细全部进入 context，即使你只需要知道"哪 3 个人超预算了"。推理开销：每个工具调用都需要一次完整的模型推理。5 个工具调用 = 5 次推理 pass，每次几百毫秒到几秒不等。噪声导致准确率下降：当上下文窗口塞满了中间结果，模型不得不在大量噪声中寻找信号。Context Rot 研究表明，LLM 在复杂任务上的性能会随上下文增长而下降 50-70%。正如 Bruno 在 Claude Code Architecture Guide 中所指出的：“Outer Loop（模型外的一切：上下文管理、工具调用、验证、记忆巩固）开始比模型推理本身更决定系统质量。” Anthropic 在 2025 年 11 月到 2026 年 2 月间陆续推出的一系列工具使用增强功能，本质上都是为了解决 Outer Loop 的效率问题。其中 Programmatic Tool Calling (PTC) 和 Dynamic Filtering 是最具范式转移意义的两项。

OpenClaw 生产踩坑：当最先进的记忆系统遇到最静默的失败

2026-05-27·更新于: 2026-07-30·1660 字· 8 分钟

AI Agent 工程 OpenClaw AI Agent 飞书记忆系统 Compaction 排障

从部署到排障，记录 OpenClaw 从启动失败、飞书消息静默吞回复到 production 稳定的全链路实战经验——compaction safeguard、五层排查法、model-harness-fit 与记忆系统对比。

OpenClaw in Production: When the Most Advanced Memory System Meets the Quietest Failure

2026-05-27·更新于: 2026-07-30·4174 words· 20 min

Agent Engineering OpenClaw AI Agent Feishu Memory System Compaction Debugging

A full-chain production battle log: from startup failures and Feishu message silent drops to production stability — compaction safeguard, five-layer debugging, model-harness fit, and memory system comparison.

↑