deerflow2

History

Ryker_Feng 167ef4512f feat(memory): add memory.token_counting config to avoid tiktoken network dependency (#3429 ) (#3465 ) * feat(memory): add memory.token_counting config to avoid tiktoken network dependency (#3429) Add a `memory.token_counting` option (`tiktoken` \| `char`) so deployments in network-restricted environments can opt out of tiktoken entirely. In `char` mode the memory-injection budget uses a network-free character-based estimate and never triggers the BPE download from openaipublic.blob.core.windows.net, which could otherwise block for tens of minutes (see #3402). Also harden the default `tiktoken` path: - cache an in-flight LOADING sentinel so concurrent callers fall back immediately instead of spawning more blocking get_encoding threads when the first load is still running (e.g. under the 5s startup warm-up timeout); - cache failures with a timestamp and retry after a cooldown so a transient network outage self-heals back to accurate counting without a restart; - skip startup warm-up entirely in char mode. The new config is surfaced via the memory config API and config.example.yaml (config_version bumped). Default remains `tiktoken`, so existing deployments are unaffected. * fix(memory): use CJK-aware char token estimate and address review feedback - Replace the flat len(text)//4 fallback with a CJK-aware estimate so Chinese/Japanese/Korean memory content does not over-fill the injection budget - Document the internal tiktoken retry cooldown and char-mode escape hatch - Sync CLAUDE.md / config.example.yaml / MEMORY_IMPROVEMENTS.md wording - Fix MemoryConfigResponse mocks/assertions and add CJK estimate tests		2026-06-10 23:26:15 +08:00
..
agents	feat(memory): add memory.token_counting config to avoid tiktoken network dependency (#3429 ) (#3465 )	2026-06-10 23:26:15 +08:00
community	fix(web_fetch): support proxy for Jina reader in restricted networks (#3418 ) (#3430 )	2026-06-08 23:25:29 +08:00
config	feat(memory): add memory.token_counting config to avoid tiktoken network dependency (#3429 ) (#3465 )	2026-06-10 23:26:15 +08:00
guardrails	feat(guardrails): add pre-tool-call authorization middleware with pluggable providers (#1240 )	2026-03-23 18:07:33 +08:00
mcp	fix(mcp): close stdio sessions on their owning loop to avoid cross-task cancel-scope error (#3379 ) (#3392 )	2026-06-07 21:37:30 +08:00
models	feat(models): add StepFun reasoning model adapter (#3461 )	2026-06-09 18:01:43 +08:00
persistence	fix: harden run finalization persistence (#3155 )	2026-05-23 00:09:06 +08:00
reflection	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
runtime	fix runtime journal run lifecycle events (#3470 )	2026-06-10 08:33:29 +08:00
sandbox	fix(sandbox): make missing sandbox.mounts host_path a loud ERROR (#3244 ) (#3250 )	2026-06-09 23:16:14 +08:00
skills	fix(skills): harden slash skill activation across chat channels (#3466 )	2026-06-09 23:07:17 +08:00
subagents	feat(subagents): extend deferred MCP tool loading to subagents (#3432 )	2026-06-08 23:17:22 +08:00
tools	feat(subagents): extend deferred MCP tool loading to subagents (#3432 )	2026-06-08 23:17:22 +08:00
tracing	fix(tracing): propagate session_id and user_id into Langfuse traces (#2944 )	2026-05-21 16:49:31 +08:00
uploads	fix upload file size contract (#3408 )	2026-06-06 15:12:17 +08:00
utils	fix(skills): harden slash skill activation across chat channels (#3466 )	2026-06-09 23:07:17 +08:00
__init__.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
client.py	feat(memory): add memory.token_counting config to avoid tiktoken network dependency (#3429 ) (#3465 )	2026-06-10 23:26:15 +08:00