deerflow2

History

DanielWalnut 3b105d1e5f fix(suggestions): strip inline <think> reasoning before parsing follow-up questions (#3435 ) Reasoning models such as MiniMax-M3 inline their chain-of-thought into the message content as <think>...</think> (reasoning_split defaults to false) instead of a separate reasoning_content field. The follow-up-suggestions endpoint extracted the JSON array via find('[') / rfind(']'), which silently broke whenever the reasoning text contained '[' or ']' — or when long thinking hit max_tokens and truncated before the array was emitted — returning empty suggestions. - Add _strip_think_blocks() and apply it before JSON extraction; it removes complete <think>...</think> blocks (case-insensitive) and drops an unclosed <think> left by max_tokens truncation. - Document the MiniMax thinking toggle in config.example.yaml (when_thinking_enabled: adaptive / when_thinking_disabled: disabled) so thinking_enabled=False actually disables reasoning on M3; note that M2.x models always think and rely on the defensive strip above. - Tests cover complete/unclosed think blocks, brackets-inside-think, think + code-fence, and an end-to-end suggestions case reproducing the empty-result bug. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>		2026-06-08 15:48:00 +08:00
..
__init__.py	feat(gateway): implement LangGraph Platform API in Gateway, replace langgraph-cli (#1403 )	2026-03-30 16:02:23 +08:00
agents.py	feat(agent): add custom-agent self-updates with user isolation (#2713 )	2026-05-05 23:17:42 +08:00
artifacts.py	fix(gateway): cap skill artifact preview size (#2963 )	2026-05-15 22:15:58 +08:00
assistants_compat.py	feat(gateway): implement LangGraph Platform API in Gateway, replace langgraph-cli (#1403 )	2026-03-30 16:02:23 +08:00
auth.py	fix(auth): replace setup-status 429 rate limit with cached response (#2915 )	2026-05-18 22:07:01 +08:00
channels.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
feedback.py	feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )	2026-04-26 11:09:55 +08:00
mcp.py	fix(security): harden MCP config endpoint (#3425 )	2026-06-08 12:21:02 +08:00
memory.py	feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )	2026-04-26 11:13:01 +08:00
models.py	refactor: thread release config through lead path (#2612 )	2026-04-28 14:53:18 +08:00
runs.py	fix: load paginated run history messages (#3305 )	2026-06-01 15:50:39 +08:00
skills.py	refactor(skills): Unified skill storage capability (#2613 )	2026-05-01 13:23:26 +08:00
suggestions.py	fix(suggestions): strip inline <think> reasoning before parsing follow-up questions (#3435 )	2026-06-08 15:48:00 +08:00
thread_runs.py	fix: load paginated run history messages (#3305 )	2026-06-01 15:50:39 +08:00
threads.py	perf(harness): push thread metadata filters into SQL (#2865 )	2026-05-12 23:21:22 +08:00
uploads.py	fix upload file size contract (#3408 )	2026-06-06 15:12:17 +08:00