deerflow2

History

He Wang e9deb6c2f2 perf(harness): push thread metadata filters into SQL (#2865 ) * perf(harness): push thread metadata filters into SQL Replace Python-side metadata filtering (5x overfetch + in-memory match) with database-side json_extract predicates so LIMIT/OFFSET pagination is exact regardless of match density. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com> * fix(harness): add dialect-aware JsonMatch compiler for type-safe metadata SQL filters Replace SQLAlchemy JSON index/comparator APIs with a custom JsonMatch ColumnElement that compiles to json_type/json_extract on SQLite and jsonb_typeof/->>/-> on PostgreSQL. Tighten key validation regex to single-segment identifiers, handle None/bool/numeric value types with json_type-based discrimination, and strengthen test coverage for edge cases and discriminability. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com> * fix(harness): address Copilot review comments on JSON metadata filters - Use json_typeof instead of jsonb_typeof in PostgreSQL compiler; the metadata_json column is JSON not JSONB so jsonb_typeof would error at runtime on any PostgreSQL backend - Align _is_safe_json_key with json_match's _KEY_CHARSET_RE so keys containing hyphens or leading digits are not silently skipped - Add thread_id as secondary ORDER BY in search() to make pagination deterministic when updated_at values collide; remove asyncio.sleep from the pagination regression test Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com> * fix(harness): address remaining review comments on metadata SQL filters - Remove _is_safe_json_key() and reuse json_match ValueError to avoid validator drift (Copilot #3217603895, #3217411616) - Raise ValueError when all metadata keys are rejected so callers never get silent unfiltered results (WillemJiang) - Fix integer precision: split int/float branches, bind int as Integer() with INTEGER/BIGINT CAST instead of float() coercion (Copilot #3217603972) - Fix jsonb_typeof -> json_typeof on JSON column (Copilot #3217411579) - Replace manual _cleanup() calls with async yield fixture so teardown always runs (Copilot #3217604019) - Remove asyncio.sleep(0.01) pagination ordering; use thread_id secondary sort instead (Copilot #3217411636) - Add type annotations to _bind/_build_clause/_compile_* and remove EOL comments from _Dialect fields (coding.mdc) - Expand test coverage: boolean/null/mixed-type/large-int precision, partial unsafe-key skip with caplog assertion Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(harness): address third-round Copilot review comments on JsonMatch - Reject unsupported value types (list, dict, ...) in JsonMatch.__init__ with TypeError so inherit_cache=True never receives an unhashable value and callers get an explicit error instead of silent str() coercion (Copilot #3217933201) - Upgrade int bindparam from Integer() to BigInteger() to align with BIGINT CAST and avoid overflow on large integers (Copilot #3217933252) - Catch TypeError alongside ValueError in search() so non-string metadata keys are warned and skipped rather than raising unexpectedly (Copilot #3217933300) - Add three tests: json_match rejects unsupported value types, search() warns and raises on non-string key, search() warns and raises on unsupported value type Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(harness): address fourth-round Copilot review comments on JsonMatch - Add CASE WHEN guard for PostgreSQL integer matching: json_typeof returns 'number' for both ints and floats; wrap CAST in CASE with regex guard '^-?[0-9]+$' so float rows never trigger CAST error (Copilot #3218413860) - Validate isinstance(key, str) before regex match in JsonMatch.__init__ so non-string keys raise ValueError consistently instead of TypeError from re.match (Copilot #3218413900) - Include exception message in metadata filter skip warning so callers can distinguish invalid key from unsupported value type (Copilot #3218413924) - Update tests: assert CASE WHEN guard in PG int compilation, cover non-string key ValueError in test_json_match_rejects_unsafe_key Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(harness): align ThreadMetaStore.search() signature with sql.py implementation Use `dict[str, Any]` for `metadata` and `list[dict[str, Any]]` as return type in base class and MemoryThreadMetaStore to resolve an LSP signature mismatch; also correct a test docstring that cited the wrong exception type. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(harness): surface InvalidMetadataFilterError as HTTP 400 in search endpoint Replace bare ValueError with a domain-specific InvalidMetadataFilterError (subclass of ValueError) so the Gateway handler can catch it and return HTTP 400 instead of letting it bubble up as a 500. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com> * fix(harness): sanitize metadata keys in log output to prevent log injection Use ascii() instead of %r to escape control characters in client-supplied metadata keys before logging, preventing multiline/forged log entries. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com> * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * fix(harness): validate metadata filters at API boundary and dedupe key/value rules - Add Pydantic ``field_validator`` on ``ThreadSearchRequest.metadata`` so unsafe keys / unsupported value types are rejected with HTTP 422 from both SQL and memory backends (closes Copilot review 3218830849). - Export ``validate_metadata_filter_key`` / ``validate_metadata_filter_value`` (and ``ALLOWED_FILTER_VALUE_TYPES``) from ``json_compat`` and have ``JsonMatch.__init__`` reuse them — the Gateway-side validator and the SQL-side ``JsonMatch`` constructor now share one admission rule and cannot drift. - Format ``InvalidMetadataFilterError`` rejected-keys list as a comma-separated plain string instead of a Python list repr so the surfaced HTTP 400 detail is readable (closes Copilot review 3218830899). - Update router tests to cover both 422 boundary paths plus the 400 defense-in-depth path when a backend still raises the error. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(harness): harden JsonMatch compile-time key validation against __init__ bypass Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com> * fix: address review feedback on metadata filter SQL push-down - Add signed 64-bit range check to validate_metadata_filter_value; give out-of-range ints a distinct TypeError message. - Replace assert guards in _compile_sqlite/_compile_pg with explicit if/raise so they survive python -O optimisation. Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4 <noreply@anthropic.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> Co-authored-by: Cursor <cursoragent@cursor.com>		2026-05-12 23:21:22 +08:00
..
_agent_e2e_helpers.py	fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )	2026-05-12 23:18:54 +08:00
_router_auth_helpers.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
conftest.py	refactor(skills): Unified skill storage capability (#2613 )	2026-05-01 13:23:26 +08:00
test_acp_config.py	feat(acp): add env field to ACPAgentConfig for subprocess env injection (#1447 )	2026-03-27 20:03:30 +08:00
test_aio_sandbox_local_backend.py	[security] fix(sandbox): bind local Docker ports to loopback (#2633 )	2026-04-30 11:40:28 +08:00
test_aio_sandbox_provider.py	feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )	2026-04-26 11:13:01 +08:00
test_aio_sandbox.py	fix(sandbox): pass no_change_timeout to exec_command to prevent 120s premature termination (#2685 )	2026-05-01 22:27:02 +08:00
test_app_config_reload.py	fix(config): reset config-backed singletons on hot reload (#2588 )	2026-05-06 10:17:55 +08:00
test_artifacts_router.py	feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )	2026-04-26 11:08:11 +08:00
test_auth_config.py	feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )	2026-04-26 11:08:11 +08:00
test_auth_errors.py	feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )	2026-04-26 11:08:11 +08:00
test_auth_middleware.py	feat: implement process-local internal authentication for Gateway and enhance CSRF handling	2026-04-26 22:20:57 +08:00
test_auth_type_system.py	feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )	2026-04-26 11:08:11 +08:00
test_auth.py	fix(security): harden auth system and fix run journal logic bug (#2593 )	2026-04-28 11:34:07 +08:00
test_channel_file_attachments.py	[security] fix(upload): reject symlinked upload destinations (#2623 )	2026-05-02 15:19:28 +08:00
test_channels.py	fix(channels): authenticate gateway command requests (#2742 )	2026-05-06 15:27:34 +08:00
test_check_script.py	fix(check): windows pnpm version detection in check script (#2189 )	2026-04-14 10:29:44 +08:00
test_checkpointer_none_fix.py	feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )	2026-04-26 11:09:55 +08:00
test_checkpointer.py	fix(packaging): add postgres extra for store/checkpointer supportFix postgres extra install guidance (#2584 )	2026-05-09 09:49:08 +08:00
test_clarification_middleware.py	fix(backend): make clarification messages idempotent (#2350 ) (#2351 )	2026-04-19 22:00:58 +08:00
test_claude_provider_oauth_billing.py	fix(oauth): Harden Claude OAuth cache-control handling (#1583 )	2026-03-30 07:41:18 +08:00
test_claude_provider_prompt_caching.py	fix: cap prompt caching breakpoints at 4 to prevent API 400 errors (#2449 )	2026-04-25 19:40:06 +08:00
test_cli_auth_providers.py	fix(provider): preserve streamed Codex output when response.completed.output is empty (#1928 )	2026-04-07 18:21:22 +08:00
test_client_e2e.py	fix(harness): resolve runtime paths from project root (#2642 )	2026-05-01 22:19:50 +08:00
test_client_live.py	[Security] Address critical host-shell escape in LocalSandboxProvider (#1547 )	2026-03-29 21:03:58 +08:00
test_client_message_serialization.py	feat: refine token usage display modes (#2329 )	2026-05-04 09:56:16 +08:00
test_client.py	feat: refine token usage display modes (#2329 )	2026-05-04 09:56:16 +08:00
test_codex_provider.py	chroe(2585): keep polishing the code of codex token usage (#2689 )	2026-05-02 15:04:11 +08:00
test_config_version.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
test_converters.py	feat(persistence): add unified persistence layer with event store, token tracking, and feedback (#1930 )	2026-04-26 11:05:47 +08:00
test_create_deerflow_agent_live.py	feat: add create_deerflow_agent SDK entry point (Phase 1) (#1203 )	2026-03-29 15:31:18 +08:00
test_create_deerflow_agent.py	feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )	2026-05-07 16:15:15 +08:00
test_credential_loader.py	feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )	2026-05-07 16:15:15 +08:00
test_csrf_middleware.py	feat: static system prompt with DynamicContextMiddleware for prefix-cache optimization (#2801 )	2026-05-09 09:27:02 +08:00
test_custom_agent.py	feat(agent): add custom-agent self-updates with user isolation (#2713 )	2026-05-05 23:17:42 +08:00
test_dangling_tool_call_middleware.py	fix(middleware): Handle invalid tool calls in dangling pairing middleware (#2890 ) (#2891 )	2026-05-12 10:55:13 +08:00
test_detect_uv_extras.py	fix(scripts): preserve uv extras across `make dev` restarts (#2754 ) (#2767 )	2026-05-10 22:28:29 +08:00
test_dev_entrypoint.py	fix(scripts): preserve uv extras across `make dev` restarts (#2754 ) (#2767 )	2026-05-10 22:28:29 +08:00
test_dingtalk_channel.py	feat(channels): add DingTalk channel integration (#2628 )	2026-04-30 11:25:33 +08:00
test_discord_channel.py	feat(channels): add Discord channel integration (#1806 )	2026-04-11 17:48:04 +08:00
test_docker_sandbox_mode_detection.py	fix Windows Docker sandbox path mounting (#1634 )	2026-03-31 22:19:27 +08:00
test_doctor.py	feat(dx): Setup Wizard + doctor command — closes #2030 (#2034 )	2026-04-10 17:43:39 +08:00
test_dynamic_context_middleware.py	fix(harness): preserve dynamic context across summarization (#2823 )	2026-05-09 19:39:36 +08:00
test_ensure_admin.py	refactor: Remove init_token handling from admin initialization logic and related tests	2026-04-26 11:09:56 +08:00
test_exa_tools.py	feat(community): add Exa search as community tool provider (#1357 )	2026-04-08 17:13:39 +08:00
test_feedback.py	feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )	2026-04-26 11:09:55 +08:00
test_feishu_parser.py	Feature/feishu receive file (#1608 )	2026-04-06 22:14:12 +08:00
test_file_conversion.py	[security] fix(uploads): require explicit opt-in for host-side document conversion (#2332 )	2026-04-18 22:47:42 +08:00
test_firecrawl_tools.py	feat(dx): Setup Wizard + doctor command — closes #2030 (#2034 )	2026-04-10 17:43:39 +08:00
test_gateway_deps_config.py	refactor: root release config in gateway runtime (#2611 )	2026-04-28 00:13:04 +08:00
test_gateway_docs_toggle.py	fix(nginx): defer CORS to gateway allowlist (#2861 )	2026-05-11 17:38:37 +08:00
test_gateway_lifespan_shutdown.py	fix(gateway): bound lifespan shutdown hooks to prevent worker hang under uvicorn reload (#2331 )	2026-04-23 19:41:26 +08:00
test_gateway_runtime_cleanup.py	fix(nginx): defer CORS to gateway allowlist (#2861 )	2026-05-11 17:38:37 +08:00
test_gateway_services.py	fix: keep new agent bootstrap in user scope (#2784 )	2026-05-09 19:43:50 +08:00
test_guardrail_middleware.py	feat(guardrails): add pre-tool-call authorization middleware with pluggable providers (#1240 )	2026-03-23 18:07:33 +08:00
test_harness_boundary.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
test_infoquest_client.py	feat(harness): integration ACP agent tool (#1344 )	2026-03-26 14:20:18 +08:00
test_initialize_admin.py	fix(security): harden auth system and fix run journal logic bug (#2593 )	2026-04-28 11:34:07 +08:00
test_invoke_acp_agent_tool.py	refactor: thread app_config through lead and subagent task path (#2666 )	2026-05-02 06:37:49 +08:00
test_jina_client.py	fix(jina): log transient failures at WARNING without traceback (#2484 ) (#2485 )	2026-04-24 16:00:14 +08:00
test_langgraph_auth.py	fix(security): harden auth system and fix run journal logic bug (#2593 )	2026-04-28 11:34:07 +08:00
test_lead_agent_model_resolution.py	feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )	2026-05-07 16:15:15 +08:00
test_lead_agent_prompt.py	fix(skills): enforce allowed-tools metadata (#2626 )	2026-05-07 08:34:43 +08:00
test_lead_agent_skills.py	fix(skills): enforce allowed-tools metadata (#2626 )	2026-05-07 08:34:43 +08:00
test_llm_error_handling_middleware.py	refactor: thread app_config through middleware factories (#2652 )	2026-04-30 12:41:09 +08:00
test_local_bash_tool_loading.py	fix(sandbox): improve sandbox security and preserve multimodal content (#2114 )	2026-04-11 16:52:10 +08:00
test_local_sandbox_encoding.py	fix(sandbox): disable msys path conversion (#2766 )	2026-05-08 10:13:11 +08:00
test_local_sandbox_provider_mounts.py	fix(harness): reset local sandbox singleton with provider lifecycle (#2834 )	2026-05-11 07:42:15 +08:00
test_local_skill_storage_write.py	refactor(skills): Unified skill storage capability (#2613 )	2026-05-01 13:23:26 +08:00
test_logging_level_from_config.py	fix(config): unify log_level from config.yaml across Gateway and debug entry points (#2601 )	2026-04-30 22:27:14 +08:00
test_loop_detection_config.py	feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )	2026-05-07 16:15:15 +08:00
test_loop_detection_middleware.py	feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )	2026-05-07 16:15:15 +08:00
test_mcp_client_config.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
test_mcp_custom_interceptors.py	feat(mcp): support custom tool interceptors via extensions_config.json (#2451 )	2026-04-25 09:18:13 +08:00
test_mcp_oauth.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
test_mcp_sync_wrapper.py	fix(harness): wrap async-only config tools for sync client execution (#2878 )	2026-05-11 22:14:13 +08:00
test_memory_prompt_injection.py	fix: inject longTermBackground into memory prompt (#1734 )	2026-04-03 11:21:58 +08:00
test_memory_queue_user_isolation.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
test_memory_queue.py	feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )	2026-04-26 11:13:01 +08:00
test_memory_router.py	feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )	2026-04-26 11:13:01 +08:00
test_memory_storage_user_isolation.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
test_memory_storage.py	fix: Memory update system has cache corruption, data loss, and thread-safety bugs (#2251 )	2026-04-17 12:00:31 +08:00
test_memory_thread_meta_isolation.py	feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )	2026-04-26 11:09:55 +08:00
test_memory_updater_user_isolation.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
test_memory_updater.py	fix(memory): replace short-lived asyncio.run() with persistent event loop (#2627 )	2026-04-30 17:59:57 +08:00
test_memory_upload_filtering.py	feat: flush memory before summarization (#2176 )	2026-04-14 15:01:06 +08:00
test_migration_user_isolation.py	feat(agent): add custom-agent self-updates with user isolation (#2713 )	2026-05-05 23:17:42 +08:00
test_mindie_provider.py	chore(adpator):Adapt MindIE engine model and improve testing and fixes (#2523 )	2026-04-28 15:09:31 +08:00
test_model_config.py	feat(codex): support explicit OpenAI Responses API config (#1235 )	2026-03-22 20:39:26 +08:00
test_model_factory.py	feat(persistence): add unified persistence layer with event store, token tracking, and feedback (#1930 )	2026-04-26 11:05:47 +08:00
test_owner_isolation.py	feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )	2026-04-26 11:09:55 +08:00
test_patched_deepseek.py	fix: resolve missing serialized kwargs in PatchedChatDeepSeek (#2025 )	2026-04-09 16:07:16 +08:00
test_patched_minimax.py	fix: improve MiniMax code plan integration (#1169 )	2026-03-20 17:18:59 +08:00
test_patched_openai.py	fix(LLM): fixing Gemini thinking + tool calls via OpenAI gateway (#1180 ) (#1205 )	2026-03-26 15:07:05 +08:00
test_paths_user_isolation.py	feat(agent): add custom-agent self-updates with user isolation (#2713 )	2026-05-05 23:17:42 +08:00
test_persistence_scaffold.py	fix(packaging): add postgres extra for store/checkpointer supportFix postgres extra install guidance (#2584 )	2026-05-09 09:49:08 +08:00
test_present_file_tool_core_logic.py	feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )	2026-04-26 11:13:01 +08:00
test_provisioner_kubeconfig.py	feat(provisioner): add optional PVC support for sandbox volumes (#2020 )	2026-04-10 20:40:30 +08:00
test_provisioner_pvc_volumes.py	feat(provisioner): add optional PVC support for sandbox volumes (#2020 )	2026-04-10 20:40:30 +08:00
test_readability.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
test_reflection_resolvers.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
test_remote_sandbox_backend.py	fix: Supplement list_running in RemoteSandboxBackend (#2716 )	2026-05-05 18:53:10 +08:00
test_run_event_store_pagination.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
test_run_event_store.py	fix(events): serialize structured db event content (#2762 )	2026-05-08 10:17:17 +08:00
test_run_journal.py	fix(runtime): persist run message summaries (#2850 )	2026-05-11 19:54:00 +08:00
test_run_manager.py	feat(run): Propagates `model_name` from the gateway request through the runtime and persistence stack to the SQLite database. (#2775 )	2026-05-11 21:45:18 +08:00
test_run_repository.py	feat(run): Propagates `model_name` from the gateway request through the runtime and persistence stack to the SQLite database. (#2775 )	2026-05-11 21:45:18 +08:00
test_run_worker_rollback.py	fix(runtime): make rollback restore checkpoint supersede newer checkpoints (#2582 )	2026-05-02 11:25:45 +08:00
test_runs_api_endpoints.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
test_runtime_paths.py	fix(harness): restore legacy skills path fallback (#2694 ) (#2696 )	2026-05-03 23:40:59 +08:00
test_sandbox_audit_middleware.py	feat(sandbox): strengthen bash command auditing with compound splitting and expanded patterns (#1881 )	2026-04-07 17:15:24 +08:00
test_sandbox_orphan_reconciliation_e2e.py	fix(sandbox): add startup reconciliation to prevent orphaned container leaks (#1976 )	2026-04-09 17:21:23 +08:00
test_sandbox_orphan_reconciliation.py	fix(sandbox): add startup reconciliation to prevent orphaned container leaks (#1976 )	2026-04-09 17:21:23 +08:00
test_sandbox_search_tools.py	fix(sandbox): add missing path masking in ls_tool output (#2317 )	2026-04-18 08:46:59 +08:00
test_sandbox_tools_security.py	fix(sandbox): block host bash traversal escapes (#2560 )	2026-04-28 12:18:41 +08:00
test_security_scanner.py	feat(trace):Add run_name to the trace info for system agents. (#2492 )	2026-04-24 17:06:55 +08:00
test_serialization.py	feat(gateway): implement LangGraph Platform API in Gateway, replace langgraph-cli (#1403 )	2026-03-30 16:02:23 +08:00
test_serialize_message_content.py	feat(harness): integration ACP agent tool (#1344 )	2026-03-26 14:20:18 +08:00
test_serper_tools.py	feat(community): add Serper web search provider (#2630 )	2026-05-02 16:22:35 +08:00
test_setup_agent_e2e_user_isolation.py	fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )	2026-05-12 23:18:54 +08:00
test_setup_agent_http_e2e_real_server.py	fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )	2026-05-12 23:18:54 +08:00
test_setup_agent_tool.py	fix: keep new agent bootstrap in user scope (#2784 )	2026-05-09 19:43:50 +08:00
test_setup_wizard.py	enable token usage by default (#2841 )	2026-05-10 22:00:57 +08:00
test_skill_manage_tool.py	refactor(skills): Unified skill storage capability (#2613 )	2026-05-01 13:23:26 +08:00
test_skills_archive_root.py	refactor: extract shared skill installer and upload manager to harness (#1202 )	2026-03-25 16:28:33 +08:00
test_skills_bundled.py	fix(skills): validate bundled SKILL.md front-matter in CI (fixes #2443 ) (#2457 )	2026-04-23 14:06:14 +08:00
test_skills_custom_router.py	refactor(skills): Unified skill storage capability (#2613 )	2026-05-01 13:23:26 +08:00
test_skills_installer.py	refactor(skills): Unified skill storage capability (#2613 )	2026-05-01 13:23:26 +08:00
test_skills_loader.py	fix(harness): restore legacy skills path fallback (#2694 ) (#2696 )	2026-05-03 23:40:59 +08:00
test_skills_parser.py	fix(skills): enforce allowed-tools metadata (#2626 )	2026-05-07 08:34:43 +08:00
test_skills_validation.py	fix(skills): enforce allowed-tools metadata (#2626 )	2026-05-07 08:34:43 +08:00
test_sse_format.py	feat(gateway): implement LangGraph Platform API in Gateway, replace langgraph-cli (#1403 )	2026-03-30 16:02:23 +08:00
test_stream_bridge.py	Fix(#1702 ): stream resume run (#1858 )	2026-04-06 14:51:10 +08:00
test_subagent_executor.py	fix(subagents): consolidate system_prompt and skills into single SystemMessage (#2701 )	2026-05-11 09:59:06 +08:00
test_subagent_limit_middleware.py	fix(middleware): sync raw tool call metadata (#2757 )	2026-05-08 10:08:53 +08:00
test_subagent_prompt_security.py	feat(subagents): support per-subagent skill loading and custom subagent types (#2253 )	2026-04-23 23:59:47 +08:00
test_subagent_skills_config.py	refactor: thread app_config through lead and subagent task path (#2666 )	2026-05-02 06:37:49 +08:00
test_subagent_timeout_config.py	feat(subagents): allow model override per subagent in config.yaml (#2064 )	2026-04-12 16:40:21 +08:00
test_subagent_token_collector.py	fix: bucket subagent token usage into parent run totals (#2838 )	2026-05-10 22:47:30 +08:00
test_suggestions_router.py	refactor: thread release config through lead path (#2612 )	2026-04-28 14:53:18 +08:00
test_summarization_middleware.py	fix(harness): preserve dynamic context across summarization (#2823 )	2026-05-09 19:39:36 +08:00
test_task_tool_core_logic.py	fix: bucket subagent token usage into parent run totals (#2838 )	2026-05-10 22:47:30 +08:00
test_thread_data_middleware.py	Fix Windows backend test compatibility (#1384 )	2026-03-26 17:39:16 +08:00
test_thread_meta_repo.py	perf(harness): push thread metadata filters into SQL (#2865 )	2026-05-12 23:21:22 +08:00
test_thread_run_messages_pagination.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
test_thread_token_usage.py	fix: use backend thread token usage for header total (#2800 )	2026-05-09 19:40:32 +08:00
test_threads_router.py	perf(harness): push thread metadata filters into SQL (#2865 )	2026-05-12 23:21:22 +08:00
test_title_generation.py	refactor: split backend into harness (deerflow.) and app (app.) (#1131 )	2026-03-14 22:55:52 +08:00
test_title_middleware_core_logic.py	fix title generation with dynamic context reminder (#2830 )	2026-05-09 18:22:58 +08:00
test_todo_middleware.py	fix(todo-middleware): prevent premature agent exit with incomplete todos (#2135 )	2026-04-14 11:11:26 +08:00
test_token_usage_config.py	enable token usage by default (#2841 )	2026-05-10 22:00:57 +08:00
test_token_usage_middleware.py	feat: static system prompt with DynamicContextMiddleware for prefix-cache optimization (#2801 )	2026-05-09 09:27:02 +08:00
test_token_usage.py	feat(harness): integration ACP agent tool (#1344 )	2026-03-26 14:20:18 +08:00
test_tool_args_schema_no_pydantic_warning.py	fix(tools): make write_file append discoverable in model-facing schema (#2843 )	2026-05-10 23:09:03 +08:00
test_tool_deduplication.py	fix(harness): wrap async-only config tools for sync client execution (#2878 )	2026-05-11 22:14:13 +08:00
test_tool_error_handling_middleware.py	fix(subagents): use model override for tools and middleware (#2641 )	2026-05-01 22:21:10 +08:00
test_tool_output_truncation.py	fix: add output truncation to ls_tool to prevent context window overflow (#1896 )	2026-04-06 15:09:57 +08:00
test_tool_search.py	fix: gate deferred MCP tool execution (#2513 )	2026-04-24 22:45:41 +08:00
test_tracing_config.py	feat(tracing): add optional Langfuse support (#1717 )	2026-04-02 13:06:10 +08:00
test_tracing_factory.py	feat(tracing): add optional Langfuse support (#1717 )	2026-04-02 13:06:10 +08:00
test_update_agent_e2e_user_isolation.py	fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )	2026-05-12 23:18:54 +08:00
test_update_agent_tool.py	feat(agent): add custom-agent self-updates with user isolation (#2713 )	2026-05-05 23:17:42 +08:00
test_uploads_manager.py	fix(uploads): add Windows support for safe symlink-protected uploads (#2794 )	2026-05-09 18:21:54 +08:00
test_uploads_middleware_core_logic.py	feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )	2026-04-26 11:13:01 +08:00
test_uploads_router.py	Fix duplicate gateway upload filenames (#2789 )	2026-05-09 18:02:40 +08:00
test_user_context.py	fix the lint error in backend	2026-04-26 15:09:25 +08:00
test_utils_time.py	fix(gateway): return ISO 8601 timestamps from threads endpoints (#2599 )	2026-05-02 15:16:16 +08:00
test_view_image_middleware.py	test: add unit tests for ViewImageMiddleware (#2256 )	2026-04-15 23:54:30 +08:00
test_view_image_tool.py	fix(harness): constrain view_image to thread data paths (#2557 )	2026-04-28 11:13:17 +08:00
test_vllm_provider.py	feat(models): add vLLM provider support (#1860 )	2026-04-06 15:18:34 +08:00
test_wechat_channel.py	feat: add WeChat channel integration (#1869 )	2026-04-10 20:49:28 +08:00