hermes-agent

Files

Zhipeng Li 020601d41e fix(compression): drop conflicting 'resume Active Task' directive in summary prefix

SUMMARY_PREFIX previously contained two contradictory directives:

1. "treat it as background reference, NOT as active instructions"
   "Do NOT answer questions or fulfill requests mentioned in this summary"
   "Respond ONLY to the latest user message that appears AFTER this summary"

2. "Your current task is identified in the '## Active Task' section of the
    summary — resume exactly from there."

When the latest user message contradicted Active Task (e.g. 'stop the
i18n refactor', 'never mind, look at grafana instead'), models tended to
follow (2) anyway because 'resume exactly' is a strong, unambiguous
directive — leading to repeated re-surfacing of already-cancelled work
across turns, even after explicit 'stop'/'don't keep bringing that up'
messages from the user.

This change:
- Removes the conflicting 'resume exactly from Active Task' clause.
- Makes the precedence explicit: latest user message is the single source
  of truth; it WINS on conflict; cancelled Active Task / In Progress /
  Pending User Asks / Remaining Work must be discarded entirely (no
  'wrap up the old task first').
- Names canonical reverse signals (stop, undo, roll back, never mind,
  just verify, topic change) so the model recognizes them as cancellation
  triggers, not background context.
- Updates the summarizer template instruction so the LLM doesn't
  mechanically copy a cancelled task into Active Task on the next
  compaction (it's instructed to copy the reverse signal verbatim).
- Preserves: REFERENCE ONLY framing, MEMORY.md/USER.md authority, and
  the 'don't repeat work already reflected in session state' clause.

Adds tests/agent/test_summary_prefix_semantics.py to pin invariants so
the conflict can't regress.

Tested:
- All compaction tests pass: tests/agent/test_context_compressor.py,
  tests/agent/test_context_compressor_summary_continuity.py,
  tests/run_agent/test_413_compression.py,
  tests/run_agent/test_compression_persistence.py,
  tests/run_agent/test_compression_boundary_hook.py,
  tests/cli/test_manual_compress.py — 117/117 passing.
- Tested on macOS.

2026-05-30 07:29:21 -07:00

acp

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

acp_adapter

feat(azure-foundry): add Microsoft Entra ID auth

2026-05-18 10:14:38 -07:00

agent

fix(compression): drop conflicting 'resume Active Task' directive in summary prefix

2026-05-30 07:29:21 -07:00

cli

fix(cli): repaint input area after inline /steer and /model submit (#34839 )

2026-05-29 19:04:40 -07:00

cron

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

docker

fix(dashboard-auth): share /api/* public allowlist between legacy and OAuth gates

2026-05-29 12:17:12 +10:00

e2e

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

fakes

fix: streaming tool call parsing, error handling, and fake HA state mutation

2026-03-14 14:27:20 +03:00

gateway

fix(gateway): recover model on post-interrupt turn; gate fallback status (#35381 )

2026-05-30 07:28:06 -07:00

hermes_cli

feat(cli): add hermes prompt-size diagnostic (#35276 )

2026-05-30 02:53:42 -07:00

hermes_state

feat(session_search): single-shape tool with discovery, scroll, browse — no LLM (#27590 )

2026-05-17 23:28:45 -07:00

honcho_plugin

fix(honcho): harden self-hosted setup paths

2026-05-29 22:29:48 -07:00

integration

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

openviking_plugin

fix(openviking): pre-check fs/stat to route file URIs before hitting directory-only endpoints

2026-04-30 02:35:29 -07:00

plugins

fix(honcho): harden self-hosted setup paths

2026-05-29 22:29:48 -07:00

providers

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

run_agent

fix(run_agent): gate concurrent checkpoint preflight on block_result (fixes #34827 )

2026-05-30 02:38:12 -07:00

scripts

feat(acp-registry): switch to uvx distribution, drop npm launcher

2026-05-14 22:27:09 -07:00

skills

fix(google-workspace): handle Gmail header casing case-insensitively

2026-05-30 02:38:18 -07:00

stress

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

tools

test(interrupt): assert no leaked tid instead of no-op block

2026-05-30 07:28:11 -07:00

tui_gateway

perf(tui): stop slow/dead MCP servers from freezing TUI startup

2026-05-30 02:53:37 -07:00

website

docs(skills): explain restoring bundled skills

2026-05-05 13:46:20 -07:00

__init__.py

A bit of restructuring for simplicity and organization

2025-10-01 23:29:25 +00:00

conftest.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_account_usage.py

feat(account-usage): add per-provider account limits module

2026-04-21 01:56:35 -07:00

test_atomic_replace_symlinks.py

refactor: consolidate symlink-safe atomic replace into shared helper

2026-04-28 04:58:22 -07:00

test_base_url_hostname.py

security(runtime_provider): close OLLAMA_API_KEY substring-leak sweep miss (#13522 )

2026-04-21 06:06:16 -07:00

test_batch_runner_checkpoint.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_bitwarden_secrets.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_cli_file_drop.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_cli_manual_compress.py

fix(tests): catch up six stale tests after compression/aux/kanban changes (#28465 )

2026-05-18 21:43:59 -07:00

test_cli_skin_integration.py

fix(ci): stabilize main test suite regressions (#17660 )

2026-04-29 23:18:55 -07:00

test_ctx_halving_fix.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_docker_home_override_scripts.py

docker: opt in to dashboard --insecure via env var, never derive from bind host

2026-05-29 09:56:40 +10:00

test_empty_model_fallback.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_env_loader_secret_sources.py

fix(secrets): only apply external secrets once per HERMES_HOME per process (#32271 )

2026-05-25 15:18:55 -07:00

test_evidence_store.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_gateway_streaming_nested_config.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_get_tool_definitions_cache_isolation.py

fix(tools): isolate get_tool_definitions quiet_mode cache + dedup LCM injection (#17335 )

2026-04-30 04:32:06 -07:00

test_hermes_bootstrap.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_hermes_constants.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_hermes_home_profile_warning.py

fix(constants): warn once when get_hermes_home() falls back under an active profile (#18746 )

2026-05-02 01:49:55 -07:00

test_hermes_logging.py

fix(logging): recover gateway.log handler from external rotation (#34349 )

2026-05-28 22:26:00 -07:00

test_hermes_state_compression_locks.py

fix(compression): prevent session-id fork from concurrent compressions (#34351 )

2026-05-28 21:40:39 -07:00

test_hermes_state_wal_fallback.py

fix(kanban): skip redundant WAL pragma on already-WAL connections

2026-05-27 14:31:55 -07:00

test_hermes_state.py

test(state): cover update_session_model overwrite + getattr-guard text path

2026-05-30 02:35:36 -07:00

test_honcho_client_config.py

fix(honcho): harden self-hosted setup paths

2026-05-29 22:29:48 -07:00

test_honcho_session_context.py

fix(honcho): align user context peer perspective

2026-05-27 10:49:33 -07:00

test_install_sh_browser_install.py

fix(install): support non-sudo service-user installs on apt distros (#25814 )

2026-05-14 09:05:31 -07:00

test_install_sh_pythonpath_sanitization.py

fix: harden install.sh against inherited Python env leakage

2026-05-06 04:02:02 -07:00

test_install_sh_root_fhs_uv_python_path.py

test(install): harden uv-python-path regression test against future drift

2026-05-27 13:55:51 -07:00

test_install_sh_setup_wizard_tty_probe.py

fix(install): widen /dev/tty open-probe to sibling gates (#16746 )

2026-04-28 06:45:55 -07:00

test_install_sh_symlink_stomp.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_install_sh_termux_network_prereqs.py

fix: strengthen termux install network prerequisites

2026-05-07 13:04:08 -07:00

test_ipv4_preference.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_lazy_session_regressions.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_lint_config.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_live_system_guard_self_test.py

chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )

2026-05-17 02:29:41 -07:00

test_mcp_serve.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_mini_swe_runner.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_minimax_model_validation.py

fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )

2026-04-19 22:44:47 -07:00

test_minimax_oauth.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_minisweagent_path.py

chore: remove all remaining mini-swe-agent references

2026-03-24 08:19:23 -07:00

test_model_picker_scroll.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_model_tools_async_bridge.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_model_tools.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_ollama_num_ctx.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_package_json_lazy_deps.py

fix(update): make Camofox lazy-installed instead of eager (#27055 )

2026-05-16 12:15:45 -07:00

test_packaging_metadata.py

security: pin patched Starlette (>=1.0.1) for CVE-2026-48710 BadHost (#35118 )

2026-05-29 23:23:54 -07:00

test_plugin_skills.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_process_loop_event_loop_warning.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_project_metadata.py

fix(stt,tts): restore mistralai — 2.4.8 is clean, ban lifted (#34841 )

2026-05-29 13:24:12 -07:00

test_retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

test_run_tests_parallel.py

test: use subprocesses for each test file (#29016 )

2026-05-21 16:40:04 +05:30

test_sanitize_tool_error.py

security: sanitize tool error strings before injecting into model context (#26823 )

2026-05-16 00:57:39 -07:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_subprocess_home_isolation.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_termux_all_extra_compat.py

fix: add termux-all install profile and safe fallbacks

2026-05-07 13:04:08 -07:00

test_timezone.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_toolset_distributions.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_toolsets.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_trajectory_compressor_async.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_trajectory_compressor.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_transform_llm_output_hook.py

test+docs: cover transform_llm_output hook + release author map

2026-05-07 05:46:05 -07:00

test_transform_tool_result_hook.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_tui_gateway_server.py

test(tui-gateway): isolate completion_queue in poller requeue test

2026-05-29 13:29:24 -07:00

test_utils_truthy_values.py

Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.

2026-03-30 13:28:10 +09:00

test_yuanbao_integration.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_yuanbao_markdown.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00

test_yuanbao_pipeline.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_yuanbao_proto.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00