hermes-agent

Files

Teknium 897f9533ed fix: keep CLI context display in sync with preflight token estimate (#35079 )

* Inspired by Claude Code: /compress here [N] — boundary-aware 'summarize up to here'

Adds a user-chosen compression boundary to the existing /compress command.
/compress here [N] summarizes everything except the most recent N exchanges
(default 2), which are preserved verbatim — letting the user pick the
compression boundary instead of relying on the automatic token-budget heuristic.

Inspired by Claude Code's Rewind 'Summarize up to here' action (v2.1.139,
Week 20, May 2026): https://code.claude.com/docs/en/whats-new/2026-w20

- hermes_cli/partial_compress.py: pure split/parse helpers + seam-alternation
  guard (shared by CLI and gateway).
- cli.py / gateway/run.py: route 'here [N]' / '--keep N' to partial compression;
  compress only the head, re-append the verbatim tail through the seam guard.
- Preserves message-flow role alternation (seam guard merges any illegal
  user->user / assistant->assistant adjacency).
- Reuses the existing _compress_context session-rotation/lock machinery — no
  changes to the compression core.
- Bare /compress (full) and /compress <focus> behavior unchanged.

Tests: 12 helper unit tests + 5 CLI integration tests + E2E (interleaved
tool-call transcript, degenerate/multimodal seams, real handler path).

* fix: keep CLI context display in sync with preflight token estimate

The status bar reads compressor.last_prompt_tokens, which only updates
from a successful API response. When loaded history is oversized but
compression no-ops (e.g. the auxiliary summary model times out), no fresh
usage arrives and the bar stays frozen at the old, smaller value while the
preflight estimate reports a much larger number — looking permanently out
of sync (reported: 74.4K display vs ~144,669 preflight).

Seed last_prompt_tokens with the fresh preflight estimate (upward-only, so
a real usage figure is never clobbered and a successful compression's
downward correction still wins). Display-only; no behavioral change to
compression, caching, or the agent loop.

2026-05-29 19:21:15 -07:00

__init__.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

conftest.py

ci(tests): add pytest-timeout 60s hard cap to break suite-teardown deadlock (#28861 )

2026-05-19 17:27:24 -07:00

test_413_compression.py

fix: keep CLI context display in sync with preflight token estimate (#35079 )

2026-05-29 19:21:15 -07:00

test_860_dedup.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_1630_context_overflow_loop.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_18028_content_policy_blocked.py

fix(agent): fallback immediately on provider content-policy blocks (#33883 )

2026-05-28 07:28:24 -07:00

test_31273_402_not_retried.py

fix(agent): abort on HTTP 402 after pool rotation and fallback fail (#31443 )

2026-05-24 15:14:13 -07:00

test_agent_guardrails.py

fix(agent): include name field on every role:tool message for Gemini compatibility (#16478 )

2026-05-04 05:06:33 -07:00

test_anthropic_prompt_cache_policy.py

fix(cache): kill long-lived prefix layout — system prompt is now byte-static within a session (#24778 )

2026-05-12 20:46:04 -07:00

test_anthropic_third_party_oauth_guard.py

fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 )

2026-04-19 22:43:09 -07:00

test_anthropic_truncation_continuation.py

chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )

2026-05-17 02:29:41 -07:00

test_api_max_retries_config.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_async_httpx_del_neuter.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_background_review_cache_parity.py

chore: trim verbose comments/docstrings, add AUTHOR_MAP entry

2026-05-21 12:49:21 +05:30

test_background_review_summary.py

fix(agent): exclude prior-history tool messages from background review summary

2026-04-24 03:10:19 -07:00

test_background_review_toolset_restriction.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_background_review.py

test: cover ci-unblocker production regressions

2026-05-27 22:14:53 -07:00

test_callable_api_key.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_codex_app_server_integration.py

fix(codex-runtime): retire wedged sessions + post-tool watchdog + OAuth refresh classify (#25769 )

2026-05-14 07:55:09 -07:00

test_codex_multimodal_tool_result.py

feat(vision): vision_analyze returns pixels to vision-capable models, not aux text (#22955 )

2026-05-09 21:06:19 -07:00

test_codex_no_tools_nonetype.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_codex_silent_hang_hint.py

fix(codex): update silent-hang workaround hint

2026-05-27 01:52:34 -07:00

test_codex_xai_oauth_recovery.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_commit_memory_session_context_engine.py

fix(agent): notify context engine on commit_memory_session (#22764 )

2026-05-09 12:28:42 -07:00

test_compress_focus_plugin_fallback.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_compression_boundary_hook.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_compression_boundary.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_compression_feasibility.py

perf(compression): defer feasibility check to first compression attempt (#28957 )

2026-05-19 17:27:17 -07:00

test_compression_persistence.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_compression_trigger_excludes_reasoning.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_compressor_fallback_update.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_concurrent_interrupt.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_context_token_tracking.py

refactor(session-log): delete _save_session_log and all callers

2026-05-20 11:44:10 -07:00

test_copilot_native_vision_headers.py

fix(copilot): mark native image requests as vision

2026-04-27 08:35:50 -07:00

test_create_openai_client_kwargs_isolation.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_create_openai_client_proxy_env.py

test(proxy): regression tests for NO_PROXY bypass on keepalive client

2026-04-24 03:04:42 -07:00

test_create_openai_client_reuse.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_credential_pool_interrupt.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_deepseek_reasoning_content_echo.py

fix(agent): re-pad reasoning_content on cross-provider fallback to require-side providers

2026-05-28 03:21:00 -07:00

test_deepseek_v4_thinking_live.py

fix(deepseek): preserve v4 reasoning_content on replay

2026-04-30 11:18:39 -07:00

test_dict_tool_call_args.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_empty_response_recovery_persistence.py

refactor(session-log): delete _save_session_log and all callers

2026-05-20 11:44:10 -07:00

test_exit_cleanup_interrupt.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_fallback_credential_isolation.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_file_mutation_verifier.py

fix: classify landed file mutations with diagnostics

2026-05-13 06:46:23 -07:00

test_image_rejection_fallback.py

fix(agent): catch ChatGPT-account Codex data-URL rejection so images are stripped instead of cascading to compression (#23602 )

2026-05-11 07:37:22 -07:00

test_image_shrink_recovery.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_init_fallback_on_exhausted_pool.py

fix(agent): try fallback providers at init when primary credential pool is exhausted (#17929 )

2026-05-02 02:09:46 -07:00

test_interactive_interrupt.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_interrupt_propagation.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_invalid_context_length_warning.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_iteration_budget_race.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_jsondecodeerror_retryable.py

fix(agent): classify TypeError('NoneType ... not iterable') as retryable provider shape error

2026-05-27 11:30:55 -07:00

test_last_reasoning_per_turn.py

test: pin per-turn reasoning extraction semantics

2026-05-05 05:00:05 -07:00

test_long_context_tier_429.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_materialize_data_url_cleanup.py

fix(misc): three small defensive fixes from PR #1974

2026-05-10 22:28:01 -07:00

test_memory_nudge_counter_hydration.py

refactor(run_agent): review fixes — keyword-forward __init__, drop dead code, tighten guards

2026-05-16 22:55:49 -07:00

test_memory_provider_init.py

Add Honcho runtime peer mapping

2026-05-27 10:49:33 -07:00

test_memory_sync_interrupted.py

feat: expose completed-turn message context to memory providers

2026-05-29 02:16:43 +05:30

test_message_sequence_repair.py

fix(run_agent): break permanent empty-response loop from orphan tool-tail (#21385 )

2026-05-07 08:35:10 -07:00

test_multimodal_tool_content_recovery.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_openai_client_lifecycle.py

fix(codex): size and propagate timeouts for Responses-API requests; lower stale defaults

2026-05-25 01:47:55 -07:00

test_partial_stream_finish_reason.py

fix(streaming): route mid-tool-call partial-stream-stub through length continuation (#31998 ) (#32012 )

2026-05-25 17:43:10 +05:30

test_percentage_clamp.py

fix: update 6 test files broken by dead code removal

2026-04-10 03:44:43 -07:00

test_plugin_context_engine_init.py

fix: expose context engine tools with saved toolsets

2026-05-28 00:28:42 -07:00

test_primary_runtime_restore.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_provider_attribution_headers.py

remove Vercel AI Gateway and Vercel Sandbox (#33067 )

2026-05-27 00:43:32 -07:00

test_provider_fallback.py

fix(fallback): skip chain entries matching current provider/model/base_url (#22780 )

2026-05-09 12:48:19 -07:00

test_provider_parity.py

test(auth): update nous jwt-only expectations

2026-05-29 02:24:48 -07:00

test_real_interrupt_subagent.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_redirect_stdout_issue.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_repair_tool_call_arguments.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_repair_tool_call_name.py

fix(agent): repair CamelCase + _tool suffix tool-call emissions (#15124 )

2026-04-24 05:32:08 -07:00

test_retry_status_buffer.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_review_prompt_class_first.py

fix(review): tell background reviewer not to capture transient env failures as skills (#23004 )

2026-05-09 22:51:25 -07:00

test_run_agent_codex_responses.py

fix(xai-sanitize): deepcopy tools_for_api before in-place mutation (#27907 )

2026-05-28 23:29:59 -07:00

test_run_agent_multimodal_prologue.py

refactor: unify transport dispatch + collapse normalize shims

2026-04-22 18:34:25 -07:00

test_run_agent.py

refactor(auth): remove vestigial Nous min_key_ttl/inference_auth_mode params

2026-05-29 02:24:48 -07:00

test_sequential_chats_live.py

test: regression guards for the keepalive/transport bug class (#10933 ) (#11266 )

2026-04-16 16:36:33 -07:00

test_session_id_env.py

feat: expose HERMES_SESSION_ID to agent tools via ContextVar + env (#23847 )

2026-05-12 00:16:45 +05:30

test_session_meta_filtering.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_session_reset_fix.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_steer.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_stream_drop_logging.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_stream_interrupt_retry.py

fix: /stop now immediately aborts streaming retry loop

2026-04-25 09:51:39 -07:00

test_streaming_tool_call_repair.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_streaming.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_strict_api_validation.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_strip_reasoning_tags_cli.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_switch_model_context.py

test(ci): stabilize shared optional dependency baselines

2026-05-13 17:32:22 -07:00

test_switch_model_fallback_prune.py

fix(agent): default missing fallback chain on switch

2026-04-24 05:35:43 -07:00

test_switch_model_rollback.py

fix(agent): roll back switch_model() state when client rebuild fails (#33228 )

2026-05-27 05:43:20 -07:00

test_thinking_only_sanitizer.py

fix(agent): drop thinking-only assistant turns before provider call (#16959 )

2026-04-28 03:50:51 -07:00

test_tls_fd_recycle_corruption.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_token_persistence_non_cli.py

fix: make session search initialize session db

2026-05-09 14:36:58 -07:00

test_tool_arg_coercion.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00

test_tool_call_args_sanitizer.py

ci(tests): add pytest-timeout 60s hard cap to break suite-teardown deadlock (#28861 )

2026-05-19 17:27:24 -07:00

test_tool_call_guardrail_runtime.py

test(guardrail): assert halt message reaches stream_delta_callback

2026-05-24 07:38:24 -07:00

test_tool_executor_contextvar_propagation.py

fix(code-exec): propagate agent-turn context into tool worker threads

2026-05-29 03:44:49 -07:00

test_tool_name_db_persistence.py

fix(agent): set tool_name on tool-result messages at construction time

2026-05-19 20:49:11 +01:00

test_unicode_ascii_codec.py

fix: always retry on ASCII codec UnicodeEncodeError — don't gate on per-component sanitization

2026-04-15 15:03:28 -07:00

test_vision_aware_preprocessing.py

chore: prune unused imports and duplicate import redefinitions

2026-05-28 22:26:25 -07:00