hermes-agent

Author	SHA1	Message	Date
teknium1	97ecfa0fc4	fix(session): extend no-FTS5 degradation to the trigram CJK index The salvaged contributor commit guarded only messages_fts. Current main also creates a second virtual table, messages_fts_trigram (CJK substring search), whose CREATE VIRTUAL TABLE ... USING fts5 still raised "no such module: fts5" on builds without FTS5 — re-crashing SessionDB init. Wrap the trigram setup with the same guard, and broaden the test's no-fts5 mock to fail BOTH tables so the regression test actually exercises a faithful no-FTS5 build.	2026-05-29 20:11:07 -07:00
LeonSGP43	5ad2b4c6da	fix(session): degrade gracefully when SQLite lacks FTS5	2026-05-29 20:11:07 -07:00
Teknium	860cf28dab	docs: clarify compression threshold is derived from the main model's context window (#35099 ) The compression threshold is threshold × context_length where context_length is the MAIN agent model's window, not the auxiliary/summary model's. On a 262,144-token model at the default 0.50 the threshold is 131,072 — close to a common 128K figure by coincidence of the percentage, which has led to confusion that the auxiliary model's context limit is the trigger. Add a note preempting that misreading and pointing to the separate summary-model-context constraint.	2026-05-29 19:59:04 -07:00
teknium1	fb0ab27649	fix(agent): register explainer config key + shorten footer prefix Follow-up to the salvaged #34452 turn-completion explainer: - Register display.turn_completion_explainer: True in DEFAULT_CONFIG so the setting is discoverable, matching the file_mutation_verifier precedent. - Shorten the repeated footer prefix from 'Turn ended without a usable reply: ' to 'No reply: ' so the 10 reason variants don't all open with the same 8-word boilerplate. - Update the 7 assertions that referenced the old prefix.	2026-05-29 19:23:05 -07:00
Bartok9	de6d6023d7	test(run_agent): align test_dict_tool_call_args with explainer suffix PR #34470 adds an explainer suffix to abnormal turn endings (e.g. max_iterations_reached) so users see why the response is short instead of receiving a bare/blank reply. test_tool_call_validation_accepts_dict_arguments runs the agent at max_iterations=3 which hits the explainer path; the existing strict-equality assertion (== "done") no longer matches once the suffix is appended. Switch the assertion to .startswith("done") so the test continues to verify that the models actual text survives intact while leaving the explainer suffix wording owned by conversation_loop (where it belongs). Test now passes (1 passed in 0.88s).	2026-05-29 19:23:05 -07:00
Bartok9	59b0ea98c8	fix(agent): explain abnormal turn endings instead of blank/partial reply When a turn ends abnormally after substantive tool calls (empty content after retries, a partial/truncated stream, exhausted retries, or an iteration/budget limit), the CLI/TUI response area was left blank or showed only a fragment (e.g. "The") with no consolidated reason. The internal turn_exit_reason values (empty_response_exhausted, partial_stream_recovery, etc.) were never surfaced to the user. Add a turn-completion explainer that mirrors the existing file-mutation verifier footer: at turn end, map an abnormal turn_exit_reason to a short, actionable message and either replace the bare "(empty)" sentinel or append the reason after a partial fragment. Normal text_response exits (e.g. a terse "Done.") stay quiet. Gated by display.turn_completion_explainer (default on) with HERMES_TURN_COMPLETION_EXPLAINER env override, matching the file-mutation verifier seam. Closes #34452	2026-05-29 19:23:05 -07:00
Teknium	897f9533ed	fix: keep CLI context display in sync with preflight token estimate (#35079 ) * Inspired by Claude Code: /compress here [N] — boundary-aware 'summarize up to here' Adds a user-chosen compression boundary to the existing /compress command. /compress here [N] summarizes everything except the most recent N exchanges (default 2), which are preserved verbatim — letting the user pick the compression boundary instead of relying on the automatic token-budget heuristic. Inspired by Claude Code's Rewind 'Summarize up to here' action (v2.1.139, Week 20, May 2026): https://code.claude.com/docs/en/whats-new/2026-w20 - hermes_cli/partial_compress.py: pure split/parse helpers + seam-alternation guard (shared by CLI and gateway). - cli.py / gateway/run.py: route 'here [N]' / '--keep N' to partial compression; compress only the head, re-append the verbatim tail through the seam guard. - Preserves message-flow role alternation (seam guard merges any illegal user->user / assistant->assistant adjacency). - Reuses the existing _compress_context session-rotation/lock machinery — no changes to the compression core. - Bare /compress (full) and /compress <focus> behavior unchanged. Tests: 12 helper unit tests + 5 CLI integration tests + E2E (interleaved tool-call transcript, degenerate/multimodal seams, real handler path). * fix: keep CLI context display in sync with preflight token estimate The status bar reads compressor.last_prompt_tokens, which only updates from a successful API response. When loaded history is oversized but compression no-ops (e.g. the auxiliary summary model times out), no fresh usage arrives and the bar stays frozen at the old, smaller value while the preflight estimate reports a much larger number — looking permanently out of sync (reported: 74.4K display vs ~144,669 preflight). Seed last_prompt_tokens with the fresh preflight estimate (upward-only, so a real usage figure is never clobbered and a successful compression's downward correction still wins). Display-only; no behavioral change to compression, caching, or the agent loop.	2026-05-29 19:21:15 -07:00
teknium	9d4c81130a	fix(gateway): name what the /status token number actually is Sharpen the label from 'Session usage (cumulative)' to 'Cumulative API tokens (re-sent each call)'. The number is real provider-reported usage summed across every API call in the session — not context size. In an agentic loop the same context is re-sent each iteration, so a one-hour tool-heavy session legitimately reaches tens of millions of tokens. The new label explains the magnitude so users stop reading it as a bug or as a total across all sessions.	2026-05-29 19:14:37 -07:00
helix4u	2259c15e4d	fix(gateway): clarify status session usage label	2026-05-29 19:14:37 -07:00
Bartok9	45bc65abbe	fix(gateway): drop outbound silence-narration messages pre-send Hallucinated 'silence' tokens ((silent), _silent_, the bare '.', '...', 'silent', no response/reply, the mute emoji) are emitted when a persona has nothing actionable to say. In bot-to-bot channels the receiving bot mirrors the token back, creating a tight loop that burns API tokens and can crash a model with 'no content after all retries'. SOUL.md/prompt rules drift across providers and have already failed in practice, so add a substrate-level guard. _deliver_to_platform now drops a message whose finalized content is only a silence-narration token, logs a WARNING with platform/chat_id/truncated content, and returns {success: True, filtered: 'silence_narration', delivered: False} instead of calling the adapter. Single chokepoint covers every platform adapter; the regex is anchored start/end with a 64-char guard so prose like 'Silence is golden — here is the plan...' or 'Silent install completed' is never dropped. Local/file delivery is a separate path and is left untouched. Opt out via gateway.filter_silence_narration: false or the HERMES_FILTER_SILENCE_NARRATION env override (env wins when set). Closes #34616	2026-05-29 19:06:05 -07:00
teknium1	9dbc3722ae	test(compression): fix StopIteration in large-rough-growth preflight test The rough-estimate mock supplied only 2 side_effect values but the conversation loop calls estimate_request_tokens_rough a third time for the post-response real-token estimate, exhausting the iterator. Use a callable side_effect that returns 125k once (to fire preflight) then sub-threshold values, independent of call count.	2026-05-29 19:05:03 -07:00
helix4u	e38b0b55d1	fix(compression): avoid repeat preflight compaction from rough estimates	2026-05-29 19:05:03 -07:00
Teknium	04de307d62	fix(cli): repaint input area after inline /steer and /model submit (#34839 ) handle_enter dispatches /steer and /model inline on the UI thread while the agent is running, calling buffer.reset() then returning. Unlike every other early-return branch in the handler, these two skipped event.app.invalidate(). process_command() prints through patch_stdout (scrolls output above the prompt without redrawing the input line), so the just-cleared input area could keep showing the submitted '/steer <text>' until an unrelated redraw fired — looking unsent and inviting an accidental re-submit. Add event.app.invalidate() after reset in both inline branches to match the sibling branches. AST regression test pins the invariant: every reset-then-return branch in handle_enter must invalidate first. Fixes #34569	2026-05-29 19:04:40 -07:00
Teknium	bcc8301000	Inspired by Claude Code: /compress here [N] — boundary-aware 'summarize up to here' (#35048 ) Adds a user-chosen compression boundary to the existing /compress command. /compress here [N] summarizes everything except the most recent N exchanges (default 2), which are preserved verbatim — letting the user pick the compression boundary instead of relying on the automatic token-budget heuristic. Inspired by Claude Code's Rewind 'Summarize up to here' action (v2.1.139, Week 20, May 2026): https://code.claude.com/docs/en/whats-new/2026-w20 - hermes_cli/partial_compress.py: pure split/parse helpers + seam-alternation guard (shared by CLI and gateway). - cli.py / gateway/run.py: route 'here [N]' / '--keep N' to partial compression; compress only the head, re-append the verbatim tail through the seam guard. - Preserves message-flow role alternation (seam guard merges any illegal user->user / assistant->assistant adjacency). - Reuses the existing _compress_context session-rotation/lock machinery — no changes to the compression core. - Bare /compress (full) and /compress <focus> behavior unchanged. Tests: 12 helper unit tests + 5 CLI integration tests + E2E (interleaved tool-call transcript, degenerate/multimodal seams, real handler path).	2026-05-29 17:49:15 -07:00
Bartok9	54aa4db1de	fix(cli): remove Hermes-managed node/npm/npx symlinks on uninstall The POSIX installer drops node/npm/npx symlinks in ~/.local/bin pointing into $HERMES_HOME/node and prepends ~/.local/bin to PATH, shadowing an existing nvm. Uninstall removed the hermes wrapper but left these behind, so the user's default node/npm/npx stayed redirected after uninstall. Add remove_node_symlinks() and call it from run_uninstall. It removes ~/.local/bin/{node,npm,npx} only when each is a symlink resolving into the current Hermes home's node dir, so a link the user repointed at nvm or a real binary is never touched. Handles dangling links too. Closes #34536	2026-05-29 17:24:38 -07:00
Teknium	2062a84000	fix(auxiliary): stop capping output with max_tokens by default (#34530 ) (#34845 ) * fix(auxiliary): stop capping output with max_tokens by default Auxiliary LLM calls (compression, titles, vision, etc.) no longer send max_tokens on the OpenAI-compatible chat-completions path. Most providers treat an omitted max_tokens as "use the model max", which is what we want; an explicit cap only risks truncation or a wire-format 400. This was surfaced by GitHub Copilot / GPT-5 (#34530): those models reject max_tokens and require max_completion_tokens, so compression 400'd and fell back to a static context marker. Omitting the param sidesteps that quirk (and ZAI vision's error 1210) entirely. The Anthropic Messages wire (MiniMax + /anthropic endpoints) keeps max_tokens because it is a mandatory field there. * test(auxiliary): update temperature-retry assertions for omitted max_tokens The temperature-retry tests asserted retry_kwargs["max_tokens"] == 500 on an api.openai.com endpoint. Now that auxiliary calls omit max_tokens on OpenAI-compatible endpoints (#34530), that key is absent. Assert it's absent in both first and retry kwargs and use model as the survives-the-retry witness.	2026-05-29 17:24:30 -07:00
Teknium	f9daa4a41d	fix(deps): declare setuptools in dev extra for packaging tests (#34851 ) * fix(deps): declare setuptools in dev extra for packaging tests tests/test_packaging_metadata.py imports `from setuptools import find_packages` at module scope to validate package discovery against the live tree. setuptools was being picked up ambiently from the CI runner image, but recent ubuntu-latest images no longer ship it in the test venv, so collection fails with ModuleNotFoundError on every PR. Declare setuptools==82.0.1 in the dev optional-dependencies so `.[all,dev]` installs it explicitly rather than relying on the runner environment. * test(packaging): skip packaging-metadata tests when setuptools absent Belt-and-suspenders alongside declaring setuptools in [dev]: guard the module-level `from setuptools import find_packages` with pytest.importorskip so a runner missing setuptools SKIPS these checks instead of erroring out collection for the entire test shard. * chore(deps): sync uv.lock for setuptools dev dependency	2026-05-29 17:24:23 -07:00
Teknium	689ef5e233	feat(cli): warn on unsupported pip installs + fix stale update-check cache (#34491 ) (#34846 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. feat(cli): warn on unsupported pip installs + fix stale update-check cache after pip upgrade Banner now shows a yellow warning when detect_install_method() == 'pip': 'pip install hermes-agent' isn't the supported install path (it exists on PyPI for internal/CI reasons), so updates and issue support don't behave correctly. Reuses existing install-method detection; warn, never block. Also fixes #34491: check_for_updates() keyed its 6h cache only on ts+rev. On the pip path (no HERMES_REVISION), rev is always None, so a 'pip install --upgrade' changed VERSION but left the cache valid — the stale 'N commits behind' count survived the upgrade. Cache now also keys on the installed VERSION and invalidates on mismatch.	2026-05-29 13:30:28 -07:00
teknium1	bb50825716	chore(release): map annguyenNous to AUTHOR_MAP Clears the check-attribution CI gate on PR #34468 — the contributor's noreply email was unmapped.	2026-05-29 13:29:34 -07:00
annguyenNous	9f5afc7636	fix(mcp): widen isinstance check to BaseException for CancelledError asyncio.gather(return_exceptions=True) captures CancelledError as a BaseException value. The previous isinstance(result, Exception) check missed CancelledError, silently dropping it without logging. Since Python 3.9, CancelledError is a BaseException subclass (not Exception). This one-line change ensures all failure types from MCP server connections are properly logged. Fixes NousResearch/hermes-agent#34443	2026-05-29 13:29:34 -07:00
teknium1	4fd8521e44	test(tui-gateway): isolate completion_queue in poller requeue test test_notification_poller_requeues_when_busy drained and reused the process-global process_registry.completion_queue, so a concurrent test in the same xdist worker could put/get on the shared singleton mid-run and empty the event the poller requeues — flaking 'assert not completion_queue.empty()' under parallel CI load only. Monkeypatch a fresh Queue onto the singleton for the test's duration so nothing external can interleave. The poller reads completion_queue by attribute at runtime, so the isolated queue is what it operates on. monkeypatch restores the original on teardown. Verified immune: 50/50 passes under a background thread hammering the global queue.	2026-05-29 13:29:24 -07:00
Bartok9	edfdc77664	fix(cli): resume the selected chat when a bare number follows /resume A bare `/resume` printed the recent-sessions list but armed no selection state, so typing just `3` on the next line was sent to the agent as chat instead of resuming session #3. `/resume 3` worked, but the natural list-then-pick flow did not. Arm a one-shot pending-resume prompt when bare `/resume` shows the list, and consume the next bare numeric input as the selection (out-of-range is reported, non-numeric/other commands disarm it). Resolves against the same _list_recent_sessions(limit=10) list used everywhere else. Closes #34584.	2026-05-29 13:29:24 -07:00
Teknium	3a2c03061c	fix(stt,tts): restore mistralai — 2.4.8 is clean, ban lifted (#34841 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(stt,tts): restore mistralai — 2.4.8 is clean, ban lifted PyPI quarantined mistralai on 2026-05-12 after the malicious 2.4.6 release (Mini Shai-Hulud worm). 2.4.6 has since been removed from the registry and clean releases resumed (2.4.7 2026-05-25, 2.4.8 2026-05-28). This rolls back the blanket runtime ban so Voxtral STT + TTS work again, following the restoration checklist the repo left in pyproject.toml. Verified against the real SDK: 2.4.8 keeps the import path the code uses (from mistralai.client import Mistral) and the audio.transcriptions.complete / audio.speech.complete surfaces. Changes: - pyproject.toml: re-add mistral extra pinned to mistralai==2.4.8; left OUT of [all] per the 2026-05-12 lazy-install policy (one quarantined release must not break fresh installs). uv.lock regenerated. - tools/lazy_deps.py: add stt.mistral / tts.mistral entries so the SDK lazy-installs on first use (matches edge / elevenlabs). - tools/transcription_tools.py: restore explicit-provider gate (_HAS_MISTRAL + key) and auto-detect entry (local>groq>openai>mistral>xai); _transcribe_mistral lazy-installs before import. - tools/tts_tool.py: dispatcher routes back to _generate_mistral_tts; _import_mistral_client lazy-installs the SDK. - hermes_cli/tools_config.py, hermes_cli/web_server.py: un-hide Mistral from the TTS provider picker and dashboard STT options. - hermes_cli/security_advisories.py: KEEP the shai-hulud-2026-05 advisory (module policy forbids removal) — it is scoped to 2.4.6 only, so it still warns anyone with the poisoned build cached and never fires on 2.4.8. Summary note updated to reflect the un-quarantine. - tests: revert the disabled-behavior assertions added by the ban commit back to routing/positive expectations; add mistral to the lazy-installable-extras-excluded-from-[all] contract. Reported by @SkYNewZ (#34503). Validation: 189 targeted STT/TTS/lazy_deps/metadata tests pass; E2E with the real mistralai 2.4.8 SDK routes both STT and TTS to mistral.	2026-05-29 13:24:12 -07:00
Teknium	781604ce4c	fix(gateway): unify MEDIA: extraction extension set + close the unknown-ext black hole (#34517 ) (#34844 ) MEDIA:<path> tags for .md/.json/.yaml/.xml/.html and other document extensions were silently dropped. extract_media() carried a narrow extension allowlist that omitted them, while extract_local_files() had a broad one. The dispatch sites then ran an unconditional re.sub(r'MEDIA:\\s*\\S+', '') that stripped the tag from the body even when extract_media had not matched it — so extract_local_files (broad list) ran on text where the path was already gone, and the file was delivered by neither path. - Add MEDIA_DELIVERY_EXTS in gateway/platforms/base.py as the single source of truth; extract_media and extract_local_files both derive their extension set from it (no more drift). - Replace the loose MEDIA cleanup at the non-streaming dispatch site (base.py) and the streaming consumer (stream_consumer.py) with the shared, extension-anchored MEDIA_TAG_CLEANUP_RE. A MEDIA: tag with an unknown extension is left in the body so the bare-path detector can still pick it up instead of being black-holed. - Chain cleaned text through extract_media -> extract_images -> extract_local_files in run.py's post-stream media delivery (it was dropping the cleaned text and rescanning raw text with MEDIA: tags). - Regression tests covering both halves: previously-dropped extensions now extract, and unknown-ext paths survive the cleanup. Consolidates the MEDIA extension-allowlist PR cluster. Co-authored-by: Bartok9 <259807879+Bartok9@users.noreply.github.com> Co-authored-by: banditburai <123342691+banditburai@users.noreply.github.com> Co-authored-by: Kyzcreig <9063726+Kyzcreig@users.noreply.github.com>	2026-05-29 13:24:01 -07:00
teknium1	0dc0c5ea6b	chore: add AUTHOR_MAP entry for sweetcornna Maps the cherry-picked commit's noreply email to the GitHub login so the release attribution / CI author check passes.	2026-05-29 13:22:54 -07:00
Bartok9	3845d86b93	fix(cron): restore jobs.json emptied by config migration on update Config-version migrations have been observed to leave cron/jobs.json valid-but-empty after `hermes update`, silently dropping every scheduled job (#34600). The existing malformed-shape guards in cron/jobs.py don't catch this because {"jobs": []} is valid JSON. Add restore_cron_jobs_if_emptied() as a post-migration safety net: if the live cron/jobs.json now has zero jobs while the pre-update snapshot held one or more, restore the snapshot copy in place and warn loudly. The check is conservative — it only restores on unambiguous evidence of loss (snapshot had jobs, live file readable-and-empty), so a user who genuinely cleared their jobs is never second-guessed and an unreadable live file is left untouched so real corruption still surfaces. Wired into _cmd_update_impl after migrate_config(), reusing the existing pre-update quick snapshot (which already captures cron/jobs.json). Closes #34600	2026-05-29 13:22:54 -07:00
Cornna	d473e7c938	fix(cron): exclude jobs.json registry from disk-cleanup pattern Closes #32164	2026-05-29 13:22:54 -07:00
Teknium	91b174038c	fix(feishu): bound _chat_locks with LRU eviction (#34836 ) The Feishu adapter stored one asyncio.Lock per chat_id in a plain dict with no upper bound, so a long-running gateway that saw many distinct chats grew _chat_locks without limit. Port the LRU-eviction pattern already used by the yuanbao adapter: OrderedDict + move_to_end on access, CHAT_LOCK_MAX_SIZE cap (1000), and eviction that skips currently-held locks (falling back to dropping the LRU entry only if all are held).	2026-05-29 13:18:15 -07:00
teknium1	8055d0f092	test(ntfy): cover echo-tag filter; tag standalone send path Adds tests for the echo-loop fix (outgoing X-Tags header, inbound skip on tagged events, genuine tags pass through) and extends the tag to the out-of-process _standalone_send() path so cron / send_message deliveries to a self-subscribed topic are also skipped. Maps both contributors in release.py AUTHOR_MAP. Co-authored-by: liuhao1024 <sunsky.lau@gmail.com>	2026-05-29 13:17:46 -07:00
annguyenNous	9405cdc8dd	fix(ntfy): prevent echo loop by tagging outgoing messages When publish_topic equals the subscribe topic, the agent's own replies are echoed back by ntfy as incoming messages, creating an infinite reply spiral. Fix: tag outgoing messages with X-Tags: hermes-agent header, and skip incoming messages that carry this tag. This is zero-config — works automatically regardless of topic configuration. Fixes NousResearch/hermes-agent#34447	2026-05-29 13:17:46 -07:00
Bartok9	08c0b22417	fix(gateway): scope tool-result MEDIA scan to current turn The post-run scan that appends tool-emitted MEDIA: tags to the final response iterated every tool/function message in the full conversation and relied solely on path-based dedup against paths reconstructed from the replayable transcript. When that reconstruction does not byte-match the in-memory tool content (timestamp stripping, observed-context withholding, compression rewrites), a stale path emitted several turns earlier is absent from the dedup set and leaks onto a later text-only reply (Telegram 'Sending media group of 1 photo(s)' with no MEDIA directive present). Scope the scan to this turn's new messages by slicing result['messages'] at len(agent_history) (agent_history is passed as conversation_history into run_conversation, so the returned list is history + this turn). Retain path-based dedup as a secondary guard and as the sole guard on the compression-shrink fallback, preserving the #160 behaviour. Closes #34608	2026-05-29 13:13:34 -07:00
teknium1	38c4f8c371	test(gateway): update system-unit cwd assertion to HERMES_HOME anchor test_system_unit_has_no_root_paths asserted the system unit's WorkingDirectory was the remapped checkout path (/home/alice/.hermes/hermes-agent). That is the brittle pin this PR fixes — the system unit now anchors cwd at the target user's HERMES_HOME (/home/alice/.hermes). The test's intent (no root-home leak, target-user paths present) is unchanged and still holds.	2026-05-29 12:36:59 -07:00
teknium1	a1cb5fa2c7	fix(gateway): anchor service WorkingDirectory at HERMES_HOME, not the source checkout The systemd unit (and launchd plist) pinned WorkingDirectory to PROJECT_ROOT (the checkout the unit was generated from). When that checkout is transient — a git worktree, or a clone hermes update later relocates/removes — the path rots. systemd then fails the start at the CHDIR step (status=200/CHDIR) BEFORE Python loads, so the on-boot refresh_systemd_unit_if_needed() self-heal never runs and Restart=always crash-loops forever on a dead directory. Observed in the wild: a gateway that crash-looped 153 times overnight, bot offline until a manual 'hermes gateway restart' regenerated the unit. Anchor cwd at HERMES_HOME instead — it never moves, always exists, and the gateway never needed cwd to be the checkout (ExecStart uses an absolute python + -m hermes_cli.main). Existing broken units now differ from the generated unit and self-heal on the next start/restart/update.	2026-05-29 12:36:59 -07:00
Teknium	45b00bb49a	fix(packaging): ship hermes_cli subpackages in wheel (#34811 ) [tool.setuptools.packages.find] listed 'hermes_cli' without the 'hermes_cli.' wildcard, so the wheel shipped hermes_cli/.py but dropped the dashboard_auth and proxy subpackages. The dashboard died on every install with ModuleNotFoundError: No module named 'hermes_cli.dashboard_auth' (#34701); 'hermes proxy' was equally broken. Add the wildcard, and add a regression test that drives setuptools' own find_packages against the live tree so any future subpackage dropped from the include list fails CI instead of a user's container.	2026-05-29 12:36:09 -07:00
teknium1	8836b3a113	fix(cli): widen Windows .bat wrapper fix to custom-name alias path The profile alias --name path in main.py rewrote the wrapper with a hardcoded #!/bin/sh script right after create_wrapper_script(), clobbering the .bat on Windows and reintroducing the exact bug for custom aliases. create_wrapper_script() now takes an optional target so the alias file is named after the alias while the -p content references the profile — one platform-aware code path, no post-hoc rewrite.	2026-05-29 12:32:47 -07:00
liuhao1024	6312dd8c3a	fix(cli): create .bat wrapper on Windows instead of POSIX shell script On Windows, hermes profile create produced a #!/bin/sh script that the shell cannot execute. Now creates a .bat file with @echo off + %* on Windows, and keeps the POSIX shell script on macOS/Linux. Also fixes check_alias_collision to use 'where' instead of 'which' on Windows, and remove_wrapper_script to find .bat files. Fixes #34708	2026-05-29 12:32:47 -07:00
zapabob	30a0d5bc9e	chore(release): map zapabob author email	2026-05-29 12:32:35 -07:00
zapabob	aa283d1e4f	fix(model): isolate custom provider picker credentials	2026-05-29 12:32:35 -07:00
Teknium	2fc2280e63	fix(cli): clarify panel clips choices off-screen on short terminals (#34808 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(cli): clarify panel clips choices off-screen on short terminals The clarify multiple-choice panel is a height-less Window inside a non-full-screen HSplit. When its content exceeds the viewport, prompt_toolkit distributes height per child and clips the panel's tail — where the choices live — so options render invisible/cut off (issue #34645, reported on macOS Terminal.app). Two budget-accounting bugs let the panel overflow: - the compact-chrome decision ignored the question rows, so full chrome (3 blank separators) was kept even with no room - the '… (question truncated)' marker was not counted against the question's row budget, overshooting by one row at a 1-row budget Fix: reserve one question row in the compact decision, count the truncation marker against the budget, and drop the question entirely when the choices alone already exceed the viewport (choices are the must-see content for a selection).	2026-05-29 12:32:31 -07:00
Teknium	27a2c4f36f	fix(mcp): stop reporting false OAuth success when no token was obtained (#34807 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(mcp): stop reporting false OAuth success when no token was obtained `hermes mcp login` reported "Authenticated — N tool(s) available" for servers that serve tools/list without auth (e.g. Google's official Drive MCP server) even when the OAuth flow never completed — dynamic client registration 400'd because the provider doesn't support RFC 7591, so no token was ever acquired. Every real tool call then hung until timeout with no indication of why. Login now verifies a token actually landed on disk after the probe. When it didn't, it warns that authentication didn't complete and shows the config needed to supply a pre-registered client_id/client_secret (the existing, already-supported workaround for DCR-less providers). Adds a docs pitfall for Google Drive / Atlassian-style providers. Fixes #34775	2026-05-29 12:32:19 -07:00
Teknium	1cb850b674	fix(api_server): emit per-turn transcript on run.completed (#34703 ) (#34804 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(api_server): emit per-turn transcript on run.completed (#34703) WebUI clients lost intermediate (pre-tool-call) assistant text after switching session pages mid-stream. The session-chat SSE stream delivers all assistant text as assistant.delta events under one message_id interleaved with tool.* events, then a single assistant.completed carrying only the final reply — so a client accumulating deltas into one buffer cannot reconstruct intermediate text segments that preceded tool calls, and they vanish from the live view (state.db persists them correctly). run.completed now carries the authoritative per-turn transcript (assistant + tool messages for this turn, in client-safe shape) so any SSE consumer can reconcile its live view against ground truth without a separate GET /messages round-trip. Purely additive — clients that ignore the field are unaffected.	2026-05-29 12:27:49 -07:00
Teknium	b6ed3913d2	feat(skills): categorize tap skills from skills.sh.json grouping sidecar A GitHub tap can ship a repo-root skills.sh.json (the published skills.sh schema) declaring category groupings. The Skills Hub now reads it at index time and uses each grouping title as the skill's category label, instead of the tag-derived guess. Generic: any tap that ships the file gets real categorization — NVIDIA's groupings (Inference AI, Decision Optimization, GPU Development, etc.) flow through automatically. - GitHubSource: _get_skillsh_groupings() fetches+caches the sidecar per repo; _parse_skillsh_groupings() flattens it to {skill_name: title}; _list_skills_in_repo() stamps meta.extra['category']; _meta_to_dict now serializes extra so the category survives the index cache round-trip. - extract-skills.py: prefers extra['category'] over the tag heuristic and exempts sidecar categories from the small-category to Other collapse. - Docs + 12 tests.	2026-05-29 12:24:39 -07:00
Teknium	4de8009ce4	feat(skills): integrate NVIDIA/skills as a trusted skills hub tap NVIDIA/skills is now a default trusted tap in the Hermes Skills Hub — discoverable, browsable, searchable, and auto-updating through the same pipeline that already serves OpenAI, Anthropic, and HuggingFace skills. Rebased onto current main.	2026-05-29 12:24:39 -07:00
Teknium	1596bb287e	fix(dashboard): chat tab works in gated (OAuth) mode (#34793 ) The Chat/TUI dashboard tab showed a false "Session token unavailable" error and never rendered the terminal whenever the dashboard ran in gated mode (OAuth auth gate active, --insecure not set), even though the user was fully authenticated and every other tab worked. Two checks in ChatPage.tsx gated purely on window.__HERMES_SESSION_TOKEN__, which the server intentionally omits in gated mode (web_server.py only injects __HERMES_AUTH_REQUIRED__=true there; the SPA is expected to use cookie auth + a single-use WS ticket). buildWsAuthParam() already resolves WS auth correctly for both modes, but the early bail prevented the effect from ever reaching it. Both checks now also honor __HERMES_AUTH_REQUIRED__: the banner no longer fires and the xterm/WS effect no longer bails in gated mode. Reported-by: wbrione <wbrione@users.noreply.github.com> Closes #34755	2026-05-29 12:19:51 -07:00
Teknium	90b3c54de9	fix: drain thread no longer crashes on fd-less stdout streams (#34789 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix: drain thread no longer crashes on fd-less stdout streams The _wait_for_process drain thread called proc.stdout.fileno() unconditionally. ProcessHandle implementations whose stdout is not backed by a real OS fd (iterator-style in-memory streams, mock procs) raised 'list_iterator' object has no attribute 'fileno' (or 'fileno() returned a non-integer' from select.select), killing the daemon thread and silently losing all process output. Resolve the fd defensively at the top of _drain; when stdout has no usable integer fileno, fall back to draining it as an iterable (the legacy 'for line in proc.stdout' contract). The real subprocess / os.pipe-backed select() fast path is unchanged.	2026-05-29 12:16:57 -07:00
teknium1	5641ae6469	chore(release): add AUTHOR_MAP entries for Bucket-1 docs salvage contributors	2026-05-29 12:06:22 -07:00
Twanislas	549a69a925	docs(curator): align 'agent-created' definition with actual provenance semantics The curator docs stated that any skill not bundled/hub-installed was 'agent-created' and subject to curation — including foreground-created skills and hand-written ones. Since PR #19621 (May 2026), the curator requires an explicit marker in .usage.json, which only the background self-improvement review fork sets. Changes: - Rewrite 'What agent-created means' to document the 3-step eligibility check (not bundled + not hub + created_by=agent marker) - Explain that foreground skill_manage(create) does NOT mark skills as agent-created (user-directed by design) - Warn that hand-written skills are NOT curated - Add note in Per-run reports explaining the '(not resolved)' display when no candidates exist (LLM pass skipped, not a config error) - Link to skill_provenance.py for the write-origin ContextVar Ref: PR #19621, tools/skill_provenance.py, tools/skill_manager_tool.py	2026-05-29 12:06:22 -07:00
Aman113114-IITD	3f0d44af8a	docs: replace invalid 'hermes config get <key>' with 'hermes config show' 'hermes config get <key>' is referenced in three guides but is not a valid subcommand. The valid subcommands under 'hermes config' are {show,edit,set,path,env-path,check,migrate}. 'hermes config show' is already used elsewhere in the docs (including 'hermes config show \| grep <pattern>' in the FAQ), so it's the idiomatic replacement. - work-with-skills.md: 'View all skill config' now uses 'hermes config show \| grep ^skills\.config' - migrate-from-openclaw.md: session-policy check now reads the value from 'hermes config show' - configuring-models.md: 'inspect what the CLI will actually use' now uses 'hermes config show \| grep ^model\.' Refs #30195	2026-05-29 12:06:22 -07:00
HKPA	eff4626747	fix(docs): add baseUrl prefix to SVG image paths in sessions and CLI pages Fixes #24809 The docs site uses baseUrl='/docs/' but the <img> tags in sessions.md and cli.md referenced images at /img/docs/... which resolves to a 404. The static files are served at /docs/img/docs/... instead. Before: <img src="/img/docs/session-recap.svg"> → 404 After: <img src="/docs/img/docs/session-recap.svg"> → 200 Also fixes cli-layout.svg which had the same issue.	2026-05-29 12:06:22 -07:00
aqilaziz	175885218e	fix(docs): align fallback provider config examples Use the current top-level fallback_providers list in fallback docs and keep fallback_model documented only as the legacy compatibility shape. Also align cron and delegation fallback coverage with current runtime behavior. Closes #19691 Co-authored-by: Codex <codex@openai.com>	2026-05-29 12:06:22 -07:00

1 2 3 4 5 ...

10008 Commits