Skip to main content

Hermes Agent

Skills Hub

Discover, search, and install from 637 skills across 4 registries

73Built-in
43Optional
521Community
16Categories
๏…น

apple-notes

โœ“ Built-in

Manage Apple Notes via the memo CLI on macOS (create, view, search, edit).

๏ฃฟ macOS
๏…น

apple-reminders

โœ“ Built-in

Manage Apple Reminders via remindctl CLI (list, add, complete, delete).

๏ฃฟ macOS
๏…น

findmy

โœ“ Built-in

Track Apple devices and AirTags via FindMy.app on macOS using AppleScript and screen capture.

๏ฃฟ macOS
๏…น

imessage

โœ“ Built-in

Send and receive iMessages/SMS via the imsg CLI on macOS.

๏ฃฟ macOS
๐Ÿค–

claude-code

โœ“ Built-in

Delegate coding tasks to Claude Code (Anthropic's CLI agent). Use for building features, refactoring, PR reviews, and iterative coding. Requires the claude CLI installed.

๐Ÿค–

codex

โœ“ Built-in

Delegate coding tasks to OpenAI Codex CLI agent. Use for building features, refactoring, PR reviews, and batch issue fixing. Requires the codex CLI and a git repository.

๐Ÿค–

hermes-agent

โœ“ Built-in

Complete guide to using and extending Hermes Agent โ€” CLI usage, setup, configuration, spawning additional agents, gateway platforms, skills, voice, tools, profiles, and a concise contributor reference. Load this skill when helping users configure Hermes, troubleshoot issues, spawn agent instances, or make code contributions.

๐Ÿค–

opencode

โœ“ Built-in

Delegate coding tasks to OpenCode CLI agent for feature implementation, refactoring, PR review, and long-running autonomous sessions. Requires the opencode CLI installed and authenticated.

๐ŸŽจ

ascii-art

โœ“ Built-in

Generate ASCII art using pyfiglet (571 fonts), cowsay, boxes, toilet, image-to-ascii, remote APIs (asciified, ascii.co.uk), and LLM fallback. No API keys required.

๐ŸŽจ

ascii-video

โœ“ Built-in

Production pipeline for ASCII art video โ€” any format. Converts video/audio/images/generative input into colored ASCII character video output (MP4, GIF, image sequence). Covers: video-to-ASCII conversion, audio-reactive music visualizers, generative ASCII art animations, hybrid video+audio reactive, text/lyrics overlays, real-time terminal rendering. Use when users request: ASCII video, text art video, terminal-style video, character art animation, retro text visualization, audio visualizer in ASCII, converting video to ASCII art, matrix-style effects, or any animated ASCII output.

๐ŸŽจ

excalidraw

โœ“ Built-in

Create hand-drawn style diagrams using Excalidraw JSON format. Generate .excalidraw files for architecture diagrams, flowcharts, sequence diagrams, concept maps, and more. Files can be opened at excalidraw.com or uploaded for shareable links.

๐ŸŽจ

songwriting-and-ai-music

โœ“ Built-in

Songwriting craft, AI music generation prompts (Suno focus), parody/adaptation techniques, phonetic tricks, and lessons learned. These are tools and ideas, not rules. Break any of them when the art calls for it.

๐ŸŽฎ

minecraft-modpack-server

โœ“ Built-in

Set up a modded Minecraft server from a CurseForge/Modrinth server pack zip. Covers NeoForge/Forge install, Java version, JVM tuning, firewall, LAN config, backups, and launch scripts.

๐ŸŽฎ

pokemon-player

โœ“ Built-in

Play Pokemon games autonomously via headless emulation. Starts a game server, reads structured game state from RAM, makes strategic decisions, and sends button inputs โ€” all from the terminal.

๐Ÿ’ป

codebase-inspection

โœ“ Built-in

Inspect and analyze codebases using pygount for LOC counting, language breakdown, and code-vs-comment ratios. Use when asked to check lines of code, repo size, language composition, or codebase stats.

๐Ÿ’ป

github-auth

โœ“ Built-in

Set up GitHub authentication for the agent using git (universally available) or the gh CLI. Covers HTTPS tokens, SSH keys, credential helpers, and gh auth โ€” with a detection flow to pick the right method automatically.

๐Ÿ’ป

github-code-review

โœ“ Built-in

Review code changes by analyzing git diffs, leaving inline comments on PRs, and performing thorough pre-push review. Works with gh CLI or falls back to git + GitHub REST API via curl.

๐Ÿ’ป

github-issues

โœ“ Built-in

Create, manage, triage, and close GitHub issues. Search existing issues, add labels, assign people, and link to PRs. Works with gh CLI or falls back to git + GitHub REST API via curl.

๐Ÿ’ป

github-pr-workflow

โœ“ Built-in

Full pull request lifecycle โ€” create branches, commit changes, open PRs, monitor CI status, auto-fix failures, and merge. Works with gh CLI or falls back to git + GitHub REST API via curl.

๐Ÿ’ป

github-repo-management

โœ“ Built-in

Clone, create, fork, configure, and manage GitHub repositories. Manage remotes, secrets, releases, and workflows. Works with gh CLI or falls back to git + GitHub REST API via curl.

๐ŸŽต

gif-search

โœ“ Built-in

Search and download GIFs from Tenor using curl. No dependencies beyond curl and jq. Useful for finding reaction GIFs, creating visual content, and sending GIFs in chat.

๐ŸŽต

heartmula

โœ“ Built-in

Set up and run HeartMuLa, the open-source music generation model family (Suno-like). Generates full songs from lyrics + tags with multilingual support.

๐ŸŽต

songsee

โœ“ Built-in

Generate spectrograms and audio feature visualizations (mel, chroma, MFCC, tempogram, etc.) from audio files via CLI. Useful for audio analysis, music production debugging, and visual documentation.

๐ŸŽต

youtube-content

โœ“ Built-in

Fetch YouTube video transcripts and transform them into structured content (chapters, summaries, threads, blog posts). Use when the user shares a YouTube URL or video link, asks to summarize a video, requests a transcript, or wants to extract and reformat content from any YouTube video.

๐Ÿงช

audiocraft-audio-generation

โœ“ Built-in

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.

๐Ÿงช

axolotl

โœ“ Built-in

Expert guidance for fine-tuning LLMs with Axolotl - YAML configs, 100+ models, LoRA/QLoRA, DPO/KTO/ORPO/GRPO, multimodal support

๐Ÿงช

clip

โœ“ Built-in

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search, content moderation, or vision-language tasks without fine-tuning. Best for general-purpose image understanding.

๐Ÿงช

dspy

โœ“ Built-in

Build complex AI systems with declarative programming, optimize prompts automatically, create modular RAG systems and agents with DSPy - Stanford NLP's framework for systematic LM programming

๐Ÿงช

evaluating-llms-harness

โœ“ Built-in

Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking training progress. Industry standard used by EleutherAI, HuggingFace, and major labs. Supports HuggingFace, vLLM, APIs.

๐Ÿงช

fine-tuning-with-trl

โœ“ Built-in

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.

๐Ÿงช

gguf-quantization

โœ“ Built-in

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

๐Ÿงช

grpo-rl-training

โœ“ Built-in

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

๐Ÿงช

guidance

โœ“ Built-in

Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance - Microsoft Research's constrained generation framework

๐Ÿงช

huggingface-hub

โœ“ Built-in

Hugging Face Hub CLI (hf) โ€” search, download, and upload models and datasets, manage repos, query datasets with SQL, deploy inference endpoints, manage Spaces and buckets.

๐Ÿงช

llama-cpp

โœ“ Built-in

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization (1.5-8 bit) for reduced memory and 4-10ร— speedup vs PyTorch on CPU.

๐Ÿงช

modal-serverless-gpu

โœ“ Built-in

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

๐Ÿงช

obliteratus

โœ“ Built-in

Remove refusal behaviors from open-weight LLMs using OBLITERATUS โ€” mechanistic interpretability techniques (diff-in-means, SVD, whitened SVD, LEACE, SAE decomposition, etc.) to excise guardrails while preserving reasoning. 9 CLI methods, 28 analysis modules, 116 model presets across 5 compute tiers, tournament evaluation, and telemetry-driven recommendations. Use when a user wants to uncensor, abliterate, or remove refusal from an LLM.

๐Ÿงช

outlines

โœ“ Built-in

Guarantee valid JSON/XML/code structure during generation, use Pydantic models for type-safe outputs, support local models (Transformers, vLLM), and maximize inference speed with Outlines - dottxt.ai's structured generation library

๐Ÿงช

peft-fine-tuning

โœ“ Built-in

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

๐Ÿงช

pytorch-fsdp

โœ“ Built-in

Expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP - parameter sharding, mixed precision, CPU offloading, FSDP2

๐Ÿงช

segment-anything-model

โœ“ Built-in

Foundation model for image segmentation with zero-shot transfer. Use when you need to segment any object in images using points, boxes, or masks as prompts, or automatically generate all object masks in an image.

๐Ÿงช

serving-llms-vllm

โœ“ Built-in

Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with limited GPU memory. Supports OpenAI-compatible endpoints, quantization (GPTQ/AWQ/FP8), and tensor parallelism.

๐Ÿงช

stable-diffusion-image-generation

โœ“ Built-in

State-of-the-art text-to-image generation with Stable Diffusion models via HuggingFace Diffusers. Use when generating images from text prompts, performing image-to-image translation, inpainting, or building custom diffusion pipelines.

๐Ÿงช

unsloth

โœ“ Built-in

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

๐Ÿงช

weights-and-biases

โœ“ Built-in

Track ML experiments with automatic logging, visualize training in real-time, optimize hyperparameters with sweeps, and manage model registry with W&B - collaborative MLOps platform

๐Ÿงช

whisper

โœ“ Built-in

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

โœ…

google-workspace

โœ“ Built-in

Gmail, Calendar, Drive, Contacts, Sheets, and Docs integration via Python. Uses OAuth2 with automatic token refresh. No external binaries needed โ€” runs entirely with Google's Python client libraries in the Hermes venv.

โœ…

linear

โœ“ Built-in

Manage Linear issues, projects, and teams via the GraphQL API. Create, update, search, and organize issues. Uses API key auth (no OAuth needed). All operations via curl โ€” no dependencies.

โœ…

nano-pdf

โœ“ Built-in

Edit PDFs with natural-language instructions using the nano-pdf CLI. Modify text, fix typos, update titles, and make content changes to specific pages without manual editing.

โœ…

notion

โœ“ Built-in

Notion API for creating and managing pages, databases, and blocks via curl. Search, create, update, and query Notion workspaces directly from the terminal.

โœ…

ocr-and-documents

โœ“ Built-in

Extract text from PDFs and scanned documents. Use web_extract for remote URLs, pymupdf for local text-based PDFs, marker-pdf for OCR/scanned docs. For DOCX use python-docx, for PPTX see the powerpoint skill.

โœ…

powerpoint

โœ“ Built-in

Use this skill any time a .pptx file is involved in any way โ€” as input, output, or both. This includes: creating slide decks, pitch decks, or presentations; reading, parsing, or extracting text from any .pptx file (even if the extracted content will be used elsewhere, like in an email or summary); editing, modifying, or updating existing presentations; combining or splitting slide files; working with templates, layouts, speaker notes, or comments. Trigger whenever the user mentions "deck," "slides," "presentation," or references a .pptx filename, regardless of what they plan to do with the content afterward. If a .pptx file needs to be opened, created, or touched, use this skill.

๐Ÿ”

arxiv

โœ“ Built-in

Search and retrieve academic papers from arXiv using their free REST API. No API key needed. Search by keyword, author, category, or ID. Combine with web_extract or the ocr-and-documents skill to read full paper content.

๐Ÿ”

blogwatcher

โœ“ Built-in

Monitor blogs and RSS/Atom feeds for updates using the blogwatcher CLI. Add blogs, scan for new articles, and track what you've read.

๐Ÿ”

polymarket

โœ“ Built-in

Query Polymarket prediction market data โ€” search markets, get prices, orderbooks, and price history. Read-only via public REST APIs, no API key needed.

๐Ÿ”

research-paper-writing

โœ“ Built-in

End-to-end pipeline for writing ML/AI research papers โ€” from experiment design through analysis, drafting, revision, and submission. Covers NeurIPS, ICML, ICLR, ACL, AAAI, COLM. Integrates automated experiment monitoring, statistical analysis, iterative writing, and citation verification.

๐Ÿง Linux๏ฃฟ macOS
๐Ÿ“ฑ

xitter

โœ“ Built-in

Interact with X/Twitter via the x-cli terminal client using official X API credentials. Use for posting, reading timelines, searching tweets, liking, retweeting, bookmarks, mentions, and user lookups.

๐Ÿง Linux๏ฃฟ macOS
๐Ÿ’ป

plan

โœ“ Built-in

Plan mode for Hermes โ€” inspect context, write a markdown plan into the active workspace's `.hermes/plans/` directory, and do not execute the work.

๐Ÿ’ป

requesting-code-review

โœ“ Built-in

Pre-commit verification pipeline โ€” static security scan, baseline-aware quality gates, independent reviewer subagent, and auto-fix loop. Use after code changes and before committing, pushing, or opening a PR.

๐Ÿ’ป

subagent-driven-development

โœ“ Built-in

Use when executing implementation plans with independent tasks. Dispatches fresh delegate_task per task with two-stage review (spec compliance then code quality).