Hugging Face reposted this
Can you make Claude reduce your Claude costs? Just came back from AWS re:Invent ; heard from multiple companies they got success building their own agents but each call costs $3-$5, due to ballooning tokens and Claude costs. The answer to driving costs down is open models. Open models to specialize a smaller LLM as the brain of the agentic system, and open models to build smarter tools that keep the context window under control. But what if you could use Claude to reduce your Claude agents costs? Ben Burtenshaw and Shaun Smith put out this brilliant step-by-step guide on how you can do exactly that - or any other LLM fine-tuning job using Claude, Gemini or Codex. tl;dr - Introducing Hugging Face Skills - Documenting hf-llm-trainer HF Skill - Supports SFT, DPO and GRPO out of the box - Works in Claude Code, OpenAI Codex, Google Gemini CLI - Uses serverless GPUs via Hugging Face Jobs (pay by the second) - Works in the background, with full monitoring using track.io OSS - Can convert to GGUF for on-device use cases Article in comments