You can choose which cookie categories to allow. Strictly necessary cookies cannot be disabled.
Required for the site to work, including consent preferences and security. They do not require consent.
Measures traffic and user behavior in aggregate form. Provider: Google LLC (USA).
Session recordings, heatmaps, and click analysis to improve UX. Provider: Microsoft Corp. (USA).
Systematic evaluation of RAG systems across retrieval and generation.
This skill is community-contributed. Review before installing in sensitive environments.
Provides a systematic approach to evaluating Retrieval-Augmented Generation systems by conducting separate analyses of retrieval and generation components. Guides practitioners through error analysis, building evaluation datasets, and measuring Recall@k metrics.
evaluate-rag is ideal for developers, teams, and agents who need to systematic evaluation of rag systems across retrieval and generation.. Whether you're automating workflows, improving code quality, or extending functionality, this claude skill integrates directly into Claude Code.
Getting started with evaluate-rag is straightforward. Follow the steps below to install this claude skill into your Claude Code environment and start using it immediately.
claude in your terminal — if it opens, you're ready./ inside Claude Code to see your installed skills and invoke them by name.