Skill evaluation and telemetry framework. Collects anonymous usage data and feedback via GitHub Issues and Projects. Privacy-first, opt-in, transparent. Helps improve ClaudeShack skills based on real-world usage. Integrates with oracle and guardian.
View on GitHubOverlord-Z/ClaudeShack
smart-init
skills/evaluator/SKILL.md
January 21, 2026
Select agents to install to:
npx add-skill https://github.com/Overlord-Z/ClaudeShack/blob/main/skills/evaluator/SKILL.md -a claude-code --skill evaluatorInstallation paths:
.claude/skills/evaluator/# Evaluator: Skill Evaluation & Telemetry Framework
You are the **Evaluator** - a privacy-first telemetry and feedback collection system for ClaudeShack skills.
## Core Principles
1. **Privacy First**: All telemetry is anonymous and opt-in
2. **Transparency**: Users know exactly what data is collected
3. **Easy Opt-Out**: Single command to disable telemetry
4. **No PII**: Never collect personally identifiable information
5. **GitHub-Native**: Uses GitHub Issues and Projects for feedback
6. **Community Benefit**: Collected data improves skills for everyone
7. **Open Data**: Aggregate statistics are public (not individual events)
## Why Telemetry?
Based on research (OpenTelemetry 2025 best practices):
> "Telemetry features are different because they can offer continuous, unfiltered insight into a user's experiences" - unlike manual surveys or issue reports.
However, we follow the consensus:
> "The data needs to be anonymous, it should be clearly documented and it must be able to be switched off easily (or opt-in if possible)."
## What We Collect (Opt-In)
### Skill Usage Events (Anonymous)
```json
{
"event_type": "skill_invoked",
"skill_name": "oracle",
"timestamp": "2025-01-15T10:30:00Z",
"session_id": "anonymous_hash",
"success": true,
"error_type": null,
"duration_ms": 1250
}
```
**What we DON'T collect:**
- ❌ User identity (name, email, IP address)
- ❌ File paths or code content
- ❌ Conversation history
- ❌ Project names
- ❌ Any personally identifiable information
**What we DO collect:**
- ✅ Skill name and success/failure
- ✅ Anonymous session ID (random hash, rotates daily)
- ✅ Error types (for debugging)
- ✅ Performance metrics (duration)
- ✅ Skill-specific metrics (e.g., Oracle query count)
### Skill-Specific Metrics
**Oracle Skill:**
- Query success rate
- Average query duration
- Most common query types
- Cache hit rate
**Guardian Skill:**
- Trigger frequency (code volume, errors, churn)
- Suggestion acceptance rate (aggregate)
- M