Curate and add documents to the golden dataset with multi-agent validation. Use when adding test data, creating golden datasets, saving examples.
View on GitHubyonatangross/orchestkit
ork-workflows-advanced
January 25, 2026
Select agents to install to:
npx add-skill https://github.com/yonatangross/orchestkit/blob/main/plugins/ork-workflows-advanced/skills/add-golden/SKILL.md -a claude-code --skill add-goldenInstallation paths:
.claude/skills/add-golden/# Add to Golden Dataset
Multi-agent curation workflow with quality score explanations, bias detection, and version tracking.
## Quick Start
```bash
/add-golden https://example.com/article
/add-golden https://arxiv.org/abs/2312.xxxxx
```
---
## Task Management (CC 2.1.16)
```python
# Create main curation task
TaskCreate(
subject="Add to golden dataset: {url}",
description="Multi-agent curation with quality explanation",
activeForm="Curating document"
)
# Create subtasks for 9-phase process
phases = ["Fetch content", "Run quality analysis", "Explain scores",
"Check bias", "Check diversity", "Validate", "Get approval",
"Write to dataset", "Update version"]
for phase in phases:
TaskCreate(subject=phase, activeForm=f"{phase}ing")
```
---
## Workflow Overview
| Phase | Activities | Output |
|-------|------------|--------|
| **1. Input Collection** | Get URL, detect content type | Document metadata |
| **2. Fetch and Extract** | Parse document structure | Structured content |
| **3. Quality Analysis** | 4 parallel agents evaluate | Raw scores |
| **4. Quality Explanation** | Explain WHY each score | Score rationale |
| **5. Bias Detection** | Check for bias in content | Bias report |
| **6. Diversity Check** | Assess dataset balance | Diversity metrics |
| **7. Validation** | Schema, duplicates, gates | Validation status |
| **8. Silver-to-Gold** | Promote or mark as silver | Classification |
| **9. Version Tracking** | Track changes, rollback | Version entry |
---
## Phase 1-2: Input and Extraction
Detect content type: article, tutorial, documentation, research_paper.
Extract: title, sections, code blocks, key terms, metadata (author, date).
---
## Phase 3: Parallel Quality Analysis (4 Agents)
Launch ALL agents in ONE message with `run_in_background=True`.
| Agent | Focus | Output |
|-------|-------|--------|
| code-quality-reviewer | Accuracy, coherence, depth, relevance | Quality scores |
| workflow-architect | Keyword dir