Use when implementing ANY computer vision feature - image analysis, object detection, pose detection, person segmentation, subject lifting, hand/body pose tracking.
View on GitHub.claude-plugin/plugins/axiom/skills/axiom-ios-vision/SKILL.md
February 3, 2026
Select agents to install to:
npx add-skill https://github.com/CharlesWiltgen/Axiom/blob/main/.claude-plugin/plugins/axiom/skills/axiom-ios-vision/SKILL.md -a claude-code --skill axiom-ios-visionInstallation paths:
.claude/skills/axiom-ios-vision/# iOS Computer Vision Router **You MUST use this skill for ANY computer vision work using the Vision framework.** ## When to Use Use this router when: - Analyzing images or video - Detecting objects, faces, or people - Tracking hand or body pose - Segmenting people or subjects - Lifting subjects from backgrounds - Recognizing text in images (OCR) - Detecting barcodes or QR codes - Scanning documents - Using VisionKit or DataScannerViewController ## Routing Logic ### Vision Work **Implementation patterns** → `/skill axiom-vision` - Subject segmentation (VisionKit) - Hand pose detection (21 landmarks) - Body pose detection (2D/3D) - Person segmentation - Face detection - Isolating objects while excluding hands - Text recognition (VNRecognizeTextRequest) - Barcode/QR detection (VNDetectBarcodesRequest) - Document scanning (VNDocumentCameraViewController) - Live scanning (DataScannerViewController) - Structured document extraction (RecognizeDocumentsRequest, iOS 26+) **API reference** → `/skill axiom-vision-ref` - Complete Vision framework API - VNDetectHumanHandPoseRequest - VNDetectHumanBodyPoseRequest - VNGenerateForegroundInstanceMaskRequest - VNRecognizeTextRequest (fast/accurate modes) - VNDetectBarcodesRequest (symbologies) - DataScannerViewController delegates - RecognizeDocumentsRequest (iOS 26+) - Coordinate conversion patterns **Diagnostics** → `/skill axiom-vision-diag` - Subject not detected - Hand pose missing landmarks - Low confidence observations - Performance issues - Coordinate conversion bugs - Text not recognized or wrong characters - Barcodes not detected - DataScanner showing blank or no items - Document edges not detected ## Decision Tree 1. Implementing (pose, segmentation, OCR, barcodes, documents, live scanning)? → vision 2. Need API reference / code examples? → vision-ref 3. Debugging issues (detection failures, confidence, coordinates)? → vision-diag ## Anti-Rationalization | Thought | Reality | |---------|---------| | "Vision fr