Use when implementing ANY computer vision feature - image analysis, object detection, pose detection, person segmentation, subject lifting, hand/body pose tracking.
View on GitHubSelect agents to install to:
npx add-skill https://github.com/CharlesWiltgen/Axiom/blob/main/.claude-plugin/plugins/axiom/skills/axiom-ios-vision/SKILL.md -a claude-code --skill axiom-ios-visionInstallation paths:
.claude/skills/axiom-ios-vision/# iOS Computer Vision Router **You MUST use this skill for ANY computer vision work using the Vision framework.** ## When to Use Use this router when: - Analyzing images or video - Detecting objects, faces, or people - Tracking hand or body pose - Segmenting people or subjects - Lifting subjects from backgrounds - Recognizing text in images (OCR) - Detecting barcodes or QR codes - Scanning documents - Using VisionKit or DataScannerViewController ## Routing Logic ### Vision Work **Implementation patterns** → `/skill axiom-vision` - Subject segmentation (VisionKit) - Hand pose detection (21 landmarks) - Body pose detection (2D/3D) - Person segmentation - Face detection - Isolating objects while excluding hands - Text recognition (VNRecognizeTextRequest) - Barcode/QR detection (VNDetectBarcodesRequest) - Document scanning (VNDocumentCameraViewController) - Live scanning (DataScannerViewController) - Structured document extraction (RecognizeDocumentsRequest, iOS 26+) **API reference** → `/skill axiom-vision-ref` - Complete Vision framework API - VNDetectHumanHandPoseRequest - VNDetectHumanBodyPoseRequest - VNGenerateForegroundInstanceMaskRequest - VNRecognizeTextRequest (fast/accurate modes) - VNDetectBarcodesRequest (symbologies) - DataScannerViewController delegates - RecognizeDocumentsRequest (iOS 26+) - Coordinate conversion patterns **Diagnostics** → `/skill axiom-vision-diag` - Subject not detected - Hand pose missing landmarks - Low confidence observations - Performance issues - Coordinate conversion bugs - Text not recognized or wrong characters - Barcodes not detected - DataScanner showing blank or no items - Document edges not detected ## Decision Tree ``` User asks about computer vision ├─ Implementing? │ ├─ Pose detection (hand/body)? → vision │ ├─ Subject segmentation? → vision │ ├─ Text recognition/OCR? → vision │ ├─ Barcode/QR scanning? → vision │ ├─ Document scanning? → vision │ └─ Live camera scanning? → vision (DataS