Back to Skills

mindwork-transcribe

verified

Transcribe therapy session recordings to formatted text. Converts audio to clean, speaker-labeled transcripts (Me/Therapist format) with grammar correction and English translation. Use when processing therapy recordings, session audio, or any two-person conversation recording.

View on GitHub

Marketplace

mindwork

tolgaio/mindwork

Plugin

mindwork

productivity

Repository

tolgaio/mindwork

skills/mindwork-transcribe/SKILL.md

Last Verified

January 21, 2026

Install Skill

Select agents to install to:

Scope:
npx add-skill https://github.com/tolgaio/mindwork/blob/main/skills/mindwork-transcribe/SKILL.md -a claude-code --skill mindwork-transcribe

Installation paths:

Claude
.claude/skills/mindwork-transcribe/
Powered by add-skill CLI

Instructions

# Therapy Session Transcriber

Part of the **mindwork** suite. Converts therapy session recordings into clean, formatted transcripts.

## What It Does

1. **Chunks** large audio files at natural silence points (sentence boundaries)
2. **Transcribes** using OpenAI Whisper API
3. **Formats** as two-person conversation with **Me:** / **Therapist:** labels
4. **Corrects** grammar and transcription errors
5. **Translates** to English (for non-English sessions)

## Prerequisites

- Docker installed and running
- `OPENAI_API_KEY` environment variable set
- The `mindwork-transcribe` Docker image built (see Setup)

## Setup (One-Time)

Build the transcription Docker image from the plugin's transcribe directory:

```bash
# Find the mindwork plugin location and build the image
docker build -t mindwork-transcribe ~/src/mindwork/transcribe
```

Or if installed as a plugin, find the plugin path first:
```bash
# The transcribe tool is in the 'transcribe/' directory of this plugin
docker build -t mindwork-transcribe /path/to/mindwork/transcribe
```

## Usage

### Full Therapy Session Processing (Recommended)

Transcribe, format as conversation, and translate to English:

```bash
docker run --rm \
  -e OPENAI_API_KEY \
  -v $(pwd):/data \
  mindwork-transcribe /data/session.m4a --format-conversation --output /data/transcript.txt
```

### Raw Transcription Only

Just transcribe without formatting or translation:

```bash
docker run --rm \
  -e OPENAI_API_KEY \
  -v $(pwd):/data \
  mindwork-transcribe /data/session.m4a --output /data/transcript.txt
```

### With Speaker Diarization

For automatic speaker detection (alternative to --format-conversation):

```bash
docker run --rm \
  -e OPENAI_API_KEY \
  -v $(pwd):/data \
  mindwork-transcribe /data/session.m4a --diarize --output /data/transcript.txt
```

### Only Chunk (No Transcription)

Split a large file into chunks for later processing:

```bash
docker run --rm \
  -v $(pwd):/data \
  mindwork-transcribe /data/session.m4a --no-tr

Validation Details

Front Matter
Required Fields
Valid Name Format
Valid Description
Has Sections
Allowed Tools
Instruction Length:
4447 chars