Your meetings already have captions. This app saves them.

CaptionSnap is a macOS application designed to capture and save live meeting captions generated by video conferencing platforms such as Microsoft Teams, Zoom, and Google Meet. It addresses the common limitation of ephemeral on-screen captions—particularly in environments where users require permanent, searchable, and well-structured records of spoken content without compromising privacy. The tool is intended for professionals who participate in frequent remote meetings and need accurate, speaker-attributed transcripts for documentation, follow-up, accessibility, or compliance purposes.
Unlike traditional transcription services, CaptionSnap does not record, process, or transmit audio. Instead, it leverages the caption text already rendered by supported meeting applications, capturing it directly from the screen using macOS Accessibility APIs. All processing occurs locally on the user’s Mac, ensuring full data sovereignty and minimal system resource usage.
CaptionSnap functions by monitoring the visible caption text displayed within active meeting windows. When live captions are enabled in Teams, Zoom, or Google Meet—either via native platform features or macOS Live Captions—the application detects and captures each line of text along with its timing and associated speaker name. Speaker identification is derived directly from metadata provided by the meeting application, eliminating the need for speaker diarization or AI inference.
The workflow requires no user intervention beyond enabling captions in the meeting app and launching CaptionSnap. Once running, it operates silently in the background, even when the meeting window is obscured by other applications. Captured content is saved as plain-text Markdown files (e.g., transcript.md) to a user-specified location on the local Mac. Optional post-meeting summaries are generated using Apple Intelligence models that run exclusively on-device, preserving confidentiality and performance.
CaptionSnap supports a range of professional use cases including meeting documentation for distributed teams, accessibility accommodations for hearing-impaired participants, legal or regulatory recordkeeping, and personal knowledge management. Because transcripts retain precise timestamps and speaker attribution, users can efficiently search, annotate, and cross-reference discussions. The local-first architecture enables integration with existing workflows—such as importing into note-taking apps (Obsidian, Notion), version control systems, or document management tools—without vendor lock-in. Export options (PDF, DOCX) facilitate sharing with stakeholders who do not use Markdown. Its low-resource design ensures compatibility with older Mac hardware and extended battery life during all-day meeting schedules.