Commit Graph

6 Commits

Author SHA1 Message Date
tomatocream c487ba8c08 feat: filter short audio segments (mic bumps) and add debug notebook
Mic bumps produce transient spikes that pass VAD onset detection but
contain no real speech — the model hallucinates "thank you" from them.
Added MIN_SPEECH_SECONDS (0.3s) filter to discard segments where the
actual speech portion is too short.

Added a Jupyter notebook (notebooks/audio_debug.ipynb) for real-time
audio visualization: streams RMS + peak amplitude into a live Plotly
FigureWidget, then provides post-hoc waveform inspection, segment
playback, and side-by-side segment comparison.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-06-01 16:16:31 +08:00
tomatocream 92d8ba28d0 feat: add Typer CLI with daemon mode and wtype keyboard injection
Replace argparse CLI with Typer-based CLI supporting `cohere on/off/status`
commands. The daemon runs transcription in the background and types into the
focused Wayland window via wtype. Adds wtype to flake.nix and fixes the
hatchling build backend.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-05-30 21:09:32 +08:00
tomatocream 8d517b3ea8 refactor: restructure project into src layout with proper packaging
Split monolithic transcribe.py into focused modules under
src/cohere_transcribe/ (model, vad, stream, cli), move tests into
tests/, add hatchling build system and CLI entry point, remove
unused shell.nix and main.py.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-05-30 00:45:56 +08:00
tomatocream 8b88489a53 Simplify to audio file input (mic requires PortAudio on NixOS) 2026-05-26 01:49:52 +08:00
tomatocream 14abcb89f2 Add accelerate dependency 2026-05-26 01:38:10 +08:00
tomatocream 82fe21fe41 Add Cohere Transcribe demo with uv + Python 3.14 2026-05-26 01:35:10 +08:00