thomas.kopp
c7cad4bb2a
feat: add whisper.cpp ROCm backend support for AMD GPU acceleration
...
- transcription.py: new _transcribe_remote_whispercpp() using /inference endpoint
- transcription.py: backend param routes to openai or whispercpp remote path
- config.py: whisper.backend default 'openai', alt 'whispercpp'
- pipeline.py: passes backend from config to transcribe_file
- settings: backend dropdown (OpenAI-compat / whisper.cpp)
- SETUP.md: whisper.cpp ROCm build and systemd setup instructions
whisper-cpp-server running on beastix :8080 (ROCm0, gfx1030, RX 6800 XT)
2026-04-02 01:33:32 +02:00
thomas.kopp
5f384af6cf
feat: add diarization section to settings page
...
Adds a "Diarisierung" section with an enabled/disabled toggle,
HuggingFace token input, and a help link to pyannote/speaker-diarization-3.1.
loadConfig() and the save handler now persist diarization settings.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-04-02 01:18:26 +02:00
thomas.kopp
0eb85b98f1
feat: add frontend speaker naming card for diarization
...
Shows a card with excerpt navigation and name inputs when the backend
emits speakers_unknown. Submitting posts the mapping to /speakers or
leaves speakers anonymous; handles awaiting_speakers status label.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-04-02 01:17:23 +02:00
thomas.kopp
52ba53bec4
fix: validate Ollama URL protocol before fetching api/tags
2026-04-01 20:51:23 +02:00
thomas.kopp
0bdc0a5e42
feat: settings page — PipeWire audio device + remote Whisper/Ollama config
...
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-04-01 20:48:56 +02:00
thomas.kopp
81fbbfb56e
feat: status includes is_admin, gear icon in header for admins
...
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-04-01 20:45:33 +02:00
thomas.kopp
ccdc75c74c
feat: show date and time in transcript list items
2026-04-01 14:40:01 +02:00
thomas.kopp
b74147967b
feat: tüit logo in header, clean transcript item layout with grouped action buttons
2026-04-01 14:37:03 +02:00
thomas.kopp
2ab6e7d73b
fix: move reprocess button to transcript list item, remove from modal
2026-04-01 14:30:28 +02:00
thomas.kopp
33ae9dc1d8
feat: reprocess existing transcript via Ollama — modal button + POST /transcripts/{filename}/reprocess
2026-04-01 14:27:15 +02:00
thomas.kopp
1ab023c2ac
feat: transcript modal with markdown rendering, delete button, remove preview section
2026-04-01 14:18:04 +02:00
thomas.kopp
629341930e
fix: show recording state on first click, fix error state via WS
2026-04-01 13:57:41 +02:00
thomas.kopp
6e317a9c67
fix: error state resettable via mic click, debug logging, pipeline traceback
2026-04-01 12:41:45 +02:00
thomas.kopp
6574481647
fix: web-based first-run setup — removes terminal input(), works under systemd
2026-04-01 12:26:17 +02:00
thomas.kopp
1466a1529f
feat: multi-user auth — per-user spaces, pbkdf2 passwords, session tokens, login page
2026-04-01 08:39:16 +02:00
thomas.kopp
b62c3d6604
feat: browser UI — tüit CI dark theme, XSS-safe DOM, auth-aware
2026-04-01 02:29:07 +02:00