ai-podcast

Author	SHA1	Message	Date
tcpsyn	3164a70e48	Ep13 publish, MLX whisper, voicemail system, hero redesign, massive topic expansion - Switch whisper transcription from faster-whisper (CPU) to lightning-whisper-mlx (GPU) - Fix word_timestamps hanging, use ffprobe for accurate duration - Add Cloudflare Pages Worker for SignalWire voicemail fallback when server offline - Add voicemail sync on startup, delete tracking, save feature - Add /feed RSS proxy to _worker.js (was broken by worker taking over routing) - Redesign website hero section: ghost buttons, compact phone, plain text links - Rewrite caller prompts for faster point-getting and host-following - Expand TOPIC_CALLIN from ~250 to 547 entries across 34 categories - Add new categories: biology, psychology, engineering, math, geology, animals, work, money, books, movies, relationships, health, language, true crime, drunk/high/unhinged callers - Remove bad Inworld voices (Pixie, Dominus), reduce repeat caller frequency - Add audio monitor device routing, uvicorn --reload-dir fix - Publish episode 13 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-16 01:56:47 -07:00
tcpsyn	95c2d06435	Postprod improvements: denoise, phone EQ, ad muting, ducking, voice mappings - Add host mic noise reduction (afftdn + anlmdn) - Add phone EQ bandpass on caller stem - Mute music during ads with 2s lookahead/tail - Increase ducking release to 3s to reduce pumping - Add Inworld voice mappings for all regular callers - Recording toggle endpoint, stem sync fixes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 03:59:08 -07:00
tcpsyn	75f15ba2d2	Add persistent caller voices, Discord, REC/on-air linking, SEO fixes, ep9 - Returning callers now keep their voice across sessions (stored in regulars.json) - Backfilled voice assignments for all 11 existing regulars - Discord button on homepage + link in all page footers - REC and On-Air buttons now toggle together (both directions) - Fixed host mic double-stream bug (stem_mic vs host_stream conflict) - SEO: JSON-LD structured data on episode + how-it-works pages - SEO: noscript fallbacks, RSS links, twitter meta tags - Episode 9 transcript and sitemap update Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 00:24:37 -07:00
tcpsyn	cee78b5d88	Add speaker-labeled transcripts, favicon, host stream fix, episode page - Re-label all 8 episode transcripts with LUKE:/CALLER: speaker labels using LLM-based diarization (relabel_transcripts.py) - Add episode.html transcript page with styled speaker labels - Update publish_episode.py to generate speaker-labeled transcripts and copy to website/transcripts/ for Cloudflare Pages - Add SVG favicon with PNG fallbacks - Fix CPU issue: tie host audio stream to on-air toggle, not per-caller - Update how-it-works page with post-production pipeline info - Add transcript links to episode cards in app.js Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 15:19:45 -07:00
tcpsyn	7b7f9b8208	Add BunnyCDN integration, on-air website badge, publish script fixes - On-air toggle uploads status.json to BunnyCDN + purges cache, website polls it every 15s to show live ON AIR / OFF AIR badge - Publish script downloads Castopod's copy of audio for CDN upload (byte-exact match), removes broken slug fallback, syncs all episode media to CDN after publishing - Fix f-string syntax error in publish_episode.py (Python <3.12) - Enable CORS on BunnyCDN pull zone for json files - CDN URLs for website OG images, stem recorder bug fixes, LLM token budget tweaks, session context in CLAUDE.md Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 17:34:18 -07:00
tcpsyn	7d88c76f90	Add post-production pipeline: stem recorder, postprod script, recording UI New stem recording system captures 5 time-aligned WAV files (host, caller, music, sfx, ads) during live shows. Standalone postprod.py processes stems into broadcast-ready MP3 with gap removal, voice compression, music ducking, and EBU R128 loudness normalization. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 17:53:32 -07:00
tcpsyn	356bf145b8	Add show improvement features: crossfade, emotions, returning callers, transcripts, screening - Music crossfade: smooth 3-second blend between tracks instead of hard stop/start - Emotional detection: analyze host mood from recent messages so callers adapt tone - AI caller summaries: generate call summaries with timestamps for show history - Returning callers: persist regular callers across sessions with call history - Session export: generate transcripts with speaker labels and chapter markers - Caller screening: AI pre-screens phone callers to get name and topic while queued Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 02:43:01 -07:00
tcpsyn	bd6c8ccbab	Landing page: testimonials slider, how-it-works page, 25 TTS voices - Add testimonial slider with 8 fake caller reviews - Add how-it-works page with visual architecture diagram - Expand voice pools: Inworld 25 voices (14M/11F), ElevenLabs 22 (14M/8F) - Voice pools auto-switch when TTS provider changes - Add cover art locally, update cache-busted image refs - Add "More from Luke" footer links (MMG, prints, YouTube) - Ad channel configurable in settings UI Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 01:34:30 -07:00
tcpsyn	9452b07c5c	Ads play once on channel 11, separate from music - Add dedicated ad playback system (no loop, own channel) - Ad channel defaults to 11, saved/loaded with audio settings - Separate play_ad/stop_ad methods and API endpoints - Frontend stop button now calls /api/ads/stop instead of stopMusic Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 22:35:07 -07:00
tcpsyn	b0643d6082	Add recording diagnostics and refresh music list on play Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 01:00:41 -07:00
tcpsyn	9c5f7c5cfe	Add debug logging and safety for piggybacked recording - Log chunk count and peak audio level on recording stop - Add null check on _recorded_audio in callback - Small delay after stopping piggybacked recording for callback to finish Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:11:51 -07:00
tcpsyn	d583b48af0	Fix choppy/distorted audio to live caller - Mute host mic forwarding while TTS is streaming to prevent interleaving both audio sources into the same playback buffer - Replace nearest-neighbor downsampling with box-filter averaging on both server (host mic) and browser (caller mic) for anti-aliased resampling Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:01:33 -07:00
tcpsyn	eaedc4214b	Reduce live caller latency and improve reliability - Replace per-callback async task spawning with persistent queue-based sender - Buffer host mic to 60ms chunks (was 21ms) to reduce WebSocket frame rate - Reduce server ring buffer prebuffer from 150ms to 80ms - Reduce browser playback jitter buffer from 150ms to 100ms Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:47:17 -07:00
tcpsyn	af8606b5b7	Fix recording conflict when host stream is active When a live caller is on air, the host stream already has an InputStream open. Opening a second one for push-to-talk recording causes a conflict. Now recording piggybacks on the host stream callback instead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:42:07 -07:00
tcpsyn	4d97ea9099	Replace queue with ring buffer jitter absorption for live caller audio - Server: 150ms pre-buffer ring buffer eliminates gaps from timing mismatches - Browser playback: 150ms jitter buffer (up from 80ms) for network jitter - Capture chunks: 960 samples/60ms (better network efficiency) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:37:50 -07:00
tcpsyn	7aed4d9c34	Fix live caller audio latency and choppiness - Reduce capture chunk from 4096 to 640 samples (256ms → 40ms) - Replace BufferSource scheduling with AudioWorklet playback ring buffer - Add 80ms jitter buffer with linear interpolation upsampling - Reduce host mic and live caller stream blocksizes from 4096/2048 to 1024 - Replace librosa.resample with numpy interpolation in send_audio_to_caller Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:32:27 -07:00
tcpsyn	ab36ad8d5b	Fix choppy audio and hanging when taking live callers - Use persistent callback-based output stream instead of opening/closing per chunk - Replace librosa.resample with simple decimation in real-time audio callbacks - Move host stream initialization to background thread to avoid blocking - Change live caller channel default to 9 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:24:27 -07:00
tcpsyn	cca8eaad84	Add live caller channel to audio settings Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:03:52 -07:00
tcpsyn	41ddc8ee35	Remove Twilio dependencies and cleanup references Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:54:35 -07:00
tcpsyn	863a81f87b	Add continuous host mic streaming to real callers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:51:17 -07:00
tcpsyn	88d7fd3457	Add Twilio WebSocket media stream handler with real-time transcription Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:36:04 -07:00
tcpsyn	029ce6d689	Initial commit: AI Radio Show web application - FastAPI backend with multiple TTS providers (Inworld, ElevenLabs, Kokoro, F5-TTS, etc.) - Web frontend with caller management, music, and soundboard - Whisper transcription integration - OpenRouter/Ollama LLM support - Castopod podcast publishing script Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-04 23:11:20 -07:00

22 Commits