ai-podcast

Author	SHA1	Message	Date
luke	c516402402	Update model routing with latest OpenRouter models Style-matched defaults: - Grok 4.1 Fast for edgy callers (high_energy, confrontational, comedian etc.) - Claude Sonnet 4.6 for emotional callers (quiet_nervous, sweet_earnest, emotional) - Mistral Large 2512 for deadpan/mysterious/world-weary - DeepSeek R1 Distill for storyteller/oversharer/conspiracy/rambler - Gemini 2.5 Flash for know_it_all - Llama 3.3 70B for first_time/reluctant callers Category routing: Grok 4.1 Fast for dialog/devon/backgrounds, Gemini Flash for monitor/summary Updated OPENROUTER_MODELS and OPENROUTER_PRICING with all new models Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 02:31:33 -06:00
luke	314d5f9452	Caller model routing — cycle, style-matched, mid-show override - Three strategies: single model, cycle through pool, style-matched - 18 communication styles mapped to 7 models (Grok, Sonnet, Mistral, Qwen, DeepSeek, Gemini, Llama) - Per-caller model locked for entire call, overridable mid-show - Model badges on caller buttons and info panel - Settings UI for strategy, pool, style mapping, fallback - Fallback to Sonnet on model failure - 6 new models added to pricing and dropdown - Checkpoint persistence for all model state Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 01:58:03 -06:00
luke	6dcdf20289	Grok 4 routing, guardrails, pricing fix, strip silence improvements - Route caller_dialog, devon_ask, background_gen to x-ai/grok-4 - Add Grok-4 to OPENROUTER_MODELS and OPENROUTER_PRICING - Add Grok-specific banned phrases (I hear you, fair enough, that's wild, etc.) - Add background gen guardrails for Grok (no active violence, no real public figures) - Soften theme prompt hot-take language for organic connections - Tighten Devon flirting guardrail (awkward not crude) - Fix Devon "first day" contradiction on line 36 - Strip silence: preserve music intro, fix ad normalization (direct WAV reading) - Strip silence: loop range starts 0.5s before audible music Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 17:46:29 -06:00
luke	3dd6a83c68	Full app audit: 24 fixes across backend, frontend, infra, content, social Critical fixes: - Fix hangup-during-respond crash (null caller guard) - Fix double-click caller race condition - Stem recorder: non-daemon thread, disk error handling, 30s flush timeout - Frontend startCall() error handling High priority: - Devon: filter tool errors from speech, shorter monitor prompt, 30s interval - TTS ghost message fix (add to history after TTS, not before) - Expand banned phrase list (12 new phrases) - Increase returning callers from 1 to 2 per session - Platform-tailored social posts with staggered scheduling - YouTube dynamic tags from episode content - Social post retry logic (2 attempts, 5s delay) - Frontend: error handling on all raw fetch calls Medium: - stem_recorder null check race (local var capture in audio.py) - Reactive shape directive expanded - REACT TO LUKE moved higher in caller prompt - Devon tenure updated ("few weeks" not "first day") - D shortcut Escape to unfocus - Volume slider debounced (150ms) - Settings modal widened to 550px - Backup script (daily MariaDB dump + data/ rsync to NAS) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 14:57:50 -06:00
luke	164cad456c	Devon own stem/track/channel, per-category LLM routing, settings UI cleanup Audio: - Devon gets own stem, Reaper track (Input 17), and configurable channel - play_caller_audio accepts stem_name + channel_override params - Reaper script checks 4 voice tracks (Host, Devon, Live Caller, AI Caller) - postprod.py includes devon stem in gap detection Cost optimization: - Per-category model routing: Sonnet for caller dialog, Gemini Flash for everything else - Estimated 65% cost reduction ($4.32 → ~$1.50/show) - Category models configurable from settings UI Frontend: - Settings panel: clean routing grid for output channels, model routing grid for LLM categories - Devon channel added to audio routing - Share icon SVG fill fix (currentColor) - Website homepage iterations Publishing: - Revert Castopod API workaround (API re-enabled) - Fix container media path Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-16 17:05:19 -06:00
luke	c70f83d04a	Cost monitoring, PTT fix, Devon tuning, WEIRD pool expansion, YT thumbnails, LLM SEO, publish ep37 - Add real-time LLM/TTS cost tracking with live status bar display and post-show reports - Fix PTT bug where Devon suggestion layout shift stopped recording via mouseleave - Devon: facts-only during calls, full personality between calls - Double WEIRD topic pool (109→203), bump weight to 14-25% - Auto-generate YouTube thumbnails with bold hook text in publish pipeline - LLM SEO: llms.txt, robots.txt for LLM crawlers, structured data, BreadcrumbList schemas - Publish episode 37 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-15 05:33:27 -06:00
luke	6d4e490283	Caller generation overhaul, Devon intern, frontend redesign Caller system: structured JSON backgrounds, voice-personality matching (68 profiles), thematic inter-caller awareness, adaptive call shapes, show pacing, returning caller memory with relationships/arcs, post-call quality signals, 95 comedy writer entries. Devon the Intern: persistent show character with tool-calling LLM (web search, Wikipedia, headlines, webpage fetch), auto-monitoring, 6 API endpoints, full frontend UI. Frontend: wrap-up nudge button, caller info panel with shape/energy/emotion badges, keyboard shortcuts (1-0/H/W/M/D), pinned SFX, visual polish, Devon panel. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 01:54:08 -06:00
luke	f7b75fa72f	Clips page, new episodes, TTS/audio improvements, publish pipeline updates Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 05:38:58 -06:00
luke	6eeab58464	TTS fixes, Inworld improvements, footer redesign, episodes 15-25, invoice script fix - Fix TTS text pipeline: new caps handling (spell out unknown acronyms, lowercase emphasis words), action-word lookahead for parenthetical stripping, abbreviation expansions (US→United States, NM→New Mexico), pronunciation fixes - Inworld TTS: camelCase API fields, speakingRate per-voice overrides, retry logic with exponential backoff (3 attempts) - Footer redesign: SVG icons for social/podcast links across all pages - Stats page: show "Rate us on Spotify" instead of "not public" placeholder - New voices, expanded caller prompts and problem scenarios - Social posting via Postiz, YouTube upload in publish pipeline - Episode transcripts 15-25, terms page, sitemap updates - Fix invoice script: match Timing totals using merged Task+App intervals Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 12:38:58 -07:00
luke	28af0723c7	Ep12 publish, caller prompt overhaul, favicon, publish fixes, website updates - Reworked caller prompt: edgy/flirty personality, play along with host bits - Bumped caller token budget (200-550 range, was 150-450) - Added 20 layered/morally ambiguous caller stories - Valentine's Day awareness in seasonal context - Default LLM model: claude-sonnet-4-5 (was claude-3-haiku) - Publish: SCP-based SQL transfer (fixes base64 encoding on NAS) - Favicons: added .ico, 48px, 192px PNGs for Google search results - Website: button layout cleanup, privacy page, ep12 transcript - Control panel: channel defaults match audio_settings.json - Disabled OP3 permanently (YouTube ingest issues on large files) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-14 22:53:34 -07:00
luke	7b7f9b8208	Add BunnyCDN integration, on-air website badge, publish script fixes - On-air toggle uploads status.json to BunnyCDN + purges cache, website polls it every 15s to show live ON AIR / OFF AIR badge - Publish script downloads Castopod's copy of audio for CDN upload (byte-exact match), removes broken slug fallback, syncs all episode media to CDN after publishing - Fix f-string syntax error in publish_episode.py (Python <3.12) - Enable CORS on BunnyCDN pull zone for json files - CDN URLs for website OG images, stem recorder bug fixes, LLM token budget tweaks, session context in CLAUDE.md Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 17:34:18 -07:00
luke	7d88c76f90	Add post-production pipeline: stem recorder, postprod script, recording UI New stem recording system captures 5 time-aligned WAV files (host, caller, music, sfx, ads) during live shows. Standalone postprod.py processes stems into broadcast-ready MP3 with gap removal, voice compression, music ducking, and EBU R128 loudness normalization. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 17:53:32 -07:00
luke	aa3899b1fc	Harden LLM: model fallback chain, reuse client, remove fighting timeouts - Primary model gets 15s, then auto-falls back through gemini-flash, gpt-4o-mini, llama-3.1-8b (10s each) - Always returns a response — canned in-character line as last resort - Reuse httpx client instead of creating new one per request - Remove asyncio.timeout wrappers that were killing requests before the LLM service could try fallbacks Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 22:07:39 -07:00
luke	b3fb3b1127	Fix AI caller hanging on 'thinking...' indefinitely - Add 30s timeout to all frontend fetch calls (safeFetch) - Add 20s asyncio.timeout around lock+LLM in chat, ai-respond, auto-respond - Reduce OpenRouter timeout from 60s to 25s - Reduce Inworld TTS timeout from 60s to 25s - Return graceful fallback responses on timeout instead of hanging Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:16:15 -07:00
luke	7adf1bbcad	Fix LLM model list, Castopod API, and server runner - Remove gpt-4o-realtime (WebSocket-only) from OpenRouter models - Increase OpenRouter timeout to 60s and max_tokens to 150 - Handle empty LLM responses - Fix publish_episode.py for current Castopod API fields - Add port conflict check and graceful shutdown to run.sh Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 01:56:09 -07:00
luke	a1c94a3682	Fix unnatural response cutoffs - Replace aggressive sentence-count limiting with ensure_complete_thought() which only trims if the LLM was actually cut off mid-sentence - Softer prompt guidance for natural brevity instead of rigid sentence count - max_tokens at 100 as natural length cap Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:18:22 -07:00
luke	9d4b8a0d22	Replace token-based truncation with sentence-count limiting - max_tokens back to 150 so LLM can finish thoughts - New limit_sentences() keeps only first 2 complete sentences - Never cuts mid-sentence — always ends at punctuation - Applied to both chat and auto-respond paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:15:04 -07:00
luke	6a56967540	Enforce shorter AI responses and prevent cut-off sentences - Reduce max_tokens from 100 to 75 for shorter output - Add truncate_to_complete_sentence() to trim at last punctuation - Applied to both chat and auto-respond paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:07:41 -07:00
luke	0e65fa5084	Force shorter AI responses — max 1-2 sentences - Much stronger prompt language: "no more than 2 sentences EVER" - Added "DO NOT ramble" instruction - Reduced max_tokens back to 100 as hard limit Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:05:51 -07:00
luke	3192735615	Fix AI responses being cut off - Increase max_tokens from 100 to 150 to avoid mid-sentence truncation - Tighten prompt to 1-2 short sentences with emphasis on completing them Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:04:12 -07:00
luke	029ce6d689	Initial commit: AI Radio Show web application - FastAPI backend with multiple TTS providers (Inworld, ElevenLabs, Kokoro, F5-TTS, etc.) - Web frontend with caller management, music, and soundboard - Whisper transcription integration - OpenRouter/Ollama LLM support - Castopod podcast publishing script Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-04 23:11:20 -07:00

21 Commits