ai-podcast

Author	SHA1	Message	Date
tcpsyn	aa3899b1fc	Harden LLM: model fallback chain, reuse client, remove fighting timeouts - Primary model gets 15s, then auto-falls back through gemini-flash, gpt-4o-mini, llama-3.1-8b (10s each) - Always returns a response — canned in-character line as last resort - Reuse httpx client instead of creating new one per request - Remove asyncio.timeout wrappers that were killing requests before the LLM service could try fallbacks Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 22:07:39 -07:00
tcpsyn	b3fb3b1127	Fix AI caller hanging on 'thinking...' indefinitely - Add 30s timeout to all frontend fetch calls (safeFetch) - Add 20s asyncio.timeout around lock+LLM in chat, ai-respond, auto-respond - Reduce OpenRouter timeout from 60s to 25s - Reduce Inworld TTS timeout from 60s to 25s - Return graceful fallback responses on timeout instead of hanging Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:16:15 -07:00
tcpsyn	7adf1bbcad	Fix LLM model list, Castopod API, and server runner - Remove gpt-4o-realtime (WebSocket-only) from OpenRouter models - Increase OpenRouter timeout to 60s and max_tokens to 150 - Handle empty LLM responses - Fix publish_episode.py for current Castopod API fields - Add port conflict check and graceful shutdown to run.sh Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 01:56:09 -07:00
tcpsyn	a1c94a3682	Fix unnatural response cutoffs - Replace aggressive sentence-count limiting with ensure_complete_thought() which only trims if the LLM was actually cut off mid-sentence - Softer prompt guidance for natural brevity instead of rigid sentence count - max_tokens at 100 as natural length cap Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:18:22 -07:00
tcpsyn	9d4b8a0d22	Replace token-based truncation with sentence-count limiting - max_tokens back to 150 so LLM can finish thoughts - New limit_sentences() keeps only first 2 complete sentences - Never cuts mid-sentence — always ends at punctuation - Applied to both chat and auto-respond paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:15:04 -07:00
tcpsyn	6a56967540	Enforce shorter AI responses and prevent cut-off sentences - Reduce max_tokens from 100 to 75 for shorter output - Add truncate_to_complete_sentence() to trim at last punctuation - Applied to both chat and auto-respond paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:07:41 -07:00
tcpsyn	0e65fa5084	Force shorter AI responses — max 1-2 sentences - Much stronger prompt language: "no more than 2 sentences EVER" - Added "DO NOT ramble" instruction - Reduced max_tokens back to 100 as hard limit Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:05:51 -07:00
tcpsyn	3192735615	Fix AI responses being cut off - Increase max_tokens from 100 to 150 to avoid mid-sentence truncation - Tighten prompt to 1-2 short sentences with emphasis on completing them Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:04:12 -07:00
tcpsyn	029ce6d689	Initial commit: AI Radio Show web application - FastAPI backend with multiple TTS providers (Inworld, ElevenLabs, Kokoro, F5-TTS, etc.) - Web frontend with caller management, music, and soundboard - Whisper transcription integration - OpenRouter/Ollama LLM support - Castopod podcast publishing script Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-04 23:11:20 -07:00

9 Commits