ai-podcast

Author	SHA1	Message	Date
luke	d5fd89fc9a	Add on-air toggle for phone call routing When off air, callers hear a message and get disconnected. When on air, calls route normally. Toggle button added to frontend header with pulsing red ON AIR indicator. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:03:38 -07:00
luke	7adf1bbcad	Fix LLM model list, Castopod API, and server runner - Remove gpt-4o-realtime (WebSocket-only) from OpenRouter models - Increase OpenRouter timeout to 60s and max_tokens to 150 - Handle empty LLM responses - Fix publish_episode.py for current Castopod API fields - Add port conflict check and graceful shutdown to run.sh Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 01:56:09 -07:00
luke	a94fc92647	Improve SignalWire streaming, randomize caller names, update frontend - Add streamSid tracking and per-caller send locks for SignalWire - Improve TTS streaming with real-time pacing and detailed logging - Block host audio to caller during TTS playback - Randomize caller names between sessions from name pools - Update page title and show phone number in UI Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 01:56:05 -07:00
luke	b0643d6082	Add recording diagnostics and refresh music list on play Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 01:00:41 -07:00
luke	0412f4487f	Enhance caller personality for depth and authenticity Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 00:39:49 -07:00
luke	50e3d3af7d	Include news and research context in caller prompts Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 00:35:55 -07:00
luke	a06d0a22e1	Wire up headline fetch and background research triggers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 00:22:49 -07:00
luke	e46337a05a	Add session news/research fields and helper functions Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 00:21:25 -07:00
luke	e28579f909	Add NewsService for current events awareness Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 00:18:40 -07:00
luke	9016f9734f	Add SignalWire endpoints, update queue/hangup for phone callers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:45:08 -07:00
luke	051790136e	Update CallerService for SignalWire protocol Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:41:27 -07:00
luke	c22818bfec	Add SignalWire configuration Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:38:41 -07:00
luke	a1c94a3682	Fix unnatural response cutoffs - Replace aggressive sentence-count limiting with ensure_complete_thought() which only trims if the LLM was actually cut off mid-sentence - Softer prompt guidance for natural brevity instead of rigid sentence count - max_tokens at 100 as natural length cap Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:18:22 -07:00
luke	9d4b8a0d22	Replace token-based truncation with sentence-count limiting - max_tokens back to 150 so LLM can finish thoughts - New limit_sentences() keeps only first 2 complete sentences - Never cuts mid-sentence — always ends at punctuation - Applied to both chat and auto-respond paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:15:04 -07:00
luke	9c5f7c5cfe	Add debug logging and safety for piggybacked recording - Log chunk count and peak audio level on recording stop - Add null check on _recorded_audio in callback - Small delay after stopping piggybacked recording for callback to finish Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:11:51 -07:00
luke	6a56967540	Enforce shorter AI responses and prevent cut-off sentences - Reduce max_tokens from 100 to 75 for shorter output - Add truncate_to_complete_sentence() to trim at last punctuation - Applied to both chat and auto-respond paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:07:41 -07:00
luke	0e65fa5084	Force shorter AI responses — max 1-2 sentences - Much stronger prompt language: "no more than 2 sentences EVER" - Added "DO NOT ramble" instruction - Reduced max_tokens back to 100 as hard limit Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:05:51 -07:00
luke	3192735615	Fix AI responses being cut off - Increase max_tokens from 100 to 150 to avoid mid-sentence truncation - Tighten prompt to 1-2 short sentences with emphasis on completing them Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:04:12 -07:00
luke	d583b48af0	Fix choppy/distorted audio to live caller - Mute host mic forwarding while TTS is streaming to prevent interleaving both audio sources into the same playback buffer - Replace nearest-neighbor downsampling with box-filter averaging on both server (host mic) and browser (caller mic) for anti-aliased resampling Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 17:01:33 -07:00
luke	d4e25ceb88	Stream TTS audio to caller in real-time chunks TTS audio was sent as a single huge WebSocket frame that overflowed the browser's 3s ring buffer. Now streams in 60ms chunks at real-time rate. Also increased browser ring buffer from 3s to 10s as safety net. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:56:22 -07:00
luke	97d37f3381	Send AI TTS audio to live caller during auto-respond The auto-respond function played AI TTS to the local Loopback channel but didn't send it over WebSocket to the live caller in the browser. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:53:41 -07:00
luke	eaedc4214b	Reduce live caller latency and improve reliability - Replace per-callback async task spawning with persistent queue-based sender - Buffer host mic to 60ms chunks (was 21ms) to reduce WebSocket frame rate - Reduce server ring buffer prebuffer from 150ms to 80ms - Reduce browser playback jitter buffer from 150ms to 100ms Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:47:17 -07:00
luke	af8606b5b7	Fix recording conflict when host stream is active When a live caller is on air, the host stream already has an InputStream open. Opening a second one for push-to-talk recording causes a conflict. Now recording piggybacks on the host stream callback instead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:42:07 -07:00
luke	4d97ea9099	Replace queue with ring buffer jitter absorption for live caller audio - Server: 150ms pre-buffer ring buffer eliminates gaps from timing mismatches - Browser playback: 150ms jitter buffer (up from 80ms) for network jitter - Capture chunks: 960 samples/60ms (better network efficiency) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:37:50 -07:00
luke	7aed4d9c34	Fix live caller audio latency and choppiness - Reduce capture chunk from 4096 to 640 samples (256ms → 40ms) - Replace BufferSource scheduling with AudioWorklet playback ring buffer - Add 80ms jitter buffer with linear interpolation upsampling - Reduce host mic and live caller stream blocksizes from 4096/2048 to 1024 - Replace librosa.resample with numpy interpolation in send_audio_to_caller Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:32:27 -07:00
luke	ab36ad8d5b	Fix choppy audio and hanging when taking live callers - Use persistent callback-based output stream instead of opening/closing per chunk - Replace librosa.resample with simple decimation in real-time audio callbacks - Move host stream initialization to background thread to avoid blocking - Change live caller channel default to 9 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:24:27 -07:00
luke	bcd0d96185	Fix slow hangup by moving LLM summarization and SFX to background Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:09:10 -07:00
luke	cca8eaad84	Add live caller channel to audio settings Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 16:03:52 -07:00
luke	41ddc8ee35	Remove Twilio dependencies and cleanup references Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:54:35 -07:00
luke	82ad234480	Add browser call-in page and update host dashboard for browser callers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:52:54 -07:00
luke	863a81f87b	Add continuous host mic streaming to real callers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:51:17 -07:00
luke	bf140a77b7	Add browser caller WebSocket handler with PCM audio streaming Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:49:49 -07:00
luke	06f334359e	Remove Twilio endpoints and dependencies Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:48:12 -07:00
luke	3961cfc9d4	Rename TwilioService to CallerService, remove Twilio-specific audio encoding Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 15:45:08 -07:00
luke	141f81232e	Add AI follow-up system with call summarization and show history Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:42:35 -07:00
luke	c82420ddad	Add outbound audio streaming to real callers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:39:02 -07:00
luke	88d7fd3457	Add Twilio WebSocket media stream handler with real-time transcription Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:36:04 -07:00
luke	28ff8c2d16	Add Twilio webhook and queue management endpoints Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:33:03 -07:00
luke	924ddca71a	Add Twilio call queue service with channel allocation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:31:02 -07:00
luke	00c2e8f018	Add Session multi-party call support and show history - Add CallRecord dataclass for tracking call history - Extend Session with call_history, active_real_caller, ai_respond_mode, auto_followup - Add get_show_history() for AI caller context about earlier calls - Update get_conversation_summary() to handle host/real_caller/ai_caller roles - Update reset() to clear new fields - Add tests for all new functionality Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:25:32 -07:00
luke	fad182f7e5	Add Twilio config and dependencies Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 13:20:30 -07:00
luke	029ce6d689	Initial commit: AI Radio Show web application - FastAPI backend with multiple TTS providers (Inworld, ElevenLabs, Kokoro, F5-TTS, etc.) - Web frontend with caller management, music, and soundboard - Whisper transcription integration - OpenRouter/Ollama LLM support - Castopod podcast publishing script Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-04 23:11:20 -07:00

42 Commits