AI Podcast Transcription
Podcast Generation and Transcription System for Truth Network
Podcast Generation and Transcription System for Truth Network
Architected an internal AI content-pipeline platform to drive continuous SEO-rich content growth. The system monitors ~100 podcast feeds in MySQL, detects new episodes, downloads audio, and performs low-cost GPU inference via API to a GPU cloud provider using a custom Docker image — transcribing via NVIDIA Parakeet-tdt with Groq fallback. From each episode, it generates structured enrichment (1–2 paragraph summaries, SEO-specific keyword clusters, sentiment scoring, and spam classification) using Groq + Llama 3.3 70B, returning normalized JSON objects to downstream systems for indexing and publishing.
Results: