handelsblatt logo
News and Media Industry

AI-supported article audio generation

Handelsblatt Media Group

100%

Automation

From text to audio without manual intervention

Near-Human

Voice quality

Natural intonation & stress

24/7

Availability

Scalable audio production

1

Pipeline

Integrated end-to-end solution

Lukas Famula
Lukas FamulaFull Stack Software Developer & AI Engineer

Initial situation

More and more people are consuming content on the go – while commuting, exercising or going about their daily lives. The growing popularity of podcasts clearly shows that demand for audio content has risen dramatically. Editorial teams were faced with the challenge of efficiently converting written articles into this format and making them accessible to a wider audience – especially people who don't have time to read.

Lukas Famula
Lukas FamulaFull Stack Software Developer & AI Engineer

Approach / Idea

As part of the internal project team, I developed a specialised AI system for automatic audio generation from article texts. The focus was on advanced deep learning algorithms optimised for the specific challenges of article audio production – natural intonation, appropriate emphasis and correct pronunciation of technical terminology. Through continuous training, we achieved a quality that is close to that of manual voice recording.

AI & Automation
Solution

AI & Automation

Growth - Key Features

  • Multi-Channel ChatbotsWhatsApp, Telegram & more
  • Voice-AI IntegrationAutomated customer service by phone
  • RAG up to 10,000 documentsNetworked knowledge databases
  • CRM & ERP IntegrationCross-team workflows
  • Simple AI AgentsFirst intelligent automation
2024
October 2024Market launch

Launch

Working closely with the internal project team, we developed a fully automated audio pipeline that was seamlessly integrated into existing editorial systems. The biggest challenge was finding the right balance between processing speed and audio quality. Through iterative optimisation of the deep learning models and intensive training with subject-specific texts, we achieved production-quality speech synthesis that reproduces even complex terminology in a natural and understandable way.

Fully automated audio pipeline with deep learning-based speech synthesis in production quality
Lukas Famula
Lukas FamulaFull Stack Software Developer & AI Engineer

The Result

Editors and content creators can now efficiently convert their content into high-quality audio formats. The system delivers natural-sounding speech output with correct intonation and terminology, opening up new target groups for text-based content without additional manual effort.