Skip to content

Instantly share code, notes, and snippets.

View yousuf-hossain-shanto's full-sized avatar
🎯
Focusing

Yousuf Hossain yousuf-hossain-shanto

🎯
Focusing
View GitHub Profile
@yousuf-hossain-shanto
yousuf-hossain-shanto / readme.md
Created January 31, 2025 05:37 — forked from foyzulkarim/readme.md
# Text-to-Speech Pipeline with Kokoro TTS A Python script that converts text into natural-sounding speech using the Kokoro TTS engine. The script processes a transcript file, generates speech segments, and merges them into a single audio file. ## Features: - Reads text from a transcript file - Generates speech segments with customizable voice an…

Text-to-Speech Pipeline with Kokoro TTS

A Python script that converts text into natural-sounding speech using the Kokoro TTS engine. The script processes a transcript file, generates speech segments, and merges them into a single audio file.

Features:

  • Reads text from a transcript file
  • Generates speech segments with customizable voice and speed settings
  • Saves individual audio segments and their corresponding text
  • Merges all audio segments into a single WAV file using FFmpeg
  • Organizes output in timestamped directories