Audiobook Generation on Jetson Orin Nano

This repository contains scripts and documentation for generating audiobooks on a Jetson Orin Nano using either Piper TTS or Sesame CSM.

Quick Start

The fastest way to get started is to use the quickstart script:

# Make the script executable
chmod +x quickstart.sh

# Run with an EPUB or PDF file
./quickstart.sh --input /path/to/your/book.epub --method piper  # For Piper TTS (faster)
# or
./quickstart.sh --input /path/to/your/book.epub --method sesame # For Sesame CSM (higher quality)

Usage Options

Usage: ./quickstart.sh [options]

Required options:
  --input FILE               Path to input book file (EPUB or PDF)

Optional options:
  --method METHOD            TTS method to use: 'piper' (faster) or 'sesame' (higher quality)
                             Default: piper
  --voice VOICE              Voice preset to use
                             For piper: lessac (default), ryan, jenny, kathleen, alan
                             For sesame: calm (default), excited, authoritative, gentle, narrative
  --chapter-range RANGE      Range of chapters to process (e.g., '1-5')
  --memory-per-chunk SIZE    Memory usage per chunk in MB
  --max-batch-size SIZE      Maximum batch size for processing
  --output-format FORMAT     Output format: mp3 (default), wav, flac
  --help                     Show this help message

Available Scripts

build_container.sh – Build the Docker container image for Sesame TTS.
quickstart.sh – Helper script to set up the environment and start generation.
generate_audiobook_piper.py – Script for generating audiobooks using Piper TTS
generate_audiobook_sesame.py – Script for generating audiobooks using Sesame CSM
extract_chapters.py - Utility script to extract chapters from EPUB/PDF files

Comprehensive Documentation

For complete instructions, options, and troubleshooting, see the full documentation:

Comprehensive Audiobook Generation Plan

Features

Support for both ePub and PDF formats (ePub recommended)
Chapter detection and organization
Progress reporting with time estimates
Memory usage optimization
Resume capability for interrupted processes
Voice model selection
Process specific chapter ranges
Batch processing to manage memory usage
Automatic voice preset discovery

Example Usage

Piper TTS (in jetson-containers)

python generate_audiobook_piper.py \
  --input /books/your_book.epub \
  --output /audiobook_data/audiobook_piper.mp3 \
  --model /opt/piper/voices/en/en_US-lessac-medium.onnx \
  --temp_dir /audiobook_data/temp_audio_piper \
  --max_batch_size 15 \
  --memory_per_chunk 50

Sesame CSM

python generate_audiobook_sesame.py \
  --input ~/audiobook/your_book.epub \
  --output ~/audiobook_data/audiobook_sesame.mp3 \
  --model_path ~/huggingface_models/sesame-csm-1b \
  --voice_preset "calm" \
  --max_batch_size 8 \
  --memory_per_chunk 150 \
  --chapter_range "1-5"

Requirements

Jetson Orin Nano with JetPack/L4T
At least 5GB of available RAM
At least 20GB of free storage space
Internet connection for downloading models
Docker installed for container-based execution

License

This project is open source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 297 Commits
docker/sesame-tts		docker/sesame-tts
docs		docs
scripts		scripts
.dockerignore		.dockerignore
.dockerignore.bak		.dockerignore.bak
.gitignore		.gitignore
BUILD-NOTES.md		BUILD-NOTES.md
README.md		README.md
audiobook-plan.md		audiobook-plan.md
build-validation.md		build-validation.md
build.sh		build.sh
cleanup.sh		cleanup.sh
dependency_graph.dot		dependency_graph.dot
dependency_tree.json		dependency_tree.json
dependency_tree_copy.json		dependency_tree_copy.json
extract_chapters.py		extract_chapters.py
generate_audiobook_piper.py		generate_audiobook_piper.py
generate_audiobook_piper_epub.py		generate_audiobook_piper_epub.py
generate_audiobook_sesame.py		generate_audiobook_sesame.py
generate_audiobook_sesame_epub.py		generate_audiobook_sesame_epub.py
generate_dot_from_pipdeptree.sh		generate_dot_from_pipdeptree.sh
index.md		index.md
json_to_dot.py		json_to_dot.py
json_to_dot_debug.py		json_to_dot_debug.py
json_to_dot_fixed.py		json_to_dot_fixed.py
json_to_dot_new.py		json_to_dot_new.py
quickstart.sh		quickstart.sh
requirements.in		requirements.in
requirements.lock.txt		requirements.lock.txt
test_json_read.py		test_json_read.py
verification-script.sh		verification-script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audiobook Generation on Jetson Orin Nano

Quick Start

Usage Options

Available Scripts

Comprehensive Documentation

Features

Example Usage

Piper TTS (in jetson-containers)

Sesame CSM

Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Audiobook Generation on Jetson Orin Nano

Quick Start

Usage Options

Available Scripts

Comprehensive Documentation

Features

Example Usage

Piper TTS (in jetson-containers)

Sesame CSM

Requirements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages