Use when you have a written implementation plan to execute in a separate session with review checkpoints
npx skills add koreyba/Claude-Skill-pdf-to-epub --skill "convert-pdf-to-epub"
Install specific skill from multi-skill repository
# Description
>
# SKILL.md
name: convert-pdf-to-epub
description: >
Convert PDF books to EPUB format for e-readers. Use when user asks to convert PDF to EPUB,
create an e-book from PDF, make PDF readable on Kindle/phone/tablet, or extract book content
from PDF. Handles: chapter detection, image extraction with optimization, footnotes/endnotes
with hyperlinks, reading order for multi-column layouts. Validates conversion quality automatically.
allowed-tools: Read, Write, Edit, Bash, Glob, Grep
PDF to EPUB Converter
Convert PDF documents to high-quality EPUB files with automatic chapter detection, image optimization, and footnote hyperlinking.
Quick Start
# Run conversion (from skill directory)
python -m scripts.convert input.pdf output.epub
# Validate result
python -m scripts.validate input.pdf output.epub
Workflow Overview
The conversion follows a 3-phase process with an optional 4th phase for adaptation:
βββββββββββββββββββ βββββββββββββββββββ βββββββββββββββββββ
β 1. ANALYZE β βββΊ β 2. CONVERT β βββΊ β 3. VALIDATE β
β - PDF structureβ β - Apply config β β - Check qualityβ
β - Generate cfg β β - Build EPUB β β - Report issuesβ
βββββββββββββββββββ βββββββββββββββββββ βββββββββββββββββββ
β
βΌ
βββββββββββββββββββ
β 4. ADAPT β
β (if needed) β
β - Tune config β
β - Modify code β
βββββββββββββββββββ
Phase 1: Analyze
Before converting, analyze the PDF to determine the best configuration:
# Open PDF and examine structure
import fitz # pymupdf
doc = fitz.open("input.pdf")
# Check for:
# 1. Number of pages
# 2. Presence of images
# 3. Multi-column layout (compare text block x-coordinates)
# 4. Footnotes/endnotes (numbers in margins or at page bottom)
# 5. Font sizes (for heading detection thresholds)
Generate initial config based on analysis:
- Fiction book: Use default y_sort reading order
- Academic paper: Enable xy_cut for columns
- Magazine: Enable image optimization, use xy_cut
Ask user to confirm the proposed configuration before proceeding.
Phase 2: Convert
Run the conversion with the generated config:
from conversion.converter import Converter
from conversion.models import ConversionConfig
config = ConversionConfig(
page_ranges=PageRanges(skip=[1, 2], content=(3, -3)),
exclude_regions=ExcludeRegions(top=0.05, bottom=0.05),
reading_order_strategy="y_sort", # or "xy_cut" for columns
image_optimization=ImageOptimizationConfig(enabled=True),
)
converter = Converter(strategy="simple")
result = converter.convert(pdf_path, epub_path, config)
# Check confidence
if result.reading_order_confidence < 0.7:
print("Warning: Low confidence in reading order")
Phase 3: Validate
Always validate the conversion result:
from validation.completeness_checker import CompletenessChecker
from validation.order_checker import OrderChecker
# Check text completeness
completeness = CompletenessChecker().check(pdf_text, epub_text)
print(f"Completeness: {completeness.score:.1%}") # Should be > 95%
# Check reading order
order = OrderChecker().check(pdf_chunks, epub_chunks)
print(f"Order score: {order.score:.1%}") # Should be > 80%
Quality gates:
- Completeness < 95%: Text is being lost
- Order score < 80%: Reading order is wrong
Phase 4: Adapt (if validation fails)
See Decision Tree below.
Decision Tree: When Things Go Wrong
Validation failed?
β
βββΊ Text loss > 5%?
β βββΊ Check exclude_regions (headers/footers being cut?)
β β β Try: exclude_regions.top: 0.03 (reduce from 0.05)
β βββΊ Check page_ranges (skipping too many pages?)
β β β Try: page_ranges.skip: [] (don't skip any)
β βββΊ Still failing? β See reference/troubleshooting.md#text-loss
β
βββΊ Wrong reading order?
β βββΊ PDF has columns?
β β β Try: reading_order_strategy: "xy_cut"
β βββΊ Columns detected but wrong?
β β β Try: multi_column.threshold: 0.3 (more sensitive)
β βββΊ Still failing? β See reference/troubleshooting.md#order
β
βββΊ Headings not detected?
β βββΊ Headings only slightly larger than body?
β β β Try: heading_detection.font_size_threshold: 1.1
β βββΊ Custom font patterns?
β β β May need to modify structure_classifier.py (ADAPTABLE)
β
βββΊ Footnotes not linking?
β βββΊ Non-standard format (not [1] or (1))?
β β β Add pattern to FootnoteDetector.PATTERNS
β βββΊ See reference/troubleshooting.md#footnotes
β
βββΊ Other issue?
βββΊ See reference/troubleshooting.md
Three-Layer Architecture
The codebase is organized into three layers with different modification policies:
Layer 1: FROZEN (Do Not Modify)
These files implement fixed specifications or deterministic algorithms:
| File | Reason |
|---|---|
core/epub_builder.py |
EPUB3 spec is fixed |
core/text_segmenter.py |
Validation depends on identical chunking |
validation/* |
Metrics must be reproducible |
Never modify these files unless there's a fundamental bug.
Layer 2: CONFIGURABLE (Try Config First)
Before changing code, try adjusting configuration:
ConversionConfig:
βββ page_ranges # Which pages to process
βββ exclude_regions # Margins to ignore (headers/footers)
βββ multi_column # Column detection settings
βββ reading_order_strategy # "y_sort" or "xy_cut"
βββ heading_detection # Font size thresholds
βββ footnote_processing # Footnote patterns
βββ image_optimization # Compression settings
βββ metadata # Title, author, language
See reference/config-tuning.md for all parameters.
Layer 3: ADAPTABLE (Can Modify If Config Fails)
These files contain heuristics that may need tuning for specific PDFs:
| File | What You Can Modify |
|---|---|
conversion/strategies/* |
Create new strategy subclass |
detectors/structure_classifier.py |
Heading detection heuristics |
detectors/reading_order/* |
Add custom sorter algorithm |
detectors/footnote_detector.py |
Add new footnote patterns |
See reference/code-adaptation.md for guidelines.
Project Structure
<skill-directory>/
βββ SKILL.md # This file
βββ requirements.txt # Python dependencies
βββ core/ # FROZEN: Core algorithms
β βββ epub_builder.py # EPUB3 file creation
β βββ pdf_extractor.py # PDF text/image extraction
β βββ text_segmenter.py # Deterministic chunking
β βββ image_optimizer.py # Image compression
β
βββ conversion/ # Main conversion logic
β βββ converter.py # Orchestrator
β βββ models.py # Data classes & configs
β βββ strategies/ # ADAPTABLE: Conversion strategies
β β βββ base_strategy.py # Template method pattern
β β βββ simple_strategy.py
β βββ detectors/ # ADAPTABLE: Detection heuristics
β βββ structure_classifier.py
β βββ reading_order/
β βββ footnote_detector.py
β βββ endnote_formatter.py
β
βββ validation/ # FROZEN: Quality checking
β βββ completeness_checker.py
β βββ order_checker.py
β
βββ scripts/ # CLI entry points
β βββ analyze.py
β βββ convert.py
β βββ validate.py
β
βββ reference/ # Documentation
β βββ workflow.md
β βββ architecture.md
β βββ troubleshooting.md
β βββ config-tuning.md
β βββ code-adaptation.md
β
βββ examples/ # Example configurations
βββ fiction-simple.json
βββ academic-multicol.json
βββ magazine-images.json
Example Configurations
Fiction Book (simple layout)
{
"page_ranges": {"skip": [1, 2], "content": [3, -3]},
"exclude_regions": {"top": 0.05, "bottom": 0.05},
"reading_order_strategy": "y_sort",
"heading_detection": {"font_size_threshold": 1.2}
}
Academic Paper (2-column)
{
"page_ranges": {"skip": [1], "content": [2, -1]},
"exclude_regions": {"top": 0.08, "bottom": 0.08},
"reading_order_strategy": "xy_cut",
"multi_column": {"enabled": true, "threshold": 0.4}
}
Magazine (images + columns)
{
"reading_order_strategy": "xy_cut",
"multi_column": {"enabled": true, "column_count": 2},
"image_optimization": {
"enabled": true,
"max_width": 800,
"jpeg_quality": 75
}
}
Reference Documentation
For detailed information, see:
- Workflow Details - Complete phase-by-phase guide
- Architecture - Three-layer system explanation
- Troubleshooting - Common problems and solutions
- Config Tuning - All configuration parameters
- Code Adaptation - When and how to modify code
Common Commands
# Full conversion with validation (from skill directory)
python -m scripts.convert input.pdf output.epub && \
python -m scripts.validate input.pdf output.epub
# Analyze PDF structure
python -m scripts.analyze input.pdf
Quality Metrics
After conversion, always check:
| Metric | Good | Warning | Bad |
|---|---|---|---|
| Text completeness | > 98% | 95-98% | < 95% |
| Reading order | > 90% | 80-90% | < 80% |
| Confidence | > 0.8 | 0.6-0.8 | < 0.6 |
If any metric is in "Warning" or "Bad" range, follow the Decision Tree above.
# Supported AI Coding Agents
This skill is compatible with the SKILL.md standard and works with all major AI coding agents:
Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.