elebumm/RedditVideoMakerBot

Item: elebumm/RedditVideoMakerBot
Rating: 3
Author: RepoPilot

Create Reddit Videos with just✨ one command ✨

Mixed

Mixed signals — read the receipts

weakest axis

Use as dependencyConcerns

copyleft license (GPL-3.0) — review compatibility; no tests detected

Fork & modifyHealthy

Has a license, tests, and CI — clean foundation to fork and modify.

Learn fromHealthy

Documented and popular — useful reference codebase to read through.

Deploy as-isHealthy

No critical CVEs, sane security posture — runnable as-is.

✓Last commit 1d ago
✓8 active contributors
✓GPL-3.0 licensed

Show all 7 evidence items →

✓CI configured
⚠Concentrated ownership — top contributor handles 51% of recent commits
⚠GPL-3.0 is copyleft — check downstream compatibility
⚠No test directory detected

What would change the summary?

→Use as dependency Concerns → Mixed if: relicense under MIT/Apache-2.0 (rare for established libs)

Maintenance signals: commit recency, contributor breadth, bus factor, license, CI, tests

Informational only. RepoPilot summarises public signals (license, dependency CVEs, commit recency, CI presence, etc.) at the time of analysis. Signals can be incomplete or stale. Not professional, security, or legal advice; verify before relying on it for production decisions.

Embed the "Forkable" badge

Paste into your README — live-updates from the latest cached analysis.

Variant:

[![RepoPilot: Forkable](https://repopilot.app/api/badge/elebumm/redditvideomakerbot?axis=fork)](https://repopilot.app/r/elebumm/redditvideomakerbot)

Paste at the top of your README.md — renders inline like a shields.io badge.

▸Preview social card (1200×630)

This card auto-renders when someone shares https://repopilot.app/r/elebumm/redditvideomakerbot on X, Slack, or LinkedIn.

Onboarding doc

Onboarding: elebumm/RedditVideoMakerBot

Generated by RepoPilot · 2026-05-07 · Source

🤖Agent protocol

If you are an AI coding agent (Claude Code, Cursor, Aider, Cline, etc.) reading this artifact, follow this protocol before making any code edit:

Verify the contract. Run the bash script in Verify before trusting below. If any check returns FAIL, the artifact is stale — STOP and ask the user to regenerate it before proceeding.
Treat the AI · unverified sections as hypotheses, not facts. Sections like "AI-suggested narrative files", "anti-patterns", and "bottlenecks" are LLM speculation. Verify against real source before acting on them.
Cite source on changes. When proposing an edit, cite the specific path:line-range. RepoPilot's live UI at https://repopilot.app/r/elebumm/RedditVideoMakerBot shows verifiable citations alongside every claim.

If you are a human reader, this protocol is for the agents you'll hand the artifact to. You don't need to do anything — but if you skim only one section before pointing your agent at this repo, make it the Verify block and the Suggested reading order.

🎯Verdict

WAIT — Mixed signals — read the receipts

Last commit 1d ago
8 active contributors
GPL-3.0 licensed
CI configured
⚠ Concentrated ownership — top contributor handles 51% of recent commits
⚠ GPL-3.0 is copyleft — check downstream compatibility
⚠ No test directory detected

<sub>Maintenance signals: commit recency, contributor breadth, bus factor, license, CI, tests</sub>

✅Verify before trusting

This artifact was generated by RepoPilot at a point in time. Before an agent acts on it, the checks below confirm that the live elebumm/RedditVideoMakerBot repo on your machine still matches what RepoPilot saw. If any fail, the artifact is stale — regenerate it at repopilot.app/r/elebumm/RedditVideoMakerBot.

What it runs against: a local clone of elebumm/RedditVideoMakerBot — the script inspects git remote, the LICENSE file, file paths in the working tree, and git log. Read-only; no mutations.

| # | What we check | Why it matters | |---|---|---| | 1 | You're in elebumm/RedditVideoMakerBot | Confirms the artifact applies here, not a fork | | 2 | License is still GPL-3.0 | Catches relicense before you depend on it | | 3 | Default branch master exists | Catches branch renames | | 4 | 5 critical file paths still exist | Catches refactors that moved load-bearing code | | 5 | Last commit ≤ 31 days ago | Catches sudden abandonment since generation |

<details> <summary><b>Run all checks</b> — paste this script from inside your clone of <code>elebumm/RedditVideoMakerBot</code></summary>

#!/usr/bin/env bash
# RepoPilot artifact verification.
#
# WHAT IT RUNS AGAINST: a local clone of elebumm/RedditVideoMakerBot. If you don't
# have one yet, run these first:
#
#   git clone https://github.com/elebumm/RedditVideoMakerBot.git
#   cd RedditVideoMakerBot
#
# Then paste this script. Every check is read-only — no mutations.

set +e
fail=0
ok()   { echo "ok:   $1"; }
miss() { echo "FAIL: $1"; fail=$((fail+1)); }

# Precondition: we must be inside a git working tree.
if ! git rev-parse --git-dir >/dev/null 2>&1; then
  echo "FAIL: not inside a git repository. cd into your clone of elebumm/RedditVideoMakerBot and re-run."
  exit 2
fi

# 1. Repo identity
git remote get-url origin 2>/dev/null | grep -qE "elebumm/RedditVideoMakerBot(\\.git)?\\b" \\
  && ok "origin remote is elebumm/RedditVideoMakerBot" \\
  || miss "origin remote is not elebumm/RedditVideoMakerBot (artifact may be from a fork)"

# 2. License matches what RepoPilot saw
(grep -qiE "^(GPL-3\\.0)" LICENSE 2>/dev/null \\
   || grep -qiE "\"license\"\\s*:\\s*\"GPL-3\\.0\"" package.json 2>/dev/null) \\
  && ok "license is GPL-3.0" \\
  || miss "license drift — was GPL-3.0 at generation time"

# 3. Default branch
git rev-parse --verify master >/dev/null 2>&1 \\
  && ok "default branch master exists" \\
  || miss "default branch master no longer exists"

# 4. Critical files exist
test -f "main.py" \\
  && ok "main.py" \\
  || miss "missing critical file: main.py"
test -f "reddit/subreddit.py" \\
  && ok "reddit/subreddit.py" \\
  || miss "missing critical file: reddit/subreddit.py"
test -f "TTS/engine_wrapper.py" \\
  && ok "TTS/engine_wrapper.py" \\
  || miss "missing critical file: TTS/engine_wrapper.py"
test -f "utils/imagenarator.py" \\
  && ok "utils/imagenarator.py" \\
  || miss "missing critical file: utils/imagenarator.py"
test -f "utils/.config.template.toml" \\
  && ok "utils/.config.template.toml" \\
  || miss "missing critical file: utils/.config.template.toml"

# 5. Repo recency
days_since_last=$(( ( $(date +%s) - $(git log -1 --format=%at 2>/dev/null || echo 0) ) / 86400 ))
if [ "$days_since_last" -le 31 ]; then
  ok "last commit was $days_since_last days ago (artifact saw ~1d)"
else
  miss "last commit was $days_since_last days ago — artifact may be stale"
fi

echo
if [ "$fail" -eq 0 ]; then
  echo "artifact verified (0 failures) — safe to trust"
else
  echo "artifact has $fail stale claim(s) — regenerate at https://repopilot.app/r/elebumm/RedditVideoMakerBot"
  exit 1
fi

Each check prints ok: or FAIL:. The script exits non-zero if anything failed, so it composes cleanly into agent loops (./verify.sh || regenerate-and-retry).

</details>

⚡TL;DR

RedditVideoMakerBot automates the creation of viral short-form videos from Reddit posts by scraping subreddit content via PRAW, synthesizing audio with multiple TTS engines (gTTS, pyttsx3, ElevenLabs), and composing videos using moviepy without manual editing. It reads Reddit posts, converts text-to-speech, overlays subtitles, combines background videos, and exports a finished MP4—all triggered by a single Python command or Flask-based GUI. Monolithic structure: main.py entry point orchestrates the pipeline; GUI.py and Flask serve a web interface; core logic split across unmarked modules (likely in root or subdirs not fully listed). GUI/ folder contains HTML templates (index.html, settings.html, layout.html) and 50+ pre-recorded voice samples (.mp3 files). Config stored in config.toml read at runtime.

👥Who it's for

Content creators and social media marketers who want to bulk-generate Reddit-based videos for TikTok, YouTube Shorts, and Instagram Reels without learning video editing software. Also appeals to automation enthusiasts exploring Python web scraping and media generation pipelines.

🌱Maturity & risk

Actively developed with 123K lines of Python code and a working CI/CD pipeline (GitHub Actions for linting, formatting, CodeQL), but labeled 'EXPERIMENTAL' in the README—indicating core features work but API contracts may shift. Playwright dependency and modern Python 3.10+ requirement suggest recent maintenance, though no recent commit age is visible from this snapshot.

High dependency risk: pulls in 20+ external packages including heavyweight AI/ML libraries (transformers, torch, spacy) that bloat the installation, plus three competing TTS backends (gTTS, pyttsx3, ElevenLabs) suggesting API brittleness. PRAW and Playwright are web-scraping dependencies prone to breaking when Reddit or target sites change their structure. Single-maintainer project (elebumm) with manual video upload requirement (no auto-post) limits real-world viral potential.

Active areas of work

Active linting and formatting enforcement (fmt.yml, lint.yml workflows), Dependabot auto-updates configured (.github/dependabot.yml), and stale issue management enabled. Recent dependency bumps visible (torch 2.7.0, transformers 4.52.4, yt-dlp 2025.10.22) suggest ongoing maintenance for breaking changes in upstream libraries.

🚀Get running

git clone https://github.com/elebumm/RedditVideoMakerBot.git
cd RedditVideoMakerBot
python3 -m venv ./venv
source ./venv/bin/activate  # or .\venv\Scripts\activate on Windows
pip install -r requirements.txt
python -m playwright install
python -m playwright install-deps
python main.py

Daily commands:

python main.py                    # CLI-driven interactive mode
# OR
python GUI.py                     # Flask web server (check code for default port, likely 5000)

🗺️Map of the codebase

main.py — Entry point orchestrating the entire Reddit video generation pipeline; every contributor must understand the main workflow.
reddit/subreddit.py — Handles Reddit API integration via PRAW to fetch posts; core to sourcing content for video generation.
TTS/engine_wrapper.py — Abstraction layer for multiple TTS engines (gTTS, TikTok, AWS Polly, ElevenLabs, OpenAI); critical for voice generation flexibility.
utils/imagenarator.py — Generates image overlays and captions; essential for video composition and visual rendering.
utils/.config.template.toml — Template for configuration including API keys, TTS engine selection, and video parameters; required for setup.
GUI.py — Flask-based GUI for user interaction; alternative to CLI for configuring and launching video generation.
requirements.txt — All Python dependencies including moviepy, playwright, praw, and TTS libraries; essential for environment setup.

🛠️How to make changes

Add a New TTS Engine

Create new TTS provider file (e.g., TTS/my_tts.py) inheriting from abstract interface (TTS/my_tts.py)
Implement required methods: generate_speech(), get_supported_voices(), validate_settings() (TTS/my_tts.py)
Register engine in TTS/engine_wrapper.py switch/factory logic (TTS/engine_wrapper.py)
Add config template entry in utils/.config.template.toml for engine-specific settings (utils/.config.template.toml)
Test integration with main.py audio generation pipeline (main.py)

Add a New Background Video/Audio Source

Add entry to utils/background_videos.json or utils/background_audios.json with URL and metadata (utils/background_videos.json)
Update video/audio selection logic in main.py pipeline to reference new asset (main.py)
Test download and integration via yt-dlp in the rendering pipeline (main.py)

Add a New Subreddit/Content Filter

Extend reddit/subreddit.py with new query method or filter logic (reddit/subreddit.py)
Add configuration parameters in utils/.config.template.toml (subreddit name, sort order, time range) (utils/.config.template.toml)
Integrate new subreddit selector into main.py content-fetching section (main.py)

🔧Why these technologies

PRAW (Python Reddit API Wrapper) — Abstracts Reddit API authentication and post fetching; enables easy subreddit targeting and filtering
moviepy — Programmatic video composition without external tools; combines audio, images, and background video seamlessly
Playwright — Headless browser automation for capturing dynamic web content (Reddit comments, user posts) as screenshots
spaCy + transformers + torch — NLP pipeline for text summarization, entity extraction, and semantic understanding of Reddit posts
Multiple TTS engines (gTTS, TikTok, AWS Polly, ElevenLabs, OpenAI) — Flexibility in voice quality, language support, and cost; allows users to choose based on budget and output preference
Flask — Lightweight web framework for GUI; enables non-technical users to configure and launch video generation without CLI

⚖️Trade-offs already made

Multiple TTS engines supported via abstraction layer
- Why: No single TTS engine is optimal for all use cases (cost, voice quality, languages, speed)
- Consequence: Increased complexity in engine_wrapper.py and config management; slower development when adding new engines
Full video rendering with moviepy instead of shell commands
- Why: Pure Python solution avoids external FFmpeg dependency; enables GUI integration and cross-platform compatibility
- Consequence: Slower rendering than optimized FFmpeg; higher memory usage for large videos
Playwright for dynamic content capture instead of static scraping
- Why: Ensures JavaScript-rendered content (comment threads, dynamic elements) is correctly captured
- Consequence: Slower than static scraping; requires browser automation overhead
AI-powered text processing (spaCy, transformers) optional/modular
- Why: Enables advanced summarization and entity extraction for better storytelling; not mandatory for basic videos
- Consequence: Adds model download overhead and memory cost; users must install torch/transformers if using advanced features

🚫Non-goals (don't propose these)

Direct posting to TikTok, YouTube, or Instagram (only generates MP4; user must post manually or integrate external APIs)
Real-time streaming of video generation progress (CLI is synchronous; GUI may have progress bars)
Licensed music integration (background audio must be royalty-free; users responsible for legal compliance)
Monetization or content rights management (tool is agnostic to revenue sharing)
Mobile app or native desktop application (GUI is web-based Flask)
Multi-user concurrency or cloud deployment (designed for single-user local execution)

🪤Traps & gotchas

Reddit API credentials: must be manually obtained from https://www.reddit.com/prefs/apps (script-type app); bot prompts for them interactively but will fail silently if invalid. Playwright installation: requires system-level dependencies (install-deps subcommand); fails on minimal Docker images without graphical libraries. Three TTS backends with different API contracts: gTTS is free but rate-limited; pyttsx3 requires offline TTS engine; ElevenLabs needs API key and paid credits. Config persistence: editing config.toml while bot is running can cause conflicts; no locking mechanism visible. FFmpeg binary requirement: moviepy wraps FFmpeg; bot fails if ffmpeg not in PATH (not declared as OS-level dependency in Dockerfile snippet). Python 3.10+ only: .python-version file locks to 3.10; earlier/later versions will encounter breaking changes in transformers/torch.

🏗️Architecture

💡Concepts to learn

Reddit API Authentication (OAuth 2.0 + PRAW) — The bot fetches live subreddit posts via PRAW; understanding OAuth credentials (client_id, client_secret, user_agent) is mandatory to configure the bot and debug API rate-limit / auth errors
[Text-to-Speech (TTS) Engine Abstraction](https://gtts.readthedocs.io/ and https://github.com/nateshmbhat/pyttsx3) — Three competing backends (gTTS, pyttsx3, ElevenLabs) are bundled; choosing and configuring the right one trades off cost, quality, latency, and offline availability—a core decision for video output
Video Composition & Frame Sequencing (moviepy) — Moviepy orchestrates layers (audio track, subtitle overlays, background video, text animations); understanding clip concatenation and timeline composition is essential to modify video aesthetics or add new effects
Web Scraping & Playwright Automation — Playwright headless browser is used (alongside PRAW) to handle dynamic Reddit content and JavaScript-rendered elements; essential for debugging when Reddit's DOM structure changes
[TOML Configuration Format](https://toml.io/en/ and https://github.com/sdispater/tomlkit) — All bot settings (Reddit credentials, video dimensions, TTS choice, output path) persist in config.toml; understanding tomlkit parsing and runtime config mutations is needed to add new settings
FFmpeg Video Encoding Pipeline — moviepy delegates codec selection and bitrate tuning to FFmpeg; understanding FFmpeg flags (codec, CRF quality, audio bitrate) is critical for optimizing video file size and quality trade-offs
[NLP Text Preprocessing (spacy, transformers, clean-text)](https://spacy.io/ and https://huggingface.co/docs/transformers) — Reddit posts contain noise (URLs, emojis, slang); the bot cleans text via spacy tokenization and unidecode normalization before TTS; understanding this pipeline prevents garbled audio output

yt-dlp/yt-dlp — Already a dependency (yt-dlp==2025.10.22); handles video download and metadata extraction for background clips in the composition pipeline
OpenAI/whisper — Inverse use case: speech-to-text for transcription; could be integrated to auto-caption videos if audio source is non-text (e.g., podcast clips)
jiaaro/pydub — Audio manipulation alternative to moviepy; lighter-weight for audio-only pipelines (e.g., isolating voice from background music)
instagrapi/instagrapi — Companion repo for auto-posting to Instagram (the README explicitly disclaims no auto-upload; this library could bridge that gap)
tweepy/tweepy — Similar social media API wrapper to PRAW; could extend bot to auto-post generated videos to Twitter/X feeds

🪄PR ideas

To work on one of these in Claude Code or Cursor, paste: Implement the "<title>" PR idea from CLAUDE.md, working through the checklist as the task list.

Add integration tests for TTS pipeline (GTTS, pyttsx3, elevenlabs)

The repo supports multiple TTS providers (GTTS, pyttsx3, elevenlabs) but there are no visible tests in the file structure. Given the complexity of TTS.GTTS.py and multiple voice options in GUI/voices/, adding tests would catch regressions when switching between providers and ensure voice file integrity.

[ ] Create tests/tts/ directory with test files for each TTS provider
[ ] Add unit tests in tests/tts/test_gtts.py to mock gTTS API calls and verify audio output
[ ] Add unit tests in tests/tts/test_elevenlabs.py to verify elevenlabs integration with sample voices
[ ] Add integration test to verify voice selection from GUI/voices/*.mp3 works correctly
[ ] Update .github/workflows/lint.yml or create tests.yml to run pytest on PR submissions

Add GitHub Action to validate Dockerfile and Docker build on PRs

The Dockerfile exists but there's no CI workflow validating it builds successfully. Given the project supports Docker deployment, PRs could break the Docker build without detection. A workflow would catch dependency conflicts early (boto3, transformers, torch versions).

[ ] Create .github/workflows/docker-build.yml workflow file
[ ] Add step to run 'docker build -t reddit-video-maker:test .' on PR branches
[ ] Add optional step to scan Dockerfile with hadolint for best practices
[ ] Document in CONTRIBUTING.md that Docker builds are validated on all PRs

Add schema validation and tests for config/settings files

The repo uses TOML configs (references toml and tomlkit dependencies) and has a GUI/settings.html but no validation schema. Invalid config files could silently break video generation. Add TOML schema validation and tests to prevent user errors and config-related issues.

[ ] Create a config_schema.py file with Pydantic model(s) for TOML config structure
[ ] Add tests/test_config_validation.py with unit tests for valid/invalid config scenarios
[ ] Add validation logic to load config files that checks against schema before use
[ ] Document required config fields in CONTRIBUTING.md or add config.example.toml
[ ] Reference the schema in README.md under configuration section

🌿Good first issues

Add unit tests for TTS synthesis logic—currently no test/ directory visible; create pytest fixtures that mock PRAW and moviepy to verify gTTS/pyttsx3/ElevenLabs payload generation without hitting external APIs.: Enables confident refactoring of the TTS abstraction layer and prevents regressions when swapping between backends.
Document the config.toml schema—currently README stops mid-sentence ('...delete the li...'); create CONFIG.md listing all valid TOML keys, their types, defaults, and side effects (e.g., which TTS backend is active).: New contributors and users waste time reverse-engineering config by running main.py; written spec reduces support burden.
Refactor voice sample management: instead of hardcoding 50+ .mp3 files in GUI/voices/, create a voices.json manifest listing [{ name, locale, filename, preview_url }] and auto-populate the GUI dropdown from it; add a --list-voices CLI flag.: Adding new voices currently requires manual file placement + frontend code changes; a manifest decouples data from presentation and enables dynamic TTS provider switching (switch from gTTS to ElevenLabs without re-recording).

⭐Top contributors

Click to expand

@JasonLovesDoggo — 51 commits
@dependabot[bot] — 19 commits
@cyteon — 18 commits
@github-actions[bot] — 7 commits
@OpenSourceSimon — 2 commits

📝Recent commits