Remotion Video Projects

Video animations and tutorial compositions for ArtiVisi YouTube channel.

Complete Production Workflow

flowchart TD
    subgraph PRE["1. Pre-Production"]
        A1[Create Episode Outline] --> A2[Generate Printable Script]
        A2 --> A3[Prepare Code Samples]
    end

    subgraph REC["2. Recording"]
        B1[Record Screen - QuickTime] --> B3[Dump to External Drive]
        B2[Record Camera - Nikon ZFC] --> B3
    end

    subgraph POST["3. Post-Production with AI Assistant"]
        direction TB
        C1[Remove Silence] --> C2[Generate Transcript]
        C2 --> C3[Extract Cursor Positions]
        C3 --> C4[Generate Zoom Keyframes]
        C2 --> C5[AI Suggests Infographics]
        C5 --> C6[Create/Edit Infographics]
        C2 --> C7[Configure Camera Switches]
        C7 --> C8[Define PIP vs Fullscreen Segments]
    end

    subgraph COMP["4. Composition"]
        D1[Create Episode Composition] --> D2[Add Intro Bumper]
        D2 --> D3[Layer Screen + Webcam]
        D3 --> D4[Apply Zoom Keyframes]
        D4 --> D5[Add Subtitles]
        D5 --> D6[Insert Infographics]
        D6 --> D7[Add Lower Thirds]
        D7 --> D8[Add Outro Bumper]
    end

    subgraph REVIEW["5. Review & Iterate"]
        E1[Preview in Browser] --> E2{Looks Good?}
        E2 -->|No| E3[Adjust Timing/Content]
        E3 --> E1
        E2 -->|Yes| E4[Ready to Render]
    end

    subgraph PUBLISH["6. Render & Publish"]
        F1[Render Final Video] --> F2[AI Generate Thumbnail]
        F2 --> F2b[Resize to 1280x720]
        F2b --> F3[Generate Metadata]
        F3 --> F4[Upload to YouTube]
    end

    PRE --> REC
    REC --> POST
    POST --> COMP
    COMP --> REVIEW
    REVIEW --> PUBLISH

AI-Assisted Workflow Summary

Step	What You Do	What AI Does
Outline	Provide topic & key points	Generate structured outline with timestamps
Recording	Record camera + screen	-
Silence Removal	Run script	Detect & remove silent sections
Transcription	Run Whisper	Generate timestamped transcript
Zoom/Pan	Run cursor extraction	Auto-generate zoom keyframes from cursor movement
B-Roll Assets	Review & approve	Create infographics, animations, diagrams, screenshots from transcript; insert at appropriate timestamps
Camera Switching	Define preferences	Generate switch points based on content type
Subtitles	Review output	Generate from transcript
Render	Run command	-
Thumbnail	Provide prompt	Generate image, resize to 1280x720
Upload	Provide credentials	Generate title, description, tags from outline

Project Structure

src/
├── animations/                    # Standalone bumper animations
│   ├── vlog-intro/               # "NGOPI DULU" intro bumper
│   ├── lower-third/              # Name/title overlay
│   ├── transition/               # "artivisi" letter animation
│   ├── outro/                    # End screen
│   └── programming-fundamentals/ # PF series intro/outro
│       ├── PFIntro.tsx           # Dark theme, typing animation, language icons
│       └── PFOutro.tsx           # Thank you, next episode preview, subscribe
├── components/                   # Reusable tutorial components
│   ├── VideoLowerThird.tsx
│   ├── WebcamOverlay.tsx
│   ├── ZoomPan.tsx
│   ├── CodeHighlight.tsx
│   ├── Subtitles.tsx
│   └── Infographic.tsx
├── tutorials/                    # Tutorial video series
│   └── programming-fundamentals/ # 31-episode series
│       ├── SERIES_PLAN.md        # Full curriculum overview
│       ├── compositions/         # Full episode compositions
│       │   ├── index.ts
│       │   ├── PF01Composition.tsx
│       │   ├── PF02Composition.tsx
│       │   └── PF03Composition.tsx
│       ├── components/           # Animated diagrams (EP01-, EP02-, etc.)
│       ├── transcript-types.ts   # Transcript & SubtitleCue types
│       ├── pf-01-transcript.ts   # Whisper transcripts
│       ├── pf-02-transcript.ts
│       ├── pf-02-sync-screen1.ts # Camera/screen sync offsets
│       ├── pf-01-edl.ts          # Silence detection EDLs
│       └── types.ts              # VideoOutline type
└── assets/
    ├── audio/                    # Sound effects (typing, swoosh, static, etc.)
    ├── icons/
    └── logos/

footage/                          # Symlink to external drive
└── programming-fundamentals/
    ├── pf-01-camera.mov
    ├── pf-02-camera-1.mov
    ├── pf-02-screen-1.mov
    └── ...

Quick Start

npm install
npm start        # Preview in browser
npm run build    # Bundle for rendering

Render Bumpers

# General bumpers
npx remotion render VlogIntro out/vlog-intro.mp4
npx remotion render LowerThird out/lower-third.mp4
npx remotion render Transition out/transition.mp4
npx remotion render Outro out/outro.mp4

# Programming Fundamentals series
npx remotion render PFIntro out/pf-intro.mp4
npx remotion render PFOutro out/pf-outro.mp4

Step-by-Step Production Guide

1. Storage Setup (One-time)

Keep project on laptop, store large video files on external drive:

# Create folders on external drive
mkdir -p "/path/to/external/drive/seri-programming-fundamental/rendered"

# Create symlinks in footage/ folder (for transcript processing scripts)
mkdir -p footage/programming-fundamentals
ln -s "/path/to/external/drive/seri-programming-fundamental/ep-01/camera/DSC_8013.MOV" \
  footage/programming-fundamentals/pf-01-camera.mov

Video file access for Remotion:

The project uses Config.setPublicDir() in remotion.config.ts to point directly to the external drive:

// remotion.config.ts
Config.setPublicDir("/path/to/external/drive/seri-programming-fundamental");

This allows compositions to use staticFile() with relative paths:

// src/tutorials/programming-fundamentals/video-paths.ts
import { staticFile } from "remotion";

export const VIDEO_PATHS = {
  "pf-01-camera": staticFile("ep-01/camera/DSC_8013.MOV"),
  "pf-02-camera-1": staticFile("ep-02/camera/DSC_8014.MOV"),
  // ...
} as const;

// In composition:
import { Video } from "@remotion/media";
<Video src={getVideoPath("pf-01-camera")} />

Note: Ensure the external drive is mounted before starting Remotion Studio or rendering.

2. Pre-Production: Create Outline

# Copy template
cp src/tutorials/programming-fundamentals/_template.ts \
   src/tutorials/programming-fundamentals/pf-01.ts

# Generate printable outline for recording
node scripts/generate-outline.mjs src/tutorials/programming-fundamentals/pf-01.ts

3. Recording

Recording setup:

Camera: Nikon ZFC → records to SD card as .MOV
Screen: QuickTime Player → Screen Recording → saves as .mov

Output files:

Camera recording: footage/programming-fundamentals/pf-01-camera.mov
Screen recording: footage/programming-fundamentals/pf-01-screen.mov

For episodes with multiple recordings:

pf-02-camera-1.mov, pf-02-camera-2.mov (multiple camera takes)
pf-02-screen-1.mov, pf-02-screen-2.mov (multiple screen sessions)

Naming convention: {series}-{episode}-{type}[-{number}].{ext}

series: pf for Programming Fundamentals
episode: 01, 02, etc.
type: camera or screen
number: optional, for multiple recordings of same type

4. Post-Production with AI Assistant

Once footage is dumped to external drive, start a session with Claude:

"I have recorded footage for Programming Fundamentals Episode 1.
Files are in footage/programming-fundamentals/pf-01-screen.mp4 and pf-01-webcam.mp4.
Let's process and create the video."

4.0 Inventory All Footage and Assets

IMPORTANT: Before processing, list ALL files in the episode folder to ensure nothing is missed.

# List all footage files
ls -la /path/to/external/drive/seri-programming-fundamental/ep-XX/camera/
ls -la /path/to/external/drive/seri-programming-fundamental/ep-XX/screen/

# Example structure:
# ep-02/
# ├── camera/
# │   ├── DSC_8014.MOV  → main recording (part 1)
# │   └── DSC_8015.MOV  → continued recording (part 2)
# └── screen/
#     ├── Screen Recording 1.mov  → main screen
#     └── Screen Recording 2.mov  → continued after restart

Why multiple files?

Camera has FAT filesystem limit: 4GB or 30 minutes max per file
Camera auto-stops and restarts, creating new files
Screen recording may restart if software crashes or needs reset
All parts must be processed and stitched together

Checklist for each episode:

List ALL camera files (FAT limit splits long recordings)
List ALL screen recordings (may have restarts)
Create symlinks for EACH file in footage/ folder
Create symlinks for EACH file in public/ folder (for Remotion preview)
Transcribe EACH camera and screen file
Check ASSETS.md for premade animated diagrams
Check for screenshots in src/tutorials/programming-fundamentals/assets/epXX/
Note any dead time sections that need cutting (screen restarts, etc.)
Sync screen recordings to camera timeline using sync-transcripts.mjs

Common mistakes:

Missing continuation footage (camera splits due to FAT limit)
Forgetting to add symlinks to public/ folder (needed for Remotion preview)
Not transcribing all footage files
Missing animated assets listed in ASSETS.md
Not checking for screenshots in episode assets folder

The AI assistant will guide you through:

4.1 Remove Silence

# Quick: remove silence and output new file
./scripts/remove-silence.sh footage/programming-fundamentals/pf-01-screen.mp4

# Or for more control: detect first, review, then apply
./scripts/detect-silence.sh footage/programming-fundamentals/pf-01-screen.mp4
# Review the -silence.txt file, then:
node scripts/detect-silence.mjs footage/programming-fundamentals/pf-01-screen-silence.txt \
  --output src/tutorials/programming-fundamentals/pf-01-edl.ts

4.2 Generate Transcript

# Activate Python venv
source .venv/bin/activate

# Transcribe with Whisper (Indonesian)
./scripts/transcribe-with-whisper.sh footage/programming-fundamentals/pf-01-camera.mov \
  --model medium --language id

# Convert to Remotion format
node scripts/process-transcript.mjs \
  footage/programming-fundamentals/pf-01-camera.json \
  src/tutorials/programming-fundamentals/pf-01-transcript.ts

4.3 Sync Camera and Screen Recordings

When you have separate camera and screen recordings, sync them using transcript matching:

# First transcribe both recordings
./scripts/transcribe-with-whisper.sh footage/programming-fundamentals/pf-02-camera-1.mov
./scripts/transcribe-with-whisper.sh footage/programming-fundamentals/pf-02-screen-1.mov

# Sync by matching spoken phrases between recordings
node scripts/sync-transcripts.mjs \
  footage/programming-fundamentals/pf-02-camera-1.json \
  footage/programming-fundamentals/pf-02-screen-1.json \
  src/tutorials/programming-fundamentals/pf-02-sync-screen1.ts

The sync script outputs:

offset: Time difference in seconds (add to screen time to get camera time)
confidence: Match quality (0-1)
screenToCamera(): Helper function to convert screen timestamps to camera timeline

Example usage in composition:

import { pf_02_camera_1_pf_02_screen_1_sync } from "../pf-02-sync-screen1";

// Screen recording starts at this camera time
const SCREEN1_START_CAMERA_TIME = pf_02_camera_1_pf_02_screen_1_sync.offset;

4.4 Auto-Generate Zoom Keyframes

# Extract cursor positions
python scripts/extract-cursor.py footage/programming-fundamentals/pf-01-screen.mp4

# Generate zoom keyframes
node scripts/generate-zoom-keyframes.mjs \
  footage/programming-fundamentals/pf-01-screen.cursor.json \
  --output src/tutorials/programming-fundamentals/pf-01-zoom.ts

4.5 Generate Infographics (AI-Assisted)

Ask the AI assistant:

"Based on the transcript, suggest infographics (slides, bullet points, callouts)
that would enhance the video. Focus on key concepts and important tips."

The AI will analyze the transcript and generate infographic cues.

4.6 Configure Camera Switching (AI-Assisted)

Ask the AI assistant:

"Based on the transcript and outline, determine when to show:
- Webcam fullscreen (introductions, explanations, conclusions)
- Screen with webcam PIP (coding, demos)
- Screen only (complex code, diagrams)"

The AI will generate camera switch cues based on content analysis.

4.7 Create B-Roll / Visual Enhancements (AI-Assisted)

For talking-head episodes without screen recordings, add visual variety with animated diagrams and infographics:

"Analyze the transcript and create b-roll components to keep audiences engaged.
Identify key topics and timestamps where visuals would help explain concepts."

B-Roll component workflow:

AI analyzes transcript to identify visual opportunities
Create components in src/tutorials/programming-fundamentals/components/
Export from components/index.ts
Add to composition with appropriate timing

Example B-Roll types:

AIToolsShowcase - Display tool logos/cards (ChatGPT, Claude, etc.)
PromptExampleCard - Show good/bad prompt examples
AILearningCycle - Flowchart/diagram of learning process
UseEnglishTip - Two-column tips layout
AIDosDonts - Do/Don't comparison lists

B-Roll timing guidelines:

Match b-roll start time to when topic is mentioned in transcript
Duration: 30-60 seconds per visual (enough time to read/absorb)
Avoid rapid switching - let visuals breathe
Cover 50-70% of talking-head content with b-roll for engagement

Example composition with b-roll:

// B-Roll timing from transcript analysis
const AI_TOOLS_START = Math.round(39 * FPS);  // 0:39 - mentions AI tools
const AI_TOOLS_DURATION = Math.round(51 * FPS); // 51 seconds

<Sequence from={AI_TOOLS_START} durationInFrames={AI_TOOLS_DURATION}>
  <AIToolsShowcase />
</Sequence>

5. Create Composition

Compositions are organized in src/tutorials/programming-fundamentals/compositions/.

Every episode should include:

PFIntro - Series intro bumper (5 seconds)
Main content with subtitles
B-roll/visual enhancements where appropriate
PFOutro - Thank you + next episode preview (6 seconds)

import { PFIntro, PFOutro } from "../../../animations/programming-fundamentals";

const INTRO_DURATION = 150; // 5 seconds
const OUTRO_DURATION = 180; // 6 seconds

// ... in composition:
<Sequence durationInFrames={INTRO_DURATION}>
  <PFIntro />
</Sequence>

// ... main content ...

<Sequence from={INTRO_DURATION + mainContentDuration} durationInFrames={OUTRO_DURATION}>
  <PFOutro nextEpisodeTitle="Next Episode Title" />
</Sequence>

Webcam-only episode (e.g., PF01, PF03):

// src/tutorials/programming-fundamentals/compositions/PF01Composition.tsx
import { AbsoluteFill, Sequence } from "remotion";
import { Video } from "@remotion/media";
import { PFIntro } from "../../../animations/programming-fundamentals";
import { Subtitles, VideoLowerThird } from "../../../components";
import { pf_01_cameraTranscript, pf_01_cameraSubtitles } from "../pf-01-transcript";
import { getVideoPath } from "../video-paths";

const FPS = 30;
const INTRO_DURATION = 150; // 5 seconds

const mainContentDuration = Math.ceil(
  pf_01_cameraTranscript.segments[pf_01_cameraTranscript.segments.length - 1].end * FPS
);

export const PF01Composition: React.FC = () => {
  return (
    <AbsoluteFill style={{ backgroundColor: "#000" }}>
      {/* Series Intro */}
      <Sequence durationInFrames={INTRO_DURATION}>
        <PFIntro />
      </Sequence>

      {/* Main Content - Webcam */}
      <Sequence from={INTRO_DURATION} durationInFrames={mainContentDuration}>
        <AbsoluteFill>
          <Video
            src={getVideoPath("pf-01-camera")}
            style={{ width: "100%", height: "100%", objectFit: "cover" }}
          />

          <Sequence from={60} durationInFrames={180}>
            <VideoLowerThird
              title="Apa Itu Programming?"
              subtitle="Programming Fundamentals - Episode 1"
            />
          </Sequence>

          <Subtitles cues={pf_01_cameraSubtitles} />
        </AbsoluteFill>
      </Sequence>
    </AbsoluteFill>
  );
};

export const PF01_DURATION = INTRO_DURATION + mainContentDuration;

Episode with screen + webcam (e.g., PF02):

// src/tutorials/programming-fundamentals/compositions/PF02Composition.tsx
import { AbsoluteFill, Sequence, useCurrentFrame } from "remotion";
import { Video } from "@remotion/media";
import { PFIntro } from "../../../animations/programming-fundamentals";
import { Subtitles, VideoLowerThird, PipFrame } from "../../../components";
import { pf_02_camera_1Subtitles } from "../pf-02-transcript";
import { pf_02_camera_1_pf_02_screen_1_sync } from "../pf-02-sync-screen1";
import { getVideoPath } from "../video-paths";

const FPS = 30;
const INTRO_DURATION = 150;

// Sync point from transcript matching
const SCREEN1_START_CAMERA_TIME = pf_02_camera_1_pf_02_screen_1_sync.offset;

export const PF02Composition: React.FC = () => {
  const frame = useCurrentFrame();
  const isScreenActive = /* calculate based on timeline */;

  return (
    <AbsoluteFill style={{ backgroundColor: "#000" }}>
      <Sequence durationInFrames={INTRO_DURATION}>
        <PFIntro />
      </Sequence>

      <Sequence from={INTRO_DURATION} durationInFrames={mainContentDuration}>
        <AbsoluteFill>
          {/* Webcam fullscreen when no screen */}
          <Video
            src={getVideoPath("pf-02-camera-1")}
            style={{ opacity: isScreenActive ? 0 : 1 }}
          />

          {/* Screen recording */}
          <Sequence from={screen1StartFrame} durationInFrames={screen1DurationFrames}>
            <Video src={getVideoPath("pf-02-screen-1")} />
          </Sequence>

          {/* Webcam PIP when screen is active */}
          {isScreenActive && (
            <div style={{ position: "absolute", bottom: 40, right: 40, width: 320, height: 240 }}>
              <Video src={getVideoPath("pf-02-camera-1")} volume={0} />
              <PipFrame />
            </div>
          )}

          <Subtitles cues={pf_02_camera_1Subtitles} />
        </AbsoluteFill>
      </Sequence>
    </AbsoluteFill>
  );
};

6. Preview & Iterate

npm start
# Open browser, select composition, review

Work with AI to adjust:

Timing of infographics
Zoom keyframe positions
Camera switch points
Subtitle styling

7. Render

Recommended: FFmpeg Pipeline (5-6x faster)

Use the config-based FFmpeg render pipeline for full episodes:

# Render full episodes (Remotion overlays + FFmpeg compositing)
./scripts/render-from-config.sh ep01   # ~5 min for 30-min episode
./scripts/render-from-config.sh ep02   # Handles screen+PIP automatically
./scripts/render-from-config.sh ep03

# Output: rendered/pf-ep01-apa-itu-programming.mp4

How it works:

Reads timeline from scripts/episode-config/epXX.json
Renders Remotion overlays (intro, outro, b-roll) as ProRes with alpha
Composites onto camera footage using FFmpeg with GPU acceleration
Concatenates intro + main + outro

Episode Config Format (scripts/episode-config/ep01.json):

{
  "id": "ep01",
  "title": "Apa Itu Programming?",
  "fps": 30,
  "intro": { "composition": "PFIntro", "duration": 150 },
  "outro": { "composition": "PFOutro", "duration": 180 },
  "mainContent": {
    "camera": "ep-01/camera/DSC_8013.MOV",
    "overlays": [
      {
        "composition": "EP01-LowerThird",
        "startFrame": 60,
        "duration": 180,
        "type": "transparent",
        "comment": "0:02 - Episode title"
      },
      {
        "composition": "EP01-CompilationProcess",
        "startFrame": 21600,
        "duration": 750,
        "type": "solid",
        "comment": "12:00 - high-level to binary"
      }
    ]
  }
}

Overlay types:

transparent: Overlay on top of camera (lower thirds, charts)
solid: Replace camera entirely (full-screen diagrams)

Render time estimates:

30-minute episode: ~5-6 minutes (5-6x realtime)
12-minute episode: ~2-3 minutes

Alternative: Pure Remotion render (slower)

For individual components or testing:

# Individual diagram animations
npx remotion render src/index.ts EP01-TypingSystemsDiagram out/ep01-typing-systems.mp4

# Full composition via Remotion (slower, ~2-3 hours for 30 min)
npx remotion render src/index.ts PF01-Full out/pf-01-final.mp4 --concurrency=1

Troubleshooting:

Files not found: Ensure external drive is mounted, check remotion.config.ts publicDir
Slow render: Use FFmpeg pipeline instead of pure Remotion
Out of disk space: Output to external drive (default for FFmpeg pipeline)

8. Upload to YouTube

One-time setup:

Create Google Cloud project at https://console.cloud.google.com/
Enable YouTube Data API v3
Create OAuth 2.0 credentials (Desktop app type)
Download JSON and save as client_secret.json in project root
Add yourself as test user in OAuth consent screen
Authenticate:
```
node scripts/youtube-auth.mjs
```

Generate metadata for all episodes:

# Generate with scheduling (start date + interval in days)
node scripts/generate-youtube-metadata.mjs all --start-date 2026-02-01 --interval 2

# Single episode
node scripts/generate-youtube-metadata.mjs ep01

Metadata includes:

Title from episode outline
Description with timeline markers (from outline timestamps)
Tags (common series tags + episode-specific)
Playlist name
Thumbnail path
Scheduled publish time

Generate thumbnails:

Use AI image generation (Gemini, DALL-E, Midjourney, etc.) to create thumbnails:

Prompt example:
"Minimalist illustration of three interlocking gears with Python snake,
Java coffee cup, and JavaScript logo, dark gradient background with
subtle binary code, bold text 'PROGRAMMING?' in neon blue, tech aesthetic,
YouTube thumbnail"

AI tools often generate square images. Resize to YouTube's 16:9 format:

# Resize and pad with dark background (1024x1024 → 1280x720)
ffmpeg -i ~/Downloads/ep01-thumbnail.png \
  -vf "scale=720:720,pad=1280:720:(ow-iw)/2:0:color=#0d1117" \
  thumbnails/ep01.png

Place final thumbnails in thumbnails/ folder:

thumbnails/ep01.png, ep02.png, etc.
Final size: 1280x720 (16:9)

Upload videos:

# Single episode
node scripts/youtube-upload.mjs scripts/youtube-metadata/ep01.json path/to/rendered/pf-ep01.mp4

# Batch upload (ep01 through ep03)
# Videos are auto-detected from RENDERED_DIR in the script
node scripts/youtube-batch-upload.mjs ep01 ep03

Upload features:

Sets title, description, tags from metadata JSON
Sets custom thumbnail
Creates playlist if not exists, adds video to playlist
Scheduled publishing (uploads as private with publishAt time)
5-second delay between batch uploads to avoid rate limiting

Components Reference

VideoLowerThird

Animated title/link overlay.

<VideoLowerThird
  title="Introduction to Variables"
  subtitle="Programming Fundamentals"
  link="github.com/artivisi/examples"
  position="bottom-left"
  showAt={90}
  hideAt={270}
  accentColor="#22c55e"
/>

WebcamOverlay

Picture-in-picture webcam.

<WebcamOverlay
  src={webcamVideoSrc}
  position="bottom-right"
  size="medium"           // small | medium | large
  shape="rounded"         // circle | rounded | rectangle
  borderColor="#22c55e"
  showAt={0}
  hideAt={9000}
/>

ZoomPan

Keyframe-based zoom and pan for screencast focus.

<ZoomPan
  keyframes={[
    { frame: 0, x: 0.5, y: 0.5, scale: 1 },
    { frame: 300, x: 0.3, y: 0.2, scale: 2 },
    { frame: 600, x: 0.5, y: 0.5, scale: 1 },
  ]}
  easing="smooth"
>
  <OffthreadVideo src={screencastSrc} />
</ZoomPan>

CodeHighlight

Syntax-highlighted code overlay.

<CodeHighlight
  code={`function hello() {\n  console.log("Hello!");\n}`}
  title="example.js"
  highlightLines={[2]}
  size="large"
  showAt={0}
  hideAt={300}
/>

Subtitles

Display subtitles from transcript.

<Subtitles
  cues={subtitleCues}
  position="bottom"
  style="default"         // default | boxed | outline
  fontSize={48}
/>

Infographic

Display slides, bullet lists, callouts, or images.

<Infographic
  content={{
    type: "bullet-list",
    title: "Key Concepts",
    items: ["Variables", "Data Types", "Functions"],
    highlightIndex: 1,
  }}
  position="center"
  showAt={300}
  hideAt={600}
/>

Scripts Reference

Script	Purpose
`render-from-config.sh`	Main render script - FFmpeg pipeline with GPU acceleration
`generate-outline.mjs`	Generate printable recording outline
`remove-silence.sh`	One-step silence removal
`detect-silence.sh`	Detect silence regions
`detect-silence.mjs`	Convert silence data to EDL
`transcribe-with-whisper.sh`	Run Whisper transcription
`process-transcript.mjs`	Convert transcript to Remotion format
`extract-cursor.py`	Extract cursor positions from video
`generate-zoom-keyframes.mjs`	Generate zoom keyframes from cursor data
`sync-transcripts.mjs`	Sync camera and screen recordings by matching transcript phrases
`youtube-auth.mjs`	YouTube OAuth authentication setup
`youtube-upload.mjs`	Upload single video with metadata, thumbnail, playlist
`youtube-batch-upload.mjs`	Batch upload multiple episodes sequentially
`generate-youtube-metadata.mjs`	Generate YouTube metadata from episode outlines

Prerequisites

# Node.js dependencies
npm install

# Python dependencies (use venv)
python3 -m venv .venv
source .venv/bin/activate
pip install openai-whisper opencv-python numpy

# FFmpeg (for silence detection)
brew install ffmpeg  # macOS

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
scripts		scripts
src		src
thumbnails		thumbnails
.gitignore		.gitignore
.prettierrc		.prettierrc
CLAUDE.md		CLAUDE.md
README.md		README.md
eslint.config.mjs		eslint.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
remotion.config.ts		remotion.config.ts
tsconfig.json		tsconfig.json

artivisi/ai-assisted-video-editor

Folders and files

Latest commit

History

Repository files navigation