youtube-knowledge-extractor
Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis)
Setup & Installation
Install command
clawhub install sdrabent/youtube-knowledge-extractorIf the CLI is not installed:
Install command
npx clawhub@latest install sdrabent/youtube-knowledge-extractorOr install with OpenClaw CLI:
Install command
openclaw skills install sdrabent/youtube-knowledge-extractoror paste the repo link into your assistant's chat
Install command
https://github.com/openclaw/skills/tree/main/skills/sdrabent/youtube-knowledge-extractorWhat This Skill Does
Analyzes YouTube videos through both audio and visual channels by extracting timestamped transcripts and key frames from the video. Pairs spoken content with on-screen visuals, so UI actions, code, and diagrams shown but not described in speech are captured accurately.
Standard YouTube transcript tools miss anything shown on screen but not spoken; this skill matches frames to transcript timestamps to close that gap.
When to Use It
- Building step-by-step guides from software tutorial videos
- Extracting code shown on screen during live coding demos
- Summarizing long explainer videos with visual anchors
- Following UI navigation in product walkthrough videos
- Capturing diagram details from architecture or design videos
Example Workflow
Here's how your AI assistant might use this skill in practice.
User asks: Building step-by-step guides from software tutorial videos
- 1Building step-by-step guides from software tutorial videos
- 2Extracting code shown on screen during live coding demos
- 3Summarizing long explainer videos with visual anchors
- 4Following UI navigation in product walkthrough videos
- 5Capturing diagram details from architecture or design videos
Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis)
Security Audits
These signals reflect official OpenClaw status values. A Suspicious status means the skill should be used with extra caution.