Advanced Video Downloader Overview
This skill provides comprehensive video downloading and transcription capabilities from 1000+ platforms including YouTube, Bilibili, TikTok, Twitter, Instagram, and more. It combines:
yt-dlp: Powerful video downloading tool SiliconFlow API: Free AI-powered transcription to convert videos to Markdown When to Use This Skill
Activate this skill when the user:
Explicitly requests to download a video ("download this video", "下载视频") Provides video URLs from any platform Mentions saving videos for offline viewing Wants to extract audio from videos Needs to download multiple videos or playlists Asks about video quality options Requests video transcription ("转录视频", "提取字幕", "视频转文字") Wants to convert video/audio to text or Markdown Asks to download AND transcribe a video in one workflow Core Capabilities 1. Single Video Download
Download individual videos from any supported platform with automatic quality selection.
Example usage:
User: "Download this YouTube video: https://youtube.com/watch?v=abc123" User: "下载这个B站视频: https://bilibili.com/video/BV1xxx"
- Batch & Playlist Download
Download multiple videos or entire playlists at once.
Example usage:
User: "Download all videos from this playlist" User: "Download these 3 videos: [URL1], [URL2], [URL3]"
- Audio Extraction
Extract audio only from videos, saving as MP3 or M4A.
Example usage:
User: "Download only the audio from this video" User: "Convert this video to MP3"
- Quality Selection
Choose specific video quality (4K, 1080p, 720p, etc.).
Example usage:
User: "Download in 4K quality" User: "Get the 720p version to save space"
- Video/Audio Transcription
Convert video or audio files to Markdown text using SiliconFlow's free AI transcription API.
Example usage:
User: "Transcribe this video to text" / "转录这个视频" User: "Download and transcribe this YouTube video" User: "将这个音频转成文字" User: "Extract transcript from this MP4 file"
Supported formats:
Audio: MP3, WAV, M4A, FLAC, AAC, OGG, OPUS, WMA Video: MP4, AVI, MOV, MKV, FLV, WMV, WEBM, M4V Response Pattern
When a user requests video download:
Step 1: Identify the Platform and URL(s)
Extract video URL(s) from user message
Identify platform: YouTube, Bilibili, TikTok, etc.
Step 2: Check Tool Availability
Check if yt-dlp is installed
yt-dlp --version
Step 3: Select Appropriate yt-dlp Command
Based on platform and requirements:
YouTube, Twitter, Instagram, TikTok: Basic command works Bilibili: Basic command works for most videos Quality selection: Use -f with height filter Audio only: Use -x --audio-format mp3 Playlists: Use playlist-specific output template Step 4: Execute Download
Use yt-dlp directly with appropriate options:
Basic download (best quality MP4)
yt-dlp -f "bestvideo[ext=mp4]+bestaudio[ext=m4a]/best[ext=mp4]/best" --merge-output-format mp4 -o "%(title)s.%(ext)s" "VIDEO_URL"
Specific quality (1080p)
yt-dlp -f "bestvideo[height<=1080]+bestaudio/best[height<=1080]" --merge-output-format mp4 -o "%(title)s.%(ext)s" "VIDEO_URL"
Audio only (MP3)
yt-dlp -x --audio-format mp3 -o "%(title)s.%(ext)s" "VIDEO_URL"
With cookies file (for protected content)
yt-dlp --cookies cookies.txt -o "%(title)s.%(ext)s" "VIDEO_URL"
Playlist download
yt-dlp -o "%(playlist)s/%(playlist_index)s - %(title)s.%(ext)s" "PLAYLIST_URL"
Step 5: Report Results
After download completes, report:
✅ Video title and duration ✅ File size and format ✅ Save location ✅ Download speed and time taken ⚠️ Any warnings or quality limitations
Example output:
✅ Downloaded: "Video Title Here" Duration: 15:30 Quality: 1080p MP4 Size: 234 MB Location: ./Video Title Here.mp4 Time: 45 seconds at 5.2 MB/s
Transcription Response Pattern
When a user requests video/audio transcription:
Step 1: Check Prerequisites
Verify SiliconFlow API key is available
echo $SILICONFLOW_API_KEY
Or user must provide via --api-key parameter
API Key Setup:
Get free API key from: https://cloud.siliconflow.cn/account/ak Copy .env.example to .env and add your API key Or set environment variable: SILICONFLOW_API_KEY=sk-xxx Step 2: Validate File
Ensure the file exists and is a supported format (audio or video).
Step 3: Execute Transcription
Use the bundled script scripts/transcribe_siliconflow.py:
Basic transcription
python scripts/transcribe_siliconflow.py --file video.mp4 --api-key sk-xxx
With custom output path
python scripts/transcribe_siliconflow.py --file audio.mp3 --output transcript.md --api-key sk-xxx
Using environment variable for API key
python scripts/transcribe_siliconflow.py --file video.mp4
Step 4: Report Transcription Results ✅ Transcription complete! File: video.mp4 Output: 2025-01-15-video.md Size: 12.5 KB
Preview:
[First 200 characters of transcription...]
Combined Workflow: Download + Transcribe
For requests like "Download and transcribe this video":
Step 1: Download video
yt-dlp -f "bestvideo[ext=mp4]+bestaudio[ext=m4a]/best" --merge-output-format mp4 -o "%(title)s.%(ext)s" "VIDEO_URL"
Step 2: Transcribe the downloaded file
python scripts/transcribe_siliconflow.py --file "Downloaded Video Title.mp4" --api-key sk-xxx
Platform-Specific Notes YouTube Fully supported by yt-dlp No authentication needed for public videos Supports all quality levels including 4K/8K Bilibili Supported by yt-dlp High-quality downloads may require login cookies Use --cookies with cookies.txt for member-only content Other Platforms Most platforms work well with yt-dlp Check references/supported_platforms.md for full list Handling Cookies for Protected Content
For platforms requiring authentication (Bilibili VIP, member-only content, etc.):
Method 1: Export Cookies File (Recommended)
Use browser extension "Get cookies.txt LOCALLY"
Export cookies.txt, then:
yt-dlp --cookies cookies.txt "VIDEO_URL"
Method 2: Manual Cookies File
Create cookies.txt in Netscape format
Use browser extension "Get cookies.txt LOCALLY"
Then use with yt-dlp
yt-dlp --cookies cookies.txt "VIDEO_URL"
Troubleshooting Issue: Video quality lower than expected
Solution:
Check if platform requires login for HD Use --cookies cookies.txt for authenticated access Explicitly specify quality with -f parameter Issue: Download very slow
Solution:
Check internet connection Try different time of day (peak hours affect speed) Use --concurrent-fragments for faster downloads Issue: "Video unavailable" or geo-restricted
Solution:
Video may be region-locked Use proxy/VPN if legally permitted Check if video is still available on platform Issue: Transcription API key error
Solution:
Verify API key starts with sk- Get free key from: https://cloud.siliconflow.cn/account/ak Set environment variable: SILICONFLOW_API_KEY=sk-xxx Issue: Transcription returns empty text
Solution:
Check if audio/video has clear speech Verify file format is supported File may be too short or contain only music Common Commands Quality Presets
4K (2160p)
yt-dlp -f "bestvideo[height<=2160]+bestaudio/best[height<=2160]" --merge-output-format mp4 "VIDEO_URL"
1080p (Full HD)
yt-dlp -f "bestvideo[height<=1080]+bestaudio/best[height<=1080]" --merge-output-format mp4 "VIDEO_URL"
720p (HD)
yt-dlp -f "bestvideo[height<=720]+bestaudio/best[height<=720]" --merge-output-format mp4 "VIDEO_URL"
480p (SD)
yt-dlp -f "bestvideo[height<=480]+bestaudio/best[height<=480]" --merge-output-format mp4 "VIDEO_URL"
Audio Extraction
Extract as MP3
yt-dlp -x --audio-format mp3 -o "%(title)s.%(ext)s" "VIDEO_URL"
Extract as M4A (better quality)
yt-dlp -x --audio-format m4a -o "%(title)s.%(ext)s" "VIDEO_URL"
Batch Downloads
Download multiple URLs from file
yt-dlp -a urls.txt
Download playlist with custom naming
yt-dlp -o "%(playlist)s/%(playlist_index)s - %(title)s.%(ext)s" "PLAYLIST_URL"
Download channel's videos
yt-dlp -o "%(uploader)s/%(title)s.%(ext)s" "CHANNEL_URL"
Bundled Resources Configuration .env.example
Template for environment variables. Copy to .env and add your SiliconFlow API key.
Scripts scripts/transcribe_siliconflow.py
AI-powered transcription script using SiliconFlow's free API.
Usage:
python scripts/transcribe_siliconflow.py --file
Parameters:
--file, -f: Input audio/video file (required) --api-key, -k: SiliconFlow API key (or use SILICONFLOW_API_KEY env var) --output, -o: Output markdown file path (default: YYYY-MM-DD-filename.md) --model, -m: Model to use (default: FunAudioLLM/SenseVoiceSmall) References references/supported_platforms.md
Comprehensive list of 1000+ supported platforms with platform-specific notes and requirements.
references/quality_formats.md
Detailed explanation of video formats, codecs, and quality selection strategies.
Tips for Best Results Always specify quality if user has preference - saves bandwidth and storage Batch downloads save time - use playlist URLs when possible Audio extraction is faster - recommend for podcast/music content Check file size before downloading - warn user for very large files (>1GB) Transcription works best with clear audio - consider extracting audio first for better results Sources yt-dlp Documentation yt-dlp Installation Guide SiliconFlow API Documentation SiliconFlow Free API Key