x-scraper

安装量: 49
排名: #15026

安装

npx skills add https://github.com/aatrooox/blog.zzao.club --skill x-scraper
X.com Post Scraper
Extracts recent posts from X.com users with full engagement data using authenticated cookies.
Quick Start
Basic command
:
cd
.opencode/skills/x-scraper/scripts
python3 scraper.py
<
username
>
[
count
]
Example
:
python3 scraper.py example_user
15
Output
:
/tmp/x_{username}_posts.json
Prerequisites
Before first use, verify environment requirements:
Python 3.11+
Check with
python3 --version
Playwright
Check with
python3 -c "import playwright"
Cookie file
Check with ls /tmp/x_cookies_pw.json If any prerequisite is missing , see references/setup.md for detailed installation and configuration guide. Common Workflows First-time setup See references/setup.md for complete environment configuration. Daily scraping python3 scraper.py < username

[ count ] Custom cookie file python3 scraper.py < username

[ count ] --cookie-file /path/to/cookies.json Troubleshooting If scraper fails, see references/troubleshooting.md for common issues and solutions. Output Format { "index" : 1 , "username" : "example_user" , "postId" : "1234567890123456789" , "publishTime" : "2025-12-03T18:28:32.000Z" , "postLink" : "https://x.com/example_user/status/1234567890123456789" , "textContent" : "Post text content..." , "views" : "471K" , "likes" : "1.1K" , "retweets" : "153" , "replies" : "44" } Key fields : postLink - Direct URL to post publishTime - ISO 8601 timestamp views/likes/retweets/replies - Abbreviated metrics (K, M) When to Use This Skill Trigger when user requests: "整理 @某人 最近的发言" "看看某人在X上说了什么" "Scrape X.com posts from @username" "Get latest tweets from user" "Analyze X user's recent posts" Available Scripts scraper.py - Main scraper python3 scraper.py < username

[ count ] [ --cookie-file < path

] Scrapes user timeline with replies Default count: 10 posts Default cookie: /tmp/x_cookies_pw.json convert_cookies.py - Cookie converter python3 convert_cookies.py < input-file

[ output-file ] Converts Cookie-Editor JSON to Playwright format Required before first scraping Reference Documents setup.md - Complete environment setup guide (Python, Playwright, cookies) troubleshooting.md - Error diagnosis and solutions usage.md - Detailed usage examples and advanced options Limitations Requires X.com authentication cookies Cookies expire (~7 days), need re-export Rate limits may apply Cannot access private/protected accounts

返回排行榜