Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
61 changes: 61 additions & 0 deletions src/content/skills-zh/social-media-scraper.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,61 @@
---
name: social-media-scraper
title: 全网作品数据抓取分析工具(抖音小红书视频号快手B站通用)
description: 免费抓取抖音、小红书、B站、快手、视频号五大平台作品数据(播放/点赞/评论/分享/收藏),无需登录/API Key/付费。支持飞书多维表格同步。
source: community
author: wu1982966308
githubUrl: https://github.com/wu1982966308/social-media-scraper
docsUrl: https://github.com/wu1982966308/social-media-scraper/blob/main/README.md
category: data
tags:
- 数据抓取
- 社交媒体
- 抖音
- 小红书
- B站
- 快手
- 视频号
- 飞书
- 多维表格
roles:
- developer
- marketer
- analyst
featured: false
popular: false
isOfficial: false
installCommand: |
git clone https://github.com/wu1982966308/social-media-scraper.git
cp -r social-media-scraper ~/.qoder/skills/
date: 2026-06-16
---

## 使用场景

- 无需登录或 API Key,提取社交媒体作品的互动数据(播放、点赞、评论、分享、收藏)
- 批量抓取多个作品链接,用于竞品分析或数据报告
- 将抓取的数据同步到飞书多维表格,实现集中化管理
- 监控抖音、小红书、B站、快手、视频号的内容表现

## 示例

```python
import sys
sys.path.insert(0, '~/.qoder/skills/social-media-scraper')
from scraper import scrape_post, format_result

# 单个链接
result = scrape_post('https://www.bilibili.com/video/BV1GJ411x7h7')
print(format_result(result))

# 批量抓取
from scraper import scrape_posts
results = scrape_posts(['链接1', '链接2', '链接3'])
```

## 注意事项

- 需要安装 Python 依赖:`pip install curl_cffi requests beautifulsoup4 lxml brotli`
- 小红书链接必须使用包含 `xsec_token` 参数的完整分享链接
- 视频号需要浏览器自动化降级(DrissionPage 或内置浏览器工具)
- 飞书多维表格同步支持 lark-cli 和 HTTP API 两种模式
61 changes: 61 additions & 0 deletions src/content/skills/social-media-scraper.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,61 @@
---
name: social-media-scraper
title: Social Media Data Scraper (Douyin/XHS/Bilibili/Kuaishou/Channels)
description: A free social media post data scraper supporting 5 major platforms (Douyin, Xiaohongshu, Bilibili, Kuaishou, WeChat Channels). No login, no API key, no payment required. Extracts views, likes, comments, shares, and collections. Optional Feishu Base sync.
source: community
author: wu1982966308
githubUrl: https://github.com/wu1982966308/social-media-scraper
docsUrl: https://github.com/wu1982966308/social-media-scraper/blob/main/README.md
category: data
tags:
- scraper
- social-media
- douyin
- xiaohongshu
- bilibili
- kuaishou
- wechat-channels
- feishu
- lark
roles:
- developer
- marketer
- analyst
featured: false
popular: false
isOfficial: false
installCommand: |
git clone https://github.com/wu1982966308/social-media-scraper.git
cp -r social-media-scraper ~/.qoder/skills/
date: 2026-06-16
---

## Use Cases

- Extract engagement data (views, likes, comments, shares, collections) from social media posts without login or API keys
- Batch scrape multiple post URLs for competitive analysis or reporting
- Sync scraped data to Feishu (Lark) Base for centralized data management
- Monitor content performance across Douyin, Xiaohongshu, Bilibili, Kuaishou, and WeChat Channels

## Example

```python
import sys
sys.path.insert(0, '~/.qoder/skills/social-media-scraper')
from scraper import scrape_post, format_result

# Single URL
result = scrape_post('https://www.bilibili.com/video/BV1GJ411x7h7')
print(format_result(result))

# Batch scrape
from scraper import scrape_posts
results = scrape_posts(['url1', 'url2', 'url3'])
```

## Notes

- Requires Python dependencies: `pip install curl_cffi requests beautifulsoup4 lxml brotli`
- Xiaohongshu links must include the full share URL with `xsec_token` parameter
- WeChat Channels requires browser automation fallback (DrissionPage or built-in browser tools)
- Feishu Base sync supports both lark-cli and HTTP API modes