From zyte-web-data
Adds web-poet page objects to Scrapy projects with auto-detected fields via AST manipulation.
How this skill is triggered — by the user, by Claude, or both
Slash command
/zyte-web-data:scrape-add-page-object [file-path] [class-name] [domain] [base-class] [item-class][file-path] [class-name] [domain] [base-class] [item-class]This skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are adding an empty web-poet page object to a Scrapy project.
You are adding an empty web-poet page object to a Scrapy project.
Read ${CLAUDE_SKILL_DIR}/../scrape/references/python-environments.md.
The raw argument string is $ARGUMENTS. Split it into up to 6 whitespace-separated positional arguments:
books_project/pages/books_toscrape_com.py)ProductPage)@handle_urls (e.g. books.toscrape.com)web_poet.WebPage)books_project.items.ProductItem)name,price,rating)Run from the project root (the directory containing pyproject.toml) so the item
class is importable for auto-detecting required fields:
uv run --project . --with libcst ${CLAUDE_SKILL_DIR}/scripts/add_page_object.py \
FILE_PATH CLASS_NAME DOMAIN BASE_CLASS ITEM_CLASS
Required fields (those with no default in the item class) are detected automatically
via itemadapter and get @field stubs. If all fields have defaults the class body is
pass. Pass --fields name,price to override auto-detection.
The script uses libcst for correct AST manipulation:
ProductPage and CategoryPage)Common base classes:
web_poet.WebPage — for pages using HTTP responses (most common)npx claudepluginhub zytedata/claude-skills --plugin zyte-web-dataGenerates web-poet page objects from extraction specs produced by /scrape-spec, including item classes, page objects, and test fixtures.
Builds production-ready web scrapers for any site using Bright Data infrastructure. Guides site analysis, API selection, selector extraction, pagination, and implementation.
Migrates web scraping from Puppeteer, Playwright, Cheerio to Firecrawl API using code examples, comparisons, and lib detection.