Crawling & Extraction
Crawler
A system that discovers and visits URLs at scale, usually before extraction logic runs.
Scraper
The extraction layer that turns pages, APIs, or rendered DOM content into structured records.
Parser
Rules or models that map raw HTML and JSON into fields like title, price, rating, or availability.
Selector Drift
A breakage pattern where CSS selectors stop matching after a website redesign.
JavaScript Rendering
Running a real browser session so dynamic React, Vue, or SPA content becomes visible before extraction.
Headless Browser
A browser controlled programmatically without a visible UI, commonly used for dynamic pages.