HTML Parser

An HTML parser reads raw HTML text and converts it into a structured tree of nodes (the Document Object Model, or DOM) that programs can traverse and query.

Scrapers use parsers like BeautifulSoup (Python), Cheerio (Node.js), or the built-in browser DOM to locate and extract specific data elements.

Related Terms