text

HTML Extract Links

Pull every href / src / action from HTML, classified by element (a / img / script / link / iframe / video / audio / source / form). Optional base URL to resolve relative links. Regex-based so broken HTML still parses.

Loading…

About HTML Extract Links

HTML Extract Links pulls every href, src, and action out of an HTML document and classifies each by the element it came from: a, img, script, link, iframe, video, audio, source, or form. Because it's regex-based rather than relying on a strict parser, it still extracts links from broken or partial markup, and an optional base URL resolves relative paths to absolute ones. It runs in your browser, so pages you paste in are never sent anywhere.

Category
text
Input
Accepts: text/html or text/plain.
Output
Outputs: application/json.
Cost
Free, runs in your browser
Memory
low
Privacy: HTML Extract Links runs entirely on your device. Files you provide never leave your browser — no uploads, no server, no tracking. The page works offline once loaded.

Common uses

  • Inventory every external script and iframe a saved page loads for a quick security review
  • Pull all image src values from a scraped page to audit which assets it depends on
  • Extract outbound a-tag links from an email's HTML to check for suspicious destinations
  • Resolve relative links to absolute URLs by supplying the page's base URL
  • Recover usable links from broken or truncated HTML that a strict parser would reject
  • List form action targets to see where a page submits data before trusting it

Frequently asked questions

What link attributes does it find?

It collects href, src, and action values, grouped by their source element such as a, img, script, link, iframe, video, audio, source, and form.

Does it handle malformed HTML?

Yes. It's regex-based rather than DOM-based, so even broken or incomplete markup still yields its links.

Can it turn relative links into absolute ones?

Yes. Provide a base URL and relative paths are resolved against it in the output.

Is the HTML I paste uploaded?

No. Extraction happens entirely in your browser, so the page content never leaves your device.

What does the output look like?

It returns JSON with links classified by element type, making it easy to filter by category.

Keywords

  • html
  • links
  • extract
  • href
  • src
  • urls
  • scrape
  • crawl

Try next