url_crawl

Function get_urls

source
pub fn get_urls(html: &str) -> impl Iterator<Item = &str>
Expand description

Extracts the links from html. This can be used to HTTP/2 push the linked resources.

Gets

  • <link> nodes where rel != preconnect
  • all nodes with a src attribute

ToDo: Add background-image and other css link detection