Skip to content

Instantly share code, notes, and snippets.

@davidhavard
Created April 20, 2018 14:35
Show Gist options
  • Select an option

  • Save davidhavard/10a246aa90f808d5f2fc87134a18a280 to your computer and use it in GitHub Desktop.

Select an option

Save davidhavard/10a246aa90f808d5f2fc87134a18a280 to your computer and use it in GitHub Desktop.
Recursively extract image urls from html files, sort them and remove duplicates
grep -roP '<img.+src="\K[^"]+' * | sed 's/.*://g' | sort | uniq > images.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment