Let's say I have a large website which may have a number of email addresses on it that are getting picked up by spammers. I plan to obfuscate or remove them all.
What's the easiest way to crawl my website to find any email addresses I may be exposing?
Either through on-page text (which Google can pick up, but not very well) or mailto: links (which Google can't).
Something like:
then
Should return a list of all pages (with line numbers) that contain your domains email addresses.
Might need tweaking.
Manuals for wget and grep