VBcoders Browse New Submit Contact Sign In

No account? Register free

Forgot password?

EmailExtractor1

RegX  (15 Submissions)   Internet/HTML   Visual Basic 3.0   Advanced   Wed 3rd February 2021

Crawls through urls extracting unique Email addresses as it goes.
This version only follows href tags, and does
limited relative path to full path conversion
i.e it does not properly folow relative links having ../
I will post a new version soon that extracts all
urls and has better relative path support.
Code is heavily commented and Shows how to use:
regular expressions
inet control
dictionary objects
listboxes

Inputs
Just give it a starting URL and press go

Assumes
Must have reference to Regular expresions and Scripting runtime I recomend regular expresions 5.5 which you will find the download link to in the global declarations

Returns
A list of email addresses that can then be saved/appended to a text file

Side Effects
although I limit cashed URLS to 5000 this script can cunsume quite a bit of memory. It would be much better to use a DB to Cash URLs and Emails (less memory) and this would also allow cach to exist between program invocations, but for this simple demo I used listboxes.

Rate EmailExtractor1 (6(6 Vote))

Download EmailExtractor1

EmailExtractor1 Comments

No comments yet — be the first to post one!

Post a Comment

0/1000 characters