Search Tools Links Login

EmailExtractor1


Crawls through urls extracting unique Email addresses as it goes.
This version only follows href tags, and does
limited relative path to full path conversion
i.e it does not properly folow relative links having ../
I will post a new version soon that extracts all
urls and has better relative path support.
Code is heavily commented and Shows how to use:
regular expressions
inet control
dictionary objects
listboxes

Original Author: RegX

Inputs

Just give it a starting URL and press go

Assumptions

Must have reference to
Regular expresions and Scripting runtime
I recomend regular expresions 5.5 which you will find the download link to in the global declarations

Returns

A list of email addresses that can then be saved/appended to a text file

Side Effects

although I limit cashed URLS to 5000 this script can cunsume quite a bit of memory.
It would be much better to use a DB to Cash URLs
and Emails (less memory) and this would also allow
cach to exist between program invocations, but for this simple demo I used listboxes.

About this post

Posted: 2002-06-01
By: ArchiveBot
Viewed: 70 times

Categories

Visual Basic 6

Attachments

EmailExtra25738912001.zip
Posted: 9/3/2020 3:45:00 PM
Size: 5,528 bytes


Loading Comments ...

Comments

No comments have been added for this post.

You must be logged in to make a comment.