topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Thursday March 28, 2024, 6:18 am
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: Request: (Email) Extractor With Options  (Read 7146 times)

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,896
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
Request: (Email) Extractor With Options
« on: January 30, 2008, 08:24 AM »
One of the things i do occasionally at DC is take all the bounced email from the last newsletter and "extract" all emails and then remove them from future mailing lists.

I currently use a freeware email extractor but i'd like to see one with some more features.

This should be a simple project and i can code it myself in a day but i thought i'd put it out there in case someone wants to try it, and maybe make it more general.

So here is my idea for a more general extractor:

0) should be a gui utility (yes i know this would be easy to do with a commandline script).
1) let user drag and drop or add a number of files to process as input
2) show results in a memo with a button to save to file
3) options for specifying the regular expressions (multiple) to extract from each line if found
(the program should have maybe a helper to let you select common things like the regex for an email).
4) options for specifying a list of regular expressions to EXCLUDE from matching (ie if this is found in a regex match in step 3 ignore it)
5) option to reject duplicates.
6) save all options above on exit.

Anyone else have any ideas to add to it?  Anyone want to try to code this?

Jibz

  • Developer
  • Joined in 2005
  • ***
  • Posts: 1,187
    • View Profile
    • Donate to Member
Re: Request: (Email) Extractor With Options
« Reply #1 on: January 30, 2008, 10:17 AM »
print "$1\n" if /([\w+-]+(?:\.[\w+-]+)*@\w+(?:\.\w+)+)/


mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,896
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
Re: Request: (Email) Extractor With Options
« Reply #2 on: January 30, 2008, 11:44 AM »
heheheh that smilie really captures your smart alecky nature..  ;D