topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • May 27, 2018, 02:56 PM
  • Proudly celebrating 13 years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: RegexCaptor - Simple app to extract email addresses or other patterns from text  (Read 1158 times)

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 38,547
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
This is a very simple beta release of a program to extract email addresses or other regular expressions from text files.

Screenshot - 5_4_2018 , 6_30_19 PM.png
Screenshot - 5_4_2018 , 6_37_13 PM.png



Motivation:

This is a very simple task.  I needed to extract email addresses from bounced emails in order to remove them from the donationcoder mailing lists.  This is a fairly simple task for a commandline regular expression extractor tool, but I like to be able to drag+drop and get some visual interaction.

I tried a few "free" tools for doing this and they were ALL adware, shareware, feature limited.  Just horrible.  I don't know when we got to a point where people think they can list software unambiguously as "free" and have it be filled with adware or be horribly crippled until you buy the full version.  :down:

So I decided to write my own tool, with hopes for improving it.  The goals are similar to CodeByters Linebyter which I have used in the past but whose source code was lost.

Again this is a very simple tool, it has a few minor features that make it useful for specific tasks:
  • You can create your own list of common regular expression search patterns and select between them easily.
  • You can specify a portion of the expression that should be extracted and listed.
  • You can specify additional patterns to be ignored (in regex or plaintext format).
  • The final list is sorted and duplicates removed.
  • Easy to search multiple files; remembers file list.

Again this is a very niche tool but I may add features to it to make it more useful for other tasks.  If you already have a good regular expression "extractor" that you are happy with, this is unlikely to replace it.
« Last Edit: May 05, 2018, 09:33 AM by mouser »

IainB

  • Supporting Member
  • Joined in 2008
  • **
  • Posts: 7,054
  • Slartibartfarst
    • View Profile
    • Read more about this member.
    • Donate to Member
Oooh! That's rather nifty. Could come in rather handy. Thankyou.

IainB

  • Supporting Member
  • Joined in 2008
  • **
  • Posts: 7,054
  • Slartibartfarst
    • View Profile
    • Read more about this member.
    • Donate to Member
Works rather well, and has helpful suggestions/favourites, etc.    :Thmbsup:

By the way:
  • it shows up in DcUpdater as:     v1.01.9, with no web version (yet).
  • the executable file is shown as: v1.1.2.0
  • the GUI "About" says it is:        v1.01.02
  • the Help file (Overview) says:          v1.01.01 - Apr 23, 2013 <-- !
  • the Help file (Version History) says:   v1.01.01 - Apr 23, 2018
« Last Edit: May 04, 2018, 09:47 PM by IainB »

Ath

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 3,160
    • View Profile
    • Donate to Member
Sounds like a useful tool, much more convenient than extracting stuff using notepad++ :Thmbsup:

The goals are similar to CodeByters Linebyter which I have used in the past but whose source code was lost.
That link isn't accessible for me, but that can be just me...


wraith808

  • Supporting Member
  • Joined in 2006
  • **
  • default avatar
  • Posts: 9,439
    • View Profile
    • Donate to Member
Nope.  Not just you.

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 38,547
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
I have corrected the link to the old Linebyter program.

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 38,547
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
As of now, Regex Captor doesn't provide much if anything that other tools don't provide.

But I am open to feature requests if there are problems it might solve that other tools don't -- it would be fun if we could figure out some features that made the tool genuinely useful over other tools.
« Last Edit: May 05, 2018, 11:17 AM by mouser »

Ath

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 3,160
    • View Profile
    • Donate to Member
A useful feature could be to apply some automatic formatting to the results list, maybe similar to regex replace, where you can re-use the found result to create complete new texts:
Template:
Code: Text [Select]
  1. <mailout name="\1" type="mailto">mailto://\1</mailout>
Resultlist:
Code: Text [Select]
  1. <mailout name="mouser@mousing.com" type="mailto">mailto://mouser@mousing.com</mailout>
  2. <mailout name="me@mailinator.com" type="mailto">mailto://me@mailinator.com</mailout>

(OT: The code=xml or code=html forum tags don't seem to work as intended, reverting to 'text' for now)

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 38,547
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
I think that's a great idea, I will add it.

wraith808

  • Supporting Member
  • Joined in 2006
  • **
  • default avatar
  • Posts: 9,439
    • View Profile
    • Donate to Member