topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • December 10, 2018, 10:56 PM
  • Proudly celebrating 13 years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: Extract ascii  (Read 811 times)

harryray

  • Participant
  • Joined in 2018
  • *
  • default avatar
  • Posts: 2
    • View Profile
    • Donate to Member
Extract ascii
« on: May 09, 2018, 10:05 AM »
Is there a search string so that I can use regexcaptor to extract just ascii characters from a file?...I currently use a couple of programmes called bintext and extracttext to do this.

I have absolutely no idea how to write a regex!!

Thanks

Ath

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 3,291
    • View Profile
    • Donate to Member
Re: Extract ascii
« Reply #1 on: May 09, 2018, 02:05 PM »
Well, the simplest regex to extract textual content as 'words' from (binary) files would be: \w+
If you only want alphanumeric characters (no accent- or punctuation- characters), that could be something like [a-zA-Z0-9]+

A great site to learn about and try out Regular Expressions has been https://regex101.com for me, with an extended explanation of what's going on with the regex you're trying, and an index of all expression elements available.