topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Friday December 13, 2024, 10:56 pm
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: Document Analysis Software  (Read 4010 times)

patteo

  • Charter Member
  • Joined in 2005
  • ***
  • Posts: 437
    • View Profile
    • Read more about this member.
    • Donate to Member
Document Analysis Software
« on: October 02, 2006, 02:36 AM »
I would like to be able to analyze a document by extracting all the words and dumping them into a table which sorts the occurrence of words by the order of frequency.

Of course it would be preferable if you could eliminate commonly occurring words and numbers - ie there should be a way to have an exclusion table where you can specify the words and numbers to be excluded (preferably with wildcards)

I was thinking that with information overload that we all faced nowadays, it would be nice if I could cut an article off a webpage or anywhere else for that matter and then paste it into such a software and get a very quick overview of the emphasis.

Of course the next step is to be able to see the frequency occurrence of two or 3 word phrases.

Does anyone know of the existence of such a utility or perhaps write one ?




mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,914
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
Re: Document Analysis Software
« Reply #1 on: October 02, 2006, 12:24 PM »
there are tools for this, i believe the proper term is "concordance tools"

for example:
http://www.concordancesoftware.co.uk/

let us know if you find other software.

Cro-Code

  • Participant
  • Joined in 2006
  • *
  • default avatar
  • Posts: 2
    • View Profile
    • Donate to Member
Re: Document Analysis Software
« Reply #2 on: October 02, 2006, 02:41 PM »
Hello
Let me recommend our product developed exactly for this goal
http://www.cro-code.com/textanz.jsp
Textanz calculates frequencies of single words, phrases or wordforms. There is a possibility to sort results by length, frequency, alphabetically etc. You can also export results to external file (CSV).
Well there is no much to say here - you are welcome to visit a product page, see screenshot and try this tool.
Registered users receives version updates without additional charge. Next version is coming soon with concordance calculator (shows all words in context, including frequency=1).

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,914
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
Re: Document Analysis Software
« Reply #3 on: October 02, 2006, 02:45 PM »
very nice - and welcome to the site.
i see you also have an ms word addin as well, cool  :Thmbsup:

Cro-Code

  • Participant
  • Joined in 2006
  • *
  • default avatar
  • Posts: 2
    • View Profile
    • Donate to Member
Re: Document Analysis Software
« Reply #4 on: October 03, 2006, 03:12 AM »
very nice - and welcome to the site.
i see you also have an ms word addin as well, cool  :Thmbsup:
Thank you for the warm greeting.
Yes, we offer add-in too. However standalone program has reached much better success, at least for now. It is more powerful, easier to install and not tied to MS Office. Even active Word users choose Textanz, although we have a number of add-in users as well.