topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Friday April 19, 2024, 11:53 am
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: IDEA: File and folder name spelling checker and corrector + folder clusterizer  (Read 3230 times)

tmpusr

  • Member
  • Joined in 2005
  • **
  • Posts: 154
  • Instantiation stuck in meatspace with no backup
    • View Profile
    • Donate to Member
The problem: You have a bunch of misspelled files and folders with no caps, extra words and characters, etc. But you also have many (usually the majority) correctly spelled versions of those names. You want everything to be spelled in the same, correct way.
Solution 1: You do it all by hand, making no use of those GHzs.
Solution 2: You're a coder and let code fix it.

1 List all files and folders. (Perhaps using some existing database like Locate32 or Everything.)
2 Find items in the list that appear multiple times.
3 Do a fuzzy match to items in step 2 with something like http://www.tgries.de/agrep/
4 Rename items found in step 3 to item names in step 2, or if the correct match is uncertain, i.e. if there's no clear majority of one spelling style, let the user select. Optionally let the user also confirm each renaming.

It should probably clean up the names before doing matching by removing certain known-to-be-extraneous words and characters.
A log and undo.

While you're at it, you could optionally gather every folder that has multiple matching words (sequential or appearing at any location) in its name into the folder that already has the greatest number of such folders.

Option: select folders and trees to include.
« Last Edit: April 24, 2010, 01:56 AM by tmpusr »