21
Post New Requests Here / Re: comparing two big different lists of strings/filenames
« Last post by compn on May 17, 2024, 11:10 AM »fuzzy matching is the next feature.
Image File: <appdir>icons\Ping.pngWhen trying to run that command, I'm met with the following error message:
Application: ping.exe
Application Arguments :-t <var>
Start in:
Button Caption: Ping -t
Command Window : Skip (Direct Launch)
Ignore : Variable Whitespace
Launch cancelled. Application file listed bellow is missing.
ping.exe
Alice
First list:
Alice [1982]
Alice [1991]
Second list:
Alice (1988).mkv
Alice in Wonderland
First list:
Alice in Wonderland [1999]
Alice in Wonderland [2010]
Second list:
Alice in Wonderland (1903).mkv
All Quiet on the Western Front
First list:
All Quiet on the Western Front [1979]
Second list:
All Quiet on the Western Front (1930).mkv
this already exists i think. its just uniq -d
...
but that wouldnt tell me which list has the duplicate [...]-compn (May 16, 2024, 12:18 AM)
I see Vic is on the task, but I think what you're asking for is usually called fuzzy string matching. Try a Web search, there seems plenty of Python work on it, and take a look at Comparing Strings Is Easy With FuzzyWuzzy.-rjbull (May 15, 2024, 05:23 PM)
Output:
print(results)
sample_name actual_name score
0 jtsports JT Sports LLC 79.0
1 tombaseball Tom Baseball Inc. 81.0
2 context express Context Express LLC 95.0
3 zb sicily ZB Sicily LLC 95.0
4 lightening express Lightening Express LLC 95.0
5 fire roads Fire Road Express 86.0
6 NaN Earth Treks NaN
7 NaN TS Sports LLC NaN
8 NaN MM Baseball Inc. NaN
9 NaN Contact Express LLC NaN
10 NaN AB Sicily LLC NaN
11 NaN Lightening Roads LLC NaN
- First, the MovieList for direct movie comparison.this already exists i think. its just uniq -d-paradisusvic (May 15, 2024, 07:38 PM)
Funiq (fuzzy uniq) is a command line tool for performing fuzzy string matching against lists of words.
- Second, Fuzzy-matching for adding more/partial results.
Tokens + ignore list of words