Messages - David.P [ switch to compact view ]

Pages: prev1 2 3 4 5 [6] 7 8 9 10 11 ... 42next
26
I can feel your pain.

This also looks interesting, but unfortunately is not yet able to actually show/highlight the fuzzy text matches that it detects in the background:
https://www.tools4noobs.com/online_tools/string_similarity/

27
Aaaarggghhhh, this is 2019 and BeyondCompare still can't wrap text to the width of the respective window  :down: :down: :down: :down: :down: :down: :down: :mad: :mad: :mad: :mad: :mad: :mad:

This unfortunately makes BeyondCompare completely UNUSABLE for text analysis (other than for program code, possibly).

This is also valid similarly for ExamDiff Pro. While ExamDiff Pro can (sort of) wrap text, the output is not practical for text analysis.

Summing up my experience: while practically all off-line tools like the ones mentioned above might be well suitable for comparing program code versions, they are completely and utterly unusable for text analysis.

This is what the comparison output of the typical off-line program looks:
2019-02-10_130557.png

Below is what the comparison output of Similarity Texter looks, and this is what I'm after:
2019-02-10_131038.png

28
Thanks, I see now that at least ExamDiff Pro says that it has some sort of fuzzy comparison.

In the meantime I found this, from the makers of Vroniplag. IMHO this tool blows everything else out of the water, at least when you're looking for text comparison with lots of moved blocks.

I will try ExamDiff Pro  and  BeyondCompare and report back.

Additionally, I am now looking for a tool that can find and highlight repeated blocks of text within a single file. So far, the only solution I found is Textanz:

8N28iVO[1].jpg

Textanz however only highlights repeated blocks in one single color which makes it very tedious to follow up when there are lots of (possibly nested) repetitions.

29
Hi all,

I was just going to report that I tested some online and off-line (*.exe) diff tools again. Thereby, the installed tools, like the latest versions of WinMerge and Meld, proved to have no chance against the best online tools. Both WinMerge and Meld fail completely as soon as the texts become just a little dissimilar.

I checked like a dozen online comparison tools, and the best ones (by far) still seem to be Vroniplag (great multi-color highlighting of identical textblocks including moved block detection), and wikEd diff (highly configurable, including moved block detection). Vroniplag even lets you copy the results including all the color highlighting into Word, however only if the comparison is done with Internet Explorer. Both with Chrome and Firefox, this does not seem to work properly.

Copyleaks.com seems to be even more sophisticated since it at least tries to detect similarity also when there are small differences like for example OCR errors. Also, you can click on text passages and it will highlight the respective passage in the other text. However, it is a) terribly slow as compared to the other two mentioned above, b) has no multi-color coding for quick overview, c) breaks longer texts into multiple pages that you have to click through, and d) you can't quickly edit one or both of your texts and redo the comparison with the edited texts.

I am yet to discover a service or tool that uses some kind of fuzzy logic or artificial intelligence in order to spot and visibly highlight similarities in a "stepless" manner, for example by using some sort of heat map color highlighting of the similarities.

Cheers
David

30
Coming back to this ten years later, and it is still not fixed.

If someone sends me a mail from Outlook and I copy and paste the mail text somewhere else, I usually get like four paragraph marks (one of them followed by a non-breaking space) between every single paragraph

Is that because even the guys at Project FixOutlook.org took a bribe with that silly giant poster that Microsoft sent them (instead of fixing Outlook) -- which apparently froze them with a foolish grin instead of moving on with kicking Microsoft's ass?

--
Edit, JFTR:
I asked if they could make TB! start any editor of your choice, i.e., present the message you were working on as a file for your preferred text editor to work on.  That's the way many systems worked back in the Fidonet DOS days.
http://forums.mozillazine.org/viewtopic.php?p=2934663&sid=7d54a912985816e6bcbe6a780573bee2#p2934663

Pages: prev1 2 3 4 5 [6] 7 8 9 10 11 ... 42next
Go to full version