Welcome Guest.   Make a donation to an author on the site September 18, 2014, 02:45:46 AM  *

Please login or register.
Or did you miss your validation email?


Login with username and password (forgot your password?)
Why not become a lifetime supporting member of the site with a one-time donation of any amount? Your donation entitles you to a ton of additional benefits, including access to exclusive discounts and downloads, the ability to enter monthly free software drawings, and a single non-expiring license key for all of our programs.


You must sign up here before you can post and access some areas of the site. Registration is totally free and confidential.
 
View the new Member Awards and Badges page.
   
   Forum Home   Thread Marks Chat! Downloads Search Login Register  
Pages: [1]   Go Down
  Reply  |  New Topic  |  Print  
Author Topic: Beta new version of TrID file identifier - now with batch scanning & renaming  (Read 9637 times)
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« on: June 01, 2006, 04:20:19 AM »

I just finished a new beta (but seems pretty stable) version of my freeware TrID file identifier, for anyone who want to try it.
It bases it's analysis on a library of definitions, that for each filetype lists a series of patterns and uniques tokens, so that it can guess to with a certain file is more similar, irregardless of the file's name & extension.
Here's a typical results:

[copy or print]
C:\TrID>trid \windows\media\chimes.wav

TrID/32 - File Identifier v1.72b - (C) 2003-06 By M.Pontello
Definitions found:  1956
Analyzing...

Collecting data from file: \windows\media\chimes.wav
 50.0% (.WAV) RIFF/WAVe standard Audio (4008/2)
 49.9% (.) Generic RIFF container (4000/1)

The current/stable version of TrID use a library of XML files for the defs, so it take some time to load & parse them at startup. For this new version, instead, I used a binary container, so that now it's almost instantaneous. Also, now it's possible to scan an entire folder of files:

[copy or print]
C:\TrID>trid \pbcc\bin\*.exe

TrID/32 - File Identifier v1.72b - (C) 2003-06 By M.Pontello
Definitions found:  1956
Analyzing...

File: \pbcc\bin\CCEdit.exe
 33.6% (.EXE) Win32 Executable PowerBASIC/Win 7.x (235131/25/18)

File: \pbcc\bin\PBCC.exe
 84.9% (.EXE) Win16 NE executable (generic) (34068/22/9)

File: \pbcc\bin\PBRes.exe
 61.2% (.EXE) WIN32 Executable PowerBASIC/CC 3.02 (393928/51/44)

File: \pbcc\bin\PBrow.exe
 38.0% (.EXE) Win32 Executable PowerBASIC/Win 7.x (235131/25/18)

File: \pbcc\bin\RC.exe
 72.1% (.EXE) Win32 Executable MS Visual C++ (generic) (37706/45/16)

Eventually TrID can also rename the scanned files adding the guessed filetype extensions.
This come useful, for example, when you have a bunch of files recovered with CHKDSK, and the type of each file isn't immediately clear.
So if you have a folder with files like:

FILE0001.CHK
FILE0002.CHK
FILE0003.CHK
...

Running something like:
[copy or print]
C:\TrID>trid \myfolder\* -ae
will rename them to:

FILE0001.CHK.doc
FILE0002.CHK.xls
FILE0003.CHK.gif
...

Here's the download link: TrID 1.72b (290KB)
That include the TrID's executable and a package with defs for over 1.900 filetypes.
Just unpack in a folder, and run.

I plan to publish a stable version in a couple of days on TrID's page, as long as the updated versions of the companion tools (like the one that take the XML defs and create the single package, etc.).
A Linux port is also almost complete / ready, thanks to the migration of the code base from PowerBASIC to the free / Open Source FreeBASIC.

Hope it will be useful to someone.

P.S.
It's and will be free for personal / non profit use, off course.

Bye!
Logged

rjbull
Charter Member
***
Posts: 2,758

View Profile Give some DonationCredits to this forum member
« Reply #1 on: June 01, 2006, 05:25:09 AM »

Mark0,

you might want to take a quick look at
Eric Phelps' UnCHK, which has a brief discussion of CHK files.  TrID will be a lot more powerful, though; it has so many more file types as standard.

[edit: link fixed, sorry]

« Last Edit: June 01, 2006, 05:57:21 AM by rjbull » Logged
f0dder
Charter Honorary Member
***
Posts: 8,774



[Well, THAT escalated quickly!]

see users location on a map View Profile WWW Read user's biography. Give some DonationCredits to this forum member
« Reply #2 on: June 01, 2006, 05:53:27 AM »

I think you need to fix the link, rjbull smiley
Logged

- carpe noctem
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #3 on: June 01, 2006, 06:20:36 AM »

Thanks rjbull, I have already seen UnCHK (maybe searching for "file identifier" or read about it somewhere).

Bye!
Logged

Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #4 on: June 03, 2006, 07:35:36 AM »

I uploaded a new beta, with some minor additions / fixes.
It now check for the defs package in the current dir first, and eventually on the TrID's exe folder.
Alternatively, it's possible to use a specific one trough a switch.

C:\TrID>trid -?

TrID/32 - File Identifier v1.74b - (C) 2003-06 By M.Pontello

Usage: TrID [path]<filespec(s)...> [-r:nn] [-v] [-p] [-w]
                                   [-d:file] [-?]

Where: <filespec> Files to identify/analyze
       -ae        Add guessed extension to filename
       -ns        Disable unique strings check
       -r:nn      Display the first nn matches (default: 5)
       -v         Verbose mode - display def name, author, etc.
       -d:file    Use the specified defs package
       -w         Wait for a key before exiting
       -?         This help!


Bye!
Logged

Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #5 on: June 04, 2006, 11:49:26 AM »

Finished with the beta, that become the new v2.00.



Link: TrID file identifier

Bye!
Logged

mouser
First Author
Administrator
*****
Posts: 33,356



see users location on a map View Profile WWW Read user's biography. Give some DonationCredits to this forum member
« Reply #6 on: June 04, 2006, 02:58:01 PM »

looks great mark0
Logged
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #7 on: June 06, 2006, 06:36:23 PM »

Thanks mouser!

Here's another news, just added:



cheesy

Bye!
Logged

f0dder
Charter Honorary Member
***
Posts: 8,774



[Well, THAT escalated quickly!]

see users location on a map View Profile WWW Read user's biography. Give some DonationCredits to this forum member
« Reply #8 on: June 07, 2006, 03:15:26 AM »

Cute - linux version wouldn't have happened with PowerBASIC smiley
Logged

- carpe noctem
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #9 on: June 07, 2006, 03:23:15 AM »

Exactly!  Thmbsup

Bye!
Logged

Cavalcader
Charter Member
***
Posts: 192


Live Long & Prosper

View Profile Give some DonationCredits to this forum member
« Reply #10 on: June 16, 2006, 12:16:41 PM »

TrID is a cool program -- I've used the v1 series on occasion over the last few years. I'm looking forward to trying the new one!
Logged

My Linguistic Profile:
  40% General American English
  30% Yankee
  20% Dixie

What Kind of American English Do You Speak?
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #11 on: February 19, 2011, 10:22:17 AM »

I have updated TrID a couple of days ago. The v2.10 mainly fix the problem with filesize > 2GB (a classic!), and add an option switch to change files extensions (in addition to the existing one that added the guessed extension).
As before, it's available both as a Win32 and Linux executable.

Mark0.net - Soft - TrID

P.S. The library of TrID's definition now cover over 4.000 filetypes.
Logged

Cavalcader
Charter Member
***
Posts: 192


Live Long & Prosper

View Profile Give some DonationCredits to this forum member
« Reply #12 on: February 19, 2011, 11:54:47 AM »

Thanks for the news! Is the GUI version 1.80 still good?
Logged

My Linguistic Profile:
  40% General American English
  30% Yankee
  20% Dixie

What Kind of American English Do You Speak?
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #13 on: February 19, 2011, 11:56:28 AM »

Sure. The engine is still the same; just keep it fed with the newest definitions, and it's OK.
Logged

worstje
Honorary Member
**
Posts: 555



The Gent with the White Hat

View Profile Give some DonationCredits to this forum member
« Reply #14 on: February 19, 2011, 01:04:03 PM »

I saw this tool before and wanted to comment, but I think I never did. I love this sort of tool - it's useful, full of byte-juggling and all that stuff. Way more fun than dorky GUI stuff. smiley

I've been meaning to ask: how does TrID compare to builtin linux tools? If I recall properly linux has a file command that does the exact same thing, and there's probably a Windows port for that command too.
Logged
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #15 on: February 19, 2011, 02:25:22 PM »

I think that the main difference is how the library of filetypes is updated.
The idea with TrID was to develop something that had no fixed rules, and basically relied on definitions created by scanning a number of files of a certain type, and automatically detecting recurring patterns.

So, for example, you just have to get some ODT files (the more, the better, usually), run TrIDScan against them, and you endup with a new definition tailored for those files. Then you can edit it, add some info (filetype descriptions, and URL with reference info, etc.), maybe remove the "obviously unimportant patterns & strings" (due to some bytes that just happened to correspond in the small data set analyzed, eventually), and the job is done.

Since it's very easy to create new definitions, I think that probably TrID recognize more filetypes than "file" (the tool).
Anyway, TrID's approach is a simple one that does seems to give some good results. But It's certainly not perfect; for example it definitely not very good with text files, because it need at least some fixed patterns.

BTW, I'm working on something completely different that I believe will results in a much better & complete file identification system, but I haven't anything ready for prime time yet...
Logged

Cavalcader
Charter Member
***
Posts: 192


Live Long & Prosper

View Profile Give some DonationCredits to this forum member
« Reply #16 on: February 20, 2011, 02:08:12 AM »

Sure. The engine is still the same; just keep it fed with the newest definitions, and it's OK.
Are you saying that the GUI version doesn't have an issue with filesize > 2GB? Not that it's something I run into; just curious. smiley

BTW, I'm working on something completely different that I believe will results in a much better & complete file identification system, but I haven't anything ready for prime time yet...
Looking forward to seeing it when it's ready.  Cool
« Last Edit: February 20, 2011, 02:11:47 AM by Cavalcader » Logged

My Linguistic Profile:
  40% General American English
  30% Yankee
  20% Dixie

What Kind of American English Do You Speak?
Mark0
Charter Honorary Member
***
Posts: 593


see users location on a map View Profile WWW Give some DonationCredits to this forum member
« Reply #17 on: February 20, 2011, 05:01:53 AM »

Are you saying that the GUI version doesn't have an issue with filesize > 2GB? Not that it's something I run into; just curious. smiley

Yes, there was/is no problems with big files for TrIDNet.
Logged

Pages: [1]   Go Up
  Reply  |  New Topic  |  Print  
 
Jump to:  
   Forum Home   Thread Marks Chat! Downloads Search Login Register  

DonationCoder.com | About Us
DonationCoder.com Forum | Powered by SMF
[ Page time: 0.044s | Server load: 0 ]