Main Area and Open Discussion > General Software Discussion
Controlling certain facts in a folder
Contro:
Controlling certain facts in a folder
I have a folder containing subfolders and files.
I would like to find a software for :
a) Count the number of pages for file type. By example : number of pages of pdf files. Or even by group of files : number of files of pdf and doc files.
b) In audios file control the duration in minutes.
c) In video files control the duration in minutes. Able to sum durations of all video files inside the folder and subfolders.
I know some utilities to show the size of the files, but I would like with these special parameters.
Best Regards
Any partial solution is good for me too : a soft for counting pages of pdf files in a folder and subfolders.....
:P
IainB:
Interesting questions for some. Non-trivial.
What do you intend to use the data (sums and counts) for? Does it matter how accurate it is?
I quite like the idea of being able to count or "stocktake" things like this.
It seems to be a classic accounting problem, but I don't have any experience or knowledge of how it might be done in this specific case. I suppose estimation could be a pragmatic approach - rather than actual physical counting, I mean.
Off the top of my head (so apologies if this seems a bit rushed):
Document files:
Depending on the accuracy required, I think it might be useful - if not necessary - for document files to have some definition.
For example:
* to define what is meant by the unit "page" (e.g., A4, Legal, A2, A3, etc.) - so storage unit size would be defined.
* to establish what languages/alphabets you will have in those documents (different alphabet systems may have different packing densities).
* to define what font and point-size you are assuming is used - so density per page could be a concept.
* to define average word-length.
* to define what max, min and average word density would be estimated for the classification of a page-unit. (e.g., do you want to call something with only 5 words on it "A page"?)
* to establish how to cope with pictures (images) in a document, and whether they cover a part of a page (and how much) or a whole page, have captions, headers, etc..
* to establish how to cope with handwriting in a document.
* to establish how to cope with documents (e.g., .PDF or Word files) which have no actual text but only images of pages with words on (this could imply the need for OCRing the documents).
* do you need right-to-left or left-to-right reading/parsing, or both?
* do you have landscape or portrait oriented pages, or both?
* what to do with a frequency estimate for blank pages?
Then you might need to have (say) a function to define the typical density of words, by page.
Physical paper pages could be various sizes, but I suspect you'd have to define a normative/standard size.
Audio files:
Not really sure about these.
Should be able to use standard tags of some (e.g., mp3) to get duration (time). I'm not sure, but that might even be a file property for audio files - if so, then Windows Explorer would presumably be able to display it as a column in details view, same as file "Comments".
Video files:
Not sure at all about these.
Do they use standard tags for things like duration (time)? (I don't know.)
You might like to ask the question over at Quantified Self, where they have been looking at similarly knotty problems - e.g., Effect of One-Legged Standing on Sleep
Mind you, I reckon some of their theories haven't got a leg to stand on.
skwire:
b) In audios file control the duration in minutes.
c) In video files control the duration in minutes. Able to sum durations of all video files inside the folder and subfolders.-Contro (December 13, 2011, 01:58 AM)
--- End quote ---
Check out PlayTime.
Contro:
Interesting questions for some. Non-trivial.
What do you intend to use the data (sums and counts) for? Does it matter how accurate it is?
I quite like the idea of being able to count or "stocktake" things like this.
It seems to be a classic accounting problem, but I don't have any experience or knowledge of how it might be done in this specific case. I suppose estimation could be a pragmatic approach - rather than actual physical counting, I mean.
Off the top of my head (so apologies if this seems a bit rushed):
Document files:
Depending on the accuracy required, I think it might be useful - if not necessary - for document files to have some definition.
For example:
* to define what is meant by the unit "page" (e.g., A4, Legal, A2, A3, etc.) - so storage unit size would be defined.
* to establish what languages/alphabets you will have in those documents (different alphabet systems may have different packing densities).
* to define what font and point-size you are assuming is used - so density per page could be a concept.
* to define average word-length.
* to define what max, min and average word density would be estimated for the classification of a page-unit. (e.g., do you want to call something with only 5 words on it "A page"?)
* to establish how to cope with pictures (images) in a document, and whether they cover a part of a page (and how much) or a whole page, have captions, headers, etc..
* to establish how to cope with handwriting in a document.
* to establish how to cope with documents (e.g., .PDF or Word files) which have no actual text but only images of pages with words on (this could imply the need for OCRing the documents).
* do you need right-to-left or left-to-right reading/parsing, or both?
* do you have landscape or portrait oriented pages, or both?
* what to do with a frequency estimate for blank pages?
Then you might need to have (say) a function to define the typical density of words, by page.
Physical paper pages could be various sizes, but I suspect you'd have to define a normative/standard size.
Audio files:
Not really sure about these.
Should be able to use standard tags of some (e.g., mp3) to get duration (time). I'm not sure, but that might even be a file property for audio files - if so, then Windows Explorer would presumably be able to display it as a column in details view, same as file "Comments".
Video files:
Not sure at all about these.
Do they use standard tags for things like duration (time)? (I don't know.)
You might like to ask the question over at Quantified Self, where they have been looking at similarly knotty problems - e.g., Effect of One-Legged Standing on Sleep
Mind you, I reckon some of their theories haven't got a leg to stand on.
-IainB (December 13, 2011, 03:09 AM)
--- End quote ---
I have seen something of this in Google. Count the number of pages of pdf files containing in a folder. usually shareware.
What for ?
I have received a request from the judge of my city about a process I open with a usb key with digitalized documentation telling me I must present in 48 hours in written context.
With Windows explorer I have detected
362 pdf
88 word docs
216 eml files
671 jpg
497 png
4 amr files
etc.
Contro:
b) In audios file control the duration in minutes.
c) In video files control the duration in minutes. Able to sum durations of all video files inside the folder and subfolders.-Contro (December 13, 2011, 01:58 AM)
--- End quote ---
Check out PlayTime.
-skwire (December 13, 2011, 08:57 AM)
--- End quote ---
I am going
Navigation
[0] Message Index
[#] Next page
Go to full version