avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • January 17, 2018, 12:13 PM
  • Proudly celebrating 10 years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Messages - vevola [ switch to compact view ]

Pages: [1] 2 3 4next
Below I provide a mockup what the UI might look like, and here's a breakdown of what I'd like (ideally).
First I want to a) create subsets (i.e. all rows having a certain value in the cells of a column) and then b) compare subsets.

I provide a CSV file to be processed by your program, which would allow me to create subsets (extracting rows based on conditions I set).
Then the program compares the subsets, matches columns of numbers (timestamps) against two other columns (also timestamps of beginning and end, indicating a range) and then compile a CSV file.

1) I have timestamps of when people are looking at each other
2) I have timestamps of when these people move
I want to see, when people were looking at one person, if that person was moving.

The data are various labels and things, but more importantly they have timestamps of beginnings and endings of things. I want to see when there are timestamps of things that are
a) contained in other timestamps,
b) overlapping, or
c) near each other (depending on my threshold).

Here is a sample of the data:

The output can be a simple CSV file.

The interface would include the options included below and could be something like this:


Create SUBSETS from CSV file, either based on an entire column or based on the contents of a cell.

1. Subset Name: _______ (*create a name for the subset)
Column: [dropdown list] (*with the column headers or number, or "ALL")
Cell: _______ (*type in text to find or leave blank)

2. Subset Name: _______
Column: [dropdown list]
Cell: _______

3. Subset Name: _______
Column: [dropdown list]
Cell: _______


[Import subset]
[Export subsets] (*as CSV files)

Choose what to match.

TEMPORAL MATCHES: Only columns with numeric data can be compared.
   This [SUBSET_NameX]: [COLUMN_list] (to [COLUMN_list] ) (*if you select just the first than it's a point, if you also select the second then it's understood as a time range)
   within [SUBSET_NameX] from [COLUMN_list] to [COLUMN_list]
   [] AND [] OR within [SUBSET_NameX] from [COLUMN_list] to [COLUMN_list]

[] Containing (*both onset or offset are within the range)
[] Overlapping (*either onset or offset is within the range)   
[] Threshold: ___ms before;  ___ms after


A note on the sample data.

Note that what you have here is already a subset of the original data, which would have been made by

Subset: All gaze_to_p1
Column: label
Cell: gaze_to_p1


Subset: All P1
Column: participant
Cell: p1

A query on this table might be to compare if the onset (just the beginning) of "gaze_to_P1" is within the range of all P1 "head_face" and all "upper_body":

Subset1: All gaze_to_P1
Column: label
Cell: gaze_to_p1

(NB: This could have also been Column:ALL and it would have given the same subset, but in this case maybe selecting the column would make it less processing intensive?)

Subset2: All P1
Column: participant
Cell: p1

The output could be a simple CSV file with an added column providing the match (or NA). Eventually, it might be interesting for this column to have the cells giving something like "gaze_to_P1 | head_face", "gaze_to_P1 | upper_body", i.e. I get to choose how the result is shown, specifically grabbing the cell content of the first and second member of the match.



Thanks! but I think it's not quite I needed (although probably the word "range" peaked your attention!).

I want to see if one number or a range "fits" into another number within a range.

But I can see how your app could be useful depending on what you're doing.

I have huge CSV files with numerical values and I want to see if there are any overlaps within a range.

1) I have ONsetX and I want to see if it's within the range of ONsetY and OFFsetY. If yes, let me see the pair.
2) I have ONsetX and OFFsetY, and I want to see if there's any overlap between ONsetY and OFFsetY. If yes, let me see the pair.

I don't even see how to do this within Excel. Can anyone help?

The quickest thing I could find to connect them is to create a shortcut to the bluetooth device (in my case, BT speakers) and then from there click on "Listen to music" .

To be clear, here's a screenshot of what I get with the shortcut Windows 7 gives me:  or https://www.dropbox....02-13%2011.34.58.png

And from there I have to wait until it polls or something before I see the "Listen to music" link and I'm able to click it.

I'd like a quick way to connect/disconnect to this, eliminating those extra clicks.

Is your operating system on an SSD or HDD?

The output of the powercfg list command
Microsoft Windows [Version 6.1.7601]
Existing Power Schemes (* Active)
Power Scheme GUID: 381b4222-f694-41f0-9685-ff5bb260df2e  (Balanced)
Power Scheme GUID: 49ef8fc0-bb7f-488e-b6a0-f1fc77ec649b  (Dell)
Power Scheme GUID: 8c5e7fda-e8bf-4a96-9a85-a6e23a8c635c  (High performance)
Power Scheme GUID: a1841308-3541-4fab-bc81-f71556f20b4a  (Power saver) *

Can you do a screengrab of PMTs output window when you run it both the first and second time

Can you do a screengrab of both the Balanced and High Performance Advanced settings of your Power Options

Here's an archive with those screengrabs:

Also, after running it once what is the value of the PMT key stored under HKLM\Software\Microsoft\Windows\CurrentVersion\RunOnce (use regedit.exe) ?

powercfg.exe /setactive a1841308-3541-4fab-bc81-f71556f20b4a

What is it after running PMT a second time, (it shouldn't be there) ?

And it's not.

Thanks for your help!

I'm running Win7 64x with "Balanced" power scheme.

To be honest, I tried using the revert, but now hibernation doesn't work at all. When it goes to sleep, I can't wake it up anymore. I see "resuming", and then a black screen and I have to force shut it down. I then see "delete restoration data" or "resume", and I have to "delete" for windows to boot. Which also means that if I forget to save whatever I have open and go away for a while... well... I loose everything.

Any tips?

Well, it seems to first hibernate, and then on wake up it works...

Also... should I be running it *twice*? I'm not sure I understand your earlier post with the two screenshots...

cool! thanks!!

Just curious as to whether I should be "seeing" something (some type of visual feedback), because I don't...  :-[

rather than having to manually change the settings, is there a little app that will temporarily disable sleep/hibernation so as to drain all the battery?

Every word would be ok too. I'm not sure which words to exclude as of yet, so all words might be easier.

Are the match words ("red" and "one" in your examples) provided by the user? 
Are the match words ("red" and "one" in your examples) provided by the user? 

No, That was just as an example! :)

The text files are a lot longer (about 2000 lines).

Post New Requests Here / [Request] Tell me who said what first!
« on: July 26, 2011, 06:59 AM »
After a great experience with DonationCoder, I'm posting another request.

I have a series of transcribed conversations. Each text file has a series of lines which begin with an initial and a semicolon which correspond to who says what. I would like to see what words are used by one speaker before the the other speaker uses them, as well as other things like frequency and collocation.

So here's an example:

A: So, I really like all those dresses, especially this red and that green thing there.
B: Yeah, the red one is nice.
A: Which one are you gonna buy?
B: I'll get the red one.

Here's what I want to be able to get.

For A:
- [What words said first by A:]
   "red" was said first by A:
- [Collocation first occurrence]
   the first time A: said "red" was in line 1
- [Frequency for A:]
   A: said "red" a total of 1 times
- [Collocation for A:]
   B: said "red" in lines 2, 4
- [Frequency for B:]
   B: repeated "red" a total of 2 times
- [Collocation for B:]
   B: said "red" in line 2

For B:
- [What words said first by B:]
   "one" was said first by B:
- [Collocation first occurrence]
   the first time B: said "one" was in line 2
- [Frequency for B:]
   B: said "one" a total of 2 times
- [Collocation for B:]
   B: said "one" in lines 2, 4
- [Frequency for A:]
   A: repeated "one" a total of 1 times
- [Collocation for A:]
   A: said "one" in line 3

My conversations have 3 speakers though, which might make it trickier.

How I see this happening: If it's possible to isolate all lines which begin with A: or B:, I imagine it's relatively easy to make a word list which includes word frequency and collocation. Then you'd have to compare two of these lists (like A+B, B+C, A+C) and compare the line numbers of the first occurrence in each speaker by seeing which number is smaller (e.g. First occurrence "red": A: line 1; B: line 2 --> 1 is less than 2, hence A: said "red" before B).

Any suggestions? Volunteers? :)

Yikes! I've been playing around with Qiqqa, but there seems to be a lot of glitches! It's uploading papers even when I asked not to, there's no way to stop any type of operation, and well... I think I'm sticking to Mendeley and skwire's app!

BTW, @skwire I donated some $$ to you. It's not a lot, just a symbolic gesture. I encourage others to donate to coders as well! Thanks!

@vevola: You might be interested in this.
I see you use Foxit and the "reference management" programme Mendeley Desktop.
I have recently started using another reference management programme called Qiqqa, having tried Zotero and Mendelay - I found the latter two did not meet my requirements.

Thanks! I'll try it out!

You say it's not a polished app, I say it does what I asked for! :)

It would be nice to choose where to save the final results for example, and maybe even keep a record of which files have already been scanned (maybe by having the app look only at files after the soonest modified file from the previous scan - dunno if that makes sense) and just update the results file. Having the possibility to exclude somehow those false-positives from the scan I think would be nice too.

But like I said, it's usable this way, at least for me and Suntsu!

If you ever come to Germany, look me up!

Would it be possible to choose where to save the final txt files? Even better, would it be possible to open Explorer with those files highlighted? tia!

would it help make the app better if i sent you some examples of false negatives?

@skwire btw, are you going to post it on your software website?

kudos to skwire! I'm happy it wasn't just for me!

you may be better off using something like File Hound to search through your PDFs.
I'm pretty sure Hound uses pdftotext too.  That probably means it ignores image files as being empty of text, which should at least make the search hands-off.

I'm not sure exactly how it works, but what I've noticed in some of my PDFs: it's a scan that the library makes (image) and then they add some type of "copyright" stamp which is text. So some PDFs would be principally images, but with just some text.


EDIT: Actually it is much better than what I wrote. There are false-positives, but probably not 1:3.

I think that unless you have any other tweaks, this may do! Thanks!

Thanks again!

Better: 225 vs. 1205

But I also see many false-positives (about 1/3).

So I temporarily moved the larger PDFs and did a search of only under 60MB and it worked (didn't try it with the very last version.)

However, the lists produced don't seem reliable: non-searchable 817; searchable 616.

(The ratio should be less).

Nope. I still get errors, and indeed eventually it crashes.

The largest of my PDFs are 340MB, 210MB, 140MB, and all the other are under 100MB. Maybe exclude all PDFs over 100MB?

Thanks for the update!

I tried running it and after a couple of minutes I get the following error:
Error: Memory limit reached (see #MaxMem in the help file). The current thread
will exit.
—> 083: Return,str

BTW, the folder I'm searching has about 4GB of PDFs, dunno if that means anything.

EDIT: I thought the program stopped, but it's still running the background. I imagine the error was for a particular file, and not for the entire program. Any ideas of what it is? Will it show up in the final log? We'll see!  ;)
Thanks again!

Pages: [1] 2 3 4next