Messages - vevola [ switch to compact view ]

Pages: prev1 2 3 4 5 [6] 7 8 9 10 11 ... 21next
26
Post New Requests Here / [Request] Tell me who said what first!
« on: July 26, 2011, 06:59 AM »
After a great experience with DonationCoder, I'm posting another request.

I have a series of transcribed conversations. Each text file has a series of lines which begin with an initial and a semicolon which correspond to who says what. I would like to see what words are used by one speaker before the the other speaker uses them, as well as other things like frequency and collocation.

So here's an example:

A: So, I really like all those dresses, especially this red and that green thing there.
B: Yeah, the red one is nice.
A: Which one are you gonna buy?
B: I'll get the red one.

Here's what I want to be able to get.

For A:
- [What words said first by A:]
   "red" was said first by A:
- [Collocation first occurrence]
   the first time A: said "red" was in line 1
- [Frequency for A:]
   A: said "red" a total of 1 times
- [Collocation for A:]
   B: said "red" in lines 2, 4
- [Frequency for B:]
   B: repeated "red" a total of 2 times
- [Collocation for B:]
   B: said "red" in line 2

For B:
- [What words said first by B:]
   "one" was said first by B:
- [Collocation first occurrence]
   the first time B: said "one" was in line 2
- [Frequency for B:]
   B: said "one" a total of 2 times
- [Collocation for B:]
   B: said "one" in lines 2, 4
- [Frequency for A:]
   A: repeated "one" a total of 1 times
- [Collocation for A:]
   A: said "one" in line 3

My conversations have 3 speakers though, which might make it trickier.

How I see this happening: If it's possible to isolate all lines which begin with A: or B:, I imagine it's relatively easy to make a word list which includes word frequency and collocation. Then you'd have to compare two of these lists (like A+B, B+C, A+C) and compare the line numbers of the first occurrence in each speaker by seeing which number is smaller (e.g. First occurrence "red": A: line 1; B: line 2 --> 1 is less than 2, hence A: said "red" before B).

Any suggestions? Volunteers? :)


27
@IainB
Yikes! I've been playing around with Qiqqa, but there seems to be a lot of glitches! It's uploading papers even when I asked not to, there's no way to stop any type of operation, and well... I think I'm sticking to Mendeley and skwire's app!

BTW, @skwire I donated some $$ to you. It's not a lot, just a symbolic gesture. I encourage others to donate to coders as well! Thanks!

28
@vevola: You might be interested in this.
I see you use Foxit and the "reference management" programme Mendeley Desktop.
I have recently started using another reference management programme called Qiqqa, having tried Zotero and Mendelay - I found the latter two did not meet my requirements.

Thanks! I'll try it out!

29
You say it's not a polished app, I say it does what I asked for! :)

It would be nice to choose where to save the final results for example, and maybe even keep a record of which files have already been scanned (maybe by having the app look only at files after the soonest modified file from the previous scan - dunno if that makes sense) and just update the results file. Having the possibility to exclude somehow those false-positives from the scan I think would be nice too.

But like I said, it's usable this way, at least for me and Suntsu!

If you ever come to Germany, look me up!


30
Would it be possible to choose where to save the final txt files? Even better, would it be possible to open Explorer with those files highlighted? tia!

ps
would it help make the app better if i sent you some examples of false negatives?

Pages: prev1 2 3 4 5 [6] 7 8 9 10 11 ... 21next
Go to full version