You are viewing a single comment's thread.

view the rest of the comments →

0
1

[–] oneslyfox 0 points 1 point (+1|-0) ago  (edited ago)

I like this idea of putting the data through some comparative analysis, but ideally the control set would be a close to identical work environment - gonna do some searching, this could be super interesting and even useful for codifying certain terms for this perverse community (not ours <3, the creeps we're investigating lol).

0
1

[–] The_Periodic_Fable [S] 0 points 1 point (+1|-0) ago  (edited ago)

yes! A meta syntax analysis of the wikileaks emails could reveal a lot. Comparing the relative frequency of certain words and phrases to their occurrence in the English language in general. The first data point needed would be the total word count for the entire Wikileaks email set. Then do like this: the word "barbecue" appears once every 1,000,000 words in a wide sampling of English language emails, books, articles, etc. But the same word appears as 1 in 10,000 in the wikileaks emails. (just making up the numbers here)