You are viewing a single comment's thread.

view the rest of the comments →

0
1

[–] The_Periodic_Fable [S] 0 points 1 point (+1|-0) ago 

I intend to keep searching the Wikileaks emails with food related search terms. If you think about it, food is the perfect subject to serve as a cover code for pedophilia (or maybe even crime in general). Perhaps something more will come up. I'd like to do an analysis of how often certain food terms appear in the Wikileaks as opposed to in a similar sized database of work related emails. But I would not know how to get the proper second data set to compare with wikileaks.

0
1

[–] oneslyfox 0 points 1 point (+1|-0) ago  (edited ago)

I like this idea of putting the data through some comparative analysis, but ideally the control set would be a close to identical work environment - gonna do some searching, this could be super interesting and even useful for codifying certain terms for this perverse community (not ours <3, the creeps we're investigating lol).

0
1

[–] The_Periodic_Fable [S] 0 points 1 point (+1|-0) ago  (edited ago)

yes! A meta syntax analysis of the wikileaks emails could reveal a lot. Comparing the relative frequency of certain words and phrases to their occurrence in the English language in general. The first data point needed would be the total word count for the entire Wikileaks email set. Then do like this: the word "barbecue" appears once every 1,000,000 words in a wide sampling of English language emails, books, articles, etc. But the same word appears as 1 in 10,000 in the wikileaks emails. (just making up the numbers here)