0
6

[–] zbou 0 points 6 points (+6|-0) ago 

Please don't ever commit suicide

0
2

[–] jsprogrammer [S] 0 points 2 points (+2|-0) ago 

I do not plan to. Unfortunately, I have some personal experience with others choosing to :(

0
4

[–] AmaleksHairyAss 0 points 4 points (+4|-0) ago 

You may have misunderstood. He meant "Please don't shoot yourself in the back of your head, twice"

0
4

[–] WhiteRonin 0 points 4 points (+4|-0) ago 

Excellent idea!

Not your program but to come here for getting ideas on what to do!

This could have started out with a giant database of speaches that are already in text format. Throw a search library at and bingo.

The next step is to find videos to match the speaches for people who are on tape.

Then to get fancy for speaches that are not on tape find a library that either converts text to speach.

Storage? Text can be done even on DO droplet or a cheap Amazon account. Going Amazon would allow you to scale up.

I've had ideas over the years that sat format because I did what you did ... Nothing much on the project.

0
3

[–] jsprogrammer [S] 0 points 3 points (+3|-0) ago 

One issue is getting the text of the speeches. Do you know of a good source (I don't, but haven't really looked). Due to the approaching election, I have been feeling that the primary interest would be in the most recent, real-time information.

I don't think the database can be built completely automatically. I think it will need to be built collaboratively by users. I want anyone to be able to contribute (though, there are concerns with data quality/integrity that need to be addressed).

I'd love to hear your ideas.

0
0

[–] WhiteRonin 0 points 0 points (+0|-0) ago 

I agree in the crowd source concept but the crowd needs a place to take their collective dump ;-)

1
4

[–] ashekchum 1 points 4 points (+5|-1) ago 

What are you going to do about storage?

What progress have you made in the 3 years you've worked on it, cause it dosen't sound like much?

A feature that would be nice is searching speaches so could quary John Bohner and the work Bonner to hear about Bohners bonner from Bohner, or something like that.

0
7

[–] jsprogrammer [S] 0 points 7 points (+7|-0) ago 

I think the best initial approach will be to leave storage to the current major players (eg. YouTube). The first iteration would be a database of metadata and links to the content. This would drastically reduce the cost and complexity of the project, while allowing the benefit of using highly available, existing systems.

I have not been working on this specific project for three years. I have prototyped out some of the interface and worked on some data models. I have also been spending time working on other, not necessarily related, projects that I feel are also important. Many of these projects share common problems (mainly, the entire stack; from the back-end services, to network engineering, to front-end development). I have a pretty decent base project for web front-end work built up now, where I can easily begin development on new projects or prototypes in a near state-of-the-art environment. Additionally, I have been working on tools to launch and run everything necessary to host these projects on any of the cloud providers or, really, any collection of networked Linux machines.

A feature that would be nice is searching speaches so could quary John Bohner and the work Bonner to hear about Bohners bonner from Bohner, or something like that.

I think searching is a must. C-SPAN already provides some transcripts and I think YouTube does too, but I think human editable transcripts will also be required. I know that similar systems already exist. Shows like The Daily Show record television channels 24/7 to record and search through the media to find their content. The awesome thing about what we can do on the web now, is that anyone with a web browser will effectively be able to run their own Daily Show, incorporating existing media and live streams...all in a browser...broadcasted all over the world.

0
7

[–] ashekchum 0 points 7 points (+7|-0) ago 

Such a system would require the hoster to be impartial, hasn't YouTube proven that incorrect recently if they removed a link then you would need a backup to reupload it while that backup dosen't need to be weblinked it still means that you would need to have access to some storage. Running a search algorithim for terms would likly require at the least the trasnscripts to be hosted on your site for proccessing speed.

Really what you need is funding, maybe kickstarter/go fund me? But you would need a demo system working before then.

0
2

[–] WhiteRonin 0 points 2 points (+2|-0) ago 

So what is your stack? And what languages are you using?

I'm a partical type of guy and hate Google like interviews ;-)

[–] [deleted] 0 points 0 points (+0|-0) ago 

[Deleted]

0
3

[–] OneTrueCube 0 points 3 points (+3|-0) ago 

Do you plan on all the files being simply streamable or will they be downloadable? What formats are you looking at?

There'd be a lot of potential to make sick mash-ups of political speeches with something like this.

0
2

[–] jsprogrammer [S] 0 points 2 points (+2|-0) ago 

Streaming would be the most common use, I would imagine. But, every file should be downloadable as well.

What formats are you looking at?

I am primarily considering reusing services that offer an embedded media player, so whatever format they are using (probably h264 or vp8/9). I don't have a strong opinion about a particular format, but I do think they should be in open codecs. However, I do think that the content is perhaps more important than the format of the media file. That said, trans-/re-coding offensive formats to something more palatable should be trivial.

There'd be a lot of potential to make sick mash-ups of political speeches with something like this.

Oh, yeah :) And we will all be able to do it on any device with a browser!

0
1

[–] OneTrueCube 0 points 1 points (+1|-0) ago 

Awesome. Having support for .wav would be amazing but I imagine it would be very space-intensive. ogg files are a pretty good open-source audio encoding that a lot of people can use too. Of course anyone worth their salt should be able to change it to their needs, but having a high-quality source is the best.

Thanks for sharing!

0
2

[–] alexkobold 0 points 2 points (+2|-0) ago 

Will your database be limited to US politicians?

Do you already have any code which is worth open-sourcing?

Will you be working on a feature that tries to determine whether a politician has contradicted themself on a topic?

0
1

[–] jsprogrammer [S] 0 points 1 points (+1|-0) ago  (edited ago)

My initial focus would be US politicians as I am a US citizen and we have a very important election coming up. I do wish for this software to be usable by everyone, for their own local politics.

All of the code I write is open source and almost (I try to push on commit, but sometimes I forget) all of it is available at github. I had begun prototyping a "Where Is Your Congressperson?" UI to explore some ideas. I also started early design and prototype work on a presidential primary fantasy league. Neither of these projects are documented for the public, so I offer my apologies in advance!

My ideal setup would be that anyone could launch such a system with incredible ease. I have spent some time working to be able to launch systems across most cloud providers [Note: You will not be able to launch anything from the web interface due to the proxies required to workaround Docker and CoreOS's lack of CORS-enabled API endpoints not currently running; also, the project is not feature complete (though, I have launched 900+ machine clusters with it)].

0
2

[–] ShitsInPringlesCans 0 points 2 points (+2|-0) ago 

I'm a high-end data wrangler / analyst specializing in analytics, and I do some programming on the side. I've got secure storage too. Need any help?

0
2

[–] jsprogrammer [S] 0 points 2 points (+2|-0) ago 

Yes. We need all the help we can get!

What do you think you can help with? What kind of storage? The data should be stored indefinitely, so we'd need to look at what kind of guarantees it can make and how easily it could be migrated as needed. It might make sense to develop onto a distributed storage system, but that adds complexity, though it might be worth it for the durability.

0
2

[–] ashekchum 0 points 2 points (+2|-0) ago 

What are your political leanings? And will you try to keep the project impartial/

0
5

[–] jsprogrammer [S] 0 points 5 points (+5|-0) ago 

I used to call myself conservative/republican (middle -> high school), then libertarian (college), now, I don't use any labels. I believe in non-violence/non-coercion and science (largely of the Popper variety).

And will you try to keep the project impartial/

I hope to have a design where partiality is not even possible. Any recording of any office-holder or person running to hold an office is what I want to collect, categorize, and make available.

[–] [deleted] 0 points 2 points (+2|-0) ago 

[Deleted]

[–] [deleted] 0 points 2 points (+2|-0) ago 

[Deleted]

0
2

[–] jsprogrammer [S] 0 points 2 points (+2|-0) ago  (edited ago)

Oh, definitely. I want to be able to pull from any and every source.

Edit: If you're interested in some drawings you can check this one out:

https://github.com/2016-fantasy/docs/blob/master/architecture/candidate_communications.svg

I had begun prototyping a fantasy league system for the primaries last summer and that is part of an initial design for sourcing candidate communications. We can can and will pull from any source that is digitizable (basically, everything).

load more comments ▼ (24 remaining)