Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Howdy, thanks for posting this.

It would be awesome to visualize wikipedia edits overtime. I don't really care about what the text says, just how blocks of it change over time. I am after the ascetics present in the ever flowing change of data. I think your script might be a good starting point. Think of the videos of a flowers growing, that compress months into a few seconds. Do something similar with wikipedia edits.



I've been doing something similar with my blog, except it is currently for people who do care what the text says. The diff algo is tricky, I should have built up a larger corpus of material before designing.

It looks at paragraphs, sentences, sub-sentence structures, words. It even draws little sparkgraph-ish diagrams. It is not really that long (250 lines by wc) but it has been a huge time sink for tweaking.

For an example of some heavy editing: http://kmkeen.com/inabow/2009-01-07-11-22-00.html




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: