What is this?
This shows the top 6 out of 9106 articles from 12 RSS tech news feeds since 2011-08-30 17:04:08 based on highest probability that the articles are dupes.
Digg.com articles are almost always flagged as the dupe since the odds that they contain original content is almost 0%.
Articles are fetched on the hour
TODO: weed out > 2 copies of the same article (nested dupes)
TODO: invert this list to create a 'clean' feed of articles (current try: http://badcheese.com/~steve/rss/rss_old.php)
TODO: check dupes on content to weed out false positives for low title hits
dupe check test
1
[0.66667]
2
[0.46154]
3
[0.46154]
4
[0.37500]
5
[0.36364]
6
[0.36364]
FEED: Amazon.com Gold Box Deals [DEL]
FEED: Boulder & County News [DEL]
FEED: City of Longmont What's New Rss Feed [DEL]
FEED: Digg / Technology [DEL]
FEED: Engadget [DEL]
FEED: Geekdad [DEL]
FEED: Gizmodo [DEL]
FEED: Hack a Day [DEL]
FEED: Hacker News [DEL]
FEED: Lifehacker [DEL]
FEED: Slashdot [DEL]
FEED: Working Dad: An Unauthorized Guide to Parenting [DEL]

Add an RSS feed url (Atom feeds won't work at the moment):