{"id":1245,"date":"2011-08-01T21:10:20","date_gmt":"2011-08-01T18:10:20","guid":{"rendered":"http:\/\/www.gatchev.info\/blog\/?p=1245"},"modified":"2011-08-01T21:10:20","modified_gmt":"2011-08-01T18:10:20","slug":"cleaning-wiki-spam","status":"publish","type":"post","link":"http:\/\/www.gatchev.info\/blog\/?p=1245","title":{"rendered":"Cleaning wiki spam!"},"content":{"rendered":"<p>There is a lot of wiki spam novadays. The wikis I host were close to being rendered useless by the amount of e-trash pumped in by spambots. Naturally, I had to do something.<\/p>\n<p>The first idea was to install a captcha extension. There are plenty for MediaWiki (all my wikis are powered by it). However, some wikis sport too old versions of MW to have a decent captcha, and one or two cannot or should not be upgraded. On some others, captchas are undesirable for various reasons. Also, a captcha will not clean the already present spam. So I needed a different solution.<\/p>\n<p>Happily, one of my hobbies is a <a href=\"http:\/\/apibot.zavinagi.org\">MediaWiki bot software<\/a>. I threw up a spam-cleaning script for it and set it over the wikis. The first attempts missed a lot of spam and had some false positives, but this was quickly fixed. It took me some more time to start reliably catching the spambot &#8220;valdalisms&#8221; (edits with no spam links, made to make the spambot blacklisting harder). Currently the script makes approximately one mistake per 1000 edits, or even less &#8211; that is, almost no cleaning handwork is left after it. I believe this is a good result. \ud83d\ude42<\/p>\n<p>And since good things should not stay idle, I decided to offer its abilities as a service. In short, I am offering cleaning from the spam MediaWiki-based wikis. If you need cleaning this e-muck, just email me (&#8220;grigor&#8221; at this site, that is, &#8220;gatchev.info&#8221;). I&#8217;d be glad to help.<\/p>\n<p>In case you insist on paying for the service, I accept <a href=\"http:\/\/en.wikipedia.org\/wiki\/Bitcoin\">Bitcoins<\/a> on address &#8220;1FvF2Y39HGjXxvhsmtLmt8oRMmicLqR561&#8221; (minus the quotes). An example fee could be 0.01 Bitcoin per 10 000 spams cleaned, but feel free to suggest a different one, if you like. \ud83d\ude42<\/p>\n<p>And&#8230; may never need this service! \ud83d\ude42<\/p>\n","protected":false},"excerpt":{"rendered":"<p>There is a lot of wiki spam novadays. The wikis I host were close to being rendered useless by the amount of e-trash pumped in by spambots. Naturally, I had to do something. The first idea was to install a captcha extension. There are plenty for MediaWiki (all my wikis are powered by it). However, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"hide_page_title":""},"categories":[],"tags":[],"_links":{"self":[{"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=\/wp\/v2\/posts\/1245"}],"collection":[{"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1245"}],"version-history":[{"count":0,"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=\/wp\/v2\/posts\/1245\/revisions"}],"wp:attachment":[{"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1245"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1245"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.gatchev.info\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1245"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}