Archive for September, 2007

Fighting the Robots

Saturday, September 29th, 2007 | Web Thinkering | No Comments

Being a “stats-freak”, I constantly check my log files and stats counter looking out for developing trends or an impending issue with the settings of my VPS. While checking on the stats of my most trafficked site, I noticed that several IP addresses has been suspiciously consuming a lot of my precious bandwidth! The largest of which has racked up a total of 7.12GB and the smallest in the group recorded 1.20Gb in Awstats

I immediately performed a reverse DNS lookup with these IP and I found out that 3 out of 10 belongs to Smartbro’s Wireless Internet Network. This could be acceptable since the system Smartbro is like a WAN or wide area network wherein they use a single gateway for an area and the clients/subscribers will be assigned Local IP’s.

The rest of the “dubious list” came from destiny internet (Philippines), Singapore and Australia. I’m more concerned with the IP from Makati Philippines since it is the one responsible for consuming up 7Gbs data.

I suspect that this particular person is trying to download my site for offline viewing or a spambot harvesting email addresses from my content! If left undetered, a few more activities like these and I’m on to some major headaches and empty pockets.:D

Thus I googled and landed in a thread at webmastersworld.com – A Close to perfect .htaccess ban list

Here is a copy of the code that you can append to your .htaccess file:

RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} ^BlackWidow [OR]
RewriteCond %{HTTP_USER_AGENT} ^Bot\ mailto:craftbot@yahoo.com [OR]
RewriteCond %{HTTP_USER_AGENT} ^ChinaClaw [OR]
RewriteCond %{HTTP_USER_AGENT} ^Custo [OR]
RewriteCond %{HTTP_USER_AGENT} ^DISCo [OR]
RewriteCond %{HTTP_USER_AGENT} ^Download\ Demon [OR]
RewriteCond %{HTTP_USER_AGENT} ^eCatch [OR]
RewriteCond %{HTTP_USER_AGENT} ^EirGrabber [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailSiphon [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailWolf [OR]
RewriteCond %{HTTP_USER_AGENT} ^Express\ WebPictures [OR]
RewriteCond %{HTTP_USER_AGENT} ^ExtractorPro [OR]
RewriteCond %{HTTP_USER_AGENT} ^EyeNetIE [OR]
RewriteCond %{HTTP_USER_AGENT} ^FlashGet [OR]
RewriteCond %{HTTP_USER_AGENT} ^GetRight [OR]
RewriteCond %{HTTP_USER_AGENT} ^GetWeb! [OR]
RewriteCond %{HTTP_USER_AGENT} ^Go!Zilla [OR]
RewriteCond %{HTTP_USER_AGENT} ^Go-Ahead-Got-It [OR]
RewriteCond %{HTTP_USER_AGENT} ^GrabNet [OR]
RewriteCond %{HTTP_USER_AGENT} ^Grafula [OR]
RewriteCond %{HTTP_USER_AGENT} ^HMView [OR]
RewriteCond %{HTTP_USER_AGENT} HTTrack [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Stripper [OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Sucker [OR]
RewriteCond %{HTTP_USER_AGENT} Indy\ Library [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^InterGET [OR]
RewriteCond %{HTTP_USER_AGENT} ^Internet\ Ninja [OR]
RewriteCond %{HTTP_USER_AGENT} ^JetCar [OR]
RewriteCond %{HTTP_USER_AGENT} ^JOC\ Web\ Spider [OR]
RewriteCond %{HTTP_USER_AGENT} ^larbin [OR]
RewriteCond %{HTTP_USER_AGENT} ^LeechFTP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mass\ Downloader [OR]
RewriteCond %{HTTP_USER_AGENT} ^MIDown\ tool [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mister\ PiX [OR]
RewriteCond %{HTTP_USER_AGENT} ^Navroad [OR]
RewriteCond %{HTTP_USER_AGENT} ^NearSite [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetAnts [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetSpider [OR]
RewriteCond %{HTTP_USER_AGENT} ^Net\ Vampire [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetZIP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Octopus [OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Explorer [OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Navigator [OR]
RewriteCond %{HTTP_USER_AGENT} ^PageGrabber [OR]
RewriteCond %{HTTP_USER_AGENT} ^Papa\ Foto [OR]
RewriteCond %{HTTP_USER_AGENT} ^pavuk [OR]
RewriteCond %{HTTP_USER_AGENT} ^pcBrowser [OR]
RewriteCond %{HTTP_USER_AGENT} ^RealDownload [OR]
RewriteCond %{HTTP_USER_AGENT} ^ReGet [OR]
RewriteCond %{HTTP_USER_AGENT} ^SiteSnagger [OR]
RewriteCond %{HTTP_USER_AGENT} ^SmartDownload [OR]
RewriteCond %{HTTP_USER_AGENT} ^SuperBot [OR]
RewriteCond %{HTTP_USER_AGENT} ^SuperHTTP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Surfbot [OR]
RewriteCond %{HTTP_USER_AGENT} ^tAkeOut [OR]
RewriteCond %{HTTP_USER_AGENT} ^Teleport\ Pro [OR]
RewriteCond %{HTTP_USER_AGENT} ^VoidEYE [OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Image\ Collector [OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Sucker [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebAuto [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebCopier [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebFetch [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebGo\ IS [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebLeacher [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebReaper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebSauger [OR]
RewriteCond %{HTTP_USER_AGENT} ^Website\ eXtractor [OR]
RewriteCond %{HTTP_USER_AGENT} ^Website\ Quester [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebStripper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebWhacker [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebZIP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Wget [OR]
RewriteCond %{HTTP_USER_AGENT} ^Widow [OR]
RewriteCond %{HTTP_USER_AGENT} ^WWWOFFLE [OR]
RewriteCond %{HTTP_USER_AGENT} ^Xaldon\ WebSpider [OR]
RewriteCond %{HTTP_USER_AGENT} ^Zeus
RewriteRule ^.* - [F,L]

Basically this piece of code will restrict access to known site-ripppers, spambots and email harvester. Though I think there will always be other ways to get around this restriction it’s better to have something than none at all. Meanwhile I’ll keep a close watch on my logs and stats to see if the method will take effect or else I would ban the IP as a last result

Google Adsense offers Western Union Quick Cash to Philippine Publishers

Friday, September 28th, 2007 | Net Moolah | No Comments

The days of the “snail checks” will soon be over (err… at least for Adsense). At last there is a better alternative to receive our Adsense earnings. Filipino webmasters can now receive their adsense payments through Western Union Quick Cash. All you have to do is got to your Account Setting and edit your Payment Details. I have not yet confirmed if this option is available to all countries but I have heard that this option was first offered in Malaysia.

However, before you select this option, check if you have at least 2 valid IDs with picture. This will be required to collect your money from any Western Union Branch. From experience, Western Union personnel are quite strict with the requirements, a typo – error or misspelling from the identification that you present will cause you some problems in receiving your money.

We have heard of several Philippine Postal “nightmares” in the past months and the infamous tales of lost Adsense checks in which some were actually encashed! I tried to avoid such headaches by selecting the Secured Express Delivery through DHL, but this option will cost you $24 as courier fee.

The Western Union Quick cash option is a welcome relief for small publishers like me :D. $24 is quite a significant amount by Philippine standards as you can already pay for your monthly internet subscription with such amount. I hope Adbrite will soon offer a similar option, since my significant earnings comes from Adbrite and I have already lost an Adbrite check. Sucks…

PowWeb Hosting - Only $3.88 per month

Search