Using SharkbaitSharkbait requires a server with PHP 4 or better. You will also need access to an email service that will let you create new addresses on-the-fly. For this I recommend SpamGourmet.
There are several things to consider when setting up Sharkbait. First, you don't want it to be immediately obvious that you're using Sharkbait. If the spammers catch on, their bots will quickly know to throw out pages that look like they contain sharkbait. To make this possible, you must take several steps:
- Rename each and every file in the Sharkbait distribution. For sharkbait.inc.php you can simply move it above your webroot, but the rest will need to be renamed. If the bots can determine if you're using Sharkbait by the presence of a file, then your setup won't be doing any good.
- By the same token, the content of your sharkbait-producing page should be very different from the default implementation in email.php. In fact, if you have the time I would encourage you to create a page in the style of your website, copying the header, footer, and related things. If the harvester bots can't tell the difference between a good address and a bad address, we've achieved our goal.
When setting up Sharkbait, your first impression will likely be to place hidden sharkbait at the bottom of every page of your site. This is not a good idea. Legitimate bots, such as GoogleBot, will then be generating sharkbait, and storing them in a cache for the rest of the world to use. Then when we got spam, the IP address in the sharkbait would link back to GoogleBot. Not a good situation. Instead, you should put sharkbait on pages specifically written to display sharkbait, and place a rule in your robots.txt file excluding all bots from viewing the page. The good bots will honor this rule, and the really nasty harvester bots will ignore it.
After you set up your sharkbait-generating page, you obviously need to link to it on your site. However, you don't want the link visible to most users. I have several ideas for the link, all of which involve the use of CSS. Harvester bots generally ignore CSS, which makes our job a lot easier. Here are some options:
- Wrap the link in a hidden element. There are two ways to do this:
- In the element directly:
<span style="display:none;"><a href="bait.php">spam me</a></span>. obviously you want to replace "bait.php" and "spam me" with your bait page name, and different link text.
- Indirectly, via your site's normal stylesheet.
- In the element directly:
- Obscure the link by placing it in a <div> that is behind another <div>.
Configuration VariablesIn sharkbait.inc.php there are several variables that you can configure. Here's a list of them and what they do:
TRAP_UNIQUE_KEYSpecifies the encryption key that will be used to scramble and descramble generated sharkbait. Try to pick something at least 10 characters, and try not to make it obvious. For example, Using your website's name is a very bad idea.
It may not be immediately obvious why you need a key, but remember that Sharkbait is open-source, and therefore the format of sharkbait can be found easily. Spammers could use this to generate sharkbait with someone else's IP address, and implicate them as a spammer. But if generating "working" sharkbait requires a key, it will be very difficult to forge sharkbait with a specific IP address and time.
For this reason, you should never, ever give out your site key.
TRAP_ALLOW_ALL_DECODEThis controls whether or not the provided decode.php file will use your site key when sharkbait is decoded without providing a key. Note that your key will not be disclosed to them, but your key will be used to decode the sharkbait they provide.
I would recommend that you leave this on. If you file a spam complaint on some sharkbait that got used, the ISP will likely want to know how the IP address and time of the retrieval are contained in the address. You can provide the URL of your decoder as proof.
However, I would not recommend making your decoder URL public -- it could be used to brute-force your key (although that would take quite a while). This is another good reason to rename the decoder.
TRAP_ALLOW_NONALPHAThis will allow non-alphanumeric characters (specifically, "-" and "_") to appear in sharkbait. While such characters are valid in an email address, harvester bots don't pay attention to RFC's. Occasionally, two non-alphanumeric characters will appear side-by-side, or worse, at the beginning of some sharkbait. Some harvester bots will ignore these addresses, and of course we don't want them to.
TRAP_RECURSEIf this is on, the default email.php script provided with Sharkbait will display a link to itself, with a slightly different URL each time, providing harvester bots with a whole barrel of sharkbait. However, many harvester bots are wise to this trick, and will ignore such pages. At the same time, you'll be burning a lot of bandwidth.
Note that if you write your own version of the front-end script, you can ignore this variable entirely, and then its setting doesn't really matter.
TRAP_PATTERNSpecifies the format that the sharkbait should appear in. This is also used by the sample front-end script, but I would recommend that you use its value instead of hard-coding the pattern that you want. The value should be a string containing exactly one "%s", which will be replaced with the generated sharkbait. So if you use SpamGourmet, you might use "%email@example.com".
- Rename everything.
- Delete email.php and write your own. (Or you can use email.php as a guide to write your own, but then delete it.)
- If you're using SpamGourmet, it'd be a good idea to pick a domain other than spamgourmet.com -- or better yet, set up your own domain. See the SpamGourmet FAQ for details on setting up a new SpamGourmet domain.
- Unless the rest of your site is written in PHP, it'd be a good idea to give
the sharkbait-generating page a different extension. If you're using Apache,
you can rename it to a .html file and add the following to an .htaccess file,
replacing bait.html with the name of the page:
<Files bait.html> SetHandler application/x-httpd-php </Files>
- Make sure your server's clock is accurate so the times embedded in the sharkbait will also be accurate.