Postby Kaligraphic » Sun Aug 29, 2004 5:30 am
They usually just find stuff by having someone else link to it. Google also incorporates the Open Directory Project (dmoz) listings, and will quickly pick up links from there. As for the Google cache, Google checks the contents of pages on a monthly cycle, and essentially re-scans the web each month. If your site ranks low on Google's priority, you can even miss out on the month's crawl to other sites, and find the Google cache several months out of date. As for the threads, Google only caches according to links, so the threads were probably linked during a previous crawl, and came up according to the text in them. (Google tries to parse any text on a page, and has a sometimes-working relevancy algorithm - I still get too many results that end up being search engines, though.)
Basically, get into dmoz or get a link from somewhere popular, and you'll get into Google's listings.
The cake used to be a lie like you, but then it took a portal to the deception core.