I see tons and tons of posts from people asking in the forums about how to write a robots.txt file for their WordPress site because they don’t wan’t to rely on a plugin to generate it for them, even though there are some really good ones out there. So I thought I would share what I use as the base for any WordPress site I build or work on. This is hand generated based on a lot of surfing and reading forum posts as well as advice given by the WordPress Codex.
User-agent: * Disallow: /cgi-bin Disallow: /wp-admin Disallow: /wp-includes Disallow: /wp-content/plugins Disallow: /wp-content/cache Disallow: /wp-content/themes Disallow: /trackback Disallow: /feed Disallow: /comments Disallow: /category/*/* Disallow: */trackback Disallow: */feed Disallow: */comments Disallow: /*?* Disallow: /*? Allow: /wp-content/uploads # Google Image User-agent: Googlebot-Image Disallow: Allow: /* # Google AdSense User-agent: Mediapartners-Google* Disallow: Allow: /* # digg mirror User-agent: duggmirror Disallow: / # Sitemap - correct path to your sites sitemap file Sitemap: http://www.example.com/sitemap.xml