SEO Starter Guide

SEO Crash Course

Note: What follows will occasionally contradict with what you'll hear in SEO forums, SEO blogs and from SEO celebrities. Keep in mind that most who talk about SEO have learned it by reading the same forums, blogs and celebrities — and haven't the slightest idea of how a semantic indexing algorithm works.

  1. The Basics

    • At the end of the day, the only thing that really counts is end-user experience. Deliver quality content, use the likes of Silo widgets and Nav Menu widgets to enhance your site's usability and navigability, and focus on your marketing and conversion rates.
    • Your page's <title> tag is very important. In addition to being important from a semantic standpoint, it appears as the top line of the entry when your site is listed by the search engines. A descriptive title can make the difference between a click or not in search results and RSS readers.
    • Meta keyword and description fields are mostly useless from an indexing standpoint. Recall that Google's key innovation was to ignore them. If left empty, the meta keywords field is automatically filled using the page's categories and tags, and you are advised to leave it blank as a result. The meta description deserves a special note, however.
    • Recent research suggests that the meta description field is used by Google to describe your page in search result listings when — and only when — a) it is short (140 characters max?) and b) it contains the search phrase. Do not use it to describe your page. Instead, find a snappy teaser that will irresistibly lure those who read it onto your page.
    • Link anchor text counts, as does their context. Picture Google as a gargantuan tagging engine where tags are the link texts, in the context of their neighboring text. And keep in mind that, nowadays, its algorithms are driven more by the need to eliminate spammy sites than anything.
    • Links often get discounted, but in the end, inbound links always count — even if in a negligible manner. It's much better to have a link from an authoritative site, however. (Yahoo's directory is authoritative, by the way.)
    • Your posts' and pages' introductions and conclusions also make a difference; don't neglect them, as they'll also enhance your site's readability. Split your content with h2/h3 tags if it's excessively long.
    • The Semiologic and Semiologic Reloaded themes have heading tags, in sidebars for instance, that SEO forum regulars may (wrongly) find erroneous. This is to semantically split your page into distinct sections, so as to semantically insulate your content from your site's cosmetic and navigation elements.
    • New and/or updated content counts, and that is one of the reasons blogs fare well in search engines. While a small update can give a boost to your ranking, a huge update can harm it — in that a page whose content was entirely rewritten gets treated as an entirely new page. Use at least 5 posts per post page. Better yet, stick to WordPress and Semiologic SEO defaults.
    • Your keywords can be noise words just as much as "in", "the", "a", etc. if you abuse their usage. To perceive a black dot on a white page, it needs to contrast with its surroundings. Much like eye perception (or any signal detection for that matter), meaning comes from derivatives, i.e. difference and contrast, rather than mere presence and amplitude.
    • Pick your fights. Uncompetitive keywords are easier to conquer, and these small victories will ultimately give you an edge when fighting the more harder battles.
  2. On Duplicate Content

    • Duplicate content issues on third party sites are very real. Fight content theft by either putting your key content on static pages (which don't show up in feeds), or by serving excerpts in your feeds — or both. And don't plagiarize content from third party sites using RSS aggregators.
    • Duplicate content "issues" on your own site are a fallacy. Similar pages get returned as clusters in search results. In other words, they're grouped together in search results, and the highest ranking page in the the group gets returned. To ensure that your individual posts and static pages rank high in a given cluster of pages, output titles or excerpts on archive pages.
    • Stand clear of "SEO" plugins that "deal" with duplicate content issues by unindexing pages on your site. You can be sure that their authors have little idea of how a search engine works. Having archive and section pages with high ranking power will help your posts and child pages rank better and faster. It is thus always preferrable.
    • In addition to being useless, adding nofollow attributes to internal links is a sure way to ultimately get your site penalized on grounds that you're trying to game search engines. Nofollow was introduced to mark outbound links (usually in comments) that are irrelevant to the post's or page's contents. Use it as such.
  3. On Links

    • Don't give too much attention to the number of links in your pages' cosmetic and navigation areas (i.e. header, sidebar, footer). It is trivial to algorithmically extract a page's contents from its cosmetic and navigation areas. You compare two or three pages on a site; the difference between them will reveal where the real content is located.
    • The links that really count are those in your content, surrounded by context. Add links to your posts and pages within the contents of your posts and pages.
    • The number of outbound links on your pages can have an impact, because spammers have been abusing it. But unless your page starts to feel like a link directory, you've nothing to worry about.
    • The rate at which you gain inbound links can have an impact, because spammers have been abusing it. If it looks like you're comment spamming the web or equivalent, they'll get discounted. A regular stream of new links is better than a massive, one-off stream of new links.
    • On the topic of link exchanges, page a on site A linking to page b on site B and reciprocally is easily detected and discounted. Page a on site A linking to page b on site B, while page c on site B links to page d on site A, is much harder to detect.
    • Concluding on the last couple of points, better a single link in the content of an authoritative page than a site-wide link in the footer or sidebar of an unauthoritative site.
  4. On Pinging And Performance

    • XML sitemaps are useful to the extent that they'll get your site indexed faster. The XML sitemaps specs say that the (optional) link attributes are indications to search engines. They won't have the slightest impact on how well individual pages on your site will get indexed.
    • Performance counts. A huge ping list can have a severe performance impact on your site, and harm your rankings by degrading your server's response time. Stick to using pingomatic, and perhaps a few specialized ping services that relate to your site or region. Or Feedburner's equivalent service. Don't install plugins that offer to fix ping service notifications — they're already throttled in WordPress.
    • Some permalink structures have a negative impact on your site's performance. Worst offenders in this arena are /category/postname/ and /postname/. Avoid those two, and their siblings, like the Plague. Structures that start with a date, i.e. the "Day and name" or "Month and name" structures, are just as optimized, they perform well and they've the added benefit of being the best for usability.
  5. On End-User Feedback

    • Search engines are increasingly taking end-user feedback into account. Consider how these potential feedback loops can give indications to Google on how worthwhile your site and its content might be: Search Results (click-through rates, ignore rates, speed of clicks), GMail (the same and popularity), Google Bookmarks (the same), Google Analytics (the same and visit duration, visitor loyalty), the Google Bar (the same), Feedburner (the same), AdSense (the same).
    • Until 1995, people would start by asking their contacts when they searched for information. Now, consider how search has evolved in the past years, and where it's heading. It'll give pre-eminence to your contacts' bookmarks. Because your contacts' opinions count more, to you, than the opinions of people you don't know. Keep this in mind while you market your site.
  6. Concluding Notes

    • Always keep this in mind that there is a conflict of interest in a search engine's business model. Google's better interest is to return "relevant enough" results in a mostly random order, in order to sell more ads to those who depend on a consistent stream of search engine traffic.
    • As a rule, view your search engine traffic as a bonus, and focus on alternative sources of traffic, such as mailing lists, affiliate networks, social networks, ebay, classifieds, word-of-mouth (online and offline forums), etc.
    • The only thing that really counts at the end of the day is end-user experience. Deliver quality content, use the likes of Silo widgets and Nav Menu widgets to enhance your site's usability and navigability, and focus on your marketing and conversion rates.
    • Special Note: If you're seeking to optimize Ad revenue, don't make your site "too" navigable: at the end of your page, you want users to click an ad to another site; not a link to a related page on your site.

Other Resources

Google produces a very good beginner's guide on search engine optimization (SEO). Originally it was an online guide but is now being distributed as a PDF. The link below will take you right to the pdf download.

Search Engine Optimization Starter Guide!