How Google penalises sites with too many of the same URL – Tested!
Last week I got an insight into how Google penalties work if you use a URL too many times in a blog entry.
In my recent article, I covered how scammers target Sedo users.
The article was included in the Google index within the hour, as it usually is for my blog, and for the following three days I had 80-100 daily unique users reach it through Google.
Then on the fourth day - all traffic to the page from Google stopped. Nothing. Nada.
After a quick investigation, I found that that particular page was no longer included in the Google index. The rest of my site was unaffected.
I looked at it in more detail and theorised that because I quoted the correspondence with the scammer, which repeatedly included his email address ("murphy@eliteinvestment.net"), Google must have decided that this was a spam message and excluded it from its index - probably because Google ignored the "@" sign and treated the companyname.com part as a URL, thus viewing it as being repeated many times over. The other option is that it doesn’t like too many repeats of the same email address, although i like my first theory better.
I decided to test my theory, and reduced the total number of references to the company from a total of ten URLs/emails (eliteinvetment.net) to only three. I then updated my sitemap and pinged Google to re-crawl my site.
Sure enough, a week later my article has been re-indexed, and is hitting traffic again. An insight into the mind of the (fluffy) beast.
It also shows that my pages were first ingested and indexed, and only a few days later the penalty was applied.
How to Write for Google – SEO Article #2

This is a continuation from: How do I get my site into Google? - SEO Article #1
SEO Article #2: “How to Write for Google“ (and some stuff about toads).
I'll start with my usual caveat: you should write content that people would like to read, or as Google puts it: 'Always focus on the users and not on search engines'. Even if you are one of the scum-of-the-earth spammers who create pages just to trap innocent people (who searched for Niagara, and you gave them Viagra instead) - you should entertain the notion that they're not going to buy your overpriced-counterfeit-drugs -that'll-probably-kill-them, unless you actually give them some information that they would like to read, or that is useful to them -- and neither will Google (include your site, that is). Remember also that content is king and the better your content - the better you will rank.
But there are other things you could do to improve your positioning: You will be better noticed on Google if you have two things:
1. Get links from other websites to yours - Google treats every link to your site like a 'vote' of confidence. Not only that, but if the websites that link to you have many other sites voting for them, then they have a higher ranking, and therefore you have a higher ranking. This ranking is referred to by Google as PageRank. And you can see a site's PageRank in the Google Toolbar, if you have one installed (Read more about PageRank here and here).
IMPORTANT NOTE ABOUT PAGERANK: when you first start publicising your website, don't worry too much about PageRank. It can take months until Google calculates your PageRanks, and by then, hopefully, you would have worked on getting lots of links from people who love your excellent site. This site, for example, was launched in May 08, and (at the time of writing) has still not been pageranked. It doesn't stop it from appearing at the top of Google search results many times over, and that all has to do with its content, and how it is presented (See below).
2. Write well for Google. No one outside of Google HQ knows exactly what formulae Google uses to assess whether your content is good. It is safe to assume that it tries to weed out spam, and that it looks for signs that your page grammar indicates a proper language article, as opposed to just a succession of words. But there are things that will help your writing appear higher in Google rankings. I say this from my own experience in getting to the very top of search results, even if your site is new and your PageRank is zero:
- Go niche - if no other site on the web uses the word combination "What are hulk frogs?" and your site does, then when there is a sudden interest in hulk frogs, your site will appear at the top of search results. It's as simple as that. If, on the other hand, you try to write about "The Movie Hulk", you will be competing with millions of other hulk sites, and are much less likely to reach top position. Once you've digested the consequences of this effect, you will realise that if you cover a niche area, use niche expressions, or tackle niche questions and topics, you are much more likely to get noticed. Of course, if your website is all about Britney Spears (you
- Consider word density - word density refers to the number of times a word or expression appears in an article. If you write about cane toad feeding habits and you repeat the term "cane toad feeding habits" and the expression "cane toad" many times, Google will conclude that your article is about cane toads and their feeding. If your article has more of these words than another article written by someone else, Google may well conclude that yours is more relevant to the search term "cane toad feeding", and place your site above the other. (See what I did there?).
Of course things aren't as straightforward as a count of the number of words, and your search-results position depends on other factors as well, but this is a very useful method, and works well for me. I should probably also warn you (again) not to try and trick Google here. Use a word or expression too much, and you might be under suspicion -- and Google will penalise your site or ban it as spam. My advice thus is: bear in mind the words that people will search for to reach your kind of content. Then use them often, and use them in expressions that are likely to appear in searches.
- Consider word weight and importance - Google gives more weight to elements on your page that are enclosed in title tags like h1, h2 etc. By using these tags you are saying: this text carries more weight. In the same way, Bold and Italics can signal that a word, expression or sentence is important.
- Consider word positioning - The closer to the top the more important. If you start a paragraph, the closer a word is to the start of the paragraph the more weight it has, etc. It is better to say “The feeding habits of the cane toad – is today’s topic” rather than “today’s topic is the feeding habits of the cane toad.”
I know it is very difficult to bear all these things in mind when writing, but after a while they become second nature. In some articles you pay them more attention, because you want to hit your niche harder, and in some you don't, because you are writing for volume, or for fun.
It’s worth noting that this article may prove its own point in rather an unfortunate way, by attracting a lot of zoologist in search of the feeding habits of the cane toad. To them, I apologise. It’s just the way search engines work.
There are further writing tips for Google, and I might get back to them in a later article. For now though, thanks for reading.
UPDATE: To prove that I wasn't just talking nonsense, search Google for the keywords 'feeding habits kane toad' by clicking here. Now re-read the article and you will see why this is.
Do come back to ThatDanny.com for the next article in the series (or subscribe to this blog to get notified when it is published).
SEO articles in this series:
How do I get my site into Google? - SEO Article #1
How to Write for Google - SEO Article #2
How do I get my site into Microsoft Live Search? - SEO Article #3
What’s a “NO FOLLOW” tag?
How do I get my site into Google? – SEO Article #1

Getting your site into to Google is the easy part, getting it to appear in relevant Google searches is quite another. But let’s start with how you get your site into the world’s most popular search engine.
SEO Article #1: "How do I get my site into Google?".
Step one: create your website before you submit it, and before you promote it in any way. Make sure you follow these important rules:
- a. Your site should include content that people would actually want to read. If people find your site engaging and informative, they will link to it. If they link to it, you stand a better chance to rank high in Google search results (more about rankings in a future article). If you only follow one piece of my advice – the above should be it. As Google itself advises: "Always focus on the users and not on search engines." This is very true when you create content, although there are many ways to get even better exposure. In this series I will cover the things you need to know, to get you ahead of the pack.
- b. When you create your engaging content, make sure it includes a lot of text. Google loves text, and is much more likely to include your site if it is text-rich. But refer back to point 'a' above. Create text-rich content that is interesting, not just text for the sake of text. Content is king. Good content is the emperor.
- c. Make sure you know how to write for Google. This involves using the right keywords in your content, putting them in the right places and in the right way.
- d. Don’t spam or use any dirty tricks. There are a lot of dirty tricks that people use to try and get higher Google rankings for their websites. Sometimes they make it to the top, but more often their site will get penalised by Google or even blocked as spam. And there are legitimate ways of getting ahead of the crowd, so don't sweat it. Do it properly and the rest will follow.
- e. Unless you absolutely know what you're doing, avoid the following: frames, Macromedia Flash, iframes, content inserted by JavaScript and image maps. All the above may be difficult for Google to read when it checks your site. It uses an automated indexing system (known as "Googlebot") to read your site and rate its content, and if it can’t – you’re stuffed.
So you've created your site, it abounds with great text content and is appealing to visitors. Now you are ready to start what is referred to as SEO or SEM.
What is SEO? SEO= Search Engine Optimisation.
What is SEM? SEM= Search Engine Marketing.
---------------------------------------------------------------------------------
-
Step two - Let Google know that your site exists:
- a. Submit a sitemap - now is the time to finally tell Google about your site. One of the most effective ways of doing this is by submitting a sitemap.
------------------------------------------------------------------------------------------------------------------
What is a Sitemap? a sitemap is a file that tells Google about your website's structure, what pages you have on your site and some further useful information. You can see what a Google sitemap looks like by clicking on the "sitemap" link at the bottom of this page.
------------------------------------------------------------------------------------------------------------------
The easiest way to create a sitemap is by using this free website. You need to save this file to your hosting space using the name "sitemap.xml" (without the quotes).*
*This tutorial assumes that you know how to upload a file to your website. If not, look for instructions from your web hosting company on how to do this.
- b. Got a sitemap? Great stuff! Now go to the Google Webmasters website and sign up. If you already have a Gmail account you don't even need to register, just use the same details. Once registered, follow the prompts to tell Google about your new site (you have to enter your site's URL and follow the verification process, which is explained very clearly there).
- c. Now that you're in, and you've proved to Google that you are the rightful owner of your site, click on the "Sitemap" link on the left-hand navigation bar, and choose to "add a sitemap". Follow the instructions, and voila, you can sit back smugly. Google knows about your site.
NOW WHAT? If you followed all the advice in this article, and your site has good content as described above, it will be included in Google within a few days. As you'll realise very quickly though, this is just the start of your journey. You've created a website and you want the world to see it, not for it to languish in search results page number 33. You want it to rank high and appear in Google searches. You want it to be visible. We’ll cover the next steps to achieve this in SEO article #2.
Come back to ThatDanny.com for the next article in this series (or subscribe to this blog to get notified when it is published).
SEO articles in this series:
How do I get my site into Google? - SEO Article #1
How to Write for Google - SEO Article #2
How do I get my site into Microsoft Live Search? - SEO Article #3
What’s a “NO FOLLOW” tag?
suriphobia – fear of mice. If you have it you are a suriphobe (why on earth do people in Washington want to know that?)
Suriphobe - someone suffering from Suriphobia.
Suriphobia - Fear of mice and rats is one of the most common specific phobias. It is sometimes referred to as musophobia (from Latin mus for "mouse") or murophobia (a coinage from the taxonomic adjective "murine" for the Muridae family that encompasses mice and rats). Suriphobia, from the French souris, meaning mouse. (Wikipedia)
And Suriphobia does not , of course, refer to fear of Suri Cruise - daughter of Tom Cruise and Katie Holmes.
Suriphobe and Suriphobia have been in the top searched for terms on the Internet today. If you are from Washington or Arlington (where most of those searches came from), could you please leave a comment here and explain yourself? I will update as and when I learn more.
The sad truth about Olivia Dukakis, also known as Olivia Ducacus – The Internet Ghost
What is an Internet Ghost? A term that everyone is searching for but doesn't actually exist. (definition by ThatDanny.com)
This posting is about an Internet Ghost called Olivia Dukakis, aka Olivia Ducacus.
It is likely that you got here because you searched for Olivia Dukakis or Olivia Ducacus. But who you were actually after is Olympia Dukakis, the Oscar winning actress who starred in many films, including Steel Magnolias, and in many TV productions, including as Anna Madrigal in Armistead Maupin's Tales of the City.

Olympia Dukakis is one of the most misspelled names in Net searches, and as a result many sites have been set up with spam, porn and nasty trojan viruses, intended to trap those searching for her misspelled names.
Consider this a safety net, to warn you before you get there:
Stop now! Go to the Olympia Dukakis IMDB page: here.
PS. and if you happen to be someone who is "really" called Olivia Dukakis, do let me know, and I'll add a side note about you...
Olympia Dukakis (the real one):
Oh dear, Google Trends #1 Hot Search today is “what to do if inside of girl gets wet”
Apologies in advance for this post. It is presented here for scientific reasons only...
And I kid you not. Here is the screenshot:
There seem to have been some problems with Google Trends over the last 24 hours with the content not updating for a while, and then this one rose to the top, circumventing any sanitation filters.
According to the Google Trends "About" page:
"Hot Trends reflects what people are searching for on Google today. Rather than showing the most popular searches overall, which would always be generic terms like "weather," Hot Trends highlights searches that have sudden surges in popularity. Our algorithm analyzes millions of web searches performed on Google and displays those searches that deviate the most from their historic traffic pattern. The algorithm also filters out spam and removes inappropriate material." (The emphasis on the last bit is mine-DD).
You may wonder how this all came about, and like a lot of things it started, innocently enough, when someone who was obviously in the midst of a barbeque, ran the following search: "what to do if inside of GRILL gets wet." Google offered an alternative search, replacing "grill" with "girl":

The guy obviously thought it all hilarious, and hurried to post it on Reddit, where it made the front page. The rest, as they say, is history...
To be fair to Google Trends, the terms "wet" and "girl" do not necessarily stand out as filth on their own. But it is still late afternoon in California, and you would have thought someone would notice. And while we're on the subject, the copyright issue at the bottom of the page is a bit out of date, and still reads "2007".
Update: I dropped Google Trends an email at 22.57 UK time, and within ten minutes I got a thank you back, and they removed the entry. I also noted to them the wrong copyright date. I should get a Google award, or something.
If you like this post, please support us by checking out our sponsor - except for Americans who are not allowed to play bingo online, apparently:
Mozilla brand is strong! Mozilo’s PR is whoops! (Did I really click “Reply All”?)
If you got here through a search engine in the past few hours, you are very likely to have typed "Realty Check: Who sparked Countrywide CEO Mozilla’s “disgusting” e-mail reply?"
Yep, this is one of those geeky entries, but it does suggest that search is a great reflection of social trends. Here's how:
When The Financial Chairman of US lender Countrywide, Angelo Mozilo, ignited an online furore in the US on Tuesday by describing a mortgage customer's plea for help as a "disgusting" example of form letters inundating his company, one unexpected effect was a testament to the growing popularity of the Mozilla brand, home to the Firefox browser.
Instead of searching for Mozilo, most US Google searches on this topic included the terms "ceo mozilla disgusting email".
This may be an indication of the growing popularity of the Firefox browser as an alternative to Internet Explorer, and it is clear that brand recognition of Mozilla, even if passive, is fairly high. This effect is obviously not assisted by the spell checker on Microsoft applications suggesting Mozilla for Mozilo, every time...
Good for Mozilla, I say. Maybe also good for Mozilo (they found a browser instead of him).
Geeky entry ends --
------------------------------------------------------------------------------------------
For more on this story::
=> Countrywide CEO Mozilo criticizes customer e-mail - on CNBC.
=> Views on PR and Mozilo in light of this story on Steve Cody's RepMan's Blog.
