Friday 1 April 2011

Link Spamming – A Closer Look

Google Blogger’s Spam definition refers to Link spam and links to a section on Wikipedia's Spamdexing page.

On this Wikipedia page, it says, Spamdexing (also known as search spam, search engine spam or web spam)[1] involves a number of methods, such as repeating unrelated phrases, to manipulate the relevance or prominence of resources indexed by a search engine, in a manner inconsistent with the purpose of the indexing system.”

It goes on further to say – “Common spamdexing techniques can be classified into two broad classes: content spam[4] (or term spam) and link spam.[3]

This post is specifically about Link Spam and Wikipedia states –Link spam is defined as links between pages that are present for reasons other than merit.[6] Link spam takes advantage of link-based ranking algorithms, which gives websites higher rankings the more other highly ranked websites link to it. These techniques also aim at influencing other link-based ranking techniques such as the HITS algorithm.”

There are several different types of Link Spam and most don’t require any explanation, they are;
- Link-building software
- Link farms
- Hidden links
- Sybil attack
- Page hijacking
- Buying expired domains
- Cookie stuffing
- Using world-writable pages
- Mirror websites
- URL redirection
- Cloaking

But it also includes Spam Blogs as a type of Link Spam, and this is what it says - Spam blogs,are blogs created solely for commercial promotion and the passage of link authority to target sites. Often these "splogs" are designed in a misleading manner that will give the effect of a legitimate website but upon close inspection will often be written using spinning software or very poorly written and barely readable content. They are similar in nature to link farms.”

And under the Using world-writable pages section, it discusses Spam in blogs, which is the placing or solicitation of links randomly on other sites, placing a desired keyword into the hyperlinked text of the inbound link. Guest books, forums, blogs, and any site that accepts visitors' comments are particular targets and are often victims of drive-by spamming where automated software creates nonsense posts with links that are usually irrelevant and unwanted.”

There does not appear to be anything odd here that would account for the rash of closures and spam classifications for blogs that do not run ads.  There are a great many risks however that are associated with running ads, unless of course they are Google ad-sense sponsored.

There is no question that spam blogs are a problem, and need to be closed, but it is much much bigger problem when Bloggers fuzzy spam algorithms start flagging blogs solely based on their associations, such as Monster Trucks.  When that is combined with a help forum where guilt is assumed, then there is a real concern. 

References are constantly made in the forums about all the righteous spam decrees and how important it is in a civilized society that judicial process is kept honest.  To even suggest that is the case is insulting and offensive.  There is no judicial process, this a behind closed door subjective determination of guilt or innocence without any reason provided why.

WHY ARE THESE BLOGS DECLARED SPAM?!?!  The only commonality for many in the gay community is that they are gay blogs, because they exhibit none of the characteristics defined above.  But this is not just a gay issue, it happens to thousands of non-gay blogs too.  But if none of them could be defined as Spam blogs under the defined definitions, then there must be un-defined definitions, that Blogger has created and will not share.  You constantly hear on the forums that providing more detail as to why a blog is classified as Spam will only provide Spammers with new ways to innovate their spam techniques. COME ON!!  That is just ludicrous! 

So much for judicial process and keeping things honest…

Next post – Content Spamming – A Closer Look

3 comments:

  1. I hate being guilty until proved innocent. That is so medieval. TCs in the Help Forums state that they cannot give details or spammers will use that information to get around it. Well, BLOGGER, if all these innocent blogs are being closed because of suspicion of spam, it doesn't look like you are doing such a great job. Why not give us bloggers more information so we can protect our own blogs. You sure ain't doing such a good job!

    ReplyDelete
  2. I am still trying to figure out whether in Blogger's eyes, SPAM encompasses all TOS violations, I know spamming is a TOS violation, but so is copyright infringement, and hate, and other things. All of which are quite different from spamming. You'd think that the all encompassing reason would be "TOS violation" rather than something relatively specific like spam.

    ReplyDelete
  3. I did notice your Help Forum post on this issue and decided to add my own 2 cents. I was not as controlled as you were when I made my post...must be the second cup of coffee.

    ReplyDelete