Article Spinning Or Article Sinning

Article Spinning is a process that lets online copywriters create variations of existing articles by using semi automated software to rewrite the original. Words, phrases, sentences or entire paragraphs can be substituted to put a new spin on an article hence the term article spinning. Although the software is automated a huge amount of human input is required if you want the finished article to be readable and pass as unique.

Article Spinning

What Is Article Spinning?

The purpose of spinning articles is to create unique content that can be syndicated through article directories like ezinearticles.com and articlesbase.com and thousands of others. By creating and uploading unique articles to each article directory the advocates and developers of article spinning software claim that each article will escape Google’s duplicate content filter and stand a better chance of ranking.

Article Spinning and Host Crowding

Host crowding is a filter that limits the amount of results returned from any website in response to a search query.  In effect only two pages can be returned, these can be grouped together in the search results with the second result slightly indented or they could be pages apart. The problem host crowding presents to authors is that there may already be tens or even hundreds of pages already targeting the term they want to chase.

You can check how many pages there are on ezinearticles.com targeting Article Spinning by using Google’s advanced site: search

“Article Spinning” site:ezinearticles.com

At the time of writing this search produced 8 results, so before any page targeting ‘Article Spinning’ will appear in the Google’s SERP's you need to outrank these pages locally. 8 is not a threatening number but keep in mind that some of these pages may already have inbound links as part of a sustained SEO campaign. Article spinning gives no advantage in this area.

Article Spinning and Duplicate Content

Although Google have publicly declared there is no duplicate content penalty, they do filter overly similar or duplicate content. Content that is deemed overly similar can only be seen if you navigate to the last page of results and click on the link ‘repeat the search with the omitted results included’. While it’s true that article spinners can create articles that are unique, readable to humans and pass Copyscape’s plagiarism test with flying colours, it all becomes academic unless they can sneak under Google's radar too.

Document Normalization

It may seem that search engines are able to read and understand words and decide what pages are relevant for a given search. Well, search engines are contextual and they do base their rankings based on the words that make up pages and the links that point to those pages but they don’t read words in the conventional sense.  

Document Normalization is an essential step in every search engines algorithm; it allows them to look at pages on a level playing field by removing the noise and concentrating on the words that have true meaning. This means that many words are simply ignored, these words are called function words and account for approximately 40% of any article. Although function words are ignored by search engines but are required by human readers and include words like ‘the’, ‘and’ etc.  Function words are necessary if you want human readers to enjoy and understand your writing; this limits article spinning to effectively working with around 60% of the total words on a page.

Content Words and Function Words

In every language, you have two different kinds of word:

  • content words - e.g. car, phone, liberty, celebrity, etc.
  • function words - e.g. and, but, to, the, etc.

Content words hold some kind of meaning; we can visualize a car is or understand the concept of liberty. Function words don’t hold meaning; ask yourself, what is the meaning of ‘the’?  Search engines strip documents of function words in order to focus on words with meaning. It is useful to know this, as it is what a search engine will be doing to the words in every article you write and syndicate.

Search engines employ a list of stop words in order to strip web pages down to a skeleton of content words. This stop list is a list of commonly used words, function words, verbs, prepositions, etc, which it removes from the page and helps the search engine determine what the page is about. This is all part of the Document Normalization process search engines performs upon web pages in order to determine the relevance of each page objectively.

The complete document normalization process search engines perform upon web pages when indexing a document is as follows:

Linearization and Tokenization

Markup tags (html code), punctuation and capitalization are removed from a page, the search engine moves through the page systematically, working from top to bottom and left to right, removing content from tags as it finds it. This action leaves the page as very basic text file containing one continuous block of words.  
Regardless of whether the above paragraph was original or the result of article spinning it would be reduced to:

markup tags html code punctuation and capitalization are removed from a page the search engine moves through the page systematically working from top to bottom and left to right removing content from tags as it finds it this action leaves the page as very basic text file containing one continuous block of words

Filtration and Stemming

The search engine applies a stop list to remove commonly used words from the document. This leaves us with only content words. The remaining content words are then ‘stemmed’. That is to say that the remaining terms are reduced to common word roots (e.g. ‘techno’ for ‘technology’, ‘technologies’, ‘technological’).
And again whether the above paragraph was original or the result of article spinning it would be reduced to:

markup tag html code punctuat capit remov page search engine move page systemat work top bottom left right remov content tag find action leav page basic text file contain continu block word

Just over 40% of the words used in the original text were stop words which is about the norm for any web page or article. In reality this means that a 300 word article is going to be reduced to around 180 words. There is also the target keyphrase to take into account, say for example the article was targeting ‘Article Spinning Software Review’ and that it was used five times in the article. None of these words are stop words so our 300 word document in reality offers only 160 words in which to make it suitably unique in order to pass Google’s duplicate content filter.

This is where the real headache starts for anyone using article spinning software, you can’t replace words with words that share the same root because it won’t make one bit of difference to the way search engines see the page. As I said earlier, search engines don’t read words in the conventional sense, and while it is possible to spin articles that are grammatically correct, readable and pass Copyscape’s plagiarism test, it is a lot harder to fly under Google’s radar.

Improve the online visibility of your small business - contact SEO Scotland Today.

Imagine How Good You'll Feel When Your Small Business Is On The First Page Of Google Contact SEO Scotland

Thank you for your enquiry it is important to us

We will contact you within the next two working days.

Request Your Free Small Business SEO Quote Today
Name
Phone
Email
Website