I have been doing a little research on Google’s duplicate content filters.
Since I signed up for a private label content site I have been wondering if I am spinning my wheels or not?
If you’re trying to make some money with a blog, or your looking into joining a private label site, then you need to know something about duplicate content.
I have posted several things from the private label site, with some mixed results.
Of course one of the problems in trying to figure out anything about the search engines is the scale you must test something to figure out if it really works or not.
For example, I have seen duplicate web pages that have been indexed thousands of times, some tens of thousands then all of them were eliminated except 50 or 60.
When you see thousands of pages removed from Google then you can safely say that they have been targeted, and it’s a reasonable assumption that they were removed because of the duplicate content filter Google has.
The thing I didn’t know is how much you would have to change a page in order for it to not be included in the duplicate content filter?
I found the answer to this question in the patent that Google filed. Google says that they take many finger prints from each page, then they compare all those finger prints and if two of the finger prints match another page then they consider it to be a near duplicate and apply the filter to it.
So once a page reached some number of results in Google then it starts getting filtered… Here is the bad part for guys like me who don’t have a big site. The pages that get filtered are the pages with the lowest page rank. This means that my pages will always be the first to go since my page rank is lower than almost everyone else’s.
This can be a problem if a popular site even quotes you. If the 2 finger prints are considered to come from the quote then your page will be removed from Google for being duplicate content.
So my previous theory about using private label content and just changing a few of the words it pretty much a waste of time. Just do a straight copy and paste, or rewrite it so much that none of the sentences can be recognized. Because if two of the sentences can be spotted as being the same then it may be filtered.
No comments:
Post a Comment