Well from my angle our search engine when indexing a website checks every 1k of code indexed, against the other entries in our database, often we get warnings of duplicate content found which may be a group of words in a sentence, that are the same as those found within another webpage from within the same site or from a different website.
This is often descriptions of products or services and is quite normal, some are however blatant copying of code or webpages to try and increase a sites scores or levels of content.
We either remove the entire site or overdide the warning depending on the way that it looks when we view it manually.
Some are the same code in use within seven or more domain names, often owned by the same company or person we delete these entries.
Some are just a group of words that are the same and they are not a problem.
|