Array
(
    [SERVER_SOFTWARE] => Apache/2.0.63 (Unix) mod_ssl/2.0.63 OpenSSL/0.9.8b mod_auth_passthrough/2.1 mod_bwlimited/1.4 FrontPage/5.0.2.2635 PHP/5.3.5
    [REQUEST_URI] => /inside/internet-marketing/is-my-website-duplicate-content-heres-how-to-find-out/
    [REDIRECT_STATUS] => 200
    [HTTP_X_CC_ID] => ccc01-01
    [HTTP_IF_MODIFIED_SINCE] => Mon, 09 Aug 2010 12:32:06 GMT
    [HTTP_HOST] => www.trimarksolutions.com
    [HTTP_USER_AGENT] => CCBot/1.0 (+http://www.commoncrawl.org/bot.html)
    [HTTP_ACCEPT] => text/html,application/xhtml+xml,text/xml;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5
    [HTTP_ACCEPT_LANGUAGE] => en-us,en;q=0.5
    [HTTP_ACCEPT_ENCODING] => gzip
    [HTTP_ACCEPT_CHARSET] => ISO-8859-1,utf-8;q=0.7,*;q=0.7
    [HTTP_CONNECTION] => close
    [HTTP_CACHE_CONTROL] => no-cache
    [HTTP_PRAGMA] => no-cache
    [HTTP_COOKIE] => PHPSESSID=b2e9c983f8498c9a9961ff360f3b9c79
    [PATH] => /sbin:/usr/sbin:/bin:/usr/bin
    [SERVER_SIGNATURE] => 
Apache/2.0.63 (Unix) mod_ssl/2.0.63 OpenSSL/0.9.8b mod_auth_passthrough/2.1 mod_bwlimited/1.4 FrontPage/5.0.2.2635 PHP/5.3.5 Server at www.trimarksolutions.com Port 80
[SERVER_NAME] => www.trimarksolutions.com [SERVER_ADDR] => 72.167.253.126 [SERVER_PORT] => 80 [REMOTE_ADDR] => 38.107.179.210 [DOCUMENT_ROOT] => /home/trimarks/public_html [SERVER_ADMIN] => webmaster@trimarksolutions.com [SCRIPT_FILENAME] => /home/trimarks/public_html/inside/index.php [REMOTE_PORT] => 50400 [REDIRECT_URL] => /inside/internet-marketing/is-my-website-duplicate-content-heres-how-to-find-out/ [GATEWAY_INTERFACE] => CGI/1.1 [SERVER_PROTOCOL] => HTTP/1.1 [REQUEST_METHOD] => GET [QUERY_STRING] => [SCRIPT_NAME] => /inside/index.php [PHP_SELF] => /inside/index.php [REQUEST_TIME] => 1328324193 [argv] => Array ( ) [argc] => 0 )
Inc. 500 Class of 2011
(919) 785-2275 or (888) 5TRIMARK

Is My Website Duplicate Content? Here’s How To Find Out

In an effort to constantly stay on top of best practice SEO strategies, we have found that Google is seriously cracking down on duplicate content. When Google catches what it considers duplicate content, it simply filters out the content, acting as if it does not exist. So just what is considered duplicate content?

Well unfortunately my name isn’t Matt Cutts, and I don’t work for Google. So until Google releases a strict set of guidelines as to how it crawls, what it looks for, and what triggers their algorithm’s duplicate content filters, we are going to have to rely on other methods.

TriMark Solutions has found a very useful website used to examine duplicate content, and wouldn’t you know it’s called http://duplicatecontent.net.

So how does it work? Simply enter in a pair of URLs, and the tool will almost immediately return results pertaining to:
• HTML Fingerprinting
• HTML Distribution Value
• Total HTML Similarity
• Standard Text Similarity
• Smart Text Similarity
• Total Text Similarity

All values are returned as a percentage, and obviously the higher the percentages are, the more similar the content is. The site even has a nice little FAQ section to help you understand it a bit better. They also even include a link to add the tool to your own website. (not too self-promoting, eh?)

In order to test the accuracy, I entered two of our landing pages which still had the same content and layout, and sure enough every field came back 100%. Our two sites were completely identical except for domain names.

This tool is useful because it provides us with a numerical value of duplicate content, and it works in real-time, which means you can make constant changes to your website until you’ve lowered your duplicate content value to a percentage with which you feel comfortable.

Keep checking in with the TriMark Solutions Blog for more useful SEO tips and tools.

This entry was posted in Duplicate Content, Internet Marketing, SEO - Tip of The Day. Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

    Contact Us
    Interested in Our Services?
    Contact us today with any needs that you may have and one of our specialists will be in touch with you shortly.
    Interview with TriMark's Founder
    wsRadio.com, the worldwide leader in Internet talk radio, interviews TriMark Founder Randy Goins regarding TriMark's growth and how its strategy has contributed to client success.
    Listen to the live audio »
  • Our Blog Categories

  • Browse Our Archives