How would you like to quickly check for duplicate content on your site? If so, then have a look at Siteliner.
Siteliner is a web crawler. Although it’s billed as a tool that checks for duplicate content, it will also find other issues on your website, such as broken links.
Bonus: the tool is brought to you by the same folks who created Copyscape. So it’s pretty good at tracking duplicate content.
In this guide, I’ll go over Siteliner so you can decide if it’s best for your business.
Siteliner runs in the cloud. That means you don’t have to download anything to get it up and running.
Other web crawlers, like Screaming Frog require a software download and install.
All you need to do to get the ball rolling is head over to the Siteliner website and plug your home page URL into the field in the middle of the screen. Then, click Go.
Siteliner will crawl all over your site looking for duplicate content. It will take a few minutes, depending on the number of pages.
Keep in mind: if you’re using the free version of Siteliner, you’re limited to a maximum of 250 pages. If you want to crawl more than that, you’ll have to upgrade to the premium service (read: $$).
Also, you can only crawl a single domain once per month with the free version. With the premium service, you can crawl as often as you want.
When Siteliner is finished crawling your site, you’ll get a report with the following info:
There’s a lot to unpack in each part of the report so let’s break them all down in detail.
Page data not only shows you key info about the pages that Siteliner scanned, it also gives you a metric for each page called Page Power.
Page Power shows you how prominently the page might appear to search engines that crawl your site.
Why would some pages appear more prominent than others? Because they have more links pointing to them than other pages on the site.
In other words, Siteliner looks at internal linking.
You can use the Page Power data to remove pages that aren’t relevant any longer. Alternatively, you might want to update them with more timely info.
The Page Power metric can also give you actionable insight on good internal linking strategies. Remember, you should link to the pages that are most important to your brand.
This is where Siteliner highlights identical content found on two or more pages on your site.
Note: Siteliner distinguishes a URL that ends with a slash (“/”) as different than the exact same URL that doesn’t end with a slash.
So the report might show you that you have 100% duplicate content on two URLs but they’re really the same URL. One ends with a slash and the other one doesn’t.
The solution to that is to set a rel=canonical on one of the pages. Siteliner reads that meta tag and won’t flag the content as duplicate.
Further, the tool lists the URLs it scanned in a table. Just click on any row in the table to see the duplicate content for a specific page.
Oh, by the way: when you look at the duplicate content section of the report, you might see some of the same terminology you’ve seen when using Copyscape.
That’s for a very good reason. Remember, Siteliner is produced by the same people who made Copyscape.
It’s easy to make changes on your site and forget to update links accordingly. Fortunately, Siteliner will find broken links that could hurt your SEO efforts.
If the tool identifies a broken link, either create a page that redirects from that link to the right place or update the broken link on the appropriate page.
Siteliner doesn’t scan everything. That’s for a very good reason.
For example, if you’re using the aforementioned rel=canonical on your site, the tool will just look at the target in the meta tag.
The whole point of the rel=canonical tag, after all, is to ensure that certain pages don’t get flagged for duplicate content. But they also don’t get indexed.
Also, if there’s a meta tag redirection, Siteliner will follow the redirection and skip the underlying page.
The tool also skips pages in the event of a HTTP header redirection, a frame redirection, or if it’s prohibited from crawling the page by robots.txt.
In any case, you’ll find a list of skipped pages on the Siteliner report.
If you’ve got subdomains, Siteliner will find those and identify them.
However, it won’t crawl them.
If you’d like the tool to crawl those domains, just click on the appropriate row. Siteliner will set up a new scan for that subdomain.
Keep in mind: the www prefix is technically considered a subdomain. It should produce the exact same results as the URL without the www prefix, but it’s worth your while to check.
Yes, you can download the Siteliner report. As of now, it’s only available in CSV format.
To download the report, just click on the download link on the left-hand side of the report.
You may also choose to download only a specific table on the screen. Once again, that file is available only in CSV format.
Finally, Siteliner produces an XML Sitemap for you. You can upload that to your site and use it as your sitemap for search engines.
So how much does Siteliner cost if you decide to go premium?
That’s an easy question to answer. Each page only costs 1 penny.
So if you have 1,000 pages on your site and you want them all scanned, that’s going to cost you $10.
You can buy premium credits via credit card or PayPal.
Siteliner is a nifty little tool that will help you identify duplicate content on your site. As a bonus, it will find broken links and give you insights about pages that stand out the most.
If you haven’t yet taken Siteliner for a free test drive, why not do so today?
How would you like to streamline your content marketing? If so, then check out ContentStudio. Simply put, ContentStudio is a comprehensive content discovery and publication
What Does Martech Mean? If you’ve been in digital marketing for any length of time, you’ve probably heard the word “martech.” But what does it