How to Find Pages that Google has Indexed
If you’re a webmaster that is curious to know which of your site’s page have been indexed by Google, simply use the site operator and perform a search query such as:
site:www.joemajewski.com
The results you see upon executing that Google search will be a list of pages that Google has indexed on your site. Although that may not be a complete list of all the pages that Google “knows” about, it is a list of all pages that Google finds relevant enough to list in their results.
Many people believe that the more pages that Google has indexed, the better, but that is not the case at all. It is much more beneficial to you to have 50 pages indexed that are all rich with content than to have 100 pages indexed with low quality content. This is why it is important to update your robots.txt file to ensure that certain pages get ignored by Google’s search engine.
For example, I prevent Google from indexing pages located within the Traffic Stats tab that’s located in the tab list at the top of my blog. The reason being is that Google will find dozens, or maybe even hundreds of pages within that section of the site that are filled with daily statistics, weekly statistics, etcetera, and in general, you only want page’s with relevant content to be listed on Google.
That’s my tip of the day, but don’t expect to find these on a daily basis.
Related Articles:
- How to Remove Indexed Pages From Google As a webmaster, you should be constantly checking Google to find out if your pages are being properly indexed. If...
- Google Slow to Index Blog Posts By using Google’s site operator, you can quickly see a list of pages that Google has indexed on any given...
- Optimize Your Blog – Simple Ways to Increase Value in Your Content WordPress is my favorite content management system; it’s easy to use, it has a large community of developers, it’s free,...
- Submitting Your Blog to Technorati Technorati is one of the Internet’s largest search engines for blogs, with over 100 million distinct blogs indexed as of...
- Google’s Slow Indexing Rate for Blog Posts Resolved… Kinda Last night I posted an article about Google’s slow indexing rate for my latest blog posts. Usually my articles get...
- The Importance of Google PageRank Dedicated webmasters place Google PageRank at a higher value than money itself. In order for a website to appear at...
- The Importance of Backlinks What are backlinks, why are they important, and how can I get more? These are just a few of the...
- Problems with WordPress 2.9: Pings Failing to Alert the Google Blogging Service WordPress 2.9.1 just released, and it fixed some minor bugs from WordPress 2.9. This is just a theory, but I...
- Meta Keywords are Old News Did you know that most engines have shifted away from using the keywords listed in meta tags? If you’re still...
- What Does the Canonical Meta Tag Do? | link rel=”canonical”, huh? In recent years, the meta tag and link relationship library has exploded. It’s not uncommon to have over a dozen...
How exactly do you get the Google search engine to ignore pages? Something about robots.txt?
Yeah, if you want to prevent Google from accessing pages within a directory, you can put a line like:
Disallow: /directory_name*
That will stop Google from getting into anything within that directory, hence the asterisk.