<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Search Engine Journal &#187; GoogleBot</title>
	<atom:link href="http://searchenginejournal.org/tag/googlebot/feed" rel="self" type="application/rss+xml" />
	<link>http://searchenginejournal.org</link>
	<description></description>
	<lastBuildDate>Sat, 05 Sep 2009 21:48:07 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Basics About Google</title>
		<link>http://searchenginejournal.org/760.html</link>
		<comments>http://searchenginejournal.org/760.html#comments</comments>
		<pubDate>Tue, 14 Jul 2009 12:59:58 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Crawling]]></category>
		<category><![CDATA[Google index]]></category>
		<category><![CDATA[Google search]]></category>
		<category><![CDATA[GoogleBot]]></category>
		<category><![CDATA[search results]]></category>
		<category><![CDATA[searching the web]]></category>

		<guid isPermaLink="false">http://searchenginejournal.org/?p=760</guid>
		<description><![CDATA[When you sit down at your computer and do a Google search, you&#8217;re almost instantly presented with a list of results from all over the web. How does Google find web pages matching your query, and determine the order of search results?
In the simplest terms, you could think of searching the web as looking in [...]]]></description>
			<content:encoded><![CDATA[<p>When you sit down at your computer and do a Google search, you&#8217;re almost instantly presented with a list of results from all over the web. How does Google find web pages matching your query, and determine the order of search results?</p>
<p>In the simplest terms, you could think of searching the web as looking in a very large book with an impressive index telling you exactly where everything is located. When you perform a Google search, our programs check our index to determine the most relevant search results to be returned ( &#8220;served&#8221;) to you.</p>
<p>Crawling:<br />
Crawling is the process by which Googlebot discovers new and updated pages to be added to the Google index.<br />
We use a huge set of computers to fetch (or &#8220;crawl&#8221;) billions of pages on the web. The program that does the fetching is called Googlebot (also known as a robot, bot, or spider). Googlebot uses an Algorithmic process: computer programs determine which sites to crawl, how often, and how many pages to fetch from each site.</p>
<p>Google&#8217;s crawl process begins with a list of web page URLs, generated from previous crawl processes, and augmented with Sitemap data provided by webmasters. As Googlebot visits of each these websites it detects links on each page and adds them to its list of pages to crawl. New sites, changes to existing sites, and dead links are noted and used to update the Google index.</p>
<p>Google does not accept payment to crawl a site more frequently, and we keep the search side of our business separate from our revenue-generating AdWords service.</p>
<p>Indexing:<br />
Googlebot processes each of the pages it crawls in order to compile a massive index of all the words it sees and their location on each page. In addition, we process information included in key content tags and attributes, such as Title tags and ALT attributes. Googlebot can process many, but not all, content types. For example, we can not process the content of some rich media files or dynamic pages.</p>
<p>Serving results:<br />
When a user enters a query, our machines search the index for matching pages and return the results we believe are the most relevant to the user. Relevancy is determined by over 200 factors, one of which is the PageRank for a given page. PageRank is the measure of the importance of a page based on the incoming links from other pages. In simple terms, each link to a page on your site from another site adds to your site&#8217;s PageRank. Not all links are equal: Google works hard to improve the user experience by identifying spam links and other practices that negatively impact search results. The best types of links are those that are given based on the quality of your content.</p>
<p>In order for your site to rank well in search results pages, it&#8217;s important to make sure that Google can crawl and index your site correctly. Our Webmaster Guidelines outline some best practices that can help you avoid common pitfalls and improve your site&#8217;s ranking.</p>
<p>Google&#8217;s Related Searches, Spelling Suggestions, and Google Suggest features are designed to help users save time by displaying related terms, common misspellings, and people&#8217;s queries. Like our google.com search results, the keywords used by these features are automatically generated by our web crawlers and search algorithms. We only display these suggestions when we think they might save the user time. If a site ranks well for a keyword, it&#8217;s because we&#8217;ve algorithmically determined that its content is more relevant to the user&#8217;s query.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchenginejournal.org/760.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Indexing How-to.</title>
		<link>http://searchenginejournal.org/indexing-how-to.html</link>
		<comments>http://searchenginejournal.org/indexing-how-to.html#comments</comments>
		<pubDate>Sun, 21 Jun 2009 10:44:02 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[GoogleBot]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[SERPs]]></category>

		<guid isPermaLink="false">http://searchenginejournal.org/?p=289</guid>
		<description><![CDATA[Normally, a small site with proper structure, correct linking, clean code and a couple of good backlinks gets indexed in very short time and you start finding it in SERPs. But what about large sites and doorways?
Some webmasters are building sites with over 100k pages with several levels of linking. If you have troubles with [...]]]></description>
			<content:encoded><![CDATA[<p>Normally, a small site with proper structure, correct linking, clean code and a couple of good backlinks gets indexed in very short time and you start finding it in SERPs. But what about large sites and doorways?</p>
<p><span id="more-289"></span>Some webmasters are building sites with over 100k pages with several levels of linking. If you have troubles with indexing of such sites, then use these tips to get your huge site indexed quicker:</p>
<p>- You need some links to force GoogleBot read your site, so buy, or exchange links.<br />
- Don’t point all links to the index page of your site, but put several of them to the index page and put at least one link per each subcategory. This means that if, for instance, you have four levels of linking &#8211; each category of each level must have an inbound link.<br />
- If you haven’t got enough links to feed all your subcategories, put them to second level only and when it will get properly indexed, change links to feed thrid level. However, this way is not recommended.<br />
- In case if your site stays unindexed, refer to some site analyzing software that will spot broken links, inaccessible areas of your site and other mistakes related to linking.<br />
- Check your .htaccess and robots.txt files for consistency with current project.</p>
<p>It is also possible to create site’s categories and subcategories on subdomains of general domain and link there. It’s up to you what strategy to choose, but remember &#8211; there’s nothing impossible in SEO, so even very huge site can be forced to be indexed.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchenginejournal.org/indexing-how-to.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
