Wednesday, September 9, 2009

How Do Search Engines Work?

This is a very interesting question! Do you know that how a “Search Engine” works, because every search engine wants to keep their secret with them. As a SEO you should have the knowledge about How a search engine works? If you know how a Search Engine works then you can make your proper plans accordingly and you can easily optimize your site. In 1994 Wide Web Worm was the first web search engine, who claim that they have more then 110,000 index web pages and web accessible documents.
We have seen several improvements in Search Engines Program. In these days we have Crawler-based search engines. Which includes the following three major elements:-
1. The spider or Web Crawler or Robot,
2. The index, and
3. The software.
Each has its own function and together they produce Search Engine Results Pages


1.) The spider:- The term “Spider“ is also known as web crawler or robot. A search engine spider is an automated program that reads web pages and follows any links to other pages within the website. This is often referred to as a site being "spidered" or "crawled". The popular search engine spiders are Googlebot (Google), Slurp (Yahoo!) and MSNBot (MSN Search). Spiders start crawling web pages when you list your site with their search engine database, with a list of page URLs it visits these pages, crawling the code and copy, it adds new pages (links) that it finds on the page to its index. The spider returns to the sites index on a regular basis, scanning for any changes. How often the spider returns is up to the search engines to decide. Website owners do have some control in how often a spider visits their site by making use of a robot.txt file. Search engines first look for this file before crawling a page.


2.) The Growing Index:- An index is like a giant catalogue or inventory of websites containing a copy of every web page and file that the spider finds. If a web page changes, this catalogue is updated with the new information. The latest figure released by Google is 8 billion indexes pages. Spider sometimes takes a while for new pages to be added to its index. Thus, a web page may have been "spidered" but not yet "indexed." Until a page is indexed - added to the index - pages will not be available to those search engines for search engine Page Result (SERP).


3.) The Performing Search Engine or Search Engine Software: - At the end of the day a search engine is a software program designed to sift through billions of pages recorded in its index to find matches to a search query and rank them in an order that it believes is most relevant. Each search engine has developed a set of rules or mathematical equations, called algorithm. This algorithm differs between engines, that’s why different search engines may produce different results for the same query. Exactly how a particular search engine's algorithm works is a kept secret, but some general rules are clear that are often used to increase a website's ranking performance. After applying this algorithm to their index of sites, a search engine comes up with a list of the most relevant results according to the search conducted. This is referred to as search engine optimization. If we want to get good SERP so we have to follow the particular search engine guidelines. These guidelines are related to On Page and Off Page optimization.

Tips for optimize your web site.

1. Use your primary keywords in domain or sub domain name of your website.
2. Choose your keywords accurately services offered by your web site.
3. Do not use dynamic
4. Get your second or third keywords in your directory name and filename.
5. Keep your webpage free from any syntax error, and validate your HTML and CSS.
6. Use Specify Meta keywords in heading of document. Limit it to 15 to 20 words.
7. Give a short Title in of your page in 60-80 characters maximum length containing your primary keyword.
8. Try to include your most important keyword phrases in heading tags on your page or to reduce size of heading use CSS.
9. Write Your Meta Description tag attractive containing keywords because it will appear on the search engine result pages.
10. Use text for navigation menu instead of using images, frames or Java scripts.
11. Try to include your most important keyword in hyperlinked text.
12. If you are using images then use ?alt? attribute to describe your image with proper keyword.
13. Submit sitemap of your website to different search engines so yor site will be indexed.
14. Keep size of your webpages near than 15KB so it can easily download on internet.
15. Try to avoid your content in Flash, frame, images, java script.
16. Do not try to spam keyword, try to add them with content of existing website.
17. Submit your website only once to every search engine.
18. Use Newsletter and RSS feed so user can subscribe your site content.
19. Publish articles related to yours web and leave your websites link.
20. Get link from other sites related to yours, by offering them oneway, towway or threeway links.