1 of 7

Slide Notes

DownloadGo Live

HOW SEARCH ENGINE WORKS

Published on Jul 02, 2017

No Description

PRESENTATION OUTLINE

HOW SEARCH ENGINE WORKS

                           BY: RUDRA, SHARAD
Photo by DocChewbacca

WHAH IS S.E

  • Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found. A search engine is really a general class of programs, however, the term is often used to specifically describe systems like Google, Bing and Yahoo! Search that enable users to search for documents on the World Wide Web.

Where does it all start

  • Crawling is where it all begins. The acquisition of data about a website. This involves scanning the site and getting a complete list of everything on there – the page title, images, keywords it contains, and any other pages it links to.

How is a website crawled exactly?

  • An automated bot – a spider – visits each page, just like you or I would, only very quickly. Even in the earliest days, Google reported that they were reading a few hundred pages a second.

crawling

  • Any site that is linked to from another site already indexed, or any site that manually asked to be indexed, will eventually be crawled – some sites more frequently than others and some to a greater depth. If the site is huge and content hidden many clicks away from the homepage, the crawler bots may actually give up.

Indexing

  • you’d be forgiven for thinking this is an easy step – indexing is the process of taking all of that data you have from a crawl, and placing it in a big database. Imagine trying to a make a list of all the books you own, their author and the number of pages. Going through each book is the crawl and writing the list is the index.
Photo by LaMenta3

Ranking & Retrieval

Photo by quimby