Share to: share facebook share twitter share wa share telegram print page

Googlebot

Googlebot
Original author(s)Google
TypeWeb crawler
WebsiteGooglebot FAQ

Googlebot is the web crawler software used by Google that collects documents from the web to build a searchable index for the Google Search engine. This name is actually used to refer to two different types of web crawlers: a desktop crawler (to simulate desktop users) and a mobile crawler (to simulate a mobile user).[1]

Behavior

A website will probably be crawled by both Googlebot Desktop and Googlebot Mobile. However starting from September 2020, all sites were switched to mobile-first indexing, meaning Google is crawling the web using a smartphone Googlebot.[2] The subtype of Googlebot can be identified by looking at the user agent string in the request. However, both crawler types obey the same product token (useent token) in robots.txt, and so a developer cannot selectively target either Googlebot mobile or Googlebot desktop using robots.txt.

Google provides various methods that enable website owners to manage the content displayed in Google's search results. If a webmaster chooses to restrict the information on their site available to a Googlebot, or another spider, they can do so with the appropriate directives in a robots.txt file,[3] or by adding the meta tag <meta name="Googlebot" content="nofollow" /> to the web page.[4] Googlebot requests to Web servers are identifiable by a user-agent string containing "Googlebot" and a host address containing "googlebot.com".[5]

Currently, Googlebot follows HREF links and SRC links.[3] There is increasing evidence Googlebot can execute JavaScript and parse content generated by Ajax calls as well.[6] There are many theories regarding how advanced Googlebot's ability is to process JavaScript, with opinions ranging from minimal ability derived from custom interpreters.[7] Currently, Googlebot uses a web rendering service (WRS) that is based on the Chromium rendering engine (version 74 as on 7 May 2019).[8] Googlebot discovers pages by harvesting every link on every page that it can find. Unless prohibited by a nofollow-tag, it then follows these links to other web pages. New web pages must be linked to from other known pages on the web in order to be crawled and indexed, or manually submitted by the webmaster.

A problem that webmasters with low-bandwidth Web hosting plans[citation needed] have often noted with the Googlebot is that it takes up an enormous amount of bandwidth.[citation needed] This can cause websites to exceed their bandwidth limit and be taken down temporarily. This is especially troublesome for mirror sites which host many gigabytes of data. Google provides "Search Console" that allow website owners to throttle the crawl rate.[9]

How often Googlebot will crawl a site depends on the crawl budget. Crawl budget is an estimation of how typically a website is updated.[citation needed] Technically, Googlebot's development team (Crawling and Indexing team) uses several defined terms internally to take over what "crawl budget" stands for.[10] Since May 2019, Googlebot uses the latest Chromium rendering engine, which supports ECMAScript 6 features. This will make the bot a bit more "evergreen" and ensure that it is not relying on an outdated rendering engine compared to browser capabilities.[8]

Mediabot

Mediabot is the web crawler that Google uses for analyzing the content so Google AdSense can serve contextually relevant advertising to a web page. Mediabot identifies itself with the user agent string "Mediapartners-Google/2.1".

Unlike other crawlers, Mediabot does not follow links to discover new crawlable URLs, instead only visiting URLs that have included the AdSense code.[11] Where that content resides behind a login, the crawler can be given a log in so that it is able to crawl protected content.[12]

Inspection Tool Crawlers

InspectionTool is the crawler used by Search testing tools such as the Rich Result Test and URL inspection in Google Search Console. Apart from the user agent and user agent token, it mimics Googlebot.[13]

A guide to the crawlers was independently published.[14] It details four (4) distinctive crawler agents based on Web server directory index data - one (1) non-chrome and three (3) chrome crawlers.

References

  1. ^ "Googlebot". Google. 2019-03-11. Retrieved 2019-03-11.
  2. ^ "Announcing mobile first indexing for the whole web". Google Developers. Retrieved 2021-03-17.
  3. ^ a b "Google Search Console". Google.com.
  4. ^ "Google Search Console". search.google.com. Retrieved 2019-03-11.
  5. ^ "What is Googlebot | Google Search Central | Documentation". May 2022.
  6. ^ "Understand the JavaScript SEO basics | Search for Developers". Google Developers. Retrieved 2020-07-26.
  7. ^ Splitt, Martin. "How Google Search indexes JavaScript sites - JavaScript SEO". YouTube. Archived from the original on 2021-12-12.
  8. ^ a b "The new evergreen Googlebot". Official Google Webmaster Central Blog. Retrieved 2019-06-07.
  9. ^ "Google - Webmasters". Retrieved 2012-12-15.
  10. ^ "What Crawl Budget Means for Googlebot". Official Google Webmaster Central Blog. Retrieved 2018-07-04.
  11. ^ "About the AdSense Crawler".
  12. ^ "Display ads on login-protected pages".
  13. ^ "Google Crawler (User Agent) Overview".
  14. ^ "The Ultimate Guide to the New InspectionTool Crawlers".

Read other articles:

Twivortiare PengarangIka NatassaNegaraIndonesiaBahasaIndonesiaInggrisGenrenovelPenerbitGramedia Pustaka Utama (Jakarta)Tanggal terbit2012Halaman352 halamanISBNISBN 978-979-228-810-0 Twivortiare adalah sebuah novel karya Ika Natassa yang diterbitkan oleh Gramedia Pustaka Utama tahun 2012. Buku ini bercerita tentang hubungan antara Alexandra dan Beno. Novel ini kemudian diadaptasi menjadi film dengan judul yang sama pada 2019. Film adaptasi Artikel utama: Twivortiare Naskah Twivortiare dia...

Pjotr Zajev Plaats uw zelfgemaakte foto hier Persoonlijke informatie Volledige naam Pjotr Ivanovitsj Zajev Geboortedatum 26 juli 1953 Geboorteplaats Lipetsk, Sovjet-Unie Overlijdensdatum 29 november 2014 Overlijdensplaats Lipetsk, Rusland Lengte 179 cm Gewicht 87 kg Sportieve informatie Discipline Boksen Onderde(e)l(en) Zwaargewicht Club Central Sport Klub Armee Dynamo Lipetsk Olympische Spelen 1980 Portaal    Sport Pjotr Ivanovitsj Zajev (Russisch: Пётр Иванович Зае...

Stefan Groothuis Medallista olímpico Datos personalesNacimiento Empe, Países Bajos23 de noviembre de 1981 (41 años)Carrera deportivaRepresentante de Países Bajos Países BajosDeporte Patinaje de velocidad               Medallero Patinaje de velocidad masculino Evento O P B Juegos Olímpicos 1 0 0 Campeonato Mundial Dist. Ind. 1 0 1 Campeonato Mundial Dist. Corta 1 0 0 Página web oficial[editar datos en W...

Međimurska županijaCondado de Međimurje Condado BanderaEscudo Coordenadas 46°27′58″N 16°24′50″E / 46.466, 16.414Capital ČakovecEntidad Condado • País  CroaciaGobernador Matija PosavecSubdivisiones 3 ciudades 22 municipiosSuperficie   • Total 730 km²Población (2011)   • Total 113 804 hab. • Densidad 156,11 hab/km²Huso horario Hora de Europa Central y Hora de verano de Europa CentralPrefijo telefónico 4...

Four-star rank officer presiding over the EU Military Committee Chairman of the European Union Military CommitteeCEUMC coat of armsEUMC coat of armsIncumbentGeneral Robert Briegersince 16 May 2022European Union Military CommitteeReports toHigh RepresentativeSeatKortenberg building, Brussels, BelgiumTerm length3 yearsInaugural holderGeneral Gustav HägglundFormationDecember 2000Websiteeuropa.eu The chairman of the European Union Military Committee (CEUMC) is the four-star rank officer rep...

El realista proscrito, 1651 Autor John Everett MillaisCreación 1853[editar datos en Wikidata] El realista proscrito, 1651 (1852-1853) es una pintura de John Everett Millais que representa a una joven puritana que protege a un realista huido después de la batalla de Worcester en 1651, la derrota decisiva de Carlos II por Oliver Cromwell. El monárquico se esconde en un árbol hueco, una referencia a un famoso incidente en el que el propio Carlos se escondió en un árbol así para ...

De Britse slagkruiser HMS Hood in 1932 Een slagkruiser is een oorlogsschip dat de slagkracht van een slagschip paart aan de snelheid van een kruiser. De hoge snelheid werd gerealiseerd door zwaardere turbines met meer vermogen te plaatsen. Om deze binnen de scheepsromp te huisvesten werd een deel van de bepantsering opgeofferd. Voor de Eerste Wereldoorlog Het idee ontstond aan het begin van de 20e eeuw bij de Royal Navy, bijna tegelijk met het ontstaan van het eerste moderne slagschip, de Dre...

American crime drama television series Queen of the SouthGenre Action Crime Drama Thriller Based onLa Reina del Surby Arturo Pérez-ReverteDeveloped by M.A. Fortin Joshua John Miller Starring Alice Braga Peter Gadiot Hemky Madera Molly Burnett Veronica Falcon Joaquim de Almeida Justina Machado Gerardo Taracena Alfonso Herrera Jon-Michael Ecker Joseph T. Campos David Andrews Theme music composer Giorgio Moroder Raney Shockne[1] Opening themeThe Queen of the SouthComposers Giorgio Morod...

Overview of religion in Cameroon Religion in Cameroon (2020)[1]   Christianity (59.7%)  Islam (20.2%)  Traditional faiths (19.0%)  Others / None (1.1%) Our Lady of Victories Cathedral in the capital Yaoundé Part of a series on theCulture of Cameroon History People Languages Mythology Cuisine Religion Art Literature Music Media Cinema Sport Monuments World Heritage Sites Symbols Flag Coat of arms National anthem  Cameroon portalvte Chri...

Livisch (līvõ kēļ) Gesprochen in Lettland Sprecher seit 2013 ausgestorben[1] LinguistischeKlassifikation Uralische Sprachen Finno-ugrisch Finno-permische Sprachen Wolgafinnische Sprachen Finno-samische Sprachen Ostseefinnisch Livisch Offizieller Status Amtssprache in - Sprachcodes ISO 639-1 – ISO 639-2 fiu (andere finnisch-ugr. Sprachen) ISO 639-3 liv Livisch (līvõ kēļ, auch rāndakēļ) wurde vom Volk der Liven in der lettischen Provinz Kurland (lett. Kurzeme) gesp...

Essential oil A glass vial containing pure Sandalwood Essential Oil Sandalwood oil is an essential oil obtained from the steam distillation of chips and billets cut from the heartwood of various species of sandalwood trees, mainly Santalum album (Indian sandalwood)[1] and Santalum spicatum (Australian sandalwood).[2] Sandalwood oil is used in perfumes, cosmetics, sacred unguents, and as a mild food flavouring.[1] Main constituents Sandalwood oil contains more than 90% ...

Second highest-ranking official in the executive branch of Rivers State in Nigeria Deputy Governor of Rivers StateIncumbentNgozi Odusince 29 May 2023Executive branch of theGovernment of Rivers StateStyleHer ExcellencyThe Honorable (alternative)Member ofExecutive CouncilSeatPort HarcourtTerm lengthFour yearsInaugural holderFrank EkeFormation1 October 1979Salary₦2,112,215Websitehttp://www.riversstate.gov.ng Politics of Rivers State Constitution Constitution Executive Governor (list) Ezen...

Election in Oregon Main article: 1920 United States presidential election 1920 United States presidential election in Oregon ← 1916 November 2, 1920 1924 →   Nominee Warren G. Harding James M. Cox Party Republican Democratic Home state Ohio Ohio Running mate Calvin Coolidge Franklin D. Roosevelt Electoral vote 5 0 Popular vote 143,592 80,019 Percentage 60.20% 33.55% County Results Harding  50-60%  60-70%  70-80% ...

Seegefecht vor Helgoland (1864) Teil von: Deutsch-Dänischer Krieg Fregatten Schwarzenberg, Radetzky, Niels Juel und Jylland, Korvette Hejmdal. Im Hintergrund preußische Kanonenboote. (Kupferstich des Seegefechts) Datum 9. Mai 1864 Ort vor Helgoland, Nordsee Ausgang taktischer dänischer Sieg Konfliktparteien Danemark Dänemark Osterreich Kaisertum ÖsterreichPreussen Konigreich Preußen Befehlshaber Danemark Edouard Suenson Osterreich Kaisertum Wilhelm von Tegetthoff Truppens...

English stage magician For the band, see David Devant & His Spirit Wife. David DevantBorn22 February 1868Died13 October 1941 (1941-10-14) (aged 73)OccupationMagician David Devant (22 February 1868 – 13 October 1941) was an English magician, shadowgraphist and film exhibitor. He was born David Wighton in Holloway, London.[1] He is regarded by magicians as a consummate exponent of suave and witty presentation of stage illusion.[2] According to magic historian Jim...

Vaccine primarily used against tuberculosis BCG vaccineMicroscopic image of the Calmette–Guérin bacillus, Ziehl–Neelsen stain, magnification: 1,000nnVaccine descriptionTargetMycobacterium tuberculosisVaccine typeAttenuatedClinical dataTrade namesBCG Vaccine, BCG Vaccine AJVAHFS/Drugs.comProfessional Drug FactsLicense data US DailyMed: BCG_Vaccine Routes ofadministrationPercutaneous, intravesical, intradermalATC codeJ07AN01 (WHO) L03AX03 (WHO)Legal statusLegal sta...

Rea Imagen de Rea tomada por la sonda Cassini-Huygens el 26 de noviembre de 2005.DescubrimientoDescubridor Giovanni CassiniFecha 23 de octubre de 1684Designaciones Saturno VCategoría satélite natural de SaturnoOrbita a SaturnoElementos orbitalesInclinación 0,345 °Semieje mayor 527.108 km[1]​Excentricidad 0,0012583Elementos orbitales derivadosPeríodo orbital sideral 4,518212 dSatélite de SaturnoCaracterísticas físicasMasa 2,32×1021 kgDimensiones 1532,4 × 1525,6̴...

Letter of the Latin alphabet Ẋ (minuscule: ẋ) is a letter of the Latin alphabet, formed from X with the addition of a dot. Usage Chechen Ẋ is present in the Chechen Latin alphabet, created in the 1990s. The Cyrillic equivalent is Хь, which represents the voiceless epiglottal fricative /ʜ/.[1] Computing code Character information Preview Ẋ ẋ Unicode name LATIN CAPITAL LETTER X WITH DOT ABOVE LATIN SMALL LETTER X WITH DOT ABOVE Encodings decimal hex dec hex Unicod...

Unvollständig illustrierte Handschrift des Buches (um 1425) in der Houghton Library, Harvard University Seite einer Handschrift einer anonymen französischen Übersetzung von De mulieribus claris (1401). Die Buchmalerei (von Colin d’Amiens?) zeigt (links) König Herodes den Großen mit seiner Frau Mariamne I. vor deren Hinrichtung. Genf, Bibliothèque Publique et Universitaire, Ms. 191, fol. 221r (um 1465/1470) De mulieribus claris (lateinisch, übersetzt „Von berühmten Frauen“) ist e...

Italian military engineer Giovanni Portinari (flourished 1526 – 1572) was an Italian military engineer who served several Tudor monarchs of England. He is most famous for organising the demolition of Lewes Priory in 1538 on the orders of Thomas Cromwell, the chief minister of King Henry VIII. Biography The ruins of Lewes Priory Little is known about Portinari's early life, but he was probably born in Florence,[1] which was home to a number of prominent individuals with this surname,...

Kembali kehalaman sebelumnya