Share to: share facebook share twitter share wa share telegram print page

Google Flu Trends

Google Flu Trends data, South Africa

Google Flu Trends (GFT) was a web service operated by Google. It provided estimates of influenza activity for more than 25 countries. By aggregating Google Search queries, it attempted to make accurate predictions about flu activity. This project was first launched in 2008 by Google.org to help predict outbreaks of flu.[1]

Google Flu Trends stopped publishing current estimates on 9 August 2015. Historical estimates are still available for download, and current data are offered for declared research purposes.[2]

History

The idea behind Google Flu Trends was that, by monitoring millions of users’ health tracking behaviors online, the large number of Google search queries gathered can be analyzed to reveal if there is the presence of flu-like illness in a population. Google Flu Trends compared these findings to a historic baseline level of influenza activity for its corresponding region and then reports the activity level as either minimal, low, moderate, high, or intense. These estimates have been generally consistent with conventional surveillance data collected by health agencies, both nationally and regionally.

Roni Zeiger helped develop Google Flu Trends.[3]

Methods

Google Flu Trends was described as using the following method to gather information about flu trends.[4][5]

First, a time series is computed for about 50 million common queries entered weekly within the United States from 2003 to 2008. A query's time series is computed separately for each state and normalized into a fraction by dividing the number of each query by the number of all queries in that state. By identifying the IP address associated with each search, the state in which this query was entered can be determined.

A linear model is used to compute the log-odds of Influenza-like illness (ILI) physician visit and the log-odds of ILI-related search query:

P is the percentage of ILI physician visit and Q is the ILI-related query fraction computed in previous steps. β0 is the intercept and β1 is the coefficient, while ε is the error term.[citation needed]

Each of the 50 million queries is tested as Q to see if the result computed from a single query could match the actual history ILI data obtained from the U.S. Centers for Disease Control and Prevention (CDC). This process produces a list of top queries which gives the most accurate predictions of CDC ILI data when using the linear model. Then the top 45 queries are chosen because, when aggregated together, these queries fit the history data the most accurately. Using the sum of top 45 ILI-related queries, the linear model is fitted to the weekly ILI data between 2003 and 2007 so that the coefficient can be gained. Finally, the trained model is used to predict flu outbreak across all regions in the United States.

This algorithm has been subsequently revised by Google, partially in response to concerns about accuracy, and attempts to replicate its results have suggested that the algorithm developers "felt an unarticulated need to cloak the actual search terms identified".[6]

Privacy concerns

Google Flu Trends tries to avoid privacy violations by only aggregating millions of anonymous search queries, without identifying individuals that performed the search.[1][7] Their search log contains the IP address of the user, which could be used to trace back to the region where the search query is originally submitted. Google runs programs on computers to access and calculate the data, so no human is involved in the process. Google also implemented the policy to anonymize IP address in their search logs after 9 months.[8]

However, Google Flu Trends has raised privacy concerns among some privacy groups. Electronic Privacy Information Center and Patient Privacy Rights sent a letter to Eric Schmidt in 2008, then the CEO of Google.[9] They conceded that the use of user-generated data could support public health effort in significant ways, but expressed their worries that "user-specific investigations could be compelled, even over Google's objection, by court order or Presidential authority".

Impact

An initial motivation for GFT was that being able to identify disease activity early and respond quickly could reduce the impact of seasonal and pandemic influenza. One report was that Google Flu Trends was able to predict regional outbreaks of flu up to 10 days before they were reported by the CDC (Centers for Disease Control and Prevention).[10]

In the 2009 flu pandemic Google Flu Trends tracked information about flu in the United States.[11] In February 2010, the CDC identified influenza cases spiking in the mid-Atlantic region of the United States. However, Google's data of search queries about flu symptoms was able to show that same spike two weeks prior to the CDC report being released.[citation needed]

“The earlier the warning, the earlier prevention and control measures can be put in place, and this could prevent cases of influenza,” said Dr. Lyn Finelli, lead for surveillance at the influenza division of the CDC. “From 5 to 20 percent of the nation's population contract the flu each year, leading to roughly 36,000 deaths on average.” [10]

Google Flu Trends is an example of collective intelligence that can be used to identify trends and calculate predictions. The data amassed by search engines is significantly insightful because the search queries represent people's unfiltered wants and needs. “This seems like a really clever way of using data that is created unintentionally by the users of Google to see patterns in the world that would otherwise be invisible,” said Thomas W. Malone, a professor at the Sloan School of Management at MIT. “I think we are just scratching the surface of what's possible with collective intelligence.” [10]

Accuracy

The initial Google paper stated that the Google Flu Trends predictions were 97% accurate comparing with CDC data.[4] However subsequent reports asserted that Google Flu Trends' predictions have been very inaccurate, especially in two high-profile cases. Google Flu Trends failed to predict the 2009 spring pandemic[12] and over the interval 2011–2013 it consistently overestimated relative flu incidence,[6] predicting twice as many doctors' visits over one interval in the 2012-2012 flu season as the CDC recorded.[6][13] A 2022 study published (with commentaries) in the International Journal of Forecasting[14] found that Google Flu Trends was outperformed by the recency heuristic, an instance of so-called "naive" forecasting, where the predicted flu incidence equals the most recently observed flu incidence. For all weeks from March 18, 2007, to August 9, 2015 (the horizon for which Google Flu Trends predictions are available), the mean absolute error of Google Flu Trends was 0.38 and of the recency heuristic 0.20 (both in percentage points; linear regression with a single predictor, the most recently observed flu incidence, had a mean absolute error of also 0.20, and the benchmark of random prediction had 1.80).

One source of problems is that people making flu-related Google searches may know very little about how to diagnose flu; searches for flu or flu symptoms may well be researching disease symptoms that are similar to flu, but are not actually flu.[15] Furthermore, analysis of search terms reportedly tracked by Google, such as "fever" and "cough", as well as effects of changes in their search algorithm over time, have raised concerns about the meaning of its predictions.[6] In fall 2013, Google began attempting to compensate for increases in searches due to prominence of flu in the news, which was found to have previously skewed results.[16] However, one analysis concluded that "by combining GFT and lagged CDC data, as well as dynamically recalibrating GFT, we can substantially improve on the performance of GFT or the CDC alone."[6] A later study also demonstrates that Google search data can indeed be used to improve estimates, reducing the errors seen in a model using CDC data alone by up to 52.7 per cent.[17]

By re-assessing the original GFT model, researchers uncovered that the model was aggregating queries about different health conditions, something that could lead to an over-prediction of ILI rates; in the same work, a series of more advanced linear and nonlinear better-performing approaches to ILI modelling have been proposed.[18]

However, followup work was able to substantially improve the accuracy of GFT through the use of a random forest regression model trained on both the incidence of influenza-like illness and the output of the original GFT model.[19]

Similar projects such as the flu-prediction project[20] by the Institute of Cognitive Science at Universitat Osnabrück carry the basic idea forward, by combining social media data e.g. Twitter with CDC data, and structural models that infer the spatial and temporal spreading [21] of the disease.

References

  1. ^ a b "Google Flu Trends | How". Archived from the original on 22 October 2012. Retrieved 10 November 2012.
  2. ^ Fred O'Connor (20 August 2015). "Google Flu Trends calls out sick, indefinitely". PCWorld. Archived from the original on 23 August 2015. Retrieved 23 August 2015.
  3. ^ Zeiger, Roni (6 October 2009). "Google Flu Trends Overview". youtube.com. YouTube. Archived from the original on 6 June 2013. Retrieved 6 June 2013.
  4. ^ a b Ginsberg, Jeremy (2009). "Detecting influenza epidemics using search engine query data" (PDF). Nature. 457 (7232): 1012–1014. Bibcode:2009Natur.457.1012G. doi:10.1038/nature07634. PMID 19020500. S2CID 125775. Archived (PDF) from the original on 4 September 2012. Retrieved 10 November 2012.
  5. ^ Ginsberg, Jeremy; Mohebbi, Matthew H.; Patel, Rajan S.; Brammer, Lynnette; Smolinski, Mark S.; Brilliant, Larry (19 February 2009). "Detecting influenza epidemics using search engine query data". Nature. 457 (7232): 1012–1014. Bibcode:2009Natur.457.1012G. doi:10.1038/nature07634. PMID 19020500. S2CID 125775.
  6. ^ a b c d e Lazer, David; Kennedy, Ryan; King, Gary; Vespignani, Alessandro (14 March 2014). "The Parable of Google Flu: Traps in Big Data Analysis" (PDF). Science. 343 (6176): 1203–1205. Bibcode:2014Sci...343.1203L. doi:10.1126/science.1248506. PMID 24626916. S2CID 206553739. Archived (PDF) from the original on 19 November 2019. Retrieved 19 December 2019.
  7. ^ Helft, Miguel (13 November 2008). "Is There a Privacy Risk in Google Flu Trends?". The New York Times. Archived from the original on 6 November 2012. Retrieved 10 November 2012.
  8. ^ "Privacy Policy – Policies & Principles – Google". Archived from the original on 3 November 2012. Retrieved 10 November 2012.
  9. ^ Peel, Deborah. "EPIC's November 12, 2008 Letter to Google Concerning Google Flu Trends" (PDF). Archived (PDF) from the original on 15 November 2012. Retrieved 10 November 2012.
  10. ^ a b c "Google Uses Searches to Track Flu's Spread" (PDF). Archived (PDF) from the original on 15 November 2012. Retrieved 10 November 2012.
  11. ^ Cook, S.; Conrad, C.; Fowlkes, A. L.; Mohebbi, M. H. (2011). Cowling, Benjamin J (ed.). "Assessing Google Flu Trends Performance in the United States during the 2009 Influenza Virus A (H1N1) Pandemic". PLOS ONE. 6 (8): e23610. Bibcode:2011PLoSO...623610C. doi:10.1371/journal.pone.0023610. PMC 3158788. PMID 21886802.
  12. ^ Schmidt, Charles (2019-09-18). "Real-time flu tracking". Nature. 573 (7774): S58–S59. doi:10.1038/d41586-019-02755-6.
  13. ^ Butler, Declan (13 February 2013). "When Google got flu wrong". Nature. 494 (7436): 155–156. Bibcode:2013Natur.494..155B. doi:10.1038/494155a. PMID 23407515.
  14. ^ Katsikopoulos, Konstantinos V.; Şimşek, Özgür; Buckmann, Marcus; Gigerenzer, Gerd (2022-04-01). "Transparent modeling of influenza incidence: Big data or a single data point from psychological theory?". International Journal of Forecasting. 38 (2): 613–619. doi:10.1016/j.ijforecast.2020.12.006. hdl:21.11116/0000-0007-E16A-4. ISSN 0169-2070. S2CID 234053376.
  15. ^ "Google Flu Trends: A case of Big Data gone bad?". SiliconANGLE. 24 March 2014.
  16. ^ Richard Harris (2014-03-13). "Google's Flu Tracker Suffers From Sniffles". NPR. Archived from the original on 2019-03-19. Retrieved 2019-12-19.
  17. ^ Preis, Tobias; Moat, Helen Susannah (29 October 2014). "Adaptive nowcasting of influenza outbreaks using Google searches". Royal Society Open Science. 1 (2): 140095. Bibcode:2014RSOS....140095P. doi:10.1098/rsos.140095. PMC 4448892. PMID 26064532.
  18. ^ Lampos, Vasileios; Miller, Andrew C.; Crossan, Steve; Stefansen, Christian (3 Aug 2015). "Advances in nowcasting influenza-like illness rates using search query logs". Scientific Reports. 5 (12760): 12760. Bibcode:2015NatSR...512760L. doi:10.1038/srep12760. PMC 4522652. PMID 26234783.
  19. ^ Kandula, Sasikiran; Shaman, Jeffrey (2019-08-02). "Reappraising the utility of Google Flu Trends". PLOS Computational Biology. 15 (8): e1007258. Bibcode:2019PLSCB..15E7258K. doi:10.1371/journal.pcbi.1007258. PMC 6693776. PMID 31374088.
  20. ^ "Flu prediction project by the University Osnabrück and IBM WATSON". Archived from the original on 2019-05-27. Retrieved 2019-12-19.
  21. ^ Schumacher Johannes (2015). "A statistical framework to infer delay and direction of information flow from measurements of complex systems". Neural Computation. 27 (8): 1555–1608. doi:10.1162/NECO_a_00756. PMID 26079751. S2CID 25156368. Archived from the original on 2019-02-12. Retrieved 2016-03-20.

Read other articles:

Campeonato Mundial de Ciclismo en Ruta de 2007Stuttgart 2007 Ciclismo en ruta El maillot arcoíris de campeón mundialDatos generalesSede StuttgartAlemania AlemaniaCategoría Élite y sub-23Fecha 26 – 30 de septiembreEdición LXXIVOrganizador Unión Ciclista Internacional Cronología Salzburgo 2006 Campeonato Mundial de Ciclismo en Ruta de 2007 Varese 2008 [editar datos en Wikidata] El LXXIV Campeonato Mundial de Ciclismo en Ruta se realizó en la ciudad de Stuttgart (Alemani...

Questa voce sull'argomento aeroporti della Tunisia è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Aeroporto di Tunisi-CartagineaeroportoTerminal 2 Codice IATATUN Codice ICAODTTA Nome commercialeAeroporto Internazionale di Tunisi-Cartagine DescrizioneTipocivile Stato Tunisia GovernatoratoGovernatorato di Tunisi CittàTunisi Altitudine7 m s.l.m. Coordinate36°51′04″N 10°13′37″E / 36.851111°N 10.226944°E36.851111...

Ambassador of Peru to the Republic of PanamaGreat Seal of PeruIncumbentVacantMinistry of Foreign AffairsMarbella 53, Panama CityAppointerThe President of PeruInaugural holderVíctor R. Cárdenas[1][2]FormationSeptember 1904WebsiteEmbassy of Peru in Panama The Extraordinary and Plenipotentiary Ambassador of Peru to the Republic of Panama is the official representative of the Republic of Peru to the Republic of Panama. Both countries established relations on December 18, 1903, a...

Ron Brewer Datos personalesNombre completo Ronald Charles BrewerApodo(s) BootNacimiento Fort Smith, Arkansas  Estados Unidos16 de septiembre de 1955 (68 años)Nacionalidad(es) EstadounidenseAltura 1,93 m (6′ 4″)Peso 82 kg (180 lb)Carrera deportivaDeporte BaloncestoEquipo universitario Arkansas-Fort Smith (1974-1975)Arkansas (1975-1978)Club profesionalDraft de la NBA 1.ª ronda (puesto 7) 1978 por Portland Trail BlazersClub RetiradoLiga NBAPosición BaseTrayectoria...

يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (ديسمبر 2018) بطولة العالم للدراجات على المضمار 2011 التفاصيل التاريخ 2011 الموقع  هولندا (آبلدورن) نوع السباق سباق الدرا

Commune in Hauts-de-France, FranceAizellesCommuneOld fire-fighting pumpLocation of Aizelles AizellesShow map of FranceAizellesShow map of Hauts-de-FranceCoordinates: 49°29′26″N 3°48′44″E / 49.4906°N 3.8122°E / 49.4906; 3.8122CountryFranceRegionHauts-de-FranceDepartmentAisneArrondissementLaonCantonVilleneuve-sur-AisneIntercommunalityChemin des DamesGovernment • Mayor (2020–2026) Jean-Marie Merlo[1]Area14.88 km2 (1.88 sq...

Pour les articles homonymes, voir Sainte-Anastasie. Sainte-Anastasie Pont Saint-Nicolas sur le Gardon Blason Administration Pays France Région Occitanie Département Gard Arrondissement Nîmes Intercommunalité Communauté d'agglomération Nîmes Métropole Maire Mandat Gilles Tixador 2020-2026 Code postal 30190 Code commune 30228 Démographie Gentilé Anastasiens, Anastasiennes[1] Populationmunicipale 1 740 hab. (2020 ) Densité 40 hab./km2 Géographie Coordonnées 43° ...

Cobalt(II) thiocyanate Identifiers CAS Number 3017-60-5 Y61497-93-6 (trihydrate) 3D model (JSmol) Interactive image ChemSpider 17166 N ECHA InfoCard 100.019.234 EC Number 221-156-8 PubChem CID 18174 UNII Q59BVN759I Y CompTox Dashboard (EPA) DTXSID00890569 InChI InChI=1S/2CHNS.Co/c2*2-1-3;/h2*3H;/q;;+2/p-2 NKey: INDBQWVYFLTCFF-UHFFFAOYSA-L NInChI=1/2CHNS.Co/c2*2-1-3;/h2*3H;/q;;+2/p-2Key: INDBQWVYFLTCFF-NUQVWONBAI SMILES C(#N)[S-].C(#N)[S-].[Co+2] Propert...

汪達·馬克希莫夫漫威電影宇宙角色伊莉莎白·歐森於《汪達幻視》(2021年)飾演汪達·馬克希莫夫/緋紅女巫首次登場《美國隊長2:酷寒戰士》(2014年)原型緋紅女巫史丹·李傑克·科比作品改编喬斯·溫登饰演 伊莉莎白·歐森 米夏艾拉·羅素(幼年時期) 角色设定信息國籍蘇科維亞頭銜緋紅女巫所屬 復仇者 九頭蛇 武器 混沌魔法[1][2] 黑暗神書(英语:Darkhold...

International athletics championship eventX South American Under-23 Championships in AthleticsDates29 September – 2 OctoberHost cityCascavel, BrazilVenueEstádio do Centro Nacional de Treinamento de AtletismoLevelUnder-23Events45Participation259 athletes from 11 nations← 2021 Guayaquil 2024 TBD → The 2022 South American Under-23 Championships in Athletics was the tenth edition of the biennial track and field competition for South American athletes aged under 23 years old, organi...

Danish singer-songwriter This article contains text that is written in a promotional tone. Please help improve it by removing promotional language and inappropriate external links, and by adding encyclopedic text written from a neutral point of view. (September 2022) (Learn how and when to remove this template message)NabihaBackground informationBirth nameNabiha BensoudaAlso known asTiger LilyBornCopenhagen, DenmarkGenresPopOccupation(s)Singer-songwriter, musicianInstrument(s)VocalsYears acti...

This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages) The topic of this article may not meet Wikipedia's general notability guideline. Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention. If notability cannot be shown, the article is likely to be merged,...

Beer company of Tirana, AlbaniaThis article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Birra Tirana – news · newspapers · books · scholar · JSTOR (July 2022) (Learn how and when to remove this template message)Birra TiranaLocationTirana, AlbaniaOpened1960Key peopleCEO: Luan BregasiRevenue6.81 million $Owned byIndivi...

Centro Internacional de Convenciones de Kaohsiung高雄國際會議中心 Vista del CentroLocalizaciónPaís TaiwánUbicación Kaohsiung TaiwánCoordenadas 22°37′35″N 120°17′10″E / 22.6264, 120.286Información generalFinalización 2013Construcción 2000Inauguración 2013Detalles técnicosPlantas 6http://www.icck.com.tw/[editar datos en Wikidata] El Centro Internacional de Convenciones de Kaohsiung[1]​ (en chino: 高雄國際會議中心) es un ce...

2019 Norwegian biopic AmundsenFilm posterDirected byEspen SandbergWritten byRavn LanesskogProduced byEspen HornStarringPål Sverre HagenKatherine WaterstonChristian RubeckCinematographyPål Ulvik RoksethEdited byPerry Eriksen, Martin StoltzMusic byJohan SöderqvistRelease date15 February 2019Running time125 minutesCountryNorwayLanguageNorwegian/English Amundsen is a Norwegian film, released on 15 February 2019, that details the life of Norwegian explorer Roald Amundsen.[1] It was dire...

PT Pabrik Gula Rajawali IJenisPerseroan terbatasIndustriPerkebunanPendahuluPT Pabrik Gula Krebet BaruPT Pabrik Gula Rejo Agung BaruDidirikan1 Januari 1996; 27 tahun lalu (1996-01-01)KantorpusatSurabaya, IndonesiaWilayah operasiIndonesiaTokohkunciRachmad Sartono[1](Direktur Utama)Suwhono[2](Komisaris Utama)ProdukGula pasirMolaseKecap manisGula coklatSari tebuMerekRaja GulaKeris MerahHijusMaja ManisPendapatanRp 2,219 triliun (2019)[3]Laba bersihRp 139,747 milyar (20...

History of pubs and pub culture in the Ottoman Empire Vefa Bozacısı in Istanbul established in 1876 Pubs in the Ottoman Empire is about pubs ( Lisān-ı Osmānī : meyhâne / ميخانه ) and pub culture and history of pubs and taverns in the Ottoman Empire. Pubs appeared first during the time of the Ottoman Empire and rapidly increased in popularity, becoming an institution that housed all kinds of people from different segments of society.[1] In the beginning of the 1800's ...

Partai Komunis Azerbaijan Azərbaycan Kommunist PartiyasıDibentuk20 Februari 1920 (1920-02-20)Dibubarkan14 September 1991 (1991-09-14)Digabungkan dariPartai AhrarBolshevik BakuPartai Komunis PersiaPartai Muslim Sosial DemokratDiteruskan olehPartai Komunis Azerbaijan (1993)IdeologiKomunismeMarxisme-LeninismeAfiliasi nasionalPartai Komunis Uni SovietWarna  MerahBenderaPolitik AzerbaijanPartai politikPemilihan umum Partai Komunis Azerbaijan (bahasa Azerbaijan: Azər...

1969 compilation album by Johnny MathisThe Impossible DreamCompilation album by Johnny MathisReleased1969Recorded1964–1966[1]GenreVocalpop/rockstage & screen[2]Length32:03LabelColumbiaProducerJohnny MathisAl HamNorman NewellDon Riebert[1]Johnny Mathis chronology People(1969) The Impossible Dream(1969) Love Theme From Romeo And Juliet (A Time For Us)(1969) Professional ratingsReview scoresSourceRatingBillboardpositive[3] The Impossible Dream is a c...

В Википедии есть статьи о других людях с фамилией Ворона. Анатолий Григорьевич Ворона «Портрет Вороны А. Г.» Хост, масло, 1986 год. Г. Н. Ложкин Гражданство  Россия Дата рождения 1950 Место рождения Керчевский Род занятий художник, график, скульптор, иллюстратор Учебно...

Kembali kehalaman sebelumnya