Google announced Gemini, a large language model (LLM) developed by subsidiary Google DeepMind, during the Google I/O keynote on May 10, 2023. It was positioned as a more powerful successor to PaLM 2, which was also unveiled at the event, with Google CEO Sundar Pichai stating that Gemini was still in its early developmental stages.[1][2] Unlike other LLMs, Gemini was said to be unique in that it was not trained on a text corpus alone and was designed to be multimodal, meaning it could process multiple types of data simultaneously, including text, images, audio, video, and computer code.[3] It had been developed as a collaboration between DeepMind and Google Brain, two branches of Google that had been merged as Google DeepMind the previous month.[4] In an interview with Wired, DeepMind CEO Demis Hassabis touted Gemini's advanced capabilities, which he believed would allow the algorithm to trump OpenAI's ChatGPT, which runs on GPT-4 and whose growing popularity had been aggressively challenged by Google with LaMDA and Bard. Hassabis highlighted the strengths of DeepMind's AlphaGo program, which gained worldwide attention in 2016 when it defeated Go champion Lee Sedol, saying that Gemini would combine the power of AlphaGo and other Google–DeepMind LLMs.[5]
In August 2023, The Information published a report outlining Google's roadmap for Gemini, revealing that the company was targeting a launch date of late 2023. According to the report, Google hoped to surpass OpenAI and other competitors by combining conversational text capabilities present in most LLMs with artificial intelligence–powered image generation, allowing it to create contextual images and be adapted for a wider range of use cases.[6] Like Bard,[7] Google co-founder Sergey Brin was summoned out of retirement to assist in the development of Gemini, along with hundreds of other engineers from Google Brain and DeepMind;[6][8] he was later credited as a "core contributor" to Gemini.[9] Because Gemini was being trained on transcripts of YouTube videos, lawyers were brought in to filter out any potentially copyrighted materials.[6]
With news of Gemini's impending launch, OpenAI hastened its work on integrating GPT-4 with multimodal features similar to those of Gemini.[10]The Information reported in September that several companies had been granted early access to "an early version" of the LLM, which Google intended to make available to clients through Google Cloud's Vertex AI service. The publication also stated that Google was arming Gemini to compete with both GPT-4 and Microsoft's GitHub Copilot.[11][12]
Launch
On December 6, 2023, Pichai and Hassabis announced "Gemini 1.0" at a virtual press conference.[13][14] It comprised three models: Gemini Ultra, designed for "highly complex tasks"; Gemini Pro, designed for "a wide range of tasks"; and Gemini Nano, designed for "on-device tasks". At launch, Gemini Pro and Nano were integrated into Bard and the Pixel 8 Pro smartphone, respectively, while Gemini Ultra was set to power "Bard Advanced" and become available to software developers in early 2024. Other products that Google intended to incorporate Gemini into included Search, Ads, Chrome, Duet AI on Google Workspace, and AlphaCode 2.[15][14] It was made available only in English.[14][16] Touted as Google's "largest and most capable AI model" and designed to emulate human behavior,[17][14][18] the company stated that Gemini would not be made widely available until the following year due to the need for "extensive safety testing".[13] Gemini was trained on and powered by Google's Tensor Processing Units (TPUs),[13][16] and the name is in reference to the DeepMind–Google Brain merger as well as NASA's Project Gemini.[19]
Gemini Ultra was said to have outperformed GPT-4, Anthropic's Claude 2, Inflection AI's Inflection-2, Meta's LLaMA 2, and xAI's Grok 1 on a variety of industry benchmarks,[20][13] while Gemini Pro was said to have outperformed GPT-3.5.[3] Gemini Ultra was also the first language model to outperform human experts on the 57-subject Massive Multitask Language Understanding (MMLU) test, obtaining a score of 90%.[3][19] Gemini Pro was made available to Google Cloud customers on AI Studio and Vertex AI on December 13, while Gemini Nano will be made available to Android developers as well.[21][22][23] Hassabis further revealed that DeepMind was exploring how Gemini could be "combined with robotics to physically interact with the world".[24] In accordance with an executive order signed by U.S. President Joe Biden in October, Google stated that it would share testing results of Gemini Ultra with the federal government of the United States. Similarly, the company was engaged in discussions with the government of the United Kingdom to comply with the principles laid out at the AI Safety Summit at Bletchley Park in November.[3]
Updates
Google partnered with Samsung to integrate Gemini Nano and Gemini Pro into its Galaxy S24 smartphone lineup in January 2024.[25][26] The following month, Bard and Duet AI were unified under the Gemini brand,[27][28] with "Gemini Advanced with Ultra 1.0" debuting via a new "AI Premium" tier of the Google One subscription service.[29] Gemini Pro also received a global launch.[30]
In February, Google launched "Gemini 1.5" in a limited capacity, positioned as a more powerful and capable model than 1.0 Ultra.[31][32][33] This "step change" was achieved through various technical advancements, including a new architecture, a mixture-of-experts approach, and a larger one-million-token context window, which equates to roughly an hour of silent video, 11 hours of audio, 30,000 lines of code, or 700,000 words.[34] The same month, Google debuted Gemma, a family of free and open-source LLMs that serve as a lightweight version of Gemini. They come in two sizes, with a neural network with two and seven billion parameters, respectively. Multiple publications viewed this as a response to Meta and others open-sourcing their AI models, and a stark reversal from Google's longstanding practice of keeping its AI proprietary.[35][36][37] Google announced an additional model, Gemini 1.5 Flash, on May 14th at the 2024 I/O keynote.[38]
Two updated Gemini models, Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002, were released on September 24, 2024.[40]
Technical specifications
The first generation of Gemini ("Gemini 1") has three models, with the same software architecture. They are decoder-only transformers, with modifications to allow efficient training and inference on TPUs. They have a context length of 32,768 tokens, with multi-query attention. Two versions of Gemini Nano, Nano-1 (1.8 billion parameters) and Nano-2 (3.25 billion parameters), are distilled from larger Gemini models, designed for use by edge devices such as smartphones. As Gemini is multimodal, each context window can contain multiple forms of input. The different modes can be interleaved and do not have to be presented in a fixed order, allowing for a multimodal conversation. For example, the user might open the conversation with a mix of text, picture, video, and audio, presented in any order, and Gemini might reply with the same free ordering. Input images may be of different resolutions, while video is inputted as a sequence of images. Audio is sampled at 16 kHz and then converted into a sequence of tokens by the Universal Speech Model. Gemini's dataset is multimodal and multilingual, consisting of "web documents, books, and code, and includ[ing] image, audio, and video data".[41]
The second generation of Gemini ("Gemini 1.5") has two models. Gemini 1.5 Pro is a multimodal sparse mixture-of-experts, with a context length in the millions, while Gemini 1.5 Flash is distilled from Gemini 1.5 Pro, with a context length above 2 million.[42]
Gemma 2 27B is trained on web documents, code, science articles. Gemma 2 9B was distilled from 27B. Gemma 2 2B was distilled from a 7B model that remained unreleased.[43]
As of 2024 August, the models released include[44]
Gemma 1 (2B, 7B)
CodeGemma (2B and 7B) - Gemma 1 finetuned for code generation.
Gemma 2 (2B, 9B, 27B) - 27B trained from scratch. 2B and 9B
RecurrentGemma (2B, 9B) - Griffin-based, instead of Transformer-based.
PaliGemma (3B) - A vision-language model that takes text and image inputs, and outputs text. It is based on PaLI.[45]
Gemini's launch was preluded by months of intense speculation and anticipation, which MIT Technology Review described as "peak AI hype".[46][20] In August 2023, Dylan Patel and Daniel Nishball of research firm SemiAnalysis penned a blog post declaring that the release of Gemini would "eat the world" and outclass GPT-4, prompting OpenAI CEO Sam Altman to ridicule the duo on X (formerly Twitter).[47][48] Business magnate Elon Musk, who co-founded OpenAI, weighed in, asking, "Are the numbers wrong?"[49] Hugh Langley of Business Insider remarked that Gemini would be a make-or-break moment for Google, writing: "If Gemini dazzles, it will help Google change the narrative that it was blindsided by Microsoft and OpenAI. If it disappoints, it will embolden critics who say Google has fallen behind."[50]
Reacting to its unveiling in December 2023, University of Washington professor emeritus Oren Etzioni predicted a "tit-for-tat arms race" between Google and OpenAI. Professor Alexei Efros of the University of California, Berkeley praised the potential of Gemini's multimodal approach,[19] while scientist Melanie Mitchell of the Santa Fe Institute called Gemini "very sophisticated". Professor Chirag Shah of the University of Washington was less impressed, likening Gemini's launch to the routineness of Apple's annual introduction of a new iPhone. Similarly, Stanford University's Percy Liang, the University of Washington's Emily Bender, and the University of Galway's Michael Madden cautioned that it was difficult to interpret benchmark scores without insight into the training data used.[46][51] Writing for Fast Company, Mark Sullivan opined that Google had the opportunity to challenge the iPhone's dominant market share, believing that Apple was unlikely to have the capacity to develop functionality similar to Gemini with its Sirivirtual assistant.[52] Google shares spiked by 5.3 percent the day after Gemini's launch.[53][54]
Google faced criticism for a demonstrative video of Gemini, which was not conducted in real time.[55]
See also
Gato, a multimodal neural network developed by DeepMind
У Вікіпедії є статті про інших людей із прізвищем Суарес. Франциско Суарес Народився 5 січня 1548[1][2][3]Гранада, Кастильська Корона[4][2]Помер 25 вересня 1617(1617-09-25)[1][2][…] (69 років)Лісабон, Португалія[2]Країна ІспаніяДіяльність богослов, філосо...
يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (نوفمبر 2019) الجنرال دير بيونيير الدولة ألمانيا تعديل مصدري - تعديل الجنرال دير بيونيير ( جنرال المهندسين ) جنر...
Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Maret 2017. Dalam seri Final Fantasy dari RPG oleh Square Enix, monster biasanya menjadi musuh antagonis untuk pemain, dan biasanya tidak ada hubungannya dengan alur cerita. Sebagian monster berasal dari cerita rakyat dan literatur dari berbagai belahan dunia. Dibaw...
Playground de uma cohousing ao lado da casa comum Cohousing[1] é uma comunidade intencional de casas particulares agrupadas em torno de espaço compartilhado. Cada casa anexa ou unifamiliar possui comodidades tradicionais, incluindo uma cozinha privativa. Os espaços compartilhados geralmente apresentam uma casa comum, que pode incluir uma grande área de cozinha e sala de jantar, lavanderia e espaços de lazer. O espaço ao ar livre compartilhado pode incluir estacionamento, passarelas, esp...
As Testemunhas de Jeová ensinam que a Liga das Nações e as Nações Unidas foram criadas como uma falsificação do Reino de Deus. Joseph F. Rutherford, segundo presidente da Sociedade Torre de Vigia, condenou políticos, líderes empresariais e clérigos em seu apoio à Liga das Nações. As Testemunhas de Jeová acreditam que as Nações Unidas logo destruirão todas as outras religiões e depois se voltarão contra as TJ. História do ensino Desenvolvimento Em uma convenção de Estudan...
British politician This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Horace King, Baron Maybray-King – news · newspapers · books · scholar · JSTOR (March 2009) (Learn how and when to remove this template message) The Right HonourableThe Lord Maybray-KingPCKing in 1966Speaker of the House of Commonsof the Unite...
1759 battle of the Seven Years' War Battle of KunersdorfPart of the Third Silesian WarBattle of Kunersdorf, Alexander KotzebueDate12 August 1759[a]LocationKunersdorf, Margraviate of Brandenburg (now Kunowice, Lubusz Voivodeship, Poland)52°21′11″N 14°36′46″E / 52.35306°N 14.61278°E / 52.35306; 14.61278Result Russo-Austrian victoryBelligerents Russia Austria PrussiaCommanders and leaders Pyotr Saltykov Ernst von Laudon Frederick IIStrength 59,50...
Administrator of the Imperial German occupational forces on the Eastern Front of WWI Supreme Commander of All German Forces in the EastOberbefehlshaber der gesamten Deutschen Streitkräfte im Osten (German)1914–1919 Flag Coat of arms StatusMilitary occupation authority of the German EmpireCapitalKönigsberg (HQ, 1919)[citation needed]Supreme Commander • 1914–1916 Paul von Hindenburg• 1916–1918 Prince Leopold of Bavaria Chief of Staff • 1914–...
Ini adalah nama Batak Toba, marganya adalah Hutapea. Yemima HutapeaNama lahirYemima Putri Alma Lamtiur HutapeaLahir2 September 1996 (umur 27) Jakarta, IndonesiaAsalIndonesiaGenrepopPekerjaanPenyanyiInstrumenVokalTahun aktif2014–sekarangLabelHits Records Yemima Hutapea (lahir 2 September 1995) atau yang akrab dipanggil MIMA merupakan seorang penyanyi berkebangsaan Indonesia. Dia memulai kariernya sebagai 2nd runner up di Miss Indonesia 2013. Sebelumnya telah lebih dulu muncul di berbaga...
يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (فبراير 2023) لمعانٍ أخرى، طالع روبرت كامبل (توضيح). هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا،
Dam in the Western Cape, South Africa Dam in Gouda, Western CapeVoëlvlei DamAn aerial view of Voëlvlei Dam with the Groot Winterhoek Mountains in the foreground.CountrySouth AfricaLocationGouda, Western CapeCoordinates33°20′15″S 19°2′1″E / 33.33750°S 19.03361°E / -33.33750; 19.03361PurposeWater supplyStatusOperationalOpening date1952Owner(s)Department of Water and SanitationDam and spillwaysHeight10 metres (33 ft)Length2,910 metres (9,550 f...
لمعانٍ أخرى، طالع مصباح (توضيح). هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (مايو 2019) مصباح مصباح اللفظ مِصْبَاحْ الجنس مذكر لغة الاسم العربية[1] أصل الاسم الأصل اللغوي العربية الاشتقاق ص ب ح النوع م...
جوزيف بيل معلومات شخصية الميلاد 2 ديسمبر 1837(1837-12-02)إدنبرة الوفاة 4 أكتوبر 1911 (73 سنة)إدنبرة مكان الدفن مقبرة دين مواطنة الإمبراطورية البريطانية الحياة العملية المدرسة الأم جامعة إدنبرة التلامذة المشهورون آرثر كونان دويل المهنة طبيب، وجراح، وأستاذ جامعي، ...
Church building in Riverside, United States of America Universalist Unitarian Church of RiversideReligionAffiliationUnitarian UniversalistLeadershipCongregationStatusActiveLocationLocation3657 Lemon St.,Riverside, CaliforniaShown within CaliforniaShow map of CaliforniaUniversalist Unitarian Church of Riverside (the United States)Show map of the United StatesGeographic coordinates33°58′56″N 117°22′17″W / 33.98222°N 117.37139°W / 33.98222; -117.37139Architect...
23rd episode of the 3rd season of The Simpsons Bart's Friend Falls in LoveThe Simpsons episodeBart with Samantha and MilhouseEpisode no.Season 3Episode 23Directed byJim ReardonWritten byJay KogenWallace WolodarskyProduction code8F22Original air dateMay 7, 1992 (1992-05-07)Guest appearances Kimmy Robertson as Samantha Stanky Phil Hartman as Troy McClure Marcia Wallace as Edna Krabappel Episode featuresChalkboard gagI will not snap bras[1]Couch gagThe couch tips over...
1930 film Dangerous ParadiseDirected byWilliam A. WellmanWritten byGrover Jones William Slavens McNuttBased onVictory by Joseph ConradProduced byB.P. SchulbergStarringNancy Carroll Richard Arlen Warner Oland Gustav von SeyffertitzCinematographyArchie StoutMusic byJohn LeipoldProductioncompanyParamount PicturesDistributed byParamount PicturesRelease dateFebruary 13, 1930Running time58 minutesCountryUnited StatesLanguageEnglish Dangerous Paradise is a 1930 American pre-Code drama film directed ...
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Stolzenfels Castle – news · newspapers · books · scholar · JSTOR (August 2014) (Learn how and when to remove this template message) Schloss in Koblenz, GermanyStolzenfels CastleSchloss StolzenfelsStolzenfels CastleStolzenfels CastleLocation within Rhineland-Pal...
American economist Hollis CheneryChief Economist of the World BankIn office1972–1982PresidentRobert McNamaraAlden W. ClausenPreceded byPosition establishedSucceeded byAnne Osborn Krueger Personal detailsBornHollis Burnley Chenery(1918-01-06)6 January 1918Richmond, Virginia, U.S.Died1 September 1994(1994-09-01) (aged 76)Santa Fe, New Mexico, U.S.Spouses Louise Seamster (m. 1942; div. 1964) Mary Montgomery (m....
Si ce bandeau n'est plus pertinent, retirez-le. Cliquez ici pour en savoir plus. Cet article/section est soupçonné d’enfreindre un droit d’auteur (avril 2023). Si vous êtes l’auteur de ce texte, vous êtes invité à donner votre avis ici. À moins qu’il soit démontré que l’auteur de la page autorise la reproduction, cette page sera supprimée ou purgée au bout d’une semaine. En attendant le retrait de cet avertissement, veuillez ne pas réutiliser ce texte. Si ce bandeau n'...
Kiwi Bus BuildersManaBus Volvo B11RHeadquartersTaurangaKey peopleRichard Drummond (MD)ProductsBus & coach bodiesWebsitekiwibus.co.nz Kiwi Bus Builders[1] is a New Zealand bus bodybuilder based in Tauranga. History Kiwi Bus Builders-bodied Volvo B11R in Rotorua in April 2014, operating a Nakedbus service Kiwi Bus Builders was established in 1993. It has built bus and coach bodies for many operators including Go Bus Transport, NZ Bus and Tranzit Coachlines.[2][3][...