They claim that Gemini is better than any other AI model out there.
Google recently announced Gemini, a multimodal AI model family that aims to take down OpenAI's GPT-4. Gemini outperforms state-of-the-art benchmarks in large language model research. It builds upon PaLM 2, an earlier AI model by Google. The mid-level Gemini model is available in over 170 countries through Google's Bard, except in the EU and the UK due to regulation concerns.
Like GPT-4, Gemini is multimodal and can process text, code, images, and audio, enabling problem-solving and providing advice across various fields. Google plans to integrate Gemini into its products, saying that it will revolutionize computing.
"Gemini 1.0's sophisticated multimodal reasoning capabilities can help make sense of complex written and visual information. This makes it uniquely skilled at uncovering knowledge that can be difficult to discern amid vast amounts of data," writes Google. "Its remarkable ability to extract insights from hundreds of thousands of documents through reading, filtering and understanding information will help deliver new breakthroughs at digital speeds in many fields from science to finance."
Gemini offers three sizes: Ultra for complex tasks, Pro for a wide range of tasks, and Nano for on-device tasks. Each size differs in complexity based on parameter count. Nano is designed for local use on consumer devices, while Ultra requires data center hardware. Currently, only the mid-level model is publicly available. Gemini Pro performs significantly better than the previous Bard version based on PaLM 2, according to informal testing.
Image credit: Google
"This new era of models represents one of the biggest science and engineering efforts we've undertaken as a company. I'm genuinely excited for what's ahead and for the opportunities Gemini will unlock for people everywhere," reads Google CEO Sundar Pichai's statement.
Google claims Gemini is highly scalable and efficient on its custom Tensor Processing Units (TPUs), surpassing earlier models in speed. It excels in coding and includes a specialized version called AlphaCode 2, capable of solving complex programming problems. While Google's enthusiasm in its PR language may seem exaggerated, the capabilities of Gemini are undeniably impressive.
Read more on Google's blog and join our 80 Level Talent platform and our Telegram channel, follow us on Instagram, Twitter, and LinkedIn, where we share breakdowns, the latest news, awesome artworks, and more.