Google Gemini AI, the tech giant’s most powerful multimodal general AI model yet, is now available to users worldwide via Bard, some developer platforms, and even the new Google Pixel 8 Pro devices. The new flexible AI model, which will be available in three sizes — the yet-to-be-released Ultra, Pro, and Nano — is being viewed as Google’s answer to ChatGPT, which has been ahead of the game in terms of GenAI so far.
Demis Hassabis, CEO and Co-Founder of Google DeepMind, describes Gemini as “AI that feels less like a smart piece of software and more like something useful and intuitive — an expert assistant or helper.” Google teams collaborated to build Gemini from the ground up. It is also multimodal, which means it is not limited to the type of information it can process and can work with text, code, audio, image, and video. ChatGPT, on the other hand, does not currently support video, at least not natively.
It is also far more powerful than current models. According to Google, Gemini Ultra “exceeds current state-of-the-art results on 30 of the 32 widely-used academic benchmarks” used in large language model (LLM) research and development. Gemini Ultra is the first model to outperform human experts on massive multitask language understanding (MMLU), which tests both world knowledge and problem-solving abilities by combining 57 subjects such as math, physics, history, law, medicine, and ethics.
Google Gemini AI can also “understand, explain, and generate high-quality code in the world’s most popular programming languages, such as Python, Java, C++, and Go,” according to the company.
Google Gemini AI Availability
Google has currently integrated the new Gemini Pro with its chatbot Bard, which is a direct competitor of ChatGPT. You can interact with Gemini-powered Bard via text, but Google promises support for other modalities “soon.” The new update is available in 170 countries and territories, but only in English.
Will Google Gemini AI have an impact on Google search?
Google stated that Google Gemini AI will be expanded to include more products and services such as Search, Ads, Chrome, and Duet AI. Google stated that it is already experimenting with Gemini in Search, “where it is making Search Generative Experience (SGE) faster for users, with a 40% reduction in latency in English in the United States, alongside quality improvements.”
Why is Google Gemini AI available in three sizes?
Google Gemini AI will be available in various sizes so that it can be scaled as needed. The largest and most capable model, Gemini Ultra, will be designed for highly complex tasks. Because this model is still undergoing trust and safety checks, it is currently only available to a limited number of customers, developers, partners, and safety and responsibility experts for early experimentation and feedback. Early next year, it will be made available to developers and enterprise customers.
Gemini Pro will excel at scaling across a wide range of tasks, and it is now available in Bard for regular users worldwide. On Bard, you’ll find a “specifically tuned version of Gemini Pro in English for more advanced reasoning, planning, understanding and more” . Gemini Pro will be available to developers and enterprise customers via the Gemini API in Google AI Studio or Google Cloud Vertex AI.
Gemini Nano will manage on-device tasks and is already available on the Pixel 8 Pro, powering new features such as Summarise in the Recorder app and Smart Reply via Gboard, which will begin with WhatsApp. Android developers will be able to build with Gemini Nano via AICore, a new system capability available in Android 14, beginning with Pixel 8 Pro devices, beginning on December 13.
How does Google Gemini AI deal with hallucinations and safety concerns?
According to Eli Collins, VP, Product, Google DeepMind, while they have worked hard to improve factuality in Gemini, the LLM is still capable of hallucinating. “When we integrate these models with products like Bard, we have additional techniques to improve the accuracy of responses.”
Concerning security, Google stated that it is implementing “new safeguards to account for Gemini’s multimodal capabilities,” as well as considering potential risks and working to test and mitigate them at each stage of development. The company claims to have conducted the “most comprehensive safety evaluations of any Google AI model to date, including for bias and toxicity” and has conducted research into potential risk areas such as cyber-offense, persuasion, and autonomy. It is also collaborating with a diverse group of external experts and partners to stress-test our models across a variety of issues and identify gaps in Google’s internal evaluation approach
So, is Google Gemini superior to ChatGPT 4?
At the moment, it’s difficult to say, but Gemini appears to be more adaptable than GPT4. Its ability to work with video and on devices that do not have Internet access also gives it an advantage. Another difference is that Gemini is now free to use, whereas ChatGPT4 is only available to paid subscribers.