Google introduces Gemma - new state-of-the-art open AI models

Staff

Google Blog:

At Google, we believe in making AI helpful for everyone. We have a long history of contributing innovations to the open community, such as with Transformers, TensorFlow, BERT, T5, JAX, AlphaFold, and AlphaCode. Today, we’re excited to introduce a new generation of open models from Google to assist developers and researchers in building AI responsibly.

Gemma open models

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Developed by Google DeepMind and other teams across Google, Gemma is inspired by Gemini, and the name reflects the Latin gemma, meaning “precious stone.” Accompanying our model weights, we’re also releasing tools to support developer innovation, foster collaboration, and guide responsible use of Gemma models.

Gemma is available worldwide, starting today. Here are the key details to know:

We’re releasing model weights in two sizes: Gemma 2B and Gemma 7B. Each size is released with pre-trained and instruction-tuned variants.
A new Responsible Generative AI Toolkit provides guidance and essential tools for creating safer AI applications with Gemma.
We’re providing toolchains for inference and supervised fine-tuning (SFT) across all major frameworks: JAX, PyTorch, and TensorFlow through native Keras 3.0.
Ready-to-use Colab and Kaggle notebooks, alongside integration with popular tools such as Hugging Face, MaxText, NVIDIA NeMo and TensorRT-LLM, make it easy to get started with Gemma.
Pre-trained and instruction-tuned Gemma models can run on your laptop, workstation, or Google Cloud with easy deployment on Vertex AI and Google Kubernetes Engine (GKE).
Optimization across multiple AI hardware platforms ensures industry-leading performance, including NVIDIA GPUs and Google Cloud TPUs.
Terms of use permit responsible commercial usage and distribution for all organizations, regardless of size.

State-of-the-art performance at size

Gemma models share technical and infrastructure components with Gemini, our largest and most capable AI model widely available today. This enables Gemma 2B and 7B to achieve best-in-class performance for their sizes compared to other open models. And Gemma models are capable of running directly on a developer laptop or desktop computer. Notably, Gemma surpasses significantly larger models on key benchmarks while adhering to our rigorous standards for safe and responsible outputs. See the technical report for details on performance, dataset composition, and modeling methodologies.

Benchmark_chart_Updates_19.02_1.width-1000.format-webp.webp

Responsible by design

Gemma is designed with our AI Principles at the forefront. As part of making Gemma pre-trained models safe and reliable, we used automated techniques to filter out certain personal information and other sensitive data from training sets. Additionally, we used extensive fine-tuning and reinforcement learning from human feedback (RLHF) to align our instruction-tuned models with responsible behaviors. To understand and reduce the risk profile for Gemma models, we conducted robust evaluations including manual red-teaming, automated adversarial testing, and assessments of model capabilities for dangerous activities. These evaluations are outlined in our Model Card.1

We’re also releasing a new Responsible Generative AI Toolkit together with Gemma to help developers and researchers prioritize building safe and responsible AI applications. The toolkit includes:

Safety classification: We provide a novel methodology for building robust safety classifiers with minimal examples.
Debugging: A model debugging tool helps you investigate Gemma's behavior and address potential issues.
Guidance: You can access best practices for model builders based on Google’s experience in developing and deploying large language models.

Optimized across frameworks, tools and hardware

You can fine-tune Gemma models on your own data to adapt to specific application needs, such as summarization or retrieval-augmented generation (RAG). Gemma supports a wide variety of tools and systems:

Multi-framework tools: Bring your favorite framework, with reference implementations for inference and fine-tuning across multi-framework Keras 3.0, native PyTorch, JAX, and Hugging Face Transformers.
Cross-device compatibility: Gemma models run across popular device types, including laptop, desktop, IoT, mobile and cloud, enabling broadly accessible AI capabilities.
Cutting-edge hardware platforms: We’ve partnered with NVIDIA to optimize Gemma for NVIDIA GPUs, from data center to the cloud to local RTX AI PCs, ensuring industry-leading performance and integration with cutting-edge technology.
Optimized for Google Cloud: Vertex AI provides a broad MLOps toolset with a range of tuning options and one-click deployment using built-in inference optimizations. Advanced customization is available with fully-managed Vertex AI tools or with self-managed GKE, including deployment to cost-efficient infrastructure across GPU, TPU, and CPU from either platform.

Free credits for research and development

Gemma is built for the open community of developers and researchers powering AI innovation. You can start working with Gemma today using free access in Kaggle, a free tier for Colab notebooks, and $300 in credits for first-time Google Cloud users. Researchers can also apply for Google Cloud credits of up to $500,000 to accelerate their projects.

Getting started

You can explore more about Gemma and access quickstart guides on ai.google.dev/gemma.

As we continue to expand the Gemma model family, we look forward to introducing new variants for diverse applications. Stay tuned for events and opportunities in the coming weeks to connect, learn and build with Gemma.

We’re excited to see what you create!

Source:

Gemma: Introducing new state-of-the-art open models

Gemma is a family of lightweight, state\u002Dof\u002Dthe art open models built from the same research and technology used to create the Gemini models.

blog.google

Click to expand...

Brink

Administrator

Staff member

MVP

Thread Starter

Feb 21, 2024

Staff
#2

Google's Gemma Optimized Across All NVIDIA AI Platforms | NVIDIA Blog

NVIDIA and Google have accelerated the performance of Gemma with NVIDIA TensorRT-LLM when running on NVIDIA GPUs — including RTX AI PCs.

blogs.nvidia.com

My Computers

System One System Two

OS

Windows 11 Pro for Workstations

Computer type

PC/Desktop

Manufacturer/Model

Custom self build

CPU

Intel i7-8700K 5 GHz

Motherboard

ASUS ROG Maximus XI Formula Z390

Memory

64 GB (4x16GB) G.SKILL TridentZ RGB DDR4 3600 MHz (F4-3600C18D-32GTZR)

Graphics Card(s)

ASUS ROG-STRIX-GTX1080TI-O11G-GAMING (11GB GDDR5X)

Sound Card

Integrated Digital Audio (S/PDIF)

Monitor(s) Displays

2 x Samsung Odyssey G75 27"

Screen Resolution

2560x1440

Hard Drives

1TB Samsung 990 PRO M.2,
4TB Samsung 990 PRO M.2,
8TB WD MyCloudEX2Ultra NAS

PSU

Seasonic Prime Titanium 850W

Case

Thermaltake Core P3 wall mounted

Cooling

Corsair Hydro H115i

Keyboard

Logitech wireless K800

Mouse

Logitech MX Master 3

Internet Speed

1 Gbps Download and 35 Mbps Upload

Browser

Google Chrome

Antivirus

Microsoft Defender and Malwarebytes Premium

Other Info

Logitech Z625 speaker system,
Logitech BRIO 4K Pro webcam,
HP Color LaserJet Pro MFP M477fdn,
APC SMART-UPS RT 1000 XL - SURT1000XLI,
Galaxy S23 Plus phone
Operating System

Windows 11 Pro

Computer type

Laptop

Manufacturer/Model

HP Spectre x360 2in1 14-eu0098nr (2024)

CPU

Intel Core Ultra 7 155H 4.8 GHz

Memory

16 GB LPDDR5x-7467 MHz

Graphics card(s)

Integrated Intel Arc

Sound Card

Poly Studio

Monitor(s) Displays

14" 2.8K OLED multitouch

Screen Resolution

2880 x 1800

Hard Drives

2 TB PCIe NVMe M.2 SSD

Internet Speed

Intel Wi-Fi 7 BE200 (2x2) and Bluetooth 5.4

Browser

Chrome and Edge

Antivirus

Windows Defender and Malwarebytes Premium

You must log in or register to reply here.

Similar threads

Article

Google Introduces Gemini Pro 1.5

Replies: 0

Views: 562

Feb 15, 2024

Brink

Article

Google introduces Gemini AI model

Replies: 1

Views: 468

Dec 6, 2023

Ghot

Article

Qualcomm Enables Meta Llama 3 to Run on Snapdragon Devices

Replies: 0

Views: 264

Apr 18, 2024

Brink

Article

How Intel is Refining Its Approach to Responsible AI

Replies: 0

Views: 394

Apr 2, 2024

Brink

Article

Google Bard is now known as Gemini

Replies: 2

Views: 895

Feb 9, 2024

Edwin

Completely Disable and Remove Copilot in Windows 11

This tutorial will show you how to completely disable the Windows Copilot feature and remove Copilot from the taskbar, Windows Search, and Microsoft Edge...
Brink

Mar 7, 2024
Enable or Disable Sudo Command in Windows 11

This tutorial will show you how to enable or disable the Sudo command for all users in Windows 11. Starting with Windows 11 build 26052 (Canary and Dev)...
Brink

Feb 8, 2024
Enable or Disable Feeds on Widgets Board in Windows 11

This tutorial will show you how to enable or disable news feeds on the widgets board for your account in Windows 11. Widgets are small windows that display...
Brink

Dec 4, 2023
Use ViVeTool to Enable or Disable Hidden Features in Windows 11

This tutorial will show you how to use ViVeTool to enable or disable hidden features in Windows 10 and Windows 11. ViVeTool is an open source tool that can...
Brink

Dec 1, 2023
Always or Never Combine Taskbar buttons and Hide Labels in Windows 11

This tutorial will show you how to always, when the taskbar is full, or never combine taskbar buttons and hide labels for your account, specific users, or...
Brink

May 24, 2023
Disable Modern Standby in Windows 10 and Windows 11

This tutorial will show you how to disable Modern Standby (S0 Low Power Idle) to enable S3 support on a Windows 10 and Windows 11 device. In Windows 10 and...
Brink

Jan 11, 2022
Disable "Show more options" context menu in Windows 11

This tutorial will show you how to enable or disable having to click on "Show more options" to see the full context menu for your account or all users in...
Brink

Oct 4, 2021
Download Official Windows 11 ISO file from Microsoft

This tutorial will show you how to download an official Windows 11 ISO file from Microsoft. Microsoft provides ISO files for Windows 11 to download. You...
Brink

Aug 19, 2021
Restore Classic File Explorer with Ribbon in Windows 11

This tutorial will show you how to restore the classic File Explorer with Ribbon for your account or all users in Windows 11. File Explorer in Windows 10...
Brink

Jul 19, 2021
Repair Install Windows 11 with an In-place Upgrade

This tutorial will show you how to do a repair install of Windows 11 by performing an in-place upgrade without losing anything. If you need to repair or...
Brink

Jul 5, 2021
Enable or Disable Windows Sandbox in Windows 11

This tutorial will show you how to enable or disable the Windows Sandbox feature for all users in Windows 11 Pro, Enterprise, or Education. Windows Sandbox...
Brink

Jun 30, 2021
Clean Install Windows 11

This tutorial will show you step by step on how to clean install Windows 11 at boot on your PC with or without an Internet connection and setup with a local...
Brink

Jun 22, 2021