New AI and accessibility updates across Android, Chrome and more



 Google Blog:

Advances in AI continue to make our world more and more accessible. Today, in honor of Global Accessibility Awareness Day, we’re rolling out new updates to our products across Android and Chrome, as well as adding new resources for developers building speech recognition tools.

More AI-powered innovation with Android

We’re building on our work and integrating the best of Google AI and Gemini into core mobile experiences customized for vision and hearing.

Get all the details with Gemini and TalkBack​

Last year we brought Gemini’s capabilities to TalkBack, Android’s screen reader, providing people who are blind or low-vision with AI-generated descriptions for images, even when there’s no alt text. Today, we’re expanding this Gemini integration so people can ask questions and get responses about their images. That means the next time a friend texts you a photo of their new guitar, you can get a description and ask follow-up questions about the make and color, or even what else is in the image. People can now also get descriptions and ask questions about their whole screen. So if you’re shopping for the latest sales on your favorite shopping app, you can ask Gemini about the material of an item or if a discount is available.

2469_ANC_Accessibility-Launches_Gemini-Talkback-Option-1_v03-optimize.gif

Use TalkBack's Gemini powered capabilities to get a description of what’s on your screen.

Understand mooooore of the emotion behind captions​

With Expressive Captions, your phone provides real-time captions for anything with sound across most apps on your phone — using AI to not only capture what someone says, but how they say it. We know one of the ways people express themselves is by dragging out the sound of their words, which is why we developed a new duration feature on Expressive Captions, so you can know when a sports announcing is calling out an “amaaazing shot” or when the video message is not “no” but “nooooo.” You’ll also receive even more labels for sounds, so you know when someone is whistling or clearing their throat. This new version is rolling out in English in the U.S., U.K., Canada and Australia for devices running Android 15 and above.

Pixel_10_-_DARK_MODE_Soccer_Case_V3.gif

With Expressive Captions new duration feature, get even more context of what's being said in the audio and video on your phone.

Improving speech recognition around the world

In 2019, we launched Project Euphonia to find ways to make speech recognition more accessible for people with non-standard speech. Now we’re supporting developers and organizations around the world as they bring that work to even more languages and cultural contexts.

New developer resources​

To improve the ecosystem of tools globally, we’re providing developers with our open-source repositories via Project Euphonia’s GitHub page. They can now develop personalized audio tools for research or train their models for diverse speech patterns.

Support for new projects in Africa​

Earlier this year we partnered with Google.org to provide support to the University College London in their creation of the Centre for Digital Language Inclusion (CDLI). The CDLI is working to improve speech recognition technology for non-English speakers in Africa by creating open-source datasets in 10 African languages, building new speech recognition models and continuing to support the ecosystem of organizations and developers in this space.

Expanding accessibility options for students

Accessibility tools can be particularly helpful for students with disabilities, from using facial gestures to navigate their Chromebooks with Face Control to customizing their reading experience with Reading Mode.

And now when you use your Chromebook with College Board’s Bluebook testing app (which is where students can take the SAT and most Advanced Placement exams) you’ll have access to all of Google’s built-in accessibility features. This includes ChromeVox screen reader and Dictation, along with College Board’s own digital testing tools.

Making Chrome more accessible

With more than 2 billion people using Chrome each day, we’re always striving to make our browser easier to use and more accessible for everyone with features like Live Caption and image descriptions for screen reader users.

Access PDFs more easily on Chrome​

Previously, if you opened a scanned PDF in your desktop Chrome browser, you wouldn’t be able to use your screen reader to interact with it. Now with Optical Character Recognition (OCR), Chrome automatically recognizes these types of PDFs, so you can highlight, copy and search for text like any other page and use your screen reader to read them.

Read with ease with Page Zoom​

Page Zoom now lets you increase the size of the text you see in Chrome on Android without affecting the webpage layout or your browsing experience — just like how it works on Chrome desktop. You can customize how much you want to zoom in and easily apply the preference to all the pages you visit or just specific ones.

GoogleKeyword_GAAD_InLineAsset_GIF.gif

Page Zoom works with Chrome on Android, letting you customize how you see pages

To start using this feature, just tap the three-dot menu in the top right corner on Chrome and set your zoom preferences.


 Source:

 
I do not want stupid expressive live captions - I want accurate live captions unlike the complete joke live captions are at the moment which are virtually unusable.
 

My Computer

System One

  • OS
    Windows 11 Pro + Win11 Canary VM.
    Computer type
    Laptop
    Manufacturer/Model
    ASUS Zenbook 14
    CPU
    I9 13th gen i9-13900H 2.60 GHZ
    Motherboard
    Yep, Laptop has one.
    Memory
    16 GB soldered
    Graphics Card(s)
    Integrated Intel Iris XE
    Sound Card
    Realtek built in
    Monitor(s) Displays
    laptop OLED screen
    Screen Resolution
    2880x1800 touchscreen
    Hard Drives
    1 TB NVME SSD (only weakness is only one slot)
    PSU
    Internal + 65W thunderbolt USB4 charger
    Case
    Yep, got one
    Cooling
    Stella Artois (UK pint cans - 568 ml) - extra cost.
    Keyboard
    Built in UK keybd
    Mouse
    Bluetooth , wireless dongled, wired
    Internet Speed
    900 mbs (ethernet), wifi 6 typical 350-450 mb/s both up and down
    Browser
    Edge
    Antivirus
    Defender
    Other Info
    TPM 2.0, 2xUSB4 thunderbolt, 1xUsb3 (usb a), 1xUsb-c, hdmi out, 3.5 mm audio out/in combo, ASUS backlit trackpad (inc. switchable number pad)

    Macrium Reflect Home V8
    Office 365 Family (6 users each 1TB onedrive space)
    Hyper-V (a vm runs almost as fast as my older laptop)

Latest Support Threads

Back
Top Bottom