Pushing the frontiers of audio generation
AI

Pushing the frontiers of audio generation

Research Published 30 October 2024 Authors Zalán Borsos, Matt Sharifi and Marco Tagliasacchi Our pioneering speech generation technologies are helping people around the world interact with more natural, conversational and intuitive digital assistants and AI tools. Speech is central to human connection. It helps people around the world exchange information and ideas, express emotions and

Generating zero-shot personalized portraits
AI

A return to hand-written notes by learning to read & write

Digital note-taking is gaining popularity, offering a durable, editable, and easily indexable way of storing notes in a vectorized form. However, a substantial gap remains between digital note-taking and traditional pen-and-paper note-taking, a practice still favored by a majority of people. Bridging this gap by converting a note taker’s physical writing into a digital form

New generative AI tools open the doors of music creation
AI

New generative AI tools open the doors of music creation

This work was made possible by core research and engineering efforts from Andrea Agostinelli, Zalán Borsos, George Brower, Antoine Caillon, Cătălina Cangea, Noah Constant, Michael Chang, Chris Deaner, Timo Denk, Chris Donahue, Michael Dooley, Jesse Engel, Christian Frank, Beat Gfeller, Tobenna Peter Igwe, Drew Jaegle, Matej Kastelic, Kazuya Kawakami, Pen Li, Ethan Manilow, Yotam Mann,

Generating zero-shot personalized portraits
AI

Scalable self-improvement for compiler optimization

Most systems we regularly interact with, such as computer operating systems, are faced with the challenge of providing good performance, while managing limited resources like computational time and memory. Since it is challenging to optimally manage these resources, there is increasing interest in the use of machine learning (ML) to make this decision-making data driven

Generating zero-shot personalized portraits
AI

Learning DeepVariant’s hidden powers

Examining DeepVariant To better understand what DeepVariant is learning from its training data, we used a set of simple clustering and visualization methods to summarize the information captured in the model’s high dimensional data. In partnership with collaborators on the Google Genomics team, we first loaded examples into the Integrated Genomics Viewer (IGV), a widely-used

Generating zero-shot personalized portraits
AI

Taking medical imaging embeddings 3D

Over recent years, developers and researchers have made progress in efficiently building AI applications. Google Research has contributed to this effort by providing easy-to-use embedding APIs for radiology, digital pathology and dermatology to help AI developers train models in these domains with less data and compute. However, these applications have been restricted to 2D imaging,

Scroll to Top