Accelerating code migrations with AI
AI

Accelerating code migrations with AI

As Google’s codebase and its products evolve, assumptions made in the past (sometimes over a decade ago) no longer hold. For example, Google Ads has dozens of numerical unique “ID” types used as handles — for users, merchants, campaigns, etc. — and these IDs were originally defined as 32-bit integers. But with the current growth […]

Accelerating code migrations with AI
AI

Using high-performance computing to advance machine learning and wildfire research

The severity and frequency of large wildfires has increased significantly over recent years due to factors ranging from climate and weather pattern changes to increased human activities in wildland-urban interfaces. While wildfires play an important role in some forest’s natural cycle, extreme fires pose serious threats to communities and ecosystems. Frequent wildfires can disrupt, damage,

How to counter people like Terrence Howard? • AI Blog
AI

How to counter people like Terrence Howard? • AI Blog

In a world filled with misinformation and oddball theories, it’s inevitable to come across individuals who hold beliefs that defy basic logic and established facts. One such example is actor Terrence Howard, who famously claimed that 1 x 1 = 2. As baffling as this assertion might be, it presents an opportunity to explore how

Accelerating code migrations with AI
AI

Assessing ASR performance with meaning preservation

Meaning preservation as an alternative metric Our research leveraged the Project Euphonia corpus, a repository of disordered speech encompassing over 1.2 million utterances from approximately 2,000 individuals with diverse speech impairments. To expand data collection to Spanish speakers, Project Euphonia partnered with the International Alliance of ALS/MND Associations, which facilitated the contribution of speech samples

E-Commerce Video Mockups with Hedra • AI Blog
AI

E-Commerce Video Mockups with Hedra • AI Blog

In the ever-evolving landscape of e-commerce, staying ahead of the curve often means adopting the latest technologies to engage and attract customers. One such innovation making waves in the industry is the use of generative video AI models. We’ve had the opportunity to explore Hedra’s generative video AI to create interesting video mockups for an

Accelerating code migrations with AI
AI

Rich human feedback for text-to-image generation

Recent text-to-image generation (T2I) models, such as Stable Diffusion and Imagen, have made significant progress in generating high-resolution images based on text descriptions. However, many generated images still suffer from issues like artifacts (e.g., distorted objects, text and body parts), misalignment with text descriptions, and low aesthetic quality. For example, the prompt in the image

Accelerating code migrations with AI
AI

A use case for meeting transcripts

To evaluate the MISeD data, we compare with a dataset collected using the traditional WOZ approach. A “user” annotator was given the general context for a meeting and asked questions about it, while an ”agent” annotator used the full transcripts to provide answers and supporting attribution. This WOZ test set contains 70 dialogs (700 query-response

Generating audio for video – Google DeepMind
AI

Generating audio for video – Google DeepMind

Acknowledgements This work was made possible by the contributions of: Ankush Gupta, Nick Pezzotti, Pavel Khrushkov, Tobenna Peter Igwe, Kazuya Kawakami, Mateusz Malinowski, Jacob Kelly, Yan Wu, Xinyu Wang, Abhishek Sharma, Ali Razavi, Eric Lau, Serena Zhang, Brendan Shillingford, Yelin Kim, Eleni Shaw, Signe Nørly, Andeep Toor, Irina Blok, Gregory Shaw, Pen Li, Scott Wisdom,

Accelerating code migrations with AI
AI

Pre-translation vs. direct inference in multilingual LLM applications

Large language models (LLMs) are becoming omnipresent tools for solving a wide range of problems. However, their effectiveness in handling diverse languages has been hampered by inherent limitations in training data, which are often skewed towards English. To address this, pre-translation, where inputs are translated to English before feeding them to the LLM, has become

Scroll to Top