Month: June 2024

Rich human feedback for text-to-image generation

Recent text-to-image generation (T2I) models, such as Stable Diffusion and Imagen, have made significant progress in generating high-resolution images based on text descriptions. However, ...

A use case for meeting transcripts

To evaluate the MISeD data, we compare with a dataset collected using the traditional WOZ approach. A “user” annotator was given the general context ...

Dynamics of magnetization at infinite temperature in a Heisenberg spin chain

Would you be surprised to learn that growing wildfires are described by the same dynamical equations as snow falling and clumping together? Many systems ...

Generating audio for video – Google DeepMind

Acknowledgements This work was made possible by the contributions of: Ankush Gupta, Nick Pezzotti, Pavel Khrushkov, Tobenna Peter Igwe, Kazuya Kawakami, Mateusz Malinowski, Jacob ...

Pre-translation vs. direct inference in multilingual LLM applications

Large language models (LLMs) are becoming omnipresent tools for solving a wide range of problems. However, their effectiveness in handling diverse languages has been ...