Month: June 2024
Rich human feedback for text-to-image generation
Recent text-to-image generation (T2I) models, such as Stable Diffusion and Imagen, have made significant progress in generating high-resolution images based on text descriptions. However, ...
A use case for meeting transcripts
To evaluate the MISeD data, we compare with a dataset collected using the traditional WOZ approach. A “user” annotator was given the general context ...
Dynamics of magnetization at infinite temperature in a Heisenberg spin chain
Would you be surprised to learn that growing wildfires are described by the same dynamical equations as snow falling and clumping together? Many systems ...
Generating audio for video – Google DeepMind
Acknowledgements This work was made possible by the contributions of: Ankush Gupta, Nick Pezzotti, Pavel Khrushkov, Tobenna Peter Igwe, Kazuya Kawakami, Mateusz Malinowski, Jacob ...
Pre-translation vs. direct inference in multilingual LLM applications
Large language models (LLMs) are becoming omnipresent tools for solving a wide range of problems. However, their effectiveness in handling diverse languages has been ...