AlphaProteo generates novel proteins for biology and health research
AI

AlphaProteo generates novel proteins for biology and health research

Science Published 5 September 2024 Authors Protein Design and Wet Lab teams New AI system designs proteins that successfully bind to target molecules, with potential for advancing drug design, disease understanding and more. Every biological process in the body, from cell growth to immune responses, depends on interactions between molecules called proteins. Like a key […]

FermiNet: Quantum physics and chemistry from first principles
AI

FermiNet: Quantum physics and chemistry from first principles

Science Published 22 August 2024 Authors David Pfau and James Spencer Note: This blog was first published on 19 October 2020. Following the publication of our breakthrough work on excited states in Science on 22 August 2024, we’ve made minor updates and added a section below about this new phase of work. Using deep learning

A deep dive with Google AI Edge’s MediaPipe
AI

A deep dive with Google AI Edge’s MediaPipe

Large language models (LLMs) are incredible tools that enable new ways for humans to interact with computers and devices. These models are frequently run on specialized server farms, with requests and responses ferried over an internet connection. Running models fully on-device is an appealing alternative, as this can eliminate server costs, ensure a higher degree

Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS
AI

Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS

Vocal characteristics contribute significantly to the construction and perception of individual identity. The loss of one’s voice, caused by physical or neurological conditions, can result in a profound sense of loss, striking at the very heart of one’s identity. Speakers with degenerative neural diseases, such as amyotrophic lateral sclerosis (ALS), Parkinson’s, and multiple sclerosis, may

Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS
AI

Enhancing retrieval augmented generation through drafting

Speculative RAG consists of two components: (1) a specialist RAG drafter, and (2) a generalist RAG verifier. First, the base model’s knowledge retriever retrieves related documents from the knowledge base. Then, Speculative RAG offloads computational burden to the specialist RAG drafter, a small LM specialized in answering questions using retrieved documents and not expected to

Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS
AI

Transformers in music recommendation

Users have more choices for listening to music than ever before. Popular services boast of massive and varied catalogs. The YouTube Music catalog, for example, has over 100M songs globally. It follows that item recommendations are a core part of these products. Recommender systems make sense of the item catalog and are critical for tuning

Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS
AI

Hallucination Attenuated Language and Vision Assistant

We use LLaVA-v1.5, a widely used open-sourced MLLM, as our base model and train it using our contrastive tuning framework (HALVA). We then evaluate its performance on object hallucination mitigation and general visual question answering tasks (VQA) against fine-tuning–based approaches, HA-DPO and EOS. We consider LLaVA-v1.5 as the lower bound and GPT-4V as a strong

We Need Positive Visions for AI Grounded in Wellbeing
AI

We Need Positive Visions for AI Grounded in Wellbeing

Introduction Imagine yourself a decade ago, jumping directly into the present shock of conversing naturally with an encyclopedic AI that crafts images, writes code, and debates philosophy. Won’t this technology almost certainly transform society — and hasn’t AI’s impact on us so far been a mixed-bag? Thus it’s no surprise that so many conversations these

Mapping the misuse of generative AI
AI

Mapping the misuse of generative AI

Responsibility & Safety Published 2 August 2024 Authors Nahema Marchal and Rachel Xu New research analyzes the misuse of multimodal generative AI today, in order to help build safer and more responsible technologies Generative artificial intelligence (AI) models that can produce image, text, audio, video and more are enabling a new era of creativity and

Scroll to Top