Language Models Reinforce Dialect Discrimination – The Berkeley Artificial Intelligence Research Blog
AI

Language Models Reinforce Dialect Discrimination – The Berkeley Artificial Intelligence Research Blog

Sample language model responses to different varieties of English and native speaker reactions. ChatGPT does amazingly well at communicating with people in English. But whose English? Only 15% of ChatGPT users are from the US, where Standard American English is the default. But the model is also commonly used in countries and communities where people […]

Open Buildings 2.5D Temporal dataset tracks building changes across the Global South
AI

Open Buildings 2.5D Temporal dataset tracks building changes across the Global South

By the year 2050 the world’s urban population is expected to increase by 2.5 billion, with nearly 90% of that growth occurring in cities across Asia and Africa. To effectively plan for this population growth, respond to crises, and understand urbanization’s impact, governments, humanitarian organizations, and researchers need data about buildings and infrastructure, including how

Open Buildings 2.5D Temporal dataset tracks building changes across the Global South
AI

Recognizing whale vocalizations with AI

In order to protect animals that live in remote environments, researchers must be able to find them to understand the movements of their populations over time. As long-term passive acoustic monitoring capabilities have grown more technologically sophisticated, automatic animal species identification tools built on large datasets from these recorded soundscapes have become an increasingly vital

Empowering YouTube creators with generative AI
AI

Empowering YouTube creators with generative AI

Models Published 18 September 2024 Authors Eli Collins New video generation technology in YouTube Shorts will help millions of people realize their creative vision Artificial intelligence (AI) technologies for generating creative content are improving rapidly, but seamless ways of using them still aren’t widely available. We’re changing that, and making these incredible technologies more easily

Open Buildings 2.5D Temporal dataset tracks building changes across the Global South
AI

The long orbit to benchmarking long video understanding

Pipeline Long video datasets are challenging to build because of the significant manual effort required to select, watch, understand and annotate long videos with free-form natural language. Answering challenging questions about longer videos is often a multimodal task that may involve listening to the audio track in addition to watching the video. It may also

A Geometric Model of Cosmological Redshift via Angular Geometry in a Static Universe • AI Blog
AI

A Geometric Model of Cosmological Redshift via Angular Geometry in a Static Universe • AI Blog

Abstract We propose a novel geometric model to explain the observed redshift of light from distant celestial objects without invoking cosmic expansion or gravitational redshift. By examining the angular geometry between the light source, the observer, and a fixed reference point “above” the observer, we demonstrate how spatial geometry alone can lead to an apparent

Our latest advances in robot dexterity
AI

Our latest advances in robot dexterity

Research Published 12 September 2024 Authors Robotics team Two new AI systems, ALOHA Unleashed and DemoStart, help robots learn to perform complex tasks that require dexterous movement People perform many tasks on a daily basis, like tying shoelaces or tightening a screw. But for robots, learning these highly-dexterous tasks is incredibly difficult to get right.

Open Buildings 2.5D Temporal dataset tracks building changes across the Global South
AI

Grounding AI in reality with a little help from Data Commons

Large Language Models (LLMs) have revolutionized how we interact with information, but grounding their responses in verifiable facts remains a fundamental challenge. This is compounded by the fact that real-world knowledge is often scattered across numerous sources, each with its own data formats, schemas, and APIs, making it difficult to access and integrate. Lack of

What’s Missing From LLM Chatbots: A Sense of Purpose
AI

What’s Missing From LLM Chatbots: A Sense of Purpose

LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly measured by benchmarks like MMLU, HumanEval, and MATH (e.g. sonnet 3.5, gpt-4o). However, as these measures get more and more saturated, is user experience increasing in proportion to these scores? If we envision a future of human-AI collaboration rather than AI replacing humans,

Real Faces or AI Creations? • AI Blog
AI

Real Faces or AI Creations? • AI Blog

AI-generated images are not just hyper-realistic; they can also be crafted to embody an infinite variety of features, expressions, and aesthetics. A recent example is a post by Moritz Stellmacher on X that showcases a very intriguing human portrait, leaving viewers puzzled about whether it’s a photograph of a real person or an AI creation. This

Scroll to Top