Tech
Howard University and Google Research to advance AI understanding of African American English
Howard University and Google researchers release dataset of over 600 hours of African American English dialects to improve AI speech recognition.
Howard University and Google Research have partnered to release a groundbreaking dataset aimed at improving how artificial intelligence (AI) systems understand African American English (AAE). a historically underrepresented linguistic form in voice-recognition technology.
Collaboration between Howard University and Google Research
As part of their ongoing initiative, Project Elevate Black Voices, the research team collected over 600 hours of audio from speakers representing AAE dialects across 32 U.S. states.
The project was designed to address the challenges that Black users face when engaging with automatic speech recognition (ASR) systems. The tools that frequently misinterpret or fail to understand culturally specific speech patterns.
Bridging the Speech Recognition Divide
AAE, also known as Black English, Black Talk, or Ebonics, is a rich, culturally grounded linguistic tradition rooted in history, identity, and resilience. Yet many ASR tools struggle to accurately process this dialect due to training on biased or incomplete data.
As a result, Black users often feel pressure to code-switch or alter their natural speech in order to be understood by digital assistants, transcription tools, and voice-enabled applications. This phenomenon diminishes authenticity and underscores systemic inequities in technology development.
Howard and Google Research: Community-Centered Data Collection
To authentically capture AAE speech, the project team hosted curated community events in multiple cities across the country.
These gatherings featured Black panelists—individuals who live and work in the represented communities—leading open discussions on the intersection of Black culture, technology, AI, and innovation.
Following these dialogues, attendees were invited to contribute their voices through a three-week audio data collection effort designed to reflect real-life language use and lived experiences.
Responsible Data Stewardship and Release
The resulting Howard African American English Dataset 1.0 will initially be released exclusively to researchers and institutions affiliated with historically Black colleges and universities (HBCUs).
This phased rollout ensures the dataset is used in ways that prioritize cultural respect, community empowerment, and ethical research practices.
Howard University will retain full ownership and licensing rights to the dataset, serving as a primary steward to ensure responsible usage and alignment with the interests of Black communities.
Google’s Role in Inclusive AI Development
Google, a collaborator on the project, will also use the dataset to enhance the inclusivity of its ASR systems. The tech company routinely trains its voice-recognition models on a wide spectrum of dialects, languages, and accents. This aims to create more equitable AI experiences for all users.
Future access to the dataset by non-HBCU institutions will be reviewed with a focus on researchers whose work aligns with values of inclusivity, equity, and community-driven impact.
Real stories. Real impact. Straight to your inbox. Join thousands others. Click here to subscribe to our newsletter today!
Follow us on Facebook, X, TikTok, Instagram, News Break
Discover more from Unheard Voices Magazine®
Subscribe to get the latest posts sent to your email.
-
Education1 week agoTen incarcerated men earn college degrees while serving time in Illinois prison
-
Police6 days agoMississippi family demands answers after police shoot and kill 1‑year‑old during Walmart shoplifting call
-
In Memoriam1 week agoDanny Simmons, painter and brother of Rev Run and Russell Simmons, dies at 72
-
Health & Wellness3 days agoBeloved Virginia teen dies one day before high school graduation
-
New Jersey1 week agoJackson, N.J. man says police racially profiled him after he was stopped for wearing a hoodie
-
Community5 days agoFrench Montana raises $75K to help NYC taxi driver after cab is destroyed in Knicks celebration
-
Community4 days agoObama Presidential Center opens on Chicago’s South Side on Juneteenth
-
Real Voices5 days ago94-year-old man who grew up on a Louisiana plantation seeks birth certificate that was never issued
-
In Memoriam5 days agoFamily wants safety changes after beloved Alabama father drowns at a state park
-
Health & Wellness3 days agoParents of 15-year-old who died after collapsing at volleyball practice sues Atlanta hospital, alleging delayed medical response



