WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.

Google joins push to localise AI for African languages with speech database

3 min read

Google has collaborated with African universities and research institutions to launch WAXAL, an open-source speech database designed to support the development of voice-based artificial intelligence for African languages. 

African institutions, including Makerere University in Uganda, the University of Ghana, Digital Umuganda in Rwanda, and the African Institute for Mathematical Sciences (AIMS), participated in the data collection for this initiative. The dataset provides foundational data for 21 Sub-Saharan African languages, including Hausa, Luganda, Yoruba, and Acholi.

WAXAL is designed to support the development of speech recognition systems, voice assistants, text-to-speech tools, and other voice-enabled applications across sectors such as education, healthcare, agriculture, and public services.

“This dataset provides the critical foundation for students, researchers, and entrepreneurs to build technology on their own terms, in their own languages,” said Aisha Walcott-Bryantt, Head of Google Research Africa

WAXAL’s launch comes amid growing efforts across Africa to develop language technologies that reflect local cultures and realities. 

In September 2025, the Nigerian government unveiled N-ATLAS, an open-source language model capable of recognising and transcribing spoken words and generating text, in Yoruba, Hausa, Igbo, and Nigerian-accented English. 

Similar initiatives are emerging in the private sector, where startups such as  South Africa’s Lelapa AI are building tools like Vulavula, which offers speech recognition, translation, and sentiment analysis. 

By making this speech dataset openly accessible, WAXAL provides the fuel for a growing wave of homegrown efforts to bring African languages into the digital age.

Although Sub-Saharan Africa is home to more than 2,000 languages, reports suggest that fewer than 5% of those languages have the resources needed for Natural Language Processing (NLP), which allows computers to understand and comprehend human language. This lack of representation in training datasets limits the effectiveness of speech recognition and text-to-speech systems for African users.  

Developed over three years with funding and technical support from Google, WAXAL addresses a major gap in global AI development.

WAXAL provides speech data for 21 Sub-Saharan African languages, including Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Swahili, and Yoruba. The dataset contains more than 11,000 hours of speech drawn from nearly two million individual recordings. 

Under the project’s partnership model, contributing institutions retain ownership of the data they collected, while making it openly available to researchers and developers worldwide.

“For AI to have a real impact in Africa, it must speak our languages and understand our contexts,” Joyce Nakatumba-Nabende, Senior Lecturer at Makerere University’s School of Computing and Information Technology, said. 

“The WAXAL dataset gives our researchers the high-quality data they need to build speech technologies that reflect our unique communities.”

Get The Best African Tech Newsletters In Your Inbox

Subscribe
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Tags:

You May Also Like

Cathie Wood's Ark Bets Big On Solana Treasury Play: Makes $162M Investment In Brera Holdings As Stock Explodes 225%

Cathie Wood's Ark Bets Big On Solana Treasury Play: Makes $162M Investment In Brera Holdings As Stock Explodes 225%

On Thursday, Cathie Wood-led Ark Invest executed significant trades, notably selling shares of Tempus AI Inc (NASDAQ:TEM) and buying shares of Brera Holdings PLC (NASDAQ:BREA), read more
Share
Coinstats2025/09/19 09:42
A Reality Check Pi Holders Might Not Want to Hear

A Reality Check Pi Holders Might Not Want to Hear

The post A Reality Check Pi Holders Might Not Want to Hear appeared on BitcoinEthereumNews.com. Crypto News 23 September 2025 | 17:10 Recent Pi Network price predictions are disheartening. Once praised as a mobile-driven crypto revolution, Pi Network has left many holders with significant losses, with prices still over 65% below their peak. Growing doubts about its viability stem from its limited utility. As uncertainty about Pi Network’s future increases, traders are turning their attention to presale opportunities with actual potential, such as Layer Brett ($LBRETT), which is gaining momentum. Pi Network Price Predictions Point to a Possible Setback The Pi Network price prediction has been a topic of intense discussion among crypto enthusiasts. Recent analyses suggest that the token is poised for a correction, challenging the optimistic outlooks held by many holders. Experts say that by October 22, 2025, Pi Network’s price will drop by about 25%, to $0.259345. Another negative Pi Network price prediction suggests the price will drop to $0.2597 in 2025 and then slowly rise to $0.4939 in 2026. Based on these predictions, investors would have to deal with a time of no growth and possibly losses. Source: CoinMarketcap Some long-term estimates are still positive, saying that prices might reach $2.09 by 2030, but the near future is not certain. Pi Network’s growth potential is still limited by the fact that it hasn’t been widely adopted or used in the real world. Investors should be careful because recent Pi Network price predictions show there is a chance that prices will drop again soon. How Layer Brett Breaks the Mold Layer Brett stands out for several key reasons. Currently in presale at just $0.0058, having already raised over $3.9 million, it offers far more than Pi Network ever did. Staking is live, boasting an impressive 660%+ APY, though this yield decreases as more wallets join, creating an inherent sense of urgency. Unlike…
Share
BitcoinEthereumNews2025/09/23 23:51
MOEX to Launch $XRP Indices/Futures: $MAXI Adoption Grows

MOEX to Launch $XRP Indices/Futures: $MAXI Adoption Grows

The post MOEX to Launch $XRP Indices/Futures: $MAXI Adoption Grows appeared on BitcoinEthereumNews.com. MOEX to Launch $XRP Indices/Futures: $MAXI Adoption
Share
BitcoinEthereumNews2026/02/04 06:00