Skip to content
  • Facebook
  • X
  • Linkedin
  • WhatsApp
  • Associate Journalism
  • About Us
  • Privacy Policy
  • 033-46046046
  • editor@artifex.news
Artifex.News

Artifex.News

Stay Connected. Stay Informed.

  • Breaking News
  • World
  • Nation
  • Sports
  • Business
  • Science
  • Entertainment
  • Lifestyle
  • Toggle search form
  • Israel PM Orders Delay Of Today’s Release Of Palestinians: Office
    Israel PM Orders Delay Of Today’s Release Of Palestinians: Office World
  • Access Denied Business
  • Access Denied World
  • Teen Falls Into Rajasthan Well, Operation To Retrieve Body On For Over 50 Hours
    Teen Falls Into Rajasthan Well, Operation To Retrieve Body On For Over 50 Hours Nation
  • Access Denied
    Access Denied Nation
  • Smart withdrawals for a lasting corpus
    Smart withdrawals for a lasting corpus Business
  • Neeraj Chopra, Avinash Sable To Compete In Brussels Diamond League 2024
    Neeraj Chopra, Avinash Sable To Compete In Brussels Diamond League 2024 Sports
  • Access Denied World
India-made app turns impaired speech into clear speech in near-realtime

India-made app turns impaired speech into clear speech in near-realtime

Posted on March 12, 2026 By admin


A whisper. A few slurred words. For those who suffer from dysarthria, a motor speech disorder, basic communication is a challenge, indelibly affecting both their professional and personal life. But now a new innovation based on artificial intelligence (AI) and developed in India could be life-changing.

Led by associate professor Vineet Gandhi of the International Institute of Information Technology (IIIT), Hyderabad, a team has developed a simple app that can help people talk as an audio translation converts the speaker’s voice almost in real-time. The app can either convert slurred speech into clear, natural-sounding speech or use a camera to analyse lip movements and subtle throat vibrations to generate intelligible speech.

While the current project runs in English, the team’s next aim is to take these technologies to regional languages, including Hindi, Telugu, and Tamil, as many across the country do not have the means to benefit from accessibility-focused AI models. For this work, Mr. Gandhi won the Anusandhan National Research Foundation (ANRF) award in 2026.

Excerpts from an interview:

What inspired you to begin work on this humanitarian AI project?

My research has always been driven by a simple question: what real problem can technology help solve?

While my academic training is primarily in computer vision, about four years ago, I began to see exciting possibilities emerging in speech research and decided to explore the field more deeply. I became increasingly aware of the challenges faced by many individuals who lose their ability to speak due to medical conditions: the impact of this loss extends far beyond communication — it affects independence, identity, and connection.

Recognising this need inspired me to focus my work on accessibility-driven technologies designed to restore or enable speech, with the goal of helping people regain their voice.

Could you describe how the app works for people with speech impairment?

The app is designed to convert impaired or distorted speech into clear, natural-sounding speech with only a few hundred milliseconds of delay. A user simply speaks in their own voice, and the system processes it to produce intelligible speech for the listener.

We are also developing a complementary lip-to-speech capability, where a person can silently move their lips and the system generates the corresponding speech.

A key aspect we are focusing on is personalisation, where users can calibrate and refine the application to their voice by reading few minutes of text on the app.

We aim for these technologies to be integrated into common communication platforms, such as web-based calling applications, making everyday communication easier for people with speech impairments.

You also aim to expand this technology to regional Indian languages. How do you hope to achieve this?

At present, much of the global speech technology ecosystem is predominantly designed for English, and our initial experiments have naturally followed the same trajectory. However, a major goal of our research is to extend these capabilities to regional Indian languages, where accessible speech technologies are equally important.

To achieve this, we plan to collect speech data in Indian languages and develop data-efficient models suited for low-resource scenarios. Our approach includes data augmentation and efficient fine-tuning of pre-trained models.

We have already conducted preliminary experiments in Hindi with promising results, and with support from the Anusandhan National Research Foundation, we aim to further enhance and expand this work to additional Indian languages.

You believe that “accessibility and linguistic diversity” are crucial for AI research in India. Could you elaborate?

Accessibility and linguistic diversity are fundamental considerations for AI research in India. Having spent several years in Europe, I observed that accessibility is far more systematically integrated into public infrastructure and digital services there.

In contrast, India still has significant gaps, even in public spaces such as railway stations, where basic accessibility provisions are often limited. This highlights the broader need to design technologies that consciously include people with disabilities.

At the same time, India’s linguistic diversity presents another important dimension. In many parts of the country, particularly in rural regions, speech remains the most natural and primary mode of interaction. Text-heavy or typing-based interfaces may not always be practical or inclusive in such contexts. Therefore, AI systems designed for India must prioritise speech-based interaction and support multiple regional languages.

Taken together, meaningful accessibility and strong support for linguistic diversity are essential if digital technologies are to be truly inclusive and widely usable across the country.

WHO has said the “future of healthcare is digital”…

The World Health Organization has emphasised that the future of healthcare will be increasingly digital. In a country like India, telemedicine can play a transformative role, particularly when supported by basic diagnostic infrastructure at the local level, which enables more accurate remote consultations.

Another important direction is AI-assisted diagnostics, where machine learning systems analyse medical images, speech, or health records to support early disease detection and prediction.

Practical solutions are already emerging. For example, ‘Shishu Maapan’ developed by Wadhwani AI helps measure newborn weight and size simply from mobile photos and is being adopted by frontline health workers such as ASHA workers.

Digital tools are also enabling assistive healthcare technologies, including speech restoration systems for individuals who have lost their ability to speak, and wearable devices that continuously monitor health parameters and alert doctors to potential anomalies. These developments illustrate how digital innovation can make healthcare more accessible and scalable.

A common criticism of AI-generated speech is that while it’s intelligible, it often fails to capture the unique cadence of the speaker. When restoring a voice to someone with dysarthria, how do you balance the need for clear communication with the need to preserve the user’s individual human essence?

This is an important concern. If recordings of the speaker’s original voice from before the onset of dysarthria are available, modern voice cloning techniques can recreate that voice with as little as 10 seconds of speech. So preserving an individual’s vocal identity is technically feasible today, and there is substantial research demonstrating this capability. Our current app, however, focuses primarily on restoring content intelligibility, ensuring that what the user intends to say is conveyed clearly. For now, the generated speech uses a common voice rather than a personalised one.

That said, text-to-speech systems are becoming increasingly natural, to the point that they are now being integrated into conversational bots replacing many traditional customer service applications. Emotional nuance remains more challenging, as we discussed in our earlier work on empathic speech generation , but progress is rapid.

How does the model differentiate between impaired speech and a noisy background as the user navigates, say, a busy Indian street?

This is indeed a significant challenge in India, where real-world environments can be extremely chaotic. Anyone who has thought about deploying self-driving cars here quickly realizes how unpredictable our roads can be: traffic patterns, honking, pedestrians, and vehicles all interacting in highly dynamic ways. Speech technology faces a similar level of complexity.

In our experiments, we improve robustness using noise augmentation, where we simulate different noisy environments during training so the model learns to handle background sounds. Ultimately, the most effective solution is to collect and train on more real-world data from noisy settings. Even then, some performance degradation is inevitable because separating impaired speech from heavy background noise is fundamentally a difficult problem.

divya.gandhi@thehindu.co.in



Source link

Science

Post navigation

Previous Post: Access Denied
Next Post: Access Denied

Related Posts

  • Massive Greenland landslide sent seismic waves around earth for nine days
    Massive Greenland landslide sent seismic waves around earth for nine days Science
  • Researchers identify key genetic factors causing oral cancer early among Indian tobacco chewers
    Researchers identify key genetic factors causing oral cancer early among Indian tobacco chewers Science
  • The discrepancy between satellite data of farm fires and air pollution | Explained
    The discrepancy between satellite data of farm fires and air pollution | Explained Science
  • What is the technology behind manufacturing a semiconductor chip? | Explained
    What is the technology behind manufacturing a semiconductor chip? | Explained Science
  • Are many Labradors hard-wired for obesity?
    Are many Labradors hard-wired for obesity? Science
  • Printing towards progress: How the printing press changed society
    Printing towards progress: How the printing press changed society Science

More Related Articles

Low-cost flexible biosensor for early heart attack detection developed by BITS–RMIT Univ Low-cost flexible biosensor for early heart attack detection developed by BITS–RMIT Univ Science
Countdown begins for launch of ISRO’s heaviest communication satellite CMS-03 for military on November 2 Countdown begins for launch of ISRO’s heaviest communication satellite CMS-03 for military on November 2 Science
Copper becomes unexpectedly hard under extreme strain rate Copper becomes unexpectedly hard under extreme strain rate Science
Union Budget 2026: push for technology, planetary science in space programme Union Budget 2026: push for technology, planetary science in space programme Science
TB: gene editing could add new power to a 100-year-old vaccine TB: gene editing could add new power to a 100-year-old vaccine Science
Assam scientists probe Sun to rethink gravity Assam scientists probe Sun to rethink gravity Science
SiteLock

Archives

  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • May 2023
  • April 2023
  • March 2023
  • February 2023
  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • July 2022
  • June 2022
  • May 2022

Categories

  • Business
  • Nation
  • Science
  • Sports
  • World

Recent Posts

  • Access Denied
  • West Asia conflict: Dubai building hit by debris from attack: media office
  • Access Denied
  • Access Denied
  • Access Denied

Recent Comments

  1. KeithCaf on UP Teacher Who Asked Students To Slap Muslim Classmate
  2. DuaneNax on UP Teacher Who Asked Students To Slap Muslim Classmate
  3. JosephNek on UP Teacher Who Asked Students To Slap Muslim Classmate
  4. Georgenub on UP Teacher Who Asked Students To Slap Muslim Classmate
  5. KeithCaf on UP Teacher Who Asked Students To Slap Muslim Classmate
  • Adani infuses ₹8,339 crore more in Ambuja Cements, raises stake to 70.3%
    Adani infuses ₹8,339 crore more in Ambuja Cements, raises stake to 70.3% Business
  • Sensex hits new all-time high in early trade
    Sensex hits new all-time high in early trade Business
  • North Korea calls new sanctions monitoring team ‘unlawful’
    North Korea calls new sanctions monitoring team ‘unlawful’ World
  • India Bowler Deepti Sharma Causes Sensational Surprise Run Out vs New Zealand In Women’s 1st ODI. Watch
    India Bowler Deepti Sharma Causes Sensational Surprise Run Out vs New Zealand In Women’s 1st ODI. Watch Sports
  • Saudi Arabia King Salman Has A Lung Infection, Will Be Treated With Antibiotics
    Saudi Arabia King Salman Has A Lung Infection, Will Be Treated With Antibiotics World
  • China says took ‘control measures’ against Philippine ships near disputed reef
    China says took ‘control measures’ against Philippine ships near disputed reef World
  • Ex-BBC Presenter Huw Edwards Pleads Guilty To Indecent Child Picture Crimes
    Ex-BBC Presenter Huw Edwards Pleads Guilty To Indecent Child Picture Crimes World
  • Access Denied
    Access Denied Nation

Editor-in-Chief:
Mohammad Ariff,
MSW, MAJMC, BSW, DTL, CTS, CNM, CCR, CAL, RSL, ASOC.
editor@artifex.news

Associate Editors:
1. Zenellis R. Tuba,
zenelis@artifex.news
2. Haris Daniyel
daniyel@artifex.news

Photograher:
Rohan Das
rohan@artifex.news

Artifex.News offers Online Paid Internships to college students from India and Abroad. Interns will get a PRESS CARD and other online offers.
Send your CV (Subjectline: Paid Internship) to internship@artifex.news

Links:
Associate Journalism
About Us
Privacy Policy

News Links:
Breaking News
World
Nation
Sports
Business
Entertainment
Lifestyle

Registered Office:
72/A, Elliot Road, Kolkata - 700016
Tel: 033-22277777, 033-22172217
Email: office@artifex.news

Editorial Office / News Desk:
No. 13, Mezzanine Floor, Esplanade Metro Rail Station,
12 J. L. Nehru Road, Kolkata - 700069.
(Entry from Gate No. 5)
Tel: 033-46011099, 033-46046046
Email: editor@artifex.news

Copyright © 2023 Artifex.News Newsportal designed by Artifex Infotech.