Skip to content
  • Facebook
  • X
  • Linkedin
  • WhatsApp
  • YouTube
  • Associate Journalism
  • About Us
  • Privacy Policy
  • 033-46046046
  • editor@artifex.news
Artifex.News

Artifex.News

Stay Connected. Stay Informed.

  • Breaking News
  • World
  • Nation
  • Sports
  • Business
  • Science
  • Entertainment
  • Lifestyle
  • Toggle search form
  • Rocket Fired From Gaza Fell In Sea Off Tel Aviv: Israel Army
    Rocket Fired From Gaza Fell In Sea Off Tel Aviv: Israel Army World
  • Access Denied World
  • ‘Nothing Without His Backing’: Joe Root Opens On Relationship With Late Graham Thorpe
    ‘Nothing Without His Backing’: Joe Root Opens On Relationship With Late Graham Thorpe Sports
  • New Zealand navy sailors rescued from shipwreck off Samoa
    New Zealand navy sailors rescued from shipwreck off Samoa World
  • Access Denied Sports
  • India vs Bangladesh Live Score Updates, 3rd T20I: Suryakumar Yadav-Led India Eye Series Sweep
    India vs Bangladesh Live Score Updates, 3rd T20I: Suryakumar Yadav-Led India Eye Series Sweep Sports
  • Ind vs Pak: Clinical India Romp To 7-Wicket Win Over Pakistan In Women’s Asia Cup T20
    Ind vs Pak: Clinical India Romp To 7-Wicket Win Over Pakistan In Women’s Asia Cup T20 Sports
  • Gizmo, The Dog Who Went Missing In US In 2015, Found Alive After 9 Years
    Gizmo, The Dog Who Went Missing In US In 2015, Found Alive After 9 Years World
India-made app turns impaired speech into clear speech in near-realtime

India-made app turns impaired speech into clear speech in near-realtime

Posted on March 12, 2026 By admin


A whisper. A few slurred words. For those who suffer from dysarthria, a motor speech disorder, basic communication is a challenge, indelibly affecting both their professional and personal life. But now a new innovation based on artificial intelligence (AI) and developed in India could be life-changing.

Led by associate professor Vineet Gandhi of the International Institute of Information Technology (IIIT), Hyderabad, a team has developed a simple app that can help people talk as an audio translation converts the speaker’s voice almost in real-time. The app can either convert slurred speech into clear, natural-sounding speech or use a camera to analyse lip movements and subtle throat vibrations to generate intelligible speech.

While the current project runs in English, the team’s next aim is to take these technologies to regional languages, including Hindi, Telugu, and Tamil, as many across the country do not have the means to benefit from accessibility-focused AI models. For this work, Mr. Gandhi won the Anusandhan National Research Foundation (ANRF) award in 2026.

Excerpts from an interview:

What inspired you to begin work on this humanitarian AI project?

My research has always been driven by a simple question: what real problem can technology help solve?

While my academic training is primarily in computer vision, about four years ago, I began to see exciting possibilities emerging in speech research and decided to explore the field more deeply. I became increasingly aware of the challenges faced by many individuals who lose their ability to speak due to medical conditions: the impact of this loss extends far beyond communication — it affects independence, identity, and connection.

Recognising this need inspired me to focus my work on accessibility-driven technologies designed to restore or enable speech, with the goal of helping people regain their voice.

Could you describe how the app works for people with speech impairment?

The app is designed to convert impaired or distorted speech into clear, natural-sounding speech with only a few hundred milliseconds of delay. A user simply speaks in their own voice, and the system processes it to produce intelligible speech for the listener.

We are also developing a complementary lip-to-speech capability, where a person can silently move their lips and the system generates the corresponding speech.

A key aspect we are focusing on is personalisation, where users can calibrate and refine the application to their voice by reading few minutes of text on the app.

We aim for these technologies to be integrated into common communication platforms, such as web-based calling applications, making everyday communication easier for people with speech impairments.

You also aim to expand this technology to regional Indian languages. How do you hope to achieve this?

At present, much of the global speech technology ecosystem is predominantly designed for English, and our initial experiments have naturally followed the same trajectory. However, a major goal of our research is to extend these capabilities to regional Indian languages, where accessible speech technologies are equally important.

To achieve this, we plan to collect speech data in Indian languages and develop data-efficient models suited for low-resource scenarios. Our approach includes data augmentation and efficient fine-tuning of pre-trained models.

We have already conducted preliminary experiments in Hindi with promising results, and with support from the Anusandhan National Research Foundation, we aim to further enhance and expand this work to additional Indian languages.

You believe that “accessibility and linguistic diversity” are crucial for AI research in India. Could you elaborate?

Accessibility and linguistic diversity are fundamental considerations for AI research in India. Having spent several years in Europe, I observed that accessibility is far more systematically integrated into public infrastructure and digital services there.

In contrast, India still has significant gaps, even in public spaces such as railway stations, where basic accessibility provisions are often limited. This highlights the broader need to design technologies that consciously include people with disabilities.

At the same time, India’s linguistic diversity presents another important dimension. In many parts of the country, particularly in rural regions, speech remains the most natural and primary mode of interaction. Text-heavy or typing-based interfaces may not always be practical or inclusive in such contexts. Therefore, AI systems designed for India must prioritise speech-based interaction and support multiple regional languages.

Taken together, meaningful accessibility and strong support for linguistic diversity are essential if digital technologies are to be truly inclusive and widely usable across the country.

WHO has said the “future of healthcare is digital”…

The World Health Organization has emphasised that the future of healthcare will be increasingly digital. In a country like India, telemedicine can play a transformative role, particularly when supported by basic diagnostic infrastructure at the local level, which enables more accurate remote consultations.

Another important direction is AI-assisted diagnostics, where machine learning systems analyse medical images, speech, or health records to support early disease detection and prediction.

Practical solutions are already emerging. For example, ‘Shishu Maapan’ developed by Wadhwani AI helps measure newborn weight and size simply from mobile photos and is being adopted by frontline health workers such as ASHA workers.

Digital tools are also enabling assistive healthcare technologies, including speech restoration systems for individuals who have lost their ability to speak, and wearable devices that continuously monitor health parameters and alert doctors to potential anomalies. These developments illustrate how digital innovation can make healthcare more accessible and scalable.

A common criticism of AI-generated speech is that while it’s intelligible, it often fails to capture the unique cadence of the speaker. When restoring a voice to someone with dysarthria, how do you balance the need for clear communication with the need to preserve the user’s individual human essence?

This is an important concern. If recordings of the speaker’s original voice from before the onset of dysarthria are available, modern voice cloning techniques can recreate that voice with as little as 10 seconds of speech. So preserving an individual’s vocal identity is technically feasible today, and there is substantial research demonstrating this capability. Our current app, however, focuses primarily on restoring content intelligibility, ensuring that what the user intends to say is conveyed clearly. For now, the generated speech uses a common voice rather than a personalised one.

That said, text-to-speech systems are becoming increasingly natural, to the point that they are now being integrated into conversational bots replacing many traditional customer service applications. Emotional nuance remains more challenging, as we discussed in our earlier work on empathic speech generation , but progress is rapid.

How does the model differentiate between impaired speech and a noisy background as the user navigates, say, a busy Indian street?

This is indeed a significant challenge in India, where real-world environments can be extremely chaotic. Anyone who has thought about deploying self-driving cars here quickly realizes how unpredictable our roads can be: traffic patterns, honking, pedestrians, and vehicles all interacting in highly dynamic ways. Speech technology faces a similar level of complexity.

In our experiments, we improve robustness using noise augmentation, where we simulate different noisy environments during training so the model learns to handle background sounds. Ultimately, the most effective solution is to collect and train on more real-world data from noisy settings. Even then, some performance degradation is inevitable because separating impaired speech from heavy background noise is fundamentally a difficult problem.

divya.gandhi@thehindu.co.in



Source link

Science

Post navigation

Previous Post: Access Denied
Next Post: Access Denied

Related Posts

  • Space Wrap: From Sriharikota to Leh, preparations for Gaganyaan mission in full swing
    Space Wrap: From Sriharikota to Leh, preparations for Gaganyaan mission in full swing Science
  • Europe is the fastest-warming continent, at nearly twice global average: report
    Europe is the fastest-warming continent, at nearly twice global average: report Science
  • How an altered protein and fussy neurons conspire to cause microcephaly
    How an altered protein and fussy neurons conspire to cause microcephaly Science
  • Science Quiz on explorers who undertook ‘impossible’ expeditions
    Science Quiz on explorers who undertook ‘impossible’ expeditions Science
  • On soaps and detergents: how they are made and manufactured
    On soaps and detergents: how they are made and manufactured Science
  • What is aircraft turbulence and how common is it? | Explainer
    What is aircraft turbulence and how common is it? | Explainer Science

More Related Articles

‘It’s like writing a poem’: prize winner Rajula Srivastava on doing maths ‘It’s like writing a poem’: prize winner Rajula Srivastava on doing maths Science
Tetrapod-shaped nanoparticles could make plastics easier to process, finds IIT study Tetrapod-shaped nanoparticles could make plastics easier to process, finds IIT study Science
How Boeing can bring NASA’s Sunita Williams, Barry Wilmore back to Earth How Boeing can bring NASA’s Sunita Williams, Barry Wilmore back to Earth Science
Semaglutide guidelines based on BMI may exclude at-risk Indians Semaglutide guidelines based on BMI may exclude at-risk Indians Science
What separates classical and quantum chaos? What separates classical and quantum chaos? Science
Why did the U.S. FDA decline to review the new mRNA influenza vaccine? Why did the U.S. FDA decline to review the new mRNA influenza vaccine? Science
SiteLock

Archives

  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • May 2023
  • April 2023
  • March 2023
  • February 2023
  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • July 2022
  • June 2022
  • May 2022

Categories

  • Business
  • Nation
  • Science
  • Sports
  • World

Recent Posts

  • Assam ships first legal agarwood chips to West Asia
  • How the anti-defection law is going to operate in the AIADMK case?
  • ATS questions 57 in Maharashtra over alleged gangster network links
  • Nicobarese oppose proposal for three wildlife sanctuaries
  • Visakhapatnam Collector calls for inter-departmental synergy to boost investments

Recent Comments

  1. Stevemonge on UP Teacher Who Asked Students To Slap Muslim Classmate
  2. RichardClage on UP Teacher Who Asked Students To Slap Muslim Classmate
  3. StevenLek on UP Teacher Who Asked Students To Slap Muslim Classmate
  4. Leonardren on UP Teacher Who Asked Students To Slap Muslim Classmate
  5. NathanQuins on UP Teacher Who Asked Students To Slap Muslim Classmate
  • Hand-held ‘electric labs’ can rapidly identify pathogens
    Hand-held ‘electric labs’ can rapidly identify pathogens Science
  • Access Denied
    Access Denied Nation
  • Access Denied Business
  • Access Denied
    Access Denied Nation
  • Philippines earthquake toll rises to 72 as search winds down
    Philippines earthquake toll rises to 72 as search winds down World
  • Bomb Threat At Bhopal Airport, Police Launch Probe
    Bomb Threat At Bhopal Airport, Police Launch Probe Nation
  • Hyundai Motor India’s IPO sees muted response from retail investors, issue subscribed 2.37 times
    Hyundai Motor India’s IPO sees muted response from retail investors, issue subscribed 2.37 times Business
  • PM Modi To Launch Rs 76,000 Crore Projects, Address Fintech Fest In Maharashtra Today
    PM Modi To Launch Rs 76,000 Crore Projects, Address Fintech Fest In Maharashtra Today Nation

Editor-in-Chief:
Mohammad Ariff,
MSW, MAJMC, BSW, DTL, CTS, CNM, CCR, CAL, RSL, ASOC.
editor@artifex.news

Associate Editors:
1. Zenellis R. Tuba,
zenelis@artifex.news
2. Haris Daniyel
daniyel@artifex.news

Photograher:
Rohan Das
rohan@artifex.news

Artifex.News offers Online Paid Internships to college students from India and Abroad. Interns will get a PRESS CARD and other online offers.
Send your CV (Subjectline: Paid Internship) to internship@artifex.news

Links:
Associate Journalism
About Us
Privacy Policy

News Links:
Breaking News
World
Nation
Sports
Business
Entertainment
Lifestyle

Registered Office:
72/A, Elliot Road, Kolkata - 700016
Tel: 033-22277777, 033-22172217
Email: office@artifex.news

Editorial Office / News Desk:
No. 13, Mezzanine Floor, Esplanade Metro Rail Station,
12 J. L. Nehru Road, Kolkata - 700069.
(Entry from Gate No. 5)
Tel: 033-46011099, 033-46046046
Email: editor@artifex.news

Copyright © 2023 Artifex.News Newsportal designed by Artifex Infotech.