A PAN card is a key document required for financial and banking transactions, while Aadhaar is a 12-digit unique identification number issued by the Unique
Business

A PAN card is a key document required for financial and banking transactions, while Aadhaar is a 12-digit unique identification number issued by the Unique Identification Authority of India. New Delhi: The government has announced changes to the Permanent Account Number (PAN) application process

<h4 class=
Sports

Who will challenge D Gukesh? Vishy Anand names 3 contenders; Magnus Carlsen backs Caruana, Nakamura for Candidates 2026 Published on: Mar 18, 2026 5:34 PM IST Written by Neelav Chakravarti Share via Copy link Vishy Anand and Magnus Carlsen named their top contenders for the Candidates title

<strong>Stock Market Live Updates: </strong>Indian equity markets ended higher on Tuesday, with broader markets outperforming frontline indices as
Business

Stock Market Live Updates: Indian equity markets ended higher on Tuesday, with broader markets outperforming frontline indices as investors showed renewed interest in midcap and technology stocks. The NSE Nifty jumped 0.83 per cent or 196.65 points to close at 23,777.80

The automobile sector in the country may face near-term production disruptions due to constraints in industrial gas supply arising from ongoing disruptions in
Home

The automobile sector in the country may face near-term production disruptions due to constraints in industrial gas supply arising from ongoing disruptions in the energy market amid the West Asia conflict, according to a report. New Delhi: The automobile sector in the country may face near-term

A father in India has shut down the internet with a smooth Michael Jackson-style moonwalk performed in simple rubber slippers. Watch the viral video that has
Latest News

A father in India has shut down the internet with a smooth Michael Jackson-style moonwalk performed in simple rubber slippers. Watch the viral video that has racked up 15 million views and left Gen Z stunned by "Uncleji's" 80s swag. Move over

Tulu, Bodo, Kashmiri: Startups are teaching AI models Indian dialects

Posted By: Preeti Dabar Posted On: Dec 26, 2025Share Article
Tulu, Bodo, Kashmiri
Gaurav Sharma for Rest of World

This article was originally published in Rest of World, which covers technology's impact outside the West.

When Amrith Shenava began experimenting with large language models shortly after the launch of ChatGPT, he quickly realized that Tulu – the language he and some 2 million people spoke in the southern Indian state of Karnataka – had virtually no digital data set. He decided to build one.

Shenava, who has a degree in computer science from Kent State University in Ohio, had earlier launched a translation app, and a language learning app for Tulu. To build the data set for the LLM, he had to collect voice and text data from native speakers including teachers, professionals, homemakers, and members of the Tulu diaspora.

“Most AI systems are built in the US. They don't understand Indian languages or contexts,” Shenava, the 27-year-old founder of TuluAI, told Rest of World. “We need our own models that represent us.”

India has more than 1,600 languages and dialects, but most artificial intelligence systems cater to those that are widely spoken. OpenAI's ChatGPT supports more than a dozen Indian languages including Hindi, Tamil, and Kannada, the dominant language in Karnataka. Google's Gemini can chat with users in nine Indian languages.

Spurred by their success, and keen to be a part of the rapid global transition to AI, a handful of Indian startups are building AI tools for so-called low-resource languages such as Tulu, Bodo, and Kashmiri, which have a limited online presence and few written records. The startups are having to build data sets nearly from scratch.

TuluAI holds storytelling sessions and workshops in rural areas, in which local residents – particularly women and elders – narrate their stories, or are asked to read texts and simulate everyday conversations. Participants are taught to record and label the data. Each workshop of one to two days produces over 150 hours of labeled voice and text data, Shenava said.

The startup also collects WhatsApp voice notes from anyone who wishes to send one, with annotators checking transcripts and labels for accuracy.

“Major translation tools miss the context that gives meaning to words. The only way to fix that is to use authentic, human-recorded data that reflects real-life language use,” Shenava said. “The goal is for the model to talk like a native speaker. We want it to understand humor, idioms, and cultural context. So we're building slowly, verifying every sample.”

Across the country, in the northeastern state of Assam, Kabyanil Talukdar, the 25-year-old co-founder of Aakhor AI, follows a similar process to build data sets in Bodo and Assamese. Talukdar's team conducts community workshops and classes, and holds voice-note drives via WhatsApp groups, with simple daily prompts like “Talk about your morning tea.”

Each submission is tagged with metadata such as dialect, region, and speaker demographics to ensure diversity. The clips, 20-60 seconds long, are processed, transcribed, and anonymised. Each three-month campaign produces over 5,000 voice samples, Talukdar told Rest of World.

“When people see that their voices help preserve their language, they feel ownership,” he said. “They are driven by the shared goal of creating AI that understands and speaks their native language.”

Big tech LLMs such as GPT and Meta's Llama are trained on a wide range of data, including in languages other than English. Yet their performance in low-resource languages can be unpredictable, particularly in dialects and local idioms. Countries keen to support their languages and become self-sufficient in AI are building their own multilingual LLMs, which can support translation, speech recognition, and tools for customer service, education, health care, and other applications.

These include the Chile-led LatamGPT project, Southeast Asia's Sealion, and efforts by Masakhane – a grassroots organisation that aims to build AI data sets and tools in African languages. India's BharatGPT and Sarvam support many major Indian languages, and the government is building open-source models for several languages under the Bhashini project.

It is not easy.

Tulu's ancient script lacks a Unicode standard that would allow computational processing of text. Shenava's team is digitising literature written in the script, and training the model to identify patterns. While more complicated, the process helps capture the cultural nuance that is often lost in translation, he said.

The team avoids AI-generated or machine-translated data, which is often riddled with grammatical errors, made-up words and phrases, and other inaccuracies, he said.

“Even open-source models produce text that doesn't make sense. That's why we decided to build it from scratch,” Shenava said. This also ensures ethical data use, he said. “We don't use any personal data without explicit permission.”

Aakhor AI's models are voice-first, targeting areas with low literacy and weak internet access. The company recruits speakers from underrepresented areas to prevent dominant dialects from overshadowing smaller ones, and ensure “balanced sampling,” Talukdar said.

For Saqlain Yousef, it was the fear that Kashmiri – a language spoken by about 7 million people in India – might disappear that drove him to build the KashmiriGPT app using OpenAI's application programming interface.

The platform accepts input in English as well as Kashmiri written in the Roman script, and generates responses in the Kashmiri script, Roman Kashmiri script, and English.

“Our language is vulnerable and at risk of disappearing. So I took matters into my own hands,” the 25-year-old told Rest of World. “This will help preserve Kashmiri in the AI age.”

Yousef is right to be concerned, C Vanlalawmpuia, an independent researcher in language and AI, told Rest of World.

“These languages are already marginalised, and without proper digital representation, they risk disappearing from online spaces entirely,” he said.

AI makes it easier to preserve a language through translation tools, transcription systems, and data sets that can make a language more visible and accessible, according to Vanlalawmpuia. But the lack of digital resources and funding are a challenge, and community-led efforts are one way to sustain the platforms, he said.

AI platforms from deep-pocketed big tech firms including OpenAI, Google, and Perplexity are also targeting India. The country is already the biggest market for ChatGPT outside the US, and OpenAI this month offered its ChatGPT Go service free for a year to users in India.

Aakhor AI is aware of its challenge. “We don't compete with GPT on scale,” Talukdar said. “We compete on relevance.”

By sourcing data from the ground, the community is involved in preserving linguistic diversity and advancing linguistic inclusion, Shenava said.

“Anyone can contribute. That's how language preservation will happen,” he said. “If AI can help keep it alive, that's worth all the effort.”

For Rita D'Souza, a 32-year-old primary schoolteacher in coastal Karnataka, TuluAI is already making a difference, helping students improve their pronunciation and spelling, she told Rest of World.

Tauseef Ahmad is a freelance journalist based in Delhi.

Sajid Raina is a freelance journalist based in Delhi.

This article was originally published in Rest of World, which covers technology's impact outside the West.

Comment on Post

Leave a comment

If you have a News Orbit 360 user account, your address will be used to display your profile picture.


Illinois Lieutenant Governor Juliana Stratton is the predicted winner of the Democratic primary for US Senate, edging out her opponents after a late surge of
World
Juliana Stratton wins Illinois Democratic primary for US Senate

Illinois Lieutenant Governor Juliana Stratton is the predicted winner of the Democratic primary for US Senate, edging out her opponents after a late surge of support in a highly competitive, expensive race. Stratton was leading her main opponent, congressman Raja Krishnamoorthi

1 days ago

Israel's Defense Minister Katz has announced a significant development. The Israeli military has killed Iranian Intelligence Minister Esmail Khatib
World
Katz says Israel has killed Iranian Intelligence Minister Esmail Khatib

Israel's Defense Minister Katz has announced a significant development. The Israeli military has killed Iranian Intelligence Minister Esmail Khatib. This follows yesterday's reported killings of other top Iranian security officials. More surprises are anticipated today across all fronts

1 days ago

<strong>Happy Chaitra Navratri 2026 wishes: </strong>Celebrate the divine energy of Goddess Durga with heartfelt words and blessings this Chaitra
Life Style
Happy Chaitra Navratri 2026 wishes

Happy Chaitra Navratri 2026 wishes: Celebrate the divine energy of Goddess Durga with heartfelt words and blessings this Chaitra Navratri. Share these beautiful wishes, messages, and quotes to spread positivity, strength, and devotion.Happy Chaitra Navratri 2026 wishes: Celebrate the divine energy

1 days ago

Senator Markwayne Mullin, who has been tapped by US President Donald Trump to lead the Department of Homeland Security, will appear on Wednesday in front of a
World
Trump's homeland security pick Mullin faces senators' questions

Senator Markwayne Mullin, who has been tapped by US President Donald Trump to lead the Department of Homeland Security, will appear on Wednesday in front of a congressional committee. The Republican lawmaker from Oklahoma will testify before the Senate Homeland Security and Governmental Affairs

1 days ago

Given the current fuel shortage in the country and the increasing pressure on imports, the Petroleum Ministry has significantly tightened the rules for
Latest News
Central govt’s new rule for LPG cylinder

Given the current fuel shortage in the country and the increasing pressure on imports, the Petroleum Ministry has significantly tightened the rules for domestic gas use. Now, not only possessing a cylinder, but even having a “double” connection at home will be a crime

1 days ago

Eight members of a family were killed when a fire broke out in a three-storey house after an explosion at an Electric Vehicle (EV) charging point outside the
Politics
Eight killed in Indore house fire after EV charging point blast

Eight members of a family were killed when a fire broke out in a three-storey house after an explosion at an Electric Vehicle (EV) charging point outside the building in Indore early on Wednesday (March 18, 2026), officials said. The deceased included two minor children and three women, they said

1 days ago

Life in Kalghatgi town and nearby villages in Dharwad district was thrown into disarray on Tuesday (March 18) evening after a sudden and intense hailstorm
Latest News
Karnataka Villages Turn ‘Mini Manali’ After Rare Hailstorm Blankets Region In White

Life in Kalghatgi town and nearby villages in Dharwad district was thrown into disarray on Tuesday (March 18) evening after a sudden and intense hailstorm lashed the region. The unexpected weather event left residents panicked, forcing many to seek shelter immediately

1 days ago

Hospitals are meant to heal. Yet, for millions of patients globally, they can also become sites of new illness often preventable, often overlooked
Life Style
When Hospitals Make Patients Sicker

Hospitals are meant to heal. Yet, for millions of patients globally, they can also become sites of new illness often preventable, often overlooked. Hospital-Acquired Infections (HAIs) represent one of the most persistent paradoxes in modern healthcare. Even as medical technology advances

1 days ago

Dilip Ghosh, a prominent figure in the BJP, asserts that the party has successfully put Trinamool Congress leader Mamata Banerjee on the defensive within her
Politics
BJP ‘trapped’ Mamata in Bhabanipur with Suvendu pick

Dilip Ghosh, a prominent figure in the BJP, asserts that the party has successfully put Trinamool Congress leader Mamata Banerjee on the defensive within her Bhabanipur stronghold. He pointed out that Suvendu Adhikari's candidacy is a tactical choice aimed at strengthening BJP's foothold

1 days ago

The Allahabad High Court on Monday said that the state must ensure that security is provided to persons facing threats for congregating to hold prayers at
World
State must provide security to those threatened for holding prayers on private property

The Allahabad High Court on Monday said that the state must ensure that security is provided to persons facing threats for congregating to hold prayers at private properties in Uttar Pradesh, Bar and Bench reported. A bench of Justices Atul Sreedharan and Siddharth Nandan added that Article 25 of

1 days ago

BSEB Class 10th Result 2026: The Bihar School Examination Board will announce the 10th matric exam results on March 20, 2026, in Patna on March 20, 2026
World
Bihar Board 10th Result 2026 to be announced on March 20 at biharboardonline

BSEB Class 10th Result 2026: The Bihar School Examination Board will announce the 10th matric exam results on March 20, 2026, in Patna on March 20, 2026. Students are advised to keep themselves updated with their roll number and other necessary details

1 days ago

The fire and fury that is consuming the MAGA (Make America Great Again) fraternity that had rallied behind Donald J. Trump’s anti-war rhetoric for a decade
World
As fire and fury hit MAGA tent

The fire and fury that is consuming the MAGA (Make America Great Again) fraternity that had rallied behind Donald J. Trump’s anti-war rhetoric for a decade touched the administration on Tuesday (March 17, 2026) when Joe Kent, a long-term acolyte

1 days ago

<h4 class=
Sports
Kylian Mbappe vs Balance

Kylian Mbappe vs Balance: Real Madrid’s rhythm without Frenchman turns heads as Vinicius back to his best Updated on: Mar 18, 2026 5:58 PM IST By Aditya Maheshwari Share via Copy link Real Madrid find rhythm without Kylian Mbappe. (Reuters) Beating Manchester City so convincingly is never easy

1 days ago

In television comedy shows such as The Kapil Sharma Show and Comedy Nights with Kapil, Kiku Sharda has portrayed many popular female characters
Entertainment
Wheel of Fortune show

In television comedy shows such as The Kapil Sharma Show and Comedy Nights with Kapil, Kiku Sharda has portrayed many popular female characters, including Palak, Bumper, and Santosh. Mumbai: Actor-comedian Kiku Sharda is widely loved for slipping into multiple female avatars on-screen

1 days ago

A woman stands on a rooftop listening to the sounds of the city below. There is only the dull hum of traffic tonight. But she knows how easily that can change
Latest News
Total repression and air strikes bring unrelenting dread for Iranians

A woman stands on a rooftop listening to the sounds of the city below. There is only the dull hum of traffic tonight. But she knows how easily that can change. It is usually the dogs who notice the sound first and begin to bark furiously. The noise of aircraft

1 days ago

In fiscal year 2026, Dell Technologies reduced its workforce by approximately 11,000 employees, representing about 10% of its staff
Business
Dell Layoffs 11

In fiscal year 2026, Dell Technologies reduced its workforce by approximately 11,000 employees, representing about 10% of its staff, as detailed in its 10-K filing. The employee count decreased from 108,000 in January 2025 to nearly 97,000 by January 2026.In fiscal year 2026

1 days ago

New Delhi, Mar 18 (PTI) Uranium Corporation of India Ltd (UCIL) is set to establish a uranium mining and processing plant with a capacity of 2
Latest News
2 new uranium mining projects at clearance stage

New Delhi, Mar 18 (PTI) Uranium Corporation of India Ltd (UCIL) is set to establish a uranium mining and processing plant with a capacity of 2,500 tonnes per day (TPD) at Rohil in Rajasthan’s Sikar district and Jajwal, Chhattisgarh, said Union Minister Jitendra Singh on Wednesday.New Delhi

1 days ago

India on Tuesday denied holding negotiations with Iran about releasing three vessels New Delhi had seized in February in return for the safe passage of
World
MEA rejects report saying Iran wants return of three ships seized by India for Hormuz passage

India on Tuesday denied holding negotiations with Iran about releasing three vessels New Delhi had seized in February in return for the safe passage of ​Indian ships through the Strait of Hormuz. The denial by the Ministry of External Affairs came during an inter-ministerial press briefing on the

1 days ago

If you are an EPFO member or are associated with it, this news is for you. This is because everything—from pensions to claims and account transfers—is now
Latest News
Major changes to the pension system

If you are an EPFO member or are associated with it, this news is for you. This is because everything—from pensions to claims and account transfers—is now becoming faster and easier than ever before. In the Lok Sabha, Shobha Karandlaje, the Minister of State for Labour and Employment

1 days ago

According to the NXT Foundation’s India Progress Report 2025-26, India has surpassed Japan with a nominal GDP of $4.8 trillion. It maintains the fastest
Business
India stuns the world

According to the NXT Foundation’s India Progress Report 2025-26, India has surpassed Japan with a nominal GDP of $4.8 trillion. It maintains the fastest growth rate of 8.2% in the world. New Delhi: India has overtaken Japan to become the world’s fourth-largest economy

1 days ago

Ugadi Pachadi, the traditional festive offering prepared during Ugadi, is more than just a dish; it is a symbolic representation of life itself
Life Style
From Sweet To Bitter

Ugadi Pachadi, the traditional festive offering prepared during Ugadi, is more than just a dish; it is a symbolic representation of life itself. Known as a blend of Shadruchulu (six tastes), this special preparation holds deep cultural, spiritual, and health significance.Ugadi Pachadi

1 days ago

Nora Fatehi has addressed the controversy around her and Sanjay Dutt’s viral song Sarke Chunar, saying she was unaware of the Hindi lyrics and never approved
Entertainment
Nora Fatehi Says She Flagged Sarke Chunar’s Lyrics To Director

Nora Fatehi has addressed the controversy around her and Sanjay Dutt’s viral song Sarke Chunar, saying she was unaware of the Hindi lyrics and never approved the final version. The actor’s reaction comes after the track drew backlash for allegedly inappropriate lyrics.Explaining her side

1 days ago

<h4 class=
Latest News
Dad's effortless moonwalk in chappals to Michael Jackson song gets 17M views

Watch: Dad's moonwalk in chappals gets 17 million views, internet says 'MJ is clapping happily in his grave'A father's surprise moonwalk in chappals during his daughter's dance video has gone viral, crossing 17 million views in a day. Published on: Mar 18

1 days ago

<h4 class=
Latest News
Mumbai woman paying ₹1

Mumbai woman paying ₹1.2 lakh rent for Andheri West flat says she's a 'stay at home daughter', internet reactsA self-described “stay-at-home daughter” says she pays ₹1.2 lakh rent. Her viral video has sparked a discussion online. Published on: Mar 18

1 days ago

<h4 class=
Life Style
Fitness coach recommends walking after eating heavy meals

Fitness coach recommends walking after heavy meals, shares key benefits for digestion and overall healthEnsure you walk after every meal, that way your blood sugar stays stable, along with improved weight management. Published on: Mar 18, 2026 5:58 PM IST By Adrija Dey Share via Copy link After

1 days ago

In the video, the father can be seen holding his old phone, its screen damaged, with a rubber band around it, and worn out from years of use
World
Father’s broken phone reveals his struggles

In the video, the father can be seen holding his old phone, its screen damaged, with a rubber band around it, and worn out from years of use. Despite its condition, he appears to have continued using it. The son surprises him with a new smartphone

1 days ago

Uttarakhand resident Deepak Kumar has moved the High Court challenging the case against him and requesting a departmental inquiry against police officers who
World
Police failed to act against Uttarakhand mob despite evidence

Uttarakhand resident Deepak Kumar has moved the High Court challenging the case against him and requesting a departmental inquiry against police officers who allegedly failed to act against hate crimes, The Indian Express reported on Wednesday. Kumar and another person booked in the matter

1 days ago


Sing Up