Bharati Sangraha Launched

Agartala

February 20, 2026

In a significant step toward strengthening India’s multilingual digital ecosystem, the Hon’ble Union Minister of Home Affairs and Cooperation, Shri Amit Shah, launched Bharati Sangraha: A National Digital Repository for India’s Multilingual Future on February20th, 2026 at the Joint Regional Official Language Conference of the Eastern, North-Eastern, and Northern Regions held at the International Indoor Exhibition Centre, Hapania, Agartala, Tripura.

The landmark event was graced by an esteemed gathering of national and state dignitaries, including Dr Manik Saha, Hon’ble Chief Minister of Tripura; Shri Bandi Sanjay Kumar, Hon’ble Union Minister of State for Home Affairs; Shri Rajeev Bhattacharya, Hon’ble Member of Parliament, Rajya Sabha, Tripura; Shri Biplab Kumar Deb, Hon’ble Member of Parliament, Lok Sabha, Tripura (West); and Smt Maharani Kriti Singh Debbarman, Hon’ble Member of Parliament, Lok Sabha, Tripura (East) Smt Anjuli Arya, Secretary, Department of Official Language and other dignitaries.

A Strategic Initiative for Linguistic Integration

Bharati Sangraha has been developed by C-DAC, Pune, for the Department of Official Language, Ministry of Home Affairs, Government of India. The project is led by Dr Shashi Pal Singh, Scientist F and Project Director, OLP Group, C-DAC Pune.

The initiative is part of the broader Bharati BahubhashiAnuvadaSarthi programme and focuses on the systematic creation, collection, validation, and management of high-quality multilingual datasets. Supporting English and 15 Indian languages, Bharati Sangraha serves as a centralized and secure national linguistic repository designed to power AI-driven language technologies.

India’s linguistic diversity is one of its greatest strengths. However, advanced digital systems such as Machine Translation, Automatic Speech Recognition (ASR), Text-to-Speech (TTS), and Natural Language Processing (NLP) require structured and validated datasets to function accurately. Bharati Sangraha addresses this gap by providing standardized, model-ready linguistic resources.

Building the Foundation for Multilingual AI

At its core, Bharati Sangraha aims to create a robust digital corpus that can be used to train and refine language models. By establishing structured data acquisition frameworks and multi-stage validation workflows, the platform ensures high-quality outputs suitable for AI and research applications.The initiative brings together government bodies, public institutions, universities, researchers, and language experts onto a unified digital platform. This collaborative ecosystem enhances data authenticity, linguistic accuracy, and scalability.

The platform will benefit students, researchers, AI developers, and government departments alike. It provides a structured framework for linguistic research, technological innovation, and policy implementation. By strengthening language technologies, the initiative supports seamless communication, documentation, and digital interactions across regions.

Strengthening India’s Position in AI

As nations worldwide invest in AI capabilities, access to high-quality linguistic data has become a strategic necessity. Bharati Sangraha positions India to harness its linguistic diversity as a technological advantage. By systematically developing curated datasets across multiple Indian languages, the platform ensures that AI innovations are not limited to a few widely spoken languages but extend across the country’s rich linguistic landscape.The platform can be accessed at:https://bharati-sangraha.rajbhasha.co.in/

The launch of Bharati Sangraha represents more than the introduction of a digital platform - it marks the beginning of a structured national effort to integrate language, technology, and governance. By bridging linguistic diversity with advanced digital tools, India moves closer to building an inclusive, AI-powered future rooted in its cultural and linguistic heritage.

BSL Pic
Top