Digital Preservation and Online Portal for Encyclopedic Sanskrit Dictionary
KoshaSHRI: A Digital Encyclopaedic Dictionary of Sanskrit on Historical Principles
Brief Description
The KoshaSHRI software is developed by Human-Centred Design & Computing Group (HCDC), C-DAC Pune in collaboration with Deccan College Pune as a part of the project “Digital Preservation and Online Portal for Encyclopaedic Sanskrit Dictionary,” funded by Department of Science & Technology (DST) under Science and Heritage Research Initiative (SHRI).
The objective of this project is to provide technological solutions to digitally preserve existing dictionary volumes, create new dictionary volumes, build the Sanskrit dictionary database, and make it available through online search portal. The existing database consists of 15 lakh vocables with more than 1 Crore (10 million) reference slips documented by Deccan College. This is one of the biggest ongoing works of Lexicography across any language of the world in terms of data and extent.
The technological solutions developed in this project are as follows:
- KoshaSHRI - Sanskrit Dictionary Article Authoring Tool (AAT) powered by crowdsourcing framework is developed which helps creation of new articles in a collaborative manner. Article Authoring Tool is a specialized software to prepare the vocables and volumes of the Sanskrit dictionary and publish it online. The facility of Crowd-Sourcing of Article Authoring Tool is unique in the field of Indology as it enables Sanskrit Scholars, experts, students to contribute in preparation of Sanskrit vocables, volumes in collaborative manner. (Url: https://eds.koshashri-dc.ac.in/)
- Automatic Extraction of Lexicographic Elements (Conversion of Published Volumes into Structured Database) which helps in Sanskrit Dictionary Page Segmentation into articles and extraction of meaning blocks using Optical Character Recognition (OCR) and Detectron2 Model further followed by an intelligent article wise dictionary (Lexical) element extraction. (IEEE Conference Paper - https://ieeexplore.ieee.org/document/10544007)
- Sanskrit Dictionary Editor (SDE) is developed which presents the contents of dictionary articles in tagged format with in-place editing features.
- Specialized Sanskrit Font (named as Koshashri) consists of various Vedic symbols & matching Roman diacritical & English characters along with Sanskrit Text Inputting Tool is provided to the users.
- KoshaSHRI Sanskrit Encyclopaedic Dictionary portal contains the vocables and volumes of the Sanskrit dictionary, which is useful for Sanskrit Scholars, dictionary experts, linguists, Indologists, students, public etc. These can be accessed by using references, citations, transliteration, grammatical categories, unique historical timeline for words, etc using search features based on various filters and types. The portal is available for public access at https://koshashri-dc.ac.in/
Use Cases
- KoshaSHRI - Sanskrit Dictionary Article Authoring Tool (AAT) is unique in the field of Indology it enables Sanskrit Scholars, experts, students to contribute in preparation of Sanskrit vocables, volumes in collaborative manner. (Url: https://eds.koshashri-dc.ac.in )
- KoshaSHRI Sanskrit Encyclopaedic Dictionary portal is available for public access at https://koshashri-dc.ac.in/about to benefit Sanskrit Scholars, dictionary experts, linguists, Indologists, students, public etc.
Salient Features
- KoshaSHRI - Sanskrit Dictionary Article Authoring Tool (AAT) powered by crowdsourcing framework is unique in the field of Indology
- Enables Sanskrit Scholars, experts, students to contribute in preparation of Sanskrit vocables, volumes in collaborative manner
- Koshashri font, a specialized Sanskrit Font) consists of various Vedic symbols & matching Roman diacritical & English characters
- Sanskrit dictionary vocables can be accessed using search features based on various filters and types like references, citations, transliteration, grammatical categories etc.
- Presents a unique historical timeline for vocables/words, etc
Chief Investigator Details
Nigod Dayal Dhurke, ndhurke[at]cdac[dot]in