Product Information

Shrutlekhan-Advance

Real-time dictation powered by AI, tailored for Indic languages

Brief Description

Shrutlekhan-Advance is a state-of-the-art, general-purpose continuous automatic speech recognition (ASR) system designed for Indian languages. It accurately transcribes spoken words into text, making it an invaluable tool for a wide range of applications. The system employs an end-to-end AI-driven approach, mapping audio inputs directly to text outputs using deep learning algorithms to ensure precise and reliable transcription.

Use Cases

Voice-Driven Content Creation: Individuals efficiently create content such as emails, blogs, books, and other written materials by voice dictation, enhancing productivity and accessibility in writing tasks.
Judicial and Legal Documentation: Transcribe courtroom proceedings, judgments, and daily orders.
Media and Journalism: Convert interviews, news coverage, and panel discussions into publishable text.
Education and Research: Record and transcribe lectures, seminars, and oral histories.
Corporate Communication: Document business meetings, voice memos, and client calls.
Government and Administrative Workflows: Enable digital transformation with accurate voice-to-text transcription.

Salient Features

Real-Time Online Transcription: High-speed, high-accuracy transcription of live speech.
Seamless Integration: Easy API-based integration with existing systems and workflows.
Instant and Precise Text Creation: Automatically generate and structure text from audio inputs.
Voice-Based Editing & Formatting: Control text formatting and editing using voice commands.
Audio File Transcription Support: Upload and transcribe pre-recorded audio content with ease.

Technical Specifications

Supported Languages: Indian languages (Hindi, Marathi, Kannada, Tamil, Indian English, Gujarati)
Model Architecture: End-to-End Conformer (Encoder-Decoder with Attention/CTC) Architecture
Input Types: Live Microphone Audio, Pre-recorded Audio Files (WAV, MP3, FLAC, etc.)
Output Formats: Editable Transcripts
Deployment Modes: Web-based Interface, REST APIs, On-Premise Option

Contact Details

Mahesh Bhargava

Scientist E

Multilingual Technologies Group,

C-DAC Pune.

Phone: 02025503305

Email: mbhargava[at]cdac[dot]in