MUMBAI, India, Jan. 7 -- Intellectual Property India has published a patent application (202511103994 A) filed by Ajay Kumar Garg Engineering College, Ghaziabad, Uttar Pradesh, on Oct. 29, 2025, for 'multimodal language learning system combining text, audio, and visual inputs for english education.'
Inventor(s) include Dr. Mohit Tiwari; and Bhavya Jain.
The application for the patent was published on Dec. 12, under issue no. 50/2025.
According to the abstract released by the Intellectual Property India: "A multimodal language learning system for English education is disclosed. Textual passages are rendered with synchronized audio narrations and context-relevant visual assets. Learner inputs include typed responses, spoken utterances, and image selections. A capture pipeline normalizes audio and visual data and buffers text edits. A multimodal analysis engine computes linguistic, acoustic, and visual features. A temporal alignment module maps tokens and phonemes to video scene markers on a shared timeline. A fusion model produces a learner state vector with trait scores, uncertainty values, and engagement indicators. A recommendation module selects activities based on expected learning gain and cognitive load constraints. A feedback generator delivers layered corrections in text, audio, and visual forms. A progress tracker updates a mastery graph and supports longitudinal adaptation and institutional reporting."
Disclaimer: Curated by HT Syndication.