• Home
    • About
    • Ongoing Projects
    • Collaboration
  • People
  • Publications
    • Conferences
    • Journals
    • Workshops
  • Resources
  • Affairs
    • Events
    • Friday Talks
  • Achivements
  • Gallery
  • Contact
Latest News


Upcoming Talks

Prajwal Vijay Kajare

PhD Student, IIT Jodhpur
Date: 08-08-2025
Topic: Beyond the trade-off: Self-Supervised Reinforcement Learning for Reasoning Models’ Instruction Following

Abstract – Prajwal Vijay Kajare
The paper presents a method which combines Self-supervised learning and Reinforcement Learning to improve the instruction following ability of the reasoning model.

Previous Talks

Dr. Saloka Sengupta

JRA Language
Date: 25-07-2025
Topic: "Toward cultural interpretability: A linguistic anthropological framework for describing and evaluating large language models" by Graham M Jones, Shai Satran and Arvind Satyanarayan

Rochit Ranjan

IIT Jodhpur

Date: 18-07-2025

Topic: Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation


Dr. Saroj Kumar Jha

SRA Language

Date: 11-07-2025
Topic: Linguistic Resources for Bhojpuri, Magahi, and Maithili: Statistics about Them, Their Similarity Estimates, and Baselines for Three Applications.


Soham Bhattacharjee

JRA Technical
Date: 04-07-2025
Topic: DRT: Deep Reasoning Translation via Long Chain-of-Thought



Dr. Pramod Singh

JRA Language
Date: 27-06-2025
Topic: अनुवाद (translation)



Baban Gain

PhD Student, IIT Patna
Date: 13-06-2025
Topic: Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation

Nitika Gupta

Junior Research Associate – Language
Date: 06-06-2025
Topic: Importance of Case Markers in Bengali Language

Dibyanayan Bandyopadhyay

PhD student
Date: 30-05-2025
Topic: Zero Shot Natural Language Explanation



Bigyan Ranjan Das

JRA Language (Hindi-Odia)
Date: 23-05-2025
Topic: Nuances of Hindi-Odia Translation_Exploring Linguistic, Cultural, and Grammatical Subtleties

Somenath Nag Choudhury

PhD Student, IIT Patna
Date: 02-05-2025
Title: MITER: Medical Image–Text joint adaptive pretraining with multi-level contrastive learning

Ayodhya Murmu
Ayodhya Murmu

JRA (Language: Santali)
Date: 09-05-2025
Title: Facing Issues and Solution in Hindi to Santali Validation

Priyanshu Priya

PhD Student, IIT Patna
Date: 25-04-2025
Topic: AAAI-25 Conference Chronicles: Unveiling the Highlights and Takeaways


Anansa Roy
Anansa Roy

JRA (Language)
Date: 18-04-2025
Title: Postposition and its Usage in Bangla

Dr. Vineet Kumar Lal Das

JRA Language
Date: 28-03-2025
Topic: भाषण का शीर्षक - हिन्दी-मैथिली अनुवाद (मानवीय) करते समय होने वाली त्रुटियां


Abid Hossain

Phd Students
Date: 21-03-2025

Topic: Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting

Souravi Halder

JRA Language
Date: 07-03-2025
Topic: Web Development




Dr. Saroj Kumar Jha

SRA Language
Date: 14-02-2025
Topic: ILCI Corpora Guidelines for Annotation and Validation


Saurav Dudhate
Saurav Dudhate

B.Tech CSE (IIT Patna)
Date: 31-01-2025
Title: Direct Preference Optimization


Somenath Nag Choudhury

PhD Student, IIT Patna
Date: 07-01-2025
Topic: MAIRA-2: Grounded Radiology Report Generation


Abstract – Dr. Saloka Sengupta
In this talk the speaker discusses the article in which a new integration of linguistic anthropology and machine learning (ML) around convergent interests in understanding the underpinnings of language and making language technologies more socially responsible is proposed.
Abstract – Rochit Ranjan
A Systematic Analysis of Large Language Models into Assembly Code Obfuscation
Abstract – Dr. Saroj Kumar Jha
This work focuses on building linguistic resources for low-resource Purvanchal languages—Bhojpuri, Magahi, and Maithili—by collecting, cleaning, and annotating corpora from various domains. Basic statistical analyses were conducted at multiple linguistic levels, and comparisons were made with Hindi to assess similarities and complexities. POS tagging, chunking, and language identification tools were developed using BIS tagsets, along with bilingual dictionaries and synsets. The study also introduces a novel n-gram-based method for measuring language similarity. Overall, it lays foundational resources and baselines for future NLP research on these languages.
Abstract – Soham Bhattacharjee
This paper tackles free translation (conveying the sense but not word for word translation) using deep reasoning LLM and Chain of Thought.



Abstract – Dr. Pramod Singh
Dr. Pramod Singh shared his insights on the multifaceted nature of translation. He emphasized that translation plays a crucial role in the development of language, as it not only bridges linguistic gaps but also addresses inherent language-related challenges. Dr. Singh described translation as a long and evolving journey that requires a deep understanding of the nature of language. He highlighted that a translator must be able to faithfully convey the original author’s emotions and thoughts in another language. Therefore, for translation to be successful, it is essential to grasp the aesthetic and emotional nuances of the language.
Abstract – Baban Gain
This study finds that LLMs perform best at translation evaluation when using reference translations, while source sentences sometimes degrade their performance. Despite the source containing essential meaning for assessing fidelity, LLMs struggle to effectively utilize it—revealing a weakness in their cross-lingual reasoning.
Paper: Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation (Findings of ACL 2024)
Abstract – Nitika Gupta
This talk explores how case markers in Bengali define grammatical relationships, affecting syntax and semantics. It highlights their vital role in accurate linguistic analysis and translation between Indian languages and Bengali.
Abstract – Dibyanayan Bandyopadhyay
Can we generate a faithful Natural Language Explanation for classifiers in zero shot manner?
Can we express in Natural Language why a classifier predicted a class label with only access to its inner representation?
Paper: https://openreview.net/forum?id=X6VVK8pIzZ (ICLR 2025)
Abstract – Bigyan Ranjan Das
Hindi-Odia translation presents unique challenges due to differences in phonology, grammar, vocabulary, and cultural expressions. While both languages stem from the Indo-Aryan family, variations in verb conjugations, levels of politeness, and idiomatic usage often hinder direct translation. Script differences and regional influences further complicate the process. To overcome these nuances, translators must apply contextual understanding, cultural sensitivity, and adaptive language strategies.
Abstract – Somenath Nag Choudhury
MITER presents a two-stage pretraining framework that first leverages uni-modal objectives to independently adapt image and text encoders to the medical domain, then employs a cross-modal contrastive loss to jointly align and fuse their representations—facilitating stronger vision-language learning. It introduces a dynamic hard- negative sampling strategy based on alignment and uniformity principles, improving model robustness and representation discrimination. As a result, MITER significantly outperforms stateoftheart methods across four downstream medical tasks, image-report retrieval, multi- label classification, visual question answering, and report generation.
Abstract – Ayodhya Murmu
The topic discusses the key challenges encountered during Hindi-to-Santali translation validation, focusing on issues like punctuation mismatches, improper transliteration of technical terms, and incorrect handling of named entities such as names of people or organizations. It also addresses problems related to verb tense, case markers, and lexical choices that affect sentence clarity. Additional concerns include errors in singular/plural usage, omissions or additions during translation, and direct translations from English that may not be culturally or linguistically suitable. The talk offers corrective measures to ensure accurate, consistent, and context-appropriate validation.
Abstract – Priyanshu Priya
This talk presents key highlights and insights from the AAAI-25 Conference, capturing breakthrough research trends, impactful sessions, and emerging directions in artificial intelligence.
Abstract – Anansa Roy
Bengali postpositions exhibit a dual nature, functioning both as independent lexical items with case markers and as inflectional elements denoting spatial relations. Their distribution is uneven across grammatical cases, with certain cases supporting a richer set of postpositions than others.
Abstract – Dr. Vineet Kumar Lal Das
भाषा संबंधी त्रुटियाँ,वाक्य संरचना से संबंधित त्रुटियाँ, शब्द छुटने संबंधी त्रुटियाँ, प्रेरणार्थक क्रिया को स्वतः क्रिया में व्यक्त करने संबंधी त्रुटियाँ, विभक्ति लोप संबंधी त्रुटियां, सहायक क्रिया संबंधी त्रुटियां, क्रिया संबंधी त्रुटियां, आदर/अनादर शब्द संबंधी त्रुटियां, टाइपिंग संबंधी त्रुटियां, वर्तनी संबंधी त्रुटियां आदि के बारे में जानकारी के साथ उसके निदान पर विस्तृत चर्चा की गई|
Abstract – Abid Hossain
In this talk, I highlighted how large language models exhibit a strong English bias that hinders their performance on low-resource languages and introduced Cross-Lingual-Thought Prompting (XLT), a language-agnostic prompt template that leverages English chain-of-thought reasoning to boost accuracy across 27 diverse languages. The experiments show that XLT yields substantial gains in reasoning, inference, and generation tasks without updating model parameters.
Abstract – Souravi Halder
Web development is the process of creating and maintaining websites and web applications that run on the internet. It encompasses several disciplines, including front-end development, back-end development, and full-stack development. Front-end development focuses on the user interface and user experience, using technologies such as HTML, CSS, and JavaScript. Back-end development handles server-side logic, database interactions, and application functionality, often using languages like Python, PHP, Java, or Node.js. Full-stack development combines both front-end and back-end skills. Web development also involves understanding protocols like HTTP, managing data through databases (e.g., MySQL, MongoDB), and deploying sites using web servers and cloud platforms. This abstract introduces the foundational tools, concepts, and practices that are essential for building modern, responsive, and interactive websites.
Abstract – Saroj Jha
This guideline outlines the annotation of Hindi text using the ILPOSTS (Indian Language POS Tag Set), a hierarchical and flexible tagset framework developed by Microsoft Research India. ILPOSTS enables cross-linguistic compatibility and allows derivation of language-specific tagsets for diverse Indian languages. It supports morphosyntactic annotation at varying levels of granularity to suit specific linguistic and NLP tasks.
Abstract – Saurav Dudhate
Presented a paper on Direct Preference Optimization (DPO), a novel approach to fine-tuning large language models using human preferences without complex reinforcement learning. Unlike traditional RLHF methods, DPO uses a simple classification loss, making the process more stable, efficient, and easier to implement. It outperforms or matches existing methods in tasks like sentiment control, summarization, and dialogue generation.
Abstract – Somenath Nag Choudhury
The paper presents MAIRA2, a multimodal model that generates radiology reports from chest Xrays with optional spatial grounding, associating each finding with boundingbox annotations to enhance traceability and interpretability. By leveraging comprehensive inputs—including frontal/lateral current images, prior studies and reports, plus clinical context—MAIRA2 outperforms prior models on report quality and reduces hallucinations, setting a new stateoftheart on MIMICCXR. The authors introduce RadFact, an LLMbased evaluation framework that quantifies factual correctness and spatial localization accuracy at the sentence level, enabling rigorous assessment of the new grounded reporting task.
© Copyright AI-NLP-ML Group, Department of CSE, IIT Patna. All Rights Reserved