Student Research Workshop (SRW) Accepted Papers

From Sentences to Proof Trees: Leveraging Language Models for Structured Reasoning
Aayushee Gupta
Understanding Subliminal Learning: Generality, Sensitivity, and Token-Level Explanations
Yagnesh Veeraraghavan, Keanu Lim, Jacob Lipner, Saanvi Ibrahimpatnam, Kevin Zhu, Madhur Panwar
Active Learning for Corpus Refinement: Cost-Effective Preprocessing to Improve Validity of Applied Quantitative Text Analysis
Jakob Steglich, Stephan Poppe
You Didn’t Have to Say It Like That: Subliminal Learning from Faithful Paraphrases
Isaia Gisler, Zhonghao He, Tianyi Alex Qiu
Broken Chains: The Cost of Incomplete Reasoning in LLMs
Ian Su, Gaurav Purushothaman, Jey Narayan, Ruhika Goel, Kevin Zhu, Sunishchal Dev, Yash More, Maheep Chaudhary
Pushing the Boundaries of Multiple Choice Evaluation to One Hundred Options
Nahyun Lee, Guijin Son
Colorism in Large Vision-Language Models: An Empirical Exploration of Socioeconomic Linguistic Bias
Raj Gaurav Maurya, Vaibhav Shukla, Sreedath Panat
Hospitality-VQA: Decision-Oriented Informativeness Evaluation for Vision–Language Models
Jeongwoo Lee, Baek Duhyeong, Eungyeol Han, Soyeon Shin, Gukin han, Seungduk Kim, Jaehyun Jeon, Taewoo Jeong
TimeRes: A Turkish Benchmark For Evaluating Temporal Understanding of Large Language Models
Habib Yağız Demir, Susan Üsküdarlı, Ümit Atlamaz
What the Router Sees Matters: Funnel Pooling for Fast, Content Driven Expert Routing
Josef Pichlmeier, Sebastian Nicolas Mueller, Jakob Sturm, Josef Dräxl, Andre Luckow
FluffInjector: Diagnosing Logical Consistency Failures in Chain-of-Thought Reward Models
Varshith Vijjapu, Krishiv Ray, Archana Vaidheeswaran
Emergent Misalignment: Tracking the Emergence and Evolution of Misaligned traits throughout Model Training
Geunwoo Park, Pranay Chauhan, Haihao Liu
Beep boop: Bot Detection as a Preprocessing Step for Polish Reddit
Karmela Matyjaszek
An Evaluation of Classifiers for Mapping Generative LLM Responses to Answer Options of Multiple-choice Questionnaires
Alisea Stroligo, Anna Shamray, Julian Schelb, Andreas Spitz
Bring the Apple, Not the Sofa: Impact of Irrelevant Context in Embodied AI Commands on VLA Models
Andrey Moskalenko, Daria Pugacheva, Denis Shepelev, Andrey Kuznetsov, Vlad Shakhuro, Elena Tutubalina
Thesis Proposal: Comparing Human and Model Perception of Writing Style under Controlled Perturbations
Ewelina Paulina Księżniak
Automatic Generation of a Compositional QA Benchmark for Geospatial Reasoning under Spatial and Entity Constraints
Tetsuhisa Suizu, Shohei Higashiyama, Hiroyuki Shindo, Hiroki Ouchi, Sakriani Sakti
$\texttt{lrnn-lib}$: A library for Linear RNNs
Karan Bania, Soham Kalburgi, Manit Tanwar, Dhruthi, Aditya Nagarsekar, Harshvardhan Mestha, Naman Chibber, Raj Deshmukh, Anish Sathyanarayanan, Aarush Rathore, Pratham Chheda
Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation
Julia Belikova, Danila Rozhevskii, Dennis Svirin, Konstantin Polev, Alexander Panchenko
Thesis Proposal: Stability-Aware, Evidence-Grounded Knowledge Graphs for Substance Use Disorders and Social Determinants of Health
Gautham Vijay Kumar
Energy Matching based Preference Learning for Diffusion Langauge Models
Shiv Shankar
Thesis Proposal: Measuring Prejudice at Scale
Zoran Fijavž, Senja Pollak, Veronika Bajt
Evaluating Cost-Efficiency of LLMs in a RAG Setup on Polish Wikipedia: Quality vs. Energy Consumption Patrycja Smits, Tomasz Walkowiak
How Do Lexical Senses Correspond Between Spoken German and German Sign Language?
Melis Çelikkol, Wei Zhao
From Detection to Explanation: Modeling Fine-Grained Emotional Social Influence Techniques with LLMs and Human Preferences
Maciej Markiewicz, Wiktoria Mieleszczenko-Kowszewicz, Beata Bajcar, Tomasz Adamczyk, Aleksander Szczęsny, Jolanta Babiak, Przemyslaw Kazienko
Beyond Bias Scores: Unmasking Vacuous Neutrality in Small Language Models
Sumanth Manduru, Carlotta Domeniconi
LLMs Exhibit Performative Fairness When Generating Profiles with Complex Geopolitical Identities
Maida Aizaz, Quang Minh Nguyen
Learning Nested Named Entity Recognition from Flat Annotations
Igor Rozhkov, Natalia V Loukachevitch
Efficient Low-Resource Language Model Using Tokenizer Transfer
Gustaf Gren, Murathan Kurfali
DRAGOn: Designing RAG On Periodically Updated Corpus
Fedor Chernogorskii, Sergei Averkiev, Liliya Kudraleeva, Zaven Martirosian, Maria Tikhonova, Valentin Malykh, Alena Fenogenova
Fake News Detection Strategies under Dataset Bias: Using Large-scale Coarse-grained Labels
Yuki Kishi, Yuji Arima, Hitoshi Iyatomi
Thesis Proposal: A Multi-Agent System for Ontology-Based Perspective-Aware Knowledge Extraction
Luiz do Valle Miranda, Grzegorz J. Nalepa
A Computational Forensic Linguistic Analysis of Narrative and Question-Answer Structures in Italian Police Interrogation Transcripts
Romane Werner, Thomas François, Sonja Bitzer
Annotation-Efficient Vision-Language Model Adaptation to the Polish Language Using the LLaVA Framework
Grzegorz Statkiewicz, Alicja Dobrzeniecka, Karolina Seweryn, Aleksandra Krasnodębska, Karolina Piosek, Katarzyna Bogusz, Sebastian Cygert, Wojciech Kusa
Evaluating the Impact of SAE-based Language Steering on LLM Performance
Sebastian Zwirner, Wentao Hu, Koshiro Aoki, Daisuke Kawahara
Towards Singable Lyrics Translation Using Large Language Models
Liu Hanze, Yusuke Sakai, Taro Watanabe
Thesis Proposal: Development of End-to-End Speech Translation Models for Indian Languages
Jamaluddin
Probabilistic Bilingual Subword Segmentation with Latent Subword Alignment
Shoto Nishida, Daiki Matsui, Takashi Ninomiya, Isao Goto, Akihiro Tamura
Text-to-Text Automatic Story Generation: A Survey
Yuan Ma, Hanna Suominen, Patrik Haslum, Richard Susilo
In-Image Machine Translation. A Preliminary Modular Approach
Sergio Gomez Gonzalez, Miguel Domingo, Francisco Casacuberta
Plasticity vs. Rigidity: The Impact of Low-Rank Adapters on Reasoning on a Micro-Budget
Zohaib Khan, Omer Tafveez, Zoha Hayat Bhatti
Scale Is All You Need 🙄: Analyzing Modality Interaction and Speaker Intent Without Fine-Tuning
Animesh Gurjar, Nikhil Krishnaswamy
Token Pruning for Improving Graph-Generating State Space Model Performance
Monish Beegamudre, Jack Zheng, Margaret Capetz
GraphRAG-Rad: Concept-Aware Radiology Report Generation via Latent Visual-Semantic Retrieval
Faezeh Safari, Hang Dong, ZEYU FU, Aline Villavicencio
When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models
Zafir Shamsi, Nikhil Chekuru, Zachary Guzman, Shivank Garg
Chronocept: Instilling a Sense of Time in Machines
Krish Goel, Sanskar Pandey, Mahadevan KS, Harsh Kumar, Vishesh Khadaria
Acceleration of Backpropagation in Linear Layers of Transformer Models Based on Gradient Structure
Dmitrii Topchii, Alexander Panchenko, Viktoriia A. Chekalina
The Clinical Fingerprint: Comparing the Rhetorical Integrity and Epistemic Safety of Human Physicians and Large Language Models Bayram Ayadi
Communication as a Complex System: Modeling the Feedback Dynamics of Trust and Credibility
Swaptik Chowdhury, Samuel D. Allen, Jung Hee Hyun
Thesis Proposal: Multimodal Benchmark for Music Understanding in Large Language Models
Tomáš Sourada
Who Plays Which Role? Protagonist Detection and Classification in Moral Discourse
Mirko Sommer, Maria Becker
A Benchmark and Evaluation of Automated Language of Study Extraction from Computational Linguistics Publications
Ashwin Kirubakaran, Henry Gagnier
Kahaani: A Multimodal Co-Creative Storytelling System
Samee Arif, Taimoor Arif, Muhammad Saad Haroon, Aamina Jamal Khan, Agha Ali Raza, Awais Athar
Exploring the Semantic Space of Second Language Learners
Trisha Godara, Rui He, Wolfram Hinzen, Yan Cong
CAPID: Context-Aware PII Detection for Question-Answering Systems
Mariia Ponomarenko, Sepideh Abedini, Masoumeh Shafieinejad, D. B. Emerson, Shubhankar Mohapatra, Xi He
Generalising LLM Routing using Past Performance Retrieval: A Few-Shot Router is Sufficient
Clovis Varangot-Reille, Christophe Bouvard, Antoine Gourru
Call, Reward, Repeat: Advancing Dialog State Tracking with GRPO and Function Calling
Timur Ionov, Anna Marshalova, Valentin Malykh
Different Time, Different Language: Revisiting the Bias Against Non-Native Speakers in GPT Detectors
Adnan Al Ali, Jindřich Helcl, Jindřich Libovický
Trainable, Multiword-aware Tokenization Using Modern Neural Networks
Clara Boesenberg, Kilian Evang
LEMUR: Robust Fine-Tuning for Multilingual Embedding Models for Retrieval
Narges Baba Ahmadi, Jan Strich, Martin Semmann, Chris Biemann
Comprehensive Comparison of RAG Methods Across Multi-Domain Conversational QA
Klejda Alushi, Jan Strich, Chris Biemann, Martin Semmann
Comparing Text Compression Capabilities of Large Language Models with Traditional Compression Algorithms
Mehran Haddadi, William John Teahan
Construction of an Evaluation Dataset for Hallucination Detection in Japanese Summarization Task
Hikari Tanaka, Atsushi Keyaki, Mamoru Komachi
Thesis proposal: Are We Losing Textual Diversity to Natural Language Processing?
Josef Jon, Ondřej Bojar
Beyond One-Step Distillation: Bridging the Capacity Gap in Small Language Models via Multi-Step Knowledge Transfer
Gaeun Yim, Nayoung Ko, Manasa Bharadwaj
Thesis proposal: COGNILENS: Analyzing Cognitive Decline in Language Models for Alzheimer’s Monitoring
Jonathan Guerne
Thesis Proposal: Efficient KV Cache Reuse for Multi-Document Retrieval-Augmented Generation
Zhipeng Zhang, Dmitry Ilvovsky
Voice Identification of 1960s Tamil Singers Using Transfer Learning for Preserving Cultural Heritage
Sathiyakugan Balakrishnan, Uthayasanker Thayasivam
What Persona Are We Missing? Identifying Unknown Relevant Personas for Faithful User Simulation
Weiwen SU, Yuhan Zhou, Zihan Wang, Naoki Yoshinaga, Masashi Toyoda
Modality Matching Matters: Calibrating Language Distances for Cross-Lingual Transfer in URIEL+
York Hay Ng, Aditya Khan, Xiang Lu, Matteo Salloum, Michael Zhou, Phuong Hanh Hoang, A. Seza Doğruöz, En-Shiun Annie Lee
Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety
Denis Janiak, Julia Moska, Dawid Motyka, Karolina Seweryn, Paweł Walkowiak, Bartosz Żuk, Arkadiusz Janz
Machine Translation for Low-Resource Languages through Monolingual Data and LLM: A Case Study of English-to-Basque
Nam Luu, Aitor Soroa, German Rigau, Ondřej Bojar
Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer
Sinoué GAD, Maxence Lasbordes
PATCH Dataset: Empowering Traditional Chinese Safety Classifiers for Lightweight LLM
Chi-Wei Chang, Chiung-Jui Chen, Richard Tzong-Han Tsai
Do Multi-Agents Solve Better Than Single? Evaluating Agentic Frameworks for Diagram-Grounded Geometry Problem Solving and Reasoning
Mahbub E Sobhani, Md. Faiyaz Abdullah Sayeedi, Mohammad Nehad Alam, Proma Hossain Progga, Swakkhar Shatabda
Domain Adaptation of Image Encoder for Multimodal Manga Translation
Kota Manabe, Tomoyuki Kajiwara, Takashi Ninomiya, Isao Goto, Shonosuke Ishiwatari, Hiroshi Noji
Mask What Matters: Mitigating Object Hallucinations in Large Vision–Language Models with Object-Aligned Visual Contrastive Decoding
Boqi Chen, Xudong Liu, Jianing Qiu