Industry Track Accepted Papers
Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration
Guangxin Wu, Hao Zhang, Zhang Zhibin, Jiafeng Guo, Xueqi Cheng
SCRIPTMIND: Crime Script Inference and Cognitive Evaluation for LLM-based Social Engineering Scam Detection System
Heedou Kim, changsik Kim, Sanghwa Shin, Jaewoo Kang
From Paper to Structured JSON: An Agentic AI Workflow for Compliant BMR Digital Transformation
Bhavik Agarwal, Nidhi Bendre, Viktoria Rojkova
Compact Multimodal Language Models as Robust OCR Alternatives for Noisy Textual Clinical Reports
Nikita Neveditsin, Pawan Lingras, Salil Patil, Swarup Patil, Vijay Kumar Mago
PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents
Minjia Wang, Yunfeng Wang, Xiao Ma, Dexin Lv, Qifan Guo, Lynn Zheng, Benliang Wang, Lei Wang, Jiannan Li, Yongwei Xing, Junzhe Xu, Zheng Sun
Evaluating the Pre-Consultation Ability of LLMs using Diagnostic Guidelines
Jean Seo, Gibaeg Kim, Kihun Shin, Seungseop Lim, Hyunkyung Lee, Wooseok Han, Jongwon Lee, Eunho Yang
SELENE: Selective and Evidence-Weighted LLM Debating for Efficient and Reliable Reasoning
Akshay Verma, Swapnil Gupta, Deepak Gupta, Prateek Sircar, Siddharth Pillai
SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code
Shima Imani, Seungwhan Moon, Adel Ahmadyan, Lu Zhang, Ahmed Kirmani, Babak Damavandi
KV Pareto: Systems-Level Optimization of KV Cache and Model Compression for Long Context Inference
Sai Gokhale, Devleena Das, Rajeev Patwari, Ashish Sirasao, Elliott Delaye
MizanQA: A Benchmark for Multi-Answer Moroccan Legal QA
Adil Bahaj, Mounir Ghogho
Router-Suggest: A Router-based Framework for Auto-Completions in Visually-Grounded Conversations
SANDEEP MISHRA, Devichand Budagam, Anubhab Mandal, Bishal Santra, Pawan Goyal, Manish Gupta
Beyond Unified Models: A Service-Oriented Approach to Low Latency, Context Aware Phonemization for Real Time TTS
Mahta Fetrat Qharabagh, Donya Navabi, Zahra Dehghanian, Morteza Abolghasemi, Hamid R. Rabiee
Retrieval Enhancements for RAG: Insights from a Deployed Customer Support Chatbot
Daniel González Juclà, Mohit Tuteja, Marcos Esteve Casademunt, Keshav Unnikrishnan, Yasir Usmani, Arvind Roshaan
Scaling Intent Understanding: A Framework for Classification with Clarification using Lightweight LLMs
Subhadip Nandi, Tanishka Agarwal, Anshika Singh, Priyanka Bhatt
Beyond IVR: Benchmarking Customer Support LLM Agents for Business-Adherence
Sumanth Balaji, Piyush Mishra, Aashraya Sachdeva, Suraj Agrawal
HotelQuEST: Balancing Quality and Efficiency in Agentic Search
Guy Hadad, Shadi Iskander, Sofia Tolmach, Oren Kalinsky, Haggai Roitman, Ran Levy
TASER: Table Agents for Schema-guided Extraction and Recommendation
Nicole Cho, Kirsty Fielding, William Watson, Sumitra Ganesh, Manuela Veloso
TAGQuant: Token-Aware Clustering for Group-Wise Quantization
Jaeseong Lee, seung-won hwang, Aurick Qiao, Zhewei Yao, Yuxiong He
Beyond Grid Search: Leveraging Bayesian Optimization for Accelerating RAG Pipeline Optimization
Anum Afzal, Xueru Zheng, Florian Matthes
BornoDrishti: Leveraging Vision Encoders and Domain-Adaptive Learning for Bangla OCR on Diverse Documents
S M Jishanul Islam, Md Mehedi Hasan, Masbul Haider Ovi, AKM SHAHARIAR AZAD RABBY, Fuad Rahman
MobileCity: An Efficient Framework for Large-Scale Urban Behavior Simulation
Xiaotong Ye, Nicolas Bougie, Toshihiko Yamasaki, Narimawa Watanabe
Is Micro Domain-Adaptive Pre-Training Effective for Real-World Operations? Multi-Step Evaluation Reveals Potential and Bottlenecks
Masaya Tsunokake, Yuta Koreeda, Terufumi Morishita, Koichi Nagatsuka, Hikaru Tomonari, Yasuhiro Sogawa
A Compliance-Preserving Retrieval System for Aircraft MRO Task Search
Byungho Jo
No Label? No Problem: Unsupervised Continual Learning for Adaptive Medical ASR
Meizhu Liu, Tao Sheng
EduPulse: A Practical LLM-Enhanced Opinion Mining System for Vietnamese Student Feedback in Educational Platforms
Nguyen Xuan Phuc, Phi Nguyen Xuan, Vinh-Tiep Nguyen, Thìn Đặng Văn, Ngan Luu-Thuy Nguyen
When Speed Meets Intelligence: Scalable Conversational NER in an Ever-evolving World
Karim Ghonim, Antonio Roberto, Davide Bernardi
ReflectiveRAG: Rethinking Adaptivity in Retrieval-Augmented Generation
Akshay Verma, Swapnil Gupta, Siddharth Pillai, Prateek Sircar, Deepak Gupta
OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets
Jiyuan SHEN, Yuan Peiyue, Atin Ghosh, Yifan Mai, Daniel Dahlmeier
PatentVision: A multimodal method for drafting patent applications
Ruo Yang, Sai Krishna Reddy Mudhiganti, Manali Sharma
VideoMind: Thinking in Steps for Long Video Understanding
Shubhang Bhatnagar, Renxiong Wang, Kapil Krishnakumar, Adel Ahmadyan, Zhaojiang Lin, Lambert Mathias, Xin Luna Dong, Babak Damavandi, Narendra Ahuja, Seungwhan Moon
RegNLI: Detecting Online Product Misbranding through Legal and Linguistic Alignment
Diya Saha, Abhishek Bharadwaj Varanasi, Tirthankar Dasgupta, Manjira Sinha
CASPER: Bridging Discrete and Continuous Prompt Optimization through Feedback-Guided Gradient Descent
Aryan Jain, Pushpendu Ghosh, Promod Yenigalla
Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement
Aaditya Shukla, Sidney Knowles, Meenakshi Madugula, David Farris, Ryan Angilly, Santiago Pombo, Lu An, Anbang Xu, Abhinav Balasubramanian, Tan Yu, Jiaxiang Ren, Rama Akkiraju
Medical Summarization in Practice: Design, Deployment, and Analysis of a Clinical Summarization System for a German Hospital
Moiz Rauf, Sean Papay
Feedback-Aware Prompt Optimization Framework for Generating Job Postings
Suraj Maharjan, Ainur Yessenalina, Srinivasan H. Sengamedu
Enhancing User Safety: Context-Aware Detection of Offensive Query-Ad Pairs in Multimodal Search Advertising
Gaurav Kumar, Qiangjian Xi, Tanmaya Shekhar Dabral, Hooshang Ghasemi, Abishek Krishnamoorthy, Danqing Fu, Rui Min, Emilio Antunez, Zhongli Ding, Pradyumna Narayana
SAGE: An Agentic Explainer Framework for Interpreting SAE Features in Language Models
Jiaojiao Han, Wujiang Xu, Mingyu Jin, Mengnan Du
Adapting Vision-Language Models for E-commerce Understanding at Scale
Matteo Nulli, Orshulevich Vladimir, Tala Bazazo, Christian Herold, Michael Kozielski, Marcin Mazur, Szymon Tuzel, Cees G. M. Snoek, Seyyed Hadi Hashemi, Omar Javed, Yannick Versley, Shahram Khadivi
MedRiskEval: Medical Risk Evaluation Benchmark of Language Models, On the Importance of User Perspectives in Healthcare Settings
Jean-Philippe Corbeil, Minseon Kim, Maxime Griot, Sheela Agarwal, Alessandro Sordoni, Francois Beaulieu, Paul Vozila
Synthetic Doctor-Patient Dialogue Generation for Robust Medical ASR: A Scalable Pipeline for Vocabulary Expansion and Privacy Preservation
Kefei Liu, Meizhu Liu
Lessons from the Field: An Adaptable Lifecycle Approach to Applied Dialogue Summarization
Kushal Chawla, Chenyang Zhu, Pengshan Cai, Sangwoo Cho, Scott Novotney, Ayushman Singh, Jonah Lewis, Keasha Safewright, Alfy Samuel, Erin Babinsky, Shi-Xiong Zhang, Sambit Sahu
LingVarBench: Benchmarking LLMs on Entity Recognitions and Linguistic Verbalization Patterns in Phone-Call Transcripts
Seyedali Mohammadi, Manas Paldhe, Amit Chhabra, Youngseo Son, Vishal Seshagiri
Improving Training Efficiency and Reducing Maintenance Costs via Language Specific Model Merging
Alphaeus Dmonte, Vidhi Gupta, Daniel J Perry, Mark Arehart
The Subtle Art of Defection: Understanding Uncooperative Behaviors in LLM based Multi-Agent Systems
Devang Kulshreshtha, Wanyu Du, Raghav Jain, Srikanth Doss, Hang Su, Sandesh Swamy, Yanjun Qi
Tailoring Rumor Debunking to You: Diversifying Chinese Rumor-Debunking Passages with an LLM-Driven Simulated Feedback-Enhanced Framework
Xinle Pang, Danding Wang, Qiang Sheng, Yifan Sun, Beizhe Hu, Juan Cao
Synthetic Data Fine-Tuning for Effective Team Formation in Enterprises
Guilherme Drummond Lima, Adriano Veloso
Assertion-Conditioned Compliance: A Provenance-Aware Vulnerability in Multi-Turn Tool-Calling Agents
Daud Waqas, Aaryamaan Golthi, Erika Hayashida, Huanzhi Mao
PROBES : Performance and Relevance Observation for BEtter Search
Sejal Jain, Cyrus Andre DSouza, Jitenkumar Babubhai Rana, Aniket Joshi, Promod Yenigalla
Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning
Minseok Kim, Jingxiang Chen, Seong-Gyun Leem, Yin Huang, Rashi Rungta, Zhicheng Ouyang, Haibin Wu, Surya Teja Appini, Ankur Bansal, Yang Bai, Yue Liu, Florian Metze, Ahmed A Aly, Anuj Kumar, Ariya Rastrow, Zhaojiang Lin
IndicJR: A Judge-Free Benchmark of Jailbreak Robustness in South Asian Languages
Priyaranjan Pattnayak, Sanchari Chowdhuri
Synthesizing question answering data from financial documents: An End-to-End Multi-Agent Approach
Chetan Harsha, Karmvir Singh Phogat, Sridhar Dasaratha, Shashishekar Ramakrishna
Toward Automatic Delegation Extraction in Japanese Law
Tsuyoshi Fujita, Yuya Sawada, Yusuke Sakai, Taro Watanabe
DIALECTIC: A Multi-Agent System for Startup Evaluation
Jae Yoon Bae, Simon Malberg, Joyce Ann Clarize Galang, Andre Retterath, Georg Groh
Long-Context Long-Form Question Answering for Legal Domain
Anagha Kulkarni, Parin Rajesh Jhaveri, Prasha Shrestha, Yu Tong Han, Reza Amini, Behrouz Madahian
ELO: Efficient Layer-Specific Optimization for Continual Pretraining of Multilingual LLMs
HanGyeol Yoo, ChangSu Choi, Minjun Kim, Seohyun Song, SeungWoo Song, Inho Won, Jongyoul Park, Cheoneum Park, KyungTae Lim
MIRAGE: Metadata-guided Image Retrieval and Answer Generation for E-commerce Troubleshooting
Rishav Sahay, Lavanya Sita Tekumalla, Anoop Saladi
CODMAS: A Dialectic Multi-Agent Collaborative Framework for Structured RTL Optimization
Che-Ming Chang, Prashanth Vijayaraghavan, Ashutosh Jadhav, Charles Mackin, Hsinyu Tsai, Vandana Mukherjee, Ehsan Degan
D3: Dynamic Docid Decoding for Multi-Intent Generative Retrieval
Jaeyoung Kim, Dohyeon Lee, Soona Hong, seung-won hwang
DisGraph-RP: Graph-Augmented Temporal Modeling with Aspect-Based Contrastive Encoding of Discharge Summary for Readmission Prediction
Sudeshna Jana, Tirthankar Dasgupta, Manjira Sinha, Pabitra Mitra
CareerPathKG: Knowledge Graph Integrated Framework for Career Intelligence
Ngoc-Quang Le, Duc Duong Hoang, Mai Vu Tran, Thi-Hai-Yen Vuong
A Hybrid Supervised-LLM Pipeline for Actionable Suggestion Mining in Unstructured Customer Reviews
Aakash Trivedi, Aniket Upadhyay, Pratik Narang, Dhruv Kumar, Praveen Kumar
ShopperBench: A Benchmark for Personalized Shopping with Persona-Guided Simulation
Yuan Ling, Chunqing Yuan, Shujing Dong, Yongjian Yang, Nataraj Mocherla, Ayush Goyal
ARQA: A Benchmark for Grounded Table–Text QA in Enterprise Annual Reports
Ruilong Wang, Simone Balloccu
Do Clinical Question Answering Systems Really Need Specialised Medical Fine Tuning?
Sushant Kumar Ray, Gautam Siddharth Kashyap, Sahil Tripathi, Nipun Joshi, Vijay Govindarajan, Rafiq Ali, Jiechao Gao, Usman Naseem
SkiLLens: Recognising and Mapping Novel Skills from Millions of Job Ads Across Europe Using Language Models
Alessia De Santo, Lorenzo Malandri, Fabio Mercorio, Mario Mezzanzanica, Navid Nobani
SYMDIREC: A Neuro-Symbolic Divide-Retrieve-Conquer Framework for Enhanced RTL Synthesis and Summarization
Prashanth Vijayaraghavan, Apoorva Nitsure, Luyao Shi, Charles Mackin, Ashutosh Jadhav, David Beymer, Ehsan Degan, Vandana Mukherjee
Real-time Meme Token Narrative Generation
Han Qiu
Benchmarking and Mitigating the Impact of Noisy User Prompts in Medical VLMs via Cross-Modal Reflection
Zhiyu Xue, Reza Abbasi-Asl, Ramtin Pedarsani
Lightweight Domain-Specific Language Model for Real-Time Structuring of Medical Prescriptions
Jonathan Pattin Cottet, Véronique Eglin, Alex Aussem
Balanced Accuracy: The Right Metric for Evaluating LLM Judges - Explained through Youden’s J statistic
Stephane Collot, Colin Fraser, Justin Zhao, William F. Shen, Timon Willi, Ilias Leontiadis
PharmaQA.IT: an Italian dataset for Q&A in the pharmaceutical domain
Kamyar Zeinalipour, Andrea Zugarini, Asya Zanollo, Leonardo Rigutini
DIRECT: Directional Relevance in Conversational Trajectories
Rajdeep Mukherjee, Anshuman Mourya, Prerna Jolly, Vinayak S Puranik, Sivaramakrishnan R Kaveri