Menu Close

Accepted Full Papers (Main Technical Track)

IDTitleAuthors
4Logic of Knowledge and Cognitive AbilityJia Tao, Xinran Zhang
6Enhancing Offline Reinforcement Learning with Curriculum Learning-Based Trajectory ValuationAmir Abolfazli, Zekun Song, Avishek Anand, Wolfgang Nejdl
14Automatic Verification of Linear Integer Planning Programs via Forgetting in LIAUPFLiangda Fang, Shikang Chen, Xiaoman Wang, Xiaoyou Lin, Chenyi Zhang, Qingliang Chen, Quanlong Guan, Kaile Su
24Order Symmetry: A New Fairness Criterion for Assignment MechanismsRupert Freeman, Geoffrey Pritchard, Mark C. Wilson
27The Strong Core of Housing Markets with Partial Order PreferencesIldikó Schlotter, Lydia Mirabel Mendoza-Cadena
28Game-Theoretically Secure Distributed Protocols for Fair Allocation in Coalitional GamesT-H. Hubert Chan, Qipeng Kuang, Quan Xue
29Enhancing Graph-based Coordination with Evolutionary Algorithms for Episodic Multi-agent Reinforcement LearningKexing Peng, Pengyi Li, Jianye Hao
31Approximating One-Sided and Two-Sided Nash Social Welfare With CapacitiesSalil Gokhale, Harshul Sagar, Rohit Vaish, Jatin Yadav
37Impact Measures for Gradual Argumentation SemanticsCaren Al Anaissy, Jérôme Delobelle, Srdjan Vesic, Bruno Yun
43ReSCOM: Reward-Shaped Curriculum for Efficient Multi-Agent Communication LearningXinghai Wei, Tingting Yuan, Jie Yuan, Dongxiao Liu, Xiaoming Fu
45Game Theory with Simulation in the Presence of Unpredictable RandomisationVojtech Kovarik, Nathaniel Sauerberg, Lewis Hammond, Vincent Conitzer
47Azorus: Commitments over Protocols for BDI AgentsAmit K. Chopra, Matteo Baldoni, Samuel H. Christie V, Munindar P. Singh
63Multi-agent reinforcement learning in the all-or-nothing public goods game on networksBenedikt Valentin Meylahn
67Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language ModelSongjun Tu, Jingbo Sun, Qichao Zhang, Xiangyuan Lan, Dongbin Zhao
73To Spend or to Gain: Online Learning in Repeated Karma AuctionsDamien Berriaud, Ezzat Elokda, Devansh Jalota, Emilio Frazzoli, Marco Pavone, Florian Dorfler
74Offline Goal-Conditioned Reinforcement Learning with Elastic-Subgoal Diffused Policy LearningYaocheng Zhang, Yuanheng Zhu, Yuqian Fu, Songjun Tu, Dongbin Zhao
89Learning in Games with Progressive HidingBenjamin Heymann, Marc Lanctot
91On the Power of Temporal Locality on Online Routing ProblemsSwapnil Guragain, Gokarna Sharma
98MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal ControlSunbowen Lee, Hongqin Lyu, Yicheng Gong, Sun Yingying, Chao Deng
108Gricean Norms as a Basis for Effective CollaborationFardin Saad, Pradeep K. Murukannaiah, Munindar P. Singh
111AdaCred: Adaptive Causal Decision Transformers with Feature CreditingHemant Kumawat, Saibal Mukhopadhyay
112Candidate nomination for Condorcet-consistent voting rulesIldikó Schlotter, Katarína Cechlárová
116Adaptive Episode Length Adjustment for Multi-agent Reinforcement LearningByunghyun Yoo, Younghwan Shin, Hyunwoo Kim, Euisok Chung, Jeongmin Yang
119On the limits of agency in agent-based modelsAyush Chopra, Shashank Kumar, Nurullah Giray Kuru, Ramesh Raskar, Arnau Quera-Bofarull
120Safe Pareto Improvements for Expected Utility Maximizers in Program GamesAnthony Digiovanni, Jesse Clifton, Nicolas Macé
129FLIGHT: Facility Location Integrating Generalized, Holistic Theory of WelfareAvyukta Manjunatha Vummintala, Shivam Gupta, Shweta Jain, Sujit Gujar
146On the Fairness of Additive Welfarist RulesKaren Frilya Celine, Warut Suksompong, Sheung Man Yuen
149Goal Recognition via Variational CausalityJiaqi Wen, Leonardo Rosa Amado
154Boosting Sortition via Proportional RepresentationSoroush Ebadian, Evi Micha
159Dual Ensembled Multiagent Q-Learning with Hypernet RegularizerYaodong Yang, Guangyong Chen, Hongyao Tang, Furui Liu, Danruo Deng, Pheng-Ann Heng
160EconoJax: A Fast & Scalable Economic Simulation in JAXKoen Ponse, Aske Plaat, Niki Van Stein, Thomas M. Moerland
162Tighter Value-Function Approximations for POMDPsMerlijn Krale, Wietze Koops, Sebastian Junges, Thiago D. Simão, Nils Jansen
165Incentivizing Truth Exploration and Honest Reporting: A Contract Design ApproachYuming Shao, Zhixuan Fang
167Personality-Driven Decision Making in LLM-Based Autonomous AgentsLewis Newsham, Daniel Prince
169Causes and Strategies in Multiagent SystemsSylvia S. Kerkhove, Natasha Alechina, Mehdi Dastani
170Learning Graph Representation of Agent DiffusersYoucef Djenouri, Nassim Belmecheri, Tomasz Pawel Michalak, Jan Dubiński, Ahmed Nabil Belbachir, Anis Yazidi
174Probably Correct Optimal Stable Matching for Two-Sided Market Under UncertaintyAndreas Athanasopoulos, Anne-Marie George, Christos Dimitrakakis
187ACORN: Acyclic Coordination with Reachability Network to Reduce Communication Redundancy in Multi-Agent SystemsXie Yi, Ziqing Zhou, Chun Ouyang, Siao Liu, Linqiang Hu, Zhongxue Gan
200Anytime Fairness Guarantees in Stochastic Combinatorial MABs: A Novel Learning FrameworkSubham Pokhriyal, Shweta Jain, Ganesh Ghalme, Vaneet Aggarwal
201Multi-agent Multi-armed Bandits with Minimum Reward Guarantee FairnessPiyushi Manupriya, Himanshu, Sakethanath Jagarlapudi, Ganesh Ghalme
207Nucleolus Credit Assignment for Effective Coalitions in Multi-agent Reinforcement LearningYugu Li, Zehong Cao, Jianglin Qiao, Siyi Hu
212Teamwork Makes the Defense Work: Comprehensive Vulnerability Defense Resource AllocationSiyu Liu, Rida Bazzi, Fei Fang, Tiffany Bao
213Hypothesis-Driven Explainable Goal RecognitionAbeer Alshehri, Hissah Alotaibi, Tim Miller, Mor Vered
224Factorised Active Inference for Strategic Multi-Agent InteractionsJaime Ruiz-Serra, Patrick Sweeney, Michael Harre
239Self-Supervised Multi-Agent Diversity with Nonparametric Entropy MaximizationTianxu Li, Kun Zhu
243Human-Agent Coordination in Games under Incomplete Information via Multi-Step IntentShenghui Chen, Ruihan Zhao, Sandeep P. Chinchali, Ufuk Topcu
244MAGNET: A Multi-Agent Graph Neural Network for Efficient Bipartite Task AssignmentDonald Loveland, James Usevitch, Zachary Serlin, Danai Koutra, Rajmonda S. Caceres
245EduQate: Generating Adaptive Curricula through RMABs in Education SettingsSidney Tio, Dexun Li, Pradeep Varakantham
262Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive ExplorationXingrui Yu, Zhenglin Wan, David Mark Bossens, Yueming Lyu, Qing Guo, Ivor Tsang
263On the Effective Horizon of Inverse Reinforcement LearningYiqing Xu, Finale Doshi-Velez, David Hsu
265Algorithmically Fair Maximization of Multiple Submodular Objective FunctionsGeorgios Amanatidis, Georgios Birmpas, Philip Lazos, Stefano Leonardi, Rebecca Reiffenhäuser
272Real-World Testing Matters in Reinforcement Learning for EducationAnna Riedmann, Carlo D’Eramo, Birgit Lugrin
277FORM: Learning Expressive and Transferable First-Order Logic Reward MachinesLeo Ardon, Daniel Furelos-Blanco, Roko Parać, Alessandra Russo
279Higher-Order Belief in Incomplete Information MAIDsFrancis Rhys Ward, Jack Foxabbott, Rohan Subramani
282Robin Hood Reachability Bidding GamesShaull Almagor, Guy Avni, Neta Dafni
284Approximation Algorithms for Connected Maximum CoverageGianlorenzo D’Angelo, Esmaeil Delfaraz
297Simulating and Evaluating Generative Modeling and Collaborative Filtering in Complex Social NetworksWen Dong, Fairul Mohd-Zaid
301Near-Linear Time Leader Election in Multiagent NetworksAjay Kshemkalyani, Manish Kumar, Anisur Rahaman Molla, Gokarna Sharma
302Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement LearningWillem Röpke, Mathieu Reymond, Patrick Mannion, Diederik M Roijers, Ann Nowé, Roxana Rădulescu
303Curiosity-Driven Partner Selection Accelerates Convention Emergence in Language GamesChin-Wing Leung, Paolo Turrini, Ann Nowe
307Fast UCB-type algorithms for stochastic bandits with heavy and super heavy symmetric noiseYuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Pudovikov Andrey
308Investigating the Perspective of Non-Native Speakers on Foreigner-Directed Speech using Virtual Agents: The Role of Racial Ingroup Affiliation and Language Proficiency on Perception and ComprehensionOhenewa Bediako Akuffo, Birgit Lugrin
314Soft Condorcet Optimization for Ranking of General AgentsMarc Lanctot, Kate Larson, Michael Kaisers, Quentin Berthet, Ian Gemp, Manfred Diaz, Roberto-Rafael Maura-Rivero, Yoram Bachrach, Anna Koop, Doina Precup
330Computing Efficient Envy-Free Partial Allocations of Indivisible GoodsRobert Bredereck, Andrzej Kaczmarczyk, Junjie Luo, Bin Sun
331Training Language Models for Social Deduction with Multi-Agent Reinforcement LearningBidipta Sarkar, Warren Xia, Karen Liu, Dorsa Sadigh
334Welfare Approximation in Additively Separable Hedonic GamesMartin Bullinger, Vaggos Chatziafratis, Parnian Shahkar
335Fairly Allocating Goods in ParallelRohan Garg, Alexandros Psomas
337Towards Fair and Efficient Public Transportation: A Bus Stop ModelMartin Bullinger, Edith Elkind, Mohamad Latifian
338Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination DynamicsSomnath Hazra, Pallab Dasgupta, Soumyajit Dey
357Learning Symbolic Task Decompositions for Multi-Agent TeamsAmeesh Shah, Niklas Lauffer, Thomas Chen, Nikhil Pitta, Sanjit A. Seshia
360The Metric Distortion of Randomized Social Choice Functions: C1 Maximal Lottery Rules and SimulationsFabian Frank, Patrick Lederer
363On the Complexity of Learning to Cooperate in Populations of Socially Rational AgentsSaptarashmi Bandyopadhyay, Mustafa Mert Çelikok, Robert Loftin
367A Scoresheet for Explainable AIMichael Winikoff, John Thangarajah, Sebastian Rodriguez
375FGLight: Learning neighbor-level information for Traffic Signal ControlHang Xiao, Huale Li, Shuhan Qi, Jiajia Zhang, Dingzhong Cai
389Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential ExplorationHai Zhong, Xun Wang, Zhuoran Li, Longbo Huang
391Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal HealthArpan Dasgupta, Gagan Jain, Arun Suggala, Karthikeyan Shanmugam, Milind Tambe, Aparna Taneja
393Truthful and Welfare-maximizing Resource Scheduling with Application to Electric VehiclesRamsundar Anandanarayanan, Swaprava Nath, Prasant Misra
394Harmonious Balanced Partitioning of a Network of AgentsPulkit Agarwal, Harshvardhan Agarwal, Vaibhav Raj, Swaprava Nath
398Incentives for Early Arrival in Cost SharingJunyu Zhang, Yao Zhang, Yaoxin Ge, Dengji Zhao, Hu Fu, Zhihao Gavin Tang, Pinyan Lu
418Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature SelectionJiafan Zhuang, Gaofei Han, Zihaoxia, Che Lin, Boxi Wang, Wenji Li, Wangdongliang, Zhun Fan, Ruichu Cai, Zhifeng Hao
420Fairness and Optimality in RoutingSreenivas Gollapudi, Kostas Kollias, Alkmini Sgouritsa, Ali Kemal Sinop
424Insights Regarding the Success of Damping in Improving Belief PropagationUriel Zaed, Roie Zivan, Omer Lev
427Translating Multi-Agent Modal Logics of Knowledge and Belief into Decidable First-Order FragmentsQihui Feng, Hannah Wilk, Shakil M Khan, Gerhard Lakemeyer
434Truthful mechanisms for linear bandit games with private contextsYiting Hu, Lingjie Duan
436Model and Mechanisms of Consent for Responsible AutonomyAnastasia S. Apeiron, Davide Dell’Anna, Pradeep K. Murukannaiah, Pinar Yolum
441A Minimax-Bayes Approach to Ad Hoc TeamworkVictor Villin, Thomas Kleine Buening, Christos Dimitrakakis
442Networked Agents in the Dark: Team Value Learning under Partial ObservabilityGuilherme S. Varela, Alberto Sardinha, Francisco S. Melo
455Game-Theoretic Goal Recognition in Time-Sensitive ApplicationsSara Bernardini, Fabio Fagnani, Santiago Franco
457Artificial Agents Mitigate The Punishment Dilemma Of Indirect ReciprocityAlexandre S. Pires, Fernando P. Santos
461Leveraging Score-based Models for Generating Penalization in Model-based Offline Reinforcement LearningZeyuan Liu, Zhirui Fang, Jiafei Lyu, Xiu Li
462Opinion Dynamics with Median AggregationPetra Berenbrink, Martin Hoefer, Dominik Kaaser, Marten Maack, Malin Rau, Lisa Wilhelmi
468Learning with Limited Shared Information in Multi-agent Multi-armed BanditJunning Shao, Siwei Wang, Zhixuan Fang
470The Bakers and Millers Game with Restricted LocationsSimon Krogmann, Pascal Lenzner, Alexander Skopalik
472Fair Division in a Variable SettingHarish Chandramouleeswaran, Prajakta Nimbhorkar, Nidhi Rathi
481Compositional Shielding and Reinforcement Learning for Multi-Agent SystemsAsger Horn Brorholt, Kim Guldstrand Larsen, Christian Schilling
495On the Hardness of Fair Allocation under Ternary ValuationsZack Fitzsimmons, Vignesh Viswanathan, Yair Zick
499The Many Challenges of Human-Like Agents in Virtual Game EnvironmentsMaciej Świechowski, Dominik Slezak
500Tackling Temporal Deontic Challenges with Equilibrium LogicDavide Soldà, Pedro Cabalar, Agata Ciabattoni, Emery A. Neufeld
511Simplifying imperfect recall gamesHugo Gimbert, Soumyajit Paul, B. Srivathsan
513Non-obvious Manipulability in Hedonic Games with Friends Appreciation PreferencesMichele Flammini, Maria Fomenko, Giovanna Varricchio
517Minimizing Rosenthal’s Potential in Monotone Congestion GamesVittorio Bilò, Angelo Fanelli, Laurent Gourvès, Christos Tsoufis, Cosimo Vinci
518Alternating-time Temporal Logic with Stochastic AbilitiesGabriel Ballot, Vadim Malvone, Jean Leneutre, Jingxuan Ma, Mourad Leslous
521Timed Obstruction Logic: A Timed Approach to Dynamic Game ReasoningJames Ortiz, Vadim Malvone, Jean Leneutre
523Temporal Fair Division of Indivisible ItemsEdith Elkind, Alexander Lam, Mohamad Latifian, Tzeh Yuan Neoh, Nicholas Teh
526Unveiling Decision Intention for Cooperative Multi-Agent Reinforcement LearningZeren Zhang, Zhiwei Xu, Guangchong Zhou, Dapeng Li, Bin Zhang, Guoliang Fan
530Rational Capability in Concurrent GamesYinfeng Li, Emiliano Lorini, Munyque Mittelmann
533Dynamic Sight Range Selection in Multi-Agent Reinforcement LearningWeichen Liao, Ti-Rong Wu, I-Chen Wu
534Synergistic Traffic AssignmentThomas Bläsius, Adrian Feilhauer, Markus Jung, Moritz Laupichler, Peter Sanders, Michael Zündorf
544Beyond Words: Integrating Personality Traits and Context-Driven Gestures in Human-Robot InteractionsTahsin Tariq Banna, Dr. Sejuti Rahman, Dr. Mohammad Tareq
550Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement LearningLukas Schäfer, Oliver Slumbers, Stephen Marcus Mcaleer, Yali Du, Stefano V Albrecht, David Henry Mguni
564Maximizing Truth Learning in a Social Network is NP-hardFilip Úradník, Amanda Wang, Jie Gao
566Efficient and Optimal Policy Gradient Algorithm for Corrupted Multi-armed BanditsJiayuan Liu, Siwei Wang, Zhixuan Fang
567Adaptive Bi-Level Multi-Robot Task Allocation and Learning under Uncertainty with Temporal Logic ConstraintsXiaoshan Lin, Roberto Tron
571Socratic: Enhancing Human Teamwork via AI-enabled CoachingSangwon Seo, Bing Han, Rayan Ebnali Harari, Roger Daglius Dias, Marco A. Zenati, Eduardo Salas, Vaibhav V. Unhelkar
579Condorcet Winners and Anscombe’s Paradox Under Weighted Binary VotingCarmel Baharav, Andrei Constantinescu, Roger Wattenhofer
580Planning, scheduling, and execution on the Moon: the CADRE technology demonstration missionGregg Rabideau, Joseph A. Russino, Andrew Branch, Nihal N. Dhamani, Tiago Vaquero, Steve Chien, Jean-Pierre De La Croix, Federico Rossi
587ApproxED: Approximate Exploitability Descent via Learned Best ResponsesCarlos Martin, Tuomas Sandholm
592SCMRAG: Self-Corrective Multihop Retrieval Augmented Generation System for LLM AgentsRishabh Agrawal, Murtaza Asrani, Hadi Youssef, Apurva Narayan
602k-Approval Veto: A Spectrum of Voting Rules Balancing Metric Distortion and Minority ProtectionFatih Erdem Kizilkaya, David Kempe
605An Organizationally-Oriented Approach to Enhancing Explainability and Control in Multi-Agent Reinforcement LearningJulien Soulé, Jean-Paul Jamont, Michel Occello, Louis-Marie Traonouez, Paul Théron
607Bottom-Up Reputation Promotes Cooperation with Multi-Agent Reinforcement LearningTianyu Ren, Xuan Yao, Yang Li, Xiao-Jun Zeng
609Combining Planning and Reinforcement Learning for Solving Relational Multiagent DomainsNikhilesh Prabhakar, Ranveer Singh, Harsha Kokel, Sriraam Natarajan, Prasad Tadepalli
612In-context Learning from Language Models can Improve Embodied Instruction-followingPengyuan Wang, Jing-Cheng Pang, Wang Chenyang, Xu-Hui Liu, Tian-Shuo Liu, Si-Hang Yang, Yang Yu, Hong Qian
614Approximation Ratio for Preference Aggregation Using Tree CP-NetsAbu Mohammad Hammad Ali, Daniel Ogundare, Boting Yang, Sandra Zilles
621Who Am I Dealing With? Explaining the Designer’s Hidden IntentionsTurgay Caglar, Sarath Sreedharan, Mor Vered
622Scalable Offline Reinforcement Learning for Mean Field GamesAxel Brunnbauer, Julian Lemmel, Zahra Babaiee, Sophie A. Neubauer, Radu Grosu
628Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis PerspectiveChuxiong Sun, Peng He, Rui Wang, Changwen Zheng
630TACTIC: Task-Agnostic Contrastive pre-Training for Inter-Agent CommunicationPeihong Yu, Manav Mishra, Syed Zaidi, Pratap Tokekar
643Value Iteration for Learning Concurrently Executable Robotic Control TasksSheikh A. Tahmid, Gennaro Notomista
646Loss of Plasticity: A New Perspective on Solving Multi-Agent Exploration for Sparse Reward TasksZehua Zang, Chuxiong Sun, Lixiang Liu, Fuchun Sun, Changwen Zheng
657More Efficient Sybil Detection Mechanisms Leveraging Resistance of Users to Attack RequestsAli Safarpoor Dehkordi, Ahad N. Zehmakan
662Global Behavior of Learning Dynamics in Zero-Sum Games with Memory AsymmetryYuma Fujimoto, Kaito Ariu, Kenshi Abe
664EFX Allocations and Orientations on Bipartite Multi-graphs: A Complete PictureMahyar Afshinmehr, Alireza Danaei, Mehrafarin Kazemi, Kurt Mehlhorn, Nidhi Rathi
667Human-Aligned Skill Discovery: Balancing Behaviour Exploration and AlignmentMaxence Hussonnois, Thommen George Karimpanal, Santu Rana
671Fair Allocation of Divisible Goods under Non-Linear ValuationsHaris Aziz, Zixu He, Xinhang Lu, Kaiyang Zhou
683Data Pricing for Graph Neural Networks without Pre-purchased InspectionYiping Liu, Mengxiao Zhang, Jiamou Liu, Song Yang
696Housing Market on NetworksXinwei Song, Tianyi Yang, Dengji Zhao
699Explaining Facial Expression RecognitionSanjeev Nahulanthran, Leimin Tian, Dana Kulic, Mor Vered
704Consistency Policy with Categorical Critic for Autonomous DrivingXing Fang, Qichao Zhang, Haoran Li, Dongbin Zhao
709FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHFFlint Xiaofeng Fan, Cheston Tan, Yew-Soon Ong, Roger Wattenhofer, Wei Tsang Ooi
712Tackling Sparsity in Designated Driver Dispatch with Multi-Agent Reinforcement LearningJiaxuan Jiang, Ling Pan, Lin Zhou, Longbo Huang, Zhixuan Fang
714Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based PlannersFengming Zhu, Fangzhen Lin
727Asymptotic Existence of Class Envy-free MatchingsTomohiko Yokoyama, Ayumi Igarashi
729Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement LearningJingbo Sun, Songjun Tu, Qichao Zhang, Ke Chen, Dongbin Zhao
730Equilibrium Analysis in Markets with Asymmetric Utility FunctionsMartin Bichler, Markus Ewert, Axel Ockenfels
738Monte Carlo Tree Search with Velocity Obstacles for safe and efficient motion planning in dynamic environmentsLorenzo Bonanni, Daniele Meli, Alberto Castellini, Alessandro Farinelli
739Why Instant-Runoff Voting Is So Resilient to Coalitional Manipulation: Phase transitions in the Perturbed CultureFrançois Durand
756An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative TasksGeorge Papadopoulos, Andreas Kontogiannis, Foteini Papadopoulou, Chaido Poulianou, Ioannis Koumentis, George Vouros
762Mitigating Value Conflicts with Computational Theory of MindEmre Erdogan, Hüseyin Aydın, Frank Dignum, Rineke Verbrugge, Pinar Yolum
765Free Argumentative Exchanges for Explaining Image ClassifiersAvinash Kori, Antonio Rago, Francesca Toni
771A View of the Certainty-Equivalence Method for PAC RL as an Application of the Trajectory Tree MethodShivaram Kalyanakrishnan, Sheel Shah, Santhosh Kumar Guguloth
778GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-ThoughtSungsik Kim, Baek Janghyun, Jinkyu Kim, Jaekoo Lee
787PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement LearningKun Hu, Muning Wen, Xihuai Wang, Shao Zhang, Yiwei Shi, Minne Li, Minglong Li, Ying Wen
790Learning Real-Life Approval ElectionsPiotr Faliszewski, Łukasz Janeczko, Andrzej Kaczmarczyk, Marcin Kurdziel, Grzegorz Pierczyński, Stanisław Szufa
795Decentralized Planning Using Probabilistic HyperpropertiesFrancesco Pontiggia, Filip Macák, Roman Andriushchenko, Michele Chiari, Milan Ceska
797Games in Public Announcement: How to Reduce System Losses in Optimistic Blockchain MechanismsSiyuan Liu, Yulong Zeng
798Predictability Awareness for Efficient and Robust Multi-Agent CoordinationRomán Chiva Gil, Daniel Jarne Ornia, Khaled A. Mustafa, Javier Alonso-Mora
803Improving Policy Optimization via 𝜺-RetrainLuca Marzari, Priya L. Donti, Changliu Liu, Enrico Marchesini
805On Stateful Value Factorization in Multi-Agent Reinforcement LearningEnrico Marchesini, Andrea Baisero, Rupali Bhati, Christopher Amato
813Temporal Network Creation Games: The Impact of Non-Locality and TerminalsDavide Bilò, Sarel Cohen, Tobias Friedrich, Hans Gawendowicz, Nicolas Klodt, Pascal Lenzner, George Skretas
819Multi-Ship Future Interaction Trajectory Prediction via Pre-Initializer Diffusion ModelKun Ma, Qilong Han, Jingzheng Yao
827Resource Task GamesJessica L. Newman, Enrico Gerding, Enrico Marchioni, Baharak Rastegari
837From Natural Language to Extensive-Form Game RepresentationsShilong Deng, Yongzhao Wang, Rahul Savani
845Ranking Joint Policies in Dynamic Games using Evolutionary DynamicsNatalia Koliou, George Vouros
865Greedy ABA Learning for Case-Based ReasoningEmanuele De Angelis, Maurizio Proietti, Francesca Toni
867OGS-SLAM: Hybrid ORB-Gaussian Splatting SLAMXiaohan Li, Wenxiang Shen, Dong Liu, Jun Wu
870Selfish Behavior and Resource Competition in Multi-Agent SystemsCostas Courcoubetis, Antonis Dimakis
874Formalising Overdetermination in a Labelled Transition SystemGauvain Bourgne, Camilo Sarmiento, Jean Gabriel Gustave Ganascia
887A Simple Integration of Epistemic Logic and Reinforcement LearningThorsten Engesser, Thibaut Le Marre, Emiliano Lorini, François Schwarzentruber, Bruno Zanuttini
900Agent-Based Analysis of Green Disclosure Policies and Their Market-Wide Impact on Firm BehaviorLingxiao Zhao, Maria Polukarov, Carmine Ventre
905An Improved Mechanism for Pricing Ride-Hailing FaresMarek Adamczyk, Maurycy Borkowski, Michał Pawłowski
907Computing Efficient and Envy-Free Allocations under Dichotomous Preferences using SATAri Conati, Andreas Niskanen, Ronald De Haan, Matti Järvisalo
908Conditional Max-Sum for Asynchronous Multiagent Decision MakingDimitrios Troullinos, Georgios Chalkiadakis, Ioannis Papamichail, Markos Papageorgiou
909Probabilistic Timed ATLWojciech Jamroga, Marta Kwiatkowska, Wojciech Penczek, Laure Petrucci, Teofil Sidoruk
915On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite FlowTonghan Wang, Heng Dong, Yanchen Jiang, David C. Parkes, Milind Tambe
917Policy Abstraction and Nash Refinement in Tree-Exploiting PSROChristine Konicki, Mithun Chakraborty, Michael P. Wellman
929Practical Abstractions for Model Checking Continuous-Time Multi-Agent SystemsYan Kim, Wojciech Jamroga, Wojciech Penczek, Laure Petrucci
930Taming Multi-Agent Reinforcement Learning with Estimator Variance ReductionTaher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew E. Taylor, Kun Shao, Jun Wang, David Henry Mguni,
932Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied AgentsSabit Hassan, Hye-Young Chung, Xiang Zhi Tan, Malihe Alikhani
933Selecting Interlacing CommitteesChris Dong, Martin Bullinger, Tomasz Wƒös, Larry Birnbaum, Edith Elkind
937Bidding Games on Markov Decision Processes with Quantitative Reachability ObjectivesGuy Avni, Martin Kureƒçka, Kaushik Mallik, Petr Novotný, Suman Sadhukhan
939Eliminating Majority IllusionFoivos Fioravantes, Abhiruk Lahiri, Antonio Lauerbach, Lluís Sabater, Marie Diana Sieper, Samuel Wolf
941EnEnv 1.0: Energy Grid Environment for Multi-Agent Reinforcement Learning BenchmarkingDominik Jacek Bogucki, Łukasz Eugeniusz Lepak, Sonam Parashar, Bart Blachowski, Paweł Wawrzyński
942Emergence of Recursive Language through Bootstrapping and Iterated LearningVikas Kumar, Ajin George Joseph
943Voter Model Meets Rumour Spreading: A Study of Consensus Protocols on Graphs with Agnostic NodesMarcelo Matheus Gauy, Anna Abramishvili, Eduardo Colli, Tiago Madeira, Frederik Mallmann-Trenn, Vinícius Franco Vasconcelos, David Kohan Marzagao
944Neural DNF-MT: A Neuro-symbolic Approach for Learning Interpretable and Editable PoliciesKexin Gu Baugh, Luke Dickens, Alessandra Russo
946Multi-objective Reinforcement Learning with Nonlinear Preferences: Provable Approximation for Maximizing Expected Scalarized ReturnNianli Peng, Muhang Tian, Brandon Fain
950Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian NetworkVincent Hsiao, Mark Roberts, Laura M. Hiatt, George Konidaris, Dana S. Nau
963Local Topological Information as a Powerful Enhancer for Generalizable Neural Method in Travelling Salesman ProblemXiaoxin Bai, Junyang Yang, Shengchao Yuan, Yinghao Zhang, Hanqian Wu
966Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term RewardsFangqi Liu, Rishav Sen, Jose Paolo Talusan, Ava Pettet, Aaron Kandel, Yoshinori Suzue, Ayan Mukhopadhyay, Abhishek Dubey
968Certified Guidance for Planning with Deep Generative ModelsFrancesca Cairoli, Francesco Giacomarra, Mehran Hosseini, Nicola Paoletti
973Hierarchical Imitation Learning of Team Behavior from Heterogeneous DemonstrationsSangwon Seo, Vaibhav V. Unhelkar
977LTL Verification of Memoryful Neural AgentsMehran Hosseini, Alessio Lomuscio, Nicola Paoletti
992Large Language Models for Virtual Human Gesture SelectionParisa Ghanad Torshizi, Laura B. Hensel, Ari Shapiro, Stacy Marsella
999Policy Graphs and Intention: answering ‘why’ and ‘how’ from a telic perspectiveVictor Gimenez-Abalos, Sergio Alvarez-Napagao, Adrián Tormos, Ulises Cortés, Javier Vazquez-Salceda
1005HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement LearningKryspin Varys, Federico Cerutti, Adam Sobey, Timothy J. Norman
1008Extending Consensus-based Task Allocation Algorithms with Bid Intercession to Foster Mixed-InitiativeVictor Guillet, Charles Lesire, Gauthier Picard, Christophe Grand
1009Agent-based Modeling and Simulation of Ambiguity in Catastrophe Insurance MarketsYu Bi, Lingxiao Zhao, Jinyun Tong, Zhe Feng, Carmine Ventre
1010Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource AllocationGuojun Xiong, Haichuan Wang, Yuqi Pan, Saptarshi Mandal, Sanket Shah, Niclas Boehmer, Milind Tambe
1020Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML MonitoringGusseppe Bravo-Rocca, Peini Liu, Jordi Guitart, Rodrigo M Carrillo-Larco, Ajay Dholakia, David Ellison
1022Generalised BDI PlanningFelipe Meneguzzi, Ramon Fraga Pereira, Nir Oren
1026Uncertainty Expression for Human-Robot Task CommunicationDavid Porfirio, Mark Roberts, Laura M. Hiatt
1029Smooth Information Gathering in Two-Player Noncooperative GamesFernando Palafox, Jesse Milzman, David Fridovich-Keil, Dong Ho Lee, Ryan Park
1034Uncertain Machine Ethics PlanningSimon Kolker, Louise A. Dennis, Ramon Fraga Pereira, Mengwei Xu
1036xSRL: Safety-Aware Explainable RL – Safety as a Product of ExplainabilityRisal Shahriar Shefin, Md Asifur Rahman, Thai Le, Sarra Alqahtani
1038Robustness of Epistemic Gossip Protocols Against Data LossYoshikatsu Kobayashi, Koji Hasebe
1049Sea-cret Agents: Maritime Abduction for Region Generation to Expose Dark Vessel TrajectoriesDivyagna Bavikadi, Nathaniel Lee, Chad Parvis, Paulo Shakarian
1062On the Gale-Shapley Algorithm for Stable Matchings with a Partial Honesty Nash RefinementJames Patrick Bailey, Craig Tovey
1064The Price of Anarchy in Spatial Social ChoiceJames Patrick Bailey, Craig Tovey
1068An AI-Driven Card Playing Robot: An Empirical Study on Communicative Style and Embodiment with Elderly AdultsMichael Banck, Elisabeth Ganal, Hanna-Finja Weichert, Frank Puppe, Birgit Lugrin
1102Indifferential Privacy: A New Paradigm and Its Applications to Optimal Matching in Dark Pool AuctionsAntigoni Polychroniadou, T-H. Hubert Chan, Adya Agrawal
1103Dynamic Coalition Structure Detection in Natural-Language-based InteractionsAbhishek Ninad Kulkarni, Andy Liu, Jean-Raphaël Gaglione, Daniel Fried, Ufuk Topcu
1107Reputation-Filtered Reward Reshaping: Encouraging Cooperation in High Dimensional Semi-Cooperative Multi-agent SettingsHassan Raissouni, Wissal Bekhti, Btissam El Khamlichi, Amal Seghrouchni
1109Game of Thoughts: Iterative Reasoning in Game-Theoretic Domains with Large Language ModelsBenjamin Kempinski, Ian Gemp, Kate Larson, Yoram Bachrach, Marc Lanctot, Tal Kachman
1113Multi-Objective Planning with Contextual Lexicographic Reward PreferencesPulkit Rustagi, Yashwanthi Anand, Sandhya Saisubramanian
1139Discovery and Deployment of Emergent Robot Swarm Behaviors via Representation Learning and Real2Sim2Real TransferConnor Mattson, Varun Raveendra, Ricardo Vega, Cameron Nowzari, Daniel S. Drew, Daniel S. Brown
1140Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit AssignmentKartik Nagpal, Dayi Ethan Dong, Negar Mehr
1142Who Reviews The Reviewers? A Multi-Level Jury ProblemBen Abramowitz, Omer Lev, Nicholas Mattei
1153On Some Fundamental Problems for Multi-Agent Systems Over Multilayer NetworksDaniel Rosenkrantz, Madhav Marathe, Zirou Qiu, S. S. Ravi, Richard Stearns
1176Hitchhiker’s Guide to Patrolling: Path-Finding for Energy-Sharing Drone-UGV TeamsJonathan Diller, Qi Han, Robert Byers, James Dotterweich, James Humann
1188Learning Collusion in Episodic, Inventory-Constrained MarketsPaul Friedrich, Barna Pásztor, Giorgia Ramponi
1189Counterfactual Explanations for Model Ensembles Using Entropic Risk MeasuresErfaun Noorani, Pasan Dissanayake, Faisal Hamman, Sanghamitra Dutta
1207Self-Interpretable Reinforcement Learning via Rule EnsemblesYue Yang, Fan Yang, Yu Bai, Hao Wang
1208𝛽-DQN: Improving Deep Q-Learning By Evolving the BehaviorHongming Zhang, Fengshuo Bai, Chenjun Xiao, Chao Gao, Bo Xu, Martin Müller
1209The Degree of (Extended) Justified Representation and Its OptimizationBiaoshuai Tao, Chengkai Zhang, Houyu Zhou
1218Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing ProblemsYuxin Pan, Ruohong Liu, Yize Chen, Zhiguang Cao, Fangzhen Lin
1226Full Proportional Justified RepresentationYusuf Hakan Kalayci, Jiasen Liu, David Kempe
1233Reinforcement Learning Based Simulated AnnealingNathan Qiu, Daniel Dali Liang
1250ShipNaviSim: Data-Driven Simulation for Real-World Maritime NavigationQuang Anh Pham, Janaka Chathuranga Brahmanage, Akshat Kumar
1255Mean Field Correlated Imitation LearningZhiyu Zhao, Chengdong Ma, Qirui Mi, Ning Yang, Xue Yan, Mengyue Yang, Haifeng Zhang, Jun Wang, Yaodong Yang,
1265Conformal Set-based Human-AI Complementarity with Multiple ExpertsHelbert Paat, Guohao Shen
1269Geometric Freeze-Tag ProblemSharareh Alipour, Kajal Baghestani, Mahdis Mirzaei, Soroush Sahraei
1273On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionZichang Ge, Changyu Chen, Arunesh Sinha, Pradeep Varakantham
1283Emit As You Go: Enumerating Edges of a Spanning TreeKatrin Casel, Stefan Neubert
1289Optimising expectation with guarantees for window mean payoff in Markov decision processesPranshu Gaba, Shibashis Guha
1293Maximizing Value in Challenge the Champ TournamentsUmang Bhaskar, Juhi Chaudhary, Palash Dey
1303DUPRE: Data Utility Prediction for Efficient Data ValuationPham Kieu Thao Nguyen, Rachael Hwee Ling Sim, Quoc Phong Nguyen, See-Kiong Ng, Bryan Kian Hsiang Low
1304Surprise! Surprise! Learn and AdaptHuma Samin, Dylan J. Walton, Nelly Bencomo
1307Preventing Misinformation with Redundancy in Emergent CommunicationFábio Vital, Alberto Sardinha, Francisco S. Melo
1317Ready, Bid, Go! On-Demand Delivery Using Fleets of Drones with Unknown, Heterogeneous Energy Storage ConstraintsMohamed S. Talamali, Genki Miyauchi, Thomas Watteyne, Micael Santos Couceiro, Roderich Gross
1319Beyond Goal Recognition: A Reinforcement Learning-based Approach to Inferring Agent BehaviourSheryl Mantik, Michael Dann, Minyi Li, Huong Ha, Julie Porteous
1338Towards Efficient Online Goal Recognition through Deep LearningLorenzo Serina, Mattia Chiari, Alfonso Gerevini, Luca Putelli, Ivan Serina
1346Parameterized Algorithms for Multiagent Pathfinding on TreesArgyrios Deligkas, Eduard Eiben, Robert Ganian, Iyad A. Kanj, Ramanujan Sridharan
1347Enhancing Sub-Optimal Trajectory Stitching: Spatial Composition RvS for Offline RLSheng Zang, Zhiguang Cao, Bo An, Senthilnath Jayavelu, Xiaoli Li
1351The effect of agent-based feedback on prosociality in social dilemmasJennifer Renoux, Filipa Correia, Joana Campos, Lucas Morillo-Mendez, Neziha Akalin, Fernando P. Santos, Ana Paiva
1370Offline Multi-Agent Preference-based Reinforcement Learning with Agent-aware Direct Preference OptimizationQian Kou, Mingyang Li, Zeyang Liu, Long Qian, Zhuoran Chen, Lipeng Wan, Xingyu Chen, Xuguang Lan
1374MOSMAC: A Multi-agent Reinforcement Learning Benchmark on Sequential Multi-Objective TasksMinghong Geng, Shubham Pateria, Budhitama Subagdja, Ah-Hwee Tan
1377Byzantine Game Theory: Sun Tzu’s BoxesAndrei Constantinescu, Roger Wattenhofer
1381Modeling the Centaur: Human-Machine Synergy in Sequential Decision MakingDavid Shoresh, Yonatan Loewenstein
1385Together We Rise: Optimizing Real-Time Multi-Robot Task Allocation using Coordinated Heterogeneous PlaysAritra Pal, Anandsingh Chauhan, Mayank Baranwal
1386Uncertainty-Aware Opponent Modeling for Deep Reinforcement LearningLikun Yang, Pei Xu, Shiyue Cao, Yongjian Ren, Xiaotang Chen, Kaiqi Huang
1387Contrastive Explainable Clustering with Differential PrivacyAriel Vetzler, Dung Nguyen, Sarit Kraus, Anil Vullikanti
1388Evaluation-Time Policy Switching for Offline Reinforcement LearningNatinael Solomon Neggatu, Jeremie Houssineau, Giovanni Montana
1389CAMP: Collaborative Attention Model with Profiles for Vehicle Routing ProblemsChuanbo Hua, Federico Berto, Jiwoo Son, Seunghyun Kang, Changhyun Kwon, Jinkyoo Park
1390Responsible Uplift ModelingLihi Idan, Ming Li
1391Composing Reinforcement Learning Policies, with Formal GuaranteesFlorent Delgrange, Guy Avni, Anna Lukina, Christian Schilling, Ann Nowé, Guillermo Perez
1392Minimizing Makespan with Conflict-Based Search for Optimal Multi-Agent Path FindingAmir Maliah, Dor Atzmon, Ariel Felner
1393Towards Envy-Freeness Relaxations for General Nonmonotone ValuationsUmang Bhaskar, Gunjan Kumar, Yeshwant Pandit, Rakshitha
1394On the Structure of EFX Orientations on GraphsJinghan Zeng, Ruta Mehta
1396Changing the Rules of the Game: Reasoning About Dynamic Phenomena in Multi-Agent SystemsRustam Galimullin, Maksim Gladyshev, Munyque Mittelmann, Nima Motamed