Menu Close

Accepted Full Papers (Main Technical Track)

IDTitleAuthors
461Leveraging Score-based Models for Generating Penalization in Model-based Offline Reinforcement LearningZeyuan Liu, Zhirui Fang, Jiafei Lyu, Xiu Li
517Minimizing Rosenthal’s Potential in Monotone Congestion GamesVittorio Bilò, Angelo Fanelli, Laurent Gourvès, Christos Tsoufis, Cosimo Vinci
284Approximation Algorithms for Connected Maximum CoverageGianlorenzo D’Angelo, Esmaeil Delfaraz
973Hierarchical Imitation Learning of Team Behavior from Heterogeneous DemonstrationsSangwon Seo, Vaibhav V. Unhelkar
120Safe Pareto Improvements for Expected Utility Maximizers in Program GamesAnthony Digiovanni, Jesse Clifton, Nicolas Macé
187ACORN: Acyclic Coordination with Reachability Network to Reduce Communication Redundancy in Multi-Agent SystemsXie Yi, Ziqing Zhou, Chun Ouyang, Siao Liu, Linqiang Hu, Zhongxue Gan
213Hypothesis-Driven Explainable Goal RecognitionAbeer Alshehri, Hissah Alotaibi, Tim Miller, Mor Vered
932Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied AgentsSabit Hassan, Hye-Young Chung, Xiang Zhi Tan, Malihe Alikhani
727Asymptotic Existence of Class Envy-free MatchingsTomohiko Yokoyama, Ayumi Igarashi
790Learning Real-Life Approval ElectionsPiotr Faliszewski, Łukasz Janeczko, Andrzej Kaczmarczyk, Marcin Kurdziel, Grzegorz Pierczyński, Stanisław Szufa
174Probably Correct Optimal Stable Matching for Two-Sided Market Under UncertaintyAndreas Athanasopoulos, Anne-Marie George, Christos Dimitrakakis
1107Reputation-Filtered Reward Reshaping: Encouraging Cooperation in High Dimensional Semi-Cooperative Multi-agent SettingsHassan Raissouni, Wissal Bekhti, Btissam El Khamlichi, Amal Seghrouchni
98MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal ControlSunbowen Lee, Hongqin Lyu, Yicheng Gong, Sun Yingying, Chao Deng
367A Scoresheet for Explainable AIMichael Winikoff, John Thangarajah, Sebastian Rodriguez
1176Hitchhiker’s Guide to Patrolling: Path-Finding for Energy-Sharing Drone-UGV TeamsJonathan Diller, Qi Han, Robert Byers, James Dotterweich, James Humann
47Azorus: Commitments over Protocols for BDI AgentsAmit K. Chopra, Matteo Baldoni, Samuel H. Christie V, Munindar P. Singh
314Soft Condorcet Optimization for Ranking of General AgentsMarc Lanctot, Kate Larson, Michael Kaisers, Quentin Berthet, Ian Gemp, Manfred Diaz, Roberto-Rafael Maura-Rivero, Yoram Bachrach, Anna Koop, Doina Precup
363On the Complexity of Learning to Cooperate in Populations of Socially Rational AgentsSaptarashmi Bandyopadhyay, Mustafa Mert Çelikok, Robert Loftin
1209The Degree of (Extended) Justified Representation and Its OptimizationBiaoshuai Tao, Chengkai Zhang, Houyu Zhou
1010Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource AllocationGuojun Xiong, Haichuan Wang, Yuqi Pan, Saptarshi Mandal, Sanket Shah, Niclas Boehmer, Milind Tambe
699Explaining Facial Expression RecognitionSanjeev Nahulanthran, Leimin Tian, Dana Kulic, Mor Vered
1351The effect of agent-based feedback on prosociality in social dilemmasJennifer Renoux, Filipa Correia, Joana Campos, Lucas Morillo-Mendez, Neziha Akalin, Fernando P. Santos, Ana Paiva
614Approximation Ratio for Preference Aggregation Using Tree CP-NetsAbu Mohammad Hammad Ali, Daniel Ogundare, Boting Yang, Sandra Zilles
28Game-Theoretically Secure Distributed Protocols for Fair Allocation in Coalitional GamesT-H. Hubert Chan, Qipeng Kuang, Quan Xue
992Large Language Models for Virtual Human Gesture SelectionParisa Ghanad Torshizi, Laura B. Hensel, Ari Shapiro, Stacy Marsella
239Self-Supervised Multi-Agent Diversity with Nonparametric Entropy MaximizationTianxu Li, Kun Zhu
1142Who Reviews The Reviewers? A Multi-Level Jury ProblemBen Abramowitz, Omer Lev, Nicholas Mattei
302Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement LearningWillem Röpke, Mathieu Reymond, Patrick Mannion, Diederik M Roijers, Ann Nowé, Roxana Rădulescu
683Data Pricing for Graph Neural Networks without Pre-purchased InspectionYiping Liu, Mengxiao Zhang, Jiamou Liu, Song Yang
149Goal Recognition via Variational CausalityJiaqi Wen, Leonardo Rosa Amado
521Timed Obstruction Logic: A Timed Approach to Dynamic Game ReasoningJames Ortiz, Vadim Malvone, Jean Leneutre
499The Many Challenges of Human-Like Agents in Virtual Game EnvironmentsMaciej Świechowski, Dominik Slezak
622Scalable Offline Reinforcement Learning for Mean Field GamesAxel Brunnbauer, Julian Lemmel, Zahra Babaiee, Sophie A. Neubauer, Radu Grosu
212Teamwork Makes the Defense Work: Comprehensive Vulnerability Defense Resource AllocationSiyu Liu, Rida Bazzi, Fei Fang, Tiffany Bao
45Game Theory with Simulation in the Presence of Unpredictable RandomisationVojtech Kovarik, Nathaniel Sauerberg, Lewis Hammond, Vincent Conitzer
612In-context Learning from Language Models can Improve Embodied Instruction-followingPengyuan Wang, Jing-Cheng Pang, Wang Chenyang, Xu-Hui Liu, Tian-Shuo Liu, Si-Hang Yang, Yang Yu, Hong Qian
771A View of the Certainty-Equivalence Method for PAC RL as an Application of the Trajectory Tree MethodShivaram Kalyanakrishnan, Sheel Shah, Santhosh Kumar Guguloth
550Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement LearningLukas Schäfer, Oliver Slumbers, Stephen Marcus Mcaleer, Yali Du, Stefano V Albrecht, David Henry Mguni
1269Geometric Freeze-Tag ProblemSharareh Alipour, Kajal Baghestani, Mahdis Mirzaei, Soroush Sahraei
966Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term RewardsFangqi Liu, Rishav Sen, Jose Paolo Talusan, Ava Pettet, Aaron Kandel, Yoshinori Suzue, Ayan Mukhopadhyay, Abhishek Dubey
108Gricean Norms as a Basis for Effective CollaborationFardin Saad, Pradeep K. Murukannaiah, Munindar P. Singh
43ReSCOM: Reward-Shaped Curriculum for Efficient Multi-Agent Communication LearningXinghai Wei, Tingting Yuan, Jie Yuan, Dongxiao Liu, Xiaoming Fu
1381Modeling the Centaur: Human-Machine Synergy in Sequential Decision MakingDavid Shoresh, Yonatan Loewenstein
605An Organizationally-Oriented Approach to Enhancing Explainability and Control in Multi-Agent Reinforcement LearningJulien Soulé, Jean-Paul Jamont, Michel Occello, Louis-Marie Traonouez, Paul Théron
31Approximating One-Sided and Two-Sided Nash Social Welfare With CapacitiesSalil Gokhale, Harshul Sagar, Rohit Vaish, Jatin Yadav
1109Game of Thoughts: Iterative Reasoning in Game-Theoretic Domains with Large Language ModelsBenjamin Kempinski, Ian Gemp, Kate Larson, Yoram Bachrach, Marc Lanctot, Tal Kachman
331Training Language Models for Social Deduction with Multi-Agent Reinforcement LearningBidipta Sarkar, Warren Xia, Karen Liu, Dorsa Sadigh
630TACTIC: Task-Agnostic Contrastive pre-Training for Inter-Agent CommunicationPeihong Yu, Manav Mishra, Syed Zaidi, Pratap Tokekar
1374MOSMAC: A Multi-agent Reinforcement Learning Benchmark on Sequential Multi-Objective TasksMinghong Geng, Shubham Pateria, Budhitama Subagdja, Ah-Hwee Tan
481Compositional Shielding and Reinforcement Learning for Multi-Agent SystemsAsger Horn Brorholt, Kim Guldstrand Larsen, Christian Schilling
795Decentralized Planning Using Probabilistic HyperpropertiesFrancesco Pontiggia, Filip Macák, Roman Andriushchenko, Michele Chiari, Milan Ceska
1338Towards Efficient Online Goal Recognition through Deep LearningLorenzo Serina, Mattia Chiari, Alfonso Gerevini, Luca Putelli, Ivan Serina
427Translating Multi-Agent Modal Logics of Knowledge and Belief into Decidable First-Order FragmentsQihui Feng, Hannah Wilk, Shakil M Khan, Gerhard Lakemeyer
729Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement LearningJingbo Sun, Songjun Tu, Qichao Zhang, Ke Chen, Dongbin Zhao
1038Robustness of Epistemic Gossip Protocols Against Data LossYoshikatsu Kobayashi, Koji Hasebe
37Impact Measures for Gradual Argumentation SemanticsCaren Al Anaissy, Jérôme Delobelle, Srdjan Vesic, Bruno Yun
308Investigating the Perspective of Non-Native Speakers on Foreigner-Directed Speech using Virtual Agents: The Role of Racial Ingroup Affiliation and Language Proficiency on Perception and ComprehensionOhenewa Bediako Akuffo, Birgit Lugrin
544Beyond Words: Integrating Personality Traits and Context-Driven Gestures in Human-Robot InteractionsTahsin Tariq Banna, Dr. Sejuti Rahman, Dr. Mohammad Tareq
301Near-Linear Time Leader Election in Multiagent NetworksAjay Kshemkalyani, Manish Kumar, Anisur Rahaman Molla, Gokarna Sharma
696Housing Market on NetworksXinwei Song, Tianyi Yang, Dengji Zhao
303Curiosity-Driven Partner Selection Accelerates Convention Emergence in Language GamesChin-Wing Leung, Paolo Turrini, Ann Nowe
1218Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing ProblemsYuxin Pan, Ruohong Liu, Yize Chen, Zhiguang Cao, Fangzhen Lin
244MAGNET: A Multi-Agent Graph Neural Network for Efficient Bipartite Task AssignmentDonald Loveland, James Usevitch, Zachary Serlin, Danai Koutra, Rajmonda S. Caceres
398Incentives for Early Arrival in Cost SharingJunyu Zhang, Yao Zhang, Yaoxin Ge, Dengji Zhao, Hu Fu, Zhihao Gavin Tang, Pinyan Lu
1265Conformal Set-based Human-AI Complementarity with Multiple ExpertsHelbert Paat, Guohao Shen
1026Uncertainty Expression for Human-Robot Task CommunicationDavid Porfirio, Mark Roberts, Laura M. Hiatt
162Tighter Value-Function Approximations for POMDPsMerlijn Krale, Wietze Koops, Sebastian Junges, Thiago D. Simão, Nils Jansen
534Synergistic Traffic AssignmentThomas Bläsius, Adrian Feilhauer, Markus Jung, Moritz Laupichler, Peter Sanders, Michael Zündorf
1389CAMP: Collaborative Attention Model with Profiles for Vehicle Routing ProblemsChuanbo Hua, Federico Berto, Jiwoo Son, Seunghyun Kang, Changhyun Kwon, Jinkyoo Park
999Policy Graphs and Intention: answering ‘why’ and ‘how’ from a telic perspectiveVictor Gimenez-Abalos, Sergio Alvarez-Napagao, Adrián Tormos, Ulises Cortés, Javier Vazquez-Salceda
1273On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionZichang Ge, Changyu Chen, Arunesh Sinha, Pradeep Varakantham
67Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language ModelSongjun Tu, Jingbo Sun, Qichao Zhang, Xiangyuan Lan, Dongbin Zhao
865Greedy ABA Learning for Case-Based ReasoningEmanuele De Angelis, Maurizio Proietti, Francesca Toni
391Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal HealthArpan Dasgupta, Gagan Jain, Arun Suggala, Karthikeyan Shanmugam, Milind Tambe, Aparna Taneja
1207Self-Interpretable Reinforcement Learning via Rule EnsemblesYue Yang, Fan Yang, Yu Bai, Hao Wang
905An Improved Mechanism for Pricing Ride-Hailing FaresMarek Adamczyk, Maurycy Borkowski, Michał Pawłowski
73To Spend or to Gain: Online Learning in Repeated Karma AuctionsDamien Berriaud, Ezzat Elokda, Devansh Jalota, Emilio Frazzoli, Marco Pavone, Florian Dorfler
455Game-Theoretic Goal Recognition in Time-Sensitive ApplicationsSara Bernardini, Fabio Fagnani, Santiago Franco
518Alternating-time Temporal Logic with Stochastic AbilitiesGabriel Ballot, Vadim Malvone, Jean Leneutre, Jingxuan Ma, Mourad Leslous
442Networked Agents in the Dark: Team Value Learning under Partial ObservabilityGuilherme S. Varela, Alberto Sardinha, Francisco S. Melo
207Nucleolus Credit Assignment for Effective Coalitions in Multi-agent Reinforcement LearningYugu Li, Zehong Cao, Jianglin Qiao, Siyi Hu
338Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination DynamicsSomnath Hazra, Pallab Dasgupta, Soumyajit Dey
1377Byzantine Game Theory: Sun Tzu’s BoxesAndrei Constantinescu, Roger Wattenhofer
567Adaptive Bi-Level Multi-Robot Task Allocation and Learning under Uncertainty with Temporal Logic ConstraintsXiaoshan Lin, Roberto Tron
837From Natural Language to Extensive-Form Game RepresentationsShilong Deng, Yongzhao Wang, Rahul Savani
827Resource Task GamesJessica L. Newman, Enrico Gerding, Enrico Marchioni, Baharak Rastegari
1064The Price of Anarchy in Spatial Social ChoiceJames Patrick Bailey, Craig Tovey
646Loss of Plasticity: A New Perspective on Solving Multi-Agent Exploration for Sparse Reward TasksZehua Zang, Chuxiong Sun, Lixiang Liu, Fuchun Sun, Changwen Zheng
468Learning with Limited Shared Information in Multi-agent Multi-armed BanditJunning Shao, Siwei Wang, Zhixuan Fang
1113Multi-Objective Planning with Contextual Lexicographic Reward PreferencesPulkit Rustagi, Yashwanthi Anand, Sandhya Saisubramanian
335Fairly Allocating Goods in ParallelRohan Garg, Alexandros Psomas
394Harmonious Balanced Partitioning of a Network of AgentsPulkit Agarwal, Harshvardhan Agarwal, Vaibhav Raj, Swaprava Nath
1319Beyond Goal Recognition: A Reinforcement Learning-based Approach to Inferring Agent BehaviourSheryl Mantik, Michael Dann, Minyi Li, Huong Ha, Julie Porteous
609Combining Planning and Reinforcement Learning for Solving Relational Multiagent DomainsNikhilesh Prabhakar, Ranveer Singh, Harsha Kokel, Sriraam Natarajan, Prasad Tadepalli
944Neural DNF-MT: A Neuro-symbolic Approach for Learning Interpretable and Editable PoliciesKexin Gu Baugh, Luke Dickens, Alessandra Russo
845Ranking Joint Policies in Dynamic Games using Evolutionary DynamicsNatalia Koliou, George Vouros
1346Parameterized Algorithms for Multiagent Pathfinding on TreesArgyrios Deligkas, Eduard Eiben, Robert Ganian, Iyad A. Kanj, Ramanujan Sridharan
867OGS-SLAM: Hybrid ORB-Gaussian Splatting SLAMXiaohan Li, Wenxiang Shen, Dong Liu, Jun Wu
738Monte Carlo Tree Search with Velocity Obstacles for safe and efficient motion planning in dynamic environmentsLorenzo Bonanni, Daniele Meli, Alberto Castellini, Alessandro Farinelli
942Emergence of Recursive Language through Bootstrapping and Iterated LearningVikas Kumar, Ajin George Joseph
111AdaCred: Adaptive Causal Decision Transformers with Feature CreditingHemant Kumawat, Saibal Mukhopadhyay
907Computing Efficient and Envy-Free Allocations under Dichotomous Preferences using SATAri Conati, Andreas Niskanen, Ronald De Haan, Matti Järvisalo
1034Uncertain Machine Ethics PlanningSimon Kolker, Louise A. Dennis, Ramon Fraga Pereira, Mengwei Xu
874Formalising Overdetermination in a Labelled Transition SystemGauvain Bourgne, Camilo Sarmiento, Jean Gabriel Gustave Ganascia
63Multi-agent reinforcement learning in the all-or-nothing public goods game on networksBenedikt Valentin Meylahn
1068An AI-Driven Card Playing Robot: An Empirical Study on Communicative Style and Embodiment with Elderly AdultsMichael Banck, Elisabeth Ganal, Hanna-Finja Weichert, Frank Puppe, Birgit Lugrin
129FLIGHT: Facility Location Integrating Generalized, Holistic Theory of WelfareAvyukta Manjunatha Vummintala, Shivam Gupta, Shweta Jain, Sujit Gujar
29Enhancing Graph-based Coordination with Evolutionary Algorithms for Episodic Multi-agent Reinforcement LearningKexing Peng, Pengyi Li, Jianye Hao
533Dynamic Sight Range Selection in Multi-Agent Reinforcement LearningWeichen Liao, Ti-Rong Wu, I-Chen Wu
1102Indifferential Privacy: A New Paradigm and Its Applications to Optimal Matching in Dark Pool AuctionsAntigoni Polychroniadou, T-H. Hubert Chan, Adya Agrawal
1226Full Proportional Justified RepresentationYusuf Hakan Kalayci, Jiasen Liu, David Kempe
977LTL Verification of Memoryful Neural AgentsMehran Hosseini, Alessio Lomuscio, Nicola Paoletti
165Incentivizing Truth Exploration and Honest Reporting: A Contract Design ApproachYuming Shao, Zhixuan Fang
937Bidding Games on Markov Decision Processes with Quantitative Reachability ObjectivesGuy Avni, Martin Kureƒçka, Kaushik Mallik, Petr Novotný, Suman Sadhukhan
1009Agent-based Modeling and Simulation of Ambiguity in Catastrophe Insurance MarketsYu Bi, Lingxiao Zhao, Jinyun Tong, Zhe Feng, Carmine Ventre
1307Preventing Misinformation with Redundancy in Emergent CommunicationFábio Vital, Alberto Sardinha, Francisco S. Melo
1049Sea-cret Agents: Maritime Abduction for Region Generation to Expose Dark Vessel TrajectoriesDivyagna Bavikadi, Nathaniel Lee, Chad Parvis, Paulo Shakarian
1005HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement LearningKryspin Varys, Federico Cerutti, Adam Sobey, Timothy J. Norman
662Global Behavior of Learning Dynamics in Zero-Sum Games with Memory AsymmetryYuma Fujimoto, Kaito Ariu, Kenshi Abe
908Conditional Max-Sum for Asynchronous Multiagent Decision MakingDimitrios Troullinos, Georgios Chalkiadakis, Ioannis Papamichail, Markos Papageorgiou
462Opinion Dynamics with Median AggregationPetra Berenbrink, Martin Hoefer, Dominik Kaaser, Marten Maack, Malin Rau, Lisa Wilhelmi
671Fair Allocation of Divisible Goods under Non-Linear ValuationsHaris Aziz, Zixu He, Xinhang Lu, Kaiyang Zhou
797Games in Public Announcement: How to Reduce System Losses in Optimistic Blockchain MechanismsSiyuan Liu, Yulong Zeng
224Factorised Active Inference for Strategic Multi-Agent InteractionsJaime Ruiz-Serra, Patrick Sweeney, Michael Harre
27The Strong Core of Housing Markets with Partial Order PreferencesIldikó Schlotter, Lydia Mirabel Mendoza-Cadena
511Simplifying imperfect recall gamesHugo Gimbert, Soumyajit Paul, B. Srivathsan
628Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis PerspectiveChuxiong Sun, Peng He, Rui Wang, Changwen Zheng
933Selecting Interlacing CommitteesChris Dong, Martin Bullinger, Tomasz Wƒös, Larry Birnbaum, Edith Elkind
1250ShipNaviSim: Data-Driven Simulation for Real-World Maritime NavigationQuang Anh Pham, Janaka Chathuranga Brahmanage, Akshat Kumar
571Socratic: Enhancing Human Teamwork via AI-enabled CoachingSangwon Seo, Bing Han, Rayan Ebnali Harari, Roger Daglius Dias, Marco A. Zenati, Eduardo Salas, Vaibhav V. Unhelkar
418Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature SelectionJiafan Zhuang, Gaofei Han, Zihaoxia, Che Lin, Boxi Wang, Wenji Li, Wangdongliang, Zhun Fan, Ruichu Cai, Zhifeng Hao
1293Maximizing Value in Challenge the Champ TournamentsUmang Bhaskar, Juhi Chaudhary, Palash Dey
470The Bakers and Millers Game with Restricted LocationsSimon Krogmann, Pascal Lenzner, Alexander Skopalik
756An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative TasksGeorge Papadopoulos, Andreas Kontogiannis, Foteini Papadopoulou, Chaido Poulianou, Ioannis Koumentis, George Vouros
709FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHFFlint Xiaofeng Fan, Cheston Tan, Yew-Soon Ong, Roger Wattenhofer, Wei Tsang Ooi
245EduQate: Generating Adaptive Curricula through RMABs in Education SettingsSidney Tio, Dexun Li, Pradeep Varakantham
360The Metric Distortion of Randomized Social Choice Functions: C1 Maximal Lottery Rules and SimulationsFabian Frank, Patrick Lederer
472Fair Division in a Variable SettingHarish Chandramouleeswaran, Prajakta Nimbhorkar, Nidhi Rathi
24Order Symmetry: A New Fairness Criterion for Assignment MechanismsRupert Freeman, Geoffrey Pritchard, Mark C. Wilson
580Planning, scheduling, and execution on the Moon: the CADRE technology demonstration missionGregg Rabideau, Joseph A. Russino, Andrew Branch, Nihal N. Dhamani, Tiago Vaquero, Steve Chien, Jean-Pierre De La Croix, Federico Rossi
1289Optimising expectation with guarantees for window mean payoff in Markov decision processesPranshu Gaba, Shibashis Guha
441A Minimax-Bayes Approach to Ad Hoc TeamworkVictor Villin, Thomas Kleine Buening, Christos Dimitrakakis
714Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based PlannersFengming Zhu, Fangzhen Lin
1103Dynamic Coalition Structure Detection in Natural-Language-based InteractionsAbhishek Ninad Kulkarni, Andy Liu, Jean-Raphaël Gaglione, Daniel Fried, Ufuk Topcu
530Rational Capability in Concurrent GamesYinfeng Li, Emiliano Lorini, Munyque Mittelmann
375FGLight: Learning neighbor-level information for Traffic Signal ControlHang Xiao, Huale Li, Shuhan Qi, Jiajia Zhang, Dingzhong Cai
170Learning Graph Representation of Agent DiffusersYoucef Djenouri, Nassim Belmecheri, Tomasz Pawel Michalak, Jan Dubiński, Ahmed Nabil Belbachir, Anis Yazidi
1283Emit As You Go: Enumerating Edges of a Spanning TreeKatrin Casel, Stefan Neubert
526Unveiling Decision Intention for Cooperative Multi-Agent Reinforcement LearningZeren Zhang, Zhiwei Xu, Guangchong Zhou, Dapeng Li, Bin Zhang, Guoliang Fan
1140Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit AssignmentKartik Nagpal, Dayi Ethan Dong, Negar Mehr
265Algorithmically Fair Maximization of Multiple Submodular Objective FunctionsGeorgios Amanatidis, Georgios Birmpas, Philip Lazos, Stefano Leonardi, Rebecca Reiffenhäuser
704Consistency Policy with Categorical Critic for Autonomous DrivingXing Fang, Qichao Zhang, Haoran Li, Dongbin Zhao
587ApproxED: Approximate Exploitability Descent via Learned Best ResponsesCarlos Martin, Tuomas Sandholm
787PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement LearningKun Hu, Muning Wen, Xihuai Wang, Shao Zhang, Yiwei Shi, Minne Li, Minglong Li, Ying Wen
762Mitigating Value Conflicts with Computational Theory of MindEmre Erdogan, Hüseyin Aydın, Frank Dignum, Rineke Verbrugge, Pinar Yolum
657More Efficient Sybil Detection Mechanisms Leveraging Resistance of Users to Attack RequestsAli Safarpoor Dehkordi, Ahad N. Zehmakan
813Temporal Network Creation Games: The Impact of Non-Locality and TerminalsDavide Bilò, Sarel Cohen, Tobias Friedrich, Hans Gawendowicz, Nicolas Klodt, Pascal Lenzner, George Skretas
279Higher-Order Belief in Incomplete Information MAIDsFrancis Rhys Ward, Jack Foxabbott, Rohan Subramani
950Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian NetworkVincent Hsiao, Mark Roberts, Laura M. Hiatt, George Konidaris, Dana S. Nau
1189Counterfactual Explanations for Model Ensembles Using Entropic Risk MeasuresErfaun Noorani, Pasan Dissanayake, Faisal Hamman, Sanghamitra Dutta
457Artificial Agents Mitigate The Punishment Dilemma Of Indirect ReciprocityAlexandre S. Pires, Fernando P. Santos
495On the Hardness of Fair Allocation under Ternary ValuationsZack Fitzsimmons, Vignesh Viswanathan, Yair Zick
500Tackling Temporal Deontic Challenges with Equilibrium LogicDavide Soldà, Pedro Cabalar, Agata Ciabattoni, Emery A. Neufeld
277FORM: Learning Expressive and Transferable First-Order Logic Reward MachinesLeo Ardon, Daniel Furelos-Blanco, Roko Parać, Alessandra Russo
730Equilibrium Analysis in Markets with Asymmetric Utility FunctionsMartin Bichler, Markus Ewert, Axel Ockenfels
1008Extending Consensus-based Task Allocation Algorithms with Bid Intercession to Foster Mixed-InitiativeVictor Guillet, Charles Lesire, Gauthier Picard, Christophe Grand
420Fairness and Optimality in RoutingSreenivas Gollapudi, Kostas Kollias, Alkmini Sgouritsa, Ali Kemal Sinop
297Simulating and Evaluating Generative Modeling and Collaborative Filtering in Complex Social NetworksWen Dong, Fairul Mohd-Zaid
765Free Argumentative Exchanges for Explaining Image ClassifiersAvinash Kori, Antonio Rago, Francesca Toni
712Tackling Sparsity in Designated Driver Dispatch with Multi-Agent Reinforcement LearningJiaxuan Jiang, Ling Pan, Lin Zhou, Longbo Huang, Zhixuan Fang
579Condorcet Winners and Anscombe’s Paradox Under Weighted Binary VotingCarmel Baharav, Andrei Constantinescu, Roger Wattenhofer
146On the Fairness of Additive Welfarist RulesKaren Frilya Celine, Warut Suksompong, Sheung Man Yuen
1233Reinforcement Learning Based Simulated AnnealingNathan Qiu, Daniel Dali Liang
1062On the Gale-Shapley Algorithm for Stable Matchings with a Partial Honesty Nash RefinementJames Patrick Bailey, Craig Tovey
89Learning in Games with Progressive HidingBenjamin Heymann, Marc Lanctot
607Bottom-Up Reputation Promotes Cooperation with Multi-Agent Reinforcement LearningTianyu Ren, Xuan Yao, Yang Li, Xiao-Jun Zeng
167Personality-Driven Decision Making in LLM-Based Autonomous AgentsLewis Newsham, Daniel Prince
243Human-Agent Coordination in Games under Incomplete Information via Multi-Step IntentShenghui Chen, Ruihan Zhao, Sandeep P. Chinchali, Ufuk Topcu
664EFX Allocations and Orientations on Bipartite Multi-graphs: A Complete PictureMahyar Afshinmehr, Alireza Danaei, Mehrafarin Kazemi, Kurt Mehlhorn, Nidhi Rathi
4Logic of Knowledge and Cognitive AbilityJia Tao, Xinran Zhang
968Certified Guidance for Planning with Deep Generative ModelsFrancesca Cairoli, Francesco Giacomarra, Mehran Hosseini, Nicola Paoletti
778GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-ThoughtSungsik Kim, Baek Janghyun, Jinkyu Kim, Jaekoo Lee
798Predictability Awareness for Efficient and Robust Multi-Agent CoordinationRomán Chiva Gil, Daniel Jarne Ornia, Khaled A. Mustafa, Javier Alonso-Mora
119On the limits of agency in agent-based modelsAyush Chopra, Shashank Kumar, Nurullah Giray Kuru, Ramesh Raskar, Arnau Quera-Bofarull
6Enhancing Offline Reinforcement Learning with Curriculum Learning-Based Trajectory ValuationAmir Abolfazli, Zekun Song, Avishek Anand, Wolfgang Nejdl
819Multi-Ship Future Interaction Trajectory Prediction via Pre-Initializer Diffusion ModelKun Ma, Qilong Han, Jingzheng Yao
74Offline Goal-Conditioned Reinforcement Learning with Elastic-Subgoal Diffused Policy LearningYaocheng Zhang, Yuanheng Zhu, Yuqian Fu, Songjun Tu, Dongbin Zhao
621Who Am I Dealing With? Explaining the Designer’s Hidden IntentionsTurgay Caglar, Sarath Sreedharan, Mor Vered
1255Mean Field Correlated Imitation LearningZhiyu Zhao, Chengdong Ma, Qirui Mi, Ning Yang, Xue Yan, Mengyue Yang, Haifeng Zhang, Jun Wang, Yaodong Yang,
915On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite FlowTonghan Wang, Heng Dong, Yanchen Jiang, David C. Parkes, Milind Tambe
154Boosting Sortition via Proportional RepresentationSoroush Ebadian, Evi Micha
930Taming Multi-Agent Reinforcement Learning with Estimator Variance ReductionTaher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew E. Taylor, Kun Shao, Jun Wang, David Henry Mguni,
592SCMRAG: Self-Corrective Multihop Retrieval Augmented Generation System for LLM AgentsRishabh Agrawal, Murtaza Asrani, Hadi Youssef, Apurva Narayan
564Maximizing Truth Learning in a Social Network is NP-hardFilip Úradník, Amanda Wang, Jie Gao
643Value Iteration for Learning Concurrently Executable Robotic Control TasksSheikh A. Tahmid, Gennaro Notomista
357Learning Symbolic Task Decompositions for Multi-Agent TeamsAmeesh Shah, Niklas Lauffer, Thomas Chen, Nikhil Pitta, Sanjit A. Seshia
523Temporal Fair Division of Indivisible ItemsEdith Elkind, Alexander Lam, Mohamad Latifian, Tzeh Yuan Neoh, Nicholas Teh
667Human-Aligned Skill Discovery: Balancing Behaviour Exploration and AlignmentMaxence Hussonnois, Thommen George Karimpanal, Santu Rana
337Towards Fair and Efficient Public Transportation: A Bus Stop ModelMartin Bullinger, Edith Elkind, Mohamad Latifian
900Agent-Based Analysis of Green Disclosure Policies and Their Market-Wide Impact on Firm BehaviorLingxiao Zhao, Maria Polukarov, Carmine Ventre
1022Generalised BDI PlanningFelipe Meneguzzi, Ramon Fraga Pereira, Nir Oren
887A Simple Integration of Epistemic Logic and Reinforcement LearningThorsten Engesser, Thibaut Le Marre, Emiliano Lorini, François Schwarzentruber, Bruno Zanuttini
1304Surprise! Surprise! Learn and AdaptHuma Samin, Dylan J. Walton, Nelly Bencomo
1036xSRL: Safety-Aware Explainable RL – Safety as a Product of ExplainabilityRisal Shahriar Shefin, Md Asifur Rahman, Thai Le, Sarra Alqahtani
963Local Topological Information as a Powerful Enhancer for Generalizable Neural Method in Travelling Salesman ProblemXiaoxin Bai, Junyang Yang, Shengchao Yuan, Yinghao Zhang, Hanqian Wu
917Policy Abstraction and Nash Refinement in Tree-Exploiting PSROChristine Konicki, Mithun Chakraborty, Michael P. Wellman
1303DUPRE: Data Utility Prediction for Efficient Data ValuationPham Kieu Thao Nguyen, Rachael Hwee Ling Sim, Quoc Phong Nguyen, See-Kiong Ng, Bryan Kian Hsiang Low
803Improving Policy Optimization via 𝜺-RetrainLuca Marzari, Priya L. Donti, Changliu Liu, Enrico Marchesini
943Voter Model Meets Rumour Spreading: A Study of Consensus Protocols on Graphs with Agnostic NodesMarcelo Matheus Gauy, Anna Abramishvili, Eduardo Colli, Tiago Madeira, Frederik Mallmann-Trenn, Vinícius Franco Vasconcelos, David Kohan Marzagao
1153On Some Fundamental Problems for Multi-Agent Systems Over Multilayer NetworksDaniel Rosenkrantz, Madhav Marathe, Zirou Qiu, S. S. Ravi, Richard Stearns
112Candidate nomination for Condorcet-consistent voting rulesIldikó Schlotter, Katarína Cechlárová
739Why Instant-Runoff Voting Is So Resilient to Coalitional Manipulation: Phase transitions in the Perturbed CultureFrançois Durand
14Automatic Verification of Linear Integer Planning Programs via Forgetting in LIAUPFLiangda Fang, Shikang Chen, Xiaoman Wang, Xiaoyou Lin, Chenyi Zhang, Qingliang Chen, Quanlong Guan, Kaile Su
436Model and Mechanisms of Consent for Responsible AutonomyAnastasia S. Apeiron, Davide Dell’Anna, Pradeep K. Murukannaiah, Pinar Yolum
1020Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML MonitoringGusseppe Bravo-Rocca, Peini Liu, Jordi Guitart, Rodrigo M Carrillo-Larco, Ajay Dholakia, David Ellison
870Selfish Behavior and Resource Competition in Multi-Agent SystemsCostas Courcoubetis, Antonis Dimakis
1208𝛽-DQN: Improving Deep Q-Learning By Evolving the BehaviorHongming Zhang, Fengshuo Bai, Chenjun Xiao, Chao Gao, Bo Xu, Martin Müller
160EconoJax: A Fast & Scalable Economic Simulation in JAXKoen Ponse, Aske Plaat, Niki Van Stein, Thomas M. Moerland
263On the Effective Horizon of Inverse Reinforcement LearningYiqing Xu, Finale Doshi-Velez, David Hsu
389Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential ExplorationHai Zhong, Xun Wang, Zhuoran Li, Longbo Huang
272Real-World Testing Matters in Reinforcement Learning for EducationAnna Riedmann, Carlo D’Eramo, Birgit Lugrin
805On Stateful Value Factorization in Multi-Agent Reinforcement LearningEnrico Marchesini, Andrea Baisero, Rupali Bhati, Christopher Amato
1317Ready, Bid, Go! On-Demand Delivery Using Fleets of Drones with Unknown, Heterogeneous Energy Storage ConstraintsMohamed S. Talamali, Genki Miyauchi, Thomas Watteyne, Micael Santos Couceiro, Roderich Gross
282Robin Hood Reachability Bidding GamesShaull Almagor, Guy Avni, Neta Dafni
307Fast UCB-type algorithms for stochastic bandits with heavy and super heavy symmetric noiseYuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Pudovikov Andrey
434Truthful mechanisms for linear bandit games with private contextsYiting Hu, Lingjie Duan
159Dual Ensembled Multiagent Q-Learning with Hypernet RegularizerYaodong Yang, Guangyong Chen, Hongyao Tang, Furui Liu, Danruo Deng, Pheng-Ann Heng
330Computing Efficient Envy-Free Partial Allocations of Indivisible GoodsRobert Bredereck, Andrzej Kaczmarczyk, Junjie Luo, Bin Sun
169Causes and Strategies in Multiagent SystemsSylvia S. Kerkhove, Natasha Alechina, Mehdi Dastani
1139Discovery and Deployment of Emergent Robot Swarm Behaviors via Representation Learning and Real2Sim2Real TransferConnor Mattson, Varun Raveendra, Ricardo Vega, Cameron Nowzari, Daniel S. Drew, Daniel S. Brown
941EnEnv 1.0: Energy Grid Environment for Multi-Agent Reinforcement Learning BenchmarkingDominik Jacek Bogucki, Łukasz Eugeniusz Lepak, Sonam Parashar, Bart Blachowski, Paweł Wawrzyński
116Adaptive Episode Length Adjustment for Multi-agent Reinforcement LearningByunghyun Yoo, Younghwan Shin, Hyunwoo Kim, Euisok Chung, Jeongmin Yang
909Probabilistic Timed ATLWojciech Jamroga, Marta Kwiatkowska, Wojciech Penczek, Laure Petrucci, Teofil Sidoruk
602k-Approval Veto: A Spectrum of Voting Rules Balancing Metric Distortion and Minority ProtectionFatih Erdem Kizilkaya, David Kempe
929Practical Abstractions for Model Checking Continuous-Time Multi-Agent SystemsYan Kim, Wojciech Jamroga, Wojciech Penczek, Laure Petrucci
566Efficient and Optimal Policy Gradient Algorithm for Corrupted Multi-armed BanditsJiayuan Liu, Siwei Wang, Zhixuan Fang
201Multi-agent Multi-armed Bandits with Minimum Reward Guarantee FairnessPiyushi Manupriya, Himanshu, Sakethanath Jagarlapudi, Ganesh Ghalme
91On the Power of Temporal Locality on Online Routing ProblemsSwapnil Guragain, Gokarna Sharma
424Insights Regarding the Success of Damping in Improving Belief PropagationUriel Zaed, Roie Zivan, Omer Lev
1370Offline Multi-Agent Preference-based Reinforcement Learning with Agent-aware Direct Preference OptimizationQian Kou, Mingyang Li, Zeyang Liu, Long Qian, Zhuoran Chen, Lipeng Wan, Xingyu Chen, Xuguang Lan
946Multi-objective Reinforcement Learning with Nonlinear Preferences: Provable Approximation for Maximizing Expected Scalarized ReturnNianli Peng, Muhang Tian, Brandon Fain
513Non-obvious Manipulability in Hedonic Games with Friends Appreciation PreferencesMichele Flammini, Maria Fomenko, Giovanna Varricchio
1188Learning Collusion in Episodic, Inventory-Constrained MarketsPaul Friedrich, Barna Pásztor, Giorgia Ramponi
334Welfare Approximation in Additively Separable Hedonic GamesMartin Bullinger, Vaggos Chatziafratis, Parnian Shahkar
200Anytime Fairness Guarantees in Stochastic Combinatorial MABs: A Novel Learning FrameworkSubham Pokhriyal, Shweta Jain, Ganesh Ghalme, Vaneet Aggarwal
262Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive ExplorationXingrui Yu, Zhenglin Wan, David Mark Bossens, Yueming Lyu, Qing Guo, Ivor Tsang
1029Smooth Information Gathering in Two-Player Noncooperative GamesFernando Palafox, Jesse Milzman, David Fridovich-Keil, Dong Ho Lee, Ryan Park
939Eliminating Majority IllusionFoivos Fioravantes, Abhiruk Lahiri, Antonio Lauerbach, Lluís Sabater, Marie Diana Sieper, Samuel Wolf
1347Enhancing Sub-Optimal Trajectory Stitching: Spatial Composition RvS for Offline RLSheng Zang, Zhiguang Cao, Bo An, Senthilnath Jayavelu, Xiaoli Li
393Truthful and Welfare-maximizing Resource Scheduling with Application to Electric VehiclesRamsundar Anandanarayanan, Swaprava Nath, Prasant Misra