Accepted Extended Abstracts (Main Technical Track)
Some extended abstracts have links to author-made short video presentations (typically about 5 minutes long) in the last column.
ID | Title | Authors | Video |
---|---|---|---|
5 | PANDA: Priority-Based Collision Avoidance Framework for Heterogeneous UAVs Navigating in Dense Airspace | Agamdeep Singh, Jaskirat Singh, Sujit Pb | |
7 | Nash Equilibrium and Learning Dynamics in Three-Player Matching m-Action Games | Yuma Fujimoto, Kaito Ariu, Kenshi Abe | |
16 | Adapting Beyond the Depth Limit: Counter Strategies in Large Imperfect Information Games | David Milec, Vojtech Kovarik, Viliam Lisý | Link |
17 | Dynamic Option Creation in Option-Critic Reinforcement Learning | Mateus Begnini Melchiades, Gabriel De Oliveira Ramos, Bruno Castro Da Silva | |
26 | Offline Meta Reinforcement Learning with Weighted Policy Constraints and Proximal Context Collection | Haorui Li, Jiaqi Liang, Linjing Li, Daniel Dajun Zeng | |
75 | Efficient Model Checking with Semantically-Equivalent Models for vGOAL | Yi Yang, Tom Holvoet | |
88 | Empowering Generalization for Deep Reinforcement Learning via Symbolic Planning | Tianpei Yang, Srijita Das, Christabel Wayllace, Matthew E. Taylor | |
102 | ADAGE: A generic two-layer framework for adaptive agent based modelling | Benjamin Patrick Evans, Sihan Zeng, Sumitra Ganesh, Leo Ardon | |
103 | Truman: A Large Language Model-based Multi-agent Simulator for Synthetic Money Laundering Data Generation | Dattatray Vishnu Kute, Zihao Xu, Yuekang Li, Fethi A Rabhi | |
109 | Predicting Team Performance from Communications in Simulated Search-and-Rescue | Ali Jalal-Kamali, Nikolos M Gurney, David V. Pynadath | |
113 | Local Anomaly Detection with Partial Observation in Multi-agent Systems as a Data Matching Game | Zixin Ye, Christopher Leckie, Tansu Alpcan | |
117 | Resource Allocation under the Latin Square Constraint | Yasushi Kawase, Bodhayan Roy, Mohammad Azharuddin Sanpui | |
123 | Group Fairness in Multi-period Mobile Facility Location Problems | Haris Aziz, Hau Chan, Xingchen Sha, Toby Walsh, Lirong Xia | |
133 | Shapley Value-based Approach for Distributing Revenue of Matchmaking of Private Transactions in Blockchains | Rasheed, Parth Desai, Yash Chaurasia, Sujit Gujar | |
134 | Neighborhood Stability in Assignments on Graphs | Haris Aziz, Grzegorz Lisowski, Mashbat Suzuki, Jeremy Vollen | |
155 | Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games | Pranav Rajbhandari, Prithviraj Dasgupta, Donald Sofge | |
172 | Robust Strategies for Stochastic Multi-Agent Systems | Raphaël Berthon, Joost-Pieter Katoen, Munyque Mittelmann, Aniello Murano | |
173 | Shifting Power: Leveraging LLMs to Simulate Human Aversion in ABMs of Bilateral Financial Exchanges | A Bond Market Study., Alicia Vidler, Toby Walsh | |
175 | Observer-Aware Probabilistic Planning under Partial Observability | Salomé Lepers, Vincent Thomas, Olivier Buffet | |
181 | Multi-Agent Systems for Bullying Intervention | Luis Zhinin-Vera, José J González-García, Víctor López-Jaquero, Elena Navarro, Pascual González | |
188 | Heuristics-Assisted Experience Replay Strategy for Cooperative Multi-Agent Reinforcement Learning | Xie Yi, Ziqing Zhou, Chun Ouyang, Siao Liu, Linqiang Hu, Zhongxue Gan | |
190 | Fair Assignment on Multi-Stage Graphs | Vibulan J, Swapnil Dhamal, Shweta Jain, Ojassvi Kumar, Aman Kumar, Harpreet Singh | |
192 | Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization | Shengchao Hu, Wanru Zhao, Weixiong Lin, Li Shen, Ya Zhang, Dacheng Tao | |
202 | Regret Guarantees for a UCB-based Algorithm for Volatile Combinatorial Bandits | Andra Siva Sai Teja, Kumar Abhishek, Sujit Gujar, Yadati Narahari, Ganesh Ghalme | |
205 | Voter Participation Control in Online Polls | Koustav De, Palash Dey, Swagato Sanyal | Link |
217 | Learning Heterogeneous Agent Collaboration in Decentralized Multi-Agent Systems via Intrinsic Motivation | Jahir Sadik Monon, Deeparghya Dutta Barua, Md Mosaddek Khan | |
222 | Leveraging Fully-Observable Solutions for Improved Partially-Observable Offline Reinforcement Learning | Chulabhaya Wijesundara, Andrea Baisero, Gregory David Castanon, Alan Carlin, Robert Platt, Christopher Amato | |
223 | Fast Adaption by Policy Deviation Integral Meta-reinforcement Learning with Applications to High-speed Trains Operation | Haotong Zhang, Wanyuan Wang | |
235 | RallyDiffuser: A Representation-Guided Diffusion Model Framework for Strategic Planning in Badminton | Bing-Zhi Ke, Kuang-Da Wang, Wen-Chih Peng | |
253 | Negotiated Reasoning: On Provably Addressing Relative Over-Generalization | Junjie Sheng, Wenhao Li, Bo Jin, Hongyuan Zha, Jun Wang, Xiangfeng Wang | |
260 | Dynamic Conservative Degree Allocation for Offline Multi-Agent Reinforcement Learning | Haosheng Chen, Yun Hua, Junjie Sheng, Wenhao Li, Bo Jin, Xiangfeng Wang | |
267 | Stochastic k-Submodular Bandits with Full Bandit Feedback | Guanyu Nie, Vaneet Aggarwal, Christopher John Quinn | |
274 | Learning Fair and Preferable Allocations through Neural Network | Ryota Maruo, Koh Takeuchi, Hisashi Kashima | |
275 | MORL4Water: A Modular Multi-Objective Reinforcement Learning Toolkit for Water Resource Management | Zuzanna Osika, Roxana Rădulescu, Jazmin Zatarain Salazar, Frans A Oliehoek, Pradeep K. Murukannaiah | |
278 | Towards Automating the Design of Value-Aligned Clinical Protocols | Manel Rodriguez-Soto, Nardine Osman, Carles Sierra, Rocio Cintas-Garcia, Cristina Farriols-Danes, Montserrat Garcia-Retortillo, Silvia Minguez-Maso, Jordi Martinez-Roldan | |
296 | Adaptive Offline Data Replay in Offline-to-Online Reinforcement Learning | Xu Liu, Tong Yu, Shuai Li | |
304 | Enhancing Offline Safe Reinforcement Learning with Trajectory-Constrained Diffusion Planning | Hengrui Zhang, Youfang Lin, Shuo Shen, Hanfeng Lin, Peng Cheng, Sheng Han, Kai Lv | |
311 | IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems | Yihuan Mao, Yipeng Kang, Peilun Li, Ning Zhang, Wei Xu, Chongjie Zhang | |
317 | CADP: Towards Better Centralized Learning for Decentralized Execution in MARL | Yihe Zhou, Shunyu Liu, Yunpeng Qing, Tongya Zheng, Kaixuan Chen, Jie Song, Mingli Song | |
347 | On the Distortion of Multi-Winner Elections on the Line Metric | Negar Babashah, Hasti Karimi, Masoud Seddighin, Golnoosh Shahkarami | |
348 | Enhancing Lifelong Multi-Agent Path-finding by Using Artificial Potential Fields | Arseniy Pertzovsky, Roni Stern, Roie Zivan, Ariel Felner | Link |
350 | On-Policy Reinforcement Learning From Failure via Sparse Reward Densification | Mingkang Wu, Yongcan Cao | |
362 | Enhancing Robot Navigation Policies with Task-Specific Uncertainty Management | Gokul Puthumanaillam, Paulo Padrao, Jose Fuentes, Leonardo Bobadilla, Melkior Ornik | |
364 | DyLam: A Dynamic Reward Weighting Framework for Reinforcement Learning Algorithms | Mateus Gonçalves Machado, Hansenclever Bassani | Link |
365 | Requirements-based Explainability for Multi Agent Systems | Sebastian Rodriguez, John Thangarajah, Michael Winikoff | |
366 | The Effectiveness of Best-Response Dynamics in Reducing Price of Anarchy for Markov Potential Games | Dingyang Chen, Xiaoling Zeng, Thinh T. Doan, Qi Zhang | |
380 | Learning Bayesian Game Families, with Application To Mechanism Design | Madelyn Gatchel, Michael P. Wellman | |
386 | Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming Approach | Xinjie Liu, Jingqi Li, Filippos Fotiadis, Mustafa O. Karabag, Jesse Milzman, David Fridovich-Keil, Ufuk Topcu | Link |
396 | Optimal Mechanism Design for Crowdfunding of Public Goods | Yukun Cheng, Xiaotie Deng, Baqiao Quan | Link |
426 | Adaptive Budget Optimization for Multichannel Advertising Using Combinatorial Bandits | Briti Gangopadhyay, Zhao Wang, Alberto Silvio Chiappa, Shingo Takamatsu | Link |
429 | Parameterized Complexity of Hedonic Games with Enemy-Oriented Preferences | Martin Durand, Laurin Erlacher, Johanne Müller Vistisen, Sofia Simola | |
433 | Egalitarianism in Online Coalition Formation | Saar Cohen, Noa Agmon | |
444 | Distributed Value Decomposition Networks with Networked Agents | Guilherme S. Varela, Alberto Sardinha, Francisco S. Melo | |
448 | Interaction Protocols in an Imperative Agent-Oriented Programming Language: the case of BSPL and SARL | Matteo Baldoni, Cristina Baroglio, Stéphane Galland, Roberto Micalizio, Fatma Outay, Stefano Tedeschi | |
449 | Managing an Agent’s Changing Intentions Using LTL𝑓 Synthesis | Giuseppe De Giacomo, Yves Lesperance, Gianmarco Parretti, Fabio Patrizi, Renzo Schram | |
452 | Compensating latent nonlinear dynamics for practical consensus control | Krzysztof Kowalczyk, Dominik Baumann, Cristian R. Rojas, Paweł Wachel | |
463 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu, Yang Kai, Jiafei Lyu, Xiu Li | |
464 | Social Ranking for Feature Selection | Laurent Gourvès, Stefano Moretti, Satya Tamby | |
483 | Learning Pre-Trained Tacit Behavior for Efficient Multi-Agent Adversarial Coordination | Shiqing Yao, Jiajun Chai, Haixin Yu, Yongzhe Chang, Yuanheng Zhu, Xueqian Wang | |
505 | Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning | Changxi Zhu, Mehdi Dastani, Shihan Wang | |
507 | Making Universal Policies Universal | Niklas Hoepner, David Kuric, Herke Van Hoof | |
520 | Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity | Calarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu, Matthew E. Taylor | |
525 | LogiEx: Integrating Formal Logic and Large Language Model for Explainable Planning | Ziyan An, Xia Wang, Hendrik Baier, Zirong Chen, Abhishek Dubey, Taylor T Johnson, Jonathan Sprinkle, Ayan Mukhopadhyay, Meiyi Ma | |
541 | ChatBDI: Think BDI | Talk Llm, Andrea Gatti, Viviana Mascardi, Angelo Ferrando | |
546 | Modeling the Collaborative Edge Data Caching Problem via a Dynamic DCOP | Ziyang Song, Ziyu Chen, Jinhui Huang, Cheng Zhang, Jingyuan He | |
547 | Reasoning and Planning with Dynamic Social Norms | Taylor Olson, Roberto Salas-Damian, Kenneth Forbus | |
557 | Predictive Improvement through Latent Space Optimisation | Alexander McCaffrey, Eduardo Alonso, Esther Mondragon | |
565 | Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning | Dmytro Kuzmenko, Nadiya Shvai | |
584 | Hierarchical Multi-agent Reinforcement Learning for Cyber Network Defense | Aditya Vikram Singh, Ethan Rathbun, Emma Graham, Lisa Oakley, Simona Boboila, Alina Oprea, Peter Chin | |
586 | Practical Comparisons of Reservoir Topology Performance and Input Distribution in Digital Reservoir Computers | Lewis Thelen, Vikram Ravindra | |
588 | Dynamic Reward Sharing to Enhance Learning in the Context of Multiagent Teams | Kyle Tilbury, David Radke | |
590 | AlphaZeroES: Direct Score Maximization Outperforms Planning Loss Minimization | Carlos Martin, Tuomas Sandholm | |
626 | Rethinking Explainable AI: Explanations can be Deceiving | Peta Masters, Daniel Gallagher, Luc Moreau, Mor Vered | |
644 | FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation | Wenzheng Jiang, Ji Wang, Xiongtao Zhang, Weidong Bao, Cheston Tan, Flint Xiaofeng Fan | |
656 | Navigating Social Dilemmas with LLM-based Agents via Consideration of Future Consequences | Dung Nguyen, Hung Le, Kien Do, Sunil Gupta, Svetha Venkatesh, Truyen Tran | |
673 | The Costly Bargain: Economic Impacts of Price-Seeking Behavior in Aging Populations | Fuguang Chen, Alan Tsang | |
677 | Traffic Anomaly Detection through Generative Modeling of Multi-Agent Interactions in Traffic Flow | Zhuojun Chen, Tacitus Hui, Xinghua Zhu, Dongzhe Su | |
682 | Satisfactory Budget Division | Laurent Gourvès, Michael Lampis, Nikolaos Melissinos, Aris Pagourtzis | |
689 | Decentralized Deep Reinforcement Learning for Cooperative Multi-Agent Flight Trajectory Planning in Adverse Weather | Bizhao Pang, Mingcheng Zhang, Xinting Hu, Sameer Alam, Guglielmo Lulli | |
705 | Mitigating Non-Stationarity in Deep Reinforcement Learning with Clustering Orthogonal Weight Modification | Guoqing Ma, Yuhan Zhang, Yuming Dai, Guangfu Hao, Yang Chen, Shan Yu | |
722 | SFedRec: A Federated Learning Framework for Dynamic Session-based Recommendation | Hexiao Zhang, Yanni Tang, Jiamou Liu, Wu Chen | |
723 | Environmental Policies within Cournot Oligopoly | Liang Shan, Zhengyang Liu, Haoqiang Huang, Zihe Wang | |
732 | Efficient Training of Generalizable Visuomotor Policies via Control-Aware Augmentation | Yinuo Zhao, Kun Wu, Tianjiao Yi, Zhiyuan Xu, Zhengping Che, Chi Harold Liu, Jian Tang | |
735 | Multiplayer Games With Incomplete Information for Hyperproperty Verification | Raven Beutner, Bernd Finkbeiner | |
736 | RainbowArena: A Multi-Agent Toolkit for Reinforcement Learning and Large Language Models in Competitive Tabletop Games | Yingzhuo Liu, Shuodi Liu, Hongsong Tang, Yubing Ma, Zikang Li, Junge Zhang, Liuyu Xiang, Zhaofeng He | |
747 | Multi-Agent Pickup and Delivery with Batteries | Marcello Bavaro, Francesco Amigoni | |
755 | Model of the influence of external signals on the trust of the agent in Multi Agent System | Frederique Lalieu, Tomasz Zurek, Tom Van Engers | |
758 | What Is a Counterfactual Cause in Action Theories? | Daxin Liu, Vaishak Belle | |
768 | Quantitative Operational Monitoring for BDI Agents | Marie Farrell, Angelo Ferrando, Mengwei Xu | |
782 | Integrating Large Language Models with Reinforcement Learning for Generalization in Strategic Card Games | Wannian Xia, Meng Fang, Zihao Guo, Yali Du, Bo Xu | |
814 | Lite-DIO Is Actually What You Need for Efficient Inertial Localization | Yan Li, Meng Liu, Zhongchen Shi, Yanqing Hou, Liang Xie, Hongbo Chen, Erwei Yin | |
824 | Equilibrium selection via communication partition | Wei-Chen Lee, Alessandro Abate, Michael J. Wooldridge | |
826 | Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors | Lang Feng, Jiahao Lin, Dong Xing, Li Zhang, De Ma, Gang Pan | |
842 | Experience-replay Innovative Dynamics | Tuo Zhang, Leonardo Stella, Julian Barreiro-Gomez | |
844 | Multi-Agent Reinforcement Learning with Selective State-Space Models | Jemma Daniel, Ruan John De Kock, Louay Ben Nessir, Sasha Abramowitz, Omayma Mahjoub, Wiem Khlifi, Juan Claude Formanek, Arnu Pretorius | |
860 | Entropic Exploration for Constrained Multiagent Reinforcement Learning | Ayhan Alp Aydeniz, Enrico Marchesini, Robert Loftin, Christopher Amato, Kagan Tumer | |
862 | Decision-Making in Evolving Environments: A Bayesian Multi-Agent Bandit Framework | Mohammad Essa Alsomali, Leandro Soriano Marcolino, Barry Porter, Roberto Rodrigues-Filho | |
863 | CPE: A New Paradigm for Policy Extraction in Offline Reinforcement Learning | Zhaohui Yang, Xiaoxuan Wang, Linjing Li | |
869 | Agential AI for Integrated Continual Learning | Deliberative Behavior, And Comprehensible Models, Zeki Doruk Erden, Boi Faltings | Link |
876 | Runtime Verification with Rational Multi-Monitors | Davide Catta, Angelo Ferrando, Vadim Malvone | |
877 | Where is the nearest EV charging station? Evolutionary optimization of the gas/charging stations topology | Enrique Mateos-Melero, Javier Moralejo-Piñas, Ángela Durán-Pinto, Francisco Martinez-Gil, María Soriano, Fernando Fernández | |
890 | Improving the effectiveness of potential-based reward shaping in reinforcement learning | Henrik Müller, Daniel Kudenko | |
897 | Can you see how I learn? Human observers’ inferences about Reinforcement Learning agents’ learning processes | Bernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens | |
906 | Distributed Adaptive Macroscopic Ensemble Task Allocation of Heterogeneous Robot Teams in Dynamic Environments | Victoria Edwards, M. Ani Hsieh | |
949 | Trading-off Accuracy and Communication Cost in Federated Learning | Mattia Jacopo Villani, Emanuele Natale, Frederik Mallmann-Trenn | |
951 | When to Stop Getting Tested: The Theory of Diagnostic Tests | Anson Kahng, Joseph Saber | |
952 | Cultural Evolution of Cooperation among LLM Agents | Aron Vallinder, Edward Hughes | |
957 | Is an exponentially growing action space really that bad? Validating a Core Assumption for using Multi-Agent RL | Ruan John De Kock, Arnu Pretorius, Jonathan P. Shock | |
962 | Adaptive Multi-Round Influence Maximization with Limited Information | Diodato Ferraioli, Vincenzo Auletta, Cosimo Vinci, Francesco Carbone | |
970 | Weighted Envy Freeness With Bounded Subsidies | Noga Klein Elmalem, Rica Gonen, Erel Segal-Halevi | |
971 | Combining Normative Ethics Principles to Learn Prosocial Behaviour | Jessica Woodgate, Nirav Ajmeri | Link |
972 | Participatory Budgeting Project Strength via Candidate Control | Piotr Faliszewski, Łukasz Janeczko, Dušan Knop, Jan Pokorný, Šimon Schierreich, Mateusz Słuszniak, Krzysztof Sornat | |
975 | Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods | Tyler Becker, Zachary N Sunberg | |
978 | Towards Fair and Efficient Policy Learning in Cooperative Multi-Agent Reinforcement Learning | Umer Siddique, Peilang Li, Yongcan Cao | |
991 | Diversity-seeking swap games in networks | Yaqiao Li, Lata Narayanan, Jaroslav Opatrny | |
993 | (Submodular) Hedonic Games with Common Ranking Property | Bugra Caskurlu, Ali Eser | |
1002 | Action-Dependent Optimality-Preserving Reward Shaping | Grant Collier Forbes, Jianxun Wang, Leonardo Villalobos-Arias, Arnav Jhala, David Roberts | Link |
1018 | Planning for Temporally Extended Goals based on alpha-CTL | Viviane Bonadia Dos Santos, Leliane N. De Barros, Maria Viviane De Menezes, Silvio Do Lago Pereira | |
1021 | Efficient Multi-Agent Delegated Search | Curtis Bechtel, Shaddin Dughmi | |
1025 | Fairness in Cooperative Multiagent Multiobjective Reinforcement Learning using the Expected Scalarized Reward Criterion | Fares Chouaki, Aurélie Beynier, Nicolas Maudet, Paolo Viappiani | |
1040 | Formal Verification of Manipulation Dialogues | Andreas Brännström, Chiaki Sakama, Juan Carlos Nieves | |
1042 | Learning to explore when mistakes are not allowed | Charly Pecqueux-Guézénec, Stephane Doncieux, Nicolas Perrin-Gilbert | |
1053 | Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use Tools | Panagiotis Lymperopoulos, Vasanth Sarathy | |
1070 | DECAF: Learning to be Fair in Multi-agent Resource Allocation | Ashwin Kumar, William Yeoh | |
1073 | Online Competitive Information Gathering for Partially Observable Trajectory Games | Mel Krusniak, Hang Xu, Parker Palermo, Forrest John Laine | |
1090 | Diverse Heterogeneous Graph Conditioned Diffusion for Multi-Agent Teaming | Luis Manuel Pimentel, Sean Charles Ye, James Ellis Grant Pagan, Matthew Gombolay | |
1104 | Symplex: Learning social norm hierarchies by combining autonomous exploration and expert imitation | Oliver Deane, Oliver Ray | |
1138 | Weighted Envy-free Allocation with Subsidy | Haris Aziz, Xin Huang, Kei Kimura, Indrajit Saha, Zhaohong Sun, Mashbat Suzuki, Makoto Yokoo | |
1141 | Learning Flexible Heterogeneous Coordination With Capability-Aware Shared Hypernetworks | Kevin Fu, Pierce Howell, Shalin Jain, Harish Ravichandar | |
1154 | Evaluating and Improving Graph-based Explanation Methods for Multi-Agent Coordination | Siva Kailas, Shalin Jain, Harish Ravichandar | |
1155 | Liquid Welfare and Revenue Monotonicity in Adaptive Clinching Auctions | Ryosuke Sato | |
1163 | Open-World Classification with Bayesian Gaussian Mixture Models | Justin Clarke, Przemyslaw A. Grabowicz, David Jensen | |
1194 | Asynchronous Cooperative Multi-Agent Reinforcement Learning with Limited Communication | Sydney Dolan, Siddharth Nayak, Jasmine Jerry Aloor, Hamsa Balakrishnan | |
1204 | Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs | Ofer Dagan, Tyler Becker, Zachary N Sunberg | |
1210 | Matching Markets with Chores | Thorben Tröbst, Jugal Garg, Vijay Vazirani | |
1216 | EconTwo: A Two-Level Multi-Agent Framework for Dynamic Macroeconomic Modeling with Shock Resilience | Zhixun Chen, Zijing Shi, Yaodong Yang, Meng Fang, Yali Du | |
1237 | Group-fair Facility Location Games with Externalities | Minming Li, Cheng Peng, Ying Wang, Houyu Zhou | |
1254 | Using Assistance Rewards Without Introducing Bias: Overcoming Sparse Rewards in Multi-Agent Reinforcement Learning | Yue Yang, Bernd Meyer, Frits De Nijs | |
1262 | Pure Nash Equilibrium and Strong Nash Equilibrum Computation in Aggregate Games | Jared Soundy, Mohammad T. Irfan, Hau Chan | |
1263 | On the existence of EFX allocations in multigraphs | Alkmini Sgouritsa, Minas Marios Sotiriou | Link |
1291 | Decoding Negotiation Dynamics: The Impact of Opponent Identity and Privacy on Strategy | Deception, And Emotional Transparency In Human-Agent Interaction, Nusrath Jahan, Johnathan Mell | |
1297 | Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning | Lunjun Liu, Weilai Jiang, Yaonan Wang | |
1308 | Will Systems of LLM Agents Lead to Cooperation: An Investigation into a Social Dilemma | Richard Willis, Yali Du, Joel Z Leibo | Link |
1335 | To Stand on the Shoulders of Giants: Should We Protect Initial Discoveries in Multi-Agent Exploration? | Hodaya Lampert, Reshef Meir, Kinneret Teodorescu | |
1348 | Coordinating Competing Electric Vehicle Fleets: An Agent-Based Charging Capacity Market | Lennard Sund, Janik Muires, Ramin Ahadi, Konstantina Valogianni, Wolfgang Ketter | Link |
1359 | Fusing Physical and Cognitive Stimuli: An Eye Movement Emotion Recognition Framework Based on Hierarchical Attention Mechanism | Zhi Lin Li, Xiaomei Tao | |
1366 | Adaptive Microtolling in Competitive Online Congestion Games via Multiagent Reinforcement Learning | Behrad Koohy, Sebastian Stein, Enrico Gerding | |
1384 | Context Adaptive Memory-Efficient LLM Inference for Edge Multi-Agent Systems | Hamza Mohammed, Sai Chand Boyapati, Hang Yin | |
1395 | A Minimalist Approach to Augmentation-based Self-supervised Representation Learning for On-policy Reinforcement Learning | Nasik Muhammad Nafi, William Hsu |