| | |
184 | Human-Readable Neuro-Fuzzy Networks from Frequent Yet Discernible Patterns in Reward-Based Environments | John Wesley Hostetter, Adittya Soukarjya Saha, Md Mirajul Islam, Tiffany Barnes, Min Chi |
16 | Adapting Beyond the Depth Limit: Counter Strategies in Large Imperfect Information Games | David Milec, Vojtech Kovarik, Viliam Lisý |
5 | PANDA: Priority-Based Collision Avoidance Framework for Heterogeneous UAVs Navigating in Dense Airspace | Agamdeep Singh, Jaskirat Singh, Sujit Pb |
590 | AlphaZeroES: Direct Score Maximization Outperforms Planning Loss Minimization | Carlos Martin, Tuomas Sandholm |
824 | Equilibrium selection via communication partition | Wei-Chen Lee, Alessandro Abate, Michael J. Wooldridge |
1042 | Learning to explore when mistakes are not allowed | Charly Pecqueux-Guézénec, Stephane Doncieux, Nicolas Perrin-Gilbert |
386 | Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming Approach | Xinjie Liu, Jingqi Li, Filippos Fotiadis, Mustafa O. Karabag, Jesse Milzman, David Fridovich-Keil, Ufuk Topcu |
507 | Making Universal Policies Universal | Niklas Hoepner, David Kuric, Herke Van Hoof |
1262 | Pure Nash Equilibrium and Strong Nash Equilibrum Computation in Aggregate Games | Jared Soundy, Mohammad T. Irfan, Hau Chan |
155 | Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games | Pranav Rajbhandari, Prithviraj Dasgupta, Donald Sofge |
1141 | Learning Flexible Heterogeneous Coordination With Capability-Aware Shared Hypernetworks | Kevin Fu, Pierce Howell, Shalin Jain, Harish Ravichandar |
1155 | Liquid Welfare and Revenue Monotonicity in Adaptive Clinching Auctions | Ryosuke Sato |
1021 | Efficient Multi-Agent Delegated Search | Curtis Bechtel, Shaddin Dughmi |
897 | Can you see how I learn? Human observers’ inferences about Reinforcement Learning agents’ learning processes | Bernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens |
103 | Truman: A Large Language Model-based Multi-agent Simulator for Synthetic Money Laundering Data Generation | Dattatray Vishnu Kute, Zihao Xu, Yuekang Li, Fethi A Rabhi |
1194 | Asynchronous Cooperative Multi-Agent Reinforcement Learning with Limited Communication | Sydney Dolan, Siddharth Nayak, Jasmine Jerry Aloor, Hamsa Balakrishnan |
1040 | Formal Verification of Manipulation Dialogues | Andreas Brännström, Chiaki Sakama, Juan Carlos Nieves |
173 | Shifting Power: Leveraging LLMs to Simulate Human Aversion in ABMs of Bilateral Financial Exchanges | A Bond Market Study., Alicia Vidler, Toby Walsh |
755 | Model of the influence of external signals on the trust of the agent in Multi Agent System | Frederique Lalieu, Tomasz Zurek, Tom Van Engers |
842 | Experience-replay Innovative Dynamics | Tuo Zhang, Leonardo Stella, Julian Barreiro-Gomez |
682 | Satisfactory Budget Division | Laurent Gourvès, Michael Lampis, Nikolaos Melissinos, Aris Pagourtzis |
267 | Stochastic k-Submodular Bandits with Full Bandit Feedback | Guanyu Nie, Vaneet Aggarwal, Christopher John Quinn |
586 | Practical Comparisons of Reservoir Topology Performance and Input Distribution in Digital Reservoir Computers | Lewis Thelen, Vikram Ravindra |
181 | Multi-Agent Systems for Bullying Intervention | Luis Zhinin-Vera, José J González-García, Víctor López-Jaquero, Elena Navarro, Pascual González |
520 | Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity | Calarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu, Matthew E. Taylor |
673 | The Costly Bargain: Economic Impacts of Price-Seeking Behavior in Aging Populations | Fuguang Chen, Alan Tsang |
869 | Agential AI for Integrated Continual Learning | Deliberative Behavior, And Comprehensible Models, Zeki Doruk Erden, Boi Faltings |
429 | Parameterized Complexity of Hedonic Games with Enemy-Oriented Preferences | Martin Durand, Laurin Erlacher, Johanne Müller Vistisen, Sofia Simola |
1025 | Fairness in Cooperative Multiagent Multiobjective Reinforcement Learning using the Expected Scalarized Reward Criterion | Fares Chouaki, Aurélie Beynier, Nicolas Maudet, Paolo Viappiani |
557 | Predictive Improvement through Latent Space Optimisation | Alexander McCaffrey, Eduardo Alonso, Esther Mondragon |
644 | FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation | Wenzheng Jiang, Ji Wang, Xiongtao Zhang, Weidong Bao, Cheston Tan, Flint Xiaofeng Fan |
890 | Improving the effectiveness of potential-based reward shaping in reinforcement learning | Henrik Müller, Daniel Kudenko |
906 | Distributed Adaptive Macroscopic Ensemble Task Allocation of Heterogeneous Robot Teams in Dynamic Environments | Victoria Edwards, M. Ani Hsieh |
310 | Dynamic Agent Replacement: Optimizing Team Performance within Adversarial Environments | Gregory Everett, Ryan Beal, Tim Matthews, Timothy J. Norman, Sarvapali Ramchurn |
541 | ChatBDI: Think BDI | Talk Llm, Andrea Gatti, Viviana Mascardi, Angelo Ferrando |
311 | IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems | Yihuan Mao, Yipeng Kang, Peilun Li, Ning Zhang, Wei Xu, Chongjie Zhang |
464 | Social Ranking for Feature Selection | Laurent Gourvès, Stefano Moretti, Satya Tamby |
1125 | Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games | Zixuan Wu, Sean Charles Ye, Manisha Natarajan, Matthew Gombolay |
117 | Resource Allocation under the Latin Square Constraint | Yasushi Kawase, Bodhayan Roy, Mohammad Azharuddin Sanpui |
113 | Local Anomaly Detection with Partial Observation in Multi-agent Systems as a Data Matching Game | Zixin Ye, Christopher Leckie, Tansu Alpcan |
863 | CPE: A New Paradigm for Policy Extraction in Offline Reinforcement Learning | Zhaohui Yang, Xiaoxuan Wang, Linjing Li |
1002 | Action-Dependent Optimality-Preserving Reward Shaping | Grant Collier Forbes, Jianxun Wang, Leonardo Villalobos-Arias, Arnav Jhala, David Roberts |
525 | LogiEx: Integrating Formal Logic and Large Language Model for Explainable Planning | Ziyan An, Xia Wang, Hendrik Baier, Zirong Chen, Abhishek Dubey, Taylor T Johnson, Jonathan Sprinkle, Ayan Mukhopadhyay, Meiyi Ma |
971 | Combining Normative Ethics Principles to Learn Prosocial Behaviour | Jessica Woodgate, Nirav Ajmeri |
274 | Learning Fair and Preferable Allocations through Neural Network | Ryota Maruo, Koh Takeuchi, Hisashi Kashima |
975 | Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods | Tyler Becker, Zachary N Sunberg |
304 | Enhancing Offline Safe Reinforcement Learning with Trajectory-Constrained Diffusion Planning | Hengrui Zhang, Youfang Lin, Shuo Shen, Hanfeng Lin, Peng Cheng, Sheng Han, Kai Lv |
172 | Robust Strategies for Stochastic Multi-Agent Systems | Raphaël Berthon, Joost-Pieter Katoen, Munyque Mittelmann, Aniello Murano |
970 | Weighted Envy Freeness With Bounded Subsidies | Noga Klein Elmalem, Rica Gonen, Erel Segal-Halevi |
747 | Multi-Agent Pickup and Delivery with Batteries | Marcello Bavaro, Francesco Amigoni |
365 | Requirements-based Explainability for Multi Agent Systems | Sebastian Rodriguez, John Thangarajah, Michael Winikoff |
448 | Interaction Protocols in an Imperative Agent-Oriented Programming Language: the case of BSPL and SARL | Matteo Baldoni, Cristina Baroglio, Stéphane Galland, Roberto Micalizio, Fatma Outay, Stefano Tedeschi |
949 | Trading-off Accuracy and Communication Cost in Federated Learning | Mattia Jacopo Villani, Emanuele Natale, Frederik Mallmann-Trenn |
1297 | Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning | Lunjun Liu, Weilai Jiang, Yaonan Wang |
75 | Efficient Model Checking with Semantically-Equivalent Models for vGOAL | Yi Yang, Tom Holvoet |
134 | Neighborhood Stability in Assignments on Graphs | Haris Aziz, Grzegorz Lisowski, Mashbat Suzuki, Jeremy Vollen |
217 | Learning Heterogeneous Agent Collaboration in Decentralized Multi-Agent Systems via Intrinsic Motivation | Jahir Sadik Monon, Deeparghya Dutta Barua, Md Mosaddek Khan |
433 | Egalitarianism in Online Coalition Formation | Saar Cohen, Noa Agmon |
223 | Fast Adaption by Policy Deviation Integral Meta-reinforcement Learning with Applications to High-speed Trains Operation | Haotong Zhang, Wanyuan Wang |
1090 | Diverse Heterogeneous Graph Conditioned Diffusion for Multi-Agent Teaming | Luis Manuel Pimentel, Sean Charles Ye, James Ellis Grant Pagan, Matthew Gombolay |
109 | Predicting Team Performance from Communications in Simulated Search-and-Rescue | Ali Jalal-Kamali, Nikolos M Gurney, David V. Pynadath |
444 | Distributed Value Decomposition Networks with Networked Agents | Guilherme S. Varela, Alberto Sardinha, Francisco S. Melo |
656 | Navigating Social Dilemmas with LLM-based Agents via Consideration of Future Consequences | Dung Nguyen, Hung Le, Kien Do, Sunil Gupta, Svetha Venkatesh, Truyen Tran |
547 | Reasoning and Planning with Dynamic Social Norms | Taylor Olson, Roberto Salas-Damian, Kenneth Forbus |
1210 | Matching Markets with Chores | Thorben Tröbst, Jugal Garg, Vijay Vazirani |
768 | Quantitative Operational Monitoring for BDI Agents | Marie Farrell, Angelo Ferrando, Mengwei Xu |
463 | CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Zeyuan Liu, Yang Kai, Jiafei Lyu, Xiu Li |
972 | Participatory Budgeting Project Strength via Candidate Control | Piotr Faliszewski, Łukasz Janeczko, Dušan Knop, Jan Pokorný, Šimon Schierreich, Mateusz Słuszniak, Krzysztof Sornat |
1335 | To Stand on the Shoulders of Giants: Should We Protect Initial Discoveries in Multi-Agent Exploration? | Hodaya Lampert, Reshef Meir, Kinneret Teodorescu |
123 | Group Fairness in Multi-period Mobile Facility Location Problems | Haris Aziz, Hau Chan, Xingchen Sha, Toby Walsh, Lirong Xia |
960 | Less is More: Simple and Scalable Out-of-Distribution Dynamics Detection Using RBF Kernel-Based Descriptors | Tala Jafari, Philip Torr, Varun Kanade, Christian Schroeder De Witt |
677 | Traffic Anomaly Detection through Generative Modeling of Multi-Agent Interactions in Traffic Flow | Zhuojun Chen, Tacitus Hui, Xinghua Zhu, Dongzhe Su |
364 | DyLam: A Dynamic Reward Weighting Framework for Reinforcement Learning Algorithms | Mateus Gonçalves Machado, Hansenclever Bassani |
348 | Enhancing Lifelong Multi-Agent Path-finding by Using Artificial Potential Fields | Arseniy Pertzovsky, Roni Stern, Roie Zivan, Ariel Felner |
26 | Offline Meta Reinforcement Learning with Weighted Policy Constraints and Proximal Context Collection | Haorui Li, Jiaqi Liang, Linjing Li, Daniel Dajun Zeng |
588 | Dynamic Reward Sharing to Enhance Learning in the Context of Multiagent Teams | Kyle Tilbury, David Radke |
705 | Mitigating Non-Stationarity in Deep Reinforcement Learning with Clustering Orthogonal Weight Modification | Guoqing Ma, Yuhan Zhang, Yuming Dai, Guangfu Hao, Yang Chen, Shan Yu |
1366 | Adaptive Microtolling in Competitive Online Congestion Games via Multiagent Reinforcement Learning | Behrad Koohy, Sebastian Stein, Enrico Gerding |
1263 | On the existence of EFX allocations in multigraphs | Alkmini Sgouritsa, Minas Marios Sotiriou |
860 | Entropic Exploration for Constrained Multiagent Reinforcement Learning | Ayhan Alp Aydeniz, Enrico Marchesini, Robert Loftin, Christopher Amato, Kagan Tumer |
296 | Adaptive Offline Data Replay in Offline-to-Online Reinforcement Learning | Xu Liu, Tong Yu, Shuai Li |
876 | Runtime Verification with Rational Multi-Monitors | Davide Catta, Angelo Ferrando, Vadim Malvone |
689 | Decentralized Deep Reinforcement Learning for Cooperative Multi-Agent Flight Trajectory Planning in Adverse Weather | Bizhao Pang, Mingcheng Zhang, Xinting Hu, Sameer Alam, Guglielmo Lulli |
366 | The Effectiveness of Best-Response Dynamics in Reducing Price of Anarchy for Markov Potential Games | Dingyang Chen, Xiaoling Zeng, Thinh T. Doan, Qi Zhang |
7 | Nash Equilibrium and Learning Dynamics in Three-Player Matching m-Action Games | Yuma Fujimoto, Kaito Ariu, Kenshi Abe |
452 | Compensating latent nonlinear dynamics for practical consensus control | Krzysztof Kowalczyk, Dominik Baumann, Cristian R. Rojas, Paweł Wachel |
260 | Dynamic Conservative Degree Allocation for Offline Multi-Agent Reinforcement Learning | Haosheng Chen, Yun Hua, Junjie Sheng, Wenhao Li, Bo Jin, Xiangfeng Wang |
1053 | Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use Tools | Panagiotis Lymperopoulos, Vasanth Sarathy |
396 | Optimal Mechanism Design for Crowdfunding of Public Goods | Yukun Cheng, Xiaotie Deng, Baqiao Quan |
350 | On-Policy Reinforcement Learning From Failure via Sparse Reward Densification | Mingkang Wu, Yongcan Cao |
1216 | EconTwo: A Two-Level Multi-Agent Framework for Dynamic Macroeconomic Modeling with Shock Resilience | Zhixun Chen, Zijing Shi, Yaodong Yang, Meng Fang, Yali Du |
1138 | Weighted Envy-free Allocation with Subsidy | Haris Aziz, Xin Huang, Kei Kimura, Indrajit Saha, Zhaohong Sun, Mashbat Suzuki, Makoto Yokoo |
1359 | Fusing Physical and Cognitive Stimuli: An Eye Movement Emotion Recognition Framework Based on Hierarchical Attention Mechanism | Zhi Lin Li, Xiaomei Tao |
1104 | Symplex: Learning social norm hierarchies by combining autonomous exploration and expert imitation | Oliver Deane, Oliver Ray |
188 | Heuristics-Assisted Experience Replay Strategy for Cooperative Multi-Agent Reinforcement Learning | Xie Yi, Ziqing Zhou, Chun Ouyang, Siao Liu, Linqiang Hu, Zhongxue Gan |
862 | Decision-Making in Evolving Environments: A Bayesian Multi-Agent Bandit Framework | Mohammad Essa Alsomali, Leandro Soriano Marcolino, Barry Porter, Roberto Rodrigues-Filho |
957 | Is an exponentially growing action space really that bad? Validating a Core Assumption for using Multi-Agent RL | Ruan John De Kock, Arnu Pretorius, Jonathan P. Shock |
735 | Multiplayer Games With Incomplete Information for Hyperproperty Verification | Raven Beutner, Bernd Finkbeiner |
253 | Negotiated Reasoning: On Provably Addressing Relative Over-Generalization | Junjie Sheng, Wenhao Li, Bo Jin, Hongyuan Zha, Jun Wang, Xiangfeng Wang |
505 | Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning | Changxi Zhu, Mehdi Dastani, Shihan Wang |
844 | Multi-Agent Reinforcement Learning with Selective State-Space Models | Jemma Daniel, Ruan John De Kock, Louay Ben Nessir, Sasha Abramowitz, Omayma Mahjoub, Wiem Khlifi, Juan Claude Formanek, Arnu Pretorius |
584 | Hierarchical Multi-agent Reinforcement Learning for Cyber Network Defense | Aditya Vikram Singh, Ethan Rathbun, Emma Graham, Lisa Oakley, Simona Boboila, Alina Oprea, Peter Chin |
626 | Rethinking Explainable AI: Explanations can be Deceiving | Peta Masters, Daniel Gallagher, Luc Moreau, Mor Vered |
1348 | Coordinating Competing Electric Vehicle Fleets: An Agent-Based Charging Capacity Market | Lennard Sund, Janik Muires, Ramin Ahadi, Konstantina Valogianni, Wolfgang Ketter |
236 | Learning Robust Representations for Visual Reinforcement Learning via Task-Relevant Mask Sampling | Vedant Dave, Ozan Ozdenizci, Elmar Rueckert |
278 | Towards Automating the Design of Value-Aligned Clinical Protocols | Manel Rodriguez-Soto, Nardine Osman, Carles Sierra, Rocio Cintas-Garcia, Cristina Farriols-Danes, Montserrat Garcia-Retortillo, Silvia Minguez-Maso, Jordi Martinez-Roldan |
723 | Environmental Policies within Cournot Oligopoly | Liang Shan, Zhengyang Liu, Haoqiang Huang, Zihe Wang |
1254 | Using Assistance Rewards Without Introducing Bias: Overcoming Sparse Rewards in Multi-Agent Reinforcement Learning | Yue Yang, Bernd Meyer, Frits De Nijs |
736 | RainbowArena: A Multi-Agent Toolkit for Reinforcement Learning and Large Language Models in Competitive Tabletop Games | Yingzhuo Liu, Shuodi Liu, Hongsong Tang, Yubing Ma, Zikang Li, Junge Zhang, Liuyu Xiang, Zhaofeng He |
1154 | Evaluating and Improving Graph-based Explanation Methods for Multi-Agent Coordination | Siva Kailas, Shalin Jain, Harish Ravichandar |
1395 | A Minimalist Approach to Augmentation-based Self-supervised Representation Learning for On-policy Reinforcement Learning | Nasik Muhammad Nafi, William Hsu |
175 | Observer-Aware Probabilistic Planning under Partial Observability | Salomé Lepers, Vincent Thomas, Olivier Buffet |
978 | Towards Fair and Efficient Policy Learning in Cooperative Multi-Agent Reinforcement Learning | Umer Siddique, Peilang Li, Yongcan Cao |
161 | Online housing market | Julien Lesca |
901 | Memory Assignment for Finite-Memory Strategies in Adversarial Patrolling Games | Vojtěch Kůr, Vít Musil, Vojtěch Řehák |
202 | Regret Guarantees for a UCB-based Algorithm for Volatile Combinatorial Bandits | Andra Siva Sai Teja, Kumar Abhishek, Sujit Gujar, Yadati Narahari, Ganesh Ghalme |
722 | SFedRec: A Federated Learning Framework for Dynamic Session-based Recommendation | Hexiao Zhang, Yanni Tang, Jiamou Liu, Wu Chen |
546 | Modeling the Collaborative Edge Data Caching Problem via a Dynamic DCOP | Ziyang Song, Ziyu Chen, Jinhui Huang, Cheng Zhang, Jingyuan He |
347 | On the Distortion of Multi-Winner Elections on the Line Metric | Negar Babashah, Hasti Karimi, Masoud Seddighin, Golnoosh Shahkarami |
993 | (Submodular) Hedonic Games with Common Ranking Property | Bugra Caskurlu, Ali Eser |
732 | Efficient Training of Generalizable Visuomotor Policies via Control-Aware Augmentation | Yinuo Zhao, Kun Wu, Tianjiao Yi, Zhiyuan Xu, Zhengping Che, Qinru Qiu, Chi Harold Liu, Jian Tang |
235 | RallyDiffuser: A Representation-Guided Diffusion Model Framework for Strategic Planning in Badminton | Bing-Zhi Ke, Kuang-Da Wang, Wen-Chih Peng |
275 | MORL4Water: A Modular Multi-Objective Reinforcement Learning Toolkit for Water Resource Management | Zuzanna Osika, Roxana Rădulescu, Jazmin Zatarain Salazar, Frans A Oliehoek, Pradeep K. Murukannaiah |
565 | Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning | Dmytro Kuzmenko, Nadiya Shvai |
449 | Managing an Agent’s Changing Intentions Using LTL𝑓 Synthesis | Giuseppe De Giacomo, Yves Lesperance, Gianmarco Parretti, Fabio Patrizi, Renzo Schram |
17 | Dynamic Option Creation in Option-Critic Reinforcement Learning | Mateus Begnini Melchiades, Gabriel De Oliveira Ramos, Bruno Castro Da Silva |
362 | Enhancing Robot Navigation Policies with Task-Specific Uncertainty Management | Gokul Puthumanaillam, Paulo Padrao, Jose Fuentes, Leonardo Bobadilla, Melkior Ornik |
1073 | Online Competitive Information Gathering for Partially Observable Trajectory Games | Mel Krusniak, Hang Xu, Parker Palermo, Forrest John Laine |
205 | Voter Participation Control in Online Polls | Koustav De, Palash Dey, Swagato Sanyal |
1204 | Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs | Ofer Dagan, Tyler Becker, Zachary N Sunberg |
88 | Empowering Generalization for Deep Reinforcement Learning via Symbolic Planning | Tianpei Yang, Srijita Das, Christabel Wayllace, Matthew E. Taylor |
1308 | Will Systems of LLM Agents Lead to Cooperation: An Investigation into a Social Dilemma | Richard Willis, Yali Du, Joel Z Leibo |
1237 | Group-fair Facility Location Games with Externalities | Minming Li, Cheng Peng, Ying Wang, Houyu Zhou |
426 | Adaptive Budget Optimization for Multichannel Advertising Using Combinatorial Bandits | Briti Gangopadhyay, Zhao Wang, Alberto Silvio Chiappa, Shingo Takamatsu |
133 | Shapley Value-based Approach for Distributing Revenue of Matchmaking of Private Transactions in Blockchains | Rasheed, Parth Desai, Yash Chaurasia, Sujit Gujar |
317 | CADP: Towards Better Centralized Learning for Decentralized Execution in MARL | Yihe Zhou, Shunyu Liu, Yunpeng Qing, Tongya Zheng, Kaixuan Chen, Jie Song, Mingli Song |
190 | Fair Assignment on Multi-Stage Graphs | Vibulan J, Swapnil Dhamal, Shweta Jain, Ojassvi Kumar, Aman Kumar, Harpreet Singh |
1018 | Planning for Temporally Extended Goals based on alpha-CTL | Viviane Bonadia Dos Santos, Leliane N. De Barros, Maria Viviane De Menezes, Silvio Do Lago Pereira |
951 | When to Stop Getting Tested: The Theory of Diagnostic Tests | Anson Kahng, Joseph Saber |
222 | Leveraging Fully-Observable Solutions for Improved Partially-Observable Offline Reinforcement Learning | Chulabhaya Wijesundara, Andrea Baisero, Gregory David Castanon, Alan Carlin, Robert Platt, Christopher Amato |
1291 | Decoding Negotiation Dynamics: The Impact of Opponent Identity and Privacy on Strategy | Deception, And Emotional Transparency In Human-Agent Interaction, Nusrath Jahan, Johnathan Mell |
192 | Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization | Shengchao Hu, Wanru Zhao, Weixiong Lin, Li Shen, Ya Zhang, Dacheng Tao |
483 | Learning Pre-Trained Tacit Behavior for Efficient Multi-Agent Adversarial Coordination | Shiqing Yao, Jiajun Chai, Haixin Yu, Yongzhe Chang, Yuanheng Zhu, Xueqian Wang |
1070 | DECAF: Learning to be Fair in Multi-agent Resource Allocation | Ashwin Kumar, William Yeoh |
962 | Adaptive Multi-Round Influence Maximization with Limited Information | Diodato Ferraioli, Vincenzo Auletta, Cosimo Vinci, Francesco Carbone |
814 | Lite-DIO Is Actually What You Need for Efficient Inertial Localization | Yan Li, Meng Liu, Zhongchen Shi, Yanqing Hou, Liang Xie, Hongbo Chen, Erwei Yin |
991 | Diversity-seeking swap games in networks | Yaqiao Li, Lata Narayanan, Jaroslav Opatrny |
758 | What Is a Counterfactual Cause in Action Theories? | Daxin Liu, Vaishak Belle |
1163 | Open-World Classification with Bayesian Gaussian Mixture Models | Justin Clarke, Przemyslaw A. Grabowicz, David Jensen |
380 | Learning Bayesian Game Families, with Application To Mechanism Design | Madelyn Gatchel, Michael P. Wellman |
459 | SocialMP: Learning Social-awared Motion Patterns via Addictive Fusion for Pedestrian Trajectory Prediction | Tianci Gao, Yuzhen Zhang, Hang Guo, Pei Lv |
1384 | Context Adaptive Memory-Efficient LLM Inference for Edge Multi-Agent Systems | Hamza Mohammed, Sai Chand Boyapati, Hang Yin |
782 | Integrating Large Language Models with Reinforcement Learning for Generalization in Strategic Card Games | Wannian Xia, Meng Fang, Zihao Guo, Yali Du, Bo Xu |
826 | Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors | Lang Feng, Jiahao Lin, Dong Xing, Li Zhang, De Ma, Gang Pan |
877 | Where is the nearest EV charging station? Evolutionary optimization of the gas/charging stations topology | Enrique Mateos-Melero, Javier Moralejo-Piñas, Ángela Durán-Pinto, Francisco Martinez-Gil, María Soriano, Fernando Fernández |
952 | Cultural Evolution of Cooperation among LLM Agents | Aron Vallinder, Edward Hughes |
102 | ADAGE: A generic two-layer framework for adaptive agent based modelling | Benjamin Patrick Evans, Sihan Zeng, Sumitra Ganesh, Leo Ardon |