Theses

Publications

SACHA: Soft Actor-Critic with Heuristic-Based Attention for Partially Observable Multi-Agent Path Finding
Qiushi Lin, Hang Ma
IEEE Robotics and Automation Letters (RA-L) 2023 [pdf] [code]
TL;DR: We designed a novel multi-agent actor-critic reinforcement framework for partially observable multi-agent path finding. We integrated the heuristic-based attention mechanisms to enable the learned model to generalize among multiple instances on a large scale.

MFC-EQ: Mean Field Control with Envelope Q-learning for Moving Decentralized Agents in Formation
Qiushi Lin, Hang Ma
Preprint (In Submission) [pdf] [code]
TL;DR: We proposed an adaptable multi-objective multi-agent reinforcement learning algorithm that combines mean field control and envelop Q-learning for moving agents in formation, and provided theoretical analysis and empirical evaluation.

(* = equal contribution)

On the Convergence Rates of Log-Linear Policy Gradient Methods [pdf] [code]
Qiushi Lin*, Matin Aghaei*, Anderson de Andrade*, Sharan Vaswani
TL;DR: We provided a general framework to derive convergence rates of policy gradient methods for log-linear policy class by reducing the problem to the one in tabular softmax settings. Based on this, we extended theoretical guarantees of softmax policy gradient methods to derive theoretically guaranteed algorithms for log-linear policies with both exact and inexact policy evaluation.

A Survey of Apprenticeship Learning [pdf]
Qiushi Lin*, Ziqian Bai*, Minh Bui*, Jiaqi Tan*
TL;DR: We surveyed the literature on a few widely used apprenticeship learning algorithms and empirically evaluated them on a shared benchmark