Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Conservative Q learning for Offline Reinforcement Learning

We develop a conservative Q-learning (CQL) algorithm, such that the expected value of a policy under the learned Q-function lower-bounds its true value. A lower bound on the Q-value prevents the over-estimation that is common in offline RL settings due to OOD actions and function approximation error. We start by focusing on policy evaluation step in CQL, which could be used by itself as an off-policy evaluation procedure, or integrated into a complete offline RL algorithm.

Building an Academic Website

If you’re an academic, you need a website. Obviously I agree with this since you’re reading this on my website, but if you don’t have one, you should get one. Most universities these days provide a free option, usually powered by WordPress (both WashU and UNC use WordPress for their respective offerings). While these sites are quick to set up and come with the prestige of a .edu URL, they have several drawbacks that have been extensively written on.

projects

publications

Coupling Deep Textural and Shape Features for Sketch Recognition

Qi Jia, Xin Fan, Meiyu Yu, Yuqing Liu, Dingrong Wang, and Longin Jan Latecki. 2020. Coupling Deep Textural and Shape Features for Sketch Recognition.

Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval

Wang, D., Sapkota, H., Liu, X., & Yu, Q. (2021, December). Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval. In 2021 IEEE International Conference on Data Mining (ICDM) (pp. 669-678). IEEE.

Deep Temporal Sets with Evidential Reinforced Attentions for Unique Behavioral Pattern Discovery

ICML 2023 (In Press)

Reinforcement learning guided Adaptive Region Selection

Working paper.

Conservative Evidential Learning of Long-Term User Preferences

Working paper.

Distributionally Robust Ensemble of Lottery Tickets Towards Calibrated Sparse Network Training

NIPS 2023 (In Press)

LIBR+: Improving Intraoperative Liver Registration by Learning the Residual of Biomechanics-Based Deformable Registration..

MICCAI 2024 (In Press)

Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness.

KDD 2024 (In Press)

research

talks