Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

Jupyter notebook markdown generator

Posts

Conservative Q learning for Offline Reinforcement Learning

March 07, 2023

We develop a conservative Q-learning (CQL) algorithm, such that the expected value of a policy under the learned Q-function lower-bounds its true value. A lower bound on the Q-value prevents the over-estimation that is common in offline RL settings due to OOD actions and function approximation error. We start by focusing on policy evaluation step in CQL, which could be used by itself as an off-policy evaluation procedure, or integrated into a complete offline RL algorithm.

Building an Academic Website

June 30, 2020

If you’re an academic, you need a website. Obviously I agree with this since you’re reading this on my website, but if you don’t have one, you should get one. Most universities these days provide a free option, usually powered by WordPress (both WashU and UNC use WordPress for their respective offerings). While these sites are quick to set up and come with the prestige of a .edu URL, they have several drawbacks that have been extensively written on.

projects

Chinese OCR Web Search System

Underwater object trajectory tracking with biological whiskers’ matrix

publications

Coupling Deep Textural and Shape Features for Sketch Recognition

Qi Jia, Xin Fan, Meiyu Yu, Yuqing Liu, Dingrong Wang, and Longin Jan Latecki. 2020. Coupling Deep Textural and Shape Features for Sketch Recognition.

Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval

Wang, D., Sapkota, H., Liu, X., & Yu, Q. (2021, December). Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval. In 2021 IEEE International Conference on Data Mining (ICDM) (pp. 669-678). IEEE.

Deep Temporal Sets with Evidential Reinforced Attentions for Unique Behavioral Pattern Discovery

ICML 2023 (In Press)

Distributionally Robust Ensemble of Lottery Tickets Towards Calibrated Sparse Network Training

NIPS 2023 (In Press)

LIBR+: Improving Intraoperative Liver Registration by Learning the Residual of Biomechanics-Based Deformable Registration.

MICCAI 2024 (In Press)

Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness.

KDD 2024

Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object Detection

NIPS 2024

Conservative Evidential Learning of Long-Term User Preferences

ICLR 2025 (In Press)

Dingrong Wang