Research

My academic research falls into two main areas: Reinforcement learning research and applications, where I develop novel algorithms to handle well-known difficult problems in RL training, such as offline learning, uncertainty-aware RL, semi-supervised RL to handle data limitation and sub-optimality problems.

My other main research agenda uses advanced RL methods combining with uncertainty theory to help predict user’s preference on movies (recommender system, multi-modal fusion), predict ASD patients from time series data recording from their reactions when playing a game (medicine, time sequence prediction), and CV related image based sketch retrieval, object detection, etc.

Now, my focus is on applying RL to language models for text summarization and traditional backbone networks for architecture search to solve Lottery Ticket Hypothesis problem.