S. Yang

First name
S.
Last name
Yang
Yang, S. ., Feng, Y. ., Zhang, S. ., & Zhou, M. . (2022). Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning. In International conference on machine learning. Retrieved from https://par.nsf.gov/biblio/10340487