[LG]《Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF》S Belakaria, J Kazdan, C Marx, C Cundy... [Stanford University] (2025)
网页链接
#机器学习#
#人工智能#
#论文#
#AI创造营#