tongjingqi/AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.

404 stars11 forksApache-2.0

agentai-innovatorai-scientistsrl

Rankings

Daily

#5566

Weekly

#8888

Monthly

#10466

Engagement

Total stars

Daily new stars

Get badge

This repository is tracked by Trending Repos. The badge upgrades automatically if it ever cracks the top 100.

Markdown

![Trending Repos](https://trending-repos.com/badge/tongjingqi/AI-Can-Learn-Scientific-Taste.svg)

HTML

<img src="https://trending-repos.com/badge/tongjingqi/AI-Can-Learn-Scientific-Taste.svg" alt="Trending Repos" />

Raw URL

https://trending-repos.com/badge/tongjingqi/AI-Can-Learn-Scientific-Taste.svg

Back to trending