![](https://cdn-avatars.huggingface.co/v1/production/uploads/60f1abe7544c2adfd699860c/QssJRzc60u8flpjsXWSBF.png)
ICML2023
AI & ML interests
None defined yet.
Recent Activity
View all activity
ICML2023's activity
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6266513d539521e602b5dc3a/qg0fmVTGNKEFL7feyvQNh.png)
ameerazam08
posted
an
update
12 days ago
Post
1601
R1 is out! And with a lot of other R1 releated models...
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643012094339-61914f536d34e827404ceb99.jpeg)
hysts
updated
a
Space
about 1 month ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62bfc80b54db40ca617ce53a/IZxafMqoYJNHwTuFt8AqF.png)
vwxyzjn
authored
5
papers
about 1 month ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Paper
•
2403.17031
•
Published
•
6
A2C is a special case of PPO
Paper
•
2205.09123
•
Published
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Paper
•
2410.18252
•
Published
•
5
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
•
2411.15124
•
Published
•
59
2 OLMo 2 Furious
Paper
•
2501.00656
•
Published
•
16
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62fa1d95e8c9c532aa75331c/WFfk_n8gOj845pSkfdazA.jpeg)
mbrack
authored
a
paper
about 2 months ago
Post
9166
Google drops Gemini 2.0 Flash Thinking
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
![](https://cdn-avatars.huggingface.co/v1/production/uploads/66347c0bc269602beb44b6fa/QAOAMszD3-b75ahSIoyh_.png)
Kameshr
authored
a
paper
2 months ago
Post
10041
QwQ-32B-Preview is now available in anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
Post
3974
Post
2926
anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1669164541734-61919e59d221f4281b3833d5.jpeg)
xzyao
authored
a
paper
3 months ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678735253848-640f7083208821a59b74c757.jpeg)
Lupin1998
authored
2
papers
4 months ago