Junlin Zhou's picture

Junlin Zhou

jlzhou

·

edwardzjl

AI & ML interests

None yet

Recent Activity

commented on a paper about 12 hours ago

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

reacted to schuler's post with 👍 about 13 hours ago

📢 New Research Alert: Making Language Models Smaller & Smarter! Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance. The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena. 🔑 Key Findings: • 77% parameter reduction. • Maintained model capabilities. • Improved generalization. Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT Code: https://github.com/joaopauloschuler/less-parameters-llm

upvoted an article about 14 hours ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

View all activity

Organizations

Articles 2

Article

1

Distributed SFT with trl and DeepSpeed Part 2: Scaling Locally

Article

1

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

View all Articles

Papers 1

arxiv:2307.08674

models 2

jlzhou/Qwen2.5-3B-Infinity-Instruct-0625

Text Generation • Updated 3 days ago • 27

jlzhou/ppo-LunarLander-v2

Reinforcement Learning • Updated 6 days ago • 6

datasets

None public yet