Yoann Poupart
Xmaster6y
AI & ML interests
AI Safety | Interpretability | LLM | RL
Recent Activity
updated
a dataset
1 day ago
LuxWorld/trajectories
upvoted
a
paper
2 days ago
Analyze Feature Flow to Enhance Interpretation and Steering in Language
Models
upvoted
a
paper
2 days ago
Mechanistic Permutability: Match Features Across Layers