taesiri's picture

taesiri PRO

taesiri

·

https://taesiri.ai/

AI & ML interests

AGI ... one linear layer at a time

Recent Activity

updated a dataset 39 minutes ago

taesiri/HumanHandsDatasetFingerCounts

upvoted a paper about 4 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

upvoted a paper about 4 hours ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

View all activity

Organizations

taesiri's activity

upvoted 3 papers about 4 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published about 24 hours ago • 34

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 1 day ago • 53

MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents

Paper • 2502.05957 • Published 2 days ago • 5

upvoted a paper about 16 hours ago

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Paper • 2502.03628 • Published 6 days ago • 9

upvoted 8 papers 1 day ago

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Paper • 2502.03738 • Published 6 days ago • 8

No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces

Paper • 2502.04959 • Published 4 days ago • 9

Generating Symbolic World Models via Test-time Scaling of Large Language Models

Paper • 2502.04728 • Published 4 days ago • 15

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published 4 days ago • 18

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published 5 days ago • 42

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Paper • 2502.05176 • Published 4 days ago • 24

Agency Is Frame-Dependent

Paper • 2502.04403 • Published 5 days ago • 17

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published 4 days ago • 57

upvoted 4 papers 5 days ago

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Paper • 2502.04306 • Published 5 days ago • 16

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 6 days ago • 37

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm

Paper • 2502.02358 • Published 7 days ago • 15

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Paper • 2502.01105 • Published 8 days ago • 16

upvoted a collection 5 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 5 days ago • 227

upvoted 3 papers 6 days ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 6 days ago • 44

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 153

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published 8 days ago • 31