Methods - a rivasmig Collection

rivasmig 's Collections

VLMs

Methods

Utility

Methods

updated 7 days ago

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

Paper • 2411.04952 • Published Nov 7, 2024 • 28
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Paper • 2411.05005 • Published Nov 7, 2024 • 13
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models

Paper • 2411.04075 • Published Nov 6, 2024 • 16
Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 17
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published Nov 5, 2024 • 67
The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 29 days ago • 90
Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53
Evaluating Sample Utility for Data Selection by Mimicking Model Weights

Paper • 2501.06708 • Published about 1 month ago • 5
MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 28 days ago • 273
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering

Paper • 2501.05131 • Published Jan 9 • 34
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Paper • 2501.08326 • Published 28 days ago • 31
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Paper • 2501.08292 • Published 28 days ago • 17
FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation

Paper • 2502.01068 • Published 9 days ago • 14
Improving Transformer World Models for Data-Efficient RL

Paper • 2502.01591 • Published 8 days ago • 9
Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 11 days ago • 34
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

Paper • 2501.17433 • Published 14 days ago • 8