mayank1729
's Collections
Papers
updated
How Do Large Language Models Acquire Factual Knowledge During
Pretraining?
Paper
•
2406.11813
•
Published
•
31
From RAGs to rich parameters: Probing how language models utilize
external knowledge over parametric information for factual queries
Paper
•
2406.12824
•
Published
•
21
Tokenization Falling Short: The Curse of Tokenization
Paper
•
2406.11687
•
Published
•
16
Iterative Length-Regularized Direct Preference Optimization: A Case
Study on Improving 7B Language Models to GPT-4 Level
Paper
•
2406.11817
•
Published
•
13
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens
Grounding
Paper
•
2406.19263
•
Published
•
10
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
•
2406.19215
•
Published
•
30
LiteSearch: Efficacious Tree Search for LLM
Paper
•
2407.00320
•
Published
•
38
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
97
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via
Dynamic Sparse Attention
Paper
•
2407.02490
•
Published
•
23
Is It Really Long Context if All You Need Is Retrieval? Towards
Genuinely Difficult Long Context NLP
Paper
•
2407.00402
•
Published
•
22
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
Models by Learning from Knowledge Graphs
Paper
•
2407.00653
•
Published
•
11
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on
Edge
Paper
•
2407.00088
•
Published
•
10
Show Less, Instruct More: Enriching Prompts with Definitions and
Guidelines for Zero-Shot NER
Paper
•
2407.01272
•
Published
•
8
To Forget or Not? Towards Practical Knowledge Unlearning for Large
Language Models
Paper
•
2407.01920
•
Published
•
14
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
•
2407.01489
•
Published
•
59
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Paper
•
2407.01370
•
Published
•
86
How do you know that? Teaching Generative Language Models to Reference
Answers to Biomedical Questions
Paper
•
2407.05015
•
Published
•
4
SEED-Story: Multimodal Long Story Generation with Large Language Model
Paper
•
2407.08683
•
Published
•
22
Inference Performance Optimization for Large Language Models on CPUs
Paper
•
2407.07304
•
Published
•
52
Case2Code: Learning Inductive Reasoning with Synthetic Data
Paper
•
2407.12504
•
Published
•
8
Gemma 2: Improving Open Language Models at a Practical Size
Paper
•
2408.00118
•
Published
•
76
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper
•
2408.15545
•
Published
•
35