Better Automatic Evaluation of Open-Domain Dialogue Systems with Contextualized Embeddings Paper • 1904.10635 • Published Apr 24, 2019
The Woman Worked as a Babysitter: On Biases in Language Generation Paper • 1909.01326 • Published Sep 3, 2019
Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems Paper • 2310.05280 • Published Oct 8, 2023 • 1
ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems Paper • 2305.07797 • Published May 12, 2023
Mitigating Bias for Question Answering Models by Tracking Bias Influence Paper • 2310.08795 • Published Oct 13, 2023
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters Paper • 2310.09219 • Published Oct 13, 2023
Evaluating Large Language Models on Controlled Generation Tasks Paper • 2310.14542 • Published Oct 23, 2023
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos Paper • 2311.01620 • Published Nov 2, 2023
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model Paper • 2305.16734 • Published May 26, 2023
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks Paper • 2311.00288 • Published Nov 1, 2023
DesCo: Learning Object Recognition with Rich Language Descriptions Paper • 2306.14060 • Published Jun 24, 2023 • 1
Next Steps for Human-Centered Generative AI: A Technical Perspective Paper • 2306.15774 • Published Jun 27, 2023
Model Editing Can Hurt General Abilities of Large Language Models Paper • 2401.04700 • Published Jan 9, 2024 • 3
DEGREE: A Data-Efficient Generation-Based Event Extraction Model Paper • 2108.12724 • Published Aug 29, 2021
Socially Aware Bias Measurements for Hindi Language Representations Paper • 2110.07871 • Published Oct 15, 2021
On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark Paper • 2110.08466 • Published Oct 16, 2021