GenBench

non-profit

https://genbench.github.io

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

koustuvs authored a paper 12 days ago

CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text

koustuvs authored a paper 12 days ago

Learning an Unreferenced Metric for Online Dialogue Evaluation

koustuvs authored a paper 12 days ago

Evaluating Gender Bias in Natural Language Inference

View all activity

genbench's activity

koustuvs

authored 5 papers 12 days ago

How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contexts

Paper • 2205.10762 • Published May 22, 2022

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Paper • 2412.14164 • Published Dec 18, 2024 • 4

yanaiela

authored a paper 4 months ago

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Paper • 2410.19133 • Published Oct 24, 2024 • 11

kazemnejad

authored 3 papers 4 months ago

The Impact of Positional Encoding on Length Generalization in Transformers

Paper • 2305.19466 • Published May 31, 2023 • 2

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 24

Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models

Paper • 2305.14775 • Published May 24, 2023

yanaiela

authored 5 papers 5 months ago

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection

Paper • 2004.07667 • Published Apr 16, 2020

Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation

Paper • 2305.16938 • Published May 26, 2023

Text-based NP Enrichment

Paper • 2109.12085 • Published Sep 24, 2021

A Survey on Data Selection for Language Models

Paper • 2402.16827 • Published Feb 26, 2024 • 4

Lexical Generalization Improves with Larger Models and Longer Training

Paper • 2210.12673 • Published Oct 23, 2022

yanaiela

authored a paper 6 months ago

Data Contamination Report from the 2024 CONDA Shared Task

Paper • 2407.21530 • Published Jul 31, 2024 • 10

kaleidophon

authored a paper 10 months ago

Recoding latent sentence representations -- Dynamic gradient-based activation modification in RNNs

Paper • 2101.00674 • Published Jan 3, 2021 • 1

kaleidophon

authored 4 papers 11 months ago

TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification

Paper • 2402.12991 • Published Feb 20, 2024

Trust Issues: Uncertainty Estimation Does Not Enable Reliable OOD Detection On Medical Tabular Data

Paper • 2011.03274 • Published Nov 6, 2020

Know Your Limits: Uncertainty Estimation with ReLU Classifiers Fails at Reliable OOD Detection

Paper • 2012.05329 • Published Dec 9, 2020

deep-significance - Easy and Meaningful Statistical Significance Testing in the Age of Neural Networks

Paper • 2204.06815 • Published Apr 14, 2022

AI & ML interests

Recent Activity

Team members 5

genbench's activity