ICML RN-F: A Novel Approach for Mitigating Contaminated Data in Large Language Models

Poster
in
Workshop: DIG-BUGS: Data in Generative Models (The Bad, the Ugly, and the Greats)

RN-F: A Novel Approach for Mitigating Contaminated Data in Large Language Models

Vu Anh Le · Dinh Nguyen · Phi Nguyen · Keshav Sood

Keywords: [ quantization residuals ] [ anomaly detection ] [ data poisoning ] [ quantization ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Sat 19 Jul 3 p.m. PDT — 3:45 p.m. PDT

Abstract:

Large Language Models (LLMs) have become foundational in modern artificial intelligence, powering a wide range of applications from code generation and virtual assistants to scientific research and enterprise automation. However, concerns about data contamination, where test data overlaps with training data, have raised serious questions about the reliability of these applications. Despite awareness of this issue, existing methods fall short in effectively identifying or mitigating contamination. In this paper, we propose Residual-Noise Fingerprinting (RN-F), a novel framework for detecting contaminated data in LLMs. RN-F is a single-pass, gradient-free detection method that leverages residual signal patterns without introducing additional floating-point operations. Our approach is lightweight, model-agnostic, and efficient. We evaluate RN-F on multiple LLMs across various contaminated datasets and show that it consistently outperforms existing state-of-the-art methods, achieving performance improvements of up to 11.1% in contamination detection metrics.

Chat is not available.

Poster in Workshop: DIG-BUGS: Data in Generative Models (The Bad, the Ugly, and the Greats)

RN-F: A Novel Approach for Mitigating Contaminated Data in Large Language Models

Vu Anh Le · Dinh Nguyen · Phi Nguyen · Keshav Sood

Poster
in
Workshop: DIG-BUGS: Data in Generative Models (The Bad, the Ugly, and the Greats)