Search Results for author: Anush Kini

Found 2 papers, 0 papers with code

Provably Robust DPO: Aligning Language Models with Noisy Feedback

no code implementations1 Mar 2024 Sayak Ray Chowdhury, Anush Kini, Nagarajan Natarajan

Our experiments on IMDb sentiment generation and Anthropic's helpful-harmless dataset show that rDPO is robust to noise in preference labels compared to vanilla DPO and other heuristics proposed by practitioners.

GAR-meets-RAG Paradigm for Zero-Shot Information Retrieval

no code implementations31 Oct 2023 Daman Arora, Anush Kini, Sayak Ray Chowdhury, Nagarajan Natarajan, Gaurav Sinha, Amit Sharma

Given a query and a document corpus, the information retrieval (IR) task is to output a ranked list of relevant documents.

Passage Retrieval Re-Ranking +1

Cannot find the paper you are looking for? You can Submit a new open access paper.