LLMs

AcuRank: Uncertainty-Aware Adaptive Computation for Listwise Reranking

Listwise reranking with uncertainty-aware adaptive computation via Bayesian modeling of documents' relevance with TrueSkill

ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts

A bilingual benchmark for abstraction, comprehension, and reasoning evaluation in academic contexts

MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG

Multi-scale adaptive context RAG for long-context large language models