Listwise reranking with uncertainty-aware adaptive computation via Bayesian modeling of documents' relevance with TrueSkill
A bilingual benchmark for abstraction, comprehension, and reasoning evaluation in academic contexts - ___[Findings of EMNLP 2025](https://2025.emnlp.org/)___
Multi-scale adaptive context RAG for long-context large language models