Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models Paper • 2310.16570 • Published Oct 25, 2023
Less Finetuning, Better Retrieval: Rethinking LLM Adaptation for Biomedical Retrievers via Synthetic Data and Model Merging Paper • 2602.04731 • Published Feb 4
Automatic Fine-grained Segmentation-assisted Report Generation Paper • 2507.16623 • Published Jul 22, 2025
Towards Conditioning Clinical Text Generation for User Control Paper • 2502.17571 • Published Feb 24, 2025
MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters Paper • 2502.03298 • Published Feb 5, 2025
A Second Look on BASS -- Boosting Abstractive Summarization with Unified Semantic Graphs -- A Replication Study Paper • 2403.02930 • Published Mar 25, 2024
Investigating the Impact of Randomness on Reproducibility in Computer Vision: A Study on Applications in Civil Engineering and Medicine Paper • 2410.02806 • Published Sep 19, 2024
Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why Paper • 2606.19602 • Published 6 days ago • 5
Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why Paper • 2606.19602 • Published 6 days ago • 5
Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why Paper • 2606.19602 • Published 6 days ago • 5
CLUE: A Clinical Language Understanding Evaluation for LLMs Paper • 2404.04067 • Published Apr 5, 2024
WideSearch: Benchmarking Agentic Broad Info-Seeking Paper • 2508.07999 • Published Aug 11, 2025 • 113
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts Paper • 2508.09848 • Published Aug 13, 2025 • 71
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds Paper • 2508.12782 • Published Aug 18, 2025 • 25
MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers Paper • 2508.14704 • Published Aug 20, 2025 • 43
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction Paper • 2508.11987 • Published Aug 16, 2025 • 73
ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16, 2025 • 1.72k • 88