Large Language Models Do NOT Really Know What They Don't Know Paper • 2510.09033 • Published Oct 10 • 16
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published Oct 13 • 100
Medical Reasoning in the Era of LLMs: A Systematic Review of Enhancement Techniques and Applications Paper • 2508.00669 • Published Aug 1
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation Paper • 2405.06948 • Published May 11, 2024
Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion Paper • 2501.16679 • Published Jan 28
EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis Paper • 2505.23601 • Published May 29
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper • 2507.22607 • Published Jul 30 • 46
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Paper • 2506.21355 • Published Jun 26 • 10
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1 • 240
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Paper • 2504.13816 • Published Apr 18 • 18
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper • 2506.07044 • Published Jun 8 • 114
VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning Paper • 2410.22995 • Published Oct 30, 2024 • 2
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2 • 64
NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization Paper • 2502.14638 • Published Feb 20 • 11
A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation Paper • 2502.00314 • Published Feb 1 • 3
UNet++: A Nested U-Net Architecture for Medical Image Segmentation Paper • 1807.10165 • Published Jul 18, 2018
SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection Paper • 2111.13495 • Published Nov 26, 2021
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification Paper • 2210.12843 • Published Oct 23, 2022
Making Your First Choice: To Address Cold Start Problem in Vision Active Learning Paper • 2210.02442 • Published Oct 5, 2022 • 1