Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mkurman
's Collections
GLM-4.7-Flash-SynthLabs
NeuroBLAST v3
Medical Pre-Training Datasets
Medical QA Datasets
Medical Pre-Training Datasets
updated
Aug 23, 2025
A collection of medical datasets suitable for LLMs pretraining
Upvote
1
openmed-community/TheBlueScrubs-v1-fixed
Viewer
•
Updated
Aug 29, 2025
•
11.1M
•
438
•
13
mkurman/hindawi-journals-2007-2023
Viewer
•
Updated
Jun 9, 2025
•
298k
•
640
•
5
epfl-llm/guidelines
Viewer
•
Updated
Mar 7, 2024
•
38k
•
670
•
146
ncbi/Open-Patients
Viewer
•
Updated
May 11, 2025
•
180k
•
527
•
27
AGBonnet/augmented-clinical-notes
Viewer
•
Updated
Jan 24, 2024
•
30k
•
1.09k
•
66
harishnair04/mtsamples
Viewer
•
Updated
Nov 7, 2024
•
5k
•
138
•
1
Tonic/Health-Bench-Eval-OSS-2025-07
Viewer
•
Updated
May 17, 2025
•
9.67k
•
367
•
3
zeroshot/arxiv-biology
Viewer
•
Updated
Jan 5, 2023
•
1.28k
•
43
•
14
Upvote
1
Share collection
View history
Collection guide
Browse collections