Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AI-Insight 's Collections
💡HF Papers Live 1: Reinforcement Learning
💡HF Papers Live 2: Code Bench
💡HF Papers Live 3: AI for Science
💡HF Papers Live 4: Multi Modal models
💡HF Papers Live 5: Omni-Modal models
💡HF Papers Live 6: OCR

💡HF Papers Live 6: OCR

updated Dec 3, 2025
Upvote
-

  • tencent/HunyuanOCR

    Image-Text-to-Text • 1.0B • Updated 7 days ago • 1.13M • 676

  • HunyuanOCR Technical Report

    Paper • 2511.19575 • Published Nov 24, 2025 • 22

  • PaddlePaddle/PaddleOCR-VL

    Image-Text-to-Text • 1.0B • Updated Dec 11, 2025 • 12.9k • 1.5k

  • PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

    Paper • 2510.14528 • Published Oct 16, 2025 • 111

  • Running on L40S
    521

    MinerU OCR

    📚
    521

    A data extraction tool to convert PDF to Markdown and JSON


  • MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

    Paper • 2509.22186 • Published Sep 26, 2025 • 139

  • opendatalab/MinerU2.5-2509-1.2B

    Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 1.03M • 314
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs