The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models Paper • 2507.23313 • Published Jul 31, 2025 • 1
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering Paper • 2508.03448 • Published Aug 5, 2025 • 6
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor Paper • 2508.01311 • Published Aug 2, 2025 • 2
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model Paper • 2505.21179 • Published May 27, 2025 • 13
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt Paper • 2505.09264 • Published May 14, 2025 • 5
Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection Paper • 2504.14221 • Published Apr 19, 2025
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection Paper • 2505.09926 • Published May 15, 2025 • 6
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning Paper • 2505.09265 • Published May 14, 2025 • 5
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards Paper • 2508.04632 • Published Aug 6, 2025 • 2
Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks Paper • 2507.21974 • Published Jul 29, 2025 • 5
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Paper • 2508.01197 • Published Aug 2, 2025 • 5
Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management Paper • 2508.04664 • Published Aug 6, 2025 • 13
Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference Paper • 2508.04586 • Published Aug 6, 2025 • 12
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding Paper • 2508.02215 • Published Aug 4, 2025 • 12
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Paper • 2507.23785 • Published Jul 31, 2025 • 18
LaTCoder: Converting Webpage Design to Code with Layout-as-Thought Paper • 2508.03560 • Published Aug 5, 2025 • 24
Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents Paper • 2508.01858 • Published Aug 3, 2025 • 20
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 137
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published Aug 2, 2025 • 238
Attention Basin: Why Contextual Position Matters in Large Language Models Paper • 2508.05128 • Published Aug 7, 2025 • 4
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode Paper • 2508.04107 • Published Aug 6, 2025 • 4
Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis Paper • 2508.04699 • Published Aug 6, 2025 • 2
RPCANet++: Deep Interpretable Robust PCA for Sparse Object Segmentation Paper • 2508.04190 • Published Aug 6, 2025 • 1
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations Paper • 2508.04939 • Published Aug 6, 2025 • 2
REINA: Regularized Entropy Information-Based Loss for Efficient Simultaneous Speech Translation Paper • 2508.04946 • Published Aug 7, 2025 • 1
I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal Entity Linking Paper • 2508.02243 • Published Aug 4, 2025 • 2
Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression Paper • 2508.04979 • Published Aug 7, 2025 • 5
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance Paper • 2508.01650 • Published Aug 3, 2025 • 6
MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes Paper • 2508.05630 • Published Aug 7, 2025 • 9
Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability Paper • 2508.04017 • Published Aug 6, 2025 • 11
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? Paper • 2508.03644 • Published Aug 5, 2025 • 25
A Practical Guide to Fine-tuning Language Models with Limited Data Paper • 2411.09539 • Published Nov 14, 2024
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks Paper • 2412.12499 • Published Dec 17, 2024 • 2
Development of Pre-Trained Transformer-based Models for the Nepali Language Paper • 2411.15734 • Published Nov 24, 2024
Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation Paper • 2412.13375 • Published Dec 17, 2024
Facilitating large language model Russian adaptation with Learned Embedding Propagation Paper • 2412.21140 • Published Dec 30, 2024 • 18
BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment Paper • 2411.16300 • Published Nov 25, 2024
Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers Paper • 2601.22139 • Published Jan 29
Mirroring the Mind: Distilling Human-Like Metacognitive Strategies into Large Language Models Paper • 2602.22508 • Published 17 days ago
Knowledge Integration Decay in Search-Augmented Reasoning of Large Language Models Paper • 2602.09517 • Published Feb 10 • 1
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 3 days ago • 49
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published 3 days ago • 35
Language of Thought Shapes Output Diversity in Large Language Models Paper • 2601.11227 • Published Jan 16 • 9
What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance Paper • 2602.20300 • Published 19 days ago • 4
No One Size Fits All: QueryBandits for Hallucination Mitigation Paper • 2602.20332 • Published 19 days ago • 2
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents Paper • 2602.14234 • Published 28 days ago • 26
Cognitive Models and AI Algorithms Provide Templates for Designing Language Agents Paper • 2602.22523 • Published 17 days ago • 1
Agentic Artificial Intelligence (AI): Architectures, Taxonomies, and Evaluation of Large Language Model Agents Paper • 2601.12560 • Published Jan 18
Shared Nature, Unique Nurture: PRISM for Pluralistic Reasoning via In-context Structure Modeling Paper • 2602.21317 • Published 18 days ago • 4
DIVERGE: Diversity-Enhanced RAG for Open-Ended Information Seeking Paper • 2602.00238 • Published Jan 30
CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models Paper • 2601.02236 • Published Jan 5
Autoregressive Models Rival Diffusion Models at ANY-ORDER Generation Paper • 2601.13228 • Published Jan 19
Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding? Paper • 2602.23225 • Published 17 days ago
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Paper • 2603.02208 • Published 12 days ago • 4
Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning Paper • 2601.15160 • Published Jan 21 • 1
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification Paper • 2601.22642 • Published Jan 30 • 9
Milestones over Outcome: Unlocking Geometric Reasoning with Sub-Goal Verifiable Reward Paper • 2601.05073 • Published Jan 8
P2S: Probabilistic Process Supervision for General-Domain Reasoning Question Answering Paper • 2601.20649 • Published Jan 28
VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning Paper • 2601.20055 • Published Jan 27 • 7
Decompose-and-Formalise: Recursively Verifiable Natural Language Inference Paper • 2601.19605 • Published Jan 27
Agentic Proposing: Enhancing Large Language Model Reasoning via Compositional Skill Synthesis Paper • 2602.03279 • Published Feb 3
LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval Paper • 2603.01425 • Published 13 days ago • 5
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization Paper • 2601.21358 • Published Jan 29 • 7
Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens Paper • 2602.10229 • Published Feb 10 • 5
Beyond Dense States: Elevating Sparse Transcoders to Active Operators for Latent Reasoning Paper • 2602.01695 • Published Feb 2
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 12 days ago • 138
PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference Paper • 2603.02479 • Published 12 days ago • 18
MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning Paper • 2603.03379 • Published 12 days ago • 28
Large Multimodal Models as General In-Context Classifiers Paper • 2602.23229 • Published 17 days ago • 22
nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space Paper • 2603.04948 • Published 10 days ago • 1
Mario: Multimodal Graph Reasoning with Large Language Models Paper • 2603.05181 • Published 10 days ago • 7
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published 9 days ago • 26
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model Paper • 2603.05438 • Published 9 days ago • 35
ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer Paper • 2603.03583 • Published 11 days ago • 2
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training Paper • 2603.07223 • Published 8 days ago • 13
Believe Your Model: Distribution-Guided Confidence Calibration Paper • 2603.03872 • Published 11 days ago • 37
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control Paper • 2603.09221 • Published 5 days ago
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning Paper • 2603.10377 • Published 4 days ago • 3
Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper • 2603.10145 • Published 4 days ago • 6
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published Feb 2 • 32
SimpleGPT: Improving GPT via A Simple Normalization Strategy Paper • 2602.01212 • Published Feb 1 • 3
Prism-Δ: Differential Subspace Steering for Prompt Highlighting in Large Language Models Paper • 2603.10705 • Published 4 days ago • 10
YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation Paper • 2601.08441 • Published Jan 13 • 8
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning Paper • 2603.10160 • Published 4 days ago • 20
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 4 days ago • 31
How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback Paper • 2501.15378 • Published Jan 26, 2025
Millions of GeAR-s: Extending GraphRAG to Millions of Documents Paper • 2507.17399 • Published Jul 23, 2025
RAG vs. GraphRAG: A Systematic Evaluation and Key Insights Paper • 2502.11371 • Published Feb 17, 2025
PROPEX-RAG: Enhanced GraphRAG using Prompt-Driven Prompt Execution Paper • 2511.01802 • Published Nov 3, 2025 • 1
HELP: HyperNode Expansion and Logical Path-Guided Evidence Localization for Accurate and Efficient GraphRAG Paper • 2602.20926 • Published 19 days ago • 3
PolyG: Effective and Efficient GraphRAG with Adaptive Graph Traversal Paper • 2504.02112 • Published Apr 2, 2025 • 2
GraphRAG-R1: Graph Retrieval-Augmented Generation with Process-Constrained Reinforcement Learning Paper • 2507.23581 • Published Jul 31, 2025
Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration Paper • 2601.11144 • Published Jan 16 • 3
Enhancing Startup Success Predictions in Venture Capital: A GraphRAG Augmented Multivariate Time Series Method Paper • 2408.09420 • Published Aug 18, 2024
NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks Paper • 2603.06922 • Published 8 days ago • 2
Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation Paper • 2512.23601 • Published Dec 29, 2025
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use Paper • 2603.11076 • Published 4 days ago • 4
Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks Paper • 2603.11487 • Published 3 days ago • 2
WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora Paper • 2602.02053 • Published Feb 2 • 41