Luke Stanley's picture

Luke Stanley

lukestanley

·

AI & ML interests

None yet

Recent Activity

liked a Space 6 days ago

burtenshaw/karpathy-llm-council

liked a model 12 days ago

unsloth/Olmo-3-32B-Think-unsloth-bnb-4bit

liked a dataset 14 days ago

allenai/dolma3_mix-6T-1025

View all activity

Organizations

None yet

upvoted 4 papers 4 months ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20 • 43

Emergence of Episodic Memory in Transformers: Characterizing Changes in Temporal Structure of Attention Scores During Training

Paper • 2502.06902 • Published Feb 9 • 1

Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis

Paper • 2505.11581 • Published May 16 • 3

Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs

Paper • 2508.06601 • Published Aug 8 • 6

upvoted a paper 6 months ago

Ming-Omni: A Unified Multimodal Model for Perception and Generation

Paper • 2506.09344 • Published Jun 11 • 28

upvoted a paper 11 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 53

upvoted a collection 12 months ago

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 4 days ago • 24

upvoted a paper over 1 year ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 259