Shannon Sands's picture

20 381

Shannon Sands

ssands1979

·

AI & ML interests

None yet

Recent Activity

updated a model 11 days ago

NousResearch/Hermes-4.3-36B-GGUF

liked a model 13 days ago

PleIAs/Monad

liked a dataset 13 days ago

xieyuquan/google_apps_step3000_historyimageFalse_uitars_actionspace

View all activity

Organizations

upvoted a collection 23 days ago

H-Net

The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11 • 20

upvoted a collection about 1 month ago

Pre-training Dataset

7 items • Updated Jun 19 • 4

upvoted a collection 2 months ago

Encoders vs Decoders: the Ettin Suite

A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 32 items • Updated Jul 16 • 25

upvoted a collection 4 months ago

T5Gemma

32 items • Updated Jul 10 • 77

upvoted an article 4 months ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Aug 11

•

75

upvoted a paper 6 months ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published Jun 17 • 10

upvoted an article 7 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

upvoted an article 8 months ago

Article

Cohere on Hugging Face Inference Providers 🔥

+5

Apr 16

•

129

upvoted a collection 8 months ago

Delta_CLIP

3 items • Updated Mar 17 • 2

upvoted a paper 8 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 299

upvoted a collection 8 months ago

RLVR

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated Mar 31 • 13

upvoted a collection 9 months ago

🏆 IOI

Resources related to International Olympiad in Informatics (IOI) problems • 5 items • Updated May 13 • 7

upvoted a collection 11 months ago

DeepSeek-R1

10 items • Updated 10 days ago • 820

upvoted an article over 1 year ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

303

upvoted 2 papers over 1 year ago

From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples

Paper • 2404.07544 • Published Apr 11, 2024 • 20

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 66

upvoted 2 papers almost 2 years ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 626

User-LLM: Efficient LLM Contextualization with User Embeddings

Paper • 2402.13598 • Published Feb 21, 2024 • 20

upvoted 2 papers over 2 years ago

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft

Paper • 2306.00937 • Published Jun 1, 2023 • 9

Copy Is All You Need

Paper • 2307.06962 • Published Jul 13, 2023 • 35