Amir Hossein Kargaran's picture

Open to Work

Amir Hossein Kargaran

kargaranamir

·

https://kargaranamir.github.io

AI & ML interests

#NLP, checkout https://huggingface.co/cis-lmu

Recent Activity

liked a dataset 6 days ago

openlanguagedata/flores_plus

upvoted a collection 6 days ago

OLDI and friends

upvoted an article 12 days ago

Continuous batching from first principles

View all activity

Organizations

upvoted a collection 6 days ago

OLDI and friends

This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task. • 4 items • Updated Oct 6 • 4

upvoted an article 12 days ago

Article

Continuous batching from first principles

+1

13 days ago

•

243

upvoted a paper 13 days ago

Insights from the ICLR Peer Review and Rebuttal Process

Paper • 2511.15462 • Published 18 days ago • 6

upvoted a collection about 1 month ago

mmBERT: a modern multilingual encoder

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9 • 48

upvoted a paper about 2 months ago

CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs

Paper • 2510.09871 • Published Oct 10 • 2

upvoted a paper 3 months ago

Multi-Turn Puzzles: Evaluating Interactive Reasoning and Strategic Dialogue in LLMs

Paper • 2508.10142 • Published Aug 13 • 3

upvoted a changelog 4 months ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6

• 111

upvoted a collection 4 months ago

llm-urls-neurips

57 items • Updated May 15 • 2

upvoted an article 5 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

722

upvoted a collection 5 months ago

🥂 FineWeb2

3 items • Updated Jun 27 • 21

upvoted a paper 5 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75

upvoted 2 articles 6 months ago

Article

Transformers backend integration in SGLang

+3

Jun 23

•

54

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

upvoted 2 papers 6 months ago

How Programming Concepts and Neurons Are Shared in Code Language Models

Paper • 2506.01074 • Published Jun 1 • 3

Tracing Multilingual Factual Knowledge Acquisition in Pretraining

Paper • 2505.14824 • Published May 20 • 4

upvoted a paper 7 months ago

Multilingual k-Nearest-Neighbor Machine Translation

Paper • 2310.14644 • Published Oct 23, 2023 • 2

upvoted a collection 8 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666

upvoted 2 papers 8 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 249

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 299

upvoted a collection 8 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 667