Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
5
2
Junxiong Wang
PRO
JunxiongWang
Follow
blurLake's profile picture
dark-pen's profile picture
emircanerol's profile picture
16 followers
·
3 following
https://www.cs.cornell.edu/~junxiong/
jxiw
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
upvoted
an
article
16 days ago
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
updated
a model
3 months ago
JunxiongWang/M1-3B
updated
a model
4 months ago
togethercomputer/M1-3B
View all activity
Organizations
JunxiongWang
's models
51
Sort: Recently updated
JunxiongWang/M1-3B
Text Generation
•
3B
•
Updated
Sep 2
•
34
•
2
JunxiongWang/M1-3B-SFT
Text Generation
•
3B
•
Updated
Apr 16
•
921
•
1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B
•
Updated
Feb 11
•
6
JunxiongWang/MambaInLlama3B_SFT_MATH
3B
•
Updated
Feb 7
•
22
JunxiongWang/MambaInLlama3B_DPO2
3B
•
Updated
Feb 5
•
2
JunxiongWang/MambaInLlama3B_DPO1
3B
•
Updated
Feb 5
•
4
JunxiongWang/MambaInLlama3B_Distill_MATH
3B
•
Updated
Jan 27
•
8
JunxiongWang/MambaInLlama3B_v3
3B
•
Updated
Jan 25
•
6
JunxiongWang/MambaInLlama1B_Distill_MATH
1B
•
Updated
Jan 23
•
4
JunxiongWang/mamba_0_5_distill
Updated
Dec 25, 2024
•
4
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17, 2024
•
4
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17, 2024
•
6
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17, 2024
•
10
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17, 2024
•
10
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17, 2024
•
4
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17, 2024
•
4
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17, 2024
•
5
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17, 2024
•
6
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
Sep 9, 2024
•
14
•
1
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
Sep 9, 2024
•
12
•
3
JunxiongWang/MambaByte_PG19_353M
Text Generation
•
Updated
Sep 9, 2024
•
7
JunxiongWang/MambaByte_Books
Text Generation
•
Updated
Sep 9, 2024
•
10
•
2
JunxiongWang/MambaByte_Code
Text Generation
•
Updated
Sep 9, 2024
•
13
•
2
JunxiongWang/MambaByte_PG19_972M
Text Generation
•
Updated
Sep 9, 2024
•
14
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2, 2024
•
3
•
1
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2, 2024
•
26
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2, 2024
•
6
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2, 2024
•
6
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2, 2024
•
3
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2, 2024
•
7
Previous
1
2
Next