Ashish Soni
ashish-soni08
AI & ML interests
None yet
Recent Activity
liked a model about 1 month ago
zai-org/GLM-OCR updated a dataset about 1 month ago
ashish-soni08/customer-spending-clustering published a dataset about 1 month ago
ashish-soni08/customer-spending-clusteringOrganizations
How_LLMS_Think _and_Reason_Papers
-
Evolving Deeper LLM Thinking
Paper β’ 2501.09891 β’ Published β’ 115 -
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
Paper β’ 2502.08127 β’ Published β’ 59 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 448
Embedding_Models
MoE_Models
Pre-Training-Data-for-LLMs
Open-Source Datasets that have been employed for pre-training Large Language Models
Reasoning Datasets
-
open-thoughts/OpenThoughts-114k
Viewer β’ Updated β’ 228k β’ 153k β’ 832 -
nvidia/OpenCodeReasoning
Viewer β’ Updated β’ 753k β’ 5.32k β’ 536 -
open-r1/codeforces-cots
Viewer β’ Updated β’ 254k β’ 4.98k β’ 214 -
FreedomIntelligence/medical-o1-reasoning-SFT
Viewer β’ Updated β’ 90.1k β’ 7.64k β’ 1.09k
Microsoft Models
Leaderboards
Meta AI
Models released by Meta
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation β’ 8B β’ Updated β’ 9.19M β’ β’ 5.75k -
meta-llama/Llama-3.1-405B
Text Generation β’ 406B β’ Updated β’ 178k β’ 970 -
meta-llama/Llama-3.1-405B-Instruct
Text Generation β’ 406B β’ Updated β’ 236k β’ 595 -
meta-llama/Llama-3.1-8B
Text Generation β’ 8B β’ Updated β’ 1.44M β’ β’ 2.17k
Privacy_Masking_for_LLMs
Function Calling
Reasoning Datasets
-
open-thoughts/OpenThoughts-114k
Viewer β’ Updated β’ 228k β’ 153k β’ 832 -
nvidia/OpenCodeReasoning
Viewer β’ Updated β’ 753k β’ 5.32k β’ 536 -
open-r1/codeforces-cots
Viewer β’ Updated β’ 254k β’ 4.98k β’ 214 -
FreedomIntelligence/medical-o1-reasoning-SFT
Viewer β’ Updated β’ 90.1k β’ 7.64k β’ 1.09k
How_LLMS_Think _and_Reason_Papers
-
Evolving Deeper LLM Thinking
Paper β’ 2501.09891 β’ Published β’ 115 -
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
Paper β’ 2502.08127 β’ Published β’ 59 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 448
Microsoft Models
Embedding_Models
Leaderboards
MoE_Models
Meta AI
Models released by Meta
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation β’ 8B β’ Updated β’ 9.19M β’ β’ 5.75k -
meta-llama/Llama-3.1-405B
Text Generation β’ 406B β’ Updated β’ 178k β’ 970 -
meta-llama/Llama-3.1-405B-Instruct
Text Generation β’ 406B β’ Updated β’ 236k β’ 595 -
meta-llama/Llama-3.1-8B
Text Generation β’ 8B β’ Updated β’ 1.44M β’ β’ 2.17k
Pre-Training-Data-for-LLMs
Open-Source Datasets that have been employed for pre-training Large Language Models
Privacy_Masking_for_LLMs