Yiming Wang's picture

1 6

Yiming Wang

Rubywong123

·

https://rubywong123.github.io/

AI & ML interests

Code generation, LLM

Organizations

Rubywong123 's models 62

Rubywong123/web_rag_final_0_instr_similar_cus_1

8B • Updated Jul 30, 2025 • 2

Rubywong123/web_rag_final_0_instr_support_cus_1

8B • Updated Jul 30, 2025 • 6

Rubywong123/Qwen_android_rag_0_5_bs_48_LR_1e-5_epoch_1

8B • Updated Jul 27, 2025 • 6

Rubywong123/Qwen_android_rag_0_25_bs_96_LR_1e-5_epoch_1

8B • Updated Jul 27, 2025 • 4

Rubywong123/Qwen_android_rag_0_25_bs_48_LR_1e-5_epoch_1

8B • Updated Jul 23, 2025 • 6

Rubywong123/web_rag_final_first-split_customized

8B • Updated Jul 23, 2025 • 5

Rubywong123/web_rag_final_first-split

8B • Updated Jul 22, 2025 • 5

Rubywong123/web_rag_custom_scale_0_1_0_4_48_LR_1e-5

8B • Updated Jul 20, 2025 • 5

Rubywong123/web_rag_custom_scale_0_1_0_2_48_LR_1e-5

8B • Updated Jul 20, 2025 • 5

Rubywong123/web_os_genesis_0_2_48_LR_1e-5

8B • Updated Jul 19, 2025 • 5

Rubywong123/Qwen_android_rag_48_LR_1e-5_epoch_1

8B • Updated Jul 19, 2025 • 7

Rubywong123/web_rag_custom_scale_48_LR_1e-5

8B • Updated Jul 17, 2025 • 4

Rubywong123/android_rag_48_LR_1e-5_epoch_1

8B • Updated Jul 14, 2025 • 6

Rubywong123/web_rag_0_1_48_LR_1e-5

8B • Updated Jul 9, 2025 • 3

Rubywong123/web_rag_0_8_48_LR_1e-5

8B • Updated Jul 7, 2025 • 4

Rubywong123/web_rag_ablation2_48_LR_1e-5

8B • Updated May 16, 2025 • 3

Rubywong123/web_rag_ablation1_48_LR_1e-5

8B • Updated May 15, 2025 • 4

Rubywong123/Qwen_android_os_genesis_48_LR_1e-5_epoch_1

8B • Updated May 13, 2025 • 4

Rubywong123/Qwen_android_sim_48_LR_1e-5_epoch_1

8B • Updated May 4, 2025 • 4

Rubywong123/android_sim_48_LR_1e-5_epoch_1

8B • Updated Apr 28, 2025 • 3

Rubywong123/web_rag_free_48_LR_1e-5

8B • Updated Apr 12, 2025 • 3

Rubywong123/web_rag_all_48_LR_1e-5

8B • Updated Apr 10, 2025 • 7

Rubywong123/web_general_LR_1e-5_bs_48_epoch_2

8B • Updated Apr 4, 2025 • 4

Rubywong123/AgentGrow-shopping

8B • Updated Nov 10, 2024 • 10 • 1

Rubywong123/web_sim_traj

Updated Nov 7, 2024

Rubywong123/Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning • Updated Mar 5, 2023

Rubywong123/Reinforce-CartPole-v1

Reinforcement Learning • Updated Mar 5, 2023

Rubywong123/dqn-SpaceInvadersNoFrameskip

Reinforcement Learning • Updated Feb 15, 2023 • 2

Rubywong123/q-Taxi-v3

Reinforcement Learning • Updated Feb 11, 2023

Rubywong123/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Feb 11, 2023