·
AI & ML interests
Code generation, LLM
Organizations
Rubywong123/web_rag_final_0_instr_similar_cus_1
8B
•
Updated
•
2
Rubywong123/web_rag_final_0_instr_support_cus_1
8B
•
Updated
•
6
Rubywong123/Qwen_android_rag_0_5_bs_48_LR_1e-5_epoch_1
8B
•
Updated
•
6
Rubywong123/Qwen_android_rag_0_25_bs_96_LR_1e-5_epoch_1
8B
•
Updated
•
4
Rubywong123/Qwen_android_rag_0_25_bs_48_LR_1e-5_epoch_1
8B
•
Updated
•
6
Rubywong123/web_rag_final_first-split_customized
8B
•
Updated
•
5
Rubywong123/web_rag_final_first-split
8B
•
Updated
•
5
Rubywong123/web_rag_custom_scale_0_1_0_4_48_LR_1e-5
8B
•
Updated
•
5
Rubywong123/web_rag_custom_scale_0_1_0_2_48_LR_1e-5
8B
•
Updated
•
5
Rubywong123/web_os_genesis_0_2_48_LR_1e-5
8B
•
Updated
•
5
Rubywong123/Qwen_android_rag_48_LR_1e-5_epoch_1
8B
•
Updated
•
7
Rubywong123/web_rag_custom_scale_48_LR_1e-5
8B
•
Updated
•
4
Rubywong123/android_rag_48_LR_1e-5_epoch_1
8B
•
Updated
•
6
Rubywong123/web_rag_0_1_48_LR_1e-5
8B
•
Updated
•
3
Rubywong123/web_rag_0_8_48_LR_1e-5
8B
•
Updated
•
4
Rubywong123/web_rag_ablation2_48_LR_1e-5
8B
•
Updated
•
3
Rubywong123/web_rag_ablation1_48_LR_1e-5
8B
•
Updated
•
4
Rubywong123/Qwen_android_os_genesis_48_LR_1e-5_epoch_1
8B
•
Updated
•
4
Rubywong123/Qwen_android_sim_48_LR_1e-5_epoch_1
8B
•
Updated
•
4
Rubywong123/android_sim_48_LR_1e-5_epoch_1
8B
•
Updated
•
3
Rubywong123/web_rag_free_48_LR_1e-5
8B
•
Updated
•
3
Rubywong123/web_rag_all_48_LR_1e-5
8B
•
Updated
•
7
Rubywong123/web_general_LR_1e-5_bs_48_epoch_2
8B
•
Updated
•
4
Rubywong123/AgentGrow-shopping
8B
•
Updated
•
10
•
1
Rubywong123/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Rubywong123/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Rubywong123/dqn-SpaceInvadersNoFrameskip
Reinforcement Learning
•
Updated
•
2
Reinforcement Learning
•
Updated
Rubywong123/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated