imflash217/proximal_policy_optimization_lunar_lander_v2 Reinforcement Learning • Updated Jan 13, 2023 • 11 • 1
mradermacher/CscSQL-Merge-Qwen2.5-Coder-1.5B-Instruct-GGUF Reinforcement Learning • 2B • Updated Jul 31 • 101 • 1
emiliodavola/french-solitaire-dqn-single-solution Reinforcement Learning • Updated 1 day ago • 56 • 1
edbeeching/decision-transformer-gym-halfcheetah-expert Reinforcement Learning • Updated Jun 29, 2022 • 2 • 1