No Result

View All Result

No Result

View All Result

No Result

View All Result

Reinforcement Learning

super-large-language-model

by Tech Trends Watcher

4 July 2024

Reinforcement Learning

Humanoid-v4-ppo_continuous_action-seed1

2 July 2024

Reinforcement Learning

ppo-Huggy

1 July 2024

Reinforcement Learning

ppo-LunarLander-v2

27 June 2024

Home Category Reinforcement Learning

Reinforcement Learning

ppo-LunarLander-v2

by Tech Trends Watcher

7 June 2024

Edit model card PPO Agent playing LunarLander-v2 Usage (with Stable-baselines3) PPO Agent playing LunarLander-v2 This is a trained model of...

Reinforcement Learning

ppo-LunarLander-v2

by Tech Trends Watcher

7 June 2024

Edit model card PPO Agent playing LunarLander-v2 Usage (with Stable-baselines3) PPO Agent playing LunarLander-v2 This is a trained model of...

Reinforcement Learning

tqc-PandaPickAndPlace-v1

by Tech Trends Watcher

5 June 2024

Edit model card TQC Agent playing PandaPickAndPlace-v1 Usage (with SB3 RL Zoo) Training (with the RL Zoo) Hyperparameters TQC Agent...

Reinforcement Learning

H3lt3r-Sk3lt3r/ppo-Huggy1 Reinforcement Learning • Updated about 10 hours ago • 1

by Tech Trends Watcher

4 June 2024

H3lt3r-Sk3lt3r/ppo-Huggy1 Reinforcement Learning • Updated about 10 hours ago • 1 Source link

Reinforcement Learning

RL_LunarLander_PPO

by Tech Trends Watcher

4 June 2024

Edit model card PPO_MLP Agent playing LunarLander-v2 Usage (with Stable-baselines3) PPO_MLP Agent playing LunarLander-v2 This is a trained model of...

Reinforcement Learning

ppo-Huggy

by Tech Trends Watcher

4 June 2024

Edit model card ppo Agent playing Huggy Usage (with ML-Agents) Resume the training Watch your Agent play ppo Agent playing...

Reinforcement Learning

ppo-Huggy

by Tech Trends Watcher

1 June 2024

Edit model card ppo Agent playing Huggy Usage (with ML-Agents) Resume the training Watch your Agent play ppo Agent playing...

Reinforcement Learning

ppo-PandaReachDense-v3

by Tech Trends Watcher

1 June 2024

Edit model card PPO Agent playing PandaReachDense-v3 Usage (with Stable-baselines3) PPO Agent playing PandaReachDense-v3 This is a trained model of...