Reinforcement learning

June 30, 2021

Here is a short summary of a project I recently completed during my study of reinforcement learning algorithms.

I believe, if you really want to understand some algorithm, you should write it yourself.

That is why I implemented several algorithms from scratch and ensured they work as intended:

Covariance Matrix Adaptation for Evolution Strategy (CMA-ES).
Deep Q-Network (DQN), with options: Duelling-DQN, …
Asynchronous Advantage Actor-Critic (A3C)
Advantage Actor-Critic (A2C), the synchronized version of A3C, yet with multiple worker threads.
Proximal Policy Optimization (PPO)

PPO demonstrated the best results.

For more details:

View Project on Github

Twitter Facebook LinkedIn

Introduction to Online Ads Industry

November 20, 2022 10 minute read

Having completed a ML project in online ads, I obtained some basic knowledge of how this industry works and where it is moving now. Here you can read some re...

Period

Technologies

Reinforcement learning

You May Also Enjoy

Introduction to Online Ads Industry

Text Embeddings (3/3) - Conclusion

Text Embeddings (2/3) - Computation

Text Embeddings (1/3) - Explanation