
Ref: Google
Proxy Pattern from Wikipedia

Class Diagram from Wikipedia

If you have access to Linkedin Courses, then the following Videos will be useful
Jan 02

Ref: Google
Proxy Pattern from Wikipedia

Class Diagram from Wikipedia

If you have access to Linkedin Courses, then the following Videos will be useful
Jan 01
Interesting Data Visualizations:
Building Interactive Dashboards with Jupyter
https://blog.dominodatalab.com/interactive-dashboards-in-jupyter
http://introtopython.org/visualization_earthquakes.html
http://wrobstory.github.io/2013/10/mapping-data-python.html
Maps in R: Plotting data points on a map
http://www.milanor.net/blog/maps-in-r-plotting-data-points-on-a-map/
https://stackoverflow.com/questions/42299352/installing-basemap-on-mac-python
https://stackoverflow.com/questions/21956683/enable-access-control-on-simple-http-server
Vincent Repo:
Jan 01
Dec 29





Ref and Read: https://medium.com/@gridflowai/multi-armed-bandits-an-overview-on-classical-rl-algorithms-9a1e047cd98e

https://www.kdnuggets.com/2023/01/introduction-multiarmed-bandit-problems.html

Epsilon Greedy:

https://medium.com/opex-analytics/multi-armed-bandits-101-6f4ac62b6bd6


https://cxl.com/blog/bandit-tests

https://www.geeksforgeeks.org/epsilon-greedy-algorithm-in-reinforcement-learning

Dec 27
Ref: https://www.cs.toronto.edu/~jlucas/teaching/csc411/lectures/lec21_22_handout.pdf

Formulate:

Read more from: https://www.cs.toronto.edu/~jlucas/teaching/csc411/lectures/lec21_22_handout.pdf
What is a Policy (Deterministic Policy, Stochastic Policy)
What is a Value Function
What is a Model? What is Model Free. Markov Property for Model
MDP Problems
Exploration and Exploitation
Bellman Equations
Q-Learning
Function Approximation for Large State Spaces
Dec 27
What is: Reinforcement Learning (Interactive Learning in Decision Processes)?
— Is there a way to learn by interacting
— i.e. interact have experience and use the experience to learn (predict the future)
— Interact to explore and utilize what makes learning (goal/outcome) enhanced
— The computation approach of this method is Reinforcement Learning (Interactive Learning in Decision Processes)?
— it is a goal oriented learning from interactions
— it has it’s root in Markov decision process (MDP)
— Markov decision process (MDP) is a model for sequential decision making when outcomes are uncertain.[1] [Wikipedia]

Reinforcement Learning (Interactive Learning in Decision Processes) Involves:
Markov decision processes
Dynamic Programming
Monte Carlo methods
Temporal-difference learning
Function approximation methods


Monte Carlo methods
Solves problems with repeated random sampling.

Temporal-difference learning: combination of the Monte Carlo (MC) method and the Dynamic Programming (DP) method.


Function approximation methods

Dec 25
From Google (Internet)
https://www.oracle.com/ca-en/database/sqldeveloper

pen-source tools for your Oracle Database setup?
https://www.simplyblock.io/blog/best-open-source-tools-for-oracle-database
