********************************* Welcome to CSCI 531 --- Fall 2023 ********************************* An introduction to reinforcement learning. Proficieny in python is required. This course will introduce the fundamentals of Reinforcement learning (RL) and Deep learning techniques. The course will cover the Tabular solution methods, such as the finite Markov Decision Processes and Temporal-Difference learning. It will also cover approximation solution methods, as on-policy and off-policy approximations. By the end of the course, new deep-learning techniques will be introduced. Professors ========== Dr. Jean-Alexis Delamer ----------------------- * jdelamer at stfx.ca * Annex 9C Lecture Section ^^^^^^^^^^^^^^^ * Tue: 12:30pm - 1:20pm (MULH4024) * Thu: 11:30am - 12:20pm (MULH4024) * Fri: 1:30pm - 2:20pm (MULH4024) Office Hours ^^^^^^^^^^^^ * Tue: 1:30pm - 3:30pm (Annex 9C) * Thu: 1:30pm - 2:30pm (Annex 9C) * Fri: 9:00am - 10:00am (Annex 9C) .. Getting Started .. =============== .. .. toctree:: .. :maxdepth: 1 .. getting-set/getting-set .. toctree:: :caption: Notes :maxdepth: 1 :numbered: topics/intro/intro topics/multi-armed-bandit/the-problem topics/multi-armed-bandit/more_about_multi-armed_bandit topics/markov-decison-process/markov_decision_process topics/markov-decison-process/mdp-learning topics/dynamic-programming/dynamic-programming topics/monte-carlo/monte-carlo-methods topics/Temporal-difference learning/td-learning topics/approximate-solutions/on-policy-prediction topics/eligibility-traces/eligibility-traces topics/policy-gradient/policy-gradient-methods .. toctree:: :caption: Labs :maxdepth: 1 :numbered: labs/multi-armed-bandit/multi-armed-bandit labs/multi-armed-bandit/more-about-multi-armed labs/mdp/mdp labs/DP/dp labs/MC/mc labs/TD-learning/td labs/on-policy/on-policy labs/eligibility-traces/eligibility-traces labs/actor-critic/actor-critic .. toctree:: :caption: Assignments :maxdepth: 1 :numbered: asn/mdp/mdp-asn asn/semi-gradient/semi-gradient-asn .. toctree:: :caption: Project :maxdepth: 1 project/project .. toctree:: :caption: Outline :maxdepth: 1 /outline/outline