Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
ETHEM ALPAYDIN The MIT Press, 2010
 

Summary: ETHEM ALPAYDIN
© The MIT Press, 2010
alpaydin@boun.edu.tr
http://www.cmpe.boun.edu.tr/~ethem/i2ml2e
Lecture Slides for
Introduction
Game-playing: Sequence of moves to win a game
Robot in a maze: Sequence of actions to find a goal
Agent has a state in an environment, takes an action and
sometimes receives reward and the state changes
Credit-assignment
Learn a policy
3Lecture Notes for E Alpaydin 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)
Single State: K-armed Bandit
aQaraQaQ tttt 11
4
Among K levers, choose
the one that pays best
Q(a): value of action a
Reward is ra

  

Source: Alpaydın, Ethem - Department of Computer Engineering, Bogaziçi University

 

Collections: Computer Technologies and Information Sciences