Kód: 06821373

Combined Use of Reinforcement Learning and Simulated Annealing

Name: Combined Use of Reinforcement Learning and Simulated Annealing
Brand: VDM Verlag Dr. Müller
SKU: 06821373
Price: 1490.00 CZK
Availability: BackOrder

Autor Peter Stefan

Jazyk: Angličtina
Vazba: Brožovaná
Počet stran: 128
Nakladatelství: VDM Verlag Dr. Müller, 2009
Více informací o knize

1490 Kč

U nakladatele na objednávku
Odesíláme za 3-5 dnů

Přidat mezi přání

Mohlo by se vám také líbit

Religion, Spirituality and Everyday Practice
3313 Kč
Koupit
After Taste: Cultural Value and the Moving Image
4990 Kč
Koupit
Barbed Wire
5172 Kč
Koupit
Abstracts of the Deaths and Marriages in the Hightstown Gazette, 5 January 1882-31 December 1885
677 Kč
Koupit

Darujte tuto knihu ještě dnes

Objednejte knihu a zvolte Zaslat jako dárek.
Obratem obdržíte darovací poukaz na knihu, který můžete ihned předat obdarovanému.
Knihu zašleme na adresu obdarovaného, o nic se nestaráte.

Více informací

Více informací o knize Combined Use of Reinforcement Learning and Simulated Annealing

Parametry knihy
Anotace
Oblíbené z jiného soudku

Nákupem získáte 149 bodů

Anotace knihy

In the dissertation combined reinforcement learning§(RL) and §simulated annealing (SA) concepts, problems, proposed§solutions, §algorithms and application examples are shown.§RL models a decision maker as a goal-driven agent§aiming to reach §goal states in the problem representation state§space. The agent §takes different choices among the numerous§possibilities, but each §choice can make different impact in the environment.§Each decision §has some effect being expressed in the form of§numeric honor or §dishonor, in a reward value. The agent utilizes the§feedback to §recognize which actions are honored and which are§not. The agent §then tries to govern its decision sequence into the§direction that §maximizes the environment s satisfaction .§The concept of SA is based on the analogy of how§liquids freeze. §There an initially high temperature and disordered§melt is slowly §cooled down and reaches thermal equilibrium.§While in annealing the temperature parameter bounds are §straightforward, in SA they might be dependent on the§problem and §its numeric representation.§This dissertation gives a method which can be used§for defining §temperature bounds in RL environment.