http://www.cnr.it/ontology/cnr/individuo/prodotto/ID242175
Dynamic programming and value-function approximation in sequential decision problems: error analysis and numerical results (Articolo in rivista)
- Type
- Label
- Dynamic programming and value-function approximation in sequential decision problems: error analysis and numerical results (Articolo in rivista) (literal)
- Anno
- 2013-01-01T00:00:00+01:00 (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#doi
- 10.1007/s10957-012-0118-2 (literal)
- Alternative label
M. Gaggero; G. Gnecco; M. Sanguineti (2013)
Dynamic programming and value-function approximation in sequential decision problems: error analysis and numerical results
in Journal of optimization theory and applications; SPRINGER/PLENUM PUBLISHERS, NEW YORK, NY 10013 USA (Stati Uniti d'America)
(literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#autori
- M. Gaggero; G. Gnecco; M. Sanguineti (literal)
- Pagina inizio
- Pagina fine
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#altreInformazioni
- Journal Q1 in Control and Optimization (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroVolume
- Rivista
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#numeroFascicolo
- Note
- ISI Web of Science (WOS) (literal)
- Scopu (literal)
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#affiliazioni
- 1. Institute of Intelligent Systems for Automation, National Research Council of Italy, Genova, Italy
2. DIBRIS, University of Genova, Genova, Italy
3. DIBRIS, University of Genova, Genova, Italy (literal)
- Titolo
- Dynamic programming and value-function approximation in sequential decision problems: error analysis and numerical results (literal)
- Abstract
- Value-function approximation is investigated for the solution via Dynamic Programming (DP) of continuous-state sequential N-stage decision problems, in which the reward to be maximized has an additive structure over a finite number of stages. Conditions that guarantee smoothness properties of the value function at each stage are derived. These properties are exploited to approximate such functions by means of certain nonlinear approximation schemes, which include splines of suitable order and Gaussian radial-basis networks with variable centers and widths. The accuracies of suboptimal solutions obtained by combining DP with these approximation tools are estimated. The results provide insights into the successful performances appeared in the literature about the use of value-function approximators in DP. The theoretical analysis is applied to a problem of optimal consumption, with simulation results illustrating the use of the proposed solution methodology. Numerical comparisons with classical linear approximators are presented. (literal)
- Editore
- Prodotto di
- Autore CNR
Incoming links:
- Prodotto
- Autore CNR di
- Http://www.cnr.it/ontology/cnr/pubblicazioni.owl#rivistaDi
- Editore di