Foundations of Non-stationary Dynamic Programming with Discrete Time Parameter
General Material Designation
[Book]
First Statement of Responsibility
by K. Hinderer.
.PUBLICATION, DISTRIBUTION, ETC
Place of Publication, Distribution, etc.
Berlin, Heidelberg
Name of Publisher, Distributor, etc.
Springer Berlin Heidelberg
Date of Publication, Distribution, etc.
1970
SERIES
Series Title
Lecture Notes in Operations Research and Mathematical Systems, Economics, Computer Science, Information and Control, 33.
CONTENTS NOTE
Text of Note
1. Introduction and summary --; I. Countable state space --; 2. Decision models and definition of the problem --; 3. The principle of optimality and the optimality equation --; 4. Value iteration --; 5. Criteria of optimality and existence of
SUMMARY OR ABSTRACT
Text of Note
The present work is an extended version of a manuscript of a course which the author taught at the University of Hamburg during summer 1969. The main purpose has been to give a rigorous foundation of stochastic dynamic programming in a manner which makes the theory easily applicable to many different practical problems. We mention the following features which should serve our purpose. a) The theory is built up for non-stationary models, thus making it possible to treat e.g. dynamic programming under risk, dynamic programming under uncertainty, Markovian models, stationary models, and models with finite horizon from a unified point of view. b) We use that notion of optimality (p-optimality) which seems to be most appropriate for practical purposes. c) Since we restrict ourselves to the foundations, we did not include practical problems and ways to their numerical solution, but we give (cf.section 8) a number of problems which show the diversity of structures accessible to non stationary dynamic programming. The main sources were the papers of Blackwell (65), Strauch (66) and Maitra (68) on stationary models with general state and action spaces and the papers of Dynkin (65), Hinderer (67) and Sirjaev (67) on non-stationary models. A number of results should be new, whereas most theorems constitute extensions (usually from stationary models to non-stationary models) or analogues to known results.