|
|
səhifə | 62/116 | tarix | 04.01.2022 | ölçüsü | 10,52 Mb. | | #58520 |
| 4.4.5Reward
The reward in the hypertension SDDP model is the total net benefit, which is the health benefit expressed in the monetary units, minus the total costs associated with sequential treatment. Each drug selected for s at time t leads to an expected cost or treatment benefit, which is estimated in terms of QALYs. The costs and QALYs assigned to each transition, r(st+1|st,at,) can be interpreted as the one-step (or immediate) reward. The global reward for the given policy, π=(d1,d2,d3,d4), is the weighted sum of the one-step rewards with the probabilistic structure of the underlying evaluation model; this includes the costs and QALYs, calculated both in the short-term drug switching model and the long-term CVD model.
Dostları ilə paylaş: |
|
|