A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions
2013
Biometrika
A dynamic treatment regime is a list of sequential decision rules for assigning treatment based on a patient's history. Q-and A-learning are two main approaches for estimating the optimal regime, i.e., that yielding the most beneficial outcome in the patient population, using data from a clinical trial or observational study. Q-learning requires postulated regression models for the outcome, while A-learning involves models for that part of the outcome regression representing treatment contrasts
doi:10.1093/biomet/ast014
pmid:24302771
pmcid:PMC3843953
fatcat:waa54cpndncj5jegplyvs7th7q