Continuous Action Space#

In this section, we concentrate on single-stage policy learning with an emphasis on continuous action variables, such as the dosage of insulin.