Continuous Action Space#
In this section, we concentrate on single-stage policy learning with an emphasis on continuous action variables, such as the dosage of insulin.
In this section, we concentrate on single-stage policy learning with an emphasis on continuous action variables, such as the dosage of insulin.