Practice

Practice#

In this section, you will be tasked with solving a control problem from start to finish.

Feel free to proceed as you wish. You could use a mathematical model or learn a model from data and then attempt to control it.

We will be using the Pendulum environment from gymnasium.

The system consists of a pendulum attached at one end to a fixed point, and the other end being free. The pendulum starts in a random position and we can apply torque to rotate the free end.

As seen below, the pendulum is represented in red and the joint is represented in black.

env = create_pendulum_environment()
result = simulate_environment(env)
show_video(result.frames, fps=env.metadata["render_fps"])

error: XDG_RUNTIME_DIR not set in the environment.

The environments allows the use of the following control (action):

Index	Action	Unit	Min	Max
0	apply torque to the actuated joint	torque (N m)	-2	2

and the following measurements (observation):

Index	Observation	Min	Max
0	\(\cos(\theta)\)	\(-1\)	\(1\)
1	\(\sin(\theta)\)	\(-1\)	\(1\)
2	\(\dot{\theta}\)	\(-8\)	\(8\)

First Goal#

The first goal is to apply torques on the actuated joint to swing the pendulum into an upright position and keep it there.

Second Goal#

The second goal is to apply torques on the actuated joint to swing the pendulum as fast as possible.

Exercise 2#

Exercise 9 (Pendulum Control)

Use the learned model and synthesize a controller to achieve the goals described above.

Solution to ( 9

For this exercise, we will use the SINDYc model with along with an MPC controller to achieve our control objectives.

For that we have to first convert the SINDYc model to a CasADi model in order to use it with do-mpc.

Model

mpc_model = build_sindy_model(sindy_model)

To make sure that our model is correct, we simulate the system using it

simulator = Simulator(mpc_model)
params_simulator = {
    "integration_tool": "idas",
    "abstol": 1e-8,
    "reltol": 1e-8,
    "t_step": env.dt,
}
simulator.set_param(**params_simulator)
simulator.setup()

Controller

setpoint = np.array([1.0, 0.0, 0.0])
cost = casadi.norm_2(mpc_model.x.cat - setpoint) - 100 * mpc_model.x["x0"]

terminal_cost = cost
stage_cost = cost
print(f"Stage Cost = {stage_cost}")
print(f"Terminal Cost = {terminal_cost}")

Stage Cost = (sqrt(((sq((x0-1))+sq(x1))+sq(x2)))-(100*x0))
Terminal Cost = (sqrt(((sq((x0-1))+sq(x1))+sq(x2)))-(100*x0))

u_limits = {"u0": np.array([-2, 2])}
u_penalty = {"u0": 0.00}
x_limits = {"x0": np.array([-1, 1]), "x1": np.array([-1, 1]), "x2": np.array([-8, 8])}

mpc_controller = build_mpc_controller(
    model=mpc_model,
    t_step=env.dt,
    n_horizon=50,
    stage_cost=stage_cost,
    terminal_cost=terminal_cost,
    x_limits=x_limits,
    u_penalty=u_penalty,
    u_limits=u_limits,
)

Simulation

Environment

class MPCController:
    def __init__(self, mpc: MPC) -> None:
        self.mpc = mpc
        self.mpc.reset_history()
        x0 = np.zeros((3, 1))
        # random angle
        theta0 = np.random.uniform(low=-np.pi, high=np.pi)
        # cosine and sine
        x0[0] = np.cos(theta0)
        x0[1] = np.sin(theta0)
        # angular velocity
        x0[2] = np.random.uniform(low=-8, high=8)
        self.mpc.x0 = x0
        self.mpc.set_initial_guess()

    def act(self, observation: NDArray) -> NDArray:
        return self.mpc.make_step(observation.reshape(-1, 1)).ravel()

%%capture
controller = MPCController(mpc_controller)
results = simulate_environment(env, max_steps=100, controller=controller)

Practice

Contents

Practice#

First Goal#

Second Goal#

Exercise 1#

Exercise 2#