Skip to content

Latest commit

 

History

History
40 lines (29 loc) · 2.41 KB

File metadata and controls

40 lines (29 loc) · 2.41 KB

Dynamic Structures with Reinforcement Learning

We are also interested in producing more structured trajectories, primarily lemniscates. More recently, we have started investigating the use of reinforcement learning to optimize the orientation of the larger structure.

Related work

  1. P. T. Jardine and S. N. Givigi, "Flocks, Mobs, and Figure Eights: Swarming as a Lemniscatic Arch", IEEE Transactions on Network Science and Engineering, 2022.

  2. P. T. Jardine and S. Givigi, "Emergent homeomorphic curves in swarms" in Automatica, vol. 176, 2025.

Circles and Twisted Circles

Dynamic Structures: Lemniscate trajectories.

Dynamic Structures: Encirclement.

Reinforce Learning

The curves above are produced using a geometric embedding technique, which allows us to produce a wide variety of emergent trajectories using a single underlying control policy. The orientation of this embedding can be optimized using reinforcement learning. Below is an exampled using Continuous Action Learning Automata (CALA).

Reinforcement Learning: Initial results for learning optimal embedding orientations.

Reinforcement Learning: Illustration of the distribution of learned policies over time.

Reinforcement Learning: Illustration of the explored action space.