Skip to main navigation Skip to search Skip to main content

Sim-to-Real Learning-Based Nonlinear MPC for UAV Navigation and Collision Avoidance in Unknown Cluttered Environments

  • Oualid Doukhi
  • , Deok Jin Lee*
  • *Corresponding author for this work
  • Jeonbuk National University

Research output: Contribution to journalJournal articlepeer-review

Abstract

Quadrotor control in dynamic and unstructured environments requires robust and adaptive solutions to achieve agile and autonomous navigation. Nonlinear Model Predictive Control (NMPC) offers precise control through a nonlinear dynamical system model and online optimization over a short prediction horizon. However, its adaptability can be significantly enhanced through reinforcement learning. This study proposes a multimodal NMPC (MP-NMPC) policy, leveraging deep reinforcement learning to improve online optimization. Conditioned on the quadrotor’s local observations, the trained reinforcement learning policy dynamically selects adaptation parameters and feedforward control commands for the low-level NMPC controller. The search for adaptation variables and feedforward control signals is formulated as a Markov Decision Process (MDP), enabling reinforcement learning to address this optimization challenge. A multimodal deep reinforcement learning policy that combines depth images with inertial data is trained in simulation and transferred to a real quadrotor in a zero-shot fashion, facilitating online hyperparameter adaptation in highly dynamic unstructured environments. The significance of this approach is demonstrated through its application to challenging tasks such as agile navigation and obstacle avoidance, where it enables autonomous flight through complex environments and outperforms standard reinforcement learning methods.

Original languageEnglish
Pages (from-to)46249-46262
Number of pages14
JournalIEEE Access
Volume13
DOIs
StatePublished - 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Reinforcement learning
  • agile navigation
  • autonomous quadrotors
  • multimodal policy
  • nonlinear model predictive control (NMPC)
  • obstacle avoidance
  • simulation to real transfer

Quacquarelli Symonds(QS) Subject Topics

  • Materials Science
  • Computer Science & Information Systems

Fingerprint

Dive into the research topics of 'Sim-to-Real Learning-Based Nonlinear MPC for UAV Navigation and Collision Avoidance in Unknown Cluttered Environments'. Together they form a unique fingerprint.

Cite this