Constrained trajectory optimization and force control for UAVs with universal jamming grippers

Kremer, Paul; Rahimi Nohooji, Hamed; Voos, Holger

doi:10.1038/s41598-024-62416-1

Constrained trajectory optimization and force control for UAVs with universal jamming grippers

Article
Open access
Published: 25 May 2024

Volume 14, article number 11968, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Constrained trajectory optimization and force control for UAVs with universal jamming grippers

Download PDF

Paul Kremer¹,
Hamed Rahimi Nohooji¹ &
Holger Voos^1,2

276 Accesses
Explore all metrics

Abstract

This study presents a novel framework that integrates the universal jamming gripper (UG) with unmanned aerial vehicles (UAVs) to enable automated grasping with no human operator in the loop. Grounded in the principles of granular jamming, the UG exhibits remarkable adaptability and proficiency, navigating the complexities of soft aerial grasping with enhanced robustness and versatility. Central to this integration is a uniquely formulated constrained trajectory optimization using model predictive control, coupled with a robust force control strategy, increasing the level of automation and operational reliability in aerial grasping. This control structure, while simple, is a powerful tool for various applications, ranging from material handling to disaster response, and marks an advancement toward genuine autonomy in aerial manipulation tasks. The key contribution of this research is the combination of a UG with a suitable control strategy, that can be kept relatively straightforward thanks to the mechanical intelligence built into the UG. The algorithm is validated through numerical simulations and virtual experiments.

Design of an Active-Reliable Grasping Mechanism for Autonomous Unmanned Aerial Vehicles

Optimization and Design of a Gripper Mechanism for Autonomous Unmanned Aerial Vehicle Perching

Dexterous UAVs for Precision Low-Altitude Flight

Introduction

The evolution of UAVs over the past decade has significantly transformed the landscape of numerous scientific and industrial domains^1,2,3,4. Initially envisioned as aerial sensors, UAVs demonstrated unparalleled utility in surveillance^5,6, forest fire monitoring⁷, building inspection^8,9, forestry¹⁰ and terrain mapping¹¹ due to their agility and accessibility to challenging environments^12,13. However, despite their remarkable success in sensing and monitoring, UAVs initially struggled in scenarios that demanded physical interaction with the environment.

Emerging as a solution, the field of aerial manipulation sought to equip UAVs with robotic manipulators^14,15,16 and claw-like grippers^17,18,19. This marked the beginning of a new era where UAVs could engage in tasks requiring direct physical contact²⁰. However, these traditional claw-like rigid grippers required precise alignment and positioning relative to the target object²¹. Even minor deviations could prevent successful grasping, making them susceptible to inaccuracies and necessitating exhaustive grasp analysis.

Soft aerial grasping emerged as an evolution to address the challenges posed by traditional grasping mechanisms. Soft grippers, with their adaptability and compliance, can seamlessly fit various object geometries, significantly minimizing the need for detailed grasp analysis^{22,23,24,25,26}. Inherent to soft robotics is their lightness and versatility thanks to the passive intelligence baked into their mechanical structure. Not only does this reduce the mechanical impact during interactions but it also makes the grippers particularly suitable for aerial applications. The inherent compliance of soft robotics presents an innovative solution to the difficulties related to tolerance in object grasping²⁵. Their ability to passively conform to objects leverages the concept of morphological computation, where the gripper’s passive mechanical structures supplement the role of active controls^27,28.

A particular type of soft gripper was introduced in²⁹, called the universal jamming gripper (UG). The UG is based on the principle of granular jamming where certain materials transition from a fluid- to a solid-like state when subjected to external pressure³⁰. This unique mechanism allows particles to flow and take arbitrary shapes, then solidify in place under external pressure³¹. The UG’s lightweight composition features a membrane filled with such a material that hardens under vacuum, establishing a firm grasp on the englobed object via a combination of suction, physical interlocking, and friction. Unlike many other soft grippers, the UG is specially crafted to engage with a wide array of object geometries^32,33,34 and delicate manipulations³⁵. Its symmetric design, omnidirectional grasping capabilities, and passive compliance sidestep some of the complexities inherent to traditional grippers, giving room to simpler control strategies.

Guidance and control strategies are the foundational pillars in aerial manipulation literature. Typically, trajectory generation and low-level tracking control are used to meet the mission requirements^36,37. The complexity of the tracking controller generally depends on the type of aerial manipulator. AMs equipped with heavy serial link manipulators require their dynamics to be taken into account³⁸. In contrast, simple claw-like grippers can often be treated as an unknown disturbance, effectively handled by common model-free robust control schemes such as PID or SMC. Trajectory generation often distinguishes between global and local trajectories, where global trajectories require path planning utilizing some form of map representation³⁷. Local, short-term, trajectory generation is frequently formulated as an optimization problem, specifically adapted to the nature of the mission. For instance, B-spline and sequential quadratic programming were used in³⁹ to generate minimum-time trajectories. Real-time minimal jerk trajectories for aerial perching were shown in⁴⁰, while multi-objective optimization-based trajectory planning was demonstrated in⁴¹. Dynamic artificial potential fields were used to follow a moving ground target in⁴². Lastly, stochastic model predictive control (MPC) and dynamic movement primitives were applied in⁴³ and⁴⁴.

MPC is a control strategy that leverages online optimization techniques to compute optimal control inputs over a receding horizon according to a given objective function. It was used with great success in many domains^45,46 thanks to its ability to work with non-linear system models and its ability to respect arbitrary constraints. A common problem of this technique is the computational burden related to the optimization step, which was especially problematic for systems exhibiting fast dynamics with limited compute resources (e.g., UAVs). However, in recent years, embedded systems have become more powerful, and optimization algorithms have become more efficient⁴⁷. In UAV trajectory generation, MPC can fully exploit the system’s dynamics while respecting constraints imposed by the environment and the manipulation task. Its ability to handle multiple (potentially conflicting) objectives and its robustness to dynamical uncertainties thanks to the receding horizon approach, makes it a prime candidate for local trajectory generation.

Our previous work³⁴ experimentally validated the UG’s grasping capabilities on a real UAV in an open-loop setup, highlighting the benefits of an automated approach. The reliance on a human in the loop inherently limits the scalability and adaptability of UAV operations in grasping tasks. Addressing this shortcoming, this study melds the inherent passive intelligence of the gripper with a simple, robust control scheme, marking a significant leap towards fully automated grasping.

The main objective of this paper is to introduce a novel control architecture for aerial grasping with soft jamming grippers, addressing the prevalent challenges in UAV deployments for material handling and grasping tasks. This initiative is realized by customizing a soft universal jamming gripper for aerial grasping and developing a model predictive controller to constrain the UAV’s motion while enhancing operational autonomy and safety. The contributions of this research encompass:

Formulation and implementation of a constrained path planning algorithm, specifically designed for navigating UAVs in complex aerial environments, utilizing model predictive control (MPC) adapted to the UG.
Integration and application of a force control strategy, ensuring both efficacy and robustness in grasping operations.
Presentation and validation of a scenario that showcases the framework’s efficacy, contributing to elevating the level of autonomy in aerial grasping tasks, verified through simulations and virtual experiments.

The salient features of this framework are its simple structure, straightforward architecture, and the synergetic effects between the control system and the passive intelligence of the UG to solve an otherwise complex problem.

The paper is structured as follows: We begin by articulating the problem statement and outlining the necessary technical background, followed by a deep dive into MPC where we elaborate on its formulation and adaptations tailored for the proposed grasping mechanism. Next, we introduce the force control strategy that dictates the gripping actions, ensuring both secure and reliable grasping. After this, we present the numerical simulation process, followed by a description of the virtual experiments conducted to validate our framework, highlighting the simulations and results that underscore the effectiveness of our approach. We then discuss and interpret the findings, explore their implications, and suggest future research directions. Finally, we conclude by summarizing the key takeaways of our research.

Problem statement and preliminaries

Problem statement

The main objective of this research is to enhance autonomy in aerial grasping tasks involving a UAV equipped with a UG. Achieving this autonomy necessitates overcoming several challenges:

Optimal Approach Planning. The UAV, armed with a UG, must efficiently navigate toward the target object in cluttered and potentially changing environments. The planning algorithm should be robust, adaptable, and capable of handling real-time environmental perturbations, ensuring that the UAV can modify its trajectory dynamically to avoid obstacles and save energy.

Effective grasping The grasping mechanism utilizes particle jamming and a soft membrane to establish a firm yet gentle grasp on the target object as depicted in Fig. 1a–c. The control strategies employed should control the activation force, mitigating the risks of damage during the interaction. Further, make sure that contact is maintained during the closing transition of the gripper, a necessary condition for a strong grasp. Moreover, it has to deal with the changing physical properties during the jamming transition of the gripper (from soft to rigid).

Adaptability post-grasping Following the capture of the payload, the UAV, in its role as an aerial manipulator (AM), must efficiently transport the payload to a designated location. This phase leverages the payload’s measured mass as a feed-forward element in the AM’s control system for rapid adjustment, Fig. 1d. This swift recalibration, essential for maintaining flight stability under increased weight, ensures safe and effective delivery.

The overarching goal is to craft a comprehensive control framework that synergizes with the innovative mechanics of the UG, resulting in a fully automated aerial grasping system. This framework should be capable of navigating the complexities of real-world environments, ensuring the seamless execution of aerial grasping tasks from approach to object retrieval and final deposition. The proposed framework and its integral components, aimed at solving the outlined problems, will be evaluated through numerical simulations and virtual experiments to validate their effectiveness and applicability in practical scenarios. The envisioned automated grasping setup is depicted in Fig. 1.

UG modeling

This section summarizes crucial insights into the modeling of the UG, laying the foundation for the virtual experiments discussed later in the manuscript. For an in-depth exploration of the UG’s design, fabrication, integration, modeling, and experiments on a UAV, readers are directed to³⁴.

Key to the UG’s capabilities is its ability to shift from a soft to a rigid state, see Fig. 1b–c. This adaptability significantly enhances the robustness of aerial manipulation tasks. Notably, the UG’s ability to adapt to various object geometries negates the need for precise control based on object orientation, thereby drastically simplifying the control complexity.

To render the UG’s behavior in robotic simulators, we propose the contact model in Fig. 2. This model melds two non-linear compression springs, $k_{lmp}$ and $k_{air}$. The former captures the lumped stiffness of the jammed filler material, while the latter encapsulates the dynamics of the air-filled membrane during contact which embodies numerous parameters, including the effective contact area and the non-linear elastic behavior of the membrane, a depiction akin to an air spring as cited in⁴⁸. However, pinpointing a precise model proves challenging, and offers limited practical utility in this context.

We employed non-linear regression analysis to identify the interrelation between the depth of entrance x, the payload diameter D, and the resultant elastic force $F_{air}$. Concurrently, the system’s lumped elastic force is denoted as $F_{lmp}(x)$. The combined elastic force $F_{ug}$ is then derived from the equation

$$\begin{aligned} F_{ug} = F_{air}(h(x), D) + F_{lmp}(h(x - x_a)), \end{aligned}$$

(1)

where h(y) represents the compression-only action of the springs, defined as $h(y) = 0$ if $y < 0$, and $h(y) = y$ if $y \ge 0$.

The free length $x_a$, which depends on the normalized free volume $\beta$, follows the linear mapping $x_a = x^u_a \cdot \beta$. Here, $x^u_a$, determined experimentally to be 40.8 mm, denotes the minimal volume that the filler material occupies in the jammed state. $\beta$ serves as a pivotal parameter in the model, representing the ratio of the current free volume to the maximum free volume within the membrane. It operates as a dynamic entity, delineating the transitions between the gripper’s fluidic and rigid states, thereby facilitating the control and adaptation of the UG to various object geometries. In the s-domain, $\beta$ is described by the equation

$$\begin{aligned} \beta (s) = \frac{R(s)}{1+s T}, \end{aligned}$$

(2)

with R(s) serving as a unit step, s representing the Laplace variable, and T denoting the system’s time constant, approximated as 4.3 s³⁴. Variable $\beta$ discerns three discrete states, outlined as

$$\begin{aligned} k_{gr}(\beta ) = {\left\{ \begin{array}{ll} \text {closed/jammed} &{} \text {if } \beta \le 1\% , \\ \text {opened/fluidized} &{} \text {if } \beta \ge 99\% , \\ \text {in transition} &{} \text {otherwise}, \end{array}\right. } \end{aligned}$$

(3)

highlighting the system’s increased stiffness as $\beta$ approaches zero. This contact model is pivotal in developing a digital copy of the UG for robotic simulators such as Gazebo.

UAV modeling

The optimal trajectory of the AM is determined by balancing multiple objectives:

1.
Energy efficiency: Minimize control effort to ensure the path is energetically cost-effective.
2.
Visual servoing: Maintain the targeted object within the camera’s field of view for accurate localization.
3.
Grasping approach: Initially, maintain a distance $z_{\text {grab}}$ from the floor to prevent collision with the payload and other objects during the approach. Subsequently, when close, descend following a near-vertical path to engage the gripper with the payload.

For the problem visualized in Fig. 1, the following state vector is defined

$$\begin{aligned} \pmb {\MakeLowercase {x}} = \left( {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}}, {{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}} \right) \in \mathbb {R}^{8}, \end{aligned}$$

(4)

with ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}} = \left( x, y, z, \psi \right)$, and ${{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}} = \left( v_x, v_y, v_z, v_\psi \right)$, where ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}}$ is the position, $\psi$ the yaw angle of the UAV, and ${{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}}$ the velocity of the robot. The superscript indicates the frame of reference: $\texttt{W}$ for world and $\texttt{B}$ for body. Lastly, $\pmb {\MakeLowercase {z}}$ is defined as the vector of parameters which includes

$$\begin{aligned} \pmb {\MakeLowercase {z}} = \left( {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}}, v_0, v_N, {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}, c_{lock}, z_{safe}, {{}^{\mathtt {}}\pmb {\MakeLowercase {o}}^{\mathtt {}}_{\texttt{1}}}, \ldots , {{}^{\mathtt {}}\pmb {\MakeLowercase {o}}^{\mathtt {}}_{\mathtt {N_o}}} \right) \in \mathbb {R}^{11+4 N_o}, \end{aligned}$$

(5)

with ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}} = \left( x_{ref}, y_{ref}, z_{ref}, \psi _{ref}\right) ,$ ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}} = \left( u_{ref}, v_{ref}, w_{ref}\right) ,$ and ${{}^{\mathtt {}}\pmb {\MakeLowercase {o}}^{\mathtt {}}_{\texttt{i}}} = \left( o_{x,i}, o_{y,i}, o_{sx, i}, o_{sy, i}\right) ,$ where ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}}$ is the desired position of the UAV, ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}$ is the visual target location (point of interest), $v_0$ and $v_N$ are the desired velocity at the start, resp., at the end of the horizon, $c_{lock}$ is the binary state indicating whether the visual lock has been acquired or not (i.e., whether a visual target is present), $z_{safe}$ designates the safety altitude and $o_{(\cdot ),i}$ represents the xy-centerpoint and minor and major axis of the i’th (out of $N_o$) ellipsoidal obstacles. The separation of the visual target and the desired position in combination with $c_{lock}$ gives a great deal of flexibility. By having ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}} \ne {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}$, the AM can be tasked to explore the vicinity of the object of interest while keeping it in the field of view of the sensor. At the same time, the sensor lock can be disabled by setting $c_{lock}=0$ to prevent the controller from tracking the target in certain cases (e.g., when moving to the drop-off area). For the actual grasping, ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}} \approx {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}$ as the visual target position generally corresponds to the payload position.

Regarding the state dynamics, we propose a basic kinematic UAV hover model, that represents a great simplification of the actual UAV dynamics. High-performance applications generally require more faithful models, however, our main concern is to generate a feasible trajectory that the relatively slow-flying UAV can track. This simple model has the advantage of being easily identifiable and relatively lightweight from a computational point of view. The state dynamics are thus defined as follows:

$$\begin{aligned} f\left( \pmb {\MakeLowercase {v}}, \pmb {\MakeLowercase {u}}\right) = {\left\{ \begin{array}{ll} {{}^{\texttt{W}}\dot{\pmb {\MakeLowercase {p}}}^{\mathtt {}}_{\mathtt {}}} &{}= \begin{pmatrix} {}^{\texttt{W}}\pmb {\MakeUppercase {R}}_{\texttt{B}}\left( \psi \right) {{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {[xyz]}}} \\ {{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {[\psi ]}}} \end{pmatrix}, \\ {{}^{\texttt{B}}\dot{\pmb {\MakeLowercase {v}}}^{\mathtt {}}_{\mathtt {}}} &{}= \frac{1}{\pmb {\MakeLowercase {\tau }}} \left( \pmb {\MakeLowercase {u}} \odot \pmb {\MakeLowercase {k}} - {{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}}\right) , \end{array}\right. } \end{aligned}$$

(6)

where ${}^{\texttt{W}}\pmb {\MakeUppercase {R}}_{\texttt{B}} \in SO(3)$ is the rotation from frame $\texttt{B}$ to $\texttt{W}$, $\odot$ designates the component-wise vector multiplication and the subscript in brackets indicates the components of the respective vector, e.g., ${{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {[xyz]}}} \in \mathbb {R}^3$ indicates the x, y, z components of ${{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}} \in \mathbb {R}^4$.

The gain $k_i$ and time constants $\tau _i$ of the system were determined from the simulated AM with classical step response tangent method:

$$\begin{aligned} \tau _i&= \frac{3}{2} \left( t_{y=63\%}-t_{y=28\%}\right) , \end{aligned}$$

(7a)

$$\begin{aligned} k_i&= \frac{y(\infty )}{u(\infty )}. \end{aligned}$$

(7b)

The results of the system identification are given in Table 1.

Table 1 Identified time constants and gains of the simulated first-order AM system.

Full size table

Trajectory optimization using model predictive control

The MPC-based trajectory generation subsystem calculates the optimal velocity commands based on the cost function J developed hereafter. This subsystem is embedded in the AM controller as shown in Fig. 3.

Problem formulation

The discrete-time optimization problem formulation is given by⁴⁹:

$$\begin{aligned}{} \underset{\pmb {\MakeLowercase {x}}, \pmb {\MakeLowercase {u}}}{\text {Minimize}}\, \sum _n J(\pmb {\MakeLowercase {x}}_n, \pmb {\MakeLowercase {u}}_n) + E(\pmb {\MakeLowercase {x}}_N){} & {} \end{aligned}$$

(8a)

$$\begin{aligned} {\text{ subject } \text{ to: }}\\{} & {} \pmb {\MakeLowercase {x}}(0)= \pmb {\MakeLowercase {x}}_0&\end{aligned}$$

(8b)

$$\begin{aligned}{} & {} \pmb {\MakeLowercase {x}}_{n+1} = f\left( \pmb {\MakeLowercase {x}}_n, \pmb {\MakeLowercase {u}}_n, t\right) \end{aligned}$$

(8c)

$$\begin{aligned}{} & {} \pmb {\MakeLowercase {u}}_n \in U _n \end{aligned}$$

(8d)

$$\begin{aligned}{} & {} H_n\left( \pmb {\MakeLowercase {x}}_n, \pmb {\MakeLowercase {u}}_n\right) \le 0 \end{aligned}$$

(8e)

$$\begin{aligned}{} & {} H_N\left( \pmb {\MakeLowercase {x}}_N\right) \le 0 \end{aligned}$$

(8f)

$$\begin{aligned}{} & {} n \in \left\{ 1, \dots , N \right\} \end{aligned}$$

(8g)

where J and E are the stage and terminal costs, respectively. $H_n$ and $H_N$ represent general inequality constraints, f are the system dynamics (6), n is the stage number with a total of N stages in the control horizon, and $\pmb {\MakeLowercase {u}}$ are the control inputs.

Within this work, the proximal averaged Newton-type method for optimal control (PANOC) algorithm⁴⁹ is used to solve the non-convex optimal control problem (OCP) that is part of the non-linear MPC formulation. The PANOC algorithm solves the fixed-point equations with a fast converging line-search, single direct shooting method. The algorithm, being a first-order method, is particularly well suited for embedded onboard applications with limited memory and compute power and generally outperforms sequential quadratic programming (SQP) methods.

Remark 1

The solution to problem (8) is only optimal within the fixed-length moving horizon. Globally seen, the output, i.e., the trajectory, can be suboptimal, even with a risk of getting trapped in a local minimum.

Cost function: perception

For a safe and reliable approach, the targeted object should preferably stay in the field of view (FoV) of the AM. Herein, we follow the development in⁵⁰ to impose this behavior on the AM. Imposing this as a hard constraint would in many cases lead to an infeasible problem. By softening those constraints, the MPC gets the flexibility to find a solution that is a more favorable compromise between the objectives, e.g., saving energy at the expense of slightly violating the perception constraint.

The violation of a constraint is a binary state; either it is violated ’1’, or not ’0’. If it is, there is a non-zero cost associated with it, which incentivizes the optimizer to find a solution that satisfies the constraint. That binary behavior is akin to the Heaviside step function

$$\begin{aligned} H(x) = {\left\{ \begin{array}{ll} 1, &{} x > 0,\\ 0, &{} \text {otherwise}, \end{array}\right. } \end{aligned}$$

(9)

However, its discontinuous nature makes it unsuitable for numerical optimization. Better candidates are found in the family of logistic functions. Herein we use the sigmoid function instead of the Heaviside step function as it is continuously differentiable over its entire domain. The sigmoid function is defined as

$$\begin{aligned} \sigma \left( x, x_0, k_s, k_g\right) = \frac{k_g}{1+\exp \left( -k_s \left( x-x_0\right) \right) }, \end{aligned}$$

(10)

where $k_s$ defines the steepness of the transition between 0 and $k_g$. The parameter $x_0$ defines the center point of the transition, i.e., $\sigma \left( x, x_0, k_s, k_g\right) =k_g/2$ for $x=x_0$. Based on the sigmoid function, the unit pulse function is defined as

$$\begin{aligned} \text {pulse}\left( x, k_s, k_g\right) = 4 k_g \sigma \left( x, x_0=0, k_s, k_g=1\right) \left( 1- \sigma \left( x, x_0=0, k_s, k_g=1\right) \right) . \end{aligned}$$

(11)

To satisfy the perception constraints, the MPC has to ensure that the centroid of the target object stays within the image plane of the camera as depicted in Fig. 4. The formulation herein does not force the MPC to keep the target in the center of the image plane and thus avoids unnecessary movements, keeping the AM more stable.

The object detection pipeline provides the position of the target object in world coordinates denoted as ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}$, resp., in local camera frame $\texttt{C}$ as ${{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}} = \left( {}^{\texttt{W}}\pmb {\MakeUppercase {T}}_{\texttt{B}} {}^{\texttt{B}}\pmb {\MakeUppercase {T}}_{\texttt{C}}\right) ^{-1} {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}$, with ${}^{\mathtt {}}\pmb {\MakeUppercase {T}}_{\mathtt {}} \in SE(3)$ being the transformation matrix between the frames in super-, resp., subscript. Its coordinates $\pmb {\MakeLowercase {s}}$ in the virtual image plane are then obtained with the help of the following relationship:

$$\begin{aligned} \pmb {\MakeLowercase {s}}\left( {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}\right) = \begin{pmatrix} {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[x]} / {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]} \\ {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[y]} / {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]} \end{pmatrix}. \end{aligned}$$

(12)

Equation (12) has a singularity at ${{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]} = 0$. In practice, this is not an issue since the distance from the camera to the target can never be zero (the RealSense D435 has a minimal distance of 8 cm). Taking this into account, Eq. (12) is conditioned to eliminate the singularity at ${{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]}=0$ as follows:

$$\begin{aligned} \pmb {\MakeLowercase {s}}\left( {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}\right)&= \begin{pmatrix} {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[x]} / z^* \\ {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[y]} / z^* \end{pmatrix}, \end{aligned}$$

(13a)

$$\begin{aligned} z^*&= {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]} + s_p, \end{aligned}$$

(13b)

where $s_p = \text {pulse}\left( x={{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]}, x_0=0, k_s=80, k_g\right)$ and therefore $\lim _{{{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]} \rightarrow \pmb {\MakeLowercase {0}}} z^* = k_g$, with $k_g$ being a very small positive number.

The inequality constraints that keep $\pmb {\MakeLowercase {s}}$ in the FoV of the camera (defined by $\alpha _h$ and $\alpha _v$) are thus $|{\pmb {\MakeLowercase {s}}_{[x]}}| < \tan {\frac{\alpha _h}{2}}$ and $|{\pmb {\MakeLowercase {s}}_{[y]}}| < \tan {\frac{\alpha _v}{2}}$, or more conveniently expressed as an inequality that restricts $\pmb {\MakeLowercase {s}}$ to lay within an ellipsoid centered around $\pmb {\MakeLowercase {0}}$ defined by its minor and major axis $h=\tan \alpha _h$ and $w=\tan \alpha _v$ such that

$$\begin{aligned} \frac{\pmb {\MakeLowercase {s}}_{[x]}^2}{(w/2)^2} + \frac{\pmb {\MakeLowercase {s}}_{[y]}^2}{(h/2)^2} < 1. \end{aligned}$$

(14)

By making use of the sigmoid logistic function, (14) becomes

$$\begin{aligned} c_{p,1}\left( {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}\right) = \sigma \left( \frac{\pmb {\MakeLowercase {s}}_{[x]}^2}{(w/2)^2} + \frac{\pmb {\MakeLowercase {s}}_{[y]}^2}{(h/2)^2}, 1, k_{s1}, k_{g1} \right) . \end{aligned}$$

(15)

Eq. (15) has, however, two problems; first, it represents a double elliptical cone, i.e., the constraint is satisfied even if the target is behind the camera, second, ${\nabla c_{p,1}}\approx 0$ in regions where the constraint is violated, which makes recovering from constraint violations unlikely. The solution to the first problem is to add a constraint that guarantees ${{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]} > 0$ (target in front of the camera), resp. $c_{p,z} = \sigma \left( -{{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]}, 0, k_{s2}, k_{g2}\right)$, which extends (15) to become

$$\begin{aligned} c_{p,2}\left( {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}\right) = c_{p,z} + c_{p,1} \left( 1-c_{p,z}\right) . \end{aligned}$$

(16)

Lastly, a quadratic term and a bias term are added to the formulation such that the gradient of non-compliant regions fulfills ${\nabla c_{p}} > 0$:

$$\begin{aligned} c_{p}\left( {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}\right) = c_{p,2} \left( 1+k_g \left\| { \begin{pmatrix} {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[x]} \\ {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[y]} \\ c_{p,z} {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[z]} \end{pmatrix} }\right\| ^2 + c_{p,z} \cdot \text {pulse}\left( \left( {{}^{\texttt{C}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{vis}}}_{[x]}\right) ^2, 0, k_{s3}, k_{g3}\right) \right) . \end{aligned}$$

(17)

The quadratic term includes the condition $c_{p,z}$ such that the distance of the target from the camera in the positive z-direction is not penalized. The bias term in form of the pulse function gives the optimizer an extra incentive to turn the UAV towards the target. The constraint function (17) is included as perception cost function

$$\begin{aligned} J_P\left( \pmb {\MakeLowercase {x}}\right) = k_P \cdot c_{lock} \cdot c_p, \end{aligned}$$

(18)

in (8), where $k_P$ is a positive weight and $c_{lock} \in \left\{ 0,1\right\}$ accommodates for the case where no target is detected, and the perception cost should consequently be ignored. This also accounts for cases where the UAV loses sight of the visual target, e.g., due to an obstacle blocking the line of sight with the target.

Cost function: tracking

During approach and departure, the AM is commanded to reach a given target position ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}}$ and yaw angle $\psi _{ref}$ with its current position and orientation given by ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}}$ and $\psi$. Herein, a cost function is created that encourages the UAV to approach the commanded location for translation and rotation, starting with the latter.

The yaw angle error $\psi _{err} = \psi _{ref} - \psi$ is not a useful quantity to feed into the OCP due to the discontinuity at ${0}^\circ$ resp. ${360}^\circ$. Instead, the orientation is encoded in the two-dimensional direction vector

$$\begin{aligned} \pmb {\MakeLowercase {q}}\left( \psi \right) = \begin{pmatrix} \cos \psi \\ \sin \psi \end{pmatrix}, \end{aligned}$$

(19)

with the orientation error being

$$\begin{aligned} {{}^{\mathtt {}}\pmb {\MakeLowercase {q}}^{\mathtt {}}_{\texttt{err}}}\left( \pmb {\MakeLowercase {x}}\right) = \frac{1}{2} \left( {{}^{\mathtt {}}\pmb {\MakeLowercase {q}}^{\mathtt {}}_{\texttt{ref}}}\left( \psi _{ref}\right) - \pmb {\MakeLowercase {q}}\left( \psi \right) \right) . \end{aligned}$$

(20)

Likewise, the position error is defined as

$$\begin{aligned} {{}^{\mathtt {}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{err}}} = {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}} - {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}}. \end{aligned}$$

(21)

Lastly, to have some control over the velocity at the start $v_0$ and the end $v_N$ of the horizon, the following quantity is defined

$$\begin{aligned} v_{ref}&= \left( 1-\alpha \right) v_0 + \alpha v_N, \end{aligned}$$

(22a)

$$\begin{aligned} v_{err}&= \left\| {{{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}}}\right\| - v_{ref}, \end{aligned}$$

(22b)

$$\begin{aligned} \alpha&= n/N, \end{aligned}$$

(22c)

where N is the number of states, and n is the current stage. The velocity at each stage of the moving horizon is changed linearly between $v_0$ at the beginning to $v_N$ at the end to incentivize a gradual (linear) change of velocity and thus avoid an aggressive braking maneuver when approaching the commanded location.

The cost function $J_T$ associated with the position tracking is composed by (19), (22) and (21). To account for the varying objectives of that cost function, $c_{lock}$ is defined to switch between either the tracking of the reference angle or the targeted object via the perception constraint. The cost equation to include in (8) is therefore

$$\begin{aligned} J_T\left( \pmb {\MakeLowercase {x}}\right) = k_{T,1} \left\| {{{}^{\mathtt {}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{err}}}}\right\| ^2 + k_{T,2} v_{err}^2 + k_{T,3} \dot{\psi }^2 + \left( 1-c_{lock}\right) k_{T,4} \left\| {{{}^{\mathtt {}}\pmb {\MakeLowercase {q}}^{\mathtt {}}_{\texttt{err}}}}\right\| ^2, \end{aligned}$$

(23)

where $k_{T,i}$ are positive weights.

Cost function: grasping

The grasping cost function takes the particularities of the UG into account; it guarantees that the gripper stays at a safe altitude from the ground as long as it is not located above the target, and it only allows the AM to descend while being in close vicinity. At the same time, if the UAV for some reason, e.g., a wind gust, gets dislocated from the target’s location, it is forced to climb again. For that reason, two constraint functions are defined. The altitude penalty constraint is defined as

$$\begin{aligned} c_4 = 1 - \sigma \left( z, z_{min}, k_{s4}, k_{g4}\right) , \end{aligned}$$

(24)

where $z_{min}$ marks the safe minimal altitude the AM has to keep. However, as this would prevent the AM from descending to the payload, an additional term is needed based on the xy-position relative to the target, i.e.,

$$\begin{aligned} c_5 = 1-\text {pulse}\left( \left\| {{{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}}_{[xy]} - {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}}_{[xy]}}\right\| ^2, k_{s5}, k_{g5} \right) , \end{aligned}$$

(25)

where $k_s$ is chosen as a function of the radius r of the cone through which the UAV is funneled to the target. Herein $k_s\left( r={0.1} \text{m}\right) =176.27$ was determined from solving the following equations:

$$\begin{aligned} x_{0.5}(k_s)&:=1-\text {pulse}\left( x, k_s, k_g=1\right) = 0.5 \end{aligned}$$

(26a)

$$\begin{aligned} k_s(r)&:=x_{0.5}(k_s) = r^2. \end{aligned}$$

(26b)

The pulse function has the particularity of $\text {pulse}\left( x, k_s, k_g\right) =0$ at $x=0$, resulting in no residual cost at ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}}_{xy} = {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}}_{xy}$. The total cost associated with the grasping constraints is thus defined as

$$\begin{aligned} J_G\left( \pmb {\MakeLowercase {x}}\right) = k_{G,1} c_4 c_5 \left( 1 + k_{G,2}\left( z_{min}-z\right) ^2\right) + k_{G,3} (1-c_5) \left\| { {{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}} }\right\| ^2, \end{aligned}$$

(27)

where $k_{G,i}$ are tuneable weights and $k_{G,3} (1-c_5) \left\| { {{}^{\texttt{B}}\pmb {\MakeLowercase {v}}^{\mathtt {}}_{\mathtt {}}} }\right\| ^2$ penalizes for any velocity close to the target.

Cost function: repulsion

In many applications, it is useful to define areas that should be avoided during flight, e.g., pedestrians, lamp posts, walls or cars. Herein, it is assumed that those areas can be approximated with one or multiple ellipsoids. It is assumed that the targets are static, localized, and cannot be overflown (for safety resp. physical reasons), thus resulting in a 2D xy-problem.

On that account, an axis-aligned ellipsis is defined, located at the world position $\left( o_x, o_y\right)$ with its minor and major axis defined by $\left( o_{sx}, o_{sy}\right)$. A point $\left( p_x, p_y\right)$ is inside the ellipsis if

$$\begin{aligned} \frac{\left( p_x - o_x\right) ^2}{\left( o_{sx}/2\right) ^2} + \frac{\left( p_y - o_y\right) ^2}{\left( o_{sy}/2\right) ^2} < 1. \end{aligned}$$

(28)

Reformulating that expression using the sigmoid logistic function yields

$$\begin{aligned} c_R = 1 - \sigma \left( \frac{\Delta p_x^2}{\left( o_{sx}/2\right) ^2} + \frac{\Delta p_y^2}{\left( o_{sy}/2\right) ^2}, 1, k_{s6}, k_{g6}\right) . \end{aligned}$$

(29)

The repulsion cost function for a single area of repulsion i is thus

$$\begin{aligned} J_{R,i} = c_{R,i} \cdot \sum \left( \begin{pmatrix} o_{sx,i}^2 \\ o_{sy,i}^2 \end{pmatrix} - \begin{pmatrix} \Delta p_{x,i}^2 \\ \Delta p_{y,i}^2 \end{pmatrix} \right) , \end{aligned}$$

(30)

where a quadratic term was added to improve the gradient in the non-compliant region.

The total repulsion cost of all areas is, therefore

$$\begin{aligned} J_R\left( \pmb {\MakeLowercase {x}}\right) = k_R \sum _i^{N_o} J_{R,i}. \end{aligned}$$

(31)

Herein (31) is included as a soft constraint, rather than as a hard constraint. This has the advantage of being computationally cheaper but may lead to collisions if the repulsion cost term is dominated by other cost functions.

An environment may contain hundreds of obstacles and including each obstacle in (5) makes solving the OCP computationally very expensive. However, there are generally only a handful of obstacles near the UAV that must be considered. Therefore by simply feeding the momentarily closest obstacles into the OCP’s parameter vector, a large environment with many obstacles can be considered with $N_o$ still being reasonably small (here, $N_o=2$). Furthermore, the AM’s body size can be accounted for by simply expanding the ellipsis of the obstacle by the radius $r_B$ of the body, i.e., $\left( \hat{o}_{sx}, \hat{o}_{sy}\right) = \left( o_{sx} + r_B, o_{sy} + r_B\right)$. Finally, an obstacle with a non-ellipsoidal shape can be approximated by placing several ellipses within its contour.

Force control

For optimal operation, the UG requires the activation force to be controlled, i.e., kept constant, over the grasping interval. More precisely, the force should not exceed a certain level as it may cause damage to the gripper (or to the payload), and it must also not drop below a certain threshold as this servery degrades the grasping performance, as discussed in our previous work³⁴.

The fundamental problem is thus to control the contact force, as it was shown in⁵¹ and⁵² for bridge inspection and in⁵³ in the context of aerial writing. The noticeable difference here is the strong nonlinearity of the elastic element and the shrinkage of the membrane, which consequently changes the stiffness of the system during operation and may also cause the complete loss of contact. The cascading force control approach followed herein is inspired by⁵⁴, and⁵⁵. Since the contact force measured by the load-cell and the position of the UAV are linked, the contact force tracking problem can be seen as a position tracking problem where the reference position is a function of the force tracking error. Motivated by this statement, the cascading control architecture as shown in Fig. 3 is developed.

Problem formulation

The physical properties and grasping performance of the UG were rigorously analyzed in³⁴ with the main conclusion being that the system is highly non-linear and state-dependent. In particular, the stiffness of the elastic element changes from very soft to rigid-like.

The homologous model of the AM is as shown in Fig. 5 consisting of a mass-spring-damper system, where the stiffness k is non-linear, compression-only and dependant on the gripper’s state $\beta$ and depth of entrance z. The unknown damping d contains various effects such as the friction between the filler particles and air resistance. The mass m is the total mass of the AM with $u_T$ being the cumulative thrust applied by the aircraft’s propulsion system.

The dynamics of the system are modeled according to the following set of equations

$$\begin{aligned} \ddot{z}&= \frac{1}{m} \left[ -f_{kd}\left( z, \dot{z}\right) \vert _{z>0} + m g_0 - u_T \right] , \end{aligned}$$

(32a)

$$\begin{aligned} z\left( 0\right)&= 0, \end{aligned}$$

(32b)

$$\begin{aligned} \dot{z}\left( 0\right)&= v_0, \hspace{0.1cm} v_0 > 0, \end{aligned}$$

(32c)

$$\begin{aligned} f_{kd}\left( z, \dot{z}\right)&= f_k\left( z\right) + f_d\left( \dot{z}\right) , \end{aligned}$$

(32d)

where $u_T$ is the input (applied thrust) to the system, the states $\dot{z}$ and z are measurable, $f_d\left( z, \dot{z}\right)$ and $f_k\left( z\right)$ are smooth, positive definite functions. It is assumed that the system has an initial downward velocity $\dot{z}=v_0$. The initial position is defined as $z(0)=0$, which marks the position at which the gripper touches the ground. Assuming ground contact, the load-cell measures the contact force $f_m$ at 100 Hz as

$$\begin{aligned} f_m = f_{kd}\left( z, v_z\right) . \end{aligned}$$

(33)

Force tracking

The activation force is indirectly tracked by controlling the position (altitude) of the drone. The control solution thus consists of a force tracker that feeds into a robust sliding mode position tracker (Fig. 3) as explained in the following.

Sliding mode position tracker

The position tracker has been realized with a classic sliding mode control (SMC) method^56,57. The system can be seen as uni-axial (altitude only) since the UAV will be hovering (stationary in xy-direction, held in position mostly by friction) during the grasp. The non-linear uni-axial system is considered as

$$\begin{aligned} \dot{v}_z = f\left( \pmb {\MakeLowercase {x}}\right) + g\left( \pmb {\MakeLowercase {x}}\right) u\left( t\right) + \vartheta \left( t\right) , \end{aligned}$$

(34)

where $f\left( \cdot \right)$ and $g\left( \cdot \right)$ are non-linear smooth functions and $\pmb {\MakeLowercase {x}}$ being the state vector defined in (4). The unknown disturbance is assumed to be bounded $|{\vartheta }| < D$. The sliding surface is defined as

$$\begin{aligned} s&= \dot{e_z} - c e_z, \end{aligned}$$

(35a)

$$\begin{aligned} \dot{s}&= \ddot{e_z} - c \dot{e_z}, \end{aligned}$$

(35b)

where $c>0$ is a design parameter and $e_z=z_r-z$ is the tracking error. Assuming a constant rate reaching-law

$$\begin{aligned} \dot{s} = -\eta \cdot \text {sign}\left( s\right) \text {, } \eta > 0, \end{aligned}$$

(36)

the input can be defined as

$$\begin{aligned} u_{T}=\frac{1}{g} \left( -\eta \cdot \text {sign}\left( s\right) - \ddot{z}_r + f + c\dot{e_z} \right) , \end{aligned}$$

(37)

which is stabilizes the system under the condition that $\eta > D$⁵⁶. To reduce chatter, the sign function is replaced by the continuous saturation function⁵⁶, as

$$\begin{aligned} u_{T}=\frac{1}{g} \left( -\eta \cdot \text {sat}\left( s\right) - \ddot{z}_r + f + c\dot{e_z} \right) . \end{aligned}$$

(38)

With the UAV in hover-position, the uni-axial dynamics can be stated as

$$\begin{aligned} f_z = {\left\{ \begin{array}{ll} \dot{v}_z &{}= f\left( \pmb {\MakeLowercase {x}}\right) + g u\left( t\right) + \vartheta \left( t\right) , \\ \dot{z} &{}= v_z, \end{array}\right. } \end{aligned}$$

(39)

with $f\left( \pmb {\MakeLowercase {x}}\right) = \frac{1}{m} \left( f_m - g_0 \right) ,$ and $g = \frac{1}{m}.$ Considering the sliding surface as defined in (35a) and the reaching law in (36), the control output $u_T$ of the tracker is defined as

$$\begin{aligned} u_{T}\left( t\right) = -f_m + m \left( g_0 - c\dot{e_z} - \eta \cdot \text {sat}\left( s\right) \right) . \end{aligned}$$

(40)

Force tracker

The force applied by the UG to the payload, as it is measured by the sensor, is directly dependent on the system states z and $\dot{z}$. Therefore, the force can be regulated by commanding and tracking a specific position (altitude) $z_r$. To that goal, the force tracker is defined as

$$\begin{aligned} z_r&= k_p e_f + w_{e_f}, \end{aligned}$$

(41a)

$$\begin{aligned} \dot{w}_{e_f}&= k_i e_f, \end{aligned}$$

(41b)

where $k_p$ and $k_i$ are positive design variables. The gripper is closed once the system has reached steady-state, i.e., $|{e_f}| < c_{e1}$ and $|{\dot{e}_f}| < c_{e2}$ with $c_{e1}$, $c_{e2}$ being small positive values. Closing the gripper triggers its state transition from soft to rigid. The complete control architecture is depicted in Fig. 3. The simulated system response for the closed-loop system with an initial position $z\left( 0\right) ={0}\,\hbox {m}$ and velocity of $\dot{z}\left( 0\right) = {0.1}\,\hbox {ms}^{-1}$ is shown in Fig. 6. The design variables were experimentally chosen as stated in Table 2.

Table 2 The empirically tuned constants of the SMC.

Full size table

Remark 2

Herein z is also synonymous to the UAV’s altitude relative to the position the gripper first entered in contact with the payload, i.e., the point where $f_m>\delta$ and $\dot{f}_m>0$ is measured ($\delta$ being a small threshold value). By definition, the initial position z(0) is thus zero. An initial velocity $\dot{z}(0)>0$ is required for the AM to engage with the payload (commanded by the MPC).

The gripper controller sends the close command around the $t={4}\,\hbox {s}$ mark as the closed system reaches steady-state. This triggers the UG’s state transition from soft to rigid during which the membrane shrinks, and consequently, the UAV has to descend to compensate for the diminishing contact force.

Numerical validation and virtual experiment

Numerical validation

The objective of this section is to verify that the developed cost functions produce the desired outcomes and behaviors. To that aim the developed cost functions are included in (8) as the sum of the individual cost functions for perception (18), position tracking (23), grasping (27), repulsion (31), and an additional term $J_U$ that penalizes for energy-inefficient trajectories:

$$\begin{aligned} J\left( \pmb {\MakeLowercase {x}}, \pmb {\MakeLowercase {u}}\right) = J_P\left( \pmb {\MakeLowercase {x}}\right) + J_T\left( \pmb {\MakeLowercase {x}}\right) + J_G\left( \pmb {\MakeLowercase {x}}\right) + J_R\left( \pmb {\MakeLowercase {x}}\right) + J_U\left( \pmb {\MakeLowercase {u}}\right) . \end{aligned}$$

(42)

The quadratic control effort penalty is defined as

$$\begin{aligned} J_U\left( \pmb {\MakeLowercase {u}}\right) = \pmb {\MakeLowercase {u}}^T \pmb {\MakeUppercase {Q}}_u \pmb {\MakeLowercase {u}}. \end{aligned}$$

(43)

A potential issue can arise as a consequence of using soft constraints on the objectives (18), (27) and (31) where any non-bounded cost in (42) can dominate all other objectives, which, ultimately, leads to a violation of the soft constraints (in particular the repulsion constraint). Herein, the position tracking cost (23) represents such a non-bounded cost. Therefore, our implementation includes a carrot-chasing law for position tracking that feeds a virtual target location ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref,virt}}}$ to the MPC that cannot exceed a certain distance from the UAV’s current position. This effectively means that (23) is bounded, regardless of the actual distance to the target. The virtual target displacement $\pmb {\MakeLowercase {w}}$ is defined as follows

$$\begin{aligned} \pmb {\MakeLowercase {w}}&= {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref,virt}}} - {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\mathtt {}}}, \end{aligned}$$

(44a)

$$\begin{aligned} \left\| {\pmb {\MakeLowercase {w}}}\right\|&\le w_{max}, \end{aligned}$$

(44b)

$$\begin{aligned} w_{max}&= N \cdot t_s \cdot v_{max}. \end{aligned}$$

(44c)

where $w_{max}$ represents the maximum distance that can be covered during the moving horizon at speed $v_{max}$, N is the number of stages in the horizon, and $t_s$ is the time step. The virtual target point ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref,virt}}}$ is fed to the MPC as ${{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref}}} = {{}^{\texttt{W}}\pmb {\MakeLowercase {p}}^{\mathtt {}}_{\texttt{ref,virt}}}$ with the reference velocities $v_0$, and $v_N$ defined as follows

$$\begin{aligned} v_0&= \alpha \cdot v_{max}, \end{aligned}$$

(45)

$$\begin{aligned} v_N&= {\left\{ \begin{array}{ll} 0 &{}\text {if } \left\| {\pmb {\MakeLowercase {w}}}\right\| < w_{max}, \\ v_{max} &{}\text {otherwise}, \end{array}\right. } \end{aligned}$$

(46)

$$\begin{aligned} \alpha&= \left\| {\pmb {\MakeLowercase {w}}}\right\| / w_{max} \in \left[ 0;1\right] . \end{aligned}$$

(47)

The UAV is thus encouraged to travel at maximum speed if the virtual target is located at a distance $\left\| {\pmb {\MakeLowercase {w}}}\right\| \ge w_{max}$, i.e., the target cannot be reached within the horizon. Once the target gets closer ($\left\| {\pmb {\MakeLowercase {w}}}\right\| < w_{max}$), the UAV has to slow down and is incentivized to reach zero velocity at the end of the moving horizon. The initial reference velocity $v_0$ is gradually reduced as the robot gets closer to the target point.

The developed MPC is validated numerically in three different scenarios, testing the perception, tracking, grasping, and obstacle avoidance components. The implementation uses the Optimal Control Problem (OCP) described in (8) within the OpEn⁵⁸ framework that also manages code generation and auto-differentiation using CasADi⁵⁹, and it employs the fast PANOC solver for embedded applications.

As indicated in⁶⁰, the prediction horizon must be chosen carefully such that a change to the input $\pmb {\MakeLowercase {u}}$ has a chance to realize an effect on the system states. Herein, the duration of the moving horizon is therefore based on the slowest subsystem of (6), which has the time constant $\tau = {0.54}\,\hbox {s}$ (see Table 1). Since a first order system reaches steady-state after $5\tau$, and given an adequate sampling interval of $T_s = {0.1}\,\hbox {s}$, the number of stages can be calculated as

$$\begin{aligned} N = \frac{5\tau }{T_s} = 26. \end{aligned}$$

(48)

The simulation performed inside MATLAB applies the first set of inputs from the MPC to the model defined in (6) and updates the system states using the forward Euler integration scheme. The empirically tuned weights of the MPC used throughout the simulated scenarios are as indicated in Table 3.

Table 3 The empirically tuned gains of the MPC.

Full size table

Scenario 1 (perception, inside FoV)

In this scenario, the UAV starts with its initial state set to $\left( {2}\,\hbox {m},-{2}\,\hbox {m},{1}\,\hbox {m},{225}^{\circ }\right)$ and with the visual target located at $\left( 0,0,0\right) \hbox {m}$. The commanded trajectory is a circle in xy-plane centered around the origin with a radius of ${2}\,\hbox {m}$. The robot starts with the visual target in view, but due to the circular trajectory, it has to continuously adjust its heading to keep up with the target. The soft constraint concerning the vertical location of the target in the virtual image plane allows the UAV to keep the commanded altitude even though the target is not at the center of the frame. Furthermore, an obstacle was added defined as $\pmb {\MakeLowercase {o}}_1 = \left( 0,2,2,1\right) \,\hbox {m}$. Due to its repulsion zone, the robot leaves its circular reference trajectory to avoid the obstacle. The 3D trajectory taken by the robot and the corresponding graphs are shown in Fig. 7.

Scenario 2 (perception, outside FoV)

This scenario is identical to scenario 1, except that the UAV starts with the camera pointing away from the target, i.e., with starting position $\left( {2}\,\hbox {m},-{2}\,\hbox {m},{1}\,\hbox {m},{45}^{\circ }\right)$, which represents a worst-case-scenario. Without the bias term in the perception cost, the optimization problem tends to converge in the sense that the camera keeps pointing away from the visual target. But with the formulation in (18), as shown in Fig. 8, the robot does manage to orient itself correctly to the target.

Scenario 3 (grasping approach)

In this scenario, the UAV starts with its initial state set to $\left( {2}\,\hbox {m},{0}\,\hbox {m},{2}\,\hbox {m},{-90}^{\circ }\right)$ and has the positional and visual target set to $\left( -{1.5}\,\hbox {m},{0}\,\hbox {m},{0}\,\hbox {m},{45}^{\circ }\right)$. Since the target is in the FoV, the MPC ignores the desired orientation and instead assures that the target stays in the FoV. A safety distance of 0.5 m was defined, which keeps the robot at a safe distance from the ground and only allows it to descend when close to the target (in xy-plane). At the same time, the perception constraint is dropped close to the target to avoid over-constraining the problem. The scenario is successfully executed as depicted in Fig. 9.

Virtual experiment

Object detection pipeline

Visual servoing is an essential part of autonomous grasping as it guides the AM towards the payload using sensor feedback. By doing so, it relies heavily on the information provided by the visual sensors (e.g., cameras) as well as the processing chain used to extract various pieces of information such as position, orientation, size, or material.

For this application, the simulated UAV is equipped with the stereo depth camera RealSense D435 that provides an RGB image with an associated depth channel (see Fig. 4). Blob detection is applied to the RGB image to detect objects based on their color and size. Using a pinhole camera model, the centroid of the detected blob is deprojected to a 3D point with the help of the depth information. Note that since the orientation of the payload is not required for the UG to operate, the vision pipeline can be kept lean and efficient, ideal for low-power embedded hardware.

Autonomous grasping in simulation

In this section, the components developed throughout this work are applied in a model scenario that has the disposal of a grasping object as its goal. The scenario is set up in Gazebo, a robotics simulator that performs a realistic simulation of the flight dynamics and sensors as well as the autopilot of a UAV inside a virtual environment.

The AM used throughout this work is depicted in Fig. 10a and consists of a hexacopter carrying the simulated UG as developed in the subsection on UAV modeling. It also features a stereo-vision camera that is used to localize the payload. A modified PX4⁶¹ serves as the autopilot for the drone and assumes communication with the UG via a custom interface module. The custom module further exposes the state information of the gripper to the outside world via MAVLink⁶² and consequently to ROS⁶³ via a custom MAVROS plugin. Furthermore, the commands from ROS are forwarded to the gripper, making it fully integrated into the ROS ecosystem despite being directly connected to the Pixhawk autopilot.

The complete, system architecture is as depicted in Fig. 3. The mission planner pulls in all relevant information from ROS and provides the autopilot and gripper with the relevant commands based on the state of the mission. The intention is that the operator provides a rough description of the scenario (environment, obstacles, location of the payload), and then, based on that, the mission planner coordinates all the components to ensure the successful execution of the mission. The payload’s initial position can be a rough estimate as it gets updated automatically once it enters the FoV of the camera. In addition to the mission planner, a computer vision ROS node handles the object detection based on the color and shape (blob detection) and depth images and exposes the detected 3D coordinates of the payload to the mission planner.

To stay within the scope of this paper, the MPC is made aware of potential obstacles by manually specifying their bounds before launching the simulation. For real applications, online visual SLAM can be used to create a map of the environment in real-time, e.g.,⁶⁴ and consequently generating and feeding a list of obstacles to the MPC. As mentioned in the cost function on tracking, the list of obstacles is sorted by their distance to the UAV (shortest distance to ellipsoid), and only the closest $N_o$ obstacles are fed into the MPC’s parameter vector (5).

Pick and place scenario

A grasping object has been identified in a parking garage close to a car. The goal of this simulation is to pick up the approximately localized object ($\pm {0.5}\,\hbox {m}$) and transport it to a dropoff site. The AM uses its onboard camera to further refine the localization of the object during the approach. The MPC handles the trajectory generation throughout the simulation and thus guides the UAV to the object respecting the FoV of the camera and vertical descent imposed by the gripper, while avoiding any obstacles such as the pillars, walls, and cars in the parking garage. Having reached the object, the control scheme is changed to force control via $\zeta$, where a nominal activation force of 2 N is tracked. After reaching steady-state on the exerted force, the simulated UG is closed, and the object is grasped successfully. The MPC now takes the AM to the dropoff site where the object is released. Finally, the AM returns to its starting position, where it lands again on the UG. The scenario is visualized in Fig. 10b.

Results

The path taken by the UAV is visualized in Fig. 10c alongside the obstacles known to the MPC. The xy-position and altitude are plotted separately in Fig. 13. The UAV successfully avoids all obstacles on its way and graciously approaches the target such that it can be grasped by the gripper. The robot then moves to the dropoff area, again avoiding all obstacles along its path. The MPC commands velocities (x, y, z, and yaw) at 100 Hz that are tracked via a set of cascading PID controllers for the roll, pitch, and yaw axis, resp., the velocities in x and y. The aforementioned SMC tracks the commanded velocity in z. The commanded and actual velocities are visualized in Fig. 13. It can be seen that the commanded velocity in z-direction approaches a very gentle − 0.06 $\text{ms}^{-1}$, which allows for a relatively smooth transition to the force-control scheme which is shown in detail in Fig. 12 between the 35 s and 43 s mark. A nominal force of 2 N is tracked by the SMC during the grasping interval. Once the activation force reaches steady-state, the gripper is closed at the 37 s mark. After having acquired a secure grasp, the UAV takes off (Fig. 13, 43 s mark). Consequently, the gripper’s load cell registers the weight of the payload (200 g), which is fed back to the SMC, which is thus aware of the changed mass of the system. A comprehensive video detailing the entire experiment, showcasing the UAV’s path, obstacle avoidance, approach to the object, and the grasping process, is available for a more nuanced understanding and visualization of the results (see Movie S1 in the Supplementary Materials for details).

Discussion

The primary findings of this study underscore the positive synergetic effects between mechanical intelligence, and state-of-the-art control techniques for AMs. Thanks to the passive intelligence built into the mechanics of the UG, the control system of the AM can be kept relatively straightforward, which is a net benefit over existing solutions that often require complicated and expensive algorithms to find the payload’s orientation to calculate the optimal angle of approach.

Leveraging the principles of granular jamming, the UG showcased adeptness in handling various objects, irrespective of their geometrical intricacies³⁴. The trajectory optimization, when coupled with trajectory tracking, furnishes the UAV with the capability to achieve optimal flight paths in real-time. This combination of real-time trajectory optimization and force control paves the way for increased autonomy, enabling AMs to operate efficiently without constant human intervention.

The current research landscape, while burgeoning with innovations, often leans towards compensating shortcomings, e.g., pin-point accuracy requirements, by enhancing the UAV’s agility and precision. Although a viable approach, it adds substantial complexity and costs and is diametrically different from our proposed solution which shifts the complexity back to the mechanical system of the gripper where it is handled using the passive intelligence baked into the jamming membrane. Our approach offers an integrated solution that synergizes the strength of both the mechanical system as well as the control system to address the multifaceted challenges of aerial grasping.

This research shows promising avenues for a broad spectrum of sectors. With advancements in aerial grasping capabilities, industries demanding precise material handling in remote or elevated locations, such as construction or agriculture, stand to benefit immensely. The potential of the research is particularly striking in scenarios like disaster response, where the urgency to operate efficiently in challenging terrains is paramount. Moreover, the findings could catalyze a more extensive deployment of UAVs in diverse areas, including environmental monitoring and urban delivery tasks. On the design front, the insights from this study could guide the evolution of future UAV designs, optimizing them for more general aerial grasping tasks.

The UG’s design mainly favors grasping objects from the floor, which challenges alternate orientations like horizontal grasping as the granular material would accumulate unevenly around the object. Although this allows for simple grasping strategies, it does limit the use of the AM to cases with enough space for the drone’s body not to collide with the surroundings. We believe this issue can in part be addressed with design changes to the membrane and also on the control side, inspiration can be taken from various research articles dealing with building inspection requiring physical contact⁶⁵.

Conclusion

In our previous work, we developed a universal jamming gripper for UAVs, showing its grasping capabilities in real experiments. Although the grasping procedure requires neither precise positioning nor contact force control (major advantages of our design), the requirement for an operator in the loop was still detrimental to its reliability as controlling several parameters simultaneously quickly overburdened the pilots. This study thus marks the first step towards automated aerial grasping with UGs by taking the human out of the loop. Based on the experimental results of our previous work, a simulation model of a UG was developed upon which an adequate control strategy was built, consisting of a force tracker and an optimizing constraint trajectory generator. The control strategy was validated using numerical simulations and virtual experiments.

Future work consists of deploying and testing the developed solution on real hardware. Furthermore, the current blob detection-based vision pipeline is too simplistic to cope with real-world objects and we believe the system could benefit from modern learning-based methods. This could also enable the AM to detect suitable landing locations and thus capitalize on the AM’s ability to land or rest directly on the gripper also in unconventional places, e.g., on branches or posts. Prop wash is still likely to dislocate lightweight objects on approach although the trajectory taken by the AM reduces its effects. A solution to this could entice further mechanical changes to the AM, e.g., increasing the distance between the gripper and the UAV, or adding some form of shielding against the airflow. Moreover, global trajectory planning (pathfinding) is currently missing from our solution, limiting the AM’s operation space. Lastly, the AM could be a good candidate to include in a multi-robot cooperative aerial manipulation scheme for larger and heavier payloads, exploiting recent developments in inter-robot management schemes⁶⁶ and combining those with the simplicity and unique abilities of our grasping solution.

Data availibility

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Liu, H. et al. Improved GBS-YOLOv5 algorithm based on YOLOv5 applied to UAV intelligent traffic. Sci. Rep. 13, 9577 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, N. et al. A self-rotating, single-actuated UAV with extended sensor field of view for autonomous navigation. Sci. Robot. 8, eade4538 (2023).
Article PubMed Google Scholar
Song, Y., Romero, A., Müller, M., Koltun, V. & Scaramuzza, D. Reaching the limit in autonomous racing: Optimal control versus reinforcement learning. Sci. Robot. 8, eadg1462 (2023).
Article PubMed Google Scholar
Kaufmann, E. et al. Champion-level drone racing using deep reinforcement learning. Nature 620, 982–987 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Kopfstedt, T., Mukai, M., Fujita, M. & Ament, C. Control of formations of UAVs for surveillance and reconnaissance missions. IFAC Proc. Vol. 41, 5161–5166 (2008).
Article Google Scholar
Li, X. & Savkin, A. V. Networked unmanned aerial vehicles for surveillance and monitoring: A survey. Future Internet 13, 174 (2021).
Article Google Scholar
Casbeer, D. W., Kingston, D. B., Beard, R. W. & McLain, T. W. Cooperative forest fire surveillance using a team of small unmanned air vehicles. Int. J. Syst. Sci. 37, 351–360 (2006).
Article Google Scholar
Freimuth, H. & König, M. Planning and executing construction inspections with unmanned aerial vehicles. Autom. Constr. 96, 540–553 (2018).
Article Google Scholar
Debus, P. & Rodehorst, V. Multi-scale flight path planning for UAS building inspection. In Proceedings of the 18th International Conference on Computing in Civil and Building Engineering: ICCCBE 2020, 1069–1085 (Springer, 2021).
Mohan, M. et al. UAV-supported forest regeneration: Current trends, challenges and implications. Remote Sens. 13, 2596 (2021).
Article ADS Google Scholar
Nordin, Z. & Salleh, A. M. Application of unmanned aerial vehicle (UAV) in terrain mapping: Systematic literature review. Int. J. Sustain. Constr. Eng. Technol. 13, 216–223 (2022).
Google Scholar
Zhang, Z., Liu, X. & Feng, B. Research on obstacle avoidance path planning of UAV in complex environments based on improved Bézier curve. Sci. Rep. 13, 16453 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Fang, Y., Yao, Y., Zhu, F. & Chen, K. Piecewise-potential-field-based path planning method for fixed-wing UAV formation. Sci. Rep. 13, 2234 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Bonyan Khamseh, H., Janabi-Sharifi, F. & Abdessameud, A. Aerial manipulation—A literature survey. Robot. Auton. Syst. 107, 221–235 (2018).
Article Google Scholar
Ruggiero, F., Lippiello, V. & Ollero, A. Aerial manipulation: A literature review. IEEE Robot. Autom. Lett. 3, 1957–1964 (2018).
Article Google Scholar
Ladig, R., Paul, H., Miyazaki, R. & Shimonomura, K. Aerial manipulation using multirotor UAV: a review from the aspect of operating space and force. J. Robot. Mechatron. 33, 196–204 (2021).
Article Google Scholar
Kruse, L. & Bradley, J. A Hybrid, actively compliant manipulator/gripper for aerial manipulation with a multicopter. In 2018 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), 1–8 (IEEE, 2018).
Popek, K. M. et al. Autonomous grasping robotic aerial system for perching (AGRASP). In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 1–9 (IEEE, 2018).
Hang, K. et al. Perching and resting-A paradigm for UAV maneuvering with modularized landing gears. Sci. Robot. 4, eaau6637 (2019).
Article PubMed Google Scholar
Meng, J., Buzzatto, J., Liu, Y. & Liarokapis, M. On aerial robots with grasping and perching capabilities: A comprehensive review. Front. Robot. AI 8, 739173 (2022).
Article PubMed PubMed Central Google Scholar
Zhao, M., Okada, K. & Inaba, M. Versatile articulated aerial robot DRAGON: Aerial manipulation and grasping by vectorable thrust control. Int. J. Robot. Res. 42, 214–248 (2023).
Article Google Scholar
Bauer, E., Cangan, B. G. & Katzschmann, R. K. Autonomous Vision-based Rapid Aerial Grasping. arXiv preprint arXiv:2211.13093 (2022).
Ryll, M. & Katzschmann, R. K. SMORS: A soft multirotor UAV for multimodal locomotion and robust interaction. In 2022 International Conference on Robotics and Automation (ICRA), 2010–2016 (IEEE, 2022).
Appius, A. X. et al. Raptor: Rapid aerial pickup and transport of objects by robots. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 349–355 (IEEE, 2022).
Fishman, J., Ubellacker, S., Hughes, N. & Carlone, L. Dynamic Grasping with a“ Soft” Drone: From Theory to Practice. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 4214–4221 (IEEE, 2021).
Zhang, Y., Zhang, W., Gao, P., Zhong, X. & Pu, W. Finger-palm synergistic soft gripper for dynamic capture via energy harvesting and dissipation. Nat. Commun. 13, 7700 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Nguyen, P. H. & Kovač, M. Adopting physical artificial intelligence in soft aerial robots. In IOP Conference Series: Materials Science and Engineering, Vol. 1261, 012006 (IOP Publishing, 2022).
Zheng, P., Xiao, F., Nguyen, P. H., Farinha, A. & Kovac, M. Metamorphic aerial robot capable of mid-air shape morphing for rapid perching. Sci. Rep. 13, 1297 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Brown, E. et al. Universal robotic gripper based on the jamming of granular material. Proc. Natl. Acad. Sci. 107, 18809–18814 (2010).
Article ADS CAS PubMed Central Google Scholar
Howard, D. et al. A Comprehensive Dataset of Grains for Granular Jamming in Soft Robotics: Grip Strength and Shock Absorption. arXiv preprint (2022). arXiv:2212.06511.
Amend, J. R., Brown, E., Rodenberg, N., Jaeger, H. M. & Lipson, H. A positive pressure universal gripper based on the jamming of granular material. IEEE Trans. Robot. 28, 341–350 (2012).
Article Google Scholar
Shintake, J., Cacucciolo, V., Floreano, D. & Shea, H. Soft robotic grippers. Adv. Mater. 30, 1707035 (2018).
Article Google Scholar
D’Avella, S., Tripicchio, P. & Avizzano, C. A. A study on picking objects in cluttered environments: exploiting depth features for a custom low-cost universal jamming gripper. Robot. Comput. Integr. Manuf. 63, 101888 (2020).
Article Google Scholar
Kremer, P., Rahimi Nohooji, H., Sanchez-Lopez, J. L. & Voos, H. TRIGGER: A lightweight universal jamming gripper for aerial grasping. IEEE Access 11, 50098–50115 (2023).
Article Google Scholar
Wu, C., Liu, H., Lin, S., Li, Y. & Chen, Y. Investigation of fluidic universal gripper for delicate object manipulation. Biomimetics 8, 209 (2023).
Article PubMed PubMed Central Google Scholar
Quan, L., Han, L., Zhou, B., Shen, S. & Gao, F. Survey of UAV motion planning. IET Cyber-Syst. Robot. 2, 14–21. https://doi.org/10.1049/iet-csr.2020.0004 (2020).
Article Google Scholar
Jones, M., Djahel, S. & Welsh, K. Path-planning for unmanned aerial vehicles with environment complexity considerations: A survey. ACM Comput. Surv. 55, 1–39. https://doi.org/10.1145/3570723 (2023).
Article Google Scholar
Ding, X., Guo, P., Xu, K. & Yu, Y. A review of aerial manipulation of small-scale rotorcraft unmanned robotic systems. Chin. J. Aeronaut. 32, 200–214. https://doi.org/10.1016/j.cja.2018.05.012 (2019).
Article Google Scholar
Penin, B., Spica, R., Giordano, P. R. & Chaumette, F. Vision-based minimum-time trajectory generation for a quadrotor UAV. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vol. 2017-Septe, 6199–6206, https://doi.org/10.1109/IROS.2017.8206522 (IEEE, 2017).
Liu, S., Wang, Z., Sheng, X. & Dong, W. Hitchhiker: A quadrotor aggressively perching on a moving inclined surface using compliant suction cup gripper. IEEE Trans. Autom. Sci. Eng.https://doi.org/10.1109/TASE.2023.3263558 (2024).
Article Google Scholar
Chen, H., Quan, F., Fang, L. & Zhang, S. Aerial grasping with a lightweight manipulator based on multi-objective optimization and visual compensation. Sensors 19, 4253. https://doi.org/10.3390/s19194253 (2019).
Article ADS PubMed PubMed Central Google Scholar
Jayaweera, H. M. & Hanoun, S. A dynamic artificial potential field (D-APF) UAV path planning technique for following ground moving targets. IEEE Access 8, 192760–192776. https://doi.org/10.1109/ACCESS.2020.3032929 (2020).
Article Google Scholar
Seo, H., Kim, S. & Kim, H. J. Aerial grasping of cylindrical object using visual servoing based on stochastic model predictive control. In 2017 IEEE international conference on robotics and automation (ICRA), 6362–6368, https://doi.org/10.1109/ICRA.2017.7989751 (IEEE, 2017).
Lee, H., Kim, H. & Kim, H. J. Planning and control for collision-free cooperative aerial transportation. IEEE Trans. Autom. Sci. Eng. 15, 189–201. https://doi.org/10.1109/TASE.2016.2605707 (2018).
Article Google Scholar
Romero, A., Sun, S., Foehn, P. & Scaramuzza, D. Model predictive contouring control for time-optimal quadrotor flight. IEEE Trans. Robot. 38, 3340–3356. https://doi.org/10.1109/TRO.2022.3173711 (2022).
Article Google Scholar
Liufu, Y., Jin, L., Shang, M., Wang, X. & Wang, F.-Y. ACP-incorporated perturbation-resistant neural dynamics controller for autonomous vehicles. IEEE Trans. Intell. Veh.https://doi.org/10.1109/TIV.2023.3348632 (2024).
Article Google Scholar
Sathya, A. et al. Embedded nonlinear model predictive control for obstacle avoidance using PANOC. In 2018 European Control Conference, ECC 2018 1523–1528, https://doi.org/10.23919/ECC.2018.8550253 (2018). arXiv:1904.10546.
Zhu, H., Yang, J., Zhang, Y., Feng, X. & Ma, Z. Nonlinear dynamic model of air spring with a damper for vehicle ride comfort. Nonlinear Dyn. 89, 1545–1568 (2017).
Article Google Scholar
Stella, L., Themelis, A., Sopasakis, P. & Patrinos, P. A simple and efficient algorithm for nonlinear model predictive control. In 2017 IEEE 56th Annual Conference on Decision and Control (CDC), Vol. 2018-Janua, 1939–1944 (IEEE, 2017).
Falanga, D., Foehn, P., Lu, P. & Scaramuzza, D. PAMPC: Perception-aware model predictive control for quadrotors. In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 1–8, https://doi.org/10.1109/IROS.2018.8593739 (IEEE, 2018). arXiv:1804.04811.
Ikeda, T. et al. Wall contact by octo-rotor UAV with one DoF manipulator for bridge inspection. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vol. 2017-Septe, 5122–5127 (IEEE, 2017).
Ikeda, T. et al. Stable impact and contact force control by UAV for inspection of floor slab of bridge. Adv. Robot. 32, 1061–1076 (2018).
Article Google Scholar
Tzoumanikas, D. et al. Aerial manipulation using hybrid force and position NMPC applied to aerial writing. In Robotics: Science and Systems XVI (Robotics: Science and Systems Foundation, 2020).
Chen, Y. P. & Chang, J. L. Sliding-mode force control of manipulators. Proc. Natl. Sci. Counc. Repub. China Part A Phys. Sci. Eng. 23, 281–288 (1999).
Google Scholar
Luo, H., Hu, R. & Deng, H. Force control of an underactuated prosthetic hand based on sliding mode with exponential reaching law. In Proceedings of the 2016 International Conference on Advanced Electronic Science and Technology (AEST 2016), Aest, 186–192 (Atlantis Press, Paris, France, 2016).
Edwards, C. & Spurgeon, S. Sliding Mode Control: Theory and Applications (Crc Press, Boca Raton, 1998).
Book Google Scholar
Liu, J. Sliding Mode Control Using MATLAB (Academic Press, Cambridge, 2017).
Google Scholar
Sopasakis, P., Fresk, E. & Patrinos, P. OpEn: Code generation for embedded nonconvex optimization. IFAC-PapersOnLine 53, 6548–6554 (2020).
Article Google Scholar
Andersson, J. A. E., Gillis, J., Horn, G., Rawlings, J. B. & Diehl, M. CasADi: a software framework for nonlinear optimization and optimal control. Math. Program. Comput. 11, 1–36 (2019).
Article MathSciNet Google Scholar
Schwenzer, M., Ay, M., Bergs, T. & Abel, D. Review on model predictive control: an engineering perspective. Int. J. Adv. Manuf. Technol. 117, 1327–1349 (2021).
Article Google Scholar
Meier, L., Honegger, D. & Pollefeys, M. PX4: A node-based multithreaded open source robotics framework for deeply embedded platforms. In 2015 IEEE International Conference on Robotics and Automation (ICRA), Vol. 2015-June, 6235–6240 (IEEE, 2015).
Koubaa, A. et al. Micro air vehicle link (MAVlink) in a nutshell: A survey. IEEE Access 7, 87658–87680 (2019).
Article Google Scholar
Foundation, O. S. R. Robotic Operating system website (2015).
Chen, S., Chen, H., Zhou, W., Wen, C. Y. & Li, B. End-to-End UAV Simulation for Visual SLAM and Navigation. arXiv preprint arXiv:2012.00298 (2020).
Bodie, K. et al. An omnidirectional aerial manipulation platform for contact-based inspection. In Robotics: Science and Systems XV, https://doi.org/10.15607/RSS.2019.XV.019 (Robotics: Science and Systems Foundation, 2019). arXiv:1905.03502.
Liao, B., Hua, C., Xu, Q., Cao, X. & Li, S. Inter-robot management via neighboring robot sensing and measurement using a zeroing neural dynamics approach. Expert Syst. Appl. 244, 122938. https://doi.org/10.1016/j.eswa.2023.122938 (2024).
Article Google Scholar

Download references

Acknowledgements

This research was funded in whole, or in part, by the Luxembourg National Research Fund (FNR), COSAMOS Project, ref. IC22/IS/17432865/COSAMOS. For the purpose of open access, and in fulfilment of the obligations arising from the grant agreement, the author has applied a Creative Commons Attribution 4.0 International (CC BY 4.0) license to any Author Accepted Manuscript version arising from this submission.

Author information

Authors and Affiliations

Automation & Robotics Research Group, Interdisciplinary Centre for Security, Reliability and Trust (SnT), University of Luxembourg, Esch-sur-Alzette, Luxembourg
Paul Kremer, Hamed Rahimi Nohooji & Holger Voos
Faculty of Science, Technology and Medicine (FSTM), University of Luxembourg, Esch-sur-Alzette, Luxembourg
Holger Voos

Authors

Paul Kremer
View author publications
You can also search for this author in PubMed Google Scholar
Hamed Rahimi Nohooji
View author publications
You can also search for this author in PubMed Google Scholar
Holger Voos
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, H.V., P.K.; Methodology, P.K., H.R.N.; Software and Validation, P.K.; Writing - Original Draft, P.K., H.R.N.; Writing - Review & Editing, P.K., H.R.N., H.V.; Supervision, H.V. All authors provided critical feedback and helped shape the research and manuscript and have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Hamed Rahimi Nohooji.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kremer, P., Rahimi Nohooji, H. & Voos, H. Constrained trajectory optimization and force control for UAVs with universal jamming grippers. Sci Rep 14, 11968 (2024). https://doi.org/10.1038/s41598-024-62416-1

Download citation

Received: 17 November 2023
Accepted: 16 May 2024
Published: 25 May 2024
DOI: https://doi.org/10.1038/s41598-024-62416-1
Springer Nature Limited

Constrained trajectory optimization and force control for UAVs with universal jamming grippers

Abstract

Similar content being viewed by others

Design of an Active-Reliable Grasping Mechanism for Autonomous Unmanned Aerial Vehicles

Optimization and Design of a Gripper Mechanism for Autonomous Unmanned Aerial Vehicle Perching

Dexterous UAVs for Precision Low-Altitude Flight

Introduction

Problem statement and preliminaries

Problem statement

UG modeling

UAV modeling

Trajectory optimization using model predictive control

Problem formulation

Remark 1

Cost function: perception

Cost function: tracking

Cost function: grasping

Cost function: repulsion

Force control

Problem formulation

Force tracking

Sliding mode position tracker

Force tracker

Remark 2

Numerical validation and virtual experiment

Numerical validation

Scenario 1 (perception, inside FoV)

Scenario 2 (perception, outside FoV)

Scenario 3 (grasping approach)

Virtual experiment

Object detection pipeline

Autonomous grasping in simulation

Pick and place scenario

Results

Discussion

Conclusion

Data availibility

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation