A machine learning approach for package size estimation using UHF RFID interrogation signature

Vales-Alonso, Javier; López-Matencio, Pablo

doi:10.1007/s10489-024-05412-2

A machine learning approach for package size estimation using UHF RFID interrogation signature

Open access
Published: 04 May 2024

Volume 54, pages 6053–6068, (2024)
Cite this article

Download PDF

You have full access to this open access article

Applied Intelligence Aims and scope Submit manuscript

A machine learning approach for package size estimation using UHF RFID interrogation signature

Download PDF

440 Accesses
Explore all metrics

Abstract

This paper introduces a new approach for performing package classification and sizing using Radio-Frequency Identification (RFID) systems. This technique is applicable when packages are labeled with or contain multiple RFID-tagged items. During the interrogation of the tags, received signal strength (RSS) statistics and other information, such as the frame count or the reading time, are collected by the reader and used to predict the package type from a set of candidate classes using an Artificial Neural Network (ANN). The primary challenge lies in acquiring sufficient training data for a target scenario to ensure reliable predictions. To address this, a two-phase training process based on transfer learning is adopted. Initially, a base model is developed using synthetic data generated from a detailed RFID simulator, designed to suit diverse scenarios, establish detailed link budgets, and comprehensively simulate the communication protocols. This model is then refined using a small dataset collected experimentally in the actual scenario. This method was validated in a real testbed with four different package types. The base model was trained using 1000 synthetic samples per package type (4000 in total), whereas the refined model was trained with a dataset consisting of only 25 real interrogation traces (samples) per package type (100 in total). The experimental samples were obtained using a software-defined radio unit, the Ettus B210 Universal Software Radio Peripheral (USRP) platform. This experiment achieved an accuracy of over 92%. In summary, this approach introduces a new feature to existing RFID setups, demonstrating potential for advanced package handling and cost optimization in the logistics sector.

Machine Learning for RF Fingerprinting Extraction and Identification of Soft-Defined Radio Devices

Package and Classify Wireless Product Features to Their Sales Items and Categories Automatically

Intelligent Radar Signal Recognition and Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Measuring and classifying packages is a fundamental activity in logistics, crucial for determining how each package will be handled, stored, delivered, and the costs associated to these operations. Various technologies are available to automate this task. Devices using light (e.g., [17]) or ultrasonic sensors (e.g., [27]) can measure the distance to the package faces along each axis, enabling accurate measurements of rectangular boxes with high resolution and fast processing speeds (less than one second per package). For irregular shapes, devices equipped with sensor arrays, such as light beams (e.g., [28]), are also available. However, these devices tend to be more expensive and slower, as the package must move alongside the sensing device while being measured. All of the previous systems require specific installation and operational procedures, such as controlled spaces where boxes must be aligned with specific marks before sizing. A more flexible alternative is the use of 3D cameras (e.g., [29, 36]) based on stereo-vision, time-of-flight (ToF), Light Detection and Ranging (LiDAR), and other technologies (refer to [44]). These cameras can detect the shapes of objects within their line of sight by processing images and fitting the observations to predefined candidate object models. They operate indoors at fast speeds but come at higher costs than the approaches mentioned previously.

Cost-effective alternatives include traditional vision-based measurement solutions, but these may incur uncertainties due to calibration issues [22], or face operational challenges in dynamic or cluttered processing environments. These challenges encompass movement, varying lighting conditions, and obstructions, limiting their deployment to controlled and homogeneous spaces. The integration of deep learning-based image analysis (see [25] and references therein) has enabled object detection and classification in less controlled environments by fitting the package to a set of candidate shapes. However, these systems require large training datasets, even when adapting pre-existing models. Although some public datasets with package images are available (e.g., [1, 35]), they primarily focus on box detection rather than class/size recognition.

In addition to the technologies previously mentioned, this work explores the application of Ultra High Frequency (UHF) Radio Frequency Identification (RFID) systems for classification and sizing tasks. RFID technology is widely utilized in warehouses and logistics facilities. In RFID systems, a reader can query nearby tags for their identification. Tags utilize the power from the reader’s signal to energize themselves and respond to queries through backscattering. Originally conceived for item tracking, RFID has evolved to facilitate a variety of remote sensing tasks [9], which can be performed either by implementing specialized tags capable of executing sensing operations and transmitting supplementary data to the reader, or by post-processing the tags’ signals in the reader.

Examples of the first type include works [24, 30]. In [24], the authors develop an RFID tag capable of measuring and transmitting the pH of athletes’ sweat. Similarly, the work in [30] designs a tag for measuring and relaying information about object vibrations and tilt to the reader.

The second approach is based on collecting low-level information from the tags’ transmissions (typically, received signal strengths -RSSs- and phases) and implementing remote sensing tasks by correlating this information with the target magnitude. For example, in [39], researchers utilize the phases of a tag’s backscattered signals to estimate temperature. Another example introduces a method to measure soil moisture, based on the RSS from a tag’s backscattered signal, as described in [31]. It is also possible to analyze responses from multiple tags. For instance, in [8], authors deploy tags on the walls of a room and correlate the variance in the tags’ signal RSSs with the number of people in that room. Another strategy involves examining the temporal evolution of tags’ responses. For example, [46] detects drivers’ fatigue by analyzing time-series readings from tags placed in a hat, and [40] introduces a method to deduce breathing periods by analyzing both RSS and phase time-series from backscattered signals of several tags.

Building on these concepts, this work proposes a novel RFID-sensing application that utilizes low-level information from the RFID identification process as a signature to classify package types among a set of candidates (similarly to computer vision systems), premised on the assumption that each package will be labeled with or contain multiple RFID-tagged items. In the context of this work, the RSSs statistics are used as main information for the signature, operating under the hypothesis that RSSs range and variance should correlate with the maximal distance among tags, which, in turn, should depend on the package type and size. Moreover, additional statistics from the interrogation process, such as frame count or total reading time, are also integrated into the signature as they can be indicative of the package type. For example, a higher frame count or prolonged reading time may suggest multiple interrogation attempts to identify the most distant tags, implying larger package sizes.

The proposed approach has the ability to provide reasonable capabilities without incurring additional hardware costs, thus opening the door to various applications, such as unattended package classification, ensuring adherence to package manifests, identifying packages with unusual distribution that require alternative processing, and more. Furthermore, unlike the previously discussed methods, which require direct line of sight, RFID offers a unique advantage in measuring packages even when they are not directly accessible, such as when they are inside other containers.

A supervised machine learning model based on shallow Artificial Neural Networks (ANNs) has been considered as the predictive structure that relates the identification signature to the package class. Moreover, classes have been sorted by size to keep the sizing error minimal (since classification errors will be more likely between neighbor types). The main challenge in this scheme is to create a suitable training strategy that does not require gathering large datasets from real scenarios, which would be prohibitive in practice. To that end, a two-stage transfer learning strategy has been adopted. It consists of building first a base model, which is trained using a synthetic dataset derived from a simulator adjusted to closely match the real scenario, and then a fine-tuned model, where a small set of experimental samples obtained from the actual setup is used to recalibrate the base model.

To explore the operation of this strategy, we start by describing related works in Section 2 and by defining reference scenarios in Section 3. These scenarios encompass typical elements found in standard RFID installations. The performance of the classifier defined in Section 3.4 is analyzed in Section 5 using training data gathered using the simulator described in Section 4. Section 6 explores the impact of deviations from the reference scenario and the application of transfer learning methodologies to correct this issue. Section 7 describes the application of this method in a real testbed aimed at mail classification tasks. Finally, Section 8 provides the concluding remarks.

2 Related work

Multiple machine learning predictors have been explored in various RFID sensing studies. ANNs have been utilized in works such as [38] and [4], while Recurrent Neural Networks (RNNs), specifically Long Short-Term Memory (LSTM) structures, have been examined in [46] and [2]. Deep learning architectures have also been employed, for example in [45] (refer to survey [12] for details). The majority of these studies rely on extensive real datasets to develop the predictive models. Akin to our case, some works on RFID sensing, like [41], employ transfer learning from synthetic to experimental scenarios to mitigate data scarcity.

Regarding the specific application of package size/class estimation, we are not aware of any previous proposals in the literature dealing with this challenge, aside from our preliminary model presented in a prior conference paper [34]. Nevertheless, some works have tackled different aspects of package characterization. For instance, in [3] authors describe how to perform a 3D reconstruction of RFID-tagged packages to determine their orientation and stacking, using the phases of the backscattered signals. Moreover, a method for identifying the direction of goods passing through an RFID gate in a warehouse was introduced in [2]. RFID technology was also applied by Li et al. [20] for locating packages on shelves, with the approach being extended in [21] to include a drone-based reader. Other research focusing on package location includes [7, 13, 15, 23, 26, 43], aimed at determining the relative or absolute position of tagged items or estimating their pose, as investigated in [33]. Additionally, the concept of interrogation signatures was examined by Khadka et al. [19], who studied the unique physical-layer identification signature of passive UHF RFID tags and its implications for tag holder’s privacy.

In summary, while numerous studies have demonstrated the versatility of RFID to create innovative solutions beyond simple identification tasks, its application to package class/size estimation remains largely unexplored in the literature.

3 Reference scenarios

Our target scenario is composed of a single reader with one pair of bistatic dislocated antennas, as shown in Fig. 1. The gate has dimensions: d (distance between antennas) ranging from 2 to 3 meters depending on the configuration, $h_a$ (antenna height) of 2.5 m, and $h_b$ (package placement height) of 1 m (these are normal setups in RFID installations). Packages with UHF RFID-tagged items inside arrive at the reading area for identification, and their location, orientation, and other parameters are subject to a random component. Three increasingly complex setups have been considered:

1.
Ideal scenario. In this case, packages are always placed in the same reading spot (x in Fig. 1), resting on the largest face, and without any kind of rotation (L edge is parallel to the dashed line connecting RX and TX antennas, see Figs. 1b-c). The number of tags and their position inside the package is random (see Section 3.1). Tags always rest on the horizontal plane and parallel to the edge W of the package (see Figs. 1b-c). Although simplistic, this scenario may occur in practice, e.g., when packages are automatically placed on a conveyor belt and the interrogation process is triggered by a photodetector, that always stops the belt at the same spot.
2.
Simple scenario. A slight amount of randomness is assumed for the packages in this scenario. Packages are placed near, but not exactly on, the reading spot. The actual position $x'$ is given by adding independent random variables $\mathcal {N}(0,\frac{L}{10})$ to both X and Y axes. The packages are assumed to rest on their largest face, but now only roughly aligned with the TX-RX line ($\phi $ $\sim $ $\mathcal {N}(0,\frac{\pi }{12})$ radians, see Fig. 1). Like in the ideal scenario, the number of tags and their position inside the package is random, but the orientation of the whole bunch may vary and be either vertical or horizontal (all tags are coherently aligned). This scenario may correspond to a semi-automatic placement process, with some small differences among packages.
3.
Complex scenario. In this scenario, more intense changes occur in the position (position displacement is $\mathcal {N}(0,\frac{L}{5})$ for both axes), and in the orientation ($\phi $ $\sim $ $\mathcal {N}(0,\frac{\pi }{6})$). In addition, packages are randomly rotated (i.e., they can rest on any face). The number of tags and their position are random, as in the previous scenarios, but now each particular tag orientation is selected independently (tags are not coherently aligned). This case would correspond to a scenario where packages are manually prepared and positioned.

3.1 Package types and distribution of the number of tags

As stated above, the number of tags inside each package is considered random in the reference scenario. The expected number of tags in a package is assumed to be proportional to the package volume to represent a realistic situation. Three possible sets of package sizes have been considered. Table 1 summarizes them: (i) a small set with 4 types of packages (4P), (ii) a medium set with 8 types (8P), and (iii) a large set with 16 types (16P). The sizes correspond to products from the company UK packaging^{Footnote 1}. The tag distribution is given by $\max \{1,P(\lambda )\}$, being $P(\lambda )$ a Poisson random variable with rate $\lambda $. This distribution guarantees that all packages have at least one tag. Besides, a uniform density of 100 tags/m$^3$ has been considered, which yields the mean number of tags per package shown in the last column of Table 1.

Table 1 Package dimensions (L/W/H) [m] and mean number of tags per package ($\lambda $) [tags]

Full size table

3.2 RFID interrogation

Tag identification is performed using the well-known Framed Slotted Aloha (FSA) anti-collision protocol, which corrects situations where several tags respond simultaneously. In FSA, the interrogation process involves multiple frames, divided into slots. In a given frame, non-identified tags select a random slot to communicate their identity. If no collisions occur, the reader acknowledges their IDs. If tags do not receive acknowledgment, they will attempt to identify themselves in subsequent frames. The FSA protocol helps to reduce the possibility of collisions and improve the efficiency of the system. The number of slots allocated at each interrogation round (frame) is considered fixed and equal to 16, which is a common setting in commercial readers. During this interrogation process different data and statistics can be collected. The following ones have been considered in this work:

(i)
Number of tags read,
(ii)
Total interrogation time,
(iii)
Total number of interrogation frames required,
(iv)
Average RSS of singleton slots (slots where a tag response can be correctly decoded),
(v)
Average RSS of slots with collisions (slots where a tag response is detected but cannot be decoded),
(vi)
Minimum RSS of singleton slots, and
(vii)
Maximum RSS of singleton slots.

Depending on which features are used, three possible information models have been defined:

Tag model. It uses only the first feature (number of tags read). It is the simplest model that can be created. It serves as the baseline reference to compare with other models.
Basic model. It comprises standard information collected by the RFID readers, features (i) to (iii). Similar variables are available in off-the-shelf RFID readers such as Impinj^{Footnote 2} or Alien^{Footnote 3}.
Full model. Comprises all the features: (i) to (vii), and should provide the best performance. Low-level data is available in some commercial readers. For example, RF phase angle, Doppler frequency, and peak RSS can be obtained in Impinj models^{Footnote 4} and have been used in research works related to object sensing (e.g., [42, 43]). Another option to collect this info is using custom software-defined radio (SDR) readers as in the tesbed developed in this work (see Section 7).

3.3 Data representation

Figure 2 shows a representation of the dataset for the simple, d $=$ 2.5 meters, 8P and 16P cases, obtained using the simulator described in the next section. As can be seen in the figure, the classification task is challenging. If only the number of tags read is considered (tag model) it is not possible to correctly classify the package types (there are significant overlaps in the range of read tags for each type of package, as shown in the Figs. 2a and e). When more features are added, e.g., the number of interrogation frames or the difference between the maximum and minimum RSS, the classification can be improved (see second and third columns of Fig. 2, respectively). For instance, a greater RSS difference or higher number of frames indicates that packages are larger (higher class). This effect is particularly noticeable with the RSS difference since it clearly separates small and medium types. Larger packages are still difficult to differentiate among them. However, if all the features in the example are used (3D representations in the last column of Fig. 2), the largest classes can be better separated for small RSS differences (below 2 nW), since they tend to require a higher number of interrogation frames. In summary, the classification task is challenging, and adding additional features to the data information models can lead to notable accuracy improvements.

Table 2 ANN layouts

Full size table

Table 3 Simulator configuration

Full size table

Table 4 Training dataset: interrogation process sample

Full size table

3.4 Predictive system

The predictive problem addressed in this work is categorized as supervised classification, for which various learning structures are aptly suited. ANNs were selected due to their inherent flexibility and prevalent use in similar applications. Various ANN layouts, described in Table 2, underwent testing, including configurations with identical layouts but incorporating a 20% link dropout between layers to enhance generalization. These layouts are henceforth referred to as L1-L4, and L1D-L4D when dropout is included. The number of inputs aligns with the chosen information model: tag, basic, or full. The output layer comprises 16 nodes (some may be deactivated during training), representing the possible candidate types, and employs a softmax activation function to select the class with the highest probability. The multi-class cross-entropy serves as the loss function, as a common practice for such problems. Besides, as package types are sort by their size, as shown in Table 1, the size error is minimized (misclassifications occur more frequently between neighbor classes, as discussed in Section 5).

Datasets were generated using the simulator, which is detailed in the next section. For each scenario (ideal, simple, and complex), the dataset includes 20000 samples, uniformly distributed among the different package types (i.e., 5000, 2500, and 1250 packages per class for the 4P, 8P, and 16P cases, respectively).

Implementation of the ANN was achieved using Keras over Tensorflow. In addition to dropout, training over-fitting was mitigated using an early stopping mechanism with a patience parameter set to 100 epochs, acting on a validation data set. Accuracy results were computed utilizing the repeated holdout cross-validation resampling procedure. This algorithm operates as follows. At each repetition:

Algorithm 1

Box interrogation simulation.

Full size image

1.
Selects a random partition of 80% of the data for training, 10% for validation, and 10% for testing,
2.
Trains the model using the training set using early stopping on the validation set,
3.
Evaluates the model using the testing set, and
4.
Adds the accuracy sample to compute the mean and the confidence level

This procedure is run until the accuracy is statistically (using the t-test) within a confidence interval ±2.5% for its mean with a confidence level over 99% (using at least 20 samples). ANN training is performed using the minibatch gradient descent method with a learning rate of 0.01 and a batch size of 16.

4 RFID gate simulator

To construct the learning dataset, the model described in the previous section has been simulated in a fully-detailed UHF RFID gate simulator. Using a simulated setup has the advantage of allowing to gather a rich set of data, which could be prohibitively to collect in a real test-bed. To obtain reliable results this simulator implements a comprehensive channel model where the link budgets include:

The power-up and the backscatter links between antennas.
Fading due to multipath propagation between the tag and reader antennas, to include clutter effects.
Efficiency reduction due to the materials where the tags are attached.
The difference in load states during tags’ modulation, which reduces the amount of the reflected power towards the reader.

Table 3 summarizes the main parameters and characteristics of this simulator. At decoding, we consider bit error rate due to Additive White Gaussian Noise [6], and Miller [10] coding. Tag and reader antennas include gain variations due to their relative orientations [5, 16]. Besides, the simulator comprises a detailed implementation of the FSA tag anti-collision ISO 18000-6C protocol, including capture effect and tag outage computations. The simulations assume that tags are attached to cardboard material. Different materials (metal, plastic, aluminum, etc.) would affect the radio-electric characteristics of the tag, such as the radiation pattern of the antenna, and the values of the tags’ impedance. The simulator can be adapted to other materials with estimations of losses, and radio-electric changes in the tags [14].

The interrogation process is performed in the simulator for each package, using FSA with a fixed frame length of 16 slots. The process is finished when a given number of stop frames (7 were used in the simulator) are received totally empty. This experiment has been repeated 20000 times (each with a random package configuration) to construct the datasets for training the predictive system. The simulation process for each package is summarized in Alg. 1 and receives the corresponding power-up and backscattering link path losses for each reader-tag pair as inputs. These path losses are computed using a line-of-sight channel model and adding a Rician fading with 3 dB factor.

Table 5 Training dataset: frame information sample

Full size table

P

Table 6 Accuracy [%] / Dimensional error [%] obtained for the best performing layout. L1 (blue), L2 (peach), L3 (white), L4 (gray), Ties (purple)

Full size table

An example of the statistics collected is shown in Tables 4 and 5. The former summarizes, at each row, the high-level statistics for the interrogation procedure of each package, and the latter provides low-level statistics for each interrogation frame (note that the interrogation procedure for each package usually comprises multiple frames). For example, the first package was inventoried in the 12 first frames in the simulator (see Table 4). Note that in this example, the frames 10 and 11 are empty (see Table 5), but since the number of stop frames is higher, the reader continues the interrogation. In the frame 12, a last tag is identified (the empty stop frames afterward are not stored in the table). Besides, Table 4 stores also the random conditions (e.g., size, tags, orientation, etc.) under which the test has been performed, and information about the real number of tags in the package and whether or not all tags have been read. This information is not provided to the machine learning model, but stored for analysis purposes.

5 Results

Results have been computed using the repeated holdout cross-validation resampling procedure outlined in Section 3.4. Table 6 summarizes the average accuracies obtained for each scenario/model/gate configuration (parameter d represents the gate width, see Table 3) obtained with the best performing ANN layout for each configuration. Experiments shown important differences between these layouts, being L2 the best performing. The average difference between the best and worst-performing layouts is 2.9%, and reaches 11.9% for the simple 8P scenario with $d$ $=$2.5 m., where the L4D layout (worst) gets 69.6% accuracy compared to L2 (best), which gets 81.5%.

Regarding the absolute results, the classifier operates better for scenarios with less randomness, a smaller number of package candidates, and using the full information model. This is consistent with the expected performance. For example, for the 4P case, d $=$ 2.5 m, simple scenario, the accuracy virtually reaches 100%, whereas, for the 8P case it achieves nearly 88% and drops to 82% for the 16P case. If the scenario configuration is more difficult (complex case), the accuracy drops in all cases with respect to the ideal and simple ones, but it can still reach about 99.5%, 80.9%, and 57.5%, for 16P, 8P and 4P configurations, respectively. Besides, the gate dimensions also affect the results (accuracy is reduced by about 7% when comparing the best and the worst cases). The use of better information models yields improvements in all the experiments. For instance, in the simple-16P case, the full model improves accuracy at least a 17% with respect to the tag model. These improvements are smaller in the complex setup but still significant (around a 7% compared to the other information models). Since the tag model constituted our baseline estimation, the previous results demonstrate that this estimator can be notably improved using additional data from the RFID signature, validating the hypothesis proposed in this work.

A deeper insight into the results also indicates the “smooth” behavior of the predictor (mistakes occur most likely between similar classes). Figure 3 shows the confusion matrices for the best performing ANN layout both for the simple and complex scenarios for the 8P and 16P cases. These matrices were computed by averaging the results on the test datasets with the repeated holdout procedure. The types are mainly mistaken by the most similar ones (as can be seen in Table 1 similar-sized packages have near types). This characteristic reduces the absolute size estimation error (see Section 5.1). Errors are more noticeable in the larger packages, due to the higher variance of the signature data (due, indeed, to a higher number of tags and longer distances inside the package). A higher variance makes it easier to mistake similar packages. This effect can be also seen in Fig. 2. For example, attending to the number of tags read or the number of frames used in the interrogation process, the possible input range is wider for larger packages (e.g., for the package type s15 in the Fig. 2f, the possible frame number values range from 7 to beyond 50, while for the intermediate-size or small packages this range is much narrower). This effect also causes a noticeable overlapping between the largest packages in Figs. 2c and g. Overlapping can be corrected (to some extent) by the predictive model, as shown in the confusion matrices.

5.1 Size estimation error

In order to compute the size estimation error, let $p_{ij}$ be the classification ratio for the i-th row, j-th column of the confusion matrix, and W, H, L the package dimensions shown in Table 1. The dimensional error relative to the true package size, $\varepsilon ^\%$, is defined as:

$$\begin{aligned} \varepsilon ^\% {=} \sum _{i,j} p_{ij} \frac{\sqrt{(W_i - W_j)^2 + (H_i - H_j)^2 + (L_i-L_j)^2}}{W_j+H_j+L_j} \end{aligned}$$

These errors are also shown in the Table 6. Using the full information model, the error is below 3% for the 16P simple scenario, while it increases to 6.1% for the complex one. The error rises to an 8.8% and 7.6% respectively for the tag or basic models in the simple scenario, and to 8% and 8.1% in the complex one. In conclusion, it is possible to enhance the package size estimation by using extended data of the RFID interrogation signature.

6 Transfer learning

Implementing the package type predictor requires experimental data or accurate scenario simulations. The last option is challenging since real cases could be subject to many unknown variations which affect the final RFID interrogation performance. Therefore, using actual data seems the only suitable alternative. However, obtaining and labeling such a large dataset, like the one used in the previous section, is quite lengthy or directly infeasible. A first alternative could be to reduce the dataset to an achievable size in practice. To study this idea, the accuracy results for the simple and complex scenarios with $d=2.5$ m using the full information model have been computed using increasingly larger datasets (from 20 to 1000 records). The experiment has been repeated 100 times for each dataset size, selecting a random dataset (from the total 20000 record dataset) at each run. Figure 4 shows the average and the worst-case-scenario (wcs) results for this experiment. Besides, these figures also show (red horizontal line) the maximum accuracy in each scenario, shown in Table 6 and obtained by training the ANN with the complete 20000 record dataset. Each figure has been computed with the best-performing ANN layout corresponding to that specific scenario-information model pair.

Results reveal that more than 1000 records are necessary to achieve an average accuracy within a 5% interval of the upper limit accuracy in the simple scenario and more than 500 in the complex one. Moreover, when considering the wcs performance, this is degraded in comparison to the averaged one, and then at least 5000 samples (not shown in the figure) are needed to achieve a 10% interval goal. In short, the training dataset should include many samples (each obtained with randomized package setups) to guarantee reasonable accuracy. However, even datasets with 1000 records could be cumbersome or infeasible to obtain in practice.

One way to overcome the previous issue is to rely on transfer learning (TL), a well-known technique to address problems with reduced training datasets (e.g., [32, 37]). In our TL approach, the ANN should be firstly trained with a large set of synthetic data (e.g., from a reference simulated scenario), and later, a small sample dataset obtained with the actual system should be used to perform a fine-tuning of the prediction network.

To study this idea we have analyzed the performance of three TL cases. In all of them the reference scenario has been the 8P simple scenario with d $=$ 2.5 m. The three TL cases analyzed are:

TL1, derived from the reference scenario, but assuming different gate dimensions: $d$ $=$2.25 m, $h_a$ $=$2.25 m, $h_b$ $=$ 1.2 m.
TL2, derived from the reference scenario, but considering a different package set (appending s10 and s13 from Table 1) to the regular 8P types).
TL3, derived from the reference scenario, but considering new random conditions for the packages placement: (i) $x'$ $=$ $x-(1/4,0)+\Delta x$ being $\Delta x$ a variable with random Gaussian length with an average of one-eight of the longest edge of the package (L) and rotationally uniform, and (ii) a random rotation angle with zero mean and standard deviation $\frac{\pi }{10}$.

To implement and test the TL approach, the ANN is first trained (base model) using the whole lot of 20000 samples dataset for the simple 8P scenario. Then, weights in the ANN are fixed in all but the last layer, and an additional fine-tune training step is performed using a new dataset explicitly obtained for the TL scenario using a reduced learning rate of 0.001. The ANN layout used for the TL experiments is the L4D with sigmoid activation in the output layer, instead of the original softmax activation, since the TL has performed better using this configuration. Like in previous sections, the repeated holdout cross-validation resampling procedure is used. At each repetition, the TL dataset (which contains 20000 records) is sampled to the target size (up to 300 samples, as would correspond to the limited adjustment that can be performed in an existing facility). Then, this reduced dataset is divided into training (80%), validation (10%), and test (10%) datasets, which are used to train and evaluate the model. Note that the number of samples of each package type can be different in this case since the dataset is randomly drawn. However, their average number is similar (e.g., if the dataset is reduced to 100 samples, then the expected number of packages of each type for the 4P, 8P, and 16P cases would be 25, 12.5, and 6.25, respectively).

Figures 5a-c show the average and worst-case accuracies obtained for the TL approach and the average accuracy obtained by an ANN trained only with the new TL dataset (’New’ curve). Besides, the figures show the accuracy achievable in the TL scenario using the best performing ANN for the reference scenario (“No TL” line).

The TL can outperform the base setup with a small training dataset in all cases. For example, in TL1, with 100 samples, the TL achieves roughly $+$10% improvement over that base setup, and nearly reaches $+$20% for 300 samples. In TL2, the transfer learning can provide more than $+$15% improvement with 300 samples, while in TL3 the improvement is reduced, but still significant ($+$7% with 300 samples). In comparison, if the ANN is only trained with the new data, the results are drastically worse than those from the retrained network ($+$30% for TL1 and TL3, and $+$15% for TL2 with 300 samples, $+$40% for TL1, $+$30% for TL2, $+$50% for TL3 with 100 samples). Globally, these results support the TL approach.

7 Experimental testbed

Building upon the preceding results, this section presents a real case study to validate the 2-stage learning approach under actual conditions. To this end, an experimental testbed was set up, leveraging the software-defined radio (SDR) Ettus B210 Universal Software Radio Peripheral (USRP) platform. The Ettus B210 facilitates precise control over RFID interrogation signals and accurate capture of backscattered signals for subsequent processing. The interrogation software was based on the implementation by Nikos Kargas^{Footnote 5}, described in-depth in [18]. This implementation allows for the collection of signature variables proposed in Section 3. Figure 6 displays the test setup. Similar to the system presented in Section 4, it consists of a bistatic pair of antennas aiming at a target area where packages are loosely positioned, akin to the simple scenario described in Section 3. For the experiments, four specific types of packages were selected, showcased in Fig. 7. These consist of (i) an envelope, (ii) a poster tube, (iii) a large box, and (iv) a small box. In this case, for feasibility and unlike the scenarios studied through simulation, the tags were placed directly on the package surface. A total of 25 interrogation traces were taken for each type of package using 2 tags, 25 with 4 tags, and 25 with 6 tags, totaling 300 samples. The tags were placed at random spots on a regular 3x3 grid on the package surface for the envelope and boxes, and uniformly for the poster tube. As can be seen in the examples from Fig. 7, the points were located a short distance from the package edges (approximately 1-2 cm). In all experiments, tags from the Confident U8_7014 model^{Footnote 6} were used. It should be highlighted that in the tests, in a significant number of cases, the number of tags read for each package was not always the total, as some of them could not be activated due to their position.

In order to obtain an initial ANN model, a simulation suited to the test scenario was also carried out, using an RFID gate with size as indicated in Fig. 6. In this simulation, the parameters from Table 3 were adjusted to fit the scenario configuration. An operating frequency of 910 MHz and a B210 transmission gain of 60 dBs were selected. Tags were positioned as mentioned in the previous paragraph, and 1000 interrogation traces were obtained for each configuration following the random variations proposed for the simple scenario. This model was trained and achieves an average accuracy of 97.5% (similar to the levels achieved in Section 5), tested on simulated data.

After this stage, the transfer-learning method was applied with the actual data obtained with the experimental setup. As in Section 6, 10% of samples were reserved for ANN validation, another 10% for final testing, and the remaining 80% for model training. The transfer-learning training was repeated 10 times (each with a random subset of the actual samples), and the averaged precision results are provided in Table 7, for each tagging size. In this case, and unlike the results presented in previous sections, it has not been assumed that the number of tags depends on the package size, so the classification is performed only with the low-level parameters of the RFID signature. The last cell in the table shows what happens when we assume a variable number of tags depending on the type of package. It has been considered that we will have large boxes with 6 tags with a probability of 75%, or 4 tags with a probability of 25%. For the poster tube, the possible cases are 6-tags (25%) and 4-tags (75%). For the small box: 4-tags (75%) or 2-tags (75%). And, finally, for the envelope: 4-tags (75%) or 2-tags (25%). It can be observed that assuming these conditions substantially improves the worst fixed case studied (the 2-tags configuration). In general, the result has been satisfactory, achieving precisions that may allow applications that do not mandate high measurement accuracy but do require a reasonable knowledge of the object’s size.

Table 7 Accuracy results for the testbed

Full size table

8 Conclusions

In this work, a novel capability for RFID gates has been proposed: the determination of the class and size of packages containing tagged items. By utilizing features derived from the tags’ interrogation process, an ANN implementation achieves moderate accuracy and small dimensional error. This method does not necessitate additional hardware to existing RFID gates and can furnish valuable information for cargo management. Moreover, transfer learning emerges as a viable approach to navigate a real scenario, initiating from an approximate simulated configuration and subsequently refining it using a limited set of samples from actual gate operation, as demonstrated in both simulated and experimental conditions. This validates the applicability and robustness of the proposed RFID signature method in real-world scenarios, aimed at cases were high measurement accuracy is not paramount, but a reasonable estimation of object size is sufficient.

Future work will consider new features for the interrogation signature as well as exploit multiple readings of the same tags to enhance the model’s accuracy, in addition to conduct further tests on prototypes.

Availability of data

The datasets generated and analysed during the current study are available in the author’s GitHub repository (https://github.com/plopezmp/package-size-estimation-using-UHF-RFID-signature.

Notes

References

ABC (2023) Package V2 dataset. https://universe.roboflow.com/abc-d9ezq/package-v2. Accessed 09 Feb 2024
Alvarez-Narciandi G, Motroni A, Pino MR et al (2019) A UHF-RFID gate control system based on a recurrent neural network. IEEE Antennas Wirel Propag Lett 18(11):2330–2334. https://doi.org/10.1109/LAWP.2019.2929416
Bu Y, Xie L, Gong Y et al (2019) RF-3DScan: RFID-based 3D reconstruction on tagged packages. IEEE Trans Mob Comput 20(2):722–738. https://doi.org/10.1109/TMC.2019.2943853
Buffi A, D’Andrea E, Lazzerini B et al (2017) UHF-RFID smart gate: tag action classifier by artificial neural networks. In: 2017 IEEE International conference on RFID technology application (RFID-TA), pp 45–50. https://doi.org/10.1109/RFID-TA.2017.8098900
Ciftler BS, Kadri A, Güvenç I (2017) IoT localization for bistatic passive UHF RFID systems with 3-D radiation pattern. IEEE Internet Things J 4(4):905–916. https://doi.org/10.1109/JIOT.2017.2699976
Clester IJ (2020) RFID localization for interactive systems. PhD thesis, Massachusetts Institute of Technology. https://hdl.handle.net/1721.1/129201
DiGiampaolo E, Martinelli F (2018) A robotic system for localization of passive UHF-RFID tagged objects on shelves. IEEE Sens J 18(20):8558–8568. https://doi.org/10.1109/JSEN.2018.2865339
Ding H, Han J, Liu AX et al (2018) Counting human objects using backscattered radio frequency signals. IEEE Trans Mob Comput 18(5):1054–1067. https://doi.org/10.1109/TMC.2018.2852627
Elbasani E, Siriporn P, Choi JS (2020) A Survey on RFID in industry 4.0. Internet of things for Industry 4.0: design, challenges and solutions pp 1–16, Springer International Publishing. https://doi.org/10.1007/978-3-030-32530-5_1
EPCglobal G (2018) EPC radio-frequency identity protocols generation-2 UHF RFID; specification for RFID air interface protocol for communications at 860 MHz–960 MHz. Accessed 10 Apr 2021
ETSI E (2016) 302 208 V3. 1.1 (2016-11) Radio frequency identification equipment operating in the band 865 MHz to 868 MHz with power levels up to 2 W and in the band 915 MHz to 921 MHz with power levels up to 4 W; harmonised standard covering the essential requirements of article 3.2 of the directive 2014/53/EU. European Telecommunications Standards Institute
Fan X, Wang F, Wang F et al (2019) When RFID meets deep learning: exploring cognitive intelligence for activity identification. IEEE Wirel Commun 26(3):19–25. https://doi.org/10.1109/MWC.2019.1800405
Fu H, Ma Y, Gong X et al (2022) Device-free multitarget localization with weighted intersection multidimensional feature for passive UHF RFID. IEEE Sensors J 22(7):7300–7310. https://doi.org/10.1109/JSEN.2022.3151386
Galappaththige DAL, Rezaei F, Tellambura C et al (2022) Link budget analysis for backscatter-based passive IoT. IEEE Access 10:128,890-128,922. https://doi.org/10.1109/ACCESS.2022.3227499
Giannelos E, Andrianakis E, Skyvalakis K et al (2021) Robust RFID localization in multipath with phase-based particle filtering and a mobile robot. IEEE J Radio Freq Identif 5(3):302–310. https://doi.org/10.1109/JRFID.2021.3086759
Greene CE (2006) Area of operation for a radio-frequency identification (RFID) tag in the far-field. PhD thesis, University of Pittsburgh. http://d-scholarship.pitt.edu/id/eprint/6418
Group D (2024) SMART QBING: automatic in-motion dimensioning and weighing system. https://www.digisystem.com/products/PRD00324/. Accessed 02 Feb 2024
Kargas N, Mavromatis F, Bletsas A (2015) Fully-coherent reader with commodity SDR for Gen2 FM0 and computational RFID. IEEE Wirel Commun Lett 4(6):617–620. https://doi.org/10.1109/LWC.2015.2475749
Article Google Scholar
Khadka G, Ray B, Karmakar NC et al (2022) Physical-layer detection and security of printed chipless RFID tag for Internet of Things applications. IEEE Internet Things J 9(17):15,714-15,724. https://doi.org/10.1109/JIOT.2022.3151364
Li C, Tanghe E, Plets D et al (2020) ReLoc: hybrid RSSI-and phase-based relative UHF-RFID tag localization with COTS devices. IEEE Trans Instrum Meas 69(10):8613–8627. https://doi.org/10.1109/TIM.2020.2991564
Li C, Tanghe E, Suanet P et al (2021) ReLoc 2.0: UHF-RFID relative localization for drone-based inventory management. IEEE Trans Instrum Meas 70:1–13. https://doi.org/10.1109/TIM.2021.3069377
Lins RG, Santos REd, Gaspar R (2023) Vision-based measurement for quality control inspection in the context of Industry 4.0: a comprehensive review and design challenges. J Braz Soc Mech Sci Eng 45(4):229. https://doi.org/10.1007/s40430-023-04050-y
Ma Y, Zhang Y, Wang B et al (2020) SCLA-RTI: a novel device-free multi-target localization method based on link analysis in passive UHF RFID environment. IEEE Sens J 21(3):3879–3887. https://doi.org/10.1109/JSEN.2020.3023096
Mazzaracchio V, Fiore L, Nappi S et al (2021) Medium-distance affordable, flexible and wireless epidermal sensor for pH monitoring in sweat. Talanta 222:121502. https://doi.org/10.1016/j.talanta.2020.121502
Mi C, Huang Y, Fu C et al (2021) Vision-based measurement: actualities and developing trends in automated container terminals. IEEE Instrum Meas Mag 24(4):65–76. https://doi.org/10.1109/MIM.2021.9448257
Article Google Scholar
Motroni A, Buffi A, Nepa P (2021) A survey on indoor vehicle localization through RFID technology. IEEE Access 9:17,921-17,942. https://doi.org/10.1109/ACCESS.2021.3052316
Quantronix I (2024) CubiScan 100 automatic dimensioner optimizes warehouse. https://cubiscan.com/cubiscan-100/. Accessed 12 Feb 2024
Quantronix I (2024) CubiScan 225: In-line dimensioning on-demand packaging. https://cubiscan.com/cubiscan-225-2/. Accessed 12 Feb 2024
Quantronix I (2024) Cubiscan 75 pro: best performing package dimensioner. https://cubiscan.com/cubiscan-75-pro/. Accessed 12 Feb 2024
Rahmadya B, Chen X, Takeda S et al (2020) Measurement of a UHF RFID-based battery-less vibration frequency sensitive sensor tag using Tilt/Vibration switches. IEEE Sens J 20(17):9901–9909. https://doi.org/10.1109/JSEN.2020.2992345
Rodić LD, Županović T, Perković T et al (2021) Machine learning and soil humidity sensing: signal strength approach. ACM Transactions on Internet Technology (TOIT) 22(2):1–21. https://doi.org/10.1145/3418207
Ross NS, Sheeba PT, Shibi CS et al (2023) A novel approach of tool condition monitoring in sustainable machining of ni alloy with transfer learning models. J Intell Manuf 35(2):757–775. https://doi.org/10.1007/s10845-023-02074-8
Tang J, Gong Z, Wu H et al (2021) RFID-based pose estimation for moving objects using classification and phase-position transformation. IEEE Sensors J 21(18):20,606-20,615. https://doi.org/10.1109/JSEN.2021.3098314
Vales-Alonso J, López-Matencio P (2021) Box size estimation using ANNs in UHF RFID gates from interrogation process features. In: 2021 6th International conference on smart and sustainable technologies (SpliTech). IEEE, pp 1–5. https://doi.org/10.23919/SpliTech52315.2021.9566409
Vision V (2023) Packages dataset. https://universe.roboflow.com/computer-vision-2gfz5/packages-vsudh. Accessed 09 Feb 2024
Vision Systems Design (2016) 3D cameras measure packaged product volume. Vision Systems Design. https://www.vision-systems.com/home/article/16748047/3d-cameras-measure-packaged-product-volume. Accessed 12 Feb 2024
Wang H, Lu W, Tang S et al (2022) Predict industrial equipment failure with time windows and transfer learning. Appl Intell 52(3):2346–2358. https://doi.org/10.1007/s10489-021-02441-z
Wang T, He Y, Li B et al (2018) Transformer fault diagnosis using self-powered RFID sensor and deep learning approach. IEEE Sens J 18(15):6399–6411. https://doi.org/10.1109/JSEN.2018.2844799
Wang X, Zhang J, Yu Z et al (2019) On remote temperature sensing using commercial UHF RFID tags. IEEE Internet Things J 6(6):10,715-10,727. https://doi.org/10.1109/JIOT.2019.2941023
Wang Y, Zheng Y (2019) TagBreathe: monitor breathing with commodity RFID systems. IEEE Trans Mob Comput 19(4):969–981. https://doi.org/10.1109/TMC.2019.2900214
Wang Z, Xu M, Xiao F (2021) Recognizing 3D orientation of a two-RFID-tag labeled object in multipath environments using deep transfer learning. In: 2021 IEEE 41st International conference on distributed computing systems (ICDCS), pp 652–662. https://doi.org/10.1109/ICDCS51616.2021.00068
Xu G, Sharma P, Hui X et al (2021) 3-d indoor device-free object detection by passive radio frequency identification. IEEE Trans Instrum Meas 70:1–13. https://doi.org/10.1109/TIM.2021.3059309
Xu G, Sharma P, Hysell DL et al (2021) Indoor object sensing using radio-frequency identification with inverse methods. IEEE Sensors J 22(12):11336–11344. https://doi.org/10.1109/JSEN.2021.3086700
Xu H, Xu J, Xu W (2019) Survey of 3D modeling using depth cameras. Virtual Reality & Intelligent Hardware 1(5):483–499
Article Google Scholar
Yang C, Wang X, Mao S (2020) RFID-pose: vision-aided three-dimensional human pose estimation with radio-frequency identification. IEEE Trans Reliab 70(3):1218–1231. https://doi.org/10.1109/TR.2020.3030952
Yang C, Wang X, Mao S (2020) Unsupervised drowsy driving detection with RFID. IEEE Trans Veh Technol 69(8):8151–8163. https://doi.org/10.1109/TVT.2020.2995835
Article Google Scholar

Download references

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. Partial funding through grant AriSe, (Ref. PID2020-116329GB / AEI / 10.13039/501100011033).

Author information

Authors and Affiliations

Department of Communication and Information Technologies, Technical University of Cartagena, Plaza del Hospital 1, Cartagena, 30202, Murcia, Spain
Javier Vales-Alonso & Pablo López-Matencio

Authors

Javier Vales-Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Pablo López-Matencio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Original idea, design, work coordination and analysis of results were performed by Javier Vales-Alonso. The simulator and the experimental testbeds were conducted by Pablo López-Matencio. Both authors contributed to the material preparation. The manuscript was reviewed and revised by all authors until the final version was approved.

Corresponding author

Correspondence to Javier Vales-Alonso.

Ethics declarations

Competing interests

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vales-Alonso, J., López-Matencio, P. A machine learning approach for package size estimation using UHF RFID interrogation signature. Appl Intell 54, 6053–6068 (2024). https://doi.org/10.1007/s10489-024-05412-2

Download citation

Accepted: 22 March 2024
Published: 04 May 2024
Issue Date: April 2024
DOI: https://doi.org/10.1007/s10489-024-05412-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A machine learning approach for package size estimation using UHF RFID interrogation signature

Abstract

Similar content being viewed by others

Machine Learning for RF Fingerprinting Extraction and Identification of Soft-Defined Radio Devices

Package and Classify Wireless Product Features to Their Sales Items and Categories Automatically

Intelligent Radar Signal Recognition and Classification

1 Introduction

2 Related work

3 Reference scenarios