Abstract
Most recent solutions for users’ authentication in Industry 4.0 scenarios are based on unique biological characteristics that are captured from users and recognized using artificial intelligence and machine learning technologies. These biometric applications tend to be computationally heavy, so to monitor users in an unobtrusive manner, sensing and processing modules are physically separated and connected through point-to-point wireless communication technologies. However, in this approach, sensors are very resource constrained, and common cryptographic techniques to protect private users’ information while traveling in the radio channel cannot be implemented because their computational cost. Thus, new security solutions for those biometric authentication systems in their short-range wireless communications are needed. Therefore, in this paper, we propose a new cryptographic approach addressing this scenario. The proposed solution employs lightweight operations to create a secure symmetric encryption solution. This cipher includes a pseudo-random number generator based, also, on simple computationally low-cost operations in order to create the secret key. In order to preserve and provide good security properties, the key generation and the encryption processes are fed with a chaotic number sequence obtained through the numerical integration of a new four-order hyperchaotic dynamic. An experimental analysis and a performance evaluation are provided in the experimental section, showing the good behavior of the described solution.
Keywords

Introduction
Industry 4.0 [1] represents an innovative technological approach, where Cyber-Physical Systems [2], Artificial Intelligence [35, 72], automatic decision systems [23], optimization solutions [18], exponential technologies [74], robotics [73] and circular business models are building the productive fabric [3, 4].
In Industry 4.0, production activities and workplaces are user-centric [5] and personalized [7]. Therefore, Industry 4.0 systems must implement user authentication and identification solutions [8]. However, most traditional mechanisms for user authentication (based on explicit interactions through plastic cards, keyboards and readers) are considered massive, intrusive, and too pollutants to be part of Industry 4.0 solutions. Then, to address this situation, in the last years, authentication technologies based on personal biometric characteristics have been reported [10]. Nevertheless, recognition algorithms tend to be computationally heavy, and large processing servers are needed. If those servers are placed into the user’s living environment, their wellbeing could be reduced. Thus, an edge computing architecture [11] is employed in most recent authentication systems. In that scheme, sensors capturing people information [13] are placed close to final users, and heavy processing algorithms are deployed in large hidden gateways or servers [12]. This solution, on the other hand, opens new problems to be addressed [14]. In particular, employed microcontrollers are typically very resource constrained, and they cannot implement common privacy preservation solutions (such as standard cryptographic algorithms) because of their computational cost [15]. Actually, as sensors are not connected to the global Internet, the risks are lower than in other scenarios [16], but biometric information is a very critical and valuable data, and clear communications should not be distributed in a public radio medium. Even in those short ranges, attackers could try to collect data in an unauthorized manner.
Therefore, the objective of this paper is to describe a new lightweight encryption mechanism for short-range wireless communications in Industry 4.0 biometric systems.
Hereinafter, the term “lightweight” refers to mechanisms with a very reduced computational cost or power [6]. In our work, we are looking for an algorithm requiring a very limited computation time. However, Industry 4.0 systems need lightweight algorithms in all senses [9]. Thus, we are also evaluating the memory consumption of the proposed encryption algorithm, looking for a solution with a reduced memory usage.
The proposed solution is focused on enabling a good quality privacy preservation technique, by using only simple binary operations. The cipher includes only lightweight functions, including permutations and rotations; and a XOR gate combining the private information with a secret pseudorandom key. The secret key is obtained through a pseudorandom number generator (PRNG), where only binary operations are employed. In order to guarantee the resulting scheme is strong enough and presents good characteristics in terms of entropy, key space, key sensitivity, etc., the PRNG and the cipher are fed with a chaotic number sequence, generated by numerically integrating a hyperchaotic four-order dynamic. This dynamic introduces important properties such as the sensitivity to the initial conditions. Besides, as an integer dynamic is employed, the computational cost remains acceptable contrary to other approaches based on fractional chaos.
The remainder of the paper is organized as follows. Section 2 presents the state of the art on lightweight encryption mechanisms. Section 3 presents the proposed solution, including all the considered elements and modules. Section 4 describes the experimental evaluation; and Section 5 concludes the paper.
State of the art
Proposals about lightweight encryption mechanisms may be classified into two basic groups [17]: on the one hand, cryptographic primitives and, on the other hand, application-specific technologies. Cryptographic primitives are generic algorithms or mathematical functions that can be integrated into different encryption schemes for different scenarios. Application-specific solutions are vertical security technologies specifically designed to adapt the characteristics of certain scenarios.
The following subsections analyze works on each one of these two groups.
State of the art on cryptographic primitives
State of the art on cryptographic primitives
Cryptographic primitives may be based on four basic implementation technologies [19]: block ciphers, stream ciphers, hash functions and hardware cryptosystems.
Block ciphers. A first group of lightweight block ciphers try to improve the performance of DES (Data Encryption Standard) and AES (Advanced Encryption Standard) algorithm through different strategies. A large collection of AES-like lightweight ciphers following a Substitution-Permutation Network (SPN) structure have been reported. From solutions where only small differences are applied, such as AES-128 [20], to more innovative approaches such as KLEIN [21], SKYNNY [24] or LED [22]. Other lightweight encryption mechanisms such as PRINCE [25] or Hummingbird2 [26] introduce larger differences and, even, modify the main core algorithm. Other AES-like lightweight encryption schemes simplify the key management, so the global number of operations and their complexity are also reduced. Solutions such as PRESENT [27], TWINE [28] or mCRYPTON [29] employ keys whose length is greatly below 128 bits of traditional AES. Moreover, although apparently using large keys, some AES-like ciphers such as TEA [30] chop the original key into small subkeys. On the other hand, schemes such as PRIDE [31], RECTANGLE [32] or Neokeon [33] replace traditional complex operations in AES algorithm for simple binary operations which reduce the global computational cost. Finally, although because of the intrinsic insecurity of DES they are less common, some lightweight ciphers based on DES may be also found. DESL [34] is probably the most known.
A second relevant group of lightweight block ciphers are those based on Feistel networks. Feistel functions may be ARX-based (only additions, rotations and XOR operations are allowed) or may be general functions. Encryption mechanism such as SIMON [39], RC5 [40], SPECK [39] or XTEA [41] belong to the first group. While technologies such as KASUMI [42] or MISTY [43] are based on the second approach. Solutions such as RoadRunneR [44] have been studied in Industry 4.0 scenarios, but they show a poor key structure to guarantee its lightweight properties.
All these previous block encryption schemes, however, present two common problems. First, most of them are not completely secure anymore. And, secondly, current block ciphers still consume large amount of resources. Even solutions with the lowest key length and employing only bitwise operations have reported important memory and processing time consumptions [15].
Stream ciphers. Lightweight stream ciphers are sparse, and, although they perfectly meet the characteristics of resource-constrained devices (some of them are even focused on these scenarios, such as TRIVIUM [45]), most of them are already broken because of their simple structure and low key length (designed to operate at a very high speed). MICKEY [46] and GRAIN [47] ciphers are examples of this situation. Hash functions. Around 2010, there was a great interest to create fast, lightweight hash functions supporting asymmetric encryption schemes in IoT devices. However, reported solutions such as PHOTON [48], QUARK [49] or SPOGENT [50] reduced the output size, so the probability of collision dramatically increased. Thus, most of these schemes are unsecure in typical practical applications. Hardware cryptosystems. Hardware supported cryptographic mechanisms have received a lot of attention in the last years. ASIC (application-specific integrated circuit) and FPGA (field-programmable gate array) are employed to build computationally low-cost cryptographic functions [51]. The main practical problem of cryptographic hardware is its sparse flexibility and lack of commercial platforms, what increases the maintenance and replacement costs.
Table 1 summarizes all the information about light-weight cryptographic primitives.
For the best of our knowledge, no lightweight encryption technology for Industry 4.0 systems has been reported. Although different lightweight schemes for Industry 4.0 scenarios [52] have been described, the focus of our work is totally different. Currently, lightweight security technologies are being applied to three basic scenarios: cloud computing systems, Internet of things (IoT) deployments and sensor networks.
Cloud scenarios. These solutions tend to be focused on reaching computationally fast algorithms, taking profit of the large resources that are available in the cloud. In that way, attribute-based encryption schemes [53] and parametric encryption solutions [38] may be found, where keys are generated by complex procedures from attribute descriptions between both devices communicating [54]. Moreover, as cloud systems are communicating through the global Internet, solutions based on proxies that consume a little amount of private information and re-encrypt and transmit the packets [55], all this in a very fast and efficient manner, have been also reported. On the other hand, specific lightweight encryption technologies for mobile devices accessing to cloud services may be found. Solutions such as the Very Lightweight Proxy Re-Encryption (VL-PRE) [56] or the Lightweight Homomorphic Encryption (LHE) [57] where the key generation process (based on asymmetric mechanisms such as RSA [70]) is improved to reduce its computational cost. Furthermore, lightweight encryption systems whose purpose is reducing the power consumption may be also found [58]. IoT systems. Lightweight encryption schemes for IoT deployments are typically based on reduced symmetric and/or asymmetric ciphers. Solutions consisting of simple symmetric algorithms where bits are mixed according to a random sequence [15] and simplified Elliptic Curve Cryptography (ECC) mechanisms [59] may be found. The main problem of these proposals is to balance between a high security level and a fast encryption delay. Some hybrid mechanisms have been proposed, combining lightweight symmetric and asymmetric techniques [60, 17], but they are hard to adapt to Industry 4.0 scenarios. Sensor networks. Schemes for wireless sensor networks [62], Smart Homes [61] and 5G networks [63] may be found. The main drawback of these works is the practical impossibility to adapt application-specific technologies to new scenarios in an efficient manner. Among solutions for sensor networks, there is a group of mechanisms that are very relevant for our works: chaos-based solutions. Chaos-based cryptography employs schemes such as masking [64] or modulation [65] to build solutions, such as watermarking mechanisms [67]. These schemes, however, are vulnerable; and more complex dynamics [66] and fractional chaos [36] have been investigated. Even optical chaos has been employed at physical level [68, 14]. Any case, these approaches are still weak as, among other things, transmitter and receptor must be coupled and synchronized [37], and they force the user to send the private key as a stream in a parallel channel, which can be captured by attackers. To address this problem, in our proposal, chaos is not employed as main encryption system, but as a way to increase the entropy and sensibility of the cipher.
The proposed cryptosystem (see Fig. 1) includes three basic modules: the chaos generation module, the pseudo-random number generator (PRNG) and the encryption module.
Scheme for the proposed cryptosystem.
The chaos generation module is software component producing a self-maintained (autonomous, no extra energy input is needed) numerical oscillating trajectory, based on a four-order chaotic dynamic. It generates four different chaotic signals, whose divergence rate may reach very high values (depending on the configuration). These signals are obtained through numerical procedures and take values from the set of real numbers. Thus, these signals must suffer an adaptation process to transform them into a chaotic sequence of positive integer numbers, as private information and PRNG operate only over the set of natural numbers.
Although chaos presents a sensible and complex behavior, it is a deterministic signal. Then, if directly employed to encrypt private information, statistical analyses and similar techniques could discover the underlying structure. To avoid this problem, a random signal must be employed as key in the cipher. To cover this function, a PRNG fed with a random seed is producing a pseudo-random numerical flow. The proposed PRNG, known as Trifork, only employed bitwise operations to create a high quality pseudo-random signal. The PRNG must be fed with a seed, which is calculated and updated at real-time from the chaotic signals and the previous random number sequences. Thus, the randomness of the final number sequence employed as secret key is even increased, guaranteeing its does not follow any pattern and it cannot be easily replicated by seed variation techniques.
Finally, the secret key from the PRNG and the chaotic signals are introduced in a cipher, which follows a hybrid approach between stream and block ciphers. Private information is divided into small cells, so the proposed encryption schemes may employ block encryption techniques, but macroscopically, it acts as a stream cipher at real time. The proposed cipher integrates only parametric bitwise operations and simple matrix manipulations, but enriched with chaotic signals so the sensibility, confusion and diffusion properties are largely increased compared to previously reported lightweight ciphers. To even increase more the induced confusion and diffusion in the encrypted messages, the cipher implements a feedback loop providing a complex bit mixture. Next subsections are describing all details about each one of these modules.
Traditional chaotic dynamics have been reported to be vulnerable [66]. Systems such as the Lorenz dynamic present structural problems as variables are highly coupled and divergence is limited to only one dimension. Therefore, dynamics with a more complex behavior and higher order are being studied. In this manner, we are considering hyperchaotic dynamics [66], which show a wide catalogue of trajectories as main core of this chaos generation module.
The proposed dynamics is defined by four continuous ordinary differential Eq. (3.1), where four different continuous real time-dependent variables
As resource-constrained devices in Industry 4.0 work with a limited word size (usually less than 16 bits), we look for employing only unsigned arithmetic and data formats, so we can take advantage of all bits (no bit for sign is required) and improve the numerical precision in calculations. As a consequence, we are considering
In order to determine the parameters space of this dynamics, we must consider that the system must be globally stable Eq. (5), although unstable in some directions of the phase space , so the volume
represents the dynamics in vector format
Now, the dynamics may be linearized through the Jacobian matrix Eq. (8), evaluated in a generic equilibrium point
From this linearized system, three characteristic equations are deducted Eqs (3.1–3.1).
A numerical evaluation of the four eigenvalues associated to each characteristic equation shows that, for any valid combination of the bifurcation parameters, there is always, at least, one unstable equilibrium point. This is a very important result, as it guarantees the chaos generation module is generating an oscillating signal for many possible parameter configurations. This guarantees the PRNG seed does not follow a simple pattern and the cipher has a high entropy. Although any other ranges could be selected, in this paper we are assuming the bifurcation parameters vary in specific bounded intervals Eq. (3.1).
Bifurcation diagrams. (a) 
In those regions, the bifurcation diagrams (see Fig. 2) show a large catalogue of structures, including regular and chaotic trajectories. The corresponding chaotic attractors, besides, can be considered self-existent as their basin of attraction is vast and an equivalent structure is obtained for almost every possible set of initial conditions.
For those ranges of the bifurcation parameters, Lyapunov exponents and Kaplan-York dimension (see Table 2) reach extremely high values, showing the relevant complexity and sensibility to initial conditions and small variation in the bifurcation parameters of the product chaotic signals. Figure 3 shows the most representative attractors in that area.
Most representative attractor. (a) 
Lyapunov exponents for the proposed chaotic dynamic
In order to implement a software chaos generation module, the proposed continuous differential dynamics must be transformed into a sequence of simple mathematical numerical operations (i.e. additions, subtractions, multiplications, etc.). This is especially relevant to meet the requirements of Industry 4.0 resource-constrained devices. To do that, we propose to evaluate the chaotic trajectories using a Runge-Kutta numerical method, which has been proved to solve chaotic differential problems with a good precision and low error.
Although some previous works have successfully integrated chaotic trajectories using four-order numerical methods [14], in order to increase the entropy and security of our cipher and guarantee a good representation and calculation of hyperchaotic trajectories, we propose a more complex scheme. Specifically, we are adapting the Huta’s formula [71] to the proposed dynamic. Huta’s traditional formula is defined for unidimensional problems, so we are adapting this definition to vectorial functions and trajectories
In those conditions, the Huta’s formula is formally identical to a sixth order eight-stage Runge-Kutta method. On the other hand, the Huta numerical method tends to fluctuate between very small and very high numerical values, which is not an adequate behavior for precision-limited devices as they could overflow (in Industry 4.0 many devices show an 8-bit architecture, for example). Then, the operating range of this numerical method may be modified by adapting some coefficients [69] in the original Huta’s proposal. The resulting numerical approximation can be directly applied to the dynamics, in order to calculate hyperchaotic trajectories
This Runge-Kutta method defines an initial value problem which requires a four-dimensional vector of initial conditions
This primary key starts the encryption process by triggering the generation of the chaotic signals that feed the PRNG and the encryption module. Nevertheless, this key is not directly involved in the encryption process and, as the chaotic dynamics presents a high entropy, it cannot be deducted from the encrypted messages nor the chaotic signals (see Section 4).
This primary key must be shared between both devices to be communicated. This process can be done using many existing solutions and protocols [14, 15]. As this operation will be occasional, its impact in the long-term computational cost will be negligible.
With this approach, four real chaotic signals are generated, in a way that meets the Industry 4.0 characteristics. First, as can be seen Eq. (13), this six-order method only requires eight numerical substitutions to generate a new sample; while a traditional four-order Runge-Kutta method needs twelve. Then, the proposed approach is computationally less complex. But, second, at the same time, it is much more accurate. In fact, the error in this modified Huta’s formula can be approximated by the error in a seven-point Newton Cotes formula [69] Eq. (17); while in a standard Runge-Kutta method the error is in the order
These real functions, however, are not compatible with integer symbols in PRNG and encryption schemes. Thus, they must be adapted. In other words, data flows taking positive and negative values must be mapped to only take positive values. This operation will be performed through a simple algebraic function
At this point, the four integer chaotic signals
Group
(a) Proposed scheme for the PRNG (b) internal configuration of Trifork PRNG.
Among all lightweight PRNG one of the most efficient in computational terms is the Linear Feedback Shift Register (LFSR), where
Generating, then, long pseudo-random sequences needs big registers (and memory space), which are not always available in Industry 4.0. Two basic strategies have been reported in the literature to address this problem. On the one hand, the introduction of additional control parameters produces the Tausworthe generators Eq. (23).
This PRNG is a feedback scheme including a set
Tausworthe generators produce longer sequences but, however, may cause microcontrollers overflow because of natural multiplications. On the contrary, LFGs perfectly meet the requirements of Industry 4.0 devices, but sequences present poorer cryptographic properties [15]. Then, in this work, we propose a hybrid technique, where control parameters and modular arithmetic are employed at the same time. To solve the overflow risk, multiplications are replaced by binary left-shift and right-shift operations (from a numerical perspective the result is quite similar), which are lighter and cannot overflow the controller. To improve the cryptographic properties of LFG, the calculation procedure Eq. (25) is complicated by adding additional variables to increase the entropy of the final secret key.
To do that, we are representing the LFG as a trinomial Eq. (27) over the Galois Field of two elements,
However, although the output sequence only repeats after
This standard LFG configuration is weak against modern statistical attacks. Then, a very efficient manner to increase entropy in LFG is to perturbate LSB in the samples to increase its period and make the probability of samples more uniform. These perturbations may take the form of any arithmetic operation, but if additional internal samples are obtained, the computational cost of the global PRNG will go up. To avoid this problem, perturbations are introduced by manipulating the bits inside each sample. The resulting scheme Eq. (32) is known as Perturbed Lagged Fibonacci Generators (PLFGs), where
In this case, in order to reduce the PRNG computational cost, we are putting together the multiplications introduced by Tausworthe generators and perturbation from PLFG Eq. (33). The result is a lightweight PRNG where
This expression Eq. (33) perturbates LSB in every sample, while (at the same time) they represent an arithmetic multiplication inherited from Tausworthe generators Eq. (35).
At this point, the proposed PRNG Eq. (33) includes three important innovations that adapt this technology to Industry 4.0 scenarios:
As a response, we create complex PRNG by interconnecting (as branches) different PLFG Eq. (33). Specifically, in this work we are using a three-branch scheme, named Trifork (see Fig. 4). The three branches are connected as follows: the final global samples are obtained through a XOR operation applied to branches, the branches will be totally hidden for external users and the final number of samples depends on several parameters, so statistical attacks cannot infer neither the internal system parameters, the current or past system state nor the secret keys. With this approach, Trifork PRNG Eq. (36) produces random sequences much longer than the ones obtained from conventional PLFG, but with a lower number of operations (and computational cost) than three independent PLFG or one complex PLFG including all operation in only one expression.
Proposed encryption module.
Experimentally, it has been proved that values around
Despite all the previous designs, Trifork needs to be initialized at random, and the randomness of the seed is a key factor conditioning the final behavior of the entire PRNG. Therefore, the seed is not introduced by users, but obtained from chaotic signals
As can be seen in Fig. 4, chaotic signals are introduced in a serial-to-parallel register, mixing samples in an alternative way from both signals in
This alternance is controlled through a simple 2-to-1 multiplexer and a cyclic binary counter (with any length), where the LSB is employed to control the multiplexer.
As said before, Industry 4.0 devices may be operating for very long periods, and even complex PRNG such as Trifork may show a periodical behavior if infinite time is considered. To avoid that problem, the proposed PRNG includes a reset mechanism. The Trifork output is monitored by a cyclic counter, so each time a new sample is generated the count is increased. After reaching the maximum value, the seed is refreshed with the current content in the serial-to-parallel register. This seed will be totally different from the previous one, thanks to the chaotic signals, and (then) the random number sequence will start again, avoiding periodical behaviors as much as possible.
The final random sequence of integer numbers is employed as operational secret key at the encryption module, where private biometric information is finally protected.
Once the operational secret key
The proposed encryption module presents the following characteristics:
The encryption process and the decryption process are identical. Thus, the operational secret key The proposed scheme is flexible, and a variable number of rounds may be considered. In this article we are describing two different approaches, but any other may be also valid. Input data, in this work, are considered to be already serialized and packed to be sent through the communication system. However, other information types may also be considered, if just the adequate microfragmentation and wrapping function is included. The cipher is byte oriented, to match the internal structure of the encryption module and the PRNG.
The raw input data stream
These microcells are then wrapped following a zig-zag scheme Eq. (40) to create matrices
One of the key problems in block cipher is the plaintext attack. In biometric authentication solutions this is especially critical. Exposing the biometric system to a certain particular input, we can force the cipher to work with critical inputs (for example, a null matrix). In that case, the system may expose the internal structure or the private key. To avoid this problem, in the first encryption round, the input information is introduced in a destabilization phase.
In this phase, chaotic signal
Then, the second chaotic signal
The bitConfusion function
The permutationRow function
Finally, the rotation function
After this process, the matrix
It is important to note that sequence
Once the final encrypted cell
In order to decrypt the encrypted matrix
Then, the XOR encryption is easily reversed, as it is enough to re-apply the same operation using the same secret key (53).
Besides, as both chaotic dynamics are synchronized, the rotation and confusion steps can be also undone. In this case, permutations and rotations must be undone in the exact opposite order in which they were applied. Thus, indexes
In order to evaluate and test the proposed technology, we conducted an experimental validation, based on simulation scenarios and tools. This experimental phase consisted of two phases: the first phase includes a security analysis of the proposed technique, while the second part considers a performance analysis (focused on the encryption delay).
All the experiments were supported by a simulation scenario describing a complex biometric system including three different devices: cameras for facial recognition, fingerprint readers and iris readers. This system represents a large infrastructure, for example an airport, where different biometric techniques are employed for different purposes (such as access control or criminal identification). Twenty-five devices belonging to each type are considered in all simulations. Each different device model is created according to commercial solutions, to simulate a realistic data stream (regarding both factors, data format and communication protocol). The fingerprint reader model was created according to R307 fingerprint Arduino module. Iris scanner was coded to simulate the behavior of IriMagic 100BK platform. And, finally, facial recognition is simulated to be performed by ESP-EYE embedded cameras.
Each subsystem was connected with a different authentication sever, where biometric information is decrypted and processed. Other effects such as packet losses or electromagnetic interferences are not considered in this experimental section.
To perform the experiments, the simulation scenario was implemented and executed using MATLAB 2017a software. All simulations were performed using a Linux architecture (Ubuntu 20.04 LTS) with the following hardware characteristics: Dell R540 Rack 2U, 96 GB RAM, two processors Intel Xeon Silver 4114 2.2G, HD 2TB SATA 7,2K rpm.
All simulations represented an operation time of seventy-two (72) hours. Each simulation was repeated twelve times, and final results were obtained as the average of all partial results. In order to perform all the described simulations, the system was configured using the parameters described in Table 3. Parameter
Configuration parameters
Configuration parameters
In order to formally analyze the privacy and security level reached by the proposed technology, three different approaches may be done: the Kerckhoff’s approach, based on analyzing the theoretical characteristics of the secret key; the Shannon’s theory, where the statistical relation between information in the raw and the encrypted streams is numerically calculated through different indicators; and the Diffie-Hellman’s approach, based on studying the resilience of the proposed solution against some key attacks. In this work, we are including a first study including all three perspectives.
The Kerckhoff’s approach considers no security mechanism can be secret for an indefinite time, so the security level of any solution, at the end, depends on the properties of the key. In particular, three different indicators are analyzed: the key sensitivity, the key space and the resilience against the known-plaintext attack and the chosen-plaintext attack.
The key sensitivity represents how different are two encrypted messages when protected using similar keys. Strong security mechanisms generate totally different encrypted messages even if very similar keys are employed. The number (or percentage) of bits changing the final encrypted message (
Finally, the resilience against the known-plaintext attack and the chosen-plaintext attack represents how different are two encrypted messages when two similar clear messages are employed. As when talking about the key sensitivity, the number (or percentage) of bits changing the final encrypted message (
On the other hand, the Shannon’s security analysis is based on statistical tests. These tests study how much information from the private original biometric streams is present in the encrypted messages. Many different indicators and tests may be employed to evaluate this value, but in this work, we are using six of them: the statistical correlation between encrypted and original messages, the entropy of encrypted messages and the mutual information between encrypted and original messages, the histogram variance (strong cryptosystems generate encrypted messages uniformly distributed), the Number of Byte Change Rate (NBCR) and the Unified Average Changing Intensity (UACI) evaluating how different (in bits) are the original and the encrypted messages (in percentage and per bit, respectively), the sequence test that evaluates how random are the encrypted messages and, finally, the NIST PRNG suite analyzing how random are the secret operation keys.
Eight hundred (800) random messages, with a length of twelve (12) kilobytes were generated and encrypted using the proposed scheme. Raw and encrypted messages were introduced in standard libraries for correlation, entropy, and mutual information calculation. The final result is obtained as the average of all these 800 measures. In order to calculate the histogram variance, encrypted messages are formatted as integer numbers, and they are introduced in a standard library for histogram calculation and manipulation. The same process is performed to introduce encrypted messages in the sequence test and evaluate their randomness. Finally, the NBCR and UACI are calculated through the XOR operation of every pair of raw and encrypted messages. The number of bits set to the unit in the result is then measured (percentage is easily calculated considering the message length). The final results are obtained as the average value of all these 800 measures.
Finally, the Diffie-Hellman’s scheme analyzes how resilient the proposed solution is against two basic cyber-attacks: the Known Message Attack (KMA) and the Encrypted Only Attack (EOA). In KMA, attackers have access to encrypted and original messages; while in EOA they only have access to encrypted messages. In strong crypto solutions, secret keys must not be revealed in any case.
Performance analysis: methodology
The proposed solution must show a performance compatible with Industry 4.0 requirements and scenarios. In particular, biometric information must flow at real time and the resource consumption must be low enough to allow deploying the proposed technology in small microcontrollers (typical in Industry 4.0 applications). In order to analyze these factors, two different experiments were carried out.
The first experiment evaluates the encryption delay introduced by the proposed solution in biometric authentication systems. The experiments consider different values for
For both experiments, a real ESP-EYE device was employed to code and deploy the proposed security mechanism (face recognition). That device was operating for 72 hours, and results about resource consumption and processing delay were collected and processed using MATLAB software. Real Industry 4.0 scenarios could integrate other devices such as fingerprint readers or iris scans. However, we decided to perform our experiments using only cameras because they are the most demanding devices, as they generate the highest data bitrate among all biometric devices and support the most complex data processing algorithms. Thus, if proposed solution is lightweight enough to operate with cameras, it is expected to work with other biometric devices.
Results and discussion
First, we are showing and discussing the results for the formal security analysis. We are first studying the key space. For a N-bit PRNG the maximum number we can obtain is
Thus, we can always find a configuration with a key space large enough for any given application.
Figure 6 shows the key sensitivity for different system configurations. As can be seen, the growing rate in all cases is exponential, although it goes up as the number of rounds is increased, showing that schemes with higher round numbers are safer. Besides, as can be seen, differences between schemes with ten or more rounds are very small, including the scheme where
Key sensitivity: Results.
Any case, for schemes with
Resilience against known-plaintext attack and chosen-plaintext attack: results.
Figure 7 shows system resilience against known-plaintext and chosen-plaintext attacks. As can be seen, in this case the evolution is also exponential. The results are pretty similar to Fig. 6, as both the secret key and the raw message follow similar paths. Both flows are divided into cells and combined through the XOR operation which is a commutative operator. However, raw messages are randomized using additional rotation and confusion steps, so the resilience is expected to be higher than the key sensitivity given the same cipher configuration. Actually, as can be seen, given a difference of 20% in the secret key the key sensitivity is close to 70%, while for the same difference, the resilience is near 80%. Moreover, in this case, it is a clear improvement when using
As a conclusion, the proposed system is secure, as it includes a clear configuration allowing a total resilience.
Now, we are analyzing the security analysis results according to Shannon’s perspective. Table 4 shows the obtained results for the considered statistical indicators.
As can be seen, globally, the deviation of the real values from the ideal theoretical ones is around 3%. This amount is acceptable, and similar to other previously reported schemes [15]. In general, theoretical values are associated to totally random encrypted messages, so mutual information, correlation and variance must be null, and entropy equal to the number of bits in each cell. Deviation in entropy, mutual information and correlation may be considered negligible, and caused by pseudorandom flows. Variance shows a higher deviation that is caused by the underlying structure in PRNG and chaotic signal. This analysis is also supported by the sequence test, that shows a residual deterministic behavior in encrypted messages. This statistical test evaluates the significance level of the deterministic hypothesis (i.e. encrypted messages follow a predictable pattern). In this case, the hypothesis is rejected (results is below proposed value for
Shannon’s cryptoanalysis for the proposed solution
The values obtained for NBCR and UACI are coherent with all previous discussions, showing that the proposed scheme can also be considered secure from Shannon’s perspective. Specifically, NBCR refers how many bits the encrypted and raw messages have in common (if both messages are totally different, NBCR takes value 100%). As can be see, the number of common bits is negligible.
The last test related to Shannon’s view is the NIST PRNG test suite. Table 5 summarizes the obtained results. As can be seen, all tests were approved, even with a very high score. Thus, the operation secret key is good and random enough to ensure the security of the global proposed technology.
NIST PRNG test suite: Results
Finally, Figs 8 and 9 show the resilience of the proposed cryptosystem against the Known Message Attack (KMA) and the Encrypted Only Attack (EOA), following the Diffie Hellman’s approach.
Resilience against known message attack (KMA): Results.
Resilience against encrypted only attack (EOA): Results
As can be seen, schemes where the number of rounds
In standard scenarios, the number of captured packets would tend to be small or medium, as large amounts are only possible in Man-in-the-Middle attacks, which are not possible in offline biometric information exchanges between sensors and servers in edge computing architecture.
In fact, the security level is usually defined as the required number of packets to reach a success rate of 90%. In this case, the security level against KMA is equal to
Once the security properties of the proposed mechanism are proved, the resource consumption of the technology must be analyzed, to study if it matches the Industry 4.0 requirements. Table 6 shows the memory and computational consumption of the proposed solution. RAM memory percentage refers the usage of the memory space for dynamic variables (compared to the available space in an ESP EYE device); while program space percentage refers the usage of the memory space for firmware in ESP EYE devices. As can be seen, even in resource-constrained devices, the memory usage caused by the proposed algorithm is globally around 10%, while the number of operations per encrypted cell is lower than other similar lightweight encryption schemes based on chaos [8]. In fact, the proposed scheme improves the resource consumption of similar proposals in around 25%.
Resource consumption: Results
Finally, Fig. 10 shows the processing delay caused by the proposed encryption solution, for different number of rounds in the encryption module (only non-chaotic configurations for
Processing delay: Results.
In this paper, we propose a new lightweight secure symmetric encryption solution for Industry 4.0. This cipher includes a pseudo-random number generator based on simple computationally low-cost operations to create the secret key. To preserve and provide good security properties, the key generation and the encryption processes are fed with a chaotic number sequence obtained through the numerical integration of a new four-order hyperchaotic dynamic.
In general, we can conclude that the use of chaotic signal in encryption schemes improves their performance and security properties (key sensitivity and resilience), while the computation delay and jitter increase. Besides, with the proposed approach, based on simple binary operations, the resource consumption reduces up to 25% compared to the state-of-the-art mechanisms. This low-level approach requires the use of low-level programming languages and efficient data structures, so technological experts are essential to implement and deploy this new encryption system. In order to make easier its adoption in Industry 4.0 scenarios, prosumer mechanisms will be considered in future works.
Footnotes
Acknowledgments
This work is supported by the Ministry of Science, Innovation and Universities through the COGNOS project (PID2019-105484RB-I00) and by the European Commission by the Cities2030 project (H2020-FNR-2020-1. Grant no: 101000640).
