Abstract
Handoff management is the method in which the mobile node maintains its connection active when it shifts from location to other. The devastating success of mobile devices as well as wireless communications is emphasizing the requirement for the expansion of mobility-aware facilities. Moreover, the mobility of devices requires services adapting their behavior to abrupt context variations and being conscious of handoffs, which make an intermittent discontinuities and unpredictable delays. Thus, the heterogeneity of wireless network devices confuses the situation, since a dissimilar treatment of handoffs and context-awareness is essential for every solution. Hence, this paper introduced the Deep Q network-based Firefly Aquila Optimizer (DQN-FAO) for performing the handoff management. In order to establish the handoff management, the process of selecting network is very important. Here, the network is selected based on the devised FAO algorithm, which is the consolidation of Aquila Optimizer (AO) and Firefly algorithm (FA) that considers the metrics, such as Jitter, Handoff latency, and Received Signal Strength Indicator (RSSI) as fitness function. Moreover, the handover decision is taken by the DQN, where the hyper-parameters are tuned by the devised FAO algorithm. According to the hand over decision taken, the context aware video streaming is happened by adjusting the bit rate of the videos using network bandwidth. Besides, the devised scheme attained the superior performance based on the call drop, energy consumption, handover delay, throughput, handoff latency, and PSNR of 0.5122, 7.086 J, 10.54 ms, 13.17 Mbps, 93.80 ms and 46.89 dB.
Keywords
Introduction
People can now migrate from one location to another and adapt to various networks based on their needs. Numerous mobile nodes (MNs) are interconnected with diverse wireless systems, like Worldwide Microwave Communication Interoperability (WiMAX) and Wireless Fidelity (Wi-Fi). Hence, heterogeneous networks (HetNets) are required to keep the users linked with any one of the networks as per choice of the user at specified time. It has a well-defined structure that contains micro cells, such as femtos, picos and Wi-Fi, which are overlaid by the macro level networks [18]. The HetNets poses a greater number of real-world tasks related to the complex system resources in a regimented system [33]. These networks are extensively used in both business and education to manage high data rate as well as the rising demand for global coverage [10]. HetNet resources are efficiently utilized on the basis of Radio Resource Management (RRM) approach along with handoff mechanism. The handover mechanism is significant for assuring heterogeneous wireless networks’ performance [15,28]. Furthermore, vertical handoff time decision approach has to be established, because it directly influences the transmission characteristics as well as streaming video quality. When vertical handoff measures as well as network chosen period are assessed, the handoff process is carried out, streaming video series are subjected to the target network over serving network. In vertical handoff streaming video broadcast ecosystem, the transmitted video has to be corrupted. Consequently, streaming video server and client play a role in the vertical handoff environment to guarantee the quality of the streaming video [23].
The data about the MN position and deployment management in certain region of radio communication networks are kept in the mobility management system. It enables the stable connectivity between the networks by means of handoff technique. Hence, hi-tech handoff systems are established to encounter the various Quality of service (QoS) constrains. Handoff is the technique employed to transfer mobile nodes from one network to further networks without altering the connectivity. The upcoming wireless networks need to assist wide-range applications, users as well as constraints for connectivity. The quick access has to be achieved by means of small sized cells. Despite, if the base station’s intensity is increased, the handoff frequency is also maximized [30,33]. Moreover, vertical handoff approach is classified into three phases. At first, mobile node should know about the easily accessible remote structures, this process is termed as system detection. This method is used to identify the appropriate method which fulfils the user gradients with requisite QoS [9]. In the handoff process, mobile node measures vertical handoff constraints associated with novel radio communication networking to reconcile by handoff choice. After choosing the mobile node, the next phase is termed as handoff implementation. The optimal range of vertical handoff is depending on various parameters, such as cost, Bit Error ratio (BER), Bandwidth, Available Bit Rate (ABR), Signal to Noise ratio (SNR), and throughput [31,34]. These parameters are must observed along with flag feature in the convoluted heterogeneous circumstances [22].
Several algorithms, such as fuzzy logic [14] and artificial neural networks [2] are developed to achieve vertical handoff mechanism. These approaches have the ability for robust data processing and it minimize the ping pong effect effectually and also maximize the precision of decision making. In [13], the enhanced vertical handoff approach is designed by considering accuracy of Received Signal Strength (RSS) as well as Kalman filter along with fuzzy logic algorithm. Kalman filter is employed to filter Gaussian noise in RSS to attain accurate constraints and can enhance efficiency of decision making. On the other hand, the fuzzy logic approach is applied to manage huge amount of information. However, the fuzzy inference rules are exponentially increased, which creates a great impact on the system to make more complex. In [11], researchers developed the neural network approach for network selection, which enhances the system adaptive capability in network inconsistent state, but it is failed to contemplate user’s gratification as well as QoS. The artificial neural network [35] is utilized to ensure the Quality of Experience (QoE) in vertical approach for enhancing client satisfaction. The novel approach is designed on the basis of Q-learning for handoff mechanism. In this methods, QoE assessment mechanism is developed on the basis of Recurrent Neural Network (RNN) to enhance the QoE in some extend. Despite, the terminal devices has become unfit for solving complex computations due to the restricted computing proficiencies. Moreover, the decision tree based vertical handover approach [16] is used to minimize the possibility of fault handoff and more precise decisions are attained by means of Kalman filtering, thus enhance the firmness of the technique [28].
The main goal of this research is to establish the handoff management scheme in heterogeneous Wireless Networks. For that, the devised scheme follows various steps, such as network selection, handover decision and QoS metric computation. The network is selected by the devised FAO algorithm, which utilizes jitter, handoff latency and RSSI as fitness function. The handover is initiated when the data rate measured by the sensor node is lower than the predefined threshold value. Moreover, the handover decision is performed by DQN, where the decision taken by the DQN is based on distance of Mobile Station (MS) from access point, SNR, MS speed along with the fitness parameters. If the decision taken by the DQN is ‘Yes’, then calculate network bandwidth and finally adjust the bit rate of the video.
The prime contribution of this article is deliberated below:
The structure of this research paper is explained in this section. The review of various handoff management techniques are deliberated in Section 2, system model is discussed in Section 3, invented handoff management scheme is elaborated in Section 4 and the final part of this research is explained in Section 5.
Motivation
As seen in mobile phones and laptops, the use of wireless networks, where they operate in accordance with other networks, is rapidly increasing. These are linked together via different network terminal nodes that change depending on the networks that are accessible. Rapid network change results in vertical handoffs that cause services to discontinue, such as call blocking and poor streaming video quality. A more effective handoff management methodology for heterogeneous wireless networks is required to eliminate these problems. This methodology should assist in selecting the best available network to prevent ping pong. This section enumerates the literature reviews of various existing vertical handoff techniques in heterogeneous networks along with advantages, disadvantages, and challenges to develop best technique that inspire the researchers.
Literature survey
This portion interprets the review of traditional techniques corresponding to handoff management in context aware video streaming-based heterogeneous Wireless Networks. Han, Z., et al. [7] developed deep reinforcement learning approach for enabling the network to explore the behaviors of real users and the status of network, which effectively improved the rate of data during handoff procedure but still the state of wireless local area network (WLAN) constructed with simulator may diverge from real situations. Chen, J., et al. [3] introduced quality of experience (QoE) method for determining the association among quality of experience (QoE) and quality of service (QoS) in heterogeneous networks. This method attained finest performance with handoff dropping probability and also obtained better performance of QoE in maintaining charges and fatal power consumption but it has highest call blocking probability. Wang, S., et al. [28] developed Multi objective model for solving the shortage of complete consideration of user as well as the consequences of network during handoff procedure that efficiently enhanced the service quality of user and the utilization of resources. Although, this technique failed to process a larger quantity of data due to high complexity, slow computation speed and not appropriate for rapid moving scenarios. Dhipa, M., et al. [4] incorporated telecardiology method with Trust and Privacy based Multi-attribute Vertical Handoff decision algorithm for deciding the best probability interconnections reduced the blocking possibilities but failed to reduce the power consumption and computation costs. Zaheeruddin and Mahajan, P., [33] introduced a novel optimized vertical handoff technique for faultless of users to wireless heterogeneous networks. In this technique, the time for computation is very less and provided continual services but still there was a need for reducing the count of call drops and hence call dropping rate can be improved further for proficient functioning of wireless network.
Patil, M.B. and Patil, R., [20] designed Fractional Squirrel–Dolphin Echolocation (FrSqDE) algorithm-based Deep Belief Networks (DBNs) for improving the energy effectiveness of different heterogeneous network minimized the delay and energy consumption although it did not include the adaption of QoS requirements in heterogeneous wireless network. Parambanchary, D. and Rao, V.M., [19] developed Whale Optimization Algorithm (WOA)-neural network (NN) for solving the handover indications in heterogeneous network. Here, the method highly increased the success rate whereas it did not perform in actual time utilizing many sophisticated optimization techniques. Pradeep, M. and Sampath, P., [22] presented an optimized multi-attribute vertical handoff technique for heterogeneous wireless network decreased the count of handoff failures as well as needless handoff cases though it failed to include the adaptation of parameters in Dynamic Network Selection Function (DNSF) for improving the performances. Politis, I., et al. [21] developed a scheme called QoE-driven handoff approach for scalable as well as single layer coding. This approach is very flexible and permits several precise rate adaptations, which maintains the Peak Signal to Noise Ratio (PSNR). It also minimized the count of handovers but further improvement of packet loss rate was not performed. Vallati, C., et al. [26] introduced Handoff procedure based on the link quality forecasting model, which reduced the video outages and hence improved the end-user QoE. However, this method failed to utilize cross-layer optimization QoS for changeable bit rate uses and not examined the prediction ability of the handoff technique. Pyun, J.Y., [23] devised a Context-Aware Streaming Video System for vertical handover (VHO) over wireless overlay network provides finest QoS and throughput during VHO, even if this technique did not solve the utilization of minimal bandwidth and the complexities of less computation at server region problems. The mean square error (MSE) obtained by the cross validation is taken into consideration as the fitness function for the Aquila to select the optimal features. Grace, M. et al. [6] selected the optimal features from the CSV file based on the prediction accuracy by cross validation using the Aquila optimizer. The proposed approach can predict malware from an Android application in real time. A two-dimensional reciprocal cross entropy multi-threshold-based lung parenchyma segmentation approach using an upgraded firefly algorithm was proposed by Guowei Wang et al. [27]. The findings of the experiment demonstrate that it not only segments COVID-19 lung parenchyma more correctly, but also does it with less processing time. Each lung parenchyma segmentation approach, however, has a robustness restriction due to the degree of variation in the anatomical properties of each person’s chest.
Main challenges
The challenges met by prevailing vertical handoff techniques in heterogeneous wireless network are explained below,
In [7], Deep Reinforcement Learning (DRL) method was established for Artificial Intelligence (AI)-enabled handoff management for the purpose of dense WLANs. However, in this technique DRL along actual scenarios having multiple agents or various mobile stations (STAs) were not included that lead to better performance output.
QoE was designed in [3] with better performance in power consumption in terminals and service charges. However, this method cut-off the overall capability of performance in network.
Multi objective model enabled vertical handoff algorithm was proposed in [28] for heterogeneous wireless networks, but this model lack in maintaining a balanced equilibrium between network resource utilization and terminal QoS.
The algorithm named, Trust and privacy based vertical handoff decision was developed in [4] for application in tele-cardiology heterogeneous wireless networks. Here, handover latency is increased by addition of more attributes and hence it is compulsory to deliver a proper balance between convergence speed and network selection attributes of handoff method to attain improved performance of network.
Vertical handoff is a term referred to switching off different networks. Mobile Terminals are dynamic and get affected by this vertical handoff, which is an important challenge for Mobile Terminal to pick the exact available network that could effectively eradicate the ping pong issue.
Call blocking and poor streaming video quality are only two examples of the services that have been discontinued as a result of the networks’ quick network changes and vertical handoffs. Better handoff management methodologies in heterogeneous wireless networks are required to eliminate these problems and assist users in selecting the best available networks to prevent ping pong.
System model
This section interprets the system model for handoff management in context aware video streaming-based heterogeneous wireless networks.
Assumptions
Let us consider a WLAN, which is deployed in a cellular coverage area with number of small cells. Suppose U is considered as the group of Access Point (APs) in the wireless coverage zone and it is given as,
Here, the term
Preliminaries
This part enumerates the preliminaries for comprehending the mechanism of structure in the heterogeneous wireless network. Such considerations are desirable and can be varied accordingly with the network scenario. Below, description of certain assumptions is given:
Heterogeneous devices
Each device involved in wireless network possesses various configurations. As an illustration, the diverse configuration involves computational abilities, mobility pattern of device, battery needs, network interface regulation, and so on.
Communication radius model
The coverage area and its interaction system of the device C with radius r and positioned at d from other component is represented as follows,
Here, the term C implies the coverage distance and
Scalable network (SN)
In an adjustable network, if the wireless area is regarded in the closed area and handover offered in various MNs, then every nodes preferred continual handoffs in the MSN. Let us assume that the 100 nodes are installed in the closed region with the area of
Introduced vertical handoff mechanism utilizing FAO
The fundamental intention of this article is to establish an efficient model for vertical handoff in heterogeneous wireless network utilizing designed FAO algorithm. This proposed scheme accomplishes the vertical handoff management depending upon the efficient network selection model. Initially, the sensor nodes ensure required data level and if the level is below a predefined threshold, it stimulates the handover process. Thereafter, the process of network selection is effectively done utilizing newly designed FAO by considering various factors, such as Jitter, Handoff latency, and RSSI. Moreover, the designed FAO algorithm is derived by the consolidation of AO and FA. The final step is the handover decision step, which is done utilizing DQN by considering the distance between MS and AP, SNR along with the fitness parameters, wherein the hyper parameters of DQN is effectively fine tuned using FAO in order to enhance the performance over the epochs. If the decision is ‘Yes’, then estimate network bandwidth and finally adjust the bit rate of the video. Figure 1 illustrates the schematic view of vertical handoff mechanism utilizing devised FAO algorithm.

Block diagram of vertical handoff mechanism using the proposed FAO algorithm.
In recent years, the smart phones are increasing in diverse applications, such as video chatting, management of position, transportation systems, online games, social networking and so forth. The above said applications are broadly divided into four parts, such as streaming, background, conversational, and the interactive. It is necessary to have low data rate to run an application over MN device without any complications. As an illustration, if a Skype application is processing in a MN device, it needs a minimum data rate of 128 kbps. Similarly, if L count of applications processing over MN device, then it is denoted as
The steps involved in the vertical handover are handover triggering, choosing of network, and handover execution. The triggering mechanism is employed for ensuring overall count of applications running in an MN device. If the application count is considered to be null, then the mechanism of stimulation gets stopped or else it ensures the needed data rate continuously processing over MN’s device. The threshold value is expressed
If the data level goes beyond
If the data level d is higher than
By following these two conditions, the data level guarantee is adjusted strongly after continuous checking and this always explores the data rate after the predefined period that decreases the effectiveness of the checking process. The overall mechanism of the handover triggering step is represented in the flow chart illustrated in Fig. 2.

Flow chart of handover triggering.
Most of the handover process are purely relied on link quality factors, like RSSI, bandwidth and SNR to select the best target radio range. Hence, in wireless networks, the context-aware will be highly sufficient by means of providing the user various services and network ability. Therefore, there is a crucial requirement for context-aware in order to choose the suitable networks, like RSSI, coverage region, available bandwidth, response time, delay, jitter, user preference, cost, and so on [17]. Here, the optimal network is selected for vertical handoff mechanism using proposed FAO.
Network selection
Once the handover stimulation is commenced, the MN begins to seek the obtainable neighborhood networks. If the MN is moving fast in a coverage zone of WiFi, it needs often transferring from one AP to another. This kind of switching results in high energy utilization, huge packet loss and connection breakage. To mitigate this situation, an effective algorithm is proposed for optimized network selection scheme using FAO.
The prime function is to select the optimal network by MN with minimum parameter result. The parameters chosen for network selection are indirectly proportional to QoS. Therefore, the fitness factor is employed to estimate the optimal network from the solution set by considering the factors, such as jitter, handoff latency, and RSSI. It is expressed as follows,
Here, K refers the total number of users,
The solution encoding refers the optimal selection of the network from the neighboring cells for handover mechanism. Figure 3 shows the solution encoding.

Solution encoding.
This section elaborates on the developed FAO algorithm for network selection, wherein the introduced FAO algorithm is developed by adapting the high soaring nature of the Aquila with a vertical stoop in the AO algorithm [1] by utilizing the behavior of firefly in the FA [32].
AO, a population-based optimization technique, draws its inspiration from the natural behaviors of Aquila as they pursue their prey. Thus, the four methods used in the AO algorithm’s optimization processes are: high soar with vertical stoop to select the search space; contour flight with short glide attack to explore within a diverge search space; low flight with slow descent attack to exploit within a converge search space; and walk and grab prey to swoop. The AO technique can handle real-world applications and has a fast rate of convergence.
In the tropical and temperate zones, the summer sky is a breathtaking sight when lit up by fireflies. There are over 2,000 different kinds of fireflies, and the majority of them generate brief, cyclical flashes. For a particular species, the flash pattern is frequently distinct. The bioluminescence process that causes the flashing light is still being debated as to the exact purposes of these communication systems. However, these flashes have two primary purposes: to attract possible prey and to communicate with potential mates. Furthermore, flashing might act as a safeguarding warning system. As part of the signal system that unites both sexes, the rhythmic flash, the rate of flashing, and the duration together play a role. In the same species, females react to a male’s distinctive flashing pattern, but in other species, like the photuris, female fireflies can imitate another species’ mating flashing pattern in order to entice and devour male fireflies who might misinterpret the flashes as a prospective appropriate partner. It is feasible to formulate the flashing light in a way that links it to the objective function that needs to be optimised, which opens up the possibility of creating novel optimization methods.
By merging AO and FA together, the developed FAO effectively determines the optimum solution in a short time and with high accuracy. The algorithmic procedures of the developed FAO algorithm are detailed below.
The population of Aquila is initialized first, wherein the location of the Aquila corresponds to the candidate solution and is represented by,
Here, A indicates the overall count of the candidate solutions and dim specifies the size of the problem, wherein each candidate is given by,
Wherein,
Fitness of the solution is already explained in part (i) of the current section, and the best solution corresponds to the solution with minimal fitness.
In this stage, Aquila discovers the location of food source and flies to a higher level above the ground and performs exploration of the prey. This approach is employed for catching flying birds, where the Aquila soars up with a vertical stoop and this is modeled using below expression.
Here, t expresses the current iteration and
Assume,
Now, consider the movement of firefly towards the brighter firefly in the FA algorithm, this can be expressed as,
Here,
Assume,
Substituting Eq. (13) in Eq. (9), we get
The above equation is utilized for determining the location of the Aquila in the next iteration after it soars up high and performs a vertical stoop.
In the second technique, Aquila flies at a lower level above the ground after finding the prey and prey is then encircled. The Aquila prepares itself for landing and attacking. This method is referred as contour flight with short glide attack and is employed for catching flying or running preys, such as seabirds, breeding grouse, or ground squirrels. The Aquila performs exploration of a specific area and this behaviour is indicated by,
Here, b and κ are arbitrary values with values in the limit of
Further, k and l are utilized to represent the spiral shape of the search and is given by,
Here,
In the third technique, the Aquila is prepared for its land and attacking; wherein it performs a vertical descend for attacking the prey. This technique is called low flight with very slow descent attack, where Aquila attacks the prey by selecting it and landing on the victim’s back or neck. This approach is used for catching slow preys, like tortoises, foxes, hedgehogs, and rattlesnakes. The Aquila performs exploitation of the area around the victim for catching it. This is expressed as,
Here,
In the fourth approach, Aquila walks on the land and pulls its victim, as soon as it gets closer to the victim. This approach is known as “walk and grab prey” and is employed for catching young ones of larger victims, like sheep or deer from the coverage area. This is expressed using below expression.
Here, R represents the quality factor employed to equilibrium the search techniques,
Here,
The best solution is computed by considering the fitness of the candidate solution, wherein the solution with the lowest fitness corresponds to the optimal result.
The aforementioned steps are iterated continuously till the optimal result is attained. The pseudo code of devised FAO is displayed using Algorithm 1.

Pseudo code of the proposed FAO algorithm
Thus the proposed FAO algorithm effectively performs network selection with high convergence rate. Further, the amalgamation of the AO and FA algorithms has effectively minimized the computation time and has enhanced the performance of the optimization process.
Once the optimal network is selected using proposed algorithm, the decision is to be made for handover mechanism using DQN by considering the parameters, such as distance, SNR, MS speed along with the fitness parameters.
DQN [29] is an eminent technique in reinforcement learning, which utilizes the Q-learning method and also employs CNN to approximate the action-value function referred as Q-function. Deep Q-Learning was developed primarily to manage environments with continuous activity and states. Small and discrete environments can benefit from the simple Q-Learning method. DQN also offers the option to anticipate the state-value function, which is an interesting feature. In rare cases, reinforcement learning is said to be unstable or even to different if the non-linear function approximator like neural network (NN) is employed to illustrate the Q-function [24]. The major reasons for this instability are the correlations existing in the series of state scrutinization
In order to process the experience replay, the agent’s experience at time period
The dataset W is also referred as replay memory. While learning, Q-learning upgrades are applied over experience samples
Here, y refers the award, the discount factor is denoted as γ, and
Here, the term

Architecture of DQN.
In order to provide better decisions for handover mechanism, the hyper parameters of DQN are optimally fine tuned using same proposed FAO algorithm, which is already described in Section 4.2.1. If the decision is ‘Yes’, calculate network bandwidth to adjust the bit level of video.
If the decision is made as ‘Yes’ by DQN to accomplish the handover process, it is necessary to determine the bit rate. Initially, the bandwidth of the selected network is calculated and then, the bit rate of the video is adjusted accordingly.
The network bandwidth [21] is continuously monitored by investigating the network with a default stream of duplicate RTP packets. The user is acknowledged for required capacity or the existing bandwidth by receiving the RTCP messages that conveys this data to the message leader.
Videos can be encoded into various bit rate versions with different file sizes. If the bit rate is low, the file size of the video is also smaller and it results low resolution with better smoothness level. Hence, it is significant to fine tune the bit rate to provide smoothness. The main objective of the bit rate adjustment is to guarantee the smooth playback in order to tune the bit rate version of mobile users and hence, the overall network traffic can be regulated [8]. Once the bandwidth is calculated, bit rate of the video is adjusted accordingly so that the user can enjoy the application without any interruption in the network. The graphical illustration of bit rate adjustment is given in Fig. 5.

Illustration of bit rate adjustment.
The simulation results of developed FAO-based DQN for handoff management in context aware video streaming-based heterogeneous wireless network are explicated in this part.
Experimental setup
The execution of developed FAO driven DQN is done in personal computer with python 3.7, pycharm, Intel i3 processor in Windows 10 operating system. Table 1 shows experimental setup details.
Experimental setup details
Experimental setup details
The developed FAO-based DQN is processed using Alankar Kotwal Implementation data [12]. The video frames are extracted from a video and it is stored in data folder. The collected video frame is existing in “RGB” color format.
Performance indicators
The metrics, like handoff latency, energy consumption, throughput, call drop, handover delay, and PSNR are considered for evaluating the performance of introduced handoff management system.
Call drop
It is defined as the ratio of quantity of call drop times to amount of call setup success times, which is represented as,
Energy consumption
It is computed by total quantity of energy consumed during implementation process.
Handover delay
It is referred as total delay produced by mobile node during handover, while re-establishing an enduring session from switch in source eNodeB to switch in destination eNodeB.
Throughput
This metric estimates total number of data packets transmitted by a channel in certain time interval.
Handoff latency
This metric is mainly depending on latency based on handover initialization, handover decision and execution.
PSNR
PSNR is estimated by proportion of highest possible signal power for corrupting noise power.

Experimental results for developed DQN-FAO (a) input image, (b) 360 pixel, (c), 480 pixel, (d) 720 pixel, and (e) 1080 pixel.
The experimental outcomes for vertical handoff management in context aware video streaming-based heterogeneous wireless network are explicated in this section. Figure 6 shows the sample result proposed DQN-FAO for hand off management. Figure 6(a), (b), (c), (d) and (e) depicts the input image 3360 pixel, 480 pixel, 720 pixel, and 1080 pixel
Comparative techniques
The prevailing vertical handoff techniques, such as Deep reinforcement learning [7], Random NN-based QoE estimation [3], Multi objective model [28] and DBN [20] are considered for comparing the performance of designed FAO technique. In addition, Sun Flower Optimization (SFO) algorithm [5], Sail Fish Optimization model [25], Aquila Optimizer (AO) [1], and Firefly Algorithm (FA) [32] are considered for evaluating the performance of developed FAO technique.
Comparative assessment
The comparative evaluation of the generated DQN FAO for various metrics with time and bit rate is shown in Fig. 7. The analysis of DQN-FAO for call drop, energy consumption, handover delay, throughput, and handoff latency and time duration 15 is shown in Fig. 7(a), (b), (c), (d), and (e). The developed DQN-FAO has a call drop of 0.4518, compared to existing techniques’ call drops of 0.5284, 0.5266, 0.5130, and 0.4943, and performance gains of 14.49%, 14.20%, 11.92%, and 8.60%. The existing and designed procedures achieved energy consumption of 7.918 J, 6.706 J, 7.638 J, 6.705 J, and 6.420 J in addition to obtaining handover delays of 14.60 ms, 14.29 ms, 14.53 ms, 14.46 ms, and 10.72 ms. DQN-FAO has improved performance by 18.92%, 4.26%, 15.94%, and 4.24% in terms of energy usage. Deep reinforcement learning has a throughput of 13.088 megabits per second (Mbps), Random NN based QoE estimation at 13.098 Mbps, multi-objective model at 13.102 Mbps, DBN at 13.103 Mbps, and DQN FAO at 13.108 Mbps. Deep reinforcement learning, Random NN based QoE calculation, the multi-objective approach, and DBN had handoff latency values of 85.325 ms, 81.385 ms, 82.038 ms, and 78.863 ms, respectively. The study of DQN FAO for PSNR is shown in Fig. 7(f). When bit rate 3 is taken into account, the PSNR of the existing and created DQN-FAO is 36.56 dB, 36.94 dB, 37.96 dB, 39.67 dB, and 43.26 dB.

Comparative analysis of devised DQN-FAO (a) call drop, (b) energy consumption, (c) handover delay, (d) throughput, (e) handoff latency and (f) PSNR.
Figure 8 shows the algorithm estimation of the designed FAO + DQN for a number of performance measures. The algorithm analysis of the created FAO + DQN for call drop, energy consumption, handover delay, handoff latency, and throughput is shown in Fig. 8(a), (b), (c), (d), and (e). At time 15, the strategies SFO + DQN, sailfish + DQN, AO + DQN, Firefly + DQN, and FAO + DQN achieved call drops of 0.510, 0.4934, 0.5092, 0.4744, and 0.4563, while energy consumption is 7.769 J, 8.029 J, 7.193 J, 7.027 J, and 6.677 J, respectively. When the time is 15, the handover latency of the designed FAO + DQN is 12.435 Mbps, compared to the 14.216 Mbps, 13.927 Mbps, 13.670 Mbps, and 12.657 Mbps of the existing techniques. The developed and implemented methods achieved handoff latencies of 93.255 Mbps, 85.818 Mbps, 76.353 Mbps, 76.284 Mbps, and 68.55 Mbps in 15 minutes. The throughput for SFO + DQN, Sailfish + DQN, AO + DQN, Firefly + DQN, and FAO + DQN is 12.696 Mbps, 12.857 Mbps, 12.889 Mbps, and 12.893 Mbps, respectively.

Comparative analysis of FAO algorithm (a) call drop, (b) energy consumption, (c) handover delay, (d) throughput, (e) handoff latency and (f) PSNR.
Table 2 discusses how the created DQN FAO compares to other traditional approaches for various parameters. The call drop of the planned DQN-FAO is 0.5122, compared to the 0.5869, 0.5708, 0.5494, and 0.5342 call drops of the existing techniques at time 20. In addition, handover delays of 14.55 milliseconds, 14.24 milliseconds, 11.43 milliseconds, 14.31 milliseconds, and 10.54 milliseconds were obtained using the existing and proposed procedures when time was 20. Deep reinforcement learning has a throughput of 13.16 Mbps, Random NN based QoE estimation, multi-objective model, DBN, and DQN FAO have throughputs of 13.16 Mbps, 13.16 Mbps, and 13.17 Mbps respectively for time 20. The designed DQN FAO is achieved for handoff latency at 93.80 ms, while deep reinforcement learning, Random NN based QoE estimation, multi-objective method, and DBN are at 103.61 ms, 102.78 ms, 100.71 ms, and 97.93 ms of time 20, respectively. The PSNR of the existing and developed DQN-FAO at bit rate 4 is 38.47 dB, 39.45 dB, 40.37 dB, 43.45 dB, and 46.89 dB.
Comparative discussion
Comparative discussion
This paper presents the devised DQN-FAO algorithm for performing the handoff management. The flow of steps involved in the developed system is network selection, handover decision, QoS parameter computation and bit rate adjustment. Here, the network selection is done using FAO algorithm, which is modeled by adapting the advantages of both FA and AO. FA is a metaheuristic optimization algorithm, which is modeled based on its flashing aspects. Generally, the fireflies utilize its flashing strategy to three functions, such as mating, communication and food exploration. Likewise, AO is also an optimization approach, which considers the food exploration aspects of Aquila. In the developed model, the attractiveness of firefly algorithm is adapted to update the location in AO such that the optimal network selection is obtained. The decision is made by DQN for handover mechanism, wherein the hyperparameters are optimally adjusted using proposed FAO. Based on the handover decision made by DQN, context aware video streaming is performed by adjusting the bit rate of the videos depending upon the network bandwidth. Moreover, the analysis is done by comparing the devised approach with some of the traditional methods in order to reveal the efficacy of invented scheme. From the analysis, the devised scheme provided the superior performance than the conventional methods based on the call drop, energy consumption, handover delay, throughput, handoff latency, and PSNR of 0.5122, 7.086 J, 10.54 ms, 13.17 Mbps, 93.80 ms and 46.89 dB. In future, the performance of handoff management is enhanced by including excess parameters to the fitness function.
