1 Introduction

Recent years have witnessed a rapid growth of ubiquitous applications in the Internet with a vast spread of multimedia streams. This makes providing differentiated quality of service (QoS) for such applications in Wireless Local Area Networks (WLANs) very challenging task. Besides, several wireless technologies have been risen from amongst them IEEE 802.11 [21] has assumed as a de facto standard in WLANs due to some of its key features like deployment flexibility, infrastructure simplicity and cost effectiveness [2]. In IEE 802.11 WLANs, the QoS of multimedia communications cannot be efficiently achieved due to frequent collisions and retransmission [1]. IEEE 802.11 introduces two channel access modes, namely Distributed Coordination Function (DCF) and Point Coordination Function (PCF). The former is the mandatory medium access method which is appropriate to serve best effort applications such as HTTP, FTP and SMTP. Multimedia streams that require a certain QoS level are served during the controlled mode (i.e. PCF) since it provides a contention-free polling-based access to the channel to provide the demanded QoS. However, due to the fact that PCF only operates on the Free-Contention period, which may considerably cause an increase in the transmission delay, especially with high bursty traffics it considered not efficient for serving the applications that required high QoS constraints. Therefore, IEEE 802.11 Task Group e (TGe) has established IEEE 802.11e protocol [23] which then introduced a revised version [22] with new technical enhancements on MAC and Physical layer.

The QoS support of IEEE 802.11 standard has been extended in IEEE 802.11e by means of hybrid coordination function (HCF). Enhanced distributed channel access function (EDCF) which extends DCF, provides a prioritized QoS throughout its distributed access manner to the wireless medium. HCF controlled channel access (HCCA) which extends PCF that works based on a centralized polling mechanism to provide differentiated service, according to rigid QoS parameters negotiated with the centralized coordination (HC). EDCA introduces a random access to the wireless medium by means of access categories (ACs). The traffics are mapped to ACs according to their priority. Every AC will be associated with a backoff timer so that the highest priority ACs will go through a shorter backoff process. Despite EDCA provides QoS support, it is still not efficient for application with rigid QoS requirements.

The QoS level required by the delay-sensitive application can be better met by HCCA since it is designed to minimize the control messaging overhead imposed by the distributed approach of EDCF. Besides, strict QoS support of HCCA can be exploited to transmit some TCP-based services such as web-enabled YouTube progressive video download [38]. Indeed, provisioning QoS of CBR applications such as G.711 [26] audio streams and (H.261/MPEG-1) video [43] is suitable. Whereas, HCCA fails to support VBR traffics based on their declared TSPEC parameters as they show irregularity in their profile during the time. Two-third of all traffics in the networks will be video by 2017 according to Cisco Visual Networking Index [24]. This motivates many researches to be carried out to improve the QoS provisioning in WLANs in terms of supporting QoS for such streams. Recently, Motion Picture Experts Group type 4 (MPEG-4/H.264) has become a prominent video the internet due to its scalability, error robustness and network-friendly features [35]. In HCCA, scheduling VBR streams with fixed TXOP throughout their lifetime can provide sufficient yet not favorable service quality as the data rate of such streams are likely varied from its mean characteristics during the time. This can lead to a noticeable increase in the packet delay and deterioration in channel utilization.

More recently, a large deal of video consumed on mobile devices has been prerecorded video which is considered non-realtime video [17]. The pre-recorded videos can be utilized in several applications and forms such as streaming of video surveillance, providing a large amount of access to knowledge in educational systems [8] and in peer-to-peer (P2P) streaming where users can share stored their video content [5]. Besides, the spreading of User-Generated Content (UGC) for instance, pre-recorded/stored videos such as YouTube, Dailymotion, Metacafe have become noticeable in Internet. In the stored video, the entire traffic can be precisely identified to allocate an appropriate amount of bandwidth so as to keep the cost functions (such as packet delay) minimized. In wireless infrastructure environments the scheduling of the uplink stream considered more complicated than downlink streams as the coordinator does not aware about the queue status of the terminals [45]. Motivated by the widespread of pre-recorded videos in nowadays wireless networks and the availability of the entire video traffic behavior prior to the traffic setup, we introduce a new adaptive scheduler. This scheduler exploits the availability of pre-recorded video profile in order to leverage the scheduling process. To the best of our knowledge, the scheduling of uplink stored continuous media in HCCA has not been addressed efficiently in WLANs. A novel scheduler is presented in this paper aiming at adapt to the fast changes of VBR video traffic profile. Some researches have highlighted this fact, such as that in [13, 19, 37]. The proposed scheduler utilizes the availability of the entire characteristics of pre-recorded streams for scheduling purposes. It computes the TXOP for a traffic stream based on knowledge about the actual frame size instead of assigning TXOP according to mean characteristics of the traffic which is unable to reflect the actual traffic. This scheduler exploits the Queue Size field (QS) of IEEE 802.11e MAC header to transfer this information to the HC. Furthermore, since the poll message can be overheard by all QSTAs within the BSS, broadcasting one multiple poll message to all QSTAs in the SI rather than send one poll for each will minimize the poll overhead and the packet delay with improving the bandwidth utilization. Hence, to leverage the benefit of the proposed algorithm a multi-polling feature has been integrated. We exploit similar multi-polling frame to that reported in [9, 28] which is illustrated in details in Sect. 3.1.

The rest of the paper is organized as follows. Section 2 illustrates the reference HCCA mechanism and its deficiency in supporting VBR Video streams and demonstrates some of the HCCA related works. Section 3 illustrates the proposed dynamic algorithm. The performance evaluation including simulation and analytical results is discussed in Sect. 4. Section 5 concludes the work presented in this paper.

2 Background and related works

This section presents a description of IEEE 802.11e HCCA scheduler and some MPEG-4 video characteristics. The deficiency of HCCA in supporting VBR is illustrated and some related works in enhancing its performance are also discussed.

Fig. 1
figure 1

Controlled channel access mechanism in IEEE 802.11e HCCA

2.1 IEEE 802.11e HCCA mechanism

In IEEE 802.11e, the channel divided into occurrences of subsequent superframe which includes Free Period (CFP) and Contention Period (CP). CFP is reserved for centralized transmission, which controlled by HCCA function. However, Controlled Access Phase (CAP) can be initiated during CFP and CP as well when the medium remains idle for at lease a PIFS time. This property is one of the key merits of HCCA that make HCCA overcome the legacy PCF function in that it shorten the time the packet waits in its transmission queue. The scheduling process of QSTAs is discussed in details in the next section. Figure 1 demonstrates an example of HCCA transmission during CFP and CP periods.

2.2 IEEE 802.11e HCCA scheduler

In order to initiate an uplink traffic, the QSTA issues a QoS reservation through transmitting an ADDTS-Request frame. This frame carries information about the TSPEC which is required by HC for scheduling purpose. The list of mandatory fields of the TSPEC is described in [18]. The HC which usually resides in the QoS-enabled Access Point (QAP) which maintains the TSPECs of all TSs in the polling list. The HC computes the TXOP for each QSTA.

Upon receiving an ADDTS-Request from a QSTA, the HCCA scheduler of the HC goes through the following steps to schedule the uplink traffics:

  1. 1.

    SI Assignment The scheduler calculates the Service Interval (\(SI\)) as the minimum of all Maximum Service Intervals (\(MSI\)) of all admitted traffic streams which is a submultiple of the beacon interval, the time between two subsequent TBTTs. The minimum \(MSI\) for each QSTA is obtained from Eq. (1).

    $$\begin{aligned} MSI_{min} = min(MSI_{i}),\quad i =1, 2, 3, \ldots , n \end{aligned}$$
    (1)

    where \(n\) is the number of admitted TSs and \(MSI_{i}\) is the maximum \(SI\) of the \(i^{th}\) stream. The \(SI\) is computed so that it satisfies the condition in Eq. (2).

    $$\begin{aligned} SI = \frac{BeaconInterval}{x} \le MSI_{min}, \end{aligned}$$
    (2)

    where the denominator, \(x\), is an integer number that divides the beacon interval into the largest number that is equal or less than the \(MSI_{min}\).

  2. 2.

    TXOP Allocation HC allocates a TXOP to each admitted QSTA so as to enable it to transmit its data with regards to the negotiated QoS parameters of the TSPEC. For the \(i^{th}\) QSTA, the scheduler computes the number of MSDUs arrived at the mean data rate \(\rho _{i}\) as in Eq. (3).

    $$\begin{aligned} N_{i} = \left\lceil \frac{SI \times \rho _{i}}{L_{i}} \right\rceil , \end{aligned}$$
    (3)

    where \(L_{i}\) is the nominal MSDU length for the \(i^{th}\) QSTA. Then the TXOP duration of the particular station, \(TXOP_{i}\), is calculated as the maximum of the time required to transmit \(N_{i}\) MSDU or the time to transmit one maximum MSDU at the minimum physical rate \(R_{i}\), as stated in Eq. (4).

    $$\begin{aligned} TXOP_{i} = max\left( \frac{N_{i} \times L_{i}}{R_{i}}, \frac{M{i}}{R_{i}}\right) + O \end{aligned}$$
    (4)

    where \(M\) is the maximum MSDU and \(O\) denotes the overhead, including MAC and PHY headers, inter-frame spaces (IFSs), and the acknowledgment and poll frames overheads.

  3. 3.

    Admission Control The ACU manages the TSs admission while maintaining the QoS of the already admitted ones. When a new TS demands an admission, the ACU First obtains a new \(SI\) as shown in the previous step and computes number of MSDUs arrived at the new \(SI\) using Eq. (3). Next, it calculates the \(TXOP_{i}\) for the particular TS using Eq. (4). Finally, ACU admits only the TS if the inequality in Eq. (5) is satisfied.

    $$\begin{aligned} \frac{TXOP_{ n + 1}}{SI} + \sum _{i = 1}^{n} \frac{TXOP_{i}}{SI} \le \frac{\varDelta T - T_{CP}}{\varDelta T} \end{aligned}$$
    (5)

    where \(n\) is the number of currently admitted TSs, so that (\(n + 1\)) represents the incoming TS, \(\varDelta T\) is the time interval between two consecutive TBTTs periods (i.e. \(\varDelta T= T_{i+1}^{TBTT}-T_{i}^{TBTT}\)), beacon interval and \(T_{CP}\) is the duration reserved for EDCA. The HC sends an acceptance message (ADDTS-Response) to the requested QSTA if the condition in Eq. (5) is true or send a rejection message otherwise. The accepted TS will be added to the polling list of the HC.

2.3 Variable bit rate MPEG-4 video traffic

Variable bit rate (VBR) video can be classified in terms of variability of the traffic into two types: variable in packet size such as MPEG-4; and variable on the generation interval such as H.263. As this research is aimed towards enhancing the TXOP allocation we have chosen MPEG-4 video.

MPEG-4 is an efficient video encoding covering a wide domain of bit rate coding ranging from low-bit-rate for wireless transmission up to higher quality beyond high-definition television (HDTV) [15]. For this reason, MPEG-4 video coding has become from among the prominent videos in the internet nowadays. This variability in the compression level is adequate to transmit the video packets over the limited wireless network resources such as low bit rate. Table 1 displays excerpt of video trace file of Jurassic Park 1 movie [14] encoded using MPEG-4 at high quality. We refer the reader to [29, 30, 44] for more details about MPEG-4 videos. Basically, the weakness of HCCA in supporting VBR traffic is because of the lack of information about the abrupt changes of the traffic profile during the time, more particular the traffic burstiness issue.

Table 1 A fragment of Jurassic Park 1 Trace file encoded using MPEG-4 at high bit rate

2.4 Transmission of MPEG-4 video in HCCA

Although HCCA guarantees a QoS for video traffic based on the required TSPEC parameters, there is a probability to have frames smaller than the mean negotiated MSDU size. Consequently, a larger TXOP than needed will be assigned to a QSTA causing wasting in wireless channel time and remarkable increase in the end-to-end delay. Figure 2 (upper part) illustrates the effect of assigning TXOPs for VBR traffics based on the mean TSPEC parameters on increasing the packet delay and on the poor wireless channel utilization. Suppose there are three stations sending uplink video traffics to QAP. HC will accordingly assign \(TXOP_{1}, TXOP_{2}\) and \(TXOP_{3}\) to \(QSTA_{1}, QSTA_{2}\) and \(QSTA_{3}\) respectively. Assume that in any \(SI\) some or all the frames sent is considerably smaller than the negotiated MSDU for the TS, in this case the QSTA will only utilize a portion of the scheduled TXOP for sending its data as explained in the \(SI_ {i} \) and \(SI_{i + 1}\). The next scheduled TXOP will initiate according to its scheduled time, regardless the actual exploited time in the previous TXOP causing increases in the delay and wasting the channel time as well. This issue may be noticeably severe when the number of QSTA with VBR traffics increase. Moreover, in the transmission of the pre-recorded video, the traffic behavior is known prior to the traffic setup [16]. These observations motivate us to present an enhancement to HCCA scheduler in which HC exploits information sent by QSTA about the changes in the traffic profile so as to accurately assign TXOP to QSTA and advance the consecutive TXOPs to minimize the delay and add the residual wireless channel time to EDCA period. The proposed scheduler is presented in details in the next section.

Fig. 2
figure 2

Adaptive TXOP assignment algorithm

Several approaches such as [6, 7, 27, 32, 33, 41] have been presented in the literature attempting to remedy the deficiency of the HCCA reference scheduler in supporting QoS for VBR traffics. However, these enhancements are still not sufficient to cope with the fast fluctuating nature of highly compressed video applications since the QSTAs are scheduled according to an estimation about the uplink TSs characteristic which may be far from the real traffics. Although, some other approaches such as [4, 9, 20, 46, 47] have been focused on enhancing the polling overhead of HCCA scheduler, there was a lack of consideration for transmitting pre-recorded streams in WLANs.

More particularly, with highly VBR applications, it is likely one of two cases happen at some certain SIs. One case is when the application data rate goes up in which the allocated time is not sufficient to empty the transmission queue of the uplink traffics at the QSTAs which in turn results in increasing the end-to-end delay. There are two possible solutions to remedy this problem [6, 40]. One is by maximizing the TXOP duration more than that allocated with regards to the average TXOP of the traffic. However, it will much degrade the bandwidth utilization when the data rate drops down. The other one is by applying a bandwidth reclaiming scheme [7, 34, 39, 42]. The other case is when data rate drops below its average value declared in the TSPEC of the TS. In this case, the assigned TXOP is not totally exhausted. This issue is addressed in this paper.

2.5 Adaptive TXOP scheduling algorithm

Adaptive transmission opportunity (ATXOP) scheduler [3] is a feedback-based technique which reschedules TS of each QSTA in an SI based on piggybacked information, transmitted to HC, about the next frame length. This algorithm gives an actual TXOP needed by stations and ensures that the end-to-end delay is minimized without jeopardizing the channel bandwidth. An example of the ATXOP algorithm process compared to HCCA scheduler is depicted in Fig. 2. More particularly, the QSTAs are scheduled as in Eq. (4) except that the Mean Size of MSDU (\(L_{i}\)) is the actual frame size reported in the previous received packet instead of using the mean value negotiated in the TS setup phase.

Fig. 3
figure 3

Dynamic TXOP assignment algorithm

3 Adaptive multi-polling TXOP scheduling algorithm

To leverage the advantage of the global scheduler and minimize the control messages overhead, we propose a multi-polling scheme called adaptive multi-polling TXOP scheduling algorithm (AMTXOP) which enhances the previously investigated algorithm in [3]. The benefits of the result scheduler over the previous scheme are twofold, on one hand, it further reduces the delay exposed by transmitting poll messages frame for all the stations that precede the current station. On the other hands, it enhances the channel utilization by freeing some space reserved for the single poll messages which replaced by only one multiple poll message. The delay experienced by the proceeding unused (wasted) TXOPs is minimized using this algorithm as illustrated in Fig. 3. The scheduling operations at both QSTA and QAP are described below.

3.1 Multi-polling frame structure

At the beginning of each CAP, the HC shall have collected information about the next frame size of all admitted TSs in the polling list. Accordingly, the HC compose one multi-polling frame to be broadcast to all QSTAs. The format of the multi-polling frame is shown in Fig. 4. The record number field indicates the number of the QSTAs to be polled in this service interval. The polling control includes: (1) associate identifier (AID) subfield which differentiate QSTAs in the BSS, (2) the TXOP subfield which is the TXOP duration for QSTA determined using corresponding AID. Other fields are described in details in the IEEE 802.11e standard [22]. It is worth noting that the length of the multi-polling frame is based on the number of the QSTAs admitted in the current SI. One can notice that the multi-polling frame proposed here is 1 byte smaller than that in [28] due to eliminating the rate subfield as the rate of the TS is declared in TSPEC request frame. Suppose that 802.11g wireless channel with basic physical rate of 2 is used, the transmission time of multiple single poll frames and one multi-polling frame will be as depicted in Table 2. It is obvious that the time required to transmit one multi-polling frame is for \(N\) STAs is much smaller than that of multiple single frames. The multi-polling transmission time of \(N\) QSTAs is computed as time to transmit \(13+(N\times 4)\) bytes while it is as much as transmitting \(Poll_{size} \times N\) bytes in the case of single poll frames. Moreover, the reduction in poll overhead can be implied from the gain ratio.

Fig. 4
figure 4

Multi-polling frame format

Table 2 Comparison between the transmission overhead of multiple single poll frames and one multi-polling frame measured in unit of \(\upmu \hbox {s}\)

3.2 Actions at the station

At the transmission of the packet of uplink traffic, the QSTA obtain information about the next MSDU frame size from the application layer via cross-layering. This information is carried in the QS subfield of 11e MAC header introduced in IEEE 802.11 standard [22]. This information is exploited by the HC for scheduling purpose, which is discussed in the following subsection.

Bearing in mind that the AMTXOP does not modify the polling list of the HCCA, the order of the polling records will represent the order the QSTA scheduled in. Assume without loss of generality that the polling records are in an ascending order based on their AID, hence, upon the reception of multi-poll frame, the \( QSTA_{i} \) computes the \(backoff_{i}\) that shall wait until it can transmit its data which can be obtained from the summation of \(TXOP{j}\) durations of all preceding stations, where \(j \in [1..i-1]\). Then it initiates a backoff timer waiting for all QSTA ahead to finish their transmission. At the expiration of the backoff timer, it commences its transmission for a duration equals to \(TXOP_{i}\). The algorithm at QSTA is reported in Algorithm 1.

figure e

3.3 Actions at the access point

Upon the reception of the first frame from a \(QSTA_{i}\) at HC, which includes the MSDU size of the next frame, the HC will start maintain this information in \(stations\) list until the next CAP begins. At that time, the TXOP duration will be computed as expressed in Eq. (6).

$$\begin{aligned} TXOP_{i}=\frac{Size_{i}}{R_{i}}+ O \end{aligned}$$
(6)

In the case of having more than one TS at \(QSTA_{i}\), the \(Size_{i}\) will represent a total summation of all next frames lengths in bytes. When no data packet is received in the current \(SI\) due to packet loss, the \(TXOP_{i}\) of \(QSTA_{i}\) will be computed as expressed in Eq. (4). It is worth noting that at the first CAP of any TS, the TXOP is calculated based on Eq. (4) because no information about the next packet size has been reported yet. Instead of sending a single poll for each station, the HC compose and broadcast one multi-polling frame which includes \(AID\) and computed \(TXOP_{i}\).

figure f

4 Performance evaluation

In order to evaluate the performance of the AMTXOP scheduler, we have used a network simulation tool. The simulation environment setup, and video traffic used as uplink traffics is described in details in this section. The performance of our scheduler is compared against the HCCA. The results of end-to-end delay and throughput are also discussed.

4.1 Simulation setup

The AMTXOP scheduler has been implemented in the well-known network simulator (ns-2) [36] version 2.27. The HCCA implementation framework ns2hcca [11] has been patched to provide the controlled access mode of IEEE 802.11e functions, HCCA. The ns-2 Traffic Trace [25] agent, is used for video stream generation.

Fig. 5
figure 5

Network topology

A star topology has been used for constructing the simulation scenario to form an infrastructure network with one QAP surrounded by varying number of the QSTAs ranging from 1 to 12. All QSTAs were distributed uniformly around the QAP with a radius of 30 feet as shown in Fig. 5. The stations were placed within the QAP coverage area, in the same basic service set BSS, and the wireless channel is assumed to be ideal. Since we focus on HCCA performance measurement, all the stations operate only on the contention-free mode by setting \(T_{CP}\) in Eq. (5) to zero. The QSTAs communicate directly without a hidden node problem. RTS/CTS mechanism, MAC level fragmentation, and multirate support were disabled. QAP is the sink receiver, while each QSTA is a video source because only one flow per station is supported by ns2hcca patch. Therefore, for simulating concurrent video streams multiple stations are added each with one flow. In order to leave an ample time for initialization, stations start their transmission after 20 (s) from the start of the simulation time and last until the simulation end. Wireless channel assumed to be an error-free, and no admission control used for the sake of investigating the maximum scheduling capability of each examined algorithm under heavy traffic conditions. Simulation parameters are summarized in Table 3. For evaluating the performance of our scheduler against the reference scheduler of HCCA, Jurassic Park 1 video sequence trace encoded using MPEG-4 was chosen from a publicly available library for video traces [15]. We tested the AMTXOP scheduler on Jurassic Park 1 and Formula 1 trace files which can be classified into movie and sport, respectively, which show different variability level. Table 4 demonstrates some statistics of the examined traces. The selected video is encoded using different Compression ratio, which results in varying quality. In this paper, we have tested the algorithms with low and high-quality video trace. It is worth noting that the variability of the selected videos is measured by the Coefficient of Variation (CoV), which is the standard deviation of the frame size divided by the average frame size. TSPEC parameters used for each video traffic are shown in Table 5 with regards to video QoS requirements.

Table 3 Simulation parameters
Table 4 Frame statistics of MPEG-4 video trace file
Table 5 Video traffic parameters
Fig. 6
figure 6

TXOP allocation of Formula 1 video. a Low-quality. b High-quality

4.2 Results and discussion

Simulations have been carried out to exhibit the performance of the examined algorithms, namely HCCA, an Adaptive TXOP assignment which we refer to as ATXOP onward and the proposed algorithm using a different variability level of the same videos. Since the main objective is to achieve superior QoS support by accurately granting TXOP to the station to fit its need, packet end-to-end delay of the uplink traffics has been measured which considered as one of the significant metrics to evaluate a QoS support of video streams. To validate the behavior of the examined algorithms, the measurements are done for an increasing number of TSs. The system throughput was also investigated to verify that the improvement in the delay is achieved without jeopardizing the channel bandwidth.

The behavior of the examined algorithms in terms of allocating TXOP in each SI is illustrated in Fig. 6 for the Formula 1 video sequence. The allocated TXOP for one flow is shown against a number of SIs for a duration of 30 s. The results reveal the fact of assigning fixed TXOP in HCCA for all SIs of the flow with accordance to Eq. (4). In this case, the HCCA computes TXOP duration based on the maximum MSDU size of the flow, namely 7032 bytes and 14,431 bytes for low and high-quality video respectively. Nevertheless, the both ATXOP and the proposed scheduler alike adaptively allocate a TXOP for each SI based on the actual frame size obtained from the feedback information which show that in some SIs the allocated TXOP duration in HCCA is much higher than the actual need of the flow which considered as over-allocation cases. It is obvious that the TXOP duration given in Fig. 6b is higher than that in Fig. 6a as the mean bit rate of high-quality encoded Formula 1 is considerably higher than that in low-quality video sequence, refer to Table 4.

4.2.1 End-to-end delay analysis

The end-to-end delay is defined as the time elapsed from the generation of the packet at the source QSTA application layer until it has been received at the QAP and is expressed in Eq. (7).

$$\begin{aligned} e2eDelay = \frac{ \sum _{i = 1}^{N} ( R_{i} - G_{i}) }{N}, \end{aligned}$$
(7)

where \(G_{i}\) is the generation time of packet \(i\) at the source QSTA, \(R_{i}\) is the receiving time of the particular packet at the MAC layer of the QAP, and \(N\) is the total number of packets for all flows in the system. The end-to-end delay has been measured for the three video types to study the efficiency of both HCCA and our algorithms with different traffic variability. Figure 7a–d depict the delay experienced by data packets for the low, medium and high-quality video, respectively. One can notice that the end-to-end delay boosts with the increase of the packet size, the highest quality exhibits higher end-to-end delay and vice versa. The increase of the delay in higher quality videos can be justified by the large amount of the allocated to each TS, as in Eq. (4), which leads to maximize the wasted TXOPs that keep the subsequent TSs awaiting in their transmission queue longer time. It is obvious that the proposed scheduler achieved delay enhancement over both HCCA and ATXOP for the examined videos due to the polling overhead reduction in the system. The proposed algorithm achieved delay enhancement up to 12 % over the Adaptive TXOP algorithm [3] and about 59 % over HCCA scheduler. The reason of achieving better improvement in Jurassic Park 1 is the higher packet size comparable to that in Formula 1 thereby the granted TXOP obtained in HCCA is far from the needed TXOP which in turn causes higher packet delay. Furthermore, the delay improvement in our scheduler is justified by the accurate calculation of the TXOP. Unlike the HCCA scheduler that only relies on the mean traffic characteristic which is not reflecting the actual traffic behavior.

Fig. 7
figure 7

Mean end-to-end delay as a function of the number of stations. a Low-quality Jurassic Park 1. b High-quality Jurassic Park 1. c Low-quality Formula 1. d High-quality Formula 1.

4.2.2 Throughput analysis

The aggregate throughput of the examined algorithms has been investigated against the number of stations. This is to verify that our scheduler is efficient in supporting QoS for VBR traffics which maintaining the utilization of the channel bandwidth. The aggregate throughput is calculated using Eq. (8).

$$\begin{aligned} AggregateThrp = \frac{ \sum _{i = 1}^{N} ( Size_{i}) }{time}, \end{aligned}$$
(8)

where \(Size_{i}\) is the received packet size at the QAP, \(time\) is the simulation time and \(N\) is the total number of the received packets at QAP during the simulation time. Figure 8 depicts the aggregate throughput with increasing the network load for the high-quality Jurassic Park 1 videos. The result shows that the throughput is the same as that achieved by the HCCA scheduling algorithm. This implies that the proposed approach enhanced the end-to-delay without jeopardizing the channel bandwidth.

Fig. 8
figure 8

Aggregate throughput as a function of the number of stations. a High-quality Jurassic Park 1

Fig. 9
figure 9

Aggregate TXOP duration as a function of the number of stations. a Low-quality Jurassic Park 1. b High-quality Jurassic Park 1. c Low-quality Formula 1. d High-quality Formula 1

4.2.3 Aggregate TXOP duration

To investigate the efficiency attained by the proposed scheduler in supporting pre-recorded videos, the aggregate TXOP duration is measured and can be defined as the total of TXOP duration, which is the total TXOP duration granted to all QSTAs during the simulation time in units of seconds. In Fig. 9, the aggregate TXOP is shown in the examined videos with increasing the network load. For Low-quality videos Fig. 9a, c demonstrate that allocating fixed TXOP for all TS frames in HCCA might exceed the need of the traffic. In that case only a small portion of the granted TXOP is exploited resulting in what’s called wasted TXOPs. On the contrary, since the proposed scheduler not only operates according to the actual information about frame size, but also further minimizes the polling overhead, the granted TXOP is considerably smaller than that in HCCA and ATXOP with ensuring the same achievable throughput. Thanks to multi-polling concept, the total TXOP duration assigned was as low as 32 and 25 % compared to Adaptive TXOP scheduler in low and high videos, respectively. In the proposed algorithm TXOP duration is smaller than both HCCA and that in [3] as the polling overhead in Eq. (4) is minimized by sending only one multi-poll frame for all QSTAs. In this case, the proposed algorithm shows better enhancement even when the number of station increase in the system.

Table 6 End-to-end delay in units of milliseconds with different physical layer

4.2.4 Number of admitted MSDUs

IEEE 802.11 Physical layer standards differ from each other in terms of physical data rate supported and interframe spaces such as SIFS and PIFS. It is worth noting that the physical layer may support various range of data rate as in IEEE 802.11g while slot time, SIFS and PIFS (referred to in [22] as aSlotTime, aSIFTime and PIFS) are fixed per PHY [22]. It is obvious in Eq. (4) that the TXOP duration of each QSTA is inversely proportional to the PHY rate \(R_{i}\) and directly proportional to SIFS and PIFS. To study this effect, the examined schemes have been also tested over 802.11b. Similar simulation parameters used as listed in Table 3 except physical layer, Slot time, PIFS, Physical preamble length, PLCP header length and Data rate were IEEE 802.11b, 20, 30\(\,\upmu \hbox {s}\), 18, 6 bytes and 11 Mbps respectively. Simulation results in Table 6 reveal that 802.11b only supports 5 Jurassic Park 1 videos while 802.11g can support more than 12 video traffics. In 802.11b, the calculated TXOP duration for the MSDU was 0.00714 s in this case the ACU will reject all new MSDU starting from the 6th MSDU as the CAP will become 0.04284 s which exceed the SI (0.04 s). Due to the high physical rate supported in 802.11g and smaller IFs, the number of accepted MSDU is higher.

4.3 Analytical study of the proposed schemes

Without loss of generality, assume that each \(QSTA_{i}\) in any SI has only one TS. In other words, \(N_{i}\) in Eq. (4) is set to 1, \(\forall i \in [1..N]\), where \(N\) is number of admitted TSs. Bearing in mind that the transmission of MSDU frame in IEEE 802.11 begins with a transmission of Physical Layer Convergence Procedure (PLCP) preamble and header parts, followed by the data payload. Therefore the transmission time of MSDU frame of size L, all in bytes, is calculated as follows:

$$\begin{aligned} T_{MSDU}=T_{preamble} + T_{PLCP} + T_{MAC} + T_{L}, \end{aligned}$$
(9)

where \(T_{preamble}, T_{PLCP}, T_{MAC}\) and \(T_{L}\) is the transmission time of PLCP preamble, PLCP header, IEEE 802.11e MAC header and data payload (bytes) respectively which can be calculated as follows:

$$\begin{aligned}&\!\!\! T_{preamble}=\frac{preamble*8}{PHY_{basic}} \end{aligned}$$
(10)
$$\begin{aligned}&\!\!\! T_{PLCP}=\frac{PLCP*8}{PHY_{basic}}\end{aligned}$$
(11)
$$\begin{aligned}&\!\!\! T_{MAC}=\frac{MAC*8}{PHY_{rate}}\end{aligned}$$
(12)
$$\begin{aligned}&\!\!\! T_{L}=\frac{L*8}{PHY_{rate}} \end{aligned}$$
(13)

The transmission time taken for ACK packet and poll message is calculated similarly as in Eqs. (14) and (15).

$$\begin{aligned} T_{ACK}= & {} T_{preamble} + T_{PLCP} + T_{MAC}, \end{aligned}$$
(14)
$$\begin{aligned} T_{poll}= & {} T_{preamble} + T_{PLCP} + T_{MAC}, \end{aligned}$$
(15)

Therefore, the TXOP duration of \(QSTA_{i}\) is computed as follows

$$\begin{aligned} TD_{i}=T_{L} + T_{poll} + T_{ACK} + 3 \times SIFS + D_{p} \end{aligned}$$
(16)

In order to ensure the accuracy of the calculation, the propagation delay, \(D_{p}\), between the source and receiving channel should be considered which is normally a small fixed value [22] as shown in Table 3.

Accordingly, analytical study of the end-to-end delay for examined schemes will be discussed in the following subsections.

4.3.1 Delay analysis of HCCA

Assuming that the transmission of the traffic for all QSTAs begins as the same time, the end-to-end delay of a \(QSTA_{i}\) can be computed as the total TXOP durations of the preceding STAs in the polling table including the transmission time of MSDU of the \(QSTA_{i}\). For the sake of the simplicity, we assume that the stations are scheduled in ascending order according to their index. That is, the preceding delay of \(QSTA_{i}\) in an SI is computed as in the following equation.

$$\begin{aligned} D_{SI}^{i}=\sum _{j=1}^{i-1} TD_{i} + T_{L}^{i} + T_{poll} + 2 \times SIFS \end{aligned}$$
(17)

As known, any period of time, \(T\), contains a number of consecutive SIs, \(M\), which can be computed for any experiment duration \(T\) as \(\left\lceil T/SI \right\rceil \). Thus, the average aggregated end-to-end delay for the a period of time \(T\) can be calculated as follows:

$$\begin{aligned} D= \frac{1}{M} \cdot \sum \limits _{k=1}^{M} \sum \limits _{i=1}^{N} D_{SI}^{i} \end{aligned}$$
(18)

4.3.2 Delay analysis of ATXOP

Based on the discussed deficiency of the HCCA accommodating to the fast changing in VBR traffic profile and the solution proposed in ATXOP in Sect. 2.5. When \(QSTA_{i}\), at any SI, exploits only portion of its allocated \(TXOP_{i}\) at the traffic setup time, namely \(T_{eff}^{i}\), leaving an unspent amount of \(T_{u}^{i}\). Thus, the following relation can be held:

$$\begin{aligned} \sum _{i=1}^{N}{T}'_{i}= & {} T_{eff}^{1} + T_{eff}^{2} +\cdots + T_{eff}^{N}\\= & {} TD_{1} - T^{1}_{u} + TD_{2} - T^{2}_{u} +\cdots +TD_{N} - T^{N}_{u} \\= & {} \sum _{i=1}^{N} TD_{i} - \sum _{i=1}^{N}T^{i}_{u} \\ \end{aligned}$$

where \(T^{i}_{u} \ge 0, \sum _{i=1}^{N}{T}D_{i}\) and \(\sum _{i=1}^{N}{T}'_{i}\) is the total TXOP scheduled in any SI used in HCCA and ATXOP, respectively. It is worth noting that \(TD_{i}\) is calculated from Eq. (16) which includes the poll overhead.

Thus, the delay of \(QSTA_{i}\) in an SI is computed as follows,

$$\begin{aligned} D_{SI}^{i}=\sum _{j=1}^{i-1} (TD_{i} - T^{i}_{u}) + T_{L}^{i} + T_{poll} + 2 \times SIFS \end{aligned}$$
(19)

The worst case scenario when all QSTAs used their \(TXOP_{i}\) duration that is, \( \sum _{i=1}^{N}T^{i}_{u}=0\) and consequently, \(\sum _{i=1}^{N}{T}'_{i} = \sum _{i=1}^{N}{TD}_{i}\). In this case, ATXOP performs similarly to HCCA, Thus the ATXOP ensures at least HCCA performance.

Having given different \(D_{SI}^{i}\) for HCCA and ATXOP, the end-to-end delay of both schemes can be computed as in Eq. (18).

4.3.3 Delay analysis of AMTXOP

Assume that the transmission time of a single and multi-polling frame is \(T_{poll}\) and \(T_{mpoll}\), respectively. Thus AMTXOP calculates the total TXOP in any SI as follows:

$$\begin{aligned} \sum _{i=1}^{N}{T}''_{i} = \sum _{i=1}^{N}({T}'_{i} - T_{poll}) + T_{mpoll} \end{aligned}$$
(20)

Thus, the delay of \(QSTA_{i}\) in an SI is computed as follows,

$$\begin{aligned} D_{SI}^{i}=&T_{mpoll}+\sum _{j=1}^{i-1}(TD_{i}-T^{i}_{u})- (i-1)\times T_{poll} \nonumber \\&+ T_{L}^{i} + SIFS \end{aligned}$$
(21)

Eventually, the delay enhancement in any SI when multi-polling scheme is integrated can be expressed as follows:

$$\begin{aligned} D= \frac{1}{M} \cdot \sum \limits _{k=1}^{M} \sum \limits _{i=1}^{N} \big (D_{SI}^{i} + T_{L}^{i} + SIFS\big ) \end{aligned}$$
(22)

The simulation results were validated on low and high quality jurassic park 1 video sequence using the setting in Tables 3 and 4 except for the simulation time which set to 50 s. The traffics start at second 20 from the beginning of the experiment run to leave an ample time for initialization process. For this reason, the number of SIs (\(M\)) was 750 which calculated as \(30\,\hbox {s}*1000/40\,\hbox {ms}\).

Fig. 10
figure 10

End-to-end delay validation for low-quality Jurassic Park 1

Figures 10 and 11 show both simulation; and analytical results for the HCCA, AXTOP scheduling schemes were computed using Eq. (18) and AMTXOP in Eq. (22). The simulation and analytical results reveal that the model and the simulation agree for both video sequences with acceptable difference up to maximum 10 % which caused by the disregarded overheads. The improvement achieved by ATXOP over HCCA can be straightforwardly implied from Eq. (19) where the unspent transmission time of a station will be available to the next stations in the polling list which in turn schedule the station. It is worth noting that ATXOP and AMTXOP shows superior enhancement with high-quality video, Fig. 11, compared to HCCA as the TXOP duration in the latter is computed for larger MSDU size and subsequently the unspent TXOP duration \(T^{i}_{u}\) might be high as well. The effect of multi-polling scheme in enhancing the packet delay in the SI of any \(QSTA_i\) is justified by the minimizing the overhead by \((i-1) \times T_{poll}\) in Eq. (21).

Fig. 11
figure 11

End-to-end delay validation for high-quality Jurassic Park 1

Fig. 12
figure 12

Mean delay with different PERs for low-quality Jurassic Park 1

4.4 Performance analysis over error-prone wireless networks

Aiming to measure the performance consistency of the proposed mechanism in error-prone wireless networks, we tested the performance with increasing Packet Error Rate (PER) in the system from 1 to 9 %. In [12, 31] the maximum PER measured is 10 %, we exclude the value of PER that is greater than 10 % as it is unsuitable for most applications [10]. We have chosen the case of 12 QSTAs transmitting to a QAP as a worst case scenario in these experiments. The PER was applied to the packets of the uplink traffics. In spite of the fact that the HC assigns the TXOPs based on information piggybacked in each frame, in the case of packet loss the TXOP will be computed as in the HCCA for one or more SI until the reception happen where the HC resume the Dynamic mechanism. Mean end-to-end delay and aggregate throughput metrics have been measured and the results are discussed as follows.

Fig. 13
figure 13

Mean delay with different PERs for low-quality Formula 1

Fig. 14
figure 14

Mean delay with different PERs for high-quality Jurassic Park 1

  1. 1.

    End-to-end delay analysis

    The results reveal that the proposed mechanism remains stable minimizing the delay even with increasing amount of packet loss. One can notice that, the increase of end-to-end delay of the packets in high-quality videos, Figs. 14 and 15, compared to that of the low-quality ones, Figs. 12 and 13, for the same reasons discussed in Sect. 4.2.1.

  2. 2.

    Throughput analysis

    The throughput achieved by the proposed mechanism by the presence of packet loss is depicted in Fig. 16. The proposed mechanism and HCCA perform similarly attaining the same throughput. Indeed, the achievable throughput is degrading as the PER increases due to the increase of packet loss on the system.

Fig. 15
figure 15

Mean delay with different PERs for high-quality Formula 1

Fig. 16
figure 16

Throughput with different PERs for high-quality Jurassic Park 1

Table 7 Mobility scenarios with different PHY rate ranges
Fig. 17
figure 17

Delay with mobility of speed 5 m/s. a HCCA. b AXTOP. c AMXTOP

Fig. 18
figure 18

Delay with mobility of speed 30 m/s

4.5 Performance analysis with mobility

In order to investigate the effect of the mobility on the performance of the examined schemes, we used similar topology to that discussed in Sect. 4.1 except that the stations have been placed to be moved within four diameters each with corresponding PHY rate setting as in Table 7. It is worth noting that in scenarios a, b and c, the stations experience higher data rate as they move towards the AP and vice versa. These scenarios have been investigated with different movement speeds, namely 5 and 30 meter per second.

The results in Fig. 17c reveals the robustness of the AMTXOP compared to ATXOP and HCCA (Fig. 17a, b) to minimizing the access of each TS using multi-poling scheme. The HCCA shows high delay due to its deficiency to adapt to the real time changes in which the TXOP is computed as a fixed duration at the traffic setup time and used during the traffic life regardless to the actual need of the traffic. Increasing the mobility speed to 30 m/s as illustrated in Fig. 18 shows marginally difference compared to that in Fig. 17 as the destination (AP) is only one hop far from the source. One can notice the effect of increasing the PHY on the number of admitted flows which were 4, 8, 13 and 18 TSs in 6, 18, 36 and 54 Mbps scenarios, respectively.

Fig. 19
figure 19

Channel utilization improvement for low-quality Jurassic Park 1

Fig. 20
figure 20

Channel utilization improvement for high-quality Jurassic Park 1

4.6 Channel utilization analysis

In this section, we study the improvement of the bandwidth utilization of the ATXOP and the AMTXOP compared to the HCCA protocol. The Bandwidth utilization improvement is calculated for each \(STA_{i}\) in an SI as in Eq. (23)

$$\begin{aligned} B_{im}^{i} = \frac{B_{HCCA}^{i}-B_{proposed}^{i}}{B_{HCCA}^{i}} \end{aligned}$$
(23)

where, \(B_{HCCA}\) is the channel utilization of HCCA which can be calculated as the proportion of fixed TXOP to the SI, whereas \(B_{proposed}\) is for ATXOP and AMTXOP. Figures 19 and 20 demonstrate the channel utilization improvement as the average of all admitted flows. The ATXOP achieved utilization improvement over HCCA of about 49 and 56 % for the low and high quality videos, respectively. Whereas, thanks to multi-polling scheme, the AMTXOP achieves up to 56 and 66 % over the HCCA.

5 Conclusion

In this paper, a novel scheduler called AMTXOP has been proposed to leverage the performance of the controlled access mode of IEEE 802.11e. The enhancements of AMTXOP is twofold. First, the HC polls QSTAs with regard to fast changing in the traffic profile so as to prevent QSTAs from receiving unnecessary large TXOP. Second, the surplus time of the wireless channel is conserved by reducing the number of poll frames using the multi-polling approach.

This work shows the significant impact of integrating the multi-polling scheme to the existing piggybacking mechanism in the VBR traffic scheduling. This integrated scheme reduces the packet delay and enhances the channel utilization as well.

Performance evaluation throughout simulation experiments and analytical study reveals the efficiency of the proposed scheduler over HCCA and ATXOP. Moreover, the schemes were evaluated with mobility scenarios to check the robustness over harsher environments. Thanks to multi-polling concept, the total TXOP duration assigned in AMTXOP was 32 and 25 % less than that in ATXOP in low and high videos, respectively. This remarkable improvement in the channel utilization has led to significant enhancements in the packet delay. The AMTXOP achieved delay enhancement up to 12 and 59 % over the ATXOP and HCCA, respectively. Moreover, the gain in the channel utilization is also considered as bonus merit.

6 Future directions

In the future, we intend to design an admission control unit (HCCA) to utilize the surplus bandwidth gain using AMTXOP mechanism and manage these bandwidth resources among the HCF functions, EDCA and HCCA for maximizing the number of admitted flows in the system and study the affect of increasing the admitted flows on the TCP flows of the EDCA. Moreover, an enhancement will be introduced to support live video transmission by exploiting the perfecting concept and govern the resulted additional delay budget. The other possible future direction is to enhance the proposed scheduler to cope with more sophisticated wireless conditions and with the MAC level fragmentation and multirate adaptation support.