American Journal of Operational Research
p-ISSN: 2324-6537 e-ISSN: 2324-6545
2017; 7(1): 1-14
doi:10.5923/j.ajor.20170701.01

Sushil Ghimire 1, Gyan Bahadur Thapa 1, Ram Prasad Ghimire 2, Sergei Silvestrov 3
1Pulchowk Campus, Institute of Engineering, Tribhuvan University, Nepal
2Department of Mathematical Sciences, School of Science, Kathmandu University, Nepal
3Division of Applied Mathematics, Mälardalen University, Box 883, 721 23 Västerås, Sweden
Correspondence to: Sushil Ghimire , Pulchowk Campus, Institute of Engineering, Tribhuvan University, Nepal.
| Email: | ![]() |
Copyright © 2017 Scientific & Academic Publishing. All Rights Reserved.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

Queuing systems consist of one or more servers that provide some sort of services to arriving customers. Almost everyone has some experience of tedious time being in a queue during several daily life activities. It is reasonable to accept that service should be provided to the one who arrives first in the queue. But this rule always may not work. Sometimes the last comer or the customer in the high priority gets service earlier than the one who is waiting in the queue for a long time. All these characteristics are the interesting areas of research in the queueing theory. In this paper, we present some of the previous works of various researchers with brief explanations. We then carry out some of the mathematical expressions which represent the different queueing behaviors. In almost all the literatures, these queueing behaviors are examined with the help of mathematical simulations. Based on the previous contributions of researchers, our specific point of attraction is to study the finite capacity queueing models in which limited number of customers are served by a single or multiple number of servers and the batch queueing models where arrival or service or both occur in a bulk. Furthermore, we present some performance measure equations of some queueing models together with necessary components used in the queueing theory. Finally, we report some applications of queueing systems in supply chain management pointing out some areas of research as further works.
Keywords: Queueing, Performance, Server, Customer, Capacity, Supply chain
Cite this paper: Sushil Ghimire , Gyan Bahadur Thapa , Ram Prasad Ghimire , Sergei Silvestrov , A Survey on Queueing Systems with Mathematical Models and Applications, American Journal of Operational Research, Vol. 7 No. 1, 2017, pp. 1-14. doi: 10.5923/j.ajor.20170701.01.
Probability of
number of customers and mean number of customers in the system at time
was calculated using asymptotic approximations approach. Kempa [45] derived explicit formulae for the queue size distribution of a finite-buffer GI/M/1/N transient queueing model. He calculated transient queue-size distribution convergence rate to the stationary distribution for the constant value given explicitly. Malligarjunan [46] evaluated performance measures and total expected cost rate for a single server queueing system under transient behaviour and entropy measures on an inventory system with two demand classes. Selinka et al. [47] developed a stationary backlog-carryover approach to compare the numerical results with simulations applicability in an analytical solution for a time-dependent performance evaluation of truck handling operations at an air cargo terminal. Ausina et al. [48] chose a single server queueing system in which Bayesian inference for the transient behaviour and duration of a busy period with general, unknown distributions for the inter-arrival and service times has been investigated. Batch or bulk arrival and service facility is the another area of research in queueing theory. Chang and Choi [49] studied finite-buffer discrete-time GeoX/GY/1/K+B queue with multiple vacations that has a wide range of applications including high-speed digital telecommunication system and various related areas presenting some performance analysis. Goswami and Laxmi [50] dealt a single server infinite or finite buffer bulk-service queues considering arbitrary inter-arrival and exponential service time distribution. The customers were served by a single server in accessible or non-accessible batches of maximum size ‘b’ with a minimum threshold value ‘a’. Ghimire et al. [51] established a bulk queueing model with the fixed batch size ‘b’ and obtained the expressions for mean waiting time in the queue, mean time spent in the system, mean number of customers/work pieces in the queue and in the system by using generating function method. Singh et al. [52] investigated retrial queue with bulk arrivals and unreliable servers providing m-optional services to observe the validity of performance measures and the effect of parameters for the queue size distribution. Banergee et al. [53] studied a single server bulk service finite capacity queue for the calculation of joint distribution of the random variables at various epochs in which service times depend on the batch size customers following Markovian arrival process. Luo et al. [54] dealt with a finite buffer GeoX/G//1/N queue for the observation of queue-length distributions at departure, arbitrary, pre-arrival epochs with single working vacation and different input rates combining two techniques of supplementary variable and embedded Markov chain method. Claeys et al. [55] analysed a versatile batch-service queueing model with correlation in the arrival process along with some performance evaluation and buffer management.![]() | Figure 1. General queueing system |
where
denotes inter-arrival time distribution,
is the service time distribution,
represents the number of servers and
denotes the maximum number of jobs that can be occupied in the system (waiting and in service) with infinite number of waiting positions for default. Likewise,
indicates queueing discipline (FCFS, LCFS, SIRO, RR etc.) having FCFS for default, and the last notation
is for population size from which customers rush to the system.For a single server queueing system,
denotes the traffic intensity [56] which is defined by
Assuming an infinite population system with arrival intensity
, which is reciprocal of the mean inter-arrival time, let the mean service time be denoted by
, then we have
If
, then the system is overloaded since the requests arrive faster than they are served. It shows that more servers are needed. Let
denotes the characteristic function of the event A, that is
Furthermore, let N(t) = 0 denotes the event that at time T the server is idle having no customers in the system. Therefore, utilization of the server during time T is defined by
where T is a long interval of time. As
, we get the utilization of the server denoted by
and the following relations holds with probability 1
where P0 is the steady-state probability that the server is idle.
and
denote the mean busy period and mean idle period of the server respectively.Theorem 1 [56]: Let X(t) be an ergodic Markov chain and A is a subset of its state space. Pi denotes the ergodic (stationary, steady-state distribution) of X(t). Then with probability 1
where
and
respectively denote the mean sojourn times of the chain in A and
during a cycle. In an m-server system, the mean number of arrivals to a given server during time T is
, provided that the arrivals are uniformly distributed over the servers. Thus the utilization of a given server is
The other important measure of the system is the throughput of the system which is defined as the mean number of requests serviced during a time unit. In an m-server system, the mean number of completed services is
and thus
However, if we consider a tagged customer, the waiting and response times are more important than the measures defined above. Let
and
denote waiting time and response time of the
customer respectively. Clearly, the waiting time is the time a customer spends in the queue waiting for service and response time is the time a customer spends in the system, that is
where
denotes service time;
and
are random variables and their means denoted by
and
, are appropriate for measuring the efficiency of the system. It is not easy in general to obtain their distribution function.Other characteristics of the system are the queue length and the number of customers in the system. Let the random variables Q(t) and N(t) are the number of customers in the queue and in the system at time t respectively. Clearly, in an m-server system we have
The primary aim is to get their distributions which always may not be possible. In many of the situations, we have only their mean values or their generating function.
and the service rate
, there are some standard notations and symbols used in the queuing equations which are as follows: C = Number of service channels M = Random arrival/serviceD = Deterministic service rate (constant rate)With these, we describe some of the queueing models with their respective formulae in the following subsections.![]() | Figure 2. Birth-death process |
All the above balanced equations can be expressed in terms of
as follows:
Continuing this way 
, that is, the inter-arrival times are independent, exponentially distributed random variables with parameter
. The service times are assumed to be independent and exponentially distributed with parameter
. Furthermore, all the involved random variables are supposed to be independent of each other.
Therefore,
. Now, the normalizing condition is
Thus,
Consequently, average number of customers in the system is
Summarizing the results, we have following conclusions:i. The probability of having zero customers in the system
ii. The probability of having N customers in the system
iii. Average number of customers in system
iv. Average number of customers in the queue
v. Average waiting time in the system
vi. Average waiting time in the queue 

and
have the usual meanings with all the random variables independent as described in the subsection 4.2. Followings are some of the formulae to for the performance measures of this model.i. The probability of having zero customers in the system
ii. Probability of having N customers in the system
iii. Average number of customers in the queue
iv. Average number of customers in system
v. Average waiting time in the system
vi. Average waiting time in the queue 
ii. Probability of having N customers in the system
iii. Average number of customers in the system
iv. Average number of customers in queue
v. Average waiting time in the system
vi. Average waiting time in the queue 
ii. Average number of customers in queue
iii. Average waiting time in the system
iv. Average waiting time in the queue 
where W denotes mean response time, the mean time spent in the queue and at the server, not just simply as the mean time spent waiting to be served; L refers to the average number of customers in the system and
stands for mean arrival rate as usual. Little’s law can be applied when we relate L to the average number of customers waiting to receive service denoted by
and W to the mean time spent waiting for service denoted by
. In this sense, the other well-known form of Little’s law is
It may be applied to separate parts of much larger queueing systems, such as subsystems in a queueing network. In such a case, L should be defined with respect to the number of customers in a subsystem and W with respect to the total time in that subsystem. Little’s law may also refer to a specific class of customer in a queueing system or to subgroups of customers, and so on. Its range of applicability is very wide indeed. Little’s law seems to be independent of [57]• Specific assumptions regarding the arrival distribution A(t) • Specific assumptions regarding the service time distribution B(t)• Number of servers• Particular queueing disciplineLittle’s law is important for three reasons [57]• It is widely applicable (it requires only very weak assumptions). It will be valuable to us in checking the consistency of measurement of data.• It is the main task in the algorithms for evaluating several queueing network models.• In studying computer system, we frequently find two of the quantities related by Little’s law (the average number of requests in a system and the throughput of that system) and desire to know the third (the average system residence time, in this case).Applications of Little’s Law [57]• On rainy days, streets and highways are more crowded.• Fast food restaurants need a smaller dining room than regular restaurants with the same customer arrival rate.• Large buffering together with large arrival rate cause large delays. Theorem 2 [58]: In a closed Gordon-Newell network with m queues, write
for the state of network. For a customer in transit to state i, let
denotes the probability that immediately before arrival the customer sees the state of the system is
Then the probability
is same as the steady state probability for state
for a network of the same type with one customer less.In any of the queue, the customers want them to be served as quickly as possible. But this may not happen in all the situations. One feels quite relaxed whenever her/his turn comes for the service. To describe the nature and feeling of customers, there are some popular facts about queue. They are called Murphy’s Laws and are described as follows [57]:• If a customer changes queue, the one s/he has left will start to move faster than the one s/he is in.• Customer feels that her/his queue always goes the slowest.• Whatever queue a customer joins, no matter how short it looks, will always take the longest for her/him to get served.
where,
.Likewise, if R denotes the response time and W is the waiting time in the queue, then mean response time and the mean waiting time has been expressed as
and
Average number of jobs found on the server has been determined by the formula
Boulaksil [61] proposed a model to determine the safety stock levels in supply chain systems which are facing demand uncertainty. He reported that supply chain would meet a high level of customer service if large portion of the safety stocks are placed downstream. Teimoury et al. [62] determined holding, back ordering and ordering cost function for GI/G/1 queueing model. They proposed an inventory model for batch products along with some numerical examples of manufacturing supply chain network to analyse performance evaluation. Liu et al. [63] evaluated the performance of serial manufacturing and supply systems with inventory control by developing a multi-stage inventory queue model and a job queue decomposition approach. Then they presented an efficient procedure to minimize the overall inventory in the system maintaining the required service level. Sivakumar et al. [64] studied a discrete time inventory model to evaluate joint probability distribution of the number of customers in the pool and the inventory level where demand during stock out periods either enter a pool having finite capacity
or leave the system with a predefined probability. Andriansyah et al. [65] used generalized expansion method to evaluate the performance of the systems in terms of throughput and compared results with simulation. Experiments for a large number of settings and different network topologies were also presented. They derived the formula for the throughput at node i as
where
is the probability of a customer being blocked for
queueing model. Mishra and Yadav [66] considered a clocked queueing network with renewal model and used it to develop a computational approach for the analysis of cost and profit structure in the system. They found its optimality with respect to arrival and service parameters of the system. Mary and Christina [67] proposed the procedure to find the total average cost in terms of crisp values for
with fuzzy parameters considering many other factors with some numerical example for the validity of the proposed system. Smith [68] used mean value analysis algorithm to study the material handling and transportation networks in finite buffer closed
queueing system. Babadi et al. [69] applied queueing systems in nylon plastic manufacturing and recycling centres using Jackson network to minimize the average delay to deliver products, total cost and transportation cost which was checked by the sensitivity analysis by changing the parameters. Vahdani and Mohammadi [70] proposed a bi-objective optimization model in a closed loop supply chain network in which general multi-priority and multi-server queuing system for parallel processing execution has been used to minimize the cost and maximize the profit. In order to calculate the queue waiting time of arrival products into the forward flow to the bidirectional facility, following formula has been used
where,
for all band
The other notations used in the model are described as follows:
Waiting time in the queue of forward flow of product with priority p in bidirectional facility b;
Number of service provider at bidirectional facility b;
Service rate at bidirectional facility b for forward products;
Arrival rate of forward flow of product p to bidirectional facility b.Diabat et al. [71] used queueing approach to determine the number and location of distribution centres, the assignment of retailers to distribution centres and the size and timing of orders for each distribution centres providing some numerical results. They proposed a model in which
Each opened distributions centres orders to the supplier when its inventory level is less than S + 1. Then the expected amount of recorders (RK) is calculated by
On the other hand, when the level at distribution centre located at site K is equal to zero and arriving demands are lost, then the expected amount of lost sales
is
And the expected amount of inventory
has been obtained by
where
and
are the recorder quantity and recorder point at distribution centre K. Wang et al. [72] collected a review defining supply chain and discussing literature in the areas, namely service supply management, service demand management and the coordination of service supply chains to observe the state in each area. He [73] derived supply risk sharing contracts for the equilibrium between the recycling price decision and the remanufacturing quantity decision using game theory illustrating some numerical examples for managerial results. Zhalechian et al. [74] studied environmental impact of sustainable closed-loop location - routing -inventory model using a stochastic-probabilistic programming approach presenting some real-world applications. Sadjadi et al. [75] derived optimization model using queuing approach for allocation of the retailers’ demands, and inventory replenishment decisions so as to minimize the total expected cost of location, transportation and inventory. Jin [76] formulated link transmission model and link queue model defining demand and supply to present queuing models for a point queue and their discrete versions.