International Journal of Statistics and Applications
p-ISSN: 2168-5193 e-ISSN: 2168-5215
2020; 10(4): 85-102
doi:10.5923/j.statistics.20201004.02
Received: July 30, 2020; Accepted: August 15, 2020; Published: September 5, 2020

Mohammed Eid Awad Alqatqat, Ma Tie Feng
Department of Statistics, Southwestern University of Finance and Economics, Chengdu, China
Correspondence to: Mohammed Eid Awad Alqatqat, Department of Statistics, Southwestern University of Finance and Economics, Chengdu, China.
| Email: | ![]() |
Copyright © 2020 The Author(s). Published by Scientific & Academic Publishing.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

This study aim to Comparison in fuzzy time series and fuzzy time and clustering and Proposed Method in the field production and consumption electric, the basic idea of three methods is to predictive the Possibility of future production and consumption electric based on historical data of production and consumption electric in the past, Nevertheless, Three methods have a different approach in transforming the value of production and consumption to the range of the random variables (the states). This paper considered the production and consumption data of electric in china from January, 2015 to December, 2019. The accuracy of three methods is verified by using the Mean Absolute Deviation (MAD), Mean Absolute Percentage Error (MAPE), the result shows that the Proposed Method has smaller MAPE, MAD, From fuzzy time series and fuzzy time clustering methods.
Keywords: Fuzzy time series, Clustering algorithm, Time Series, Prediction, Electric Energy
Cite this paper: Mohammed Eid Awad Alqatqat, Ma Tie Feng, Comparison Fuzzy Time Series -Clustering Applications in Production and Consumption Electric Prediction, International Journal of Statistics and Applications, Vol. 10 No. 4, 2020, pp. 85-102. doi: 10.5923/j.statistics.20201004.02.
Where
is a possible linguistic value of U then a fuzzy set
linguistic variable of U is defined by equation 1 following:![]() | (3.1.1) |
a membership function is fuzzy set
so
If the membership is from
to
is the degree that is owned by
against
.Definition 2: Let Y (t) (t =..., 0,1,2,...) subset
, become a universe discourse with the fuzzy set
defined and F (t) is a collection of
then F (t) is called fuzzy time series defined in Y (t) (t =..., 0,1,2,...). From this definition F (t) can be understood as a linguistic variable
of the linguistic probability value F (t). Because at different times, the value of F (t) can be different, F (t) as a fuzzy set is a function.From time t and universe discourse is different at each time so Y (t) is used for time t (Song and Chissom, 1993).Definition 3: Suppose F (t) is caused only by F (t-1) and appointed with
then there is Fuzzy Relations between F(t) and F(t-1) expressed by the formula![]() | (3.1.2) |
is the Max-Min composition operator. The relation R is called the first order model F (t).If fuzzy has relation R (t, t-1) of F (t). (t) Is time independent so for different times 
So that F (t) called time-invariant fuzzy time series.Definition 4: If F (t) is produced by several fuzzy sets
Then the fuzzy relationship is symbolized by:
Where
And such relationships are called
order fuzzy time series.Definition 5: Let F(t) is produced by F(t-1), F(t-2),…., and F(t-m)(m>0) simultaneously and the relation is time variant then F(t) called fuzzy time series and the relation can be expressed with a formula:![]() | (3.1.3) |
(T) and forecasting of production and consumption in the next month.Step 6: Defuzzifying the acquired outcomes or conversion of fuzzy values into qualitative values.
Calculate the threshold value for stopping condition of the proposed clustering algorithm, shown as follows:![]() | (3.2.1) |
Where the symbol “{}” denotes a cluster.Step 3: Assume that there are p clusters, calculate the cluster center cluster center k of each cluster k as follows:
Where dj is the data in Cluster k, r is the number of the data in Cluster k, and 1≤ k ≤ p.Calculate the distance
between any two adjacent cluster centers
and
shown as follows:![]() | (3.2.2) |
Step 5: If smallest distance
then combine the clusters having the smallest distance between them into a cluster and go to Step 3. Otherwise, go to Step 6.Step 6: Calculate the upper bound
of
and the lower bound
of 
![]() | (3.2.3) |
of the first cluster and the upper bound
of the last cluster can be calculated as follows:![]() | (3.2.4) |
form an
, which means that the upper bound
and the lower bound
of the cluster
are also the upper bound
and the lower bound
of the interval
, respectively. Calculate the middle value
of the interval
as follows:![]() | (3.2.5) |
obtained in Step 1, and then define linguistic terms
represented by fuzzy sets, shown as follows:
Step 3: Fuzzily each historical datum into a fuzzy set. If the datum is belonging to
, then the datum is fuzzified into
where 1≤ i ≤ n.Step 4: Construct the fuzzy logical relationship based on the fuzzified data obtained in Step 3. (Note: If the first order fuzzy time series is used and the fuzzified values of time t-1 and t are
and
, respectively, then construct the fuzzy logical relationship
where
are called the current state and the next state of the fuzzy logical relationship. If the nth order fuzzy time series is used and the fuzzified values of time t-n… t-2, t-1 and t are
respectively, then construct the Fuzzy logical relationship
where
are called the current state and the next state of the nth order fuzzy logical relationship). Based on the current state of the fuzzy logical relationships, let the fuzzy logical relationships having the same current state to form a fuzzy logical relationship group.Step 5: Calculate the forecasted output at time t by using the following principles:Principle 1: If the fuzzified values at time t-n… t-2, and t-1 are
respectively, and there is only one fuzzy logical relationship in the fuzzy logical relationship groups, shown as follows:
Then the forecasted value of time t is
where
are the middle value of the interval
and the maximum membership value of
occurs at interval
.Principle 2: If the fuzzified values at time t-n… t-2, and t-1 are
respectively, and there is only one fuzzy logical relationship in the fuzzy logical relationship groups, shown as follows:
Then the forecasted value of time t is calculated as follows:![]() | (3.2.6) |
denotes the number of fuzzy logical relationships
in the fuzzy logical relationship group,
and
are the middle value of the intervals
and
respectively, and the maximum membership values of
and
occur at interval uk1, uk2,…, and
, respectively.Principle 3: If the fuzzified values at time t-n… t-2, and t-1 are
and
respectively, and there is only one fuzzy logical relationship in the fuzzy logical relationship groups, shown as follows:
Then the forecasted value of time t is calculated as follows:![]() | (3.2.7) |
are the middle values of the intervals
respectively, and the maximum membership values of
and
occur at intervals
and
respectively.
is not related to any other group, i.e.
where
is the empty group, and the highest degree of affiliation with
is in the period
, then the results of the prediction are equal to the middle of the period
But we us this relation Mean of monthly variationThis calculated for every month by the equation below![]() | (3.3.1) |
is not related to any other group.We will get Forecasted Value of the fuzzy membership by mid value
(1+ average mean of monthly variation) (3.2.2).Fourth step: the difficult step and final step how many order equation I will use first, second, third order equation.In the method in section 3.2 use first order equation but in our Proposed Method twelve order equation but the condition here you must choose the data equal for example we choose data from January 2015 to December 2019 has 60 elements And we need prediction for 12 months so the data will be equal 60/12=5. ![]() | (4.1.1) |
![]() | (4.2.1) |
, where
is the smallest variation (Jan 2019),
is the greatest variation (Mar 2019),
Thus, the universal set U will be as follows: U= {-6900, 1500}.The second step: The universal set U must be divided into several equal intervals. In our case, this set U is divided into seven equally- length intervals: u1= [-6900,-5700], u2= [-5700,-4500], u3= [-4500,-3300], u4= [-3300,-2100], u5= [-2100,-900], u6= [-900,300], u7= [300, 1500]. Basically we calculated the length of the interval U which is (1500)—(6900) =8400 and divided it by 7:8400/7=1200, then we built our small intervals: (example: u1= [[-6900,-5700]] which has 1200 as magnitude.If we take into account the fact that forecasting with fuzzy time series exhibits the least average error, it’s necessary to find the middle points of the intervals: um1=-6300, um2=-5100, um3=-3900, um4=-2700, um5=-1500, um6=-300, um7=900..The third step: Fuzzy sets are defined on the universal set U. In this case “ the variation in total production” is a linguistic variable that assumes the following linguistic values: A1=(very low level production electric (VLLPE)); A2=(low level production electric (LLPE)); A3=(changeless production electric (CPE)); A4= (moderate production electric (MPE)); A5=(normal-level production electric (NLPE)); A6= (high-level production electric (HLPE)); A7=(very high-level production electric (VHLPE)). To every linguistic value here corresponds a fuzzy variable which, according to a certain rule is assigned against a corresponding fuzzy set determining the meaning of this variable.For example, the linguistic value “very-low-level production electric” is given by the fuzzy variable <VLLPE, = [-6900,-5700], A1>, where A1 is a fuzzy set defined on the domain = [-6900,-5700] of the universal set U. See example (3).The fuzzy set A1, A2… A7 is defined on the universal set U by the following formula (6.1.1):![]() | (6.1.1) |
is the middle point of the corresponding interval in (1); C is a constant. C is chosen in such a way that it ensures the conversion of definite quantitative values into fuzzy values or their belonging to the interval. (In our case C=0.0001);
is a fuzzy setIf the value of the variable U in formula (6.1.1) is accepted as the middle point of the corresponding interval, the fuzzy set
(i=1... 7) will be defined as follows:A1={(1/u1),(0.61/u2),(0.27/u3),(0.15/u4),(0.10/u5),(0.06/u6),(0.04/u7)}A2={(0,61/u1),(1/u2),(0.61/u3),(0.27/u4),(0.15/u5),(0.10/u6),(0.06/u7)}A3={(0,27/u1),(0.61/u2),(1/u3),(0.61/u4),(0.27/u5),(0.15/u6),(0.10/u7)}A4={(0,15/u1),(0.27/u2),(0.61/u3),(1/u4),(0.61/u5),(0.27/u6),(0.15/u7)}A5={(0,10/u1),(0.15/u2),(0.27/u3),(0.61/u4),(1/u5),(0.61/u6),(0.27/u7)}A6={(0,06/u1),(0.10/u2),(0.15/u3),(0.27/u4),(0.61/u5),(1/u6),(0.61/u7)}A7={(0,04/u1),(0.06/u2),(0.10/u3),(0.15/u4),(0.27/u5),(0.61/u6),(1/u7)}The fourth step: This step consists of the fuzzification of the variation calculated at the first step. This time, if
is a variation for the i-th month, then membership function for
is calculated by means of formula (6.1.1) by holding valid the equality
that’s to say, by separating the interval, to which belongs the considered variation, from the universal set U. Here,
is a fuzzy set of the corresponding variation for the year t= m×n where May2015<t Dec2019.The fifth step: We must select a basis w (1<w<l, where
is the number of months, prior to the current month included in experimental evaluation). Resting on the basis W or the past months, we calculate a fuzzy relationship matrix
by means of which is given a forecast. For this purpose, after the selection of w, we establish an operation matrix i×j
(here i is the number of rows, Criteria matrix
(a row matrix corresponding to fuzzy variation in total population for the month t-1). For example, by assuming that w=7, we can define the operation matrix
(which is the matrix of fuzzy variations in total production electric over the months t-2, t-3, t-4, t-5, t-6, t-7) and the criteria matrix
(which is the fuzzy variation matrix for the month t-1). Thus for w=7, the previous 8 months data are utilized (the total production electric of the (t-8) month must be known to find variation of the (t-7) month).At last, for example, in order to forecast the total production electric for July 2017, the operation matrix О7 (т) will be established as follows
О7 (1990) =
К (Jul-2017) = [fuzzy variation in total production electric for the Jun2017-th month] - [fuzzy variation in total population for Jun-2017], That is to say
According to the method, the relationship matrix R (t) is calculated at the next step![]() | (6.1.2) |
is an operation matrix;
is matrix of fuzzy sets,
an operation min
Later there is defined the forecasted value F (t) for the t year in a fuzzy form as follows.![]() | (6.1.3) |
Finally, the results obtained from population forecast for the July 2017 will be as follows.
Forecasting results for other years are calculated in an same manner.The sixth step: to fuzzify the obtained results of the 5-th step, the following formula is proposed![]() | (6.1.4) |
is the calculated value of membership function for the forecast year t,
are the middle points of intervals. For example, after calculating F (Jul-17) = -2.511164519, that is to say, anticipated production electric July 2017 equals to -2.511164519. In orders to estimate the forecasted total production electric for July 2017, we must add the calculated production electric to the total production for the June 2016. In other wordsN (July 2017) = 2959.83+-2.511164519=2957.32.Table 6.1.1 and Table 6.1.2 below show the actual and forecasted values of production and consumption electric for period from January 2016 to December 2019 in GWh the result has been rounded to the nearest integer.![]() | Table 6.1.1. Actual and forecasted values of production electric in GWh |
![]() | Table 6.1.2. Actual and forecasted values of consumption electric in GWh |
for stopping condition of the proposed clustering algorithm:
[Step 3] Put each datum in a cluster, shown as follows: {5814.57}, {435.09}, {435.09}, {1355.14}, {1798.59}, {2267.61}, {2759.49}, {3312.05}, {3877.16}, {4373.23}, {4864.69}, {5370.08}, {6133.16}, {465.77}, {465.77}, {1458.72}, {1938.24}, {2436.77}, {2959.83}, {3569.76}, {4165.94}, {4689.14}……………etc..[Step 4] Based on Eq. (2), calculate each cluster center
Based on Eq. (3.2.2), calculate the
shown as follows: Find the smallest distance smallest_distance, i.e., 30.68 (the distance distance 2, 3 between
Table 6.2.1 show the Distance between clusters for production electric
|
i.e., 30.68 < 144.2519077 is true, then
(i.e., {465.77}) and
(i.e., {522.73}) are combined into one cluster (i.e., {465.77, 522.73}), and go to Step 2 the iterations of Step 3 to Step 5 are repeatedly done until the condition
is false.![]() | Figure 1. Cluster centers of production electric |

Because there is no previous cluster before
, the lower bound of
of
is calculated using Eq. (3.2.4) and because there is no next cluster after the last cluster, i.e.,
the upper bound
is calculated using.
[Step 7] Let each
form an
and calculate the middle value using Eq. (3.2.5). The table 6.2.2 below shows Interval Generations from the Clusters and Final intervals from clusters show in table 6.2.3.
|
|
,shown as follows:
[Step 2] Fuzzify each datum that is belonging to ui, where 1≤ i ≤ 11 into Ai.[Step 3] Obtain the fuzzy logical relationships (FLR) of the first order fuzzy time series Table (6.2.4). Let the FLR having the same current state to form a FLR group (FLRG).
|
|
![]() | Table 6.2.6. Actual and forecasted values of production electric in GWh |
![]() | Table 6.2.7. Actual and forecasted values of consumption electric in GWh |
![]() | Figure 2. Monthly evolution of electric production |
.
|
![]() | Figure 3. Monthly evolution of electric production |
|
which state come after state A26the forecasted value of A27 by equation (3.2.2)Mid value (1+ average mean of monthly variation)
[Step 5] Calculate the forecasting value. In column 1 from table 6.3.3 we start forecasting our data by using states from the previous year. How we do this? Let's see some examples: We forecast the value of February 2016 by looking at the state of February 2015 and we affect the corresponding forecasted value from column C which is (491, 774505).We forecast the value of July 2016 by looking at the state of July 2015 and we affect the corresponding forecasted value from column C which is (3364, 194305). Table 6.3.4 and 6.3.5 below shows the actual and forecasted values of production and consumption electric for period from January 2016 to December 2019 in GWh, the result has been rounded to the nearest integer.
|
![]() | Table 6.3.4. Actual and forecasted values of production electric in GWh |
![]() | Table 6.3.5. Actual and forecasted values of consumption electric in GWh |
|
|
|
|
![]() | Figure 4. Balance in Energy Production and Consumption for Electric |