An integrated artificial neural network-genetic ... - Springer Link

J Intell Manuf (2011) 22:229–245 DOI 10.1007/s10845-009-0284-8

An integrated artificial neural network-genetic algorithm clustering ensemble for performance assessment of decision making units A. Azadeh · M. Saberi · M. Anvari · M. Mohamadi

Received: 27 February 2009 / Accepted: 17 June 2009 / Published online: 1 August 2009 © Springer Science+Business Media, LLC 2009

Abstract This study proposes a non-parametric efficiency frontier analysis method based on artificial neural network (ANN) and genetic algorithm clustering ensemble (GACE) for measuring efficiency as a complementary tool for the common techniques of the efficiency studies in the previous studies. The proposed ANN GA algorithm is able to find a stochastic frontier based on a set of input–output observational data and do not require explicit assumptions about the functional structure of the stochastic frontier. Furthermore, it uses a similar approach to econometric methods for calculating the efficiency scores. Moreover, the effect of the return to scale of decision making unit (DMU) on its efficiency is included and the unit used for the correction is selected based on its scale (under constant return to scale assumption). Also, in this algorithm, GA is used to cluster DMUs to increase

DMUs’ homogeneousness. It should be noted that data envelopment analysis (DEA) is sensitive to the presence of the outliers and statistical noise. It is also not capable of performing prediction and forecasting. This is shown by two examples related to outlier situations. However, the proposed algorithm is capable of handling outliers and noise and DEA is used as a benchmark to show advantages of the proposed algorithm. Also, the proposed algorithm and conventional algorithm are compared in viewpoint of DEA through statistical t-test. The proposed approach is applied to a set of actual conventional power plants to show its applicability and superiority.

A. Azadeh (B) Department of Industrial Engineering and Center of Excellence for Intelligent-Based Experimental Mechanics, College of Engineering, University of Tehran, P.O. Box 11365-4563, Tehran, Iran e-mail: [email protected]; [email protected]

Introduction

M. Saberi Institute for Digital Ecosystems & Business Intelligence, Curtin University of Technology, Perth, Australia e-mail: [email protected] M. Saberi Department of Industrial Engineering, Faculty of Engineering, University of Tafresh, Tafresh, Iran M. Anvari Department of Industrial Engineering, Iran University of Science and Technology, Tehran, Iran M. Mohamadi Department of Computer Engineering, Iran University of Science and Technology, Tehran, Iran

Keywords Performance assessment · Artificial neural network · Genetic algorithm · Decision making units

The appropriate use of limited resources with the available technology is referred to as technical efficiency. Efficiency frontier analysis has been an important approach of evaluating firms’ performance in private and public sectors. As alternatives to determine the efficiency boundaries, the international experience reports a significant number of methodologies with different approaches and methods to characterize such efficiency (Jamasb and Pollitt 2001). Basically, these methodologies can be classified according to how the frontier is estimated. There are two competing paradigms on efficiency analysis. Parametric and non-parametric approaches are widely-used in the efficiency measurement. The first include the estimation of both deterministic and stochastic frontier function (SFF) which is based on the econometric regression theory and has been widely accepted in the econometrics field. The latter include DEA and Free Disposal Hull (FDH) which are based on a mathematical programming

123

230 Table 1 ANN and DEA features

J Intell Manuf (2011) 22:229–245

DEA

ANN

Nonparametric approach

Nonparametric approach

Optimal weights to maximize the efficiency

Optimal weights to derive the best possible fit

No assumptions about the functional form

No assumptions about the functional form

Medium assumptions about functional form and data Medium flexibility

Low assumptions about functional form and data High flexibility

Many theoretical studies/applications on efficiency Very sensitive to the presence of the outliers and statistical noise Forecasting incapability

Few theoretical studies No sensitive to the presence of the outliers and statistical noise Forecasting capability

Global optimization in linear status

Local optimization in nonlinear status

Use continuous and integer inputs and outputs Medium computational expense

Use continuous and classification inputs and outputs

Black box

Black box

approach. Each of these two methodologies has its strength as well as major limitations. In all of these methodologies, the frontier is defined by the most efficient DMU of the sample. Mathematically, the frontier methods are introduced as a high-reliability analysis tool and have been largely used for studies in the electrical and industrial field (Pollit 1995; Sanhueza et al. 2004). There have been many efficiency frontier analysis methods reported in the literature (for instance only about power plants Cook and Green 2005; Golany et al. 1994; Goto and Tsutsui 1997; Knittel 2002; Lam and Shiu 2001; Olatfubi and Dismukes 2000; Park and Lesourd 2000; Sanhueza et al. 2004; Sueyoshi and Goto 2001). But, the assumptions made for each of these methods are restrictive. Conflicting conclusions of efficiency are often resulted by using the different methods due to the unsuitability of the assumptions. Their frontier is deterministic and sensitive to outliers. The non-parametric approaches make no assumption about the functional form of the frontier. Instead, it specifies certain assumptions about the underlying technology that in combination with the data set allows the construction of the production set. For instance, the DEA frontier is very sensitive to the presence of the outliers and statistical noise which indicates that the frontier derived from DEA analysis may be warped if the data are contaminated by statistical noise (Bauer 1990). On the other hand, DEA can hardly be used to predict the performance of other decision-making units. Although, benchmarking in DEA allows for the identification of targets for improvements, it has certain limitations. A difficulty addressed in the literature regarding this process is that an inefficient decision making unit (DMU) and its benchmarks may not be inherently similar to their operating practices. This is primarily due to the fact that the composite DMU that dominates the inefficient DMU does not

123

High computational expense

exist in reality. To overcome these problems researchers have utilized performance-based clustering methods for identifying more appropriate benchmarks (Doyle and Green 1994; Talluri and Sarkis 1997). These methods cluster inherently similar DMUs into groups, and the best performer in a particular cluster is utilized as a benchmark by other DMUs in the same cluster. Moreover, Wang indicates that the research to remove drawbacks and limitations in this area has been insufficient (Delgado 2005). However, his proposed algorithm is time consuming (using uncertain λ value and trial and error process and the need to run stochastic frontier analysis (SFA) and DEA jointly). The proposed algorithm uses new and powerful tools and removes the limitations of previous methods (Cook and Green 2005; Costa and Markellos 1997). It further improves SFF and DEA approaches. Table 1 shows the important features of ANN and DEA. Examination of this table shows that each of the two methods has its shortcoming and superiority. In present study, the conventional algorithm (that uses ANN as approximation tool) is improved with proposed algorithm1 (Athanassopoulos and Curram 1996). Also, the weakness of parametric method with respect to outliers (and statistical noise) and forecasting is removed by the proposed algorithm. Previous studies by Azadeh et al. show the applicability of ANN and ANN Fuzzy C-Means in selected power plants (Athanassopoulos and Curram 1996; Azadeh et al. 2006a,b, 2007a). However, this study presents an algorithm which is not only based on ANN but also uses GA as a clustering tool. The most important feature of the GA clustering is ability to extract optimum number of clusters. In addition, ANN Fuzzy C-Means which is proposed in previous studies 1 DEA is used as benchmark to show the advantage of the proposed algorithm.

J Intell Manuf (2011) 22:229–245

(Athanassopoulos and Curram 1996; Azadeh et al. 2006a,b, 2007a) is expanded for output oriented models. The proposed algorithm improves the conventional algorithm (that uses ANN as approximation tool). It should be noted that data envelopment analysis (DEA) is sensitive to the presence of the outliers and statistical noise. It is also not capable of performing prediction and forecasting. However, the proposed algorithm is capable of handling outliers and noise. It is also capable of performing forecasting for a given set of data. In present Study, we are not going to compare the result of DEA and the proposed algorithm and the result of DEA is used only as benchmark. The conventional algorithm requires explicit assumptions about the functional structure of the stochastic frontier. Moreover, the effect of the return to scale of decision making unit (DMU) on its efficiency isn’t included. However, the proposed algorithm doesn’t require explicit assumptions about the functional structure of the stochastic frontier because it uses ANN. Also, several studies show the superiority of ANN versus conventional econometric methods. Moreover, our emphasis is on philosophic drawback of conventional algorithm nor on the higher or lower results of each performance assessment method because this doesn’t show its superiority or weakness. As mentioned in the introduction, DEA has two problems: 1. Not capable of performing prediction and forecasting; 2. Sensitive to the presence of the outliers and statistical noise. Problem number one is removed in proposed algorithm (shown in step 8). Also, problem number two is removed in the proposed algorithm (step 9). The paper is organized as follows. Section “Introduction” provides an introduction to ANNs in efficiency analysis, where neural networks form a promising analysis tool together with known econometric models and non-parametric methods. This section concludes with a review of several published papers about ANNs and efficiency. DEA is introduced in “Data envelopment analysis”. Section “Genetic algorithm clustering ensemble” provides an introduction to GA method for clustering. Section “Clustering ensemble” is dedicated to the proposed ANN GA Clustering Ensemble for efficiency analysis and assessment of DMUs. An empirical illustration for measuring performance of thermal power plants in Iran is carried out in “The case study”. The capability of proposed algorithm in handling outliers and corrupted data is presented in “Examination of proposed algorithm in outliers and corrupted data situation”. The final section of the paper offers conclusions and suggests areas for the future research. Artificial Neural Networks (ANNs) are a promising alternative to econometric models and they are information

231

processing paradigms that are inspired by the way biological nervous systems, such as the brain, process information. ANNs, like people, learn by example. ANNs are configured for specific applications, such as pattern recognition, function approximation through learning process. Learning in biological systems involves adjustments to the synaptic connections that exist between the neurons. This is true for ANNs as well. They are made up of simple processing units which are linked by weighted connections to form structures that are able to learn relationships between sets of variables. Nowadays, many researches have done in field of ANN. For example, Wang et al. discuss the computational complexity the convergence rate and the stability of the single parameter dynamic searching algorithm for Multi-layer Neural Networks (Wang et al. 2008). Zhang et al. proposed learning algorithm that maintains some trends of quick descent to either global minimum or a local minimum, and at the same time has some chance of escaping from the local minima by permitting temporary error increases during learning (Zhang et al. 2008). Tamura and Tann propose a midpointvalidation method, which improves the generalization of neural networks (Tamura and Tann 2008). In this study application of ANNs in efficiency analysis is discussed. Within the efficiency literature, few applications have been developed in this field. Commonly, neural network technique is used as a complementary tool for parametric and non-parametric methods such as data envelopment analysis (DEA), to fit production functions andmeasure efficiency under non-linear contexts. In fact, applying ANNs can reduce the restrictive assumptions of each of these methods. This heuristic method can be useful for nonlinear process that has an unknown functional form. There has been a vast literature about ANNs, basically in the empirical field, indicating ANN comparability or superiority to conventional methods for estimating various functions (Azadeh et al. 2006c, 2007b,c; Chiang et al. 1996; Hill et al. 1996; Indro et al. 1999; Jhee and Lee 1993; Kohzadi et al. 1996; Stern 1996; Tang et al. 1991; Tang and Fishwick 1993). This study, uses ANN for estimating production function and consequently for performance evaluation of DMUs (Tang and Fishwick 1993). The efficiency prediction power of ANNs is unique and its flexibility to solve complex problems where the main information lies implicitly in the data is very applicable (Wu et al. 2006). The idea of combination of neural networks and DEA for classification and/or prediction was first introduced by Athanassopoulos and Curram (1996). They treated DEA as a preprocessing methodology to screen training cases in a bank with four inputs and three outputs. After selecting samples, ANN is trained to learn a nonlinear forecasting model. They assume that inefficiency distributions are semi-normal and exponential and conclude that DEA is superior to ANN for measurement purpose. Their study indicates that ANN

123

232

results are more similar with the constant returns to scale and less with the variable returns to scale results. Costa and Markellos (1997) analyzed the London underground efficiency with time series data for 1970–1994 where there are two inputs (fleet and workers) and one output (distance). They explain how the ANNs results are similar to corrected ordinary least squares (COLS) and DEA. They proposed two procedures: (a) similar method to COLS after neural training; (b) by an oversized network until some signal to noise ratio is reached. Then, inefficiency is determined as observation-frontier distance. However, ANN offers advantages in the decision making, the impact of constant versus variable returns to scale or congestion areas (Costa and Markellos 1997). Santin et al. (2004) infer that ANN is superior to econometric approach for frontier estimation. Pendharkar and Rodger (2003) used DEA as a data screening approach to create a sub sample training data set that is ‘approximately’ monotonic, which a key property is assumed in certain forecasting problems. Their results indicate that the predictive power of a trained ANN on the ‘efficient’ training data subset is stronger than the predictive performance of a trained ANN on the ‘inefficient’ training data subset. Santin et al. used a neural network approach for a simulated nonlinear production function and compared its performance with conventional alternatives such as stochastic frontier and DEA in different observations and noise scenarios. The results suggested that ANNs are a promising alternative to conventional approaches to fit production functions and measure efficiency under non-linear contexts. Wu et al. presented a DEA-ANN2 study for performance assessment of branches of a large Canadian bank. Their results are operable to the normal DEA results on the whole (Wu et al. 2006). They concluded that the DEA-ANN approach produces a more robust frontier and identifies more efficient units because better performance patterns are explored. Furthermore, for worse performers, it provides the guidance on how to improve their performance to different efficiency ratings. Ultimately, they concluded that neural network approach requires no assumptions about the production function (the major drawback of the parametric approach) and it is highly flexible. It is concluded that ANN has been viewed as an ideal tool to approximate numerous non-parametric and non-linear problems.

J Intell Manuf (2011) 22:229–245

Data envelopment analysis Data envelopment analysis is a non-parametric method that uses linear programming to calculate the efficiency in a given set of decision-making units (DMUs). The DMUs that make up a frontier envelop, the less efficient firms and the relative efficiency of the firms is calculated in terms of scores on a scale of 0 to 1, with the frontier firms receiving a score of 1. DEA models can be input- or output-oriented and can be specified as constant returns to scale (CRS) or variable returns to scale (VRS). Basic models of DEA The original fractional Charnes Cooper Rhodes (CCR) model (1) evaluates the relative efficiencies of n DMUs ( j = 1, . . ., n), each with m inputs and s outputs denoted by x1 j , x2 j , . . ., xm j and y1 j , y2 j , . . ., ys j , respectively (Charnes et al. 1978). This is achieved by maximizing the ratio of weighted sum of outputs to the weighted sum of inputs as shown in model (1): s u y rm=1 r r o i=1 vi xio s u y rm=1 r r j ≤ 1, i=1 vi xi j

Max θ = s.t.

j = 1, . . . , n, r = 1, . . . , s

(1)

u r , vi ≥ 0, i = 1, . . . , m, r = 1, . . . , s

In model (1), the efficiency of DMUo is θo and u r and vi are the factor weights. However, for computational convenience the fractional programming model (1) is re-expressed in linear program (LP) form as follows in model (2): Max θ = s.t.

s

s r =1

u r yr o

u r yr j −

r =1 m

m

vi xi j ≤ 0,

j = 1, . . . , n,

i=1

(2)

vi xio = 1

i=1

u r , vi ≥ ε, i = 1, . . . , m, r = 1, . . . , s

where ε is a non-Archimedean infinitesimal introduced to ensure that all the factor weights will have positive values in the solution. Model (3) evaluates the relative efficiencies of n DMUs ( j = 1, . . ., n), respectively, by minimizing inputs when outputs are constant. The dual of linear program (LP) model for input oriented CCR is as follows:

2

They defined an algorithm to implement the DEA-ANN model. Moreover, after collecting a data set, CCR method is used to calculate efficiency score of DMUs (Charnes et al. 1978). The preprocessed data set is obtained and is grouped into four categories based on the efficiency scores and the neural network is trained with some groups of data subset until the pre-specified epochs or accuracy is satisfied. Then the trained neural network model is applied to calculate efficiency scores of all DMUs and post process the calculated efficiency scores by regress analysis between DEA-NN results and CCR DEA results.

123

Min θ s.t. θ xio ≥ yr o ≤

n

λ j xi j , i = 1, . . . , m,

j=1 n j=1

λj ≥ 0

(3) λ j yr j , r = 1, . . . , s

J Intell Manuf (2011) 22:229–245

233

The output oriented CCR model is as follows: Max θ

n

s.t. xio ≥

λ j xi j , i = 1, . . . , m,

j=1

θ yr o ≤

(4)

n

λ j yr j , r = 1, . . . , s

j=1

1. Using different clustering algorithms to produce partitions for combination; 2. Changing initialization or other parameters of a clustering algorithm; 3. Using different features via feature extraction for subsequent clustering; 4. Partitioning different subsets of the original data.

λj ≥ 0 If λ j = 1( j = 1, . . ., n) is added to model (3), the Banker Charnes Cooper (BCC) model is obtained which is input oriented and its return to scale is variable. The calculations provide a maximal performance measure using piecewise linear optimization on each DMU with respect to the closest observation on the frontier. The linear programming system for the BCC input-oriented model is given in expression (5), and the output-oriented model is given in expression (6) (refer to Charnes et al. 1978). Min θ s.t. θ xio ≥ yr o ≤

n

λ j xi j , i = 1, . . . , m,

j=1 n

λ j yr j , r = 1, . . . , s

(5)

j=1

n

λj = 1

j=1

λ j ≥ 0, Max θ s.t. xio ≥

n j=1

θ yr o ≤ n

j = 1, . . . , n

λ j xi j , i = 1, . . . , m, n

λ j yr j , r = 1, . . . , s

(6)

j=1

λj = 1

j=1

λ j ≥ 0,

j = 1, . . . , n

Based on the diversity of components, a matrix is computed which contains combinational information. This matrix is co-association matrix. Co-association matrix can be computed by the following steps. Let D be a dataset of N data points in d-dimensional space. Suppose that, k-means algorithm has m running time. The value of co_assocition(i, j) is equal to:

i = j 1 i= j k m

(7)

Out of m times repeating the single clustering algorithm on the samples, the ith and jth samples are placed in the same cluster for the k times. It should be noted that the co-association matrix is symmetry matrix (Strehl and Ghosh 2002a). Numerous hierarchical algorithms can be applied to the co-association matrix to obtain the final partition, including Single Link (SL), Average Link (AL), Complete Link (CL) and voting k-means (Minaei et al. 2004; Strehl and Ghosh 2002b; Topchy et al. 2004). In this paper, we used genetic algorithm (GA) on co-association matrix to obtain the final partitions. The appropriateness of GA versus other hierarchical algorithms is shown in two recent studies (Azimi et al. 2006; Mohammadi et al. 2007). Also, the most important feature of our algorithm is ability to extract the number of optimum clusters. In this paper, ten different k-means algorithms are used to create the ensemble matrix. Then, GA is applied to the ensemble matrix. The superiority of GACE method is:

Genetic algorithm clustering ensemble Clustering ensemble Clustering ensemble method is a common and efficient method for clustering task which is used in several applications and algorithms (Minaei et al. 2004; Strehl and Ghosh 2002a,b; Topchy et al. 2004). The proposed algorithm uses single clustering algorithms on a same dataset and then combines the results and consequently improved solution for clustering task is identified. In order to optimally integrate clustering ensembles in a robust and stable manner, one needs a diversity of component partitions for combination. Generally, this diversity can be obtained from several sources:

1. Combination of the results of single clustering algorithms; 2. Ability to extract the number of optimum clusters; 3. Improved outputs of GACE versus hierarchical algorithms. GA applied on ensemble matrix The searching capability of GAs has been used to appropriately determine the number of cluster and final clusters. Genetic algorithm encompasses three main operators: (1) selection, (2) crossover, and (3) mutation (Davis 1991; Goldberg 1989).

123

234

J Intell Manuf (2011) 22:229–245

Representation (definition of individuals)

Selection operation

The first step in defining a GA is to link the “real world “to the “GA world”, that is to setup a bridge between the original problems context and the problem solving space. In this method, the individuals have two parts which are sample part and index part. Sample part contains all samples of dataset and each gene is a representative of a member of database. The length of this part is equal to the number of dataset members. The order of each sample in each individual is random (initialization step). Figure 1 shows an instance of sample part. This instance shows the dataset which has 15 samples with the following order. The boundary of each cluster is determined in index part. The length of this part is equal to number of clusters + 1. Figure 2 shows an instance of the index part. This instance contains 3 clusters. Members from index 1 to 5 belong to cluster 1, members from index 6 to 11 belong to cluster 2 and members from index 12 to 15 belong to cluster 3. Table 2 shows the member of each cluster. The value of genes in the index part is selected at random (initialization step), except first and last genes. The value of the first and last genes always are fixed (first = 0 and last = the number of samples).

The selection process selects individuals from the mating pool directed by the survival of the fitness concepts of natural genetic systems. In this paper, the tournament selection (2) is used as selection method (Davis 1991). In tournament selection (2), two individuals are chosen at random and the individual with better fitness is selected for next population.

5

6

7

12

13

11

2

15

9

1

4

3

8

10

Crossover operation The role of the crossover operation is to create new individuals from old ones. Crossover is a probabilistic process that exchanges information between two parent individuals for generating two child individuals. We use “cut-and-crossfill” as our crossover operation with a fixed crossover probability of μ (Davis 1991). In cut-and-crossfill crossover, a random integer (crossover point) is generated in the range of [1, l − 1] (l is the length of dataset). The portions of the individuals lying to the right of the crossover point are exchange to produce two offspring. This crossover has been used only on sample part. This crossover guarantees that no repeated gene is created and changes the value of repeated genes to the correct values. Mutation operation

14

Fig. 1 An example of sample part

Fig. 2 An example of index part

0

5

Mutation is applied to one individual and delivers a modified mutant (the child). Three Mutation methods are implemented to increase the performance of GA. Intelligence mutation is needed to approximate the results. Examination of Fig. 3 shows that in iteration 1,200 the diversity is decreased and consequently the approximate result is yielded. At each step (iteration) of GA, mutation rate is determined by the following procedure:

11 15

Table 2 Cluster members Cluster 1

Cluster 2

If (iteration < 1,200) then

Cluster 3

Swap rate = 10%; Creep rate = 20%;

Cluster members {5, 6, 7, 12, 13} {11, 2, 15, 9,1,4} {3, 8, 10, 14}

Fig. 3 Diversity rate

Intelligence rate = 0%

90

Diversity Rate

80 70 60 50 40 30 20 10

123

00

00

18

00

17

00

16

00

15

00

14

50

12

00

Iteration

11

11

90 0 10 00

80 0

0

70 0

0

60

0

50

40

30 0

20 0

1

10 0

0

J Intell Manuf (2011) 22:229–245

Else, Swap rate = 5%; Creep rate = 10%; Intelligence rate = 20% Mutation operation is selected from swap mutation, creep mutation and Intelligence mutation by the randomize function. This function generates the integer numbers from 1 to 100 to swap mutation rate, creep mutation rate and Intelligence mutation rate. Diversity is essential to the genetic algorithm because it enables the algorithm to search a larger region of the space. Three mutations are applied to create sufficient diversity. Each mutation creates one type of diversity (Davis 1991). Swap mutation This mutation simply changes the position of two samples at random (Aarts et al. 1989). This mutation is used only on sample part.

235

which is composed of two parts (Aarts et al. 1989). The first part gets more importance to the individual clusters which are very similar (based on co-association matrix). The second part considers clusters which have low association with the other clusters. Equations 8–10 presents final fitness, penalty cluster and fitness cluster, respectively. I Fitness_cluster(Ck ) Final fitness = k=1 I I I −1 Penalty_Cluster(i, j) i=1 j=i+1

−

Penalty_cluster(Ci , C j ) NC j NCi

i= j co_assocition(i, j) N C ∗ N C − N C ∩ N C 2 i j i j Fitness_cluster(Ck ) (i, j)∈(NCk ×NCk ) i≥ j co_assocition(i, j) 2 =

=

(i, j)∈(NCk ×NCk )

NC − NC k k

Intelligence mutation This mutation is used only on sample part. It is used after some iterations (in final iterations) when the algorithm obtains the approximate results. We introduce this mutation to increase the accuracy of the proposed algorithm. This mutation is named as intelligence mutation, because it selects the best candidate samples for mutation. The steps of special mutation are as follows: • A cluster is chosen at random (C). • A member of C with minimum similarity with other members of C is chosen (X ) (dissimilar sample in the same cluster). • The similarity of X with other cluster is calculated. • X is transformed to the most similar cluster (H ). The boundary of H is updated in the index part. The procedure for calculating similarity is the same as the following section (fitness function). Evaluation function (fitness function) The role of the fitness function is to adapt to the requirements. The proposed algorithm presents a fitness function

(9)

(10)

2

Creep mutation This mutation is used only on index part and it works by adding a small (positive or negative) value to selected gene with probability p. It means that this mutation can change the boundary of the selected cluster (increase or decrease) (Aarts et al. 1989).

(8)

I (I − 1)/2

In Eqs. 8–10 we have: Co_association(i, j): The value of entry (i, j) in the co-association matrix Ck : The kth cluster Nk: The set of Ck members index |Nk |: Number of samples in cluster k i: Number of clusters for current individual Fitness_cluster (Ck ): The fitness of cluster kth Penalty_cluster (Ci , C j ): The average similarity between Ci &C j Penalty: The average similarity between all clusters and V is balance parameter.

Method As mentioned, conventional and existing methods have several limitations for in assessment of DMUs. The sensitivity of those methods about outliers and corrupted data is the main reason to introduce the proposed algorithm. To select a method for performance assessment of a set of DMUs, it is suggested to pay great attention about the data, type of problem and DMUs. An ANN GACE algorithm is proposed to measure DMUs’ efficiency in current period. This algorithm can estimate efficiency by considering input (output) orientation through identifying production (cost) function ANN (same as econometric methods). Also, for simplicity we consider one output (input), but it is easy to extend to it various outputs (inputs). Figure 4 presents the integrated algorithm for performance

123

236

J Intell Manuf (2011) 22:229–245

1

Determination of input(s) and output(s) variables

2

Collect relative data set S in all available previous and current periods, normalize all data

3

Divide S in to two subsets: training (S1) and valid (S2) data

4

Use different ANN architecture to estimate relation between input(s) and output(s)

5

Run ANN* for Sc.

2.

3.

6 Cluster DMUs by GACE method into x clusters

Set j = 1

Calculate weigh of DMUi (Wi) i = 1… n for DMUs in jth cluster

7

Calculate the error between the real output and ANN model output for DMUs in jth cluster

8

4. Shift frontier function from ANN for DMUs in jth cluster

9

j=j+1

Yes

j ≤ x No

10

• Train the model using the training data (S). In this study, MLP Levenberg–Marquardt (MLP–LM) training algorithm is used. LM algorithm is selected by noting its capability to reach desired error in an appropriate time frame. • Evaluate the model using the test data (Sc ) and obtain MAPE error.

Calculate efficiency scores for all DMUs

Fig. 4 Integrated ANN-GACE algorithm for performance assessment of DMUs

assessment of DMUs. The proposed algorithm is composed of the following steps:

1.

Determination of input (or inputs) and output (P) variables under input oriented assumption (input (C) and output (or outputs) variables under output oriented assumption) of the model.

123

Collect data set S in all available previous periods which describes the input-output relationship for DMUs. Assume that there are n DMUs to be evaluated. Note that the current period data (Sc : test data3 ) does not belong to S. Then obtain the preprocessed data set (S and Sc ) after the data are scaled between 0 and 1. Divide S in to two subsets: training (S1 ) and valid (S2 ) data. The first subset is the training set, which is used for computing the gradient and updating the network weights and biases. The second subset is the validation set. The error on the validation set is monitored during the training process. The validation error will normally decrease during the initial phase of training, as does the training set error. However, when the network begins to over fit the data, the error on the validation set will typically begin to rise. When the validation error increases for a specified number of iterations, the training is stopped, and the weights and biases at the minimum of the validation error are returned. The validation is separated from training data and it should be a good representation of data across the range of outcomes. It is necessary to strike a balance between the training and validation data set sizes. When large amounts of data are available the selection of validation data can be done using a simple random choice. The extrapolation ability of ANN should be used and therefore the data for validation is chosen from the period which is closer to Sc . Use ANN method to estimate relation between input(s) and output(s). For this reason select architecture and training parameters. All networks used in this study have a single hidden layer because the single hidden layer network is found to be sufficient to model any function (Cybenko 1989; Patuwo et al. 1993). To find the appropriate number of hidden nodes, the following steps are performed for networks with one to q nodes in their hidden layer. The value of q is optional and should be changed if after the following next steps, the goal error has not been met.

Then the model which has the lowest error (MAPE) is selected to estimate the production function. The MAPE is calculated with following formula: 3

The test set error is not used during the training, but it is used to compare different ANN models.

J Intell Manuf (2011) 22:229–245

MAPE =

237

N 1 Actualvaluei − Setpointvaluei , N Setpointvaluei

on its efficiency is considered and the unit used for the correction is selected by noting its scale (CRS).

i=1

(N : the number of rows) 5. 6.

7.

Run ANN∗ for Sc . Cluster DMUs by GACE (x cluster are obtained after running GA)and for each of the clusters do the fowling steps. Calculate weight of DMUi (Wi )

Vi = Ci /Average (C1 , . . . , Ci−1 , Ci+1 , . . . , Ccj ) i = 1, . . . , n for input oriented model Wi = Vi / (Vi ) (11) where c j (1 ≤ j ≤ x) is the number of DMUs which belong to jth cluster of x clusters. Calculate the error between the real output (Preal(i) ) and ANN model output (PANN∗(i) ) in the period which you want to assess the efficiency of DMUs (Sc ): E i = Preal(i) − PANN∗ (i) i = 1, . . . , n for input oriented model E i = CANN∗ (i) − Creal(i) i = 1, . . . , n for output oriented model (12) Shift frontier function from neural network to obtain the effect of the largest positive error which is one of the unique features of this algorithm: E k = E i /Wi

i = 1, . . . , n

(13)

This option consists of not considering the largest error, but it calculates the frontier function by noting the DMU scale (Constant Returns to Scale (CRS)). Therefore: The largest E i indicates the DMU with best performance. Suppose that DMUk has the Largest E i and we have: E k i = max(E i )

(14)

Thus, the value of the shift for each of the DMUs is different and is calculated by: Shi = E k∗ Wi /Wk i = 1, . . . , n

Fi = Pi /(P(ANN∗ )i + Shi ) i = 1, . . . , n for input oriented model i = 1, . . . , n for input oriented model (16)

i = 1, . . . , n for input oriented model

9.

Calculate the efficiency scores The efficiency scores take values between 0 and 1. The maximum score is assigned to the unit used for the correction in each cluster:

Fi = (C(ANN∗ )i − Shi )/Ci

Vi = Pi /Average (P1 , . . . , Pi−1 , Pi+1 , . . . , Pcj )

8.

10.

(15)

In this approach in spite of the previous studies (Athanassopoulos and Curram (1996) called this measure “standardized efficiency”) the effect of the scale of DMUs

It should be noted that the efficient unit in cluster A is not more efficient than the efficient unit in cluster B although it may be better than it. In some cases, perhaps in a particular cluster, the error E i for all DMUs is negative. In this situation, by the proposed algorithm, frontier function from neural network is shifted to lower level of production. Moreover, the best unit is the DMU that has the lowest loss with respect to its scale. In this study, it has been noted that comparing various units with various scales and capitals is not logical. For example in power plant, increasing 100,000 MWh in the production of a 2,000 and 50 MW plants is more feasible for the 2,000 MW plant. In fact, if a 50 MW plant can produce 100,000 MWh more than predicted value of the ANN algorithm, its efficiency is more than a plant with the same added production of 2,000 MW plant. ANN modelling capability can be used to analyze DMUs’ status. The inputs of each DMU can be changed to foresee their impacts on the output. Moreover, DMUs can assess their own status with probability of changing inputs. Then DMUs can decide to increase or decrease each input with regard to their affect on efficiency. Also, DMUs can establish relationship with other attributes of DMU such as personnel attributes and yield efficiency (Azadeh and Javanmardi 2007). The case study The proposed algorithm is applied to an actual steam power generation plant. The proposed algorithm is applied and explained to the actual data set. The selected power plants (DMUs) essentially perform the same tasks and are homogeneous. This selection ensures that plants in the sample constitute a homogenous technology, thus forming a suitable sample for applying the proposed algorithm. The DMUs are 19 steam power plants in Iran for 1997–2004. It is assumed that the model is input-oriented because power plants have particular orders to fill (e.g., electricity generation) and hence the input quantities appear to be the primary decision variables.

123

238

J Intell Manuf (2011) 22:229–245

Table 3 Real data indicators for evaluating performance of conventional power plants (1997–2000) and (2001–2004) 1997 IC

1999 ICo

FC

GP

IC

ICo

FC

GP

Esfehan

835

329279

1099958

4363907

835

374496

1226427

4673887

Iranshahr

128

31081

132584

347549

128

53869

209113

650897

Besat

248

70690

345723

1197503

1280

439286

1864230

7005830

1280

64464

301947

1006278

466577

1814283

6677500

Bistoon

640

321182

987583

3905080

640

Tabriz

736

338000

1055127

4194640

736

314416

952885

3684730

376163

1219200

4820225

Ramin

1260

380375

1774332

7298784

1890

Zarand

60

11100

45238

121184

60

401456

1766489

7940506

15460

64018

240

99388

359522

1379867

183005

240

98732

359759

1360083

Rajaei

1000

412982

1487164

6280709

1000

385772

1498393

5888736

Salimi

1760

337938

Firoozi

50

14425

2644540

9962950

1760

320923

2713157

10092838

91093

233787

50

7059

44968

Madhaj

290

114861

73939

242386

1033094

290

87872

392738

1570995

Mofatteh

1000

370550

1083089

4633526

1000

385508

1211224

5142265

Montazeri

800

377682

1420517

5399730

1600

646582

2292959

8839074

Toos

600

232562

1050193

3756918

600

231469

1053040

3760082

Bandarabbas

Beheshti

247.5

Mashahd

133

58106

226835

661598

133

66235

247101

742750

Montazerghaem

626

254320

1057716

3920790

625

227990

940948

3529630

1998 IC

2000 ICo

FC

GP 4788579

IC

ICo

FC

GP

835

386886

1315642

128

67058

264044

834132

247.5

70575

326939

1089433

Esfehan

835

351887.5

1209563

Iranshahr

128

42475

246048

707111

Besat

247.5

67577

318269

1061026

452931.5

1808702

6602310

1280

483799

1936360

7076050

963206

3934980

640

334874

1048605

4181695

Bandarabbas Bistoon

1280 640

317799

5091712

Tabriz

736

357081.5

932911

3698715

736

381788

1241956

4853130

Ramin

1575

390915.5

1453842

6301687

1890

528661

2334391

10238875

Zarand

60

13280

62456

175639

60

14887

61964

172142

240

99060

324733

1252004

240

104456

417468

1576566

Rajaei

1000

399377

1337951

5797072

1000

413835

1507394

6483750

Salimi

1760

329430.5

2642707

9563050

1760

266939

2905615

10492131

Firoozi

50

73872

188691

50

3215

22023

56254

Madhaj

290

304735

1177177

290

108852

393838

1575351

1376720

5821394

Beheshti

10742 80905.5

Mofatteh

1000

378029

1142855

4945512

1000

409486

Montazeri

1200

512132

1620133

6117069

1600

733304

2631552

10247273

Toos

600

232015.5

997084

3553653

600

266989

1123230

3991182

Mashahd

133

62170.5

228970

654738

133

61515

536099

682881

Montazerghaem

626

942865

3469570

625

225110

935678

3426910

241155

2001 IC

2003 ICo

FC

GP

IC

ICo

FC 1353534

Esfehan

835

380422

1315319

5045028

835

383123

Iranshahr

128

65602

272739

865502

256

76538

123

322812.8

GP 5159787 1024768

J Intell Manuf (2011) 22:229–245

239

Table 3 continued 2001 IC Besat Bandarabbas

247.5 1280

2003 ICo

FC

GP

IC 247.5

ICo

97371

402697

1256959

500282

1759773

6306860

1280

521124

102313

FC 451699.9 2067974

GP 1354178 7329520

Bistoon

640

284381

882762

3574670

640

341280

1060294

4175810

Tabriz

736

327362

1015303

4053160

736

343049

1066862

4171440

Ramin

1890

635592

2606317

11327188

1890

595521

2484192

10516063

Zarand

60

22060

86974

243189

60

27925

111874

309276

1647855

Shazand

1300

365280

993077

4061178

1300

665394

Beheshti

240

109320

425810

1572894

240

101397

486229.9

7444968 1728820

Rajaei

1000

442179

1576409

6649454

1000

394500

1419424

6017929

Salimi

1760

337494

2845149

10241907

1760

315385

2681187

10003329

Firoozi

50

12301

87159

217896

50

13707

97763.7

241905

Madhaj

290

110374

396984

1587936

290

89828

299103.3

1196408

Mofatteh

1000

358077

1227777

5064299

1000

413842

1423815

5468079

Montazeri

1600

789265

2939317

11115901

1600

823033

3004335

11640505

Toos

600

265032

1046902

3725620

600

258318

1039862

3692738

Mashahd

120

580363

775823

120

72635

262383

771897

Montazerghaem

625.88

924527

3417070

242139

871940

3155480

67615.5 257902

2002 IC

625.88 2004

ICo

FC

GP

IC

ICo

FC

GP

Esfehan

835

413412

1431905

5486515

835

422673

1519828

5621431

Iranshahr

192

57733

237914

751759

256

140940

453860

1492847

Besat

91143

388880

1217997

1280

366441

1415261

5012560

1280

Bistoon

640

276674

858706

3455480

Tabriz

736

272689

833390

3367600

Ramin

1890

622654

2658594

Zarand

60

31085

111268

1300

607092

1558995

Bandarabbas

Shazand Beheshti

247.5

247.5

139505

489664

1500253

588855

2109162

7196540

640

350154

1101614

4210280

736

361080

1233226

4341330

11494462

1890

636643

2686567

10861867

309696

60

29698

124479

341402

6806304

1300

642909

1650253

7438002

240

111710

463475

1738341

240

85307

404894

1435991

Rajaei

1000

429868

1441756

6079938

1000

421015

1572620

6342203

Salimi

1760

362650

2672674

9867577

1760

301276

3085375

11310817

Firoozi

50

12858

95242

240363

50

13039

98275

212403

Madhaj

290

97091

361081

1444325

290

81674

227104

922587

Mofatteh

1000

398613

1333949

5411964

1000

390708

1335352

5134547

Montazeri

1600

802543

3298201

11273792

1600

796262

2949382

11137177

Toos

600

264541

1107494

3889653

600

271901

1078599

3831065

Mashahd

120

73716

272196

816277

120

63050

229087

665887

Montazerghaem

625.88

159212

903907

3318380

241139

891334

3297100

625.8

IC Install capacity (MW); ICo Internal consumption (MWh); EC Fuel consumption (TJ); GP Gross production (MWh)

Step 1: As shown by several authors, the production function for conventional thermal steam-electric production may be described conveniently within an engineering

framework. In this framework, pertinent inputs are the fuel quantity consumed and installed power, which is the maximum nominal power the plants are initially designed

123

240

J Intell Manuf (2011) 22:229–245

Table 4 The MAPE for the 20 Models Model

Number of neurons in hidden layera

Install Capacity

MAPE

1

1

0.066

2

2

0.045

3

3

0.080

4

4

0.016

5

5

0.051

6

6

0.089

7

7

0.045

8

8

0.019

9

9

0.035

10

10

0.036

11

11

0.028

12

12

0.081

13

13

0.097

14

14

0.044

15

15

0.056

16

16

0.058

17

17

0.094

18

18

0.066

19

19

0.079

20

20

0.056

a Hidden

layer activation function which is defined for this single layer is Tan–Sigmoid. This non-linear feature is introduced at the hidden transfer function. From the previous universal approximation studies, these transfer function must have mild regularity conditions: continuous, bounded, differentiable and monotonic increasing. The most popular transfer function is sigmoid or tanh. TanSig (n) calculates its output according to: n = 2/(1 + exp(−2 × n)) − 1. This is mathematically equivalent to tanh (n). This transfer functions bound the output to a finite range [−1, 1]

for. On the other hand, labor inputs contribute to production through control and maintenance services, which also require some capital. The output is, of course, electrical energy production. But by noting studies about efficiency measurement of thermal power generations in Iran which indicate that labor isn’t an effective factor (for instance, Emami Meibodi 1998), in our study, electric power (in megawatt hour) generated from thermal power plants in each DMU (GP) is used as the output variable, while Install Capacity (IC), Fuel Consumption (FC), and Internal Consumption (ICo) are three inputs used for power generation. Capital is measured in terms of installed thermal generating capacity in megawatt (MW (Fare et al. 1983; Hawdon 1997). Various natural elements have been used as fuel in the production of electric power in various steam plants in Iran (natural gas, gas oil and Mazute). The choice of fuel depends on many factors such as availability, cost and environmental concerns and each fuel has its limitations. Our figures measure fuel consumption in terms of Tera Joule (TJ). In other words, our figures have already adjusted for

123

1 Gross Production Internal Consumption

2 . . .

Fuel Consumption

Input layer

4

Hidden layers

Output layer

Fig. 5 The architecture of the selected ANN model (8th model)

the quality of fuel used in different plants. Internal power is the amount of energy consumed (in megawatt hour) within the site (for electrically powered equipment etc). Step 2: 148 rows of data are collected from 1997 to 2004. Table 3 shows the real data for the inputs and output used in the algorithm. Detailed information about power generation of thermal power plants in Iran, like total output, generation capacity and fuel consumption can be obtained from “Electric Power Industry in Iran” (including transmission and distribution) published by the Tavanir (1997– 2004). Current period (2004) is considered as Sc . Step 3: In our problem, each period have all range of outcomes. Therefore, all data for period can be selected for test or can be sorted by the value of the output variable, partitioned and one validation data point is chosen at random from each partition. In this way the stratification tries to ensure that validation data is chosen across the range of outcomes. S1 is data from 1997 to 2002 (110 rows of data) and S2 is 2003 data (19 rows of data). Step 4: In this study the value of the desired minimum error has been defined between 2 and 4% (96–98% confidence) and the value of q has been defined 20. The error is estimated by Mean Absolute Percentage Error (MAPE). In order to get the best ANN for the electricity production in steam power plants, 20 MLP–LM models are tested to find the best architecture. The architecture of the MLP– LM models and their MAPE values are shown in Table 4. It seems the 4th model (it has four neurons in single hidden layer) has the lowest MAPE error and consequently is chosen as the preferred model. In Fig. 5, the ANN architecture for the preferred network (4th model) is shown. Step 5: Therefore, this model is selected for estimating the electricity production in 2004.

J Intell Manuf (2011) 22:229–245

241

Table 5 Estimation of efficiency scores for all DMUs by proposed algorithm in 2004 DMU name

Preal(i)

PANN(i)

Ei

Wi

E i

Shi

Fi

Ranking

Cluster A Montazerghaem

0.280

0.282

−0.002

0.195

−0.008

0.004

0.982

3

Bistoon

0.359

0.354

0.005

0.274

0.019

0.005

1

1

Toos

0.326

0.325

0.001

0.240

0.004

0.005

0.989

2

Tabriz

0.370

0.398

−0.028

0.292

−0.097

0.006

0.916

4

0.125

0.154

−0.03

0.25

−0.13

0.015

0.70

4

−0.45

0.002

0.36

7

0.01

1

1

Cluster B Besat Firoozi

0.013

0.033

−0.02

0.04

Beheshti

0.119

0.109

0.01

0.16

0.059

Madhaj

0.075

0.084

−0.009

0.12

−0.073

0.007

0.82

2

Zarand

0.025

0.043

−0.018

0.05

−0.3

0.003

0.53

6

Mashhad

0.053

0.071

−0.018

0.1

−0.17

0.006

0.68

5

Iranshahr

0.124

0.15

−0.03

0.24

−0.13

0.014

0.75

3

Salimi

0.972

0.942

0.030

0.175

0.170

0.061

0.969

2

Shazand

0.637

0.601

0.036

0.104

0.350

0.036

1

1

Rajaei

0.543

0.538

0.005

0.092

0.051

0.032

0.952

3

Mofatteh

0.438

0.459

−0.021

0.077

−0.269

0.027

0.902

7

Ramin

0.933

0.967

−0.034

0.180

−0.189

0.063

0.906

6

Bandarabbas

0.616

0.619

−0.003

0.107

−0.025

0.038

0.939

5

Cluster C

Esfahan

0.480

0.518

−0.038

0.088

−0.429

0.031

0.875

8

Montazeri

0.957

0.953

0.004

0.177

0.020

0.062

0.942

4

Step 6: The best partitioning of the data is achieved with 3 clusters. To identify the best fitness, the required parameters on GA algorithm are as follows: • Population size: 100 • Iteration: 1,500 • Crossover rate: 80% Steps 7–10: The results of these steps for each of the clusters are shown in Table 5. It is observed that medium, small and large plants belong to clusters A, B, and C respectively. The proposed algorithm estimates the efficiencies by the best estimated neural networks for 2004. The main results of the proposed algorithm, proposed algorithm without clustering and DEA are summarized in the Table 6. The results are compared through t-test. Several differences are clearly appreciated from the statistical tests. Also, DEA which is a well known performance assessment tool is used for validation with proposed algorithm without clustering. It is compared with the results of proposed and proposed without clustering through paired t-test. • The mean technical efficiencies obtained from the proposed algorithm and DEA is relatively small ( p-value:

0.52; higher p-value). Therefore, the proposed algorithm is verified and validated by DEA. • Mean technical efficiencies obtained from the proposed algorithm without clustering and DEA is significantly different ( p-value: 0.16; small p-value). Therefore, clustering benefits the proposed algorithm.

Examination of proposed algorithm in outliers and corrupted data situation The capability of proposed algorithm in handling outliers and corrupted data is presented. We suppose that imaginary outlier data is yielded from human error. Also, it is assumed that outlier or corrupted data is occurred in inputs variable. Let us suppose that one of input variable of Besat DMU (Fuel Consumption) is changed from 0.13 to 0.013 that is yielded from human error. The result of proposed algorithm and DEA after this error is shown in Tables 7 and 8. MAPE is used to compare the results. Table 9 shows MAPE value for proposed algorithm. Examination of MAPE values show that the proposed algorithm is approximately eight times better than DEA. The same procedure is done for Bandarabbas DMU. Second input variable of Bandaabbas DMU (Internal Consumption) is changed from 0.71 to 0.071. The results

123

242 Table 6 Efficiency results for the conventional power plants in 2004

Table 7 The impact of outlier and corrupted data on proposed algorithm result

J Intell Manuf (2011) 22:229–245

DMU

Efficiency scores by the proposed algorithm

Efficiency scores by the proposed algorithm without clustering

Efficiency scores by DEA-CCR

Bandarabbas

0.93

0.91

0.81

Beheshti

1.00

1

0.92

Besat

0.72

0.71

0.83

Bistoon

1.00

0.92

0.98

Esfahan

0.87

0.83

0.97

Firoozi

0.38

0.38

0.64

Iranshahr

0.75

0.74

0.81

Madhaj

0.819

0.83

0.91

Mashhad

0.68

0.67

0.72

Mofatteh

0.90

0.86

0.80

Montazerghaem

0.98

0.91

0.82

Montazeri

0.94

0.91

1

Rajaei

0.95

0.92

1

Ramin

0.90

0.85

0.99

Salimi

0.969

0.91

1

Shazand

1.00

0.93

1

Tabriz

0.91

0.85

0.82

Toos

0.98

0.91

0.91

Zarand

0.52

0.53

0.81

Mean (μTE )

0.852

0.819

0.881

Proposed algorithm

DMU name

Cluster A

Montazerghaem Bistoon

Cluster B

97.99 100

Original efficiency score (%)

97.99 100

98.94

98.94

Tabriz

91.61

91.61

91.61

Besat

92.20

70.94

70.94

Firoozi

36.48

36.46

36.46

100

100

98.94

100

Madhaj

82.11

82.09

82.09

Zarand

53.78

53.74

53.74

Mashhad

68.78

68.75

68.75

Iranshahr

75.15

75.19

75.19

Salimi Shazand

123

97.99 100

Efficiency score after Bandarabbas noise (%)

Toos

Beheshti

Cluster C

Efficiency score after Besat noise (%)

96.93 100

96.95 100

96.93 100

Rajaei

95.29

95.29

95.29

Mofatteh

90.17

90.16

90.17

Ramin

90.62

90.64

90.62

Bandarabbas

93.56

73.87

93.56

Esfahan

87.50

87.50

87.50

Montazeri

94.34

94.35

94.34

J Intell Manuf (2011) 22:229–245

Table 8 The impact of outlier and corrupted data on DEA result

243

DEA

DMU name

Efficiency score after Besat noise

Original efficiency score

Bandarabbas

0.71

1

0.81

Beheshti

0.87

0.69

0.92

Besat

1

0.72

0.83

Bistoon

0.79

0.81

0.98

Esfahan

0.81

0.68

0.97

Firoozi

0.64

1

0.64

Iranshahr

0.71

0.77

0.81

Madhaj

0.64

0.62

0.91

Mashhad

0.57

0.8

0.72

Mofatteh

0.6

0.81

0.8

Montazerghaem

0.69

0.68

0.82

Montazeri

0.82

0.88

1

Rajaei

0.84

0.68

1

Ramin

0.83

0.65

0.99

Salimi

1

1

1

Shazand

0.75

0.58

1

Tabriz

0.68

0.56

0.82

Toos

0.83

0.71

0.91

Zarand

0.81

0.9

0.81

Table 9 The MAPE value for Besat and Bandarabbas in proposed algorithm and DEA

Besat Bandarabbas

Efficiency score after Bandarabbas noise

Proposed algorithm

DEA

0.015 0.011

0.14 0.13

show that the proposed algorithm is approximately 11 times better that DEA in viewpoint of handling outliers and corrupted data. This two sample show that the results of proposed algorithm are more robust than the DEA. Conclusion A highly unique flexible ANN GA Clustering Ensemble algorithm was proposed to measure the DMUs efficiency scores. It is composed of ten steps. The non-linearity of the neural networks and its universal approximations makes the algorithms highly flexible. Furthermore, it can handle outliers and corrupted data much better than conventional econometric approach and DEA. This is shown by two examples related to outlier situations. Examination of the two example show that the proposed algorithm is 9 and 11 times better than DEA in handling outliers and corrupted data. It is also capable of performing optimization analysis and forecast-

ing for a given set of data. It was applied to a set of actual steam power plants to show its applicability and superiority. Furthermore, the results and rankings were compared with the proposed algorithm without clustering. Results indicate that the proposed algorithm estimates the values of efficiency scores closer to the ideal efficiency in viewpoint of DEA through statistical t-test. In summary, the algorithm is able to find a stochastic frontier based on a set of input–output observational data and do not require explicit assumptions about the functional structure of the stochastic frontier. Furthermore, it uses a similar approach to econometric methods for calculating the efficiency scores. Moreover, the effect of the return to scale of decision making unit (DMU) on its efficiency is included and the unit used for the correction is selected by notice of its scale (under constant return to scale assumption). Also, in this algorithm, for increasing DMUs’ homogeneousness, GA is used to cluster DMUs. Although it is believed that ANNs can be a potential alternative for measuring technical efficiency and can outperform other techniques when the production process is unknown, there is still a lack of both theoretical and empirical work in efficiency analysis and consequently optimization analysis. In future, some cases with more outputs or inputs can be studied and also the proposed algorithm can be expanded by encountering the lack of data. Designing proper algorithms

123

244

for cases with data of only one period is a candidate for future research. In summary, it should be mentioned that pervious studies do not consider the all of following features of the proposed algorithm 1. High flexibility; 2. Handling outlier and statistical noise data and o sensitive to the presence of the outliers and statistical noise; 3. Forecasting capability; 4. Low assumptions about functional form and data and do not require explicit assumptions about the functional structure of the stochastic frontier; 5. Optimum clustering ability; 6. No sensitivity to initial points in clustering section.

Acknowledgments The authors are grateful for the valuable comments and suggestion from the respected reviewers. Their valuable comments and suggestions have enhanced the strength and significance of our paper.

References Aarts, E. H. L., Eiben, A. E., & vanHee, K. M. (1989). A general theory of genetic algorithms. Tech.Rep.89/08, Einndhoven University of Thechnology. Athanassopoulos, A. D., & Curram, S. P. (1996). A comparison of data envelopment analysis and artificial neural networks as tool for assessing the efficiency of decision making units. Journal of the Operational Research Society, 47(8), 1000–1016. Azadeh, A., Ghaderi, S. F., Anvari, M., & Saberi, M. (2006a). Measuring performance electric power generations using artificial neural networks and fuzzy clustering. In G. A. Capolino, L. G. Franquelo (Eds.), Proceedings of the 32nd annual conference of the IEEE industrial electronics society. Paris, France: IECON. Azadeh, A., Ghaderi, S. F., Anvari, M., & Saberi, M. (2007a). Performance assessment of electric power generations using an adaptive neural network algorithm. Energy Policy, 35, 3155–3166. Azadeh, A., Ghaderi, S. F., Anvari, M., Saberi, M., & Izadbakhsh, H. (2006b). An integrated artificial neural network and fuzzy clustering algorithm for performance assessment of decision making units. Applied Mathematics and Computation, 187(2), 584–599. Azadeh, A., Ghaderi, S. F., & Sohrabkhani, S. (2007b). Forecasting electrical consumption by integration of neural network, time series and ANOVA. Applied Mathematics and Computation, 186, 1753–1761. Azadeh, A., Ghaderi, S. F., Tarverdian, S., & Saberi, M. (2006c). Integration of artificial neural networks and GA to predict electrical energy consumption. In G. A Capolino & L. G. Franquelo (Eds.), Proceedings of the 32nd annual conference of the IEEE industrial electronics society-IECON’06. Paris, France: Conservatoire National des Arts & Metiers. Azadeh, A., Ghaderi, S. F., Tarverdian, S., & Saberi, M. (2007). Integration of artificial neural networks and genetic algorithm to predict electrical energy consumption. Applied Mathematics and Computation, 186, 1731–1741. Azadeh, A., & Javanmardi, L. (2007). Studying impact of decision making units feature on efficiency by integration of data

123

J Intell Manuf (2011) 22:229–245 envelopment analysis and data mining tools. In International conference on operations research. German Azimi, J., Mohammadi, M., Movaghar, M., Analoui, A., & Clustering, M. (2006). Ensembles using genetic algorithm. In International workshop on computer architecture for machine perception and sensing, 2006 CAMP (pp. 119–123). Bauer, P. W. (1990). Recent developments in the econometric estimation of frontiers. Journal of Econometrics, 46(1–2), 39–56. Charnes, A., Cooper, W. W., & Rhodes, E. (1978). Measuring the efficiency of decision making units. European Journal of Operational Research, 2(6), 429–444. Chiang, W. C., Urban, T. L., & Baldridge, G. W. (1996). A neural network approach to mutual fund net asset value forecasting. Omega, The International Journal of Management Science, 24, 205–215. Cook, W. D., & Green, R. H. (2005). Evaluating power plant efficiency: A hierarchical model. Computers & Operations Research, 32(4), 813–823. Costa, A., & Markellos, R. N. (1997). Evaluating public transport efficiency with neural network models. Transportation Research C, 5(5), 301–312. Cybenko, G. (1989). Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems, 2, 303– 314. Davis, L. (Ed.). (1991). Handbook of genetic algorithms. New York: Van Nostrand Reinhold. Delgado, F. J. (2005). Measuring efficiency with neural networks, an application to the public sector. Economics Bulletin, 3(15), 1–10. Doyle, J., & Green, R. (1994). Efficiency and cross-efficiency in DEA: Derivations, meanings and uses. Journal of the Operational Research Society, 45(5), 567–578. Emami Meibodi, A. (1998). Efficiency considerations in the electricity supply industry: The case of Iran. A thesis submitted to university of Surrey for the degree of Doctor of Philosophy. Fare, R., Grosskopf, S., & Lovell, C. A. K. (1983). The measurement of efficiency of production. Boston: Kluwer. Golany, B., Yaakov, R., & Rybak, D. (1994). Measuring efficiency of power plants in Israel by data envelopment analysis. IEEE Transactions on Engineering Management, 41(3), 291–301. Goldberg, D. E. (1989). Genetic algorithms in search optimization and machine learning. New York: Addison-Wesley. Goto, M., & Tsutsui, M. (1997). Comparison of productive and cost efficiencies among Japanese and US electric utilities. Tokyo, Japan: Central Research Institute of Electric Power Industry. Hawdon, D. (1997). Improving the performance of electricity industries in developing countries: Is World Bank policy on deregulation the way forward. Economics Departmental Research Seminar, University of Surrey. Hill, T., O’Connor, M., & Remus, W. (1996). Neural network models for time series forecasts. Management Science, 42(7), 1082–1092. Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feed forward networks are universal approximators. Neural Networks, 2, 359–366. Indro, D. C., Jiang, C. X., Patuwo, B. E., & Zhang, G. P. (1999). Predicting mutual fund performance using artificial neural networks. Omega, The International Journal of Management Science, 27, 373–380. Jamasb, T., & Pollitt, M. (2001). Benchmarking and regulation of electricity distribution and transmission utilities: Lessons from international experience. DEA Working Paper, Cambridge, UK: Department of Applied Economics, University of Cambridge. Jhee, W. C., & Lee, J. K. (1993). Performance of neural networks in managerial forecasting. Intelligent Systems in Accounting, Finance and Management, 2, 55–71. Knittel, C. R. (2002). Alternative regulatory methods and firm efficiency: Stochastic frontier evidence from the US electricity

J Intell Manuf (2011) 22:229–245 industry. The Review of Economics and Statistics, 84(3), 530– 540. Kohzadi, N., Boyd, M. S., Kermanshahi, B., & Kaastra, I. (1996). A comparison of artificial neural network and time series models for forecasting commodity prices. Neurocomputing, 10(3), 169–181. Lam, P., & Shiu, A. (2001). A data envelopment analysis of the efficiency of China’s thermal power generation. Utilities Policy, 10, 75–83. Minaei, B., Topchy, A., & Punch, W. F. (2004). Ensembles of Partitions via Data Resampling. In Proceedings of the international conference on information technology, ITCC 04. Las Vegas. Mohammadi, M., Davoodi, R., & Rahmani, A. (2007). A genetic based clustering method. In Proceedings of 12th annual international computer society of iran computer conference (CSICC). Olatfubi, W. O., & Dismukes, D. E. (2000). A data envelopment analysis of the level and determinants of coal-fired electric power generation performance. Utilities Policy, 9, 47–59. Park, S., & Lesourd, J. (2000). The efficiency of conventional fuel power plants in South Korea: A comparison of parametric and non-parametric approaches. International Journal of Production Economics, 63, 59–67. Patuwo, E., Hu, M. Y., & Hung, M. S. (1993). Two-group classification using neural networks. Decision Sciences, 24(4), 825–845. Pendharkar, P. C. J. A., & Rodger, J. A. (2003). Technical efficiencybased selection of learning cases to improve forecasting accuracy of neural networks under monotonicity assumption. Decision Support Systems, 36(1), 117–136. Pollit, M. G. (1995). Ownership and performance in electric utilities. Oxford, UK: Oxford University Press. Rumelhart, D. E., & McClelland, J. L. (1986). Parallel distributed processing: Explorations in the microstructure of cognition, Vol. 1, Foundations. Cambridge, MA: MIT Press. Sanhueza, R., Rudnick, H., & Lagunas, H. (2004). DEA efficiency for the determination of the electric power distribution added value. IEEE Transactions on Power Systems, 19(2), 919–925. Santin, D., Delgado, F. J., & Valino, A. (2004). The measurement of technical efficiency: A neural network approach. Applied Economics, 36, 627–635. Stern, H. S. (1996). Neural networks in applied statistics. Technometrics, 38(3), 205–214. Strehl, A., & Ghosh, J. (2002). Cluster ensembles—acknowledge reuse framework for combining multiple partitions. Journal on Machine Learning Research, 3, 583–617.

245 Strehl, A., & Ghosh, J. (2002). Cluster ensembles—a knowledge reuse framework for combining partitionings. In National Conference on Artificial Intelligence (pp. 93–98). AB, Canada. Sueyoshi, T., & Goto, M. (2001). Slack-adjusted DEA for time series analysis: Performance measurement of Japanese electric power generation industry in 1984–1993. European Journal of Operation Research, 133, 232–259. Talluri, S., & Sarkis, J. (1997). Extensions in efficiency measurement of alternate machine component grouping solutions via data envelopment analysis. IEEE Transactions on Engineering Management, 44(3), 299–304. Tamura, H., & Tann, K. (2008). Midpoint-validation method of neural networks for pattern classification problems. International Journal of Innovative Computing, Information and Control, 4(10), 2475– 2482. Tang, Z., Almeida, C., & Fishwick, P. (1991). A Time series forecasting using neural networks vs Box–Jenkins methodology. Simulation, 57(5), 303–310. Tang, Z., & Fishwick, P. A. (1993). Back-propagation neural nets as models for time series forecasting. ORSA Journal on Computing, 5(4), 374–385. Tavanir Management Organization. (1997–2004). Electric power industry in Iran 1997–2004. Tavanir management organization. Topchy, A., Minaei-Bidgoli B., Jain, A. K., & Punch, W. (2004). Adaptive Clustering ensembles. In Proceedings of the international conference on pattern recognition, ICPR’04 (pp. 272–275). Cambridge, UK. Wang, S. (2003). Adaptive non-parametric efficiency frontier analysis: a neural-network-based model. Computers & Operations Research, 2003(30), 279–295. Wang, X., Zhao, X., & Feng, Y. (2008). A single parameter dynamic searching algorithm for multi-layer neural networks (part II). International Journal of Innovative Computing, Information and Control, 4(11), 2941–2954. Wu, D., Yang, Z., & Liang, L. (2006). Using DEA-neural network approach to evaluate branch efficiency of a large Canadian bank. Expert Systems with Applications, 31, 108–115.Z Zhang, Z., Yang, G., Yi, J., Zhu, Y. Y., & Tang, Z. (2008). A new stochastic dynamic adaptive local search algorithm for Elman neural network. International Journal of Innovative Computing, Information and Control, 4(11), 2927–2940.

123