Chaos Genetic Algorithm For Nonlinear Optimization
Chaos Genetic Algorithm For Nonlinear Optimization
To reduce the computational amount and improve estimation accuracy for nonlinear
optimizations, a new algorithm, K-means clustering with Chaos Genetic Algorithm
(KCGA) is proposed, in which initial population are generated by chaos mapping and
refined by competition. Within each iteration of this approach, in addition to the
evolution of genetic algorithm (GA), the K-means Clustering algorithm is applied to
achieve faster convergence and lead to a quick evolution of the population.
The main purpose of the paper is to demonstrate how the GA optimizer can be
improved by incorporating a hybridization strategy. Experimental studies revealed
that the hybrid KCGA approach can produce much more accurate estimates of the
true optimum points than the other two optimization procedures, the chaos genetic
algorithm (CGA) and GA. Further, the proposed hybrid KCGA approach exhibits
superior convergence characteristics when compared to other algorithms in this paper
separately. On the whole, the new approach is demonstrated to be extremely effective
and efficient at locating optimal solutions and verified by an empirical example from
construction.
Keywords: Chaos; optimization; K-means clustering; Genetic Algorithm
____________________________________________________________________
1. Introduction
1
At the meantime, GA are highly parallel randomly searching algorithms that imitate
the life evolution as proposed in Darwinian survival of the fittest principle (Hibbert,
1993; Lavine and Moores, 1999). Critical genetic operations such as the encoding of
the solution of optimizing problem, the designing of the fitting function according to
its application, and the crossover and mutation for offspring, play important roles in
GA (Holland, 1975; Zhao, Chen & Hu, 2000).
In chaos, a small difference in the initial conditions may produce an enormous error
in the final phenomena. It is extremely sensitive to the initial conditions, and its
property sometimes referred to as the instability in the so-called butterfly effect or
Liapunove’s sense (Lorenze, 1963; Kim and Stringer, 1992). Sensitive dependence on
initial conditions is often exhibited by multiple elements with nonlinear interactions in
the systems. Owing to chaos characteristic, the system could be designed as an
efficient approach for maintaining the population diversity in the problem of interest.
Clustering is one of the most important and the most challenging of classifying
algorithms. A successful clustering algorithm is able to reliably find true natural
groupings in the data set. K-means is one of the well-known algorithms for clustering,
originally known as Forgy’s method (Forgy, 1965). K-means clustering is the process
2
of dispatching a set of objects into groups or clusters of similarities. Objects collected
in the same cluster have similar features, but others are not (Han & Kamber, 2001).
K-means is famous for its simplicity and computational efficiency in clustering
techniques. As aforementioned Chaotic algorithm is for population diversity in GA,
and K-means is for convergence efficiency in evolution. The former will keep the
system accuracy, and the later will decrease iteration times of GA significantly.
X n 1 f ( , X n ) X n (1 X n ) (1)
in which μ stands for a control parameter, X for a variable and n =0,1,2,3…. It is easy
to find that equation (1) is a deterministic dynamic system. The variable X is also
called as chaotic variable. The basic characteristic of chaos could be presented by Eq.
(1), for a very small difference in the initial value of X will cause large difference in
its long-term behavior.
The variation of control parameter μ of Eq. (1) will directly impact the behavior of
X greatly. Usually, [0, 4] has been defined as domain area of control parameter μ.
Different value in domain area of μ will determine whether X stabilizes at a constant
size or behaves chaotically in an unpredictable pattern. The track of chaotic variable
looks like in disorder. However, it can travel ergodically over the whole space of
3
interest especially under the condition of μ = 4. Then, a tiny difference in initial value
of the chaotic variable would result in considerable differences of the values of
chaotic variable later. Generally, there are three primary characteristics of the
variation of the chaotic variable, i.e. ergodicity, irregularity and pseudo-randomness
( Bountis, 1995; Li & Jiang, 1998; Ohya, 1998).
4
(3) Crossover: the crossover process was to choose two individuals as parents
randomly. This study applies one-point crossover process in which the point is
randomly selected in the list of fields. All the fields lying after this point was
exchanged between the two parents to create two new offspring.
(4) Mutation: The mutation process is a probability-based procedure in which a
heuristic operation was employed to find shortest path from a random point. Then,
a correction action is taken to keep individuals meeting the legal requirements, in
case of necessary.
The above four steps are iterated in this study until a satisfactory solution is found or
the terminating criterion is met.
In this study, while a crossover has finished, the new generated offspring may not
follow the designed rule to visit every node once and move back to the starting point.
A new offspring will compare with the swapped and original portion to verify if the
members are identical. Same members lead to a sound crossover while duplicated
members with parents need to be legalized. For instance, a one-point crossover was
introduced; the random selected point of field is 3 and 2 shown on the following two
tables.
Table 1
Legalization to crossover with identical members
Parents Selected field Swapping Operation Offspring
123541 123451 123451
Crossover on field 3 Equal to
132451 132541 132541
Table 2
Legalization to crossover with un-identical members
Parents Selected field Swapping Operation Offspring
123541 122451 132451
Crossover on field 2 Legalization
132451 133541 123541
5
variable. It can keep the individuals distributed ergodically in the defined space and
avoid from the premature of generations. And, CGA also takes the inherent advantage
of GA over convergence to overcome the randomness of the chaotic process and
hence to increase the probability of finding the global optimal solution.
6
in which cxi is the ith chaotic variable, and (k) and (k+1) denote the number of
iterations. Then a linear mapping function was used to convert chaotic variable to a
certain interval. In this study the linear mapping function could be described as:
in which xik 1 is the ith working variable, and (k) and (k+1) denote the number of
K-means plays a critical role in convergence of GA. Chaos algorithm can keep GA
population diversity and avoid from premature. To take advantages of the above
described benefits in GA, a novel algorithm combined K-means clustering and the
Chaos algorithms with genetic algorithms was proposed as a powerful hybrid
algorithm called KCGA (K-means and Chaos in Genetic Algorithm). Initial
population of KCGA should be generated from chaotic algorithm, and then chaos
function would adjust the individuals after mutation with the same proability. After
mutation, K-means clustering in this study will help to group population in several
clusters as pre-defined. Thus, location information of each centroid of cluster would
be treated as candidate individuals for next generation. A competing procedure was
employed to eliminate lower fitness value individuals, and reserved the others to
create formal population for KCGA iteration.
7
Generate initial Calculate Reserve top
Encode problem population by fitness fit individuals
chaos algorithm & rank to population size
Yes
Output the best Meet
individual criterion?
No
4. Experimental result
Construction work includes many inherently hazardous conditions and tasks such
as work at noise, dust, height, excavations, etc. For example, construction has about
6% of U.S. workers, but 20% of the fatalities - the largest number of fatalities
reported for any of the industry sectors. These were announced by National institute
for occupation safety and health (NIOSH) in 2008. In this study, a simulated case of
ten building-construction sites was used for an auditor of safety and health. The
auditor should start from one of the building-construction sites and travel to every site
before returning back to the same place. The target is to find out the shortest path
along every construction site.
After assigning each construction site an integer number, the distances between
each site could be recorded and create a matrix. The fitness function was designed to
calculate the total distance along the path. Any set of random integer number within
[1, 10] may stands for a different path. To comply with the real world, it is critical to
legalize offspring during the KCGA iteration, especially after the crossover and
mutation procedure. Chaotic algorithm will impact the visit priority of each site.
However, each centroid of cluster derived from K-means may not be integer. They
need to re-legalize again to get their integer sequence as one of the quasi individuals
8
for next generation. All experiments are completed on Core 2 CPU T5500 @
1.66GHz PCs with 2GB memory. The results reported are all averaged over 50
independent runs. The parameters, such as mutation rate, crossover rate, generation
limit, are given under the results.
Table 3
The performance of four methods
KCGA KGA CGA GA
(0.90) (0.80) (0.85) (0.85)
Avg. Std. E. Avg. Std. E. Avg. Std. E. Avg. Std. E.
Iteration 23.9 5.0 31.0 38.3 71.8 132.5 105.0 171.0
Time(sec.) 1.3 0.3 1.5 2.0 1.1 1.6 1.1 1.5
Min. 38.1 0.3 38.3 0.6 38.1 0.3 38.1 0.2
Max. 38.1 0.3 38.3 0.6 38.2 0.5 39.2 3.0
Fitness 2286.0 18.5 2295.0 33.0 2289.0 22.0 2291.0 26.8
Notes: mutation rate = 0.01, crossover rate = 0.8, population size = 60,
generation limit = 500, Avg.: Average, Std. E.: Standard Error
(*) = accurate ratio
GA, integrated with K-means clustering technique and chaos algorithm, could
promote its accuracy and reduce the converging time. Migration from GA to KCGA,
listed in table 3, has shown that KCGA improves the accuracy of GA, and diminishes
the amount of evolution runs significantly.
5. Conclusions
9
This study has proposed a procedure which joins K-means and chaos attributes
based on genetic algorithm. The proposed procedure is not only to enhance the
diversity of GA for more accuracy but also to extract clustering rules for achieving a
potential trend of evolution. Additionally, it can effectively improve some drawbacks
of traditional GA, such as long running time and getting trapped in local optima.
Furthermore, this proposed procedure can really contribute to construction
management in real world.
References
Han, J., & Kamber, M. (2001). Data mining: Concepts and techniques. San Francisco:
Morgan Kaufmann Publishers.
Hugget, A., Se´bastian, P., & Nadeau, J. P. (1999). Global optimization of a dryer by
using neural networks and genetic algorithms. American Institute of Chemical
Engineering Journal , 45 (6) 1227_/1238.
Kim, J.H., Stringer, J., (1992). Applied Chaos. Wiley, New York.
10
MacQueen, J. B. (1967). Some methods for classification and analysis of multivariate
observations. In Proceedings of 5th berkeley symposium on mathematical statistics
and probability (pp. 281–297). Berkeley: University of California Press.
May, R., (1976). Simple mathematical model with very complicated dynamics. Nature
261, 45–67.
11