Q-LSHADE-PS: An Individual-Level Adaptive Differential Evolution with Q-Learning and History-Based Parameter Adaptation

Yang Cao; Hedong Peng

doi:10.62051/ijcsit.v8n5.02

Authors

Yang Cao
Hedong Peng

DOI:

https://doi.org/10.62051/ijcsit.v8n5.02

Keywords:

Differential Evolution, Reinforcement Learning, Adaptive Strategy

Abstract

Differential Evolution (DE) is widely used for continuous optimization due to its simple structure and strong global search ability. However, classical and many adaptive DE variants often suffer from premature convergence and diversity loss on complex problems, where suitable operators may vary across individuals and search stages. To address this issue, this paper proposes Q-LSHADE-PS, an Linear Population Size Reduction Success History based Adaptive Differential Evolution (LSHADE) variant that equips each individual with state-conditioned, tabular Q-learning for mutation strategy selection, while preserving LSHADE’s success-history parameter adaptation, external archive, and linear population size reduction (LPSR). Each individual maintains a compact Q-table to adaptively select mutation strategies according to its stagnation state and the global search phase. In addition, a population-size-aware Q-table decay mechanism is introduced to prevent outdated strategy preferences from dominating after population reduction, thereby maintaining exploration capability under non-stationary search dynamics. Experimental results on standard benchmark suites demonstrate that the proposed algorithm achieves superior or highly competitive performance compared with several state-of-the-art DE variants, while introducing only negligible computational overhead. These results indicate that individual-level reinforcement learning provides an effective and practical mechanism for adaptive strategy control in modern DE frameworks.

Downloads

Download data is not yet available.

References

[1] Storn R, Price K. Differential evolution a simple and efficient heuristic for global optimization over continuous spaces [J]. Journal of Global Optimization, 1997, 11(4): 341-359.

[2] Tanabe R, Fukunaga A. Success-history based parameter adaptation for differential evolution [C]. IEEE Congress on Evolutionary Computation, Cancun, 2013: 71-78.

[3] Piotrowski A P. LSHADE optimization algorithms with population-wide inertia [J]. Information Sciences, 2018, 468: 117-141.

[4] Luo W, Yu X, Wei Y. Solving combined economic and emission dispatch problems using reinforcement learning-based adaptive differential evolution algorithm [J]. Engineering Applications of Artificial Intelligence, 2023, 126: 107002.

[5] Durgut R, Aydin M E, Atli I. Adaptive operator selection with reinforcement learning [J]. Information Sciences, 2021, 581: 773-790.

[6] Padakandla S, Prabuchandran K J, Bhatnagar S. Reinforcement learning in non-stationary environments [J]. Applied Intelligence, 2020, 50(11): 3591-3606.

[7] Giannopoulos P G, Malamas V, Dasaklis T K. Integration of evolutionary algorithms and machine learning techniques in routing-related problems: A review [C]. Panhellenic Conference on Informatics, 2025: 237-243.

[8] Brest J, Maucec M S, Boskovic B. Single objective real-parameter optimization: Algorithm jSO [C]. IEEE Congress on Evolutionary Computation, 2017: 1311-1318.

[9] Li Y, Han T, Wang X, Zhou H, Tang S, Huang C, Han B. MjSO: A modified differential evolution with a probability selection mechanism and a directed mutation strategy [J]. Swarm and Evolutionary Computation, 2023, 78: 101294.

[10] Stanovov V, Semenkin E. Success rate-based adaptive differential evolution L-SRTDE for CEC 2024 competition [C]. IEEE Congress on Evolutionary Computation, 2024: 1-8.

[11] Zhou B, Huang Y. An adaptive archive differential evolution with nonlinear population size reduction and selective pressure [J]. Information Sciences, 2024, 682: 121273.

[12] Qin A K, Suganthan P N, Huang V L. Differential evolution algorithm with strategy adaptation for global numerical optimization [J]. IEEE Transactions on Evolutionary Computation, 2009, 13(2): 398-417.

[13] Zhang J, Sanderson A C. JADE: Adaptive differential evolution with optional external archive [J]. IEEE Transactions on Evolutionary Computation, 2009, 13(5): 945-958.

[14] Lin X, Meng Z. An adaptative differential evolution with enhanced diversity and restart mechanism [J]. Expert Systems with Applications, 2024, 249: 123634.

[15] Auger A, Hansen N. A restart CMA evolution strategy with increasing population size [C]. IEEE Congress on Evolutionary Computation, 2005: 1769-1776.

[16] Price K V, Storn R M, Lampinen J A. Differential Evolution – A Practical Approach to Global Optimization [M]. Springer, 2005: 1-34.

[17] Fialho A, Ros R, Schoenauer M, Sebag M. Comparison-based adaptive strategy selection with bandits in differential evolution [C]. Parallel Problem Solving from Nature, 2010: 194-203.

[18] Matsushita H, Kinoshita W, Kurokawa H, Kousaka T. Nested-layer particle swarm optimization method for bifurcation point detection in non-autonomous systems [J]. Nonlinear Theory and Its Applications, IEICE, 2019, 10(3): 289-302.

[19] Li P, Hao J, Tang H, Fu X, Zheng Y, Tang K. Bridging evolutionary algorithms and reinforcement learning: A comprehensive survey on hybrid algorithms [J]. arXiv Preprint, 2024.

[20] Guo H, Ma S, Huang Z, Hu Z, Ma Z, Zhang X, Gong Y J. Reinforcement learning-based self-adaptive differential evolution through automated landscape feature learning [C]. Genetic and Evolutionary Computation Conference, 2025.

[21] Yu X, Xu P, Wang F, Wang X. Reinforcement learning-based differential evolution algorithm for constrained multi-objective optimization problems [J]. Engineering Applications of Artificial Intelligence, 2024, 131: 107817.

[22] Ding W, Qian M, Lu C, Yi J, Pu H, Luo J. Differential evolution with joint adaptation of mutation strategies and control parameters via distributed proximal policy optimization [J]. Tsinghua Science and Technology, 2026, 31(1): 101-124.

[23] Cao Z, Xu K, Jia H, Fu Y, Foh C H, Tian F. An autonomous differential evolution based on reinforcement learning for cooperative countermeasures of unmanned aerial vehicles [J]. Applied Soft Computing, 2025, 169: 112605.