%0 Conference Proceedings %T Proceedings of the 2011 IEEE Congress on Evolutionary Computation %D 2011 %8 jun %F toc:2011:cec %O CEC 2011 %R doi:10.1109/CEC.2011.5949582 %U http://dx.doi.org/doi:10.1109/CEC.2011.5949582 %0 Conference Proceedings %T 13th International Symposium MECHATRONIKA, 2010 %D 2010 %8 jun %F cover:2010:MECHATRONIKA %O MECHATRONIKA, 2010 %U http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=5521207 %0 Journal Article %T Genetic programming: Proceedings of the first annual conference 1996 : Edited by John R. Koza, David E. Goldberg, David B. Fogel and Rick L. Riolo. MIT Press, Cambridge, MA. (1996). 568 pages. $75.00 %J Computers & Mathematics with Applications %D 1997 %V 33 %N 5 %@ 0898-1221 %F tagkey1997126 %O tagkey1997126 %9 journal article %R doi:10.1016/S0898-1221(97)00025-4 %U http://www.sciencedirect.com/science/article/B6TYJ-3SNTGM2-D/2/23afe396341b39baf74fcd29db315b46 %U http://dx.doi.org/doi:10.1016/S0898-1221(97)00025-4 %P 126-127 %0 Journal Article %T Advances in genetic programming, volume 2 : Edited by Peter Angeline and Kenneth Kinnear, Jr. MIT Press, Cambridge, MA. (1996). 538 pages. $50.00 %J Computers & Mathematics with Applications %D 1997 %V 33 %N 5 %@ 0898-1221 %F tagkey1997129 %O tagkey1997129 %9 journal article %R doi:10.1016/S0898-1221(97)82933-1 %U http://www.sciencedirect.com/science/article/B6TYJ-3SNTGM2-T/2/4d3bcc2dda31e9aca679eba60ff95a3a %U http://dx.doi.org/doi:10.1016/S0898-1221(97)82933-1 %P 129 %0 Journal Article %T Advances in genetic programming, volume III : Edited by Lee Spector, William B. Langdon, Una-May O’Reilly and Peter J. Angeline. MIT Press, Cambridge, MA. (1999). 476 pages. $55.00 %J Computers & Mathematics with Applications %D 1999 %V 38 %N 11-12 %@ 0898-1221 %F tagkey1999291 %O tagkey1999291 %9 journal article %R doi:10.1016/S0898-1221(99)91267-1 %U http://www.sciencedirect.com/science/article/B6TYJ-48778B1-3H/2/1d6f4728f10e14a24f4f28189d15f818 %U http://dx.doi.org/doi:10.1016/S0898-1221(99)91267-1 %P 291-291 %0 Journal Article %T Genetic programming and data structures: Genetic programming + data STRUCTURES = automatic programming! : By W. B. Langdon. Kluwer Academic Publishers, Boston, MA. (1998). 278 pages. $125.00. NLG 285.00, GBP 85.00 %J Computers & Mathematics with Applications %D 1999 %V 37 %N 3 %@ 0898-1221 %F tagkey1999132 %O tagkey1999132 %9 journal article %R doi:10.1016/S0898-1221(99)90375-9 %U http://www.sciencedirect.com/science/article/B6TYJ-489YTT5-2T/2/13179f12104abafe66b36e402ef358d9 %U http://dx.doi.org/doi:10.1016/S0898-1221(99)90375-9 %P 132-132 %0 Journal Article %T Genetic programming II: Automatic discovery of reusable programs : By John R. Koza. MIT Press, Cambridge, MA. (1994). 746 pages. $45.00 %J Computers & Mathematics with Applications %D 1995 %V 29 %N 3 %@ 0898-1221 %F tagkey1995115 %O tagkey1995115 %9 journal article %R doi:10.1016/0898-1221(95)90099-3 %U http://www.sciencedirect.com/science/article/B6TYJ-48F4PJH-H/2/bd467ac24453cb0b3f9dbbf15075bedb %U http://dx.doi.org/doi:10.1016/0898-1221(95)90099-3 %P 115-115 %0 Journal Article %T Evolutionary algorithms in engineering and computer science: Recent advances in genetic algorithms, evolution strategies, evolutionary programming, genetic programming and industrial applications : Edited by K. Miettinen, P. Neittaanmaki, M. M. Makela and J. Periaux. John Wiley & Sons, Ltd., Chichester. (1999). pounds60.00 %J Computers & Mathematics with Applications %D 1999 %V 38 %N 11-12 %@ 0898-1221 %F tagkey1999282 %O tagkey1999282 %9 journal article %R doi:10.1016/S0898-1221(99)91189-6 %U http://www.sciencedirect.com/science/article/B6TYJ-48778B1-24/2/ee28594e33abf3bd7c4a9fc997b98492 %U http://dx.doi.org/doi:10.1016/S0898-1221(99)91189-6 %P 282-282 %0 Journal Article %T Automated generation of robust error recovery logic in assembly systems using genetic programming : Cem M. Baydar, Kazuhiro Saitou, v20, n1, 2001, pp55-68 %J Journal of Manufacturing Systems %D 2002 %V 21 %N 6 %@ 0278-6125 %F tagkey2002475 %O tagkey2002475 %9 journal article %R doi:10.1016/S0278-6125(02)80094-2 %U http://www.sciencedirect.com/science/article/B6VJD-4920DSC-1N/2/93bf79c7eb0d6ad94d169ed1b37ec77f %U http://dx.doi.org/doi:10.1016/S0278-6125(02)80094-2 %P 475-476 %0 Generic %T Intelligent Machines Evolutionary algorithm outperforms deep-learning machines at video games %D 2018 %8 18 jul %I MIT Technolgy Review %F 2018:MITtechreview %O MIT Technolgy Review %X Neural networks have garnered all the headlines, but a much more powerful approach is waiting in the wings. by Emerging Technology from the arXiv July 18, 2018 Summary of https://arxiv.org/pdf/1806.05695 See instead \citeWilson:2018:GECCO %K genetic algorithms, genetic programming %0 Journal Article %T Evolutionary Algorithms for Software Testing in Facebook %J SIGEVOlution %D 2018 %8 December %V 11 %N 2 %@ 1931-8499 %F Sapienz:2018:sigevolution %O SIGEVOlution %X Sapienz is an approach to Android testing that uses multi-objective evolutionary algorithms to automatically explore and optimise test sequences, minimising length, while simultaneously maximising coverage and fault revelation. It is in production now helping to improve the quality of Facebook software! %K genetic algorithms, genetic programming, SBSE, mobile computing, smart phone %9 journal article %R doi:10.1145/3264700.3264702 %U http://www.sigevolution.org/issues/SIGEVOlution1102.pdf %U http://dx.doi.org/doi:10.1145/3264700.3264702 %P 7 %0 Generic %T Store Steel 165 years %I Internal information magazine %F glasilo_1_16_ang %O Store Steel %K genetic algorithms, genetic programming %U http://www.store-steel.si/Data/InterniInformativniCasopis/glasilo_1_16_ang.pdf %0 Journal Article %T Genetic programming-based self-reconfiguration planning for metamorphic robot %A Ababsa, Tarek %A Djedl, Noureddine %A Duthen, Yves %J International Journal of Automation and Computing %D 2018 %V 15 %N 4 %F ababsa:2018:IJAC %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s11633-016-1049-4 %U http://link.springer.com/article/10.1007/s11633-016-1049-4 %U http://dx.doi.org/doi:10.1007/s11633-016-1049-4 %0 Conference Proceedings %T A SIMD Interpreter for Linear Genetic Programming %A Ababsa, Tarek %S 2022 International Symposium on iNnovative Informatics of Biskra (ISNIB) %D 2022 %8 dec %F Ababsa:2022:ISNIB %X Genetic programming (GP) has been applied as an automatic programming tool to solve various kinds of problems by genetically breeding a population of computer programs using biologically inspired operations. However, it is well known as a computationally demanding approach with a significant potential of parallelization. In this paper, we emphasize parallelizing the evaluation of genetic programs on Graphics Processing Unit (GPU). We used a compact representation for genotypes. This representation is a memory-efficient method that allows efficient evaluation of programs. Our implementation clearly distinguishes between an individual’s genotype and phenotype. Thus, the individuals are represented as linear entities (arrays of 32 bits integers) that are decoded and expressed just like nonlinear entities (trees). %K genetic algorithms, genetic programming, linear genetic programming, GPU, Graphics, Automatic programming, Sociology, Graphics processing units, Arrays, Statistics, Parallel Processing, GPGPU, symbolic regression %R doi:10.1109/ISNIB57382.2022.10075819 %U http://dx.doi.org/doi:10.1109/ISNIB57382.2022.10075819 %0 Conference Proceedings %T A Survey of Pattern Recognition Applications in Cancer Diagnosis %A Abarghouei, Amir Atapour %A Ghanizadeh, Afshin %A Sinaie, Saman %A Shamsuddin, Siti Mariyam %S International Conference of Soft Computing and Pattern Recognition, SOCPAR ’09 %D 2009 %8 dec %F Abarghouei:2009:SOCPAR %X In this paper, some of the image processing and pattern recognition methods that have been used on medical images for cancer diagnosis are reviewed. Previous studies on Artificial Neural Networks, Genetic Programming, and Wavelet Analysis are described with their working process and advantages. The definition of each method is provided in this study, and the acknowledgment is granted for previous related research activities. %K genetic algorithms, genetic programming, artificial neural networks, cancer diagnosis, image processing, medical images, pattern recognition applications, wavelet analysis, cancer, medical image processing, pattern recognition %R doi:10.1109/SoCPaR.2009.93 %U http://dx.doi.org/doi:10.1109/SoCPaR.2009.93 %P 448-453 %0 Journal Article %T Evolutionary computational intelligence algorithm coupled with self-tuning predictive model for water quality index determination %A Abba, S. I. %A Hadi, Sinan Jasim %A Sammen, Saad Sh. %A Salih, Sinan Q. %A Abdulkadir, R. A. %A Pham, Quoc Bao %A Yaseen, Zaher Mundher %J Journal of Hydrology %D 2020 %V 587 %@ 0022-1694 %F ABBA:2020:JH %X Anthropogenic activities affect the water bodies and result in a drastic reduction of river water quality (WQ). The development of a reliable intelligent model for evaluating the suitability of water remains a challenging task facing hydro-environmental engineers. The current study is investigated the applicability of Extreme Gradient Boosting (XGB) and Genetic Programming (GP) in obtaining feature importance, and then abstracted input variables were imposed into the predictive model (the Extreme Learning Machine (ELM)) for the prediction of water quality index (WQI). The stand-alone modeling schema is compared with the proposed hybrid models where the optimum variables are supplied into the GP, XGB, linear regression (LR), stepwise linear regression (SWLR) and ELM models. The WQ data is obtained from the Department of Environment (DoE) (Malaysia), and results are evaluated in terms of determination coefficient (R2) and root mean square error (RMSE). The results demonstrated that the hybrid GPELM and XGBELM models outperformed the standalone GP, XGB, and ELM models for the prediction of WQI at Kinta River basin. A comparison of the hybrid models showed that the predictive skill of GPELM (RMSE = 3.441 training and RMSE = 3.484 testing) over XGBELM improving the accuracy by decreasing the values of RMSE by 5percent and 9percent for training and testing, respectively with regards to XGBELM (RMSE = 3.606 training and RMSE = 3.816 testing). Although regressions are often proposed as reference models (LR and SWLR), when combined with computational intelligence, they still provide satisfactory results in this study. The proposed hybrid GPELM and XGBELM models have improved the prediction accuracy with minimum number of input variables and can therefore serve as reliable predictive tools for WQI at Kinta River basin %K genetic algorithms, genetic programming, Water quality index, Watershed management, Extreme Gradient Boosting, Extreme Learning Machine, Kinta River %9 journal article %R doi:10.1016/j.jhydrol.2020.124974 %U http://www.sciencedirect.com/science/article/pii/S0022169420304340 %U http://dx.doi.org/doi:10.1016/j.jhydrol.2020.124974 %P 124974 %0 Journal Article %T Multi Block based Image Watermarking in Wavelet Domain Using Genetic Programming %A Abbasi, Almas %A Seng, Woo Chaw %A Ahmad, Imran Shafiq %J The International Arab Journal of Information Technology %D 2014 %V 11 %N 6 %F journals/iajit/AbbasiSA14 %X The increased use of the Internet in sharing and distribution of digital data makes it is very difficult to maintain copyright and ownership of data. Digital watermarking offers a method for authentication and copyright protection. We propose a blind, still image, Genetic Programming (GP) based robust watermark scheme for copyright protection. In this scheme, pseudorandom sequence of real number is used as watermark. It is embedded into perceptually significant blocks of vertical and horizontal sub-band in wavelet domain to achieve robustness. GP is used to structure the watermark for improved imperceptibility by considering the Human Visual System (HVS) characteristics such as luminance sensitivity and self and neighbourhood contrast masking. We also present a GP function which determines the optimal watermark strength for selected coefficients irrespective of the block size. Watermark detection is performed using correlation. Our experiments show that in proposed scheme the watermark resists image processing attack, noise attack, geometric attack and cascading attack. We compare our proposed technique with other two genetic perceptual model based techniques. Comparison results show that our multiblock based technique is approximately 5percent, and 23percent more robust, then the other two compared techniques. %K genetic algorithms, genetic programming, Robust watermark, wavelet domain, digital watermarking, HVS %9 journal article %U http://ccis2k.org/iajit/?option=com_content&task=blogcategory&id=94&Itemid=364 %P 582-589 %0 Conference Proceedings %T Automated Behavior-based Malice Scoring of Ransomware Using Genetic Programming %A Abbasi, Muhammad Shabbir %A Al-Sahaf, Harith %A Welch, Ian %S IEEE Symposium Series on Computational Intelligence, SSCI 2021 %D 2021 %8 dec 5 7 %I IEEE %C Orlando, FL, USA %F DBLP:conf/ssci/AbbasiAW21 %X Malice or severity scoring models are a technique for detection of maliciousness. A few ransom-ware detection studies use malice scoring models for detection of ransomware-like behaviour. These models rely on the weighted sum of some manually chosen features and their weights by a domain expert. To automate the modelling of malice scoring for ransomware detection, we propose a method based on Genetic Programming (GP) that automatically evolves a behavior-based malice scoring model by selecting appropriate features and functions from the input feature and operator sets. The experimental results show that the best-evolved model correctly assigned a malice score, below the threshold value to over 85percent of the unseen goodware instances, and over the threshold value to more than 99percent of the unseen ransomware instances. %K genetic algorithms, genetic programming Symbolic regression, ransomware, malice scoring %R doi:10.1109/SSCI50451.2021.9660009 %U https://doi.org/10.1109/SSCI50451.2021.9660009 %U http://dx.doi.org/doi:10.1109/SSCI50451.2021.9660009 %P 1-8 %0 Journal Article %T Estimation of hydraulic jump on corrugated bed using artificial neural networks and genetic programming %A Abbaspour, Akram %A Farsadizadeh, Davood %A Ghorbani, Mohammad Ali %J Water Science and Engineering %D 2013 %V 6 %N 2 %@ 1674-2370 %F Abbaspour:2013:WSE %X Artificial neural networks (ANNs) and genetic programming (GP) have recently been used for the estimation of hydraulic data. In this study, they were used as alternative tools to estimate the characteristics of hydraulic jumps, such as the free surface location and energy dissipation. The dimensionless hydraulic parameters, including jump depth, jump length, and energy dissipation, were determined as functions of the Froude number and the height and length of corrugations. The estimations of the ANN and GP models were found to be in good agreement with the measured data. The results of the ANN model were compared with those of the GP model, showing that the proposed ANN models are much more accurate than the GP models. %K genetic algorithms, genetic programming, artificial neural networks, corrugated bed, Froude number, hydraulic jump %9 journal article %R doi:10.3882/j.issn.1674-2370.2013.02.007 %U http://www.sciencedirect.com/science/article/pii/S1674237015302362 %U http://dx.doi.org/doi:10.3882/j.issn.1674-2370.2013.02.007 %P 189-198 %0 Conference Proceedings %T AntTAG: A New Method to Compose Computer Programs Using Colonies of Ants %A Abbass, H. %A Hoai, N. X. %A McKay, R. I. (Bob) %S Proceedings, 2002 World Congress on Computational Intelligence %D 2002 %V 2 %I IEEE Press %F Abbass:2002:WCCI %X Genetic Programming (GP) plays the primary role for the discovery of programs through evolving the program’s set of parse trees. In this paper, we present a new technique for constructing programs through Ant Colony Optimisation (ACO) using the tree adjunct grammar (TAG) formalism. We call the method AntTAG and we show that the results are very promising. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2002.1004490 %U http://sc.snu.ac.kr/PAPERS/TAGACOcec02.pdf %U http://dx.doi.org/doi:10.1109/CEC.2002.1004490 %P 1654-1666 %0 Conference Proceedings %T Scout Algorithms and Genetic Algorithms: A Comparative Study %A Abbattista, Fabio %A Carofiglio, Valeria %A Koppen, Mario %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 1 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F abbattista:1999:SAGAACS %K genetic algorithms and classifier systems, poster papers %U http://gpbib.cs.ucl.ac.uk/gecco1999/GA-803.pdf %P 769 %0 Conference Proceedings %T Evolutionary Computing for Metals Properties Modelling %A Abbod, Maysam F. %A Mahfouf, M. %A Linkens, D. A. %A Sellars, C. M. %S THERMEC 2006 %S Materials Science Forum %D 2006 %8 jul 4 8 %V 539 %I Trans Tech Publications %C Vancouver %G en %F abbod2007 %X During the last decade Genetic Programming (GP) has emerged as an efficient methodology for teaching computers how to program themselves. This paper presents research work which uses GP for developing mathematical equations for the response surfaces that have been generated using hybrid modelling techniques for predicting the properties of materials under hot deformation. Collected data from the literature and experimental work on aluminium are used as the initial training data for the GP to develop the mathematical models under different deformation conditions and compositions. %K genetic algorithms, genetic programming, strain, alloy materials, modeling, material property, stress %R doi:10.4028/www.scientific.net/MSF.539-543.2449 %U http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.1011.6271 %U http://dx.doi.org/doi:10.4028/www.scientific.net/MSF.539-543.2449 %P 2449-2454 %0 Conference Proceedings %T A GP Approach for Precision Farming %A Abbona, Francesca %A Vanneschi, Leonardo %A Bona, Marco %A Giacobini, Mario %Y Jin, Yaochu %S 2020 IEEE Congress on Evolutionary Computation, CEC 2020 %D 2020 %8 19 24 jul %I IEEE Press %C internet %F Abbona:2020:CEC %X Livestock is increasingly treated not just as food containers, but as animals that can be susceptible to stress and diseases, affecting, therefore, the production of offspring and the performance of the farm. The breeder needs a simple and useful tool to make the best decisions for his farm, as well as being able to objectively check whether the choices and investments made have improved or worsened its performance. The amount of data is huge but often dispersive: it is therefore essential to provide the farmer with a clear and comprehensible solution, that represents an additional investment. This research proposes a genetic programming approach to predict the yearly number of weaned calves per cow of a farm, namely the measure of its performance. To investigate the efficiency of genetic programming in such a problem, a dataset composed by observations on representative Piedmontese breedings was used. The results show that the algorithm is appropriate, and can perform an implicit feature selection, highlighting important variables and leading to simple and interpretable models. %K genetic algorithms, genetic programming, Cows, Precision Livestock Farming, PLF, Cattle Breeding, Piedmontese Bovines %R doi:10.1109/CEC48606.2020.9185637 %U http://dx.doi.org/doi:10.1109/CEC48606.2020.9185637 %P paperid24248 %0 Journal Article %T Towards modelling beef cattle management with Genetic Programming %A Abbona, Francesca %A Vanneschi, Leonardo %A Bona, Marco %A Giacobini, Mario %J Livestock Science %D 2020 %V 241 %@ 1871-1413 %F ABBONA:2020:LS %X Among the Italian Piemontese Beef Breedings, the yearly production of calves weaned per cow, that is the calves that survive during the period of 60 days following birth, is identified as the main target expressing the performance of a farm. modeling farm dynamics in order to predict the value of this parameter is a possible solution to investigate and highlight breeding strengths, and to find alternatives to penalizing factors. The identification of such variables is a complex but solvable task, since the amount of recorded data among livestock is nowadays huge and manageable through Machine Learning techniques. Besides, the evaluation of the effectiveness of the type of management allows the breeder to consolidate the ongoing processes or, on the contrary, to adopt new management strategies. To solve this problem, we propose a Genetic Programming approach, a white-box technique suitable for big data management, and with an intrinsic ability to select important variables, providing simple models. The most frequent variables encapsulated in the models built by Genetic Programming are highlighted, and their zoological significance is investigated a posteriori, evaluating the performance of the prediction models. Moreover, two of the final expressions selected only three variables among the 48 given in input, one of which is the best performing among GP models. The expressions were then analyzed in order to propose a zootechnical interpretation of the equations. Comparisons with other common techniques, including also black-box methods, are performed, in order to evaluate the performance of different type of methods in terms of accuracy and generalization ability. The approach entailed constructive and helpful considerations to the addressed task, confirming its key-role in the zootechnical field, especially in the beef breeding management %K genetic algorithms, genetic programming, Precision livestock farming, Evolutionary algorithms, Machine learning, Cattle breeding, Piemontese bovines %9 journal article %R doi:10.1016/j.livsci.2020.104205 %U https://iris.unito.it/retrieve/e27ce430-63b3-2581-e053-d805fe0acbaa/Abbona2020_LS_OA.pdf %U http://dx.doi.org/doi:10.1016/j.livsci.2020.104205 %P 104205 %0 Journal Article %T Towards a Vectorial Approach to Predict Beef Farm Performance %A Abbona, Francesca %A Vanneschi, Leonardo %A Giacobini, Mario %J Applied Sciences %D 2022 %V 12 %N 3 %@ 2076-3417 %F abbona:2022:AS %X Accurate livestock management can be achieved by means of predictive models. Critical factors affecting the welfare of intensive beef cattle husbandry systems can be difficult to be detected, and Machine Learning appears as a promising approach to investigate the hundreds of variables and temporal patterns lying in the data. In this article, we explore the use of Genetic Programming (GP) to build a predictive model for the performance of Piemontese beef cattle farms. In particular, we investigate the use of vectorial GP, a recently developed variant of GP, that is particularly suitable to manage data in a vectorial form. The experiments conducted on the data from 2014 to 2018 confirm that vectorial GP can outperform not only the standard version of GP but also a number of state-of-the-art Machine Learning methods, such as k-Nearest Neighbors, Generalized Linear Models, feed-forward Neural Networks, and long- and short-term memory Recurrent Neural Networks, both in terms of accuracy and generalizability. Moreover, the intrinsic ability of GP in performing an automatic feature selection, while generating interpretable predictive models, allows highlighting the main elements influencing the breeding performance. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/app12031137 %U https://www.mdpi.com/2076-3417/12/3/1137 %U http://dx.doi.org/doi:10.3390/app12031137 %0 Conference Proceedings %T Niches as a GA divide-and-conquer strategy %A Abbott, R. J. %Y Chapman, Art %Y Myers, Leonard %S Proceedings of the Second Annual AI Symposium for the California State University %D 1991 %I California State University %F aicsu-91:abbot %K genetic algorithms, genetic programming %P 133-136 %0 Conference Proceedings %T Object-Oriented Genetic Programming, An Initial Implementation %A Abbott, Russell J. %S Proceedings of the Sixth International Conference on Computational Intelligence and Natural Computing %D 2003 %8 sep 26 30 %C Embassy Suites Hotel and Conference Center, Cary, North Carolina USA %F abbott:2003:OOGP %X This paper describes oogp, an object-oriented genetic programming system. Oogp provides traditional genetic programming capabilities in an object-oriented framework. Among the advantages of object-oriented genetic programming are: (a) strong typing, (b) availability of existing class libraries for inclusion in generated programs, and (c) straightforward extensibility to include features such as iteration as object-oriented methods. Oogp is written in Java and makes extensive use of Java’s reflection capabilities. Oogp includes a relatively straightforward but apparently innovative simplification capability. %K genetic algorithms, genetic programming, object-oriented, STGP %U http://abbott.calstatela.edu/PapersAndTalks/OOGP.pdf %0 Conference Proceedings %T Guided Genetic Programming %A Abbott, Russ %A Guo, Jiang %A Parviz, Behzad %S The 2003 International Conference on Machine Learning; Models, Technologies and Applications (MLMTA’03) %D 2003 %8 23 26 jun %I CSREA Press %C las Vegas %F abbott:2003:MLMTA %X We argue that genetic programming has not made good on its promise to generate computer programs automatically. It then describes an approach that would allow that promise to be fulfilled by running a genetic programming engine under human guidance. %K genetic algorithms, genetic programming, guided genetic programming %U http://abbott.calstatela.edu/PapersAndTalks/Guided%20Genetic%20Programming.pdf %0 Conference Proceedings %T Genetic Programming Reconsidered %A Abbott, Russ %A Parviz, Behzad %A Sun, Chengyu %Y Arabnia, Hamid R. %Y Mun, Youngsong %S Proceedings of the International Conference on Artificial Intelligence, IC-AI ’04, Volume 2 & Proceedings of the International Conference on Machine Learning; Models, Technologies & Applications, MLMTA ’04 %D 2004 %8 jun 21 24 %V 2 %I CSREA Press %C Las Vegas, Nevada, USA %@ 1-932415-32-7 %F DBLP:conf/icai/AbbottPS04 %X Even though the Genetic Programming (GP) mechanism is capable of evolving any computable function, the means through which it does so is inherently flawed: the user must provide the GP engine with an evolutionary pathway toward a solution. Hence Genetic Programming is problematic as a mechanism for generating creative solutions to specific problems. %K genetic algorithms, genetic programming, evolutionary pathway, fitness function, teleological evolution, adaptive evolution %U http://abbott.calstatela.edu/PapersAndTalks/GeneticProgrammingReconsidered.pdf %P 1113-1116 %0 Journal Article %T (AI) in Infrastructure Projects-Gap Study %A Abdel-Kader, Mohamed Y. %A Ebid, Ahmed M. %A Onyelowe, Kennedy C. %A Mahdi, Ibrahim M. %A Abdel-Rasheed, Ibrahim %J Infrastructures %D 2022 %V 7 %N 10 %@ 2412-3811 %F abdel-kader:2022:Infrastructures %X Infrastructure projects are usually complicated, expensive, long-term mega projects; accordingly, they are the type of projects that most need optimisation in the design, construction and operation stages. A great deal of earlier research was carried out to optimise the performance of infrastructure projects using traditional management techniques. Recently, artificial intelligence (AI) techniques were implemented in infrastructure projects to improve their performance and efficiency due to their ability to deal with fuzzy, incomplete, inaccurate and distorted data. The aim of this research is to collect, classify, analyse and review all of the available previous research related to implementing AI techniques in infrastructure projects to figure out the gaps in the previous studies and the recent trends in this research area. A total of 159 studies were collected since the beginning of the 1990s until the end of 2021. This database was classified based on publishing date, infrastructure subject and the used AI technique. The results of this study show that implementing AI techniques in infrastructure projects is rapidly increasing. They also indicate that transportation is the first and the most AI-using project and that both artificial neural networks (ANN) and particle swarm optimisation (PSO) are the most implemented techniques in infrastructure projects. Finally, the study presented some opportunities for farther research, especially in natural gas projects. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/infrastructures7100137 %U https://www.mdpi.com/2412-3811/7/10/137 %U http://dx.doi.org/doi:10.3390/infrastructures7100137 %P ArticleNo.137 %0 Journal Article %T Interpretable soft computing predictions of elastic shear buckling in tapered steel plate girders %A AbdelAleem, Basem H. %A Ismail, Mohamed K. %A Haggag, May %A El-Dakhakhni, Wael %A Hassan, Assem A. A. %J Thin-Walled Structures %D 2022 %V 176 %@ 0263-8231 %F ABDELALEEM:2022:tws %X The complexity of the shear buckling in tapered plate girders has motivated researchers to conduct experimental and numerical investigations to understand the underlying mechanisms controlling such phenomenon, and subsequently develop related design-oriented expressions. However, existing predictive models have been developed and validated using limited datasets and/or traditional regression techniques-restricting both the model utility, when considering a wider range of design parameters, and the model generalizability, due to associated uncertainties. To address these issues, the present study employed a powerful soft computing technique-multi-gene genetic programming (MGGP), to develop design expressions to predict the elastic shear buckling strength of tapered end plate girder web panels. A dataset of 427 experimental and experimentally validated numerical results was used in training, validating, and testing the developed MGGP models. Guided by mechanics and findings from previous studies, the key parameters controlling the strength were identified, and MGGP were employed to reveal the interdependence between such parameters and subsequently develop interpretable predictive models. The prediction accuracy of the developed models was evaluated against that of other existing models using various statistical measures. Several filter and embedded variable importance techniques were used to rank the model input parameters according to their significance in predicting the elastic shear buckling strength. These techniques include the variable importance random forest and the relative influence gradient boosting techniques. Moreover, partial dependence plots were employed to explore the effect of the input variables on the strength. The results obtained from this study demonstrated the robustness of the developed MGGP expression for predicting the elastic shear buckling strength of tapered plate girder end web panel. The developed model also exhibited a superior prediction accuracy and generalizability compared to currently existing ones. Furthermore, the developed partial dependence plots facilitated interpreting the influence of all input variables on the predicted elastic shear buckling strength %K genetic algorithms, genetic programming, Data-driven models, Elastic shear buckling strength, Multi-gene genetic programming, Variable importance, Partial dependence plots, Tapered end web panel %9 journal article %R doi:10.1016/j.tws.2022.109313 %U https://www.sciencedirect.com/science/article/pii/S026382312200235X %U http://dx.doi.org/doi:10.1016/j.tws.2022.109313 %P 109313 %0 Conference Proceedings %T Gene Expression Programming Algorithm for Transient Security Classification %A Abdelaziz, Almoataz Y. %A Mekhamer, S. F. %A Khattab, H. M. %A Badr, M. L. A. %A Panigrahi, Bijaya Ketan %Y Panigrahi, Bijaya Ketan %Y Das, Swagatam %Y Suganthan, Ponnuthurai Nagaratnam %Y Nanda, Pradipta Kumar %S Proceedings of the Third International Conference on Swarm, Evolutionary, and Memetic Computing, SEMCCO 2012 %S Lecture Notes in Computer Science %D 2012 %8 dec 20 22 %V 7677 %I Springer %C Bhubaneswar, India %F Abdelaziz:2012:SEMCCO %X In this paper, a gene expression programming (GEP) based algorithm is implemented for power system transient security classification. The GEP algorithms as evolutionary algorithms for pattern classification have recently received attention for classification problems because they can perform global searches. The proposed methodology applies the GEP for the first time in transient security assessment and classification problems of power systems. The proposed algorithm is examined using different IEEE standard test systems. Power system three phase short circuit contingency has been used to test the proposed algorithm. The algorithm checks the static security status of the power system then classifies the transient security of the power system as secure or not secure. Performance of the algorithm is compared with other neural network based classification algorithms to show its superiority for transient security classification. %K genetic algorithms, genetic programming, gene expression programming %R doi:10.1007/978-3-642-35380-2_48 %U http://works.bepress.com/almoataz_abdelaziz/42 %U http://dx.doi.org/doi:10.1007/978-3-642-35380-2_48 %P 406-416 %0 Journal Article %T Applying Machine Learning Techniques for Classifying Cyclin-Dependent Kinase Inhibitors %A Abdelbaky, Ibrahim Z. %A Al-Sadek, Ahmed F. %A Badr, Amr A. %J International Journal of Advanced Computer Science and Applications %D 2018 %V 9 %N 11 %I The Science and Information (SAI) Organization %G eng %F Abdelbaky:2018:IJACSA %X The importance of protein kinases made them a target for many drug design studies. They play an essential role in cell cycle development and many other biological processes. Kinases are divided into different subfamilies according to the type and mode of their enzymatic activity. Computational studies targeting kinase inhibitors identification is widely considered for modelling kinase-inhibitor. This modelling is expected to help in solving the selectivity problem arising from the high similarity between kinases and their binding profiles. In this study, we explore the ability of two machine-learning techniques in classifying compounds as inhibitors or non-inhibitors for two members of the cyclin-dependent kinases as a subfamily of protein kinases. Random forest and genetic programming were used to classify CDK5 and CDK2 kinases inhibitors. This classification is based on calculated values of chemical descriptors. In addition, the response of the classifiers to adding prior information about compounds promiscuity was investigated. The results from each classifier for the datasets were analysed by calculating different accuracy measures and metrics. Confusion matrices, accuracy, ROC curves, AUC values, F1 scores, and Matthews correlation, were obtained for the outputs. The analysis of these accuracy measures showed a better performance for the RF classifier in most of the cases. In addition, the results show that promiscuity information improves the classification accuracy, but its significant effect was notably clear with GP classifiers. %K genetic algorithms, genetic programming, cdk inhibitors, random forest classification %9 journal article %R doi:10.14569/IJACSA.2018.091132 %U http://thesai.org/Downloads/Volume9No11/Paper_32-Applying_Machine_Learning_Techniques.pdf %U http://dx.doi.org/doi:10.14569/IJACSA.2018.091132 %P 229-235 %0 Conference Proceedings %T Applying Co-Evolutionary Particle Swam Optimization to the Egyptian Board Game Seega %A Abdelbar, Ashraf M. %A Ragab, Sherif %A Mitri, Sara %Y Cho, Sung-Bae %Y Hoai, Nguyen Xuan %Y Shan, Yin %S Proceedings of The First Asian-Pacific Workshop on Genetic Programming %D 2003 %8 August %C Rydges (lakeside) Hotel, Canberra, Australia %@ 0-9751724-0-9 %F Abdelbar:aspgp03 %X Seega is an ancient Egyptian two-phase board game that, in certain aspects, is more difficult than chess. The two-player game is played on either a 5 x 5, 7 x 7, or 9 x 9 board. In the first and more difficult phase of the game, players take turns placing one disk each on the board until the board contains only one empty cell. In the second phase players take turns moving disks of their colour; a disk that becomes surrounded by disks of the opposite color is captured and removed from the board. We have developed a Seega program that employs co-evolutionary particle swarm optimisation in the generation of feature evaluation scores. Two separate swarms are used to evolve White players and Black players, respectively; each particle represents feature weights for use in the position evaluation. Experimental results are presented and the performance of the full game engine is discussed. %K Particle Swarm Optimisation, Co-evolution, Game %U http://infoscience.epfl.ch/record/90539/ %P 9-15 %0 Conference Proceedings %T A Genetic Programming Ensemble Method for Learning Dynamical System Models %A Abdelbari, Hassan %A Shafi, Kamran %S Proceedings of the 8th International Conference on Computer Modeling and Simulation %D 2017 %I ACM %C Canberra, Australia %F Abdelbari:2017:ICCMS %X Modelling complex dynamical systems plays a crucial role to understand several phenomena in different domains such as physics, engineering, biology and social sciences. In this paper, a genetic programming ensemble method is proposed to learn complex dynamical systems underlying mathematical models, represented as differential equations, from system time series observations. The proposed method relies on decomposing the modelling space based on given variable dependencies. An ensemble of learners is then applied in this decomposed space and their output is combined to generate the final model. Two examples of complex dynamical systems are used to test the performance of the proposed methodology where the standard genetic programming method has struggled to find matching model equations. The empirical results show the effectiveness of the proposed methodology in learning closely matching structure of almost all system equations. %K genetic algorithms, genetic programming, complex dynamical systems, modelling and simulation, symbolic regression %R doi:10.1145/3036331.3036336 %U http://doi.acm.org/10.1145/3036331.3036336 %U http://dx.doi.org/doi:10.1145/3036331.3036336 %P 47-51 %0 Journal Article %T A System Dynamics Modeling Support System Based on Computational Intelligence %A Abdelbari, Hassan %A Shafi, Kamran %J Systems %D 2019 %V 7 %N 4 %@ 2079-8954 %F abdelbari:2019:Systems %X System dynamics (SD) is a complex systems modelling and simulation approach with wide ranging applications in various science and engineering disciplines. While subject matter experts lead most of the model building, recent advances have attempted to bring system dynamics closer to fast growing fields such as data sciences. This may prove promising for the development of novel support methods that augment human cognition and improve efficiencies in the model building process. A few different directions have been explored recently to support individual modelling stages, such as the generation of model structure, model calibration and policy optimisation. However, an integrated approach that supports across the board modelling process is still missing. In this paper, a prototype integrated modelling support system is presented for the purpose of supporting the modellers at each stage of the process. The proposed support system facilitates data-driven inferring of causal loop diagrams (CLDs), stock-flow diagrams (SFDs), model equations and the estimation of model parameters using computational intelligence (CI) techniques. The ultimate goal of the proposed system is to support the construction of complex models, where the human power is not enough. With this goal in mind, we demonstrate the working and utility of the proposed support system. We have used two well-known synthetic reality case studies with small models from the system dynamics literature, in order to verify the support system performance. The experimental results showed the effectiveness of the proposed support system to infer close model structures to target models directly from system time-series observations. Future work will focus on improving the support system so that it can generate complex models on a large scale. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/systems7040047 %U https://www.mdpi.com/2079-8954/7/4/47 %U http://dx.doi.org/doi:10.3390/systems7040047 %0 Journal Article %T Selecting the Best Forecasting-Implied Volatility Model Using Genetic Programming %A Abdelmalek, Wafa %A Ben Hamida, Sana %A Abid, Fathi %J Journal of Applied Mathematics and Decision Sciences %D 2009 %I Hindawi Publishing Corporation %@ 11739126 %G eng %F Abdelmalek:2009:JAMDS %X The volatility is a crucial variable in option pricing and hedging strategies. The aim of this paper is to provide some initial evidence of the empirical relevance of genetic programming to volatility’s forecasting. By using real data from S&P500 index options, the genetic programming’s ability to forecast Black and Scholes-implied volatility is compared between time series samples and moneyness-time to maturity classes. Total and out-of-sample mean squared errors are used as forecasting’s performance measures. Comparisons reveal that the time series model seems to be more accurate in forecasting-implied volatility than moneyness time to maturity models. Overall, results are strongly encouraging and suggest that the genetic programming approach works well in solving financial problems. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1155/2009/179230 %U http://downloads.hindawi.com/journals/ads/2009/179230.pdf %U http://dx.doi.org/doi:10.1155/2009/179230 %0 Journal Article %T Automatic modulation classification based on high order cumulants and hierarchical polynomial classifiers %A Abdelmutalab, Ameen %A Assaleh, Khaled %A El-Tarhuni, Mohamed %J Physical Communication %D 2016 %V 21 %@ 1874-4907 %F Abdelmutalab:2016:PC %X In this paper, a Hierarchical Polynomial (HP) classifier is proposed to automatically classify M-PSK and M-QAM signals in Additive White Gaussian Noise (AWGN) and slow flat fading environments. The system uses higher order cumulants (HOCs) of the received signal to distinguish between the different modulation types. The proposed system divides the overall modulation classification problem into several hierarchical binary sub-classifications. In each binary sub-classification, the HOCs are expanded into a higher dimensional space in which the two classes are linearly separable. It is shown that there is a significant improvement when using the proposed Hierarchical polynomial structure compared to the conventional polynomial classifier. Moreover, simulation results are shown for different block lengths (number of received symbols) and at different SNR values. The proposed system showed an overall improvement in the probability of correct classification that reaches 100percent using only 512 received symbols at 20 dB compared to 98percent and 98.33percent when using more complicated systems like Genetic Programming with KNN classifier (GP-KNN) and Support Vector Machines (SVM) classifiers, respectively. %K genetic algorithms, genetic programming, Modulation classification, Hierarchical polynomial classifiers, High order cumulants, Adaptive modulation %9 journal article %R doi:10.1016/j.phycom.2016.08.001 %U http://www.sciencedirect.com/science/article/pii/S1874490716301094 %U http://dx.doi.org/doi:10.1016/j.phycom.2016.08.001 %P 10-18 %0 Conference Proceedings %T Tackling Dead End Scenarios by Improving Follow Gap Method with Genetic Programming %A Abdelwhab, Mohamed %A Abouelsoud, A. A. %A Elbab, Ahmed M. R. Fath %S 2018 57th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE) %D 2018 %8 sep %C Nara, Japan %F Abdelwhab:2018:SICE %X In this paper the problem of local minimum in obstacle avoidance is solved using improved follow gap method (FGM) through combination with genetic programming (GP). Two stages of controller are proposed and applied on Robotino mobile robot equipped with nine infra-red sensors. The first stage implements FGM when there is a gap between front obstacles whereas the second stage deals with the case of no front gap through the use of GP. Simulation and experimental work prove the effectiveness of the proposed method. %K genetic algorithms, genetic programming %R doi:10.23919/SICE.2018.8492687 %U http://dx.doi.org/doi:10.23919/SICE.2018.8492687 %P 1566-1571 %0 Thesis %T Artificial Intelligence System for Continuous Affect Estimation from Naturalistic Human Expressions %A Abd Gaus, Yona Falinie %D 2018 %8 jan %C London, UK %C Brunel University %F AbdGaus:thesis %X The analysis and automatic affect estimation system from human expression has been acknowledged as an active research topic in computer vision community. Most reported affect recognition systems, however, only consider subjects performing well-defined acted expression, in a very controlled condition, so they are not robust enough for real-life recognition tasks with subject variation, acoustic surrounding and illumination change. In this thesis, an artificial intelligence system is proposed to continuously (represented along a continuum e.g., from -1 to +1) estimate affect behaviour in terms of latent dimensions (e.g., arousal and valence) from naturalistic human expressions. To tackle the issues, feature representation and machine learning strategies are addressed. In feature representation, human expression is represented by modalities such as audio, video, physiological signal and text modality. Hand- crafted features is extracted from each modality per frame, in order to match with consecutive affect label. However, the features extracted maybe missing information due to several factors such as background noise or lighting condition. Haar Wavelet Transform is employed to determine if noise cancellation mechanism in feature space should be considered in the design of affect estimation system. Other than hand-crafted features, deep learning features are also analysed in terms of the layer-wise; convolutional and fully connected layer. Convolutional Neural Network such as AlexNet, VGGFace and ResNet has been selected as deep learning architecture to do feature extraction on top of facial expression images. Then, multimodal fusion scheme is applied by fusing deep learning feature and hand-crafted feature together to improve the performance. In machine learning strategies, two-stage regression approach is introduced. In the first stage, baseline regression methods such as Support Vector Regression are applied to estimate each affect per time. Then in the second stage, subsequent model such as Time Delay Neural Network, Long Short-Term Memory and Kalman Filter is proposed to model the temporal relationships between consecutive estimation of each affect. In doing so, the temporal information employed by a subsequent model is not biased by high variability present in consecutive frame and at the same time, it allows the network to exploit the slow changing dynamic between emotional dynamic more efficiently. Following of two-stage regression approach for unimodal affect analysis, fusion information from different modalities is elaborated. Continuous emotion recognition in-the-wild is leveraged by investigating mathematical modelling for each emotion dimension. Linear Regression, Exponent Weighted Decision Fusion and Multi-Gene Genetic Programming are implemented to quantify the relationship between each modality. In summary, the research work presented in this thesis reveals a fundamental approach to automatically estimate affect value continuously from naturalistic human expression. The proposed system, which consists of feature smoothing, deep learning feature, two-stage regression framework and fusion using mathematical equation between modalities is demonstrated. It offers strong basis towards the development artificial intelligent system on estimation continuous affect estimation, and more broadly towards building a real-time emotion recognition system for human-computer interaction. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://bura.brunel.ac.uk/handle/2438/16348 %0 Conference Proceedings %T Linear and Non-Linear Multimodal Fusion for Continuous Affect Estimation In-the-Wild %A Gaus, Yona Falinie A. %A Meng, Hongying %S 2018 13th IEEE International Conference on Automatic Face Gesture Recognition (FG 2018) %D 2018 %8 may %F AbdGaus:2018:ieeeFG %X Automatic continuous affect recognition from multiple modality in the wild is arguably one of the most challenging research areas in affective computing. In addressing this regression problem, the advantages of the each modality, such as audio, video and text, have been frequently explored but in an isolated way. Little attention has been paid so far to quantify the relationship within these modalities. Motivated to leverage the individual advantages of each modality, this study investigates behavioural modelling of continuous affect estimation, in multimodal fusion approaches, using Linear Regression, Exponent Weighted Decision Fusion and Multi-Gene Genetic Programming. The capabilities of each fusion approach are illustrated by applying it to the formulation of affect estimation generated from multiple modality using classical Support Vector Regression. The proposed fusion methods were applied in the public Sentiment Analysis in the Wild (SEWA) multi-modal dataset and the experimental results indicate that employing proper fusion can deliver a significant performance improvement for all affect estimation. The results further show that the proposed systems is competitive or outperform the other state-of-the-art approaches. %K genetic algorithms, genetic programming %R doi:10.1109/FG.2018.00079 %U http://dx.doi.org/doi:10.1109/FG.2018.00079 %P 492-498 %0 Conference Proceedings %T Fast convergence strategy for Particle Swarm Optimization using spread factor %A Latiff, I. Abd %A Tokhi, M. O. %S Evolutionary Computation, 2009. CEC ’09. IEEE Congress on %D 2009 %8 may %F 4983280 %K PSO velocity equation, fast convergence strategy, inertia weight, particle swarm optimization, spread factor, convergence, particle swarm optimisation %R doi:10.1109/CEC.2009.4983280 %U http://dx.doi.org/doi:10.1109/CEC.2009.4983280 %P 2693-2700 %0 Journal Article %T Nonlinear mathematical modeling of seed spacing uniformity of a pneumatic planter using genetic programming and image processing %A Abdolahzare, Zahra %A Mehdizadeh, Saman Abdanan %J Neural Computing and Applications %D 2018 %V 29 %N 2 %F journals/nca/AbdolahzareM18 %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s00521-016-2450-1 %U http://dx.doi.org/doi:10.1007/s00521-016-2450-1 %P 363-375 %0 Journal Article %T Genetic programming for credit scoring: The case of Egyptian public sector banks %A Abdou, Hussein A. %J Expert Systems with Applications %D 2009 %V 36 %N 9 %@ 0957-4174 %F Abdou200911402 %X Credit scoring has been widely investigated in the area of finance, in general, and banking sectors, in particular. Recently, genetic programming (GP) has attracted attention in both academic and empirical fields, especially for credit problems. The primary aim of this paper is to investigate the ability of GP, which was proposed as an extension of genetic algorithms and was inspired by the Darwinian evolution theory, in the analysis of credit scoring models in Egyptian public sector banks. The secondary aim is to compare GP with probit analysis (PA), a successful alternative to logistic regression, and weight of evidence (WOE) measure, the later a neglected technique in published research. Two evaluation criteria are used in this paper, namely, average correct classification (ACC) rate criterion and estimated misclassification cost (EMC) criterion with different misclassification cost (MC) ratios, in order to evaluate the capabilities of the credit scoring models. Results so far revealed that GP has the highest ACC rate and the lowest EMC. However, surprisingly, there is a clear rule for the WOE measure under EMC with higher MC ratios. In addition, an analysis of the dataset using Kohonen maps is undertaken to provide additional visual insights into cluster groupings. %K genetic algorithms, genetic programming, Credit scoring, Weight of evidence, Egyptian public sector banks %9 journal article %R doi:10.1016/j.eswa.2009.01.076 %U http://www.sciencedirect.com/science/article/B6V03-4VJSRWK-1/2/a3b8516f289c76c474c6a1eb9d26d7ec %U http://dx.doi.org/doi:10.1016/j.eswa.2009.01.076 %P 11402-11417 %0 Thesis %T Credit Scoring Models for Egyptian Banks: Neural Nets and Genetic Programming versus Conventional Techniques %A Abdou, Hussein Ali Hussein %D 2009 %8 apr %C UK %C Plymouth Business School, University of Plymouth %F 2009AbdouEthosPhD %X Credit scoring has been regarded as a core appraisal tool of banks during the last few decades, and has been widely investigated in the area of finance, in general, and banking sectors, in particular. In this thesis, the main aims and objectives are: to identify the currently used techniques in the Egyptian banking credit evaluation process; and to build credit scoring models to evaluate personal bank loans. In addition, the subsidiary aims are to evaluate the impact of sample proportion selection on the Predictive capability of both advanced scoring techniques and conventional scoring techniques, for both public banks and a private banking case-study; and to determine the key characteristics that affect the personal loans’ quality (default risk). The stages of the research comprised: firstly, an investigative phase, including an early pilot study, structured interviews and a questionnaire; and secondly, an evaluative phase, including an analysis of two different data-sets from the Egyptian private and public banks applying average correct classification rates and estimated misclassification costs as criteria. Both advanced scoring techniques, namely, neural nets (probabilistic neural nets and multi-layer feed-forward nets) and genetic programming, and conventional techniques, namely, a weight of evidence measure, multiple discriminant analysis, probit analysis and logistic regression were used to evaluate credit default risk in Egyptian banks. In addition, an analysis of the data-sets using Kohonen maps was undertaken to provide additional visual insights into cluster groupings. From the investigative stage, it was found that all public and the vast majority of private banks in Egypt are using judgemental approaches in their credit evaluation. From the evaluative stage, clear distinctions between the conventional techniques and the advanced techniques were found for the private banking case-study; and the advanced scoring techniques (such as powerful neural nets and genetic programming) were superior to the conventional techniques for the public sector banks. Concurrent loans from other banks and guarantees by the corporate employer of the loan applicant, which have not been used in other reported studies, are identified as key variables and recommended in the specific environment chosen, namely Egypt. Other variables, such as a feasibility study and the Central Bank of Egypt report also play a contributory role in affecting the loan quality. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U https://pearl.plymouth.ac.uk/bitstream/handle/10026.1/379/2009AbdouEthosPhD.pdf %0 Conference Proceedings %T Genetic programming for evolving programs with loop structures for classification tasks %A Abdulhamid, Fahmi %A Neshatian, Kourosh %A Zhang, Mengjie %S 5th International Conference on Automation, Robotics and Applications (ICARA 2011) %D 2011 %8 June 8 dec %C Wellington, New Zealand %F Abdulhamid:2011:ICARA %X Object recognition and classification are important tasks in robotics. Genetic Programming (GP) is a powerful technique that has been successfully used to automatically generate (evolve) classifiers. The effectiveness of GP is limited by the expressiveness of the functions used to evolve programs. It is believed that loop structures can considerably improve the quality of GP programs in terms of both performance and interpretability. This paper proposes five new loop structures using which GP can evolve compact programs that can perform sophisticated processing. The use of loop structures in GP is evaluated against GP with no loops for both image and non-image classification tasks. Evolved programs using the proposed loop structures are analysed in several problems. The results show that loop structures can increase classification accuracy compared to GP with no loops. %K genetic algorithms, genetic programming, evolving program, image classification task, nonimage classification task, object classification task, object recognition task, program loop structure, robotics, image classification, learning (artificial intelligence), object recognition, robot vision %R doi:10.1109/ICARA.2011.6144882 %U http://dx.doi.org/doi:10.1109/ICARA.2011.6144882 %P 202-207 %0 Conference Proceedings %T Evolving Genetic Programming Classifiers with Loop Structures %A Abdulhamid, Fahmi %A Song, Andy %A Neshatian, Kourosh %A Zhang, Mengjie %Y Li, Xiaodong %S Proceedings of the 2012 IEEE Congress on Evolutionary Computation %D 2012 %8 October 15 jun %C Brisbane, Australia %@ 0-7803-8515-2 %F Abdulhamid:2012:CEC %X Loop structure is a fundamental flow control in programming languages for repeating certain operations. It is not widely used in Genetic Programming as it introduces extra complexity in the search. However in some circumstances, including a loop structure may enable GP to find better solutions. This study investigates the benefits of loop structures in evolving GP classifiers. Three different loop representations are proposed and compared with other GP methods and a set of traditional classification methods. The results suggest that the proposed loop structures can outperform other methods. Additionally the evolved classifiers can be small and simple to interpret. Further analysis on a few classifiers shows that they indeed have captured genuine characteristics from the data for performing classification. %K genetic algorithms, genetic programming, Conflict of Interest Papers, Classification, clustering, data analysis and data mining %R doi:10.1109/CEC.2012.6252877 %U http://dx.doi.org/doi:10.1109/CEC.2012.6252877 %P 2710-2717 %0 Journal Article %T The PARSEC machine: a non-Newtonian supra-linear super-computer %A Abdulkarimova, Ulviya %A Ouskova Leonteva, Anna %A Rolando, Christian %A Jeannin-Girardon, Anne %A Collet, Pierre %J Azerbaijan Journal of High Performance Computing %D 2019 %8 dec %V 2 %N 2 %F abdulkarimova:2019:ajhpc %X transfer-learning can turn a Beowulf cluster into a full super-computer with supra-linear qualitative acceleration. Harmonic Analysis is used as a real-world example to show the kind of result that can be achieved with the proposed super-computer architecture, that locally exploits absolute space-time parallelism on each machine (SIMD parallelism) and loosely-coupled relative space-time parallelisation between different machines (loosely coupled MIMD) %K genetic algorithms, genetic programming, beowulf cluster, relative space-time, supra-linear acceleration, qualitative acceleration, GPGPU, loosely coupled machines, artificial evolution, transfer learning, harmonic analysis, super-resolution,non-uniform sampling, fourier transform. %9 journal article %R doi:10.32010/26166127.2019.2.2.122.140 %U https://publis.icube.unistra.fr/docs/14472/easeaHPC.pdf %U http://dx.doi.org/doi:10.32010/26166127.2019.2.2.122.140 %P 122-140 %0 Thesis %T SINUS-IT: an evolutionary approach to harmonic analysis %A Abdulkarimova, Ulviya %D 2021 %8 February %C France %C Universite de Strasbourg %F abdulkarimova:tel-03700035 %X This PhD project is about harmonic analysis of signals coming from Fourier Transform Ion Cyclotron Resonance (FT-ICR) mass spectrometer. The analysis of these signals is usually done using Fourier Transform (FT) method. However, there are several limitations of this method, one of which is not being able to find the phase parameter. Mass spectrometers are used to determine the chemical composition of compounds. It is known that if the phase component is known, it would yield an improvement in mass accuracy and mass resolving power which would help to determine the composition of a given compound more accurately. In this PhD work we use evolutionary algorithm to overcome the limitations of the FT method. We explore different sampling, speed optimization and algorithm improvement methods. We show that our proposed method outperforms the FT method as it uses short transients to resolve the peaks and it automatically yields phase values. %K genetic algorithms, genetic programming, EASEA, NVIDA, CUDA, Artificial evolution, Evolution strategies, QAES, Fourier transform, FFT, Harmonic analysis, FT-ICR, Isotopic structure, GPU, GPGPU parallelisation, Island-based parallelization, Glutathione, binary radians, Brad2rad, Rad2brad, global random sampling, GRS %9 Ph.D. thesis %U https://theses.hal.science/tel-03700035/ %0 Thesis %T Android Malware Detection System using Genetic Programming %A Abdullah, Norliza Binti %D 2019 %8 mar %C UK %C Computer Science, University of York %F Abdullah:thesis %X Nowadays, smartphones and other mobile devices are playing a significant role in the way people engage in entertainment, communicate, network, work, and bank and shop online. As the number of mobile phones sold has increased dramatically worldwide, so have the security risks faced by the users, to a degree most do not realise. One of the risks is the threat from mobile malware. In this research, we investigate how supervised learning with evolutionary computation can be used to synthesise a system to detect Android mobile phone attacks. The attacks include malware, ransomware and mobile botnets. The datasets used in this research are publicly downloadable, available for use with appropriate acknowledgement. The primary source is Drebin. We also used ransomware and mobile botnet datasets from other Android mobile phone researchers. The research in this thesis uses Genetic Programming (GP) to evolve programs to distinguish malicious and non-malicious applications in Android mobile datasets. It also demonstrates the use of GP and Multi-Objective Evolutionary Algorithms (MOEAs) together to explore functional (detection rate) and non-functional (execution time and power consumption) trade-offs. Our results show that malicious and non-malicious applications can be distinguished effectively using only the permissions held by applications recorded in the application’s Android Package (APK). Such a minimalist source of features can serve as the basis for highly efficient Android malware detection. Non-functional tradeoffs are also highlight. %K genetic algorithms, genetic programming, Supervised Learning, Multi-objective Genetic Algorithm, SPEA2, MOGP, Android Malware %9 Ph.D. thesis %U https://etheses.whiterose.ac.uk/29027/ %0 Conference Proceedings %T An Empirical Comparison of Code Size Limit in Auto-Constructive Artificial Life %A Abdul rahim, A. B. %A Teo, J. %A Saudi, A. %S 2006 IEEE Conference on Cybernetics and Intelligent Systems %D 2006 %8 jun %I IEEE %C Bangkok %@ 1-4244-0023-6 %F Abdul-Rahim:2006:ccis %X This paper presents an evolving swarm system of flying agents simulated as a collective intelligence within the Breve auto-constructive artificial life environment. The behaviour of each agent is governed by genetically evolved program codes expressed in the Push programming language. There are two objectives in this paper, that is to investigate the effects of firstly code size limit and secondly two different versions of the Push genetic programming language on the auto-constructive evolution of artificial life. We investigated these genetic programming code elements on reproductive competence using a measure based on the self-sustainability of the population. Self-sustainability is the point in time when the current population’s agents are able to reproduce enough offspring to maintain the minimum population size without any new agents being randomly injected from the system. From the results, we found that the Push2 implementation showed slightly better evolvability than Push3 in terms of achieving self-sufficiency. In terms of code size limit, the reproductive competence of the collective swarm was affected quite significantly at certain parameter settings %K genetic algorithms, genetic programming, Push, Breve, ALife, PushGP %R doi:10.1109/ICCIS.2006.252308 %U http://dx.doi.org/doi:10.1109/ICCIS.2006.252308 %P 1-6 %0 Journal Article %T Classification of Retina Diseases from OCT using Genetic Programming %A Abdulrahman, Hadeel %A Khatib, Mohamed %J International Journal of Computer Applications %D 2020 %8 mar %V 177 %N 45 %I Foundation of Computer Science (FCS), NY, USA %C New York, USA %@ 0975-8887 %F Abdulrahman:2020:IJCA %X a fully automated method for feature extraction and classification of retina diseases is implemented. The main idea is to find a method that can extract the important features from the Optical Coherence Tomography (OCT) image, and acquire a higher classification accuracy. The using of genetic programming (GP) can achieve that aim. Genetic programming is a good way to choose the best combination of feature extraction methods from a set of feature extraction methods and determine the proper parameters for each one of the selected extraction methods. 800 OCT images are used in the proposed method, of the most three popular retinal diseases: Choroidal neovascularization (CNV), Diabetic Macular Edema (DME) and Drusen, beside the normal OCT images. While the set of the feature extraction methods that is used in this paper contains: Gabor filter, Local Binary Pattern (LBP), Gray Level Co-occurrence Matrix (GLCM), histogram of the image, and Speed Up Robust Filter (SURF). These methods are used for the both of global and local feature extraction. After that the classification process is achieved by the Support Vector Machine (SVM). The proposed method performed high accuracy as compared with the traditional methods. %K genetic algorithms, genetic programming, feature extraction, Optical Coherence Tomography, OCT image classification, OCT feature extraction %9 journal article %R doi:10.5120/ijca2020919973 %U https://www.ijcaonline.org/archives/volume177/number45/abdulrahman-2020-ijca-919973.pdf %U http://dx.doi.org/doi:10.5120/ijca2020919973 %P 41-46 %0 Conference Proceedings %T Genetic programming hyper-heuristic for solving dynamic production scheduling problem %A Abednego, Luciana %A Hendratmo, Dwi %S International Conference on Electrical Engineering and Informatics (ICEEI 2011) %D 2011 %8 17 19 jul %C Bandung, Indonesia %F Abednego:2011:ICEEI %X This paper investigates the potential use of genetic programming hyper-heuristics for solution of the real single machine production problem. This approach operates on a search space of heuristics rather than directly on a search space of solutions. Genetic programming hyper-heuristics generate new heuristics from a set of potential heuristic components. Real data from production department of a metal industries are used in the experiments. Experimental results show genetic programming hyper-heuristics outperforms other heuristics including MRT, SPT, LPT, EDD, LDD, dan MON rules with respect to minimum tardiness and minimum flow time objectives. Further results on sensitivity to changes indicate that GPHH designs are robust. Based on experiments, GPHH outperforms six other benchmark heuristics with number of generations 50 and number of populations 50. Human designed heuristics are result of years of work by a number of experts, while GPHH automate the design of the heuristics. As the search process is automated, this would largely reduce the cost of having to create a new set of heuristics. %K genetic algorithms, genetic programming, cost reduction, dynamic production scheduling problem, genetic programming hyper heuristics, metal industries, minimum flow time, minimum tardiness, single machine production problem, cost reduction, dynamic scheduling, heuristic programming, lead time reduction, metallurgical industries, single machine scheduling %R doi:10.1109/ICEEI.2011.6021768 %U http://dx.doi.org/doi:10.1109/ICEEI.2011.6021768 %P K3-2 %0 Book Section %T Using a Genetic Algorithm to Select Beam Configurations for Radiosurgery of the Brain %A Abernathy, Neil %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 2000 %D 2000 %8 jun %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F abernathy:2000:UGASBCRB %K genetic algorithms %P 1-7 %0 Journal Article %T Comparing Predictability of Genetic Programming and ANFIS on Drilling Performance Modeling for GFRP Composites %A Abhishek, Kumar %A Panda, Biranchi Narayan %A Datta, Saurav %A Mahapatra, Siba Sankar %J Procedia Materials Science %D 2014 %V 6 %@ 2211-8128 %F Abhishek:2014:PMS %O 3rd International Conference on Materials Processing and Characterisation (ICMPC 2014) %X Drilling of glass fibre reinforced polymer (GFRP) composite material is substantially complicated from the metallic materials due to its high structural stiffness (of the composite) and low thermal conductivity of plastics. During drilling of GFRP composites, problems generally arise like fibre pull out, delamination, stress concentration, swelling, burr, splintering and micro cracking etc. which reduces overall machining performance. Now-a-days hybrid approaches have been received remarkable attention in order to model machining process behaviour and to optimise machining performance towards subsequent improvement of both quality and productivity, simultaneously. In the present research, spindle speed, feed rate, plate thickness and drill bit diameter have been considered as input parameters; and the machining yield characteristics have been considered in terms of thrust and surface roughness (output responses) of the drilled composite product. The study illustrates the applicability of genetic programming with the help of GPTIPS as well as Adaptive Neuro Fuzzy Inference System (ANFIS) towards generating prediction models for better understanding of the process behavior and for improving process performances in drilling of GFRP composites. %K genetic algorithms, genetic programming, Glass fibre reinforced polymer (GFRP), Adaptive Neuro Fuzzy Inference System (ANFIS), GPTIPS. %9 journal article %R doi:10.1016/j.mspro.2014.07.069 %U http://www.sciencedirect.com/science/article/pii/S2211812814004349 %U http://dx.doi.org/doi:10.1016/j.mspro.2014.07.069 %P 544-550 %0 Book Section %T Dynamic Hedging Using Generated Genetic Programming Implied Volatility Models %A Abid, Fathi %A Abdelmalek, Wafa %A Ben Hamida, Sana %E Ventura, Sebastian %B Genetic Programming - New Approaches and Successful Applications %D 2012 %I InTech %F Abid:2012:GPnew %K genetic algorithms, genetic programming %R doi:10.5772/48148 %U http://dx.doi.org/doi:10.5772/48148 %P 141-172 %0 Journal Article %T Estimating the subgrade reaction at deep braced excavation bed in dry granular soil using genetic programming (GP) %A Aboelela, Abdelrahman E. %A Ebid, Ahmed M. %A Fayed, Ayman L. %J Results in Engineering %D 2022 %V 13 %@ 2590-1230 %F ABOELELA:2022:RE %X Modulus of subgrade reaction (Ks) is a simplified and approximated approach to present the soil-structure interaction. It is widely used in designing combined and raft foundations due to its simplicity. (Ks) is not a soil propriety, its value depends on many factors including soil properties, shape, dimensions and stiffness of footing and even time (for saturated cohesive soils). Many earlier formulas were developed to estimate the (Ks) value. This research is concerned in studying the effect of de-stressing and shoring rigidity of deep excavation on the (Ks) value. A parametric study was carried out using 27 FEM models with different configurations to generate a database, then a well-known ’Genetic Programming’ technique was applied on the database to develop a formula to correlate the (Ks) value with the deep excavation configurations. The results indicated that (Ks) value increased with increasing the diaphragm wall stiffness and decreases with increasing the excavation depth %K genetic algorithms, genetic programming, Deep braced excavation, Modulus of subgrade reaction %9 journal article %R doi:10.1016/j.rineng.2021.100328 %U https://www.sciencedirect.com/science/article/pii/S2590123021001298 %U http://dx.doi.org/doi:10.1016/j.rineng.2021.100328 %P 100328 %0 Journal Article %T Estimation of dynamic viscosity of natural gas based on genetic programming methodology %A Abooali, Danial %A Khamehchi, Ehsan %J Journal of Natural Gas Science and Engineering %D 2014 %V 21 %@ 1875-5100 %F Abooali:2014:JNGSE %X Investigating the behaviour of natural gas can contribute to a detailed understanding of hydrocarbon reservoirs. Natural gas, alone or in association with oil in reservoirs, has a large impact on reservoir fluid properties. Thus, having knowledge about gas characteristics seems to be necessary for use in estimation and prediction purposes. In this project, dynamic viscosity of natural gas (mu_g), as an important quantity, was correlated with pseudo-reduced temperature (Tpr), pseudo-reduced pressure (Ppr), apparent molecular weight (Ma) and gas density (rhog) by operation of the genetic programming method on a large dataset including 1938 samples. The squared correlation coefficient (R2), average absolute relative deviation percent (AARDpercent) and average absolute error (AAE) are 0.999, 2.55percent and 0.00084 cp, respectively. The final results show that the obtained simple-to-use model can predict viscosity of natural gases with high accuracy and confidence. %K genetic algorithms, genetic programming, Natural gas, Dynamic viscosity, Correlation %9 journal article %R doi:10.1016/j.jngse.2014.11.006 %U http://www.sciencedirect.com/science/article/pii/S1875510014003394 %U http://dx.doi.org/doi:10.1016/j.jngse.2014.11.006 %P 1025-1031 %0 Journal Article %T A new empirical model for estimation of crude oil/brine interfacial tension using genetic programming approach %A Abooali, Danial %A Sobati, Mohammad Amin %A Shahhosseini, Shahrokh %A Assareh, Mehdi %J Journal of Petroleum Science and Engineering %D 2019 %V 173 %@ 0920-4105 %F ABOOALI:2019:JPSE %X Detailed understanding of the behavior of crude oils and their interactions with reservoir formations and other in-situ fluids can help the engineers to make better decisions about the future of oil reservoirs. As an important property, interfacial tension (IFT) between crude oil and brine has great impacts on the oil production efficiency in different recovery stages due to its effects on the capillary number and residual oil saturation. In the present work, a new mathematical model has been developed to estimate IFT between crude oil and brine on the basis of a number of physical properties of crude oil (i.e., specific gravity, and total acid number) and the brine (i.e., pH, NaCl equivalent salinity), temperature, and pressure. Genetic programming (GP) methodology has been implemented on a data set including 560 experimental data to develop the IFT correlation. The correlation coefficient (R2a =a 0.9745), root mean square deviation (RMSDa =a 1.8606a mN/m), and average absolute relative deviation (AARDa =a 3.3932percent) confirm the acceptable accuracy of the developed correlation for the prediction of IFT %K genetic algorithms, genetic programming, Interfacial tension, Correlation, Crude oil, Brine, Genetic programming (GP) %9 journal article %R doi:10.1016/j.petrol.2018.09.073 %U http://www.sciencedirect.com/science/article/pii/S0920410518308283 %U http://dx.doi.org/doi:10.1016/j.petrol.2018.09.073 %P 187-196 %0 Journal Article %T Characterization of physico-chemical properties of biodiesel components using smart data mining approaches %A Abooali, Danial %A Soleimani, Reza %A Gholamreza-Ravi, Saeed %J Fuel %D 2020 %V 266 %@ 0016-2361 %F ABOOALI:2020:Fuel %X Biodiesels are the most probable future alternatives for petroleum fuels due to their easy accessibility and extraction, comfortable transportation and storage and lower environmental pollutions. Biodiesels have wide range of molecular structures including various long chain fatty acid methyl esters (FAMEs) and fatty acid ethyl esters (FAEEs) with different thermos-physical properties. Therefore, reliable methods estimating the ester properties seems necessary to choose the appropriate one for a special diesel engine. In the present study, the effort was developing a set of novel and robust methods for estimation of four important properties of common long chain fatty acid methyl and ethyl esters including density, speed of sound, isentropic and isothermal compressibility, directly from a number of basic effective variables (i.e. temperature, pressure, molecular weight and normal melting point). Stochastic gradient boosting (SGB) and genetic programming (GP) as innovative and powerful mathematical approaches in this area were applied and implemented on large datasets including 2117, 1048, 483 and 310 samples for density, speed of sound, isentropic and isothermal compressibility, respectively. Statistical assessments revealed high applicability and accuracy of the new developed models (R2 > 0.99 and AARD < 1.7percent) and the SGB models yield more accurate and confident predictions %K genetic algorithms, genetic programming, Fatty acid ester, Density, Speed of sound, Isentropic and isothermal compressibility, Stochastic gradient boosting %9 journal article %R doi:10.1016/j.fuel.2020.117075 %U http://www.sciencedirect.com/science/article/pii/S0016236120300703 %U http://dx.doi.org/doi:10.1016/j.fuel.2020.117075 %P 117075 %0 Journal Article %T New predictive method for estimation of natural gas hydrate formation temperature using genetic programming %A Abooali, Danial %A Khamehchi, Ehsan %J Neural Comput. Appl. %D 2019 %V 31 %N 7 %F DBLP:journals/nca/AbooaliK19 %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s00521-017-3208-0 %U https://doi.org/10.1007/s00521-017-3208-0 %U http://dx.doi.org/doi:10.1007/s00521-017-3208-0 %P 2485-2494 %0 Conference Proceedings %T Web Usage Mining Using Artificial Ant Colony Clustering and Genetic Programming %A Abraham, Ajith %A Ramos, Vitorino %Y Sarker, Ruhul %Y Reynolds, Robert %Y Abbass, Hussein %Y Tan, Kay Chen %Y McKay, Bob %Y Essam, Daryl %Y Gedeon, Tom %S Proceedings of the 2003 Congress on Evolutionary Computation CEC2003 %D 2003 %8 August 12 dec %I IEEE Press %C Canberra %@ 0-7803-7804-0 %F abraham:2003:CEC %X The rapid e-commerce growth has made both business community and customers face a new situation. Due to intense competition on one hand and the customer’s option to choose from several alternatives business community has realized the necessity of intelligent marketing strategies and relationship management. Web usage mining attempts to discover useful knowledge from the secondary data obtained from the interactions of the users with the Web. Web usage mining has become very critical for effective Web site management, creating adaptive Web sites, business and support services, personalization, network traffic flow analysis and so on. The study of ant colonies behavior and their self-organizing capabilities is of interest to knowledge retrieval/management and decision support systems sciences, because it provides models of distributed adaptive organization, which are useful to solve difficult optimization, classification, and distributed control problems, among others. In this paper, we propose an ant clustering algorithm to discover Web usage patterns (data clusters) and a linear genetic programming approach to analyze the visitor trends. Empirical results clearly shows that ant colony clustering performs well when compared to a self-organizing map (for clustering Web usage patterns) even though the performance accuracy is not that efficient when comparared to evolutionary-fuzzy clustering (i-miner) approach. %K genetic algorithms, genetic programming, Web Usage Mining, Ant Systems, Stigmergy, Data-Mining, Linear Genetic Programming, Adaptive control, Ant colony optimization, Artificial intelligence, Communication system traffic control, Decision support systems, Knowledge management, Marketing management, Programmable control, Traffic control, Internet, artificial life, data mining, decision support systems, electronic commerce, self-organising feature maps, statistical analysis, Web site management, Web usage mining, artificial ant colony clustering algorithm, decision support systems, distributed adaptive organisation, distributed control problems, e-commerce, intelligent marketing strategies, knowledge discovery, knowledge retrieval, network traffic flow analysis, self-organizing map %R doi:10.1109/CEC.2003.1299832 %U http://alfa.ist.utl.pt/~cvrm/staff/vramos/Vramos-CEC03b.pdf %U http://dx.doi.org/doi:10.1109/CEC.2003.1299832 %P 1384-1391 %0 Report %T Soft Computing Models for Network Intrusion Detection Systems %A Abraham, Ajith %A Jain, Ravi %D 2004 %8 13 may 2004 %I OSU %F abraham:2004:0405046 %O Journal-ref: Soft Computing in Knowledge Discovery: Methods and Applications, Saman Halgamuge and Lipo Wang (Eds.), Studies in Fuzziness and Soft Computing, Springer Verlag Germany, Chapter 16, 20 pages, 2004 %X Security of computers and the networks that connect them is increasingly becoming of great significance. Computer security is defined as the protection of computing systems against threats to confidentiality, integrity, and availability. There are two types of intruders: external intruders, who are unauthorised users of the machines they attack, and internal intruders, who have permission to access the system with some restrictions. This chapter presents a soft computing approach to detect intrusions in a network. Among the several soft computing paradigms, we investigated fuzzy rule-based classifiers, decision trees, support vector machines, linear genetic programming and an ensemble method to model fast and efficient intrusion detection systems. Empirical results clearly show that soft computing approach could play a major role for intrusion detection. %K genetic algorithms, genetic programming, Cryptography and Security %U http://www.softcomputing.net/saman2.pdf %0 Journal Article %T Business Intelligence from Web Usage Mining %A Abraham, Ajith %J Journal of Information & Knowledge Management %D 2003 %V 2 %N 4 %F Abraham:2003:JIKM %X The rapid e-commerce growth has made both business community and customers face a new situation. Due to intense competition on the one hand and the customer’s option to choose from several alternatives, the business community has realized the necessity of intelligent marketing strategies and relationship management. Web usage mining attempts to discover useful knowledge from the secondary data obtained from the interactions of the users with the Web. Web usage mining has become very critical for effective Web site management, creating adaptive Web sites, business and support services, personalization, network traffic flow analysis and so on. This paper presents the important concepts of Web usage mining and its various practical applications. Further a novel approach called ’intelligent-miner’ (i-Miner) is presented. i-Miner could optimize the concurrent architecture of a fuzzy clustering algorithm (to discover web data clusters) and a fuzzy inference system to analyze the Web site visitor trends. A hybrid evolutionary fuzzy clustering algorithm is proposed to optimally segregate similar user interests. The clustered data is then used to analyze the trends using a Takagi-Sugeno fuzzy inference system learned using a combination of evolutionary algorithm and neural network learning. Proposed approach is compared with self-organizing maps (to discover patterns) and several function approximation techniques like neural networks, linear genetic programming and Takagi?Sugeno fuzzy inference system (to analyze the clusters). The results are graphically illustrated and the practical significance is discussed in detail. Empirical results clearly show that the proposed Web usage-mining framework is efficient. %K genetic algorithms, genetic programming, Web mining, knowledge discovery, business intelligence, hybrid soft computing, neuro-fuzzy-genetic system %9 journal article %R doi:10.1142/S0219649203000565 %U http://www.softcomputing.net/jikm.pdf %U http://dx.doi.org/doi:10.1142/S0219649203000565 %P 375-390 %0 Generic %T Business Intelligence from Web Usage Mining %A Abraham, Ajith %D 2004 %8 may 06 %F oai:arXiv.org:cs/0405030 %X The rapid e-commerce growth has made both business community and customers face a new situation. Due to intense competition on one hand and the customer’s option to choose from several alternatives business community has realized the necessity of intelligent marketing strategies and relationship management. Web usage mining attempts to discover useful knowledge from the secondary data obtained from the interactions of the users with the Web. Web usage mining has become very critical for effective Web site management, creating adaptive Web sites, business and support services, personalization, network traffic flow analysis and so on. In this paper, we present the important concepts of Web usage mining and its various practical applications. We further present a novel approach ’intelligent-miner’ (i-Miner) to optimize the concurrent architecture of a fuzzy clustering algorithm (to discover web data clusters) and a fuzzy inference system to analyze the Web site visitor trends. A hybrid evolutionary fuzzy clustering algorithm is proposed in this paper to optimally segregate similar user interests. The clustered data is then used to analyze the trends using a Takagi-Sugeno fuzzy inference system learned using a combination of evolutionary algorithm and neural network learning. Proposed approach is compared with self-organizing maps (to discover patterns) and several function approximation techniques like neural networks, linear genetic programming and Takagi-Sugeno fuzzy inference system (to analyze the clusters). The results are graphically illustrated and the practical significance is discussed in detail. Empirical results clearly show that the proposed Web usage-mining framework is efficient. %K genetic algorithms, genetic programming %U http://arxiv.org/abs/cs/0405030 %0 Book Section %T Evolutionary Computation in Intelligent Network Management %A Abraham, Ajith %E Ghosh, Ashish %E Jain, Lakhmi C. %B Evolutionary Computing in Data Mining %S Studies in Fuzziness and Soft Computing %D 2004 %V 163 %I Springer %@ 3-540-22370-3 %F abraham:2004:ECDM %X Data mining is an iterative and interactive process concerned with discovering patterns, associations and periodicity in real world data. This chapter presents two real world applications where evolutionary computation has been used to solve network management problems. First, we investigate the suitability of linear genetic programming (LGP) technique to model fast and efficient intrusion detection systems, while comparing its performance with artificial neural networks and classification and regression trees. Second, we use evolutionary algorithms for a Web usage-mining problem. Web usage mining attempts to discover useful knowledge from the secondary data obtained from the interactions of the users with the Web. Evolutionary algorithm is used to optimise the concurrent architecture of a fuzzy clustering algorithm (to discover data clusters) and a fuzzy inference system to analyse the trends. Empirical results clearly shows that evolutionary algorithm could play a major rule for the problems considered and hence an important data mining tool. %K genetic algorithms, genetic programming, Linear Genetic Programming, LGP, intrusion detection, ANN, www, fuzzy clustering, fuzzy inference, computer security, RIPPER, demes (ring topology), steady state 32-bit FPU machine code GP, SVM, decision trees, i-miner %U http://www.softcomputing.net/ec_web-chapter.pdf %P 189-210 %0 Book Section %T Evolutionary Computation: from Genetic Algorithms to Genetic Programming %A Abraham, Ajith %A Nedjah, Nadia %A de Macedo Mourelle, Luiza %E Nedjah, Nadia %E Abraham, Ajith %E de Macedo Mourelle, Luiza %B Genetic Systems Programming: Theory and Experiences %S Studies in Computational Intelligence %D 2006 %V 13 %I Springer %C Germany %@ 3-540-29849-5 %F intro:2006:GSP %X Evolutionary computation, offers practical advantages to the researcher facing difficult optimisation problems. These advantages are multi-fold, including the simplicity of the approach, its robust response to changing circumstance, its flexibility, and many other facets. The evolutionary approach can be applied to problems where heuristic solutions are not available or generally lead to unsatisfactory results. As a result, evolutionary computation have received increased interest, particularly with regards to the manner in which they may be applied for practical problem solving. we review the development of the field of evolutionary computations from standard genetic algorithms to genetic programming, passing by evolution strategies and evolutionary programming. For each of these orientations, we identify the main differences from the others. We also, describe the most popular variants of genetic programming. These include linear genetic programming (LGP), gene expression programming (GEP), multi-expression programming (MEP), Cartesian genetic programming (CGP), traceless genetic programming (TGP), gramatical evolution (GE) and genetic algorithm for deriving software (GADS). %K genetic algorithms, genetic programming, cartesian genetic programming %R doi:10.1007/3-540-32498-4_1 %U http://www.softcomputing.net/gpsystems.pdf %U http://dx.doi.org/doi:10.1007/3-540-32498-4_1 %P 1-20 %0 Book Section %T Evolving Intrusion Detection Systems %A Abraham, Ajith %A Grosan, Crina %E Nedjah, Nadia %E Abraham, Ajith %E de Macedo Mourelle, Luiza %B Genetic Systems Programming: Theory and Experiences %S Studies in Computational Intelligence %D 2006 %V 13 %I Springer %C Germany %@ 3-540-29849-5 %F abraham:2006:GSP %X An Intrusion Detection System (IDS) is a program that analyses what happens or has happened during an execution and tries to find indications that the computer has been misused. An IDS does not eliminate the use of preventive mechanism but it works as the last defensive mechanism in securing the system. We evaluate the performances of two Genetic Programming techniques for IDS namely Linear Genetic Programming (LGP) and Multi-Expression Programming (MEP). Results are then compared with some machine learning techniques like Support Vector Machines (SVM) and Decision Trees (DT). Empirical results clearly show that GP techniques could play an important role in designing real time intrusion detection systems. %K genetic algorithms, genetic programming %R doi:10.1007/3-540-32498-4_3 %U http://falklands.globat.com/~softcomputing.net/ids-chapter.pdf %U http://dx.doi.org/doi:10.1007/3-540-32498-4_3 %P 57-79 %0 Conference Proceedings %T Genetic Programming Approach for Fault Modeling of Electronic Hardware %A Abraham, Ajith %A Grosan, Crina %Y Corne, David %Y Michalewicz, Zbigniew %Y Dorigo, Marco %Y Eiben, Gusz %Y Fogel, David %Y Fonseca, Carlos %Y Greenwood, Garrison %Y Chen, Tan Kay %Y Raidl, Guenther %Y Zalzala, Ali %Y Lucas, Simon %Y Paechter, Ben %Y Willies, Jennifier %Y Guervos, Juan J. Merelo %Y Eberbach, Eugene %Y McKay, Bob %Y Channon, Alastair %Y Tiwari, Ashutosh %Y Volkert, L. Gwenn %Y Ashlock, Dan %Y Schoenauer, Marc %S Proceedings of the 2005 IEEE Congress on Evolutionary Computation %D 2005 %8 February 5 sep %V 2 %I IEEE Press %C Edinburgh, UK %@ 0-7803-9363-5 %F abraham:2005:CEC %X presents two variants of Genetic Programming (GP) approaches for intelligent online performance monitoring of electronic circuits and systems. Reliability modelling of electronic circuits can be best performed by the stressor - susceptibility interaction model. A circuit or a system is deemed to be failed once the stressor has exceeded the susceptibility limits. For on-line prediction, validated stressor vectors may be obtained by direct measurements or sensors, which after preprocessing and standardisation are fed into the GP models. Empirical results are compared with artificial neural networks trained using backpropagation algorithm. The performance of the proposed method is evaluated by comparing the experiment results with the actual failure model values. The developed model reveals that GP could play an important role for future fault monitoring systems. %K genetic algorithms, genetic programming, MEP, ANN, LGP %R doi:10.1109/CEC.2005.1554875 %U http://www.softcomputing.net/cec05.pdf %U http://dx.doi.org/doi:10.1109/CEC.2005.1554875 %P 1563-1569 %0 Journal Article %T Decision Support Systems Using Ensemble Genetic Programming %A Abraham, Ajith %A Grosan, Crina %J Journal of Information & Knowledge Management (JIKM) %D 2006 %8 dec %V 5 %N 4 %@ 0219-6492 %F journals/jikm/AbrahamG06 %O Special topic: Knowledge Discovery Using Advanced Computational Intelligence Tools %X This paper proposes a decision support system for tactical air combat environment using a combination of unsupervised learning for clustering the data and an ensemble of three well-known genetic programming techniques to classify the different decision regions accurately. The genetic programming techniques used are: Linear Genetic programming (LGP), Multi-Expression Programming (MEP) and Gene Expression Programming (GEP). The clustered data are used as the inputs to the genetic programming algorithms. Some simulation results demonstrating the difference of these techniques are also performed. Test results reveal that the proposed ensemble method performed better than the individual GP approaches and that the method is efficient. %K genetic algorithms, genetic programming, gene expression programming, Decision support systems, ensemble systems, evolutionary multi-objective optimisation %9 journal article %R doi:10.1142/S0219649206001566 %U http://dx.doi.org/doi:10.1142/S0219649206001566 %P 303-313 %0 Journal Article %T D-SCIDS: Distributed soft computing intrusion detection system %A Abraham, Ajith %A Jain, Ravi %A Thomas, Johnson %A Han, Sang Yong %J Journal of Network and Computer Applications %D 2007 %8 jan %V 30 %N 1 %F Abraham:2007:JNCS %X An Intrusion Detection System (IDS) is a program that analyses what happens or has happened during an execution and tries to find indications that the computer has been misused. A Distributed IDS (DIDS) consists of several IDS over a large network (s), all of which communicate with each other, or with a central server that facilitates advanced network monitoring. In a distributed environment, DIDS are implemented using co-operative intelligent agents distributed across the network(s). This paper evaluates three fuzzy rule-based classifiers to detect intrusions in a network. Results are then compared with other machine learning techniques like decision trees, support vector machines and linear genetic programming. Further, we modelled Distributed Soft Computing-based IDS (D-SCIDS) as a combination of different classifiers to model lightweight and more accurate (heavy weight) IDS. Empirical results clearly show that soft computing approach could play a major role for intrusion detection. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1016/j.jnca.2005.06.001 %U http://dx.doi.org/doi:10.1016/j.jnca.2005.06.001 %P 81-98 %0 Conference Proceedings %T Real time intrusion prediction, detection and prevention programs %A Abraham, Ajith %S IEEE International Conference on Intelligence and Security Informatics, ISI 2008 %D 2008 %8 jun %F Abraham:2008:ieeeISI %O IEEE ISI 2008 Invited Talk (VI) %X An intrusion detection program (IDP) analyzes what happens or has happened during an execution and tries to find indications that the computer has been misused. In this talk, we present some of the challenges in designing efficient intrusion detection systems (IDS) using nature inspired computation techniques, which could provide high accuracy, low false alarm rate and reduced number of features. Then we present some recent research results of developing distributed intrusion detection systems using genetic programming techniques. Further, we illustrate how intruder behavior could be captured using hidden Markov model and predict possible serious intrusions. Finally we illustrate the role of online risk assessment for intrusion prevention systems and some associated results. %K genetic algorithms, genetic programming, distributed intrusion detection systems, hidden Markov model, intrusion detection program, online risk assessment, real time intrusion detection, real time intrusion prediction, real time intrusion prevention, hidden Markov models, risk management, security of data %R doi:10.1109/ISI.2008.4565018 %U http://dx.doi.org/doi:10.1109/ISI.2008.4565018 %P xli-xlii %0 Conference Proceedings %T Programming Risk Assessment Models for Online Security Evaluation Systems %A Abraham, Ajith %A Grosan, Crina %A Snasel, Vaclav %S 11th International Conference on Computer Modelling and Simulation, UKSIM ’09 %D 2009 %8 25 27 mar %F Abraham:2009:UKSIM %X Risk assessment is often done by human experts, because there is no exact and mathematical solution to the problem.Usually the human reasoning and perception process cannot be expressed precisely. This paper propose a genetic programming approach for risk assessment. Preliminary results indicate that genetic programming methods are robust and suitable for this problem when compared to other risk assessment models. %K genetic algorithms, genetic programming, genetic programming methods, human reasoning, online security evaluation systems, perception process, programming risk assessment models, risk management, security of data %R doi:10.1109/UKSIM.2009.75 %U http://dx.doi.org/doi:10.1109/UKSIM.2009.75 %P 41-46 %0 Conference Proceedings %T Hierarchical Takagi-Sugeno Models for Online Security Evaluation Systems %A Abraham, Ajith %A Grosan, Crina %A Liu, Hongbo %A Chen, Yuehui %S Fifth International Conference on Information Assurance and Security, IAS ’09 %D 2009 %8 aug %V 1 %F Abraham:2009:IAS %X Risk assessment is often done by human experts, because there is no exact and mathematical solution to the problem. Usually the human reasoning and perception process cannot be expressed precisely. This paper propose a light weight risk assessment system based on an Hierarchical Takagi-Sugeno model designed using evolutionary algorithms. Performance comparison is done with neuro-fuzzy and genetic programming methods. Empirical results indicate that the techniques are robust and suitable for developing light weight risk assessment models, which could be integrated with intrusion detection and prevention systems. %K genetic algorithms, genetic programming, hierarchical Takagi-Sugeno models, human perception, human reasoning, intrusion detection, neuro-fuzzy programming, online security evaluation systems, risk assessment, fuzzy reasoning, hierarchical systems, human factors, interactive programming, risk management, security of data %R doi:10.1109/IAS.2009.348 %U http://dx.doi.org/doi:10.1109/IAS.2009.348 %P 687-692 %0 Book Section %T Complimentary Selection as an Alternative Method for Population Reproduction %A Abrams, Zoe %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 2000 %D 2000 %8 jun %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F abrams:2000:CSAMPR %K genetic algorithms, genetic programming %P 8-15 %0 Conference Proceedings %T Classification using Cultural Co-Evolution and Genetic Programming %A Abramson, Myriam %A Hunter, Lawrence %Y Koza, John R. %Y Goldberg, David E. %Y Fogel, David B. %Y Riolo, Rick L. %S Genetic Programming 1996: Proceedings of the First Annual Conference %D 1996 %8 28–31 jul %I MIT Press %C Stanford University, CA, USA %F abramson:1996:cccGP %K genetic algorithms, genetic programming %U http://cognet.mit.edu/sites/default/files/books/9780262315876/pdfs/9780262315876_chap30.pdf %P 249-254 %0 Journal Article %T Automatic Modulation Classification Using Moments And Likelihood Maximization %A Abu-Romoh, M. %A Aboutaleb, A. %A Rezki, Z. %J IEEE Communications Letters %D 2018 %@ 1089-7798 %F Abu-Romoh:2018:ieeeCL %X Motivated by the fact that moments of the received signal are easy to compute and can provide a simple way to automatically classify the modulation of the transmitted signal, we propose a hybrid method for automatic modulation classification that lies in the intersection between likelihood-based and feature-based classifiers. Specifically, the proposed method relies on statistical moments along with a maximum likelihood engine. We show that the proposed method offers a good tradeoff between classification accuracy and complexity relative to the Maximum Likelihood (ML) classifier. Furthermore, our classifier outperforms state-of-the-art machine learning classifiers, such as genetic programming-based K-nearest neighbour (GP-KNN) classifiers, the linear support vector machine classifier (LSVM) and the fold-based Kolmogorov-Smirnov (FB-KS) algorithm. %K genetic algorithms, genetic programming, Feature extraction, Machine learning algorithms, Modulation, Probability density function, Receivers, Signal to noise ratio, Support vector machines %9 journal article %R doi:10.1109/LCOMM.2018.2806489 %U http://dx.doi.org/doi:10.1109/LCOMM.2018.2806489 %0 Conference Proceedings %T New universal gate library for synthesizing reversible logic circuit using genetic programming %A Abubakar, Mustapha Yusuf %A Jung, Low Tang %A Zakaria, Mohamed Nordin %A Younesy, Ahmed %A Abdel-Atyz, Abdel-Haleem %S 2016 3rd International Conference on Computer and Information Sciences (ICCOINS) %D 2016 %8 aug %F Abubakar:2016:ICCOINS %X We newly formed universal gate library, that includes NOT, CNOT (Feyman), Toffoli, Fredkin, Swap, Peres gates and a special gate called G gate. The gate G on its own is a universal gate, but using it alone in a library will result in large circuit realization. G gate combines the operations of Generalized Toffoli gates. For example a gate called G3 combines the operations of NOT, CNOT and T3 (3 - bit Toffoli) gates all in one place. The new library was used in synthesizing reversible circuits. The experiment was done using Genetic programming algorithm that is capable of allowing the choice of any type of gate library and optimising the circuit. The results were promising because the gate complexity in the circuits were drastically reduced compared to previously attempted synthesis. %K genetic algorithms, genetic programming %R doi:10.1109/ICCOINS.2016.7783234 %U http://dx.doi.org/doi:10.1109/ICCOINS.2016.7783234 %P 316-321 %0 Journal Article %T Reversible circuit synthesis by genetic programming using dynamic gate libraries %A Abubakar, Mustapha Yusuf %A Jung, Low Tang %A Zakaria, Nordin %A Younes, Ahmed %A Abdel-Aty, Abdel-Haleem %J Quantum Information Processing %D 2017 %V 16 %N 6 %F journals/qip/AbubakarJZYA17 %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s11128-017-1609-8 %U http://dx.doi.org/doi:10.1007/s11128-017-1609-8 %P 160 %0 Conference Proceedings %T Synthesis of Reversible Logic Using Enhanced Genetic Programming Approach %A Abubakar, Mustapha Yusuf %A Tang Jung, Low %S 2018 4th International Conference on Computer and Information Sciences (ICCOINS) %D 2018 %8 aug %F Abubakar:2018:ICCOINS %X A new enhanced reversible logic circuit synthesis method was developed using reversible gates that include NOT, CNOT (Feynman), Toffoli, Fredkin, Swap, and Peres gates. The synthesis method was done using newly developed genetic programming. Usually previous synthesis methods that uses genetic algorithms or other similar evolutionary algorithms suffers a problem known as blotting which is a sudden uncontrolled growth of an individual (circuit), which may render the synthesis inefficient because of memory usage, making the algorithm difficult to continue running and eventually stack in a local minima, there for an optimized reversible circuit may not be generated. In this method the algorithm used was blot free, the blotting was carefully controlled by fixing a suitable length and size of the individuals in the population. Following this approach, the cost of generating circuits was greatly reduced giving the algorithm to reach the end of the last designated generation to give out optimal or near optimal results. The results of the circuits generated using this method were compared with some of the results already in the literature, and in many cases, our results appeared to be better in terms of gate count and quantum cost metrics. %K genetic algorithms, genetic programming %R doi:10.1109/ICCOINS.2018.8510602 %U http://dx.doi.org/doi:10.1109/ICCOINS.2018.8510602 %0 Journal Article %T Novel electrochemical impedance simulation design via stochastic algorithms for fitting equivalent circuits %A Abud Kappel, Marco Andre %A Fabbri, Ricardo %A Domingos, Roberto P. %A Bastos, Ivan N. %J Measurement %D 2016 %V 94 %@ 0263-2241 %F Abud-Kappel:2016:Measurement %X Electrochemical impedance spectroscopy (EIS) is of great value to corrosion studies because it is sensitive to transient changes that occur in the metal-electrolyte interface. A useful way to link the results of electrochemical impedance spectroscopy to corrosion phenomena is by simulating equivalent circuits. Equivalent circuit models are very attractive because of their relative simplicity, enabling the monitoring of electrochemical systems that have a complex physical mechanism. In this paper, the stochastic algorithm Differential Evolution is proposed to fit an equivalent circuit to the EIS results for a wide potential range. EIS is often limited to the corrosion potential despite being widely used. This greatly hinders the analysis regarding the effect of the applied potential, which strongly affects the interface, as shown, for example, in polarization curves. Moreover, the data from both the EIS and the DC values were used in the proposed scheme, allowing the best fit of the model parameters. The approach was compared to the standard Simplex square residual minimization of EIS data. In order to manage the large amount of generated data, the EIS-Mapper software package, which also plots the 2D/3D diagrams with potential, was used to fit the equivalent circuit of multiple diagrams. Furthermore, EIS-Mapper also computed all simulations. The results of 67 impedance diagrams of stainless steel in a 3.5percent NaCl medium at 25C obtained in steps of 10mV, and the respective values of the fitted parameters of the equivalent circuit are reported. The present approach conveys new insight to the use of electrochemical impedance and bridges the gap between polarization curves and equivalent electrical circuits. %K genetic algorithms, genetic programming, Differential evolution, Electrochemical impedance, Impedance measurements, Corrosion, Optimization, Stochastic methods %9 journal article %R doi:10.1016/j.measurement.2016.08.008 %U https://www.sciencedirect.com/science/article/pii/S0263224116304699 %U http://dx.doi.org/doi:10.1016/j.measurement.2016.08.008 %P 344-354 %0 Thesis %T Stochastic computational techniques applied to the simulation of electrochemical impedance spectroscopy diagrams %A Kappel, Marco Andre Abud %D 2016 %8 August %C Nova Friburgo, Brazil %C Centro de Tecnologia e Ciencias, Instituto Politecnico, Universidadedo Estado do Rio de Janeiro %F Tese_MarcoAndreAbudKappel %X Electrochemical impedance spectroscopy is a widely used technique in electrochemical systems characterization. With applications in several areas, the technique is very useful in the study of corrosion because it is sensitive to transient changes that occur in the metal interface. The results from the technique can be expressed and interpreted in different ways, allowing different modeling and analysis methods, such as the use of kinetic models or equivalent circuits. In corrosion, the technique is usually applied only in a few specific potentials, such as the corrosion potential, the most important. With the motivation of improving the impedance modeling and analysis process, taking into consideration that the electrochemical phenomena are strongly linked to the potential, this work introduces the possibility to express the impedance data in a wide potential range, and use them to equivalent circuits fitting. Thus, different phenomena can be modeled adequately by equivalent circuits corresponding to different potentials. For this purpose, the related inverse problem is solved for each potential through a complex nonlinear optimization process. In addition to the transient data obtained by the spectroscopy, stationary data are also used in the optimization as a regularisation factor, supporting a consistent solution to the physical phenomena involved, from the maximum experimental frequency to theoretical zero frequency. An analysis, modeling and simulation software was developed with the following features: 1) validation of experimental data, through the Kramers-Kronig relations; 2) simultaneous visualization of impedance results for a wide potential range; 3) fitting different equivalent circuits for different ranges using transient and stationary experimental data, in conjunction with deterministic or stochastic methods; 4) generation of confidence regions for the estimated parameters, making them statistically significant; 5) simulations using the fitted equivalent circuits in computer cluster; 6) parameter sensitivity analysis according to the applied potential, revealing important physical characteristics involved in the electrochemical processes. Finally, experimental fitting results and the corresponding simulations are shown and discussed. Results show that the use of a population-based stochastic optimization method not only increases the odds of finding the global optimum, but also enables the generation of confidence regions around the found values. Furthermore, only the circuit fitted with the new objective function has equivalence with both transient data and stationary data for the entire potential range involved. %K genetic algorithms, genetic programming, Electrochemical impedance spectroscopy, Corrosion, Complex nonlinear optimization, Equivalent electrical circuit, Stochastic methods %9 Ph.D. thesis %U http://www.bdtd.uerj.br/handle/1/13692 %0 Journal Article %T A study of equivalent electrical circuit fitting to electrochemical impedance using a stochastic method %A Abud Kappel, Marco Andre %A Peixoto, Fernando Cunha %A Platt, Gustavo Mendes %A Domingos, Roberto Pinheiro %A Bastos, Ivan Napoleao %J Applied Soft Computing %D 2017 %8 jan %V 50 %@ 1568-4946 %F Abud-Kappel:2017:ASC %X Modeling electrochemical impedance spectroscopy is usually done using equivalent electrical circuits. These circuits have parameters that need to be estimated properly in order to make possible the simulation of impedance data. Despite the fitting procedure is an optimization problem solved recurrently in the literature, rarely statistical significance of the estimated parameters is evaluated. In this work, the optimization process for the equivalent electrical circuit fitting to the impedance data is detailed. First, a mathematical development regarding the minimization of residual least squares is presented in order to obtain a statistically valid objective function of the complex nonlinear regression problem. Then, the optimization method used in this work is presented, the Differential Evolution, a global search stochastic method. Furthermore, it is shown how a population-based stochastic method like this can be used directly to obtain confidence regions to the estimated parameters. A sensitivity analysis was also conducted. Finally, the equivalent circuit fitting is done to model synthetic experimental data, in order to demonstrate the adopted procedure. %K genetic algorithms, genetic programming, Differential evolution, Electrochemical impedance, Optimization, Stochastic method, Statistical analysis %9 journal article %R doi:10.1016/j.asoc.2016.11.030 %U https://www.sciencedirect.com/science/article/pii/S1568494616305993 %U http://dx.doi.org/doi:10.1016/j.asoc.2016.11.030 %P 183-193 %0 Conference Proceedings %T Cartesian Genetic Programing Applied to Equivalent Electric Circuit Identification %A Abud Kappel, Marco Andre %A Domingos, Roberto Pinheiro %A Bastos, Ivan Napoleao %Y Rodrigues, H. C. %Y Herskovits, J. %Y Mota Soares, C. M. %Y Araujo, A. L. %Y Guedes, J. M. %Y Folgado, J. O. %Y Moleiro, F. %Y Madeira, J. F. A. %S Proceedings of the 6th International Conference on Engineering Optimization. EngOpt 2018 %D 2018 %8 17 19 sep %I Springer %C Lisbon, Portugal %F Abud-Kappel:2018:EngOpt %X Equivalent electric circuits are widely used in electrochemical impedance spectroscopy (EIS) data modeling. EIS modeling involves the identification of an electrical circuit physically equivalent to the system under analysis. This equivalence is based on the assumption that each phenomenon of the electrode interface and the electrolyte is represented by electrical components such as resistors, capacitors and inductors. This analogy allows impedance data to be used in simulations and predictions related to corrosion and electrochemistry. However, when no prior knowledge of the inner workings of the process under analysis is available, the identification of the circuit model is not a trivial task. The main objective of this work is to improve both the equivalent circuit topology identification and the parameter estimation by using a different approach than the usual Genetic Programming. In order to accomplish this goal, a methodology was developed to unify the application of Cartesian Genetic Programming to tackle system topology identification and Differential Evolution for optimization of the circuit parameters. The performance and effectiveness of this methodology were tested by performing the circuit identification on four different known systems, using numerically simulated impedance data. Results showed that the applied methodology was able to identify with satisfactory precision both the circuits and the values of the components. Results also indicated the necessity of using a stochastic method in the optimization process, since more than one electric circuit can fit the same dataset. The use of evolutionary adaptive metaheuristics such as the Cartesian Genetic Programming allows not only the estimation of the model parameters, but also the identification of its optimal topology. However, due to the possibility of multiple solutions, its application must be done with caution. Whenever possible, restrictions on the search space should be added, bearing in mind the correspondence of the model to the studied physical phenomena. %K genetic algorithms, genetic programming, Cartesian Genetic Programming, Differential Evolution, Complex nonlinear optimization, Equivalent electric circuit identification %R doi:10.1007/978-3-319-97773-7_79 %U http://dx.doi.org/doi:10.1007/978-3-319-97773-7_79 %P 913-925 %0 Conference Proceedings %T Action Scheduling Optimization using Cartesian Genetic Programming %A Abud Kappel, Marco Andre %S 2019 8th Brazilian Conference on Intelligent Systems (BRACIS) %D 2019 %8 oct %F Abud-Kappel:2019:BRACIS %X Action scheduling optimisation is a problem that involves chronologically organizing a set of actions, jobs or commands in order to accomplish a pre-established goal. This kind of problem can be found in a number of areas, such as production planning, delivery logistic organization, robot movement planning and behavior programming for intelligent agents in games. Despite being a recurrent problem, selecting the appropriate time and order to execute each task is not trivial, and typically involves highly complex techniques. The main objective of this work is to provide a simple alternative to tackle the action scheduling problem, by using Cartesian Genetic Programming as an approach. The proposed solution involves the application of two simple main steps: defining the set of available actions and specifying an objective function to be optimized. Then, by the means of the evolutionary algorithm, an automatically generated schedule will be revealed as the most fitting to the goal. The effectiveness of this methodology was tested by performing an action schedule optimization on two different problems involving virtual agents walking in a simulated environment. In both cases, results showed that, throughout the evolutionary process, the simulated agents naturally chose the most efficient sequential and parallel combination of actions to reach greater distances. The use of evolutionary adaptive metaheuristics such as Cartesian Genetic Programming allows the identification of the best possible schedule of actions to solve a problem. %K genetic algorithms, genetic programming, Cartesian Genetic Programming %R doi:10.1109/BRACIS.2019.00059 %U http://dx.doi.org/doi:10.1109/BRACIS.2019.00059 %P 293-298 %0 Conference Proceedings %T A Genetic Algorithm for Solving the P-Median Problem %A Abu Dalhoum, Abdel Latif %A Al Zoubi, Moh’d %A de la Cruz, Marina %A Ortega, Alfonso %A Alfonseca, Manuel %Y Teixeira, J. Manuel Feliz %Y E.Carvalho Brito, A. %S European Simulation and Modeling Conference ESM’2005 %D 2005 %8 oct 24 26 %I http://www.eurosis.org %C Porto, Portugal %@ 90-77381-22-8 %F AbuDalhoum:2005:ESM %X One of the most popular location-allocation models among researchers is the p-median. Most of the algorithmic research on these models has been devoted to developing heuristic solution procedures. The major drawback of heuristic methods is that the time required finding solutions can become unmanageable. In this paper, we propose a new algorithm, using different variants of grammar evolution, to solve the p-median problem. %K genetic algorithms, genetic programming, grammatical evolution, Christiansen grammar, location allocation, p-median model, grammar evolution %U http://arantxa.ii.uam.es/~alfonsec/docs/confint/pmedian.pdf %P 141-145 %0 Journal Article %T Predicting failure pressure of the corroded offshore pipelines using an efficient finite element based algorithm and machine learning techniques %A Abyani, Mohsen %A Bahaari, Mohammad Reza %A Zarrin, Mohamad %A Nasseri, Mohsen %J Ocean Engineering %D 2022 %8 15 jun %V 254 %@ 0029-8018 %F ABYANI:2022:oceaneng %X This paper aims to predict the failure pressure of corroded offshore pipelines, employing different machine learning techniques. To this end, an efficient finite element based algorithm is programmed to numerically estimate the failure pressure of offshore pipelines, subjected to internal corrosion. In this process, since the computational effort of such numerical assessment is very high, the application of reliable machine learning methods is used as an alternative solution. Thus, 1815 realizations of four variables are generated, and each one is keyed into the numerical model of a sample pipeline. Thereafter, the machine learning models are constructed based on the results of the numerical analyses, and their performance are compared with each other. The results indicate that Gaussian Process Regression (GPR) and MultiLayer Perceptron (MLP) have the best performance among all the chosen models. Considering the testing dataset, the squared correlation coefficient and Root Mean Squared Error (RMSE) values of GPR and MLP models are 0.535, 0.545 and 0.993 and 0.992, respectively. Moreover, the Maximum Von-Mises Stress (MVMS) of the pipeline increases as the water depth grows at low levels of Internal Pressure (IP). Inversely, increase in water depth leads to reduction in the MVMS values at high IP levels %K genetic algorithms, genetic programming, Offshore pipelines, Corrosion, Artificial neural network, ANN, Genetic programing, Support vector machine, SVM, Random forest, Gaussian process regression %9 journal article %R doi:10.1016/j.oceaneng.2022.111382 %U https://www.sciencedirect.com/science/article/pii/S0029801822007697 %U http://dx.doi.org/doi:10.1016/j.oceaneng.2022.111382 %P 111382 %0 Report %T Intensional Encapsulations of Database Subsets by Genetic Programming %A Acar, Aybar C. %A Motro, Amihai %D 2005 %8 feb %N ISE-TR-05-01 %I Information and Software Engineering Department, The Volgenau School of Information Technology and Engineering, George Mason University %F AcarM05tr %X Finding intensional encapsulations of database subsets is the inverse of query evaluation. Whereas query evaluation transforms an intensional expression (the query) to its extension (a set of data values), intensional encapsulation assigns an intensional expression to a given set of data values. We describe a method for deriving intensional representations of subsets of records in large database tables. Our method is based on the paradigm of genetic programming. It is shown to achieve high accuracy and maintain compact expression size, while requiring cost that is acceptable to all applications, but those that require instantaneous results. Intensional encapsulation has a broad range of applications including cooperative answering, information integration, security and data mining. %K genetic algorithms, genetic programming %U http://ise.gmu.edu/techrep/2005/05_01.pdf %0 Conference Proceedings %T Intensional Encapsulations of Database Subsets via Genetic Programming %A Acar, Aybar C. %A Motro, Amihai %Y Andersen, Kim Viborg %Y Debenham, John K. %Y Wagner, Roland %S Database and Expert Systems Applications, 16th International Conference, DEXA 2005, Proceedings %S Lecture Notes in Computer Science %D 2005 %8 aug 22 26 %V 3588 %I Springer %C Copenhagen, Denmark %@ 3-540-28566-0 %F conf/dexa/AcarM05 %X Finding intensional encapsulations of database subsets is the inverse of query evaluation. Whereas query evaluation transforms an intensional expression (the query) to its extension (a set of data values), intensional encapsulation assigns an intensional expression to a given set of data values. We describe a method for deriving intensional representations of subsets of records in large database tables. Our method is based on the paradigm of genetic programming. It is shown to achieve high accuracy and maintain compact expression size, while requiring cost that is acceptable to all applications, but those that require instantaneous results. Intensional encapsulation has a broad range of applications including cooperative answering, information integration, security and data mining. %K genetic algorithms, genetic programming %R doi:10.1007/11546924_36 %U http://dx.doi.org/doi:10.1007/11546924_36 %P 365-374 %0 Thesis %T Query Consolidation: Interpreting Queries Sent to Independent Heterogenous Databases %A Acar, Aybar C. %D 2008 %8 23 jul %C Fairfax, VA, USA %C The Volgenau School of Information Technology and Engineering, George Mason University %F Acar:thesis %X This dissertation introduces the problem of query consolidation, which seeks to interpret a set of disparate queries submitted to independent databases with a single global query. The problem has multiple applications, from improving virtual database design, to aiding users in information retrieval, to protecting against inference of sensitive data from a seemingly innocuous set of apparently unrelated queries. The problem exhibits attractive duality with the much-researched problem of query decomposition, which has been addressed intensively in the context of multidatabase environments: How to decompose a query submitted to a virtual database into a set of local queries that are evaluated in individual databases. The new problem is set in the architecture of a canonical multidatabase system, using it in the reverse direction. The reversal is built on the assumption of conjunctive queries and source descriptions. A rational and efficient query decomposition strategy is also assumed, and this decomposition is reversed to arrive at the original query by analyzing the decomposed components. The process incorporates several steps where a number of solutions must be considered, due to the fact that query decomposition is not injective. Initially, the problem of finding the most likely join plan between component queries is investigated. This is accomplished by leveraging the referential constraints available in the underlying multidatabase, or by approximating these constraints from the data when not available. This approximation is done using the information theoretic concept of conditional entropy. Furthermore, the most likely join plans are enhanced by the expansion of their projections and adding precision to their selection constraints by estimating the selection constraints that would be applied to these consolidations offline. Additionally, the extraction of a set of queries related to the same retrieval task from an ongoing sequence of incoming queries is investigated. A conditional random field model is trained to segment and label incoming query sequences. Finally, the candidate consolidations are re-encapsulated with a genetic programming approach to find simpler intentional descriptions that are extensionally equivalent to discover the original intent of the query. The dissertation explains and discusses all of the above operations and validates the methods developed with experimentation on synthesised and real-world data. The results are highly encouraging and verify that the accuracy, time performance, and scalability of the methods would make it possible to exploit query consolidation in production environments. %K genetic algorithms, genetic programming, Databases, Information Integration, Query Processing, Machine Learning %9 Ph.D. thesis %U http://hdl.handle.net/1920/3223 %0 Journal Article %T Automatic design of specialized algorithms for the binary knapsack problem %A Acevedo, Nicolas %A Rey, Carlos %A Contreras-Bolton, Carlos %A Parada, Victor %J Expert Systems with Applications %D 2020 %V 141 %@ 0957-4174 %F ACEVEDO:2020:ESA %X Not all problem instances of a difficult combinatorial optimization problem have the same degree of difficulty for a given algorithm. Surprisingly, apparently similar problem instances may require notably different computational efforts to be solved. Few studies have explored the case that the algorithm that solves a combinatorial optimization problem is automatically designed. In consequence, the generation of the best algorithms may produce specialized algorithms according to the problem instances used during the constructive step. Following a constructive process based on genetic programming that combines heuristic components with an exact method, new algorithms for the binary knapsack problem are produced. We found that most of the automatically designed algorithms have better performance when solving instances of the same type used during construction, although the algorithms also perform well with other types of similar instances. The rest of the algorithms are partially specialized. We also found that the exact method that only solves a small knapsack problem has a key role in such results. When the algorithms are produced without considering such a method, the errors are higher. We observed this fact when the algorithms were constructed with a combination of instances from different types. These results suggest that the better the pre-classification of the instances of an optimization problem, the more specific and more efficient are the algorithms produced by the automatic generation of algorithms. Consequently, the method described in this article accelerates the search for efficient methods for NP-hard optimization problems %K genetic algorithms, genetic programming, Automatic generation of algorithms, Binary knapsack problem, Hyperheuristic, Generative design of algorithms %9 journal article %R doi:10.1016/j.eswa.2019.112908 %U http://www.sciencedirect.com/science/article/pii/S0957417419306268 %U http://dx.doi.org/doi:10.1016/j.eswa.2019.112908 %P 112908 %0 Journal Article %T A novel fitness function in genetic programming to handle unbalanced emotion recognition data %A Acharya, Divya %A Goel, Shivani %A Asthana, Rishi %A Bhardwaj, Arpit %J Pattern Recognition Letters %D 2020 %V 133 %@ 0167-8655 %F ACHARYA:2020:PRL %X In the area of behavioral psychology, real-time emotion recognition by using physiological stimuli is an active topic of interest. This research considers the recognition of two class of emotions i.e., positive and negative emotions using EEG signals in response to happy, horror, sad, and neutral genres. In a noise-free framework for data acquisition of 50 participants, NeuroSky MindWave 2 is used. The dataset collected is unbalanced i.e., there are more instances of positive classes than negative ones. Therefore, accuracy is not a useful metric to assess the results of the unbalanced dataset because of biased results. So, the primary goal of this research is to address the issue of unbalanced emotion recognition dataset classification, for which we are proposing a novel fitness function known as Gap score (G score), which learns about both the classes by giving them equal importance and being unbiased. The genetic programming (GP) framework in which we implemented G score is named as G-score GP (GGP). The second goal is to assess how distinct genres affect human emotion recognition process and to identify an age group that is more active emotionally when their emotions are elicited. Experiments were conducted on EEG data acquired with a single-channel EEG device. We have compared the performance of GGP for the classification of emotions with state-of-the-art methods. The analysis shows that GGP provides 87.61percent classification accuracy by using EEG. In compliance with the self-reported feelings, brain signals of 26 to 35 years of age group provided the highest emotion recognition rate %K genetic algorithms, genetic programming, Emotion recognition, Fitness function, EEG, Fast Fourier transformation %9 journal article %R doi:10.1016/j.patrec.2020.03.005 %U http://www.sciencedirect.com/science/article/pii/S0167865520300830 %U http://dx.doi.org/doi:10.1016/j.patrec.2020.03.005 %P 272-279 %0 Journal Article %T Emotion recognition using fourier transform and genetic programming %A Acharya, Divya %A Billimoria, Anosh %A Srivastava, Neishka %A Goel, Shivani %A Bhardwaj, Arpit %J Applied Acoustics %D 2020 %8 jul %V 164 %@ 0003-682X %F ACHARYA:2020:AA %X In cognitive science, the real-time recognition of humans emotional state is pertinent for machine emotional intelligence and human-machine interaction. Conventional emotion recognition systems use subjective feedback questionnaires, analysis of facial features from videos, and online sentiment analysis. This research proposes a system for real-time detection of emotions in response to emotional movie clips. These movie clips elicitate emotions in humans, and during that time, we have recorded their brain signals using Electroencephalogram (EEG) device and analyze their emotional state. This research work considered four class of emotions (happy, calm, fear, and sadness). This method leverages Fast Fourier Transform (FFT) for feature extraction and Genetic Programming (GP) for classification of EEG data. Experiments were conducted on EEG data acquired with a single dry electrode device NeuroSky Mind Wave 2. To collect data, a standardized database of 23 emotional Hindi film clips were used. All clips individually induce different emotions, and data collection was done based on these emotions elicited as the clips contain emotionally inductive scenes. Twenty participants took part in this study and volunteered for data collection. This system classifies four discrete emotions which are: happy, calm, fear, and sadness with an average of 89.14percent accuracy. These results demonstrated improvements in state-of-the-art methods and affirmed the potential use of our method for recognising these emotions %K genetic algorithms, genetic programming, Electroencephalogram, Fast Fourier Transform, Emotion recognition, Movie clips, Cinema Films %9 journal article %R doi:10.1016/j.apacoust.2020.107260 %U http://lrcdrs.bennett.edu.in:80/handle/123456789/1183 %U http://dx.doi.org/doi:10.1016/j.apacoust.2020.107260 %P 107260 %0 Journal Article %T An enhanced fitness function to recognize unbalanced human emotions data %A Acharya, Divya %A Varshney, Nandana %A Vedant, Anindiya %A Saxena, Yashraj %A Tomar, Pradeep %A Goel, Shivani %A Bhardwaj, Arpit %J Expert Systems with Applications %D 2021 %V 166 %@ 0957-4174 %F ACHARYA:2021:ESA %X In cognitive science and human-computer interaction, automatic human emotion recognition using physiological stimuli is a key technology. This research considers two-class (positive and negative) of emotions recognition using electroencephalogram (EEG) signals in response to an emotional clip from the genres happy, amusement, sad, and horror. This paper introduces an enhanced fitness function named as eD-score to recognize emotions using EEG signals. The primary goal of this research is to assess how genres affect human emotions. We also analyzed human behaviour based on age and gender responsiveness. We have compared the performance of Multilayer Perceptron (MLP), K-nearest neighbors (KNN), Support Vector Machine (SVM), D-score Genetic Programming (DGP), and enhanced D-score Genetic Programming (eDGP) for classification of emotions. The analysis shows that for two class of emotion eDGP provides classification accuracy as 83.33percent, 84.69percent, 85.88percent, and 87.61percent for 50-50, 60-40, 70-30, and 10-fold cross-validations. Generalizability and reliability of this approach is evaluated by applying the proposed approach to publicly available EEG datasets DEAP and SEED. When participants in this research are exposed to amusement genre, their reaction is positive emotion. In compliance with the self-reported feelings, brain signals of 26-35 years of age group provided the highest emotional identification. Among genders, females are more emotionally active as compared to males. These results affirmed the potential use of our method for recognizing emotions %K genetic algorithms, genetic programming, Emotion recognition, Fitness function, EEG, Fast Fourier Transformation, Unbalanced dataset %9 journal article %R doi:10.1016/j.eswa.2020.114011 %U https://www.sciencedirect.com/science/article/pii/S0957417420307843 %U http://dx.doi.org/doi:10.1016/j.eswa.2020.114011 %P 114011 %0 Conference Proceedings %T Evolving patches for software repair %A Ackling, Thomas %A Alexander, Bradley %A Grunert, Ian %Y Krasnogor, Natalio %Y Lanzi, Pier Luca %Y Engelbrecht, Andries %Y Pelta, David %Y Gershenson, Carlos %Y Squillero, Giovanni %Y Freitas, Alex %Y Ritchie, Marylyn %Y Preuss, Mike %Y Gagne, Christian %Y Ong, Yew Soon %Y Raidl, Guenther %Y Gallager, Marcus %Y Lozano, Jose %Y Coello-Coello, Carlos %Y Silva, Dario Landa %Y Hansen, Nikolaus %Y Meyer-Nieberg, Silja %Y Smith, Jim %Y Eiben, Gus %Y Bernado-Mansilla, Ester %Y Browne, Will %Y Spector, Lee %Y Yu, Tina %Y Clune, Jeff %Y Hornby, Greg %Y Wong, Man-Leung %Y Collet, Pierre %Y Gustafson, Steve %Y Watson, Jean-Paul %Y Sipper, Moshe %Y Poulding, Simon %Y Ochoa, Gabriela %Y Schoenauer, Marc %Y Witt, Carsten %Y Auger, Anne %S GECCO ’11: Proceedings of the 13th annual conference on Genetic and evolutionary computation %D 2011 %8 December 16 jul %I ACM %C Dublin, Ireland %F Ackling:2011:GECCO %X Defects are a major concern in software systems. Unsurprisingly, there are many tools and techniques to facilitate the removal of defects through their detection and localisation. However, there are few tools that attempt to repair defects. To date, evolutionary tools for software repair have evolved changes directly in the program code being repaired. In this work we describe an implementation: pyEDB, that encodes changes as a series of code modifications or patches. These modifications are evolved as individuals. We show pyEDB to be effective in repairing some small errors, including variable naming errors in Python programs. We also demonstrate that evolving patches rather than whole programs simplifies the removal of spurious errors. %K genetic algorithms, genetic programming, genetic improvement, APR, SBSE, Debugging, fault-repair, Python %R doi:10.1145/2001576.2001768 %U https://hdl.handle.net/2440/70777 %U http://dx.doi.org/doi:10.1145/2001576.2001768 %P 1427-1434 %0 Journal Article %T Learning to Assemble Classifiers via Genetic Programming %A Acosta-Mendoza, Niusvel %A Morales-Reyes, Alicia %A Escalante, Hugo Jair %A Alonso, Andres Gago %J IJPRAI %D 2014 %V 28 %N 7 %F journals/ijprai/Acosta-MendozaMEA14 %K genetic algorithms, genetic programming %9 journal article %U http://dx.doi.org/10.1142/S0218001414600052 %0 Book Section %T Computers from Plants We Never Made: Speculations %A Adamatzky, Andrew %A Harding, Simon %A Erokhin, Victor %A Mayne, Richard %A Gizzie, Nina %A Baluska, Frantisek %A Mancuso, Stefano %A Sirakoulis, Georgios Ch. %E Stepney, Susan %E Adamatzky, Andrew %B Inspired by Nature: Essays Presented to Julian F. Miller on the Occasion of his 60th Birthday %S Emergence, Complexity and Computation %D 2017 %V 28 %I Springer %F Adamatzky:2017:miller %X Plants are highly intelligent organisms. They continuously make distributed processing of sensory information, concurrent decision making and parallel actuation. The plants are efficient green computers per se. Outside in nature, the plants are programmed and hardwired to perform a narrow range of tasks aimed to maximize the plants ecological distribution, survival and reproduction. To persuade plants to solve tasks outside their usual range of activities, we must either choose problem domains which homomorphic to the plants natural domains or modify biophysical properties of plants to make them organic electronic devices. We discuss possible designs and prototypes of computing systems that could be based on morphological development of roots, interaction of roots, and analogue electrical computation with plants, and plant-derived electronic components. In morphological plant processors data are represented by initial configuration of roots and configurations of sources of attractants and repellents; results of computation are represented by topology of the roots network. Computation is implemented by the roots following gradients of attractants and repellents, as well as interacting with each other. Problems solvable by plant roots, in principle, include shortest-path, minimum spanning tree, Voronoi diagram, alpha-shapes, convex subdivision of concave polygons. Electrical properties of plants can be modified by loading the plants with functional nanoparticles or coating parts of plants of conductive polymers. Thus, we are in position to make living variable resistors, capacitors, operational amplifiers, multipliers, potentiometers and fixed-function generators. The electrically modified plants can implement summation, integration with respect to time, inversion, multiplication, exponentiation, logarithm, division. Mathematical and engineering problems to be solved can be represented in plant root networks of resistive or reaction elements. Developments in plant-based computing architectures will trigger emergence of a unique community of biologists, electronic engineering and computer scientists working together to produce living electronic devices which future green computers will be made of. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-319-67997-6_17 %U http://dx.doi.org/doi:10.1007/978-3-319-67997-6_17 %P 357-387 %0 Conference Proceedings %T Forecasting the MagnetoEncephaloGram (MEG) of Epileptic Patients Using Genetically Optimized Neural Networks %A Adamopoulos, Adam V. %A Georgopoulos, Efstratios F. %A Likothanassis, Spiridon D. %A Anninos, Photios A. %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 2 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F adamopoulos:1999:FMEPUGONN %K real world applications %U http://gpbib.cs.ucl.ac.uk/gecco1999/RW-767.pdf %P 1457-1462 %0 Book Section %T Creation of Simple, Deadline, and Priority Scheduling Algorithms using Genetic Programming %A Adams, Thomas P. %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 2002 %D 2002 %8 jun %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F adams:2002:CSDPSAGP %K genetic algorithms, genetic programming %U http://www.genetic-programming.org/sp2002/Adams.pdf %P 1-10 %0 Conference Proceedings %T Computational Scientific Discovery and Cognitive Science Theories %A Addis, Mark %A Sozou, Peter D. %A Lane, Peter C. %A Gobet, Fernand %Y Mueller, Vincent C. %S Computing and Philosophy: Selected Papers from IACAP 2014 %D 2016 %I Springer %F Addis:2014:IACAP %X This study is concerned with processes for discovering new theories in science. It considers a computational approach to scientific discovery, as applied to the discovery of theories in cognitive science. The approach combines two ideas. First, a process-based scientific theory can be represented as a computer program. Second, an evolutionary computational method, genetic programming, allows computer programs to be improved through a process of computational trial and error. Putting these two ideas together leads to a system that can automatically generate and improve scientific theories. The application of this method to the discovery of theories in cognitive science is examined. Theories are built up from primitive operators. These are contained in a theory language that defines the space of possible theories. An example of a theory generated by this method is described. These results support the idea that scientific discovery can be achieved through a heuristic search process, even for theories involving a sequence of steps. However, this computational approach to scientific discovery does not eliminate the need for human input. Human judgement is needed to make reasonable prior assumptions about the characteristics of operators used in the theory generation process, and to interpret and provide context for the computationally generated theories. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-319-23291-1_6 %U http://eprints.lse.ac.uk/66168/ %U http://dx.doi.org/doi:10.1007/978-3-319-23291-1_6 %P 83-97 %0 Conference Proceedings %T Regression genetic programming for estimating trend end in foreign exchange market %A Adegboye, Adesola %A Kampouridis, Michael %A Johnson, Colin G. %S 2017 IEEE Symposium Series on Computational Intelligence (SSCI) %D 2017 %8 27 nov 1 dec %C Honolulu, HI, USA %F Adegboye:2017:ieeeSSCI) %X Most forecasting algorithms use a physical time scale for studying price movement in financial markets, making the flow of physical time discontinuous. The use of a physical time scale can make companies oblivious to significant activities in the market, which poses a risk. Directional changes is a different and newer approach, which uses an event-based time scale. This approach summarises data into alternating trends called upward directional change and downward directional change. Each of these trends are further dismembered into directional change (DC) event and overshoot (OS) event. We present a genetic programming (GP) algorithm that evolves equations that express linear and non-linear relationships between the length of DC and OS events in a given dataset. This allows us to have an expectation when a trend will reverse, which can lead to increased profitability. This novel trend reversal estimation approach is then used as part of a DC-based trading strategy. We aim to appraise whether the new knowledge can lead to greater excess return. We assess the efficiency of the modified trading strategy on 250 different directional changes datasets from five different thresholds and five different currency pairs, consisting of intraday data from the foreign exchange (Forex) spot market. Results show that our algorithm is able to return profitable trading strategies and statistically outperform state-of-the-art financial trading strategies, such as technical analysis, buy and hold and other DC-based trading strategies. %K genetic algorithms, genetic programming %R doi:10.1109/SSCI.2017.8280833 %U http://dx.doi.org/doi:10.1109/SSCI.2017.8280833 %0 Journal Article %T Machine learning classification and regression models for predicting directional changes trend reversal in FX markets %A Adegboye, Adesola %A Kampouridis, Michael %J Expert Systems with Applications %D 2021 %8 January %V 173 %@ 0957-4174 %F ADEGBOYE:2021:ESA %X Most forecasting algorithms in financial markets use physical time for studying price movements, making the flow of time discontinuous. The use of physical time scale can make traders oblivious to significant activities in the market, which poses a risk. Directional changes (DC) is an alternative approach that uses event-based time to sample data. In this work, we propose a novel DC-based framework, which uses machine learning algorithms to predict when a trend will reverse. This allows traders to be in a position to take an action before this happens and thus increase their profitability. We combine our approach with a novel DC-based trading strategy and perform an in-depth investigation, by applying it to 10-min data from 20 foreign exchange markets over a 10-month period. The total number of tested datasets is 1,000, which allows us to argue that our results can be generalised and are widely applicable. We compare our results to ten benchmarks (both DC and non-DC based, such as technical analysis and buy-and-hold). Our findings show that our proposed approach is able to return a significantly higher profit, as well as reduced risk, and statistically outperform the other trading strategies in a number of different performance metrics %K genetic algorithms, genetic programming, Directional changes, Regression, Classification, Forex/FX, Machine learning %9 journal article %R doi:10.1016/j.eswa.2021.114645 %U https://kar.kent.ac.uk/89886/1/Adegboye-INT2021_preprint.pdf %U http://dx.doi.org/doi:10.1016/j.eswa.2021.114645 %P 114645 %0 Thesis %T Estimating Directional Changes Trend Reversal in Forex Using Machine Learning %A Adegboye, Adesola Tolulope Noah %D 2022 %8 mar %C UK %C University of Kent %F Adegboye:thesis %X Most forecasting algorithms use a physical time scale data to study price movement in financial markets by taking snapshots in fixed schedule, making the flow of time discontinuous. The use of a physical time scale can make traders oblivious to significant activities in the market, which poses risks. For example, currency risk, the risk that exchange rate will change. Directional changes is a different and newer approach of taking snapshot of the market, which uses an event-based time scale. This approach summarises data into alternating trends called upward directional change and downward directional change according to a change in price a trader considers to be significant, which is expressed as a threshold. The trends in the summary are split into directional change (DC) and overshoot (OS) events. In this work, we propose a novel DC-based framework, which uses machine learning algorithms to forecast when the next, alternate trend is expected to begin. First, we present a genetic programming (GP) algorithm that evolves equations that express linear and non-linear relationships between the length of DC and OS events in a given dataset. Awareness of DC event and OS event lengths provide traders with an idea of when DC trends are expected to reverse and thus take appropriate action to increase profit or mitigate risk. Second, DC trends can be categorised into two distinct types: (1) trends with OS events; and (2) trends without OS events(i.e. OS event length is 0). Trends with OS events are those that continue beyond a period when they were first observed and trends without OS event are others that ends as soon as they were observed. To further improve trend reversal estimation accuracy, we identified these two categorises using classification techniques and estimated OS event length for trends that belong in the first category. We appraised whether this new knowledge could lead to an even greater excess return. Third, our novel trend reversal estimation approach was then used as part of a novel genetic algorithm (GA) based trading strategy. The strategy embedded an optimised trend reversal forecasting algorithm that was based on trend reversal point forecasted by multiple thresholds. We assessed the efficiency of our framework (i.e., a novel trend reversal approach and an optimised trading strategy) by performing an in-depth investigation. To assess our approach and evaluate the extent to which it could be generalised in Forex markets, we used five tailored thresholds to create 1000 DC datasets from 10, monthly 10 minute physical time data of 20 major Forex markets (i.e 5 thresholds * 10 months * 20 currency pairs). We compared our results to six benchmarks techniques, both DC and non-DC based, such as technical analysis and buy-and-hold. Our findings showed that our proposed approach can return a significantly higher profit at reduced risk, and statistically outperformed the other trading strategies compareds in a number of different performance metrics. %K genetic algorithms, genetic programming %9 Ph.D. thesis %R doi:10.22024/UniKent/01.02.94107 %U https://kar.kent.ac.uk/94107/ %U http://dx.doi.org/doi:10.22024/UniKent/01.02.94107 %0 Journal Article %T Effect of varied fiber alkali treatments on the tensile strength of Ampelocissus cavicaulis reinforced polyester composites: Prediction, optimization, uncertainty and sensitivity analysis %A Adeyi, Abiola John %A Adeyi, Oladayo %A Oke, Emmanuel Olusola %A Olalere, Olusegun Abayomi %A Oyelami, Seun %A Ogunsola, Akinola David %J Advanced Industrial and Engineering Polymer Research %D 2021 %V 4 %N 1 %@ 2542-5048 %F ADEYI:2021:AIEPR %X Studies on modeling and optimization of alkali treatment, investigation of experimental uncertainty and sensitivity analysis of alkali treatment factors of natural fibers are important to effective natural fiber reinforced polymer composite development. In this contribution, response surface methodology (RSM) was employed to investigate and optimize the effect of varied treatment factors (sodium hydroxide concentration (NaOH) and soaking time (ST)) of the alkali treatment of Ampelocissus cavicaulis natural fiber (ACNF) on the tensile strength (TS) of alkali treated ACNF reinforced polyester composite. RSM and multi gene genetic programming (MGGP) were comparatively employed to model the alkali treatment. The best model was applied in Oracle Crystal Ball (OCB) to investigate the uncertainty of the treatment results and sensitivity of the treatment factors. Results showed that increased NaOH and ST increased the TS of the alkali treated ACNF reinforced polyester composite up to 28.3500 MPa before TS decreased. The coefficient of determination (R2) and root mean square error (RMSE) of RSM model were 0.8920 and 0.6528 while that of MGGP were 0.9144 and 0.5812. The optimum alkali treatment established by RSM was 6.23percent of NaOH at 41.99 h of ST to give a TS of 28.1800 MPa with a desirability of 0.9700. The TS of the validated optimum alkali treatment condition was 28.2200 MPa. The certainty of the experimental results was 71.2580percent. TS was 13.8000percent sensitive to NaOH and 86.2000percent sensitive to ST. This work is useful for effective polymer composite materials production to reduce the enormous material and energy losses that usually accompany the process %K genetic algorithms, genetic programming, Response surface methodology, Multigene genetic programming, Oracle crystal ball, Uncertainty and sensitivity analysis %9 journal article %R doi:10.1016/j.aiepr.2020.12.002 %U https://www.sciencedirect.com/science/article/pii/S2542504820300580 %U http://dx.doi.org/doi:10.1016/j.aiepr.2020.12.002 %P 29-40 %0 Journal Article %T Process integration for food colorant production from Hibiscus sabdariffa calyx: A case of multi-gene genetic programming (MGGP) model and techno-economics %A Adeyi, Oladayo %A Adeyi, Abiola J. %A Oke, Emmanuel O. %A Okolo, Bernard I. %A Olalere, Abayomi O. %A Otolorin, John A. %A Okhale, Samuel %A Taiwo, Abiola E. %A Oladunni, Sunday O. %A Akatobi, Kelechi N. %J Alexandria Engineering Journal %D 2022 %V 61 %N 7 %@ 1110-0168 %F ADEYI:2022:AEJ %X This work presents an integrated heat-assisted extraction process for the production of crude anthocyanins powder (CAnysP) from Hibiscus sabdariffa calyx using SuperPro Designer. The influence of process scale-up (0.04 -1000L) and variables (temperature, time and ethanol proportion in solvent) were investigated by adopting a circumscribed central composite design on techno-economic parameters such as annual production rate (APR) and unit production cost (UPC) CAnysP. The individual runs in the CCCD were taken as different process scenario and simulated independently. Multi-gene genetic programming (MGGP) was further used to develop robust predictive models. The robustness of the model and sensitivity analysis were ascertained using the Monte Carlo simulation. The process scenario at 30 min, 30 degreeC, 50percent and 1000 L possessed the highest CAnysP APR and lowest UPC. MGGP- models predicted R2 = 0.9984 for CAnysP APR and R2 = 0.9643 for UPC and certainty (99.98percent for CAnysP APR and 98.47percent for UPC) %K genetic algorithms, genetic programming, Heat assisted technology based process, Multi-gene genetic programming, Annual production rate, Unit production cost, Techno-economics, calyx %9 journal article %R doi:10.1016/j.aej.2021.10.049 %U https://www.sciencedirect.com/science/article/pii/S1110016821006931 %U http://dx.doi.org/doi:10.1016/j.aej.2021.10.049 %P 5235-5252 %0 Conference Proceedings %T Shear Force Analysis and Modeling for Discharge Estimation Using Numerical and GP for Compound Channels %A Adhikari, Alok %A Adhikari, Nibedita %A Patra, K. C. %S Soft Computing in Data Analytics %D 2019 %I Springer %F adhikari:2019:SCDA %K genetic algorithms, genetic programming %R doi:10.1007/978-981-13-0514-6_32 %U http://link.springer.com/chapter/10.1007/978-981-13-0514-6_32 %U http://dx.doi.org/doi:10.1007/978-981-13-0514-6_32 %0 Journal Article %T Genetic Programming: A Complementary Approach for Discharge Modelling in Smooth and Rough Compound Channels %A Adhikari, Alok %A Adhikari, N. %A Patra, K. C. %J Journal of The Institution of Engineers (India): Series A %D 2019 %8 sep %V 100 %N 3 %@ 2250-2149 %F adhikari:JIEIa %X Use of genetic programming (GP) in the field of river engineering is rare. During flood when the water overflows beyond its main course known as floodplain encounters various obstacles through rough materials and vegetation. Again the flow behaviour becomes more complex in a compound channel section due to shear at different regions. Discharge results from the experimental channels for varying roughness surfaces, along with data from a compound river section, are used in the GP. Model equations are derived for prediction of discharge in the compound channel using five hydraulic parameters. Derived models are tested and compared with other soft computing techniques. Few performance parameters are used to evaluate all the approaches taken for analysis. From the sensitivity analysis, the effects of parameters responsible for the flow behaviour are inferred. GP is found to give the most potential results with the highest level of accuracy. This work aims to benefit the researchers studying machine learning approaches for application in stream flow analysis. %K genetic algorithms, genetic programming, FIS, ANFIS, GP %9 journal article %R doi:10.1007/s40030-019-00367-x %U http://link.springer.com/article/10.1007/s40030-019-00367-x %U http://dx.doi.org/doi:10.1007/s40030-019-00367-x %P 395-405 %0 Journal Article %T Genetic programming-based ordinary Kriging for spatial interpolation of rainfall %A Adhikary, Sajal Kumar %A Muttil, Nitin %A Yilmaz, Abdullah %J Journal of Hydrologic Engineering %D 2016 %8 feb %V 21 %N 2 %I American Society of Civil Engineers %F vu29881 %X Rainfall data provide an essential input for most hydrologic analyses and designs for effective management of water resource systems. However, in practice, missing values often occur in rainfall data that can ultimately influence the results of hydrologic analysis and design. Conventionally, stochastic interpolation methods such as Kriging are the most frequently used approach to estimate the missing rainfall values where the variogram model that represents spatial correlations among data points plays a vital role and significantly impacts the performance of the methods. In the past, the standard variogram models in ordinary kriging were replaced with the universal function approximator-based variogram models, such as artificial neural networks (ANN). In the current study, applicability of genetic programming (GP) to derive the variogram model and use of this GP-derived variogram model within ordinary kriging for spatial interpolation was investigated. Developed genetic programming-based ordinary kriging (GPOK) was then applied for estimating the missing rainfall data at a rain gauge station using the historical rainfall data from 19 rain gauge stations in the Middle Yarra River catchment of Victoria, Australia. The results indicated that the proposed GPOK method outperformed the traditional ordinary kriging as well as the ANN-based ordinary kriging method for spatial interpolation of rainfall. Moreover, the GP-derived variogram model is shown to have advantages over the standard and ANN-derived variogram models. Therefore, the GP-derived variogram model seems to be a potential alternative to variogram models applied in the past and the proposed GPOK method is recommended as a viable option for spatial interpolation. %K genetic algorithms, genetic programming, rainfall data, management of water resource systems, missing values, programming %9 journal article %R doi:10.1061/(ASCE)HE.1943-5584.0001300 %U https://vuir.vu.edu.au/29881/ %U http://dx.doi.org/doi:10.1061/(ASCE)HE.1943-5584.0001300 %0 Thesis %T Optimal Design of a Rain Gauge Network to Improve Streamflow Forecasting %A Adhikary, Sajal Kumar %D 2017 %8 20 mar %C Melbourne, Australia %C College of Engineering and Science, Victoria University %F vu35054 %O This thesis includes 1 published article for which access is restricted due to copyright (Chapters 3, 4 (first paper). Details of access to these papers have been inserted in the thesis, replacing the articles themselves. %X Enhanced streamflow forecasting has always been an important task for researchers and water resources managers. However, streamflow forecasting is often challenging owing to the complexity of hydrologic systems. The accuracy of streamflow forecasting mainly depends on the input data, especially rainfall as it constitutes the key input in transforming rainfall into runoff. This emphasizes the need for incorporating accurate rainfall input in streamflow forecasting models in order to achieve enhanced streamflow forecasting. Based on past research, it is well-known that an optimal rain gauge network is necessary to provide high quality rainfall estimates. Therefore, this study focused on the optimal design of a rain gauge network and integration of the optimal network-based rainfall input in artificial neural network (ANN) models to enhance the accuracy of streamflow forecasting. The Middle Yarra River catchment in Victoria, Australia was selected as the case study catchment, since the management of water resources in the catchment is of great importance to the majority of Victorians. The study had three components. First, an evaluation of existing Kriging methods and universal function approximation techniques such as genetic programming (GP) and ANN were performed in terms of their potentials and suitability for the enhanced spatial estimation of rainfall. The evaluation confirmed that the fusion of GP and ordinary kriging is highly effective for the improved estimation of rainfall and the ordinary cokriging using elevation can enhance the spatial estimation of rainfall. Second, the design of an optimal rain gauge network was undertaken for the case study catchment using the kriging-based geostatistical approach based on the variance reduction framework. It is likely that an existing rain gauge network may consist of redundant stations, which have no contribution to the network performance for providing quality rainfall estimates. Therefore, the optimal network was achieved through optimal placement of additional stations (network augmentation) as well as eliminating or optimally relocating of redundant stations (network rationalization). In order to take the rainfall variability caused by climatic factors like El Nino Southern Oscillation into account, the network was designed using rainfall records for both El Nino and La Nina periods. The rain gauge network that gives the improved estimates of areal average and point rainfalls for both the El Nino and La Nina periods was selected as the optimal network. It was found that the optimal network outperformed the existing one in estimating the spatiotemporal estimates of areal average and point rainfalls. Additionally, optimal positioning of redundant stations was found to be highly effective to achieve the optimal rain gauge network. Third, an ANN-based enhanced streamflow forecasting approach was demonstrated, which incorporated the optimal rain gauge network-based input instead of using input from an existing non-optimal network to achieve the enhanced streamflow forecasting. The approach was found to be highly effective in improving the accuracy of stream-flow forecasting, particularly when the current operational rain gauge network is not an optimal one. For example, it was found that use of the optimal rain gauge network-based input results in the improvement of streamflow forecasting accuracy by 7.1percent in terms of normalised root mean square error (NRMSE) compared to the current rain gauge network based-input. Further improvement in streamflow forecasting was achieved through augmentation of the optimal network by incorporating additional fictitious rain gauge stations. The fictitious stations were added in sub-catchments that were delineated based on the digital elevation model. It was evident from the results that 18.3percent improvement in streamflow forecasting accuracy was achieved in terms of NRMSE using the augmented optimal rain gauge network-based input compared to the current rain gauge network-based input. The ANN-based input selection technique that was employed in this study for streamflow forecasting offers a viable technique for significant input variables selection as this technique is capable of learning problems involving very non-linear and complex data. %K genetic algorithms, genetic programming, rivers, water basins, streams, stream-flow simulation, modeling, water supply, spatial interpolation, genetic programming-based ordinary kriging, thesis by publication %9 Ph.D. thesis %U https://vuir.vu.edu.au/35054/ %0 Journal Article %T A Rigorous Wavelet-Packet Transform to Retrieve Snow Depth from SSMIS Data and Evaluation of its Reliability by Uncertainty Parameters %A Adib, Arash %A Zaerpour, Arash %A Kisi, Ozgur %A Lotfirad, Morteza %J Water Resources Management %D 2021 %8 jul %V 35 %I springer %F Adib:2021:WRM %X This study demonstrates the application of wavelet transform comprising discrete wavelet transform, maximum overlap discrete wavelet transform (MODWT), and multiresolution-based MODWT (MODWT-MRA), as well as wavelet packet transform (WP), coupled with artificial intelligence (AI)-based models including multi-layer perceptron, radial basis function, adaptive neuro-fuzzy inference system (ANFIS), and gene expression programming to retrieve snow depth (SD) from special sensor microwave imager sounder obtained from the national snow and ice data center. Different mother wavelets were applied to the passive microwave (PM) frequencies; afterward, the dominant resultant decomposed subseries comprising low frequencies (approximations) and high frequencies (details) were detected and inserted into the AI-based models. The results indicated that the WP coupled with ANFIS (WP-ANFIS) outperformed the other studied models with the determination coefficient of 0.988, root mean square error of 3.458 cm, mean absolute error of 2.682 cm, and Nash–Sutcliffe efficiency of 0.987 during testing period. The final verification also confirmed that the WP is a promising pre-processing technique to improve the accuracy of the AI-based models in SD evaluation from PM data. %K genetic algorithms, genetic programming, gene expression programming, passive microwave, special sensor microwave imager sounder, snow depth retrieval, discrete wavelet transform, wavelet-packet transform %9 journal article %R doi:10.1007/s11269-021-02863-x %U http://link.springer.com/10.1007/s11269-021-02863-x %U http://dx.doi.org/doi:10.1007/s11269-021-02863-x %P 2723-2740 %0 Conference Proceedings %T LooperGP: A Loopable Sequence Model for Live Coding Performance using GuitarPro Tablature %A Adkins, Sara %A Sarmento, Pedro %A Barthet, Mathieu %Y Johnson, Colin %Y Rodriguez-Fernandez, Nereida %Y Rebelo, Sergio M. %S 12th International Conference on Artificial Intelligence in Music, Sound, Art and Design, EvoMusArt 2023 %S LNCS %D 2023 %8 apr 12 14 %V 13988 %I Springer Verlag %C Brno, Czech Republic %F Adkins:2023:evomusart %X Despite their impressive offline results, deep learning models for symbolic music generation are not widely used in live performances due to a deficit of musically meaningful control parameters and a lack of structured musical form in their outputs. To address these issues we introduce LooperGP, a method for steering a Transformer-XL model towards generating loopable musical phrases of a specified number of bars and time signature, enabling a tool for live coding performances. We show that by training LooperGP on a dataset of 93681 musical loops extracted from the DadaGP dataset [Data GuitarPro], we are able to steer its generative output towards generating three times as many loopable phrases as our baseline. In a subjective listening test conducted by 31 participants, LooperGP loops achieved positive median ratings in originality, musical coherence and loop smoothness, demonstrating its potential as a performance tool. %K genetic algorithms, genetic programming, Controllable Music Generation, Sequence Models, Live Coding, Transformers, AI Music, Loops, Guitar Tabs %R doi:10.1007/978-3-031-29956-8_1 %U http://dx.doi.org/doi:10.1007/978-3-031-29956-8_1 %P 3-19 %0 Generic %T Improving Readability of Scratch Programs with Search-based Refactoring %A Adler, Felix %A Fraser, Gordon %A Gruendinger, Eva %A Koerber, Nina %A Labrenz, Simon %A Lerchenberger, Jonas %A Lukasczyk, Stephan %A Schweikl, Sebastian %D 2021 %8 16 aug %I arXiv %F adler2021improving %X Block-based programming languages like Scratch have become increasingly popular as introductory languages for novices. These languages are intended to be used with a tinkering approach which allows learners and teachers to quickly assemble working programs and games, but this often leads to low code quality. Such code can be hard to comprehend, changing it is error-prone, and learners may struggle and lose interest. The general solution to improve code quality is to refactor the code. However, Scratch lacks many of the common abstraction mechanisms used when refactoring programs written in higher programming languages. In order to improve Scratch code, we therefore propose a set of atomic code transformations to optimise readability by (1) rewriting control structures and (2) simplifying scripts using the inherently concurrent nature of Scratch programs. By automating these transformations it is possible to explore the space of possible variations of Scratch programs. In this paper, we describe a multi-objective search-based approach that determines sequences of code transformations which improve the readability of a given Scratch program and therefore form refactorings. Evaluation on a random sample of 1000 Scratch programs demonstrates that the generated refactorings reduce complexity and entropy in 70.4percent of the cases, and 354 projects are improved in at least one metric without making any other metric worse. The refactored programs can help both novices and their teachers to improve their code. %K genetic algorithms, genetic programming, genetic improvement, SBSE %U https://arxiv.org/abs/2108.07114 %0 Conference Proceedings %T Improving Readability of Scratch Programs with Search-based Refactoring %A Adler, Felix %A Fraser, Gordon %A Gruendinger, Eva %A Koerber, Nina %A Labrenz, Simon %A Lerchenberger, Jonas %A Lukasczyk, Stephan %A Schweikl, Sebastian %S 21st IEEE International Working Conference on Source Code Analysis and Manipulation, SCAM 2021 %D 2021 %8 sep 27 28 %C Luxembourg %F DBLP:conf/scam/AdlerFGKLLLS21 %O 16000 GP entry %X Block-based programming languages like SCRATCH have become increasingly popular as introductory languages for novices. These languages are intended to be used with a tinkering approach which allows learners and teachers to quickly assemble working programs and games, but this often leads to low code quality. Such code can be hard to comprehend, changing it is error-prone, and learners may struggle and lose interest. The general solution to improve code quality is to refactor the code. However, SCRATCH lacks many of the common abstraction mechanisms used when refactoring programs written in higher programming languages. In order to improve SCRATCH code, we therefore propose a set of atomic code transformations to optimise readability by (1) rewriting control structures and (2) simplifying scripts using the inherently concurrent nature of SCRATCH programs. By automating these transformations it is possible to explore the space of possible variations of SCRATCH programs. In this paper, we describe a multi-objective search-based approach that determines sequences of code transformations which improve the readability of a given SCRATCH program and therefore form refactorings. Evaluation on a random sample of 1000 SCRATCH programs demonstrates that the generated refactorings reduce complexity and entropy in 70.4% of the cases, and 354 projects are improved in at least one metric without making any other metric worse. The refactored programs can help both novices and their teachers to improve their code. %K genetic algorithms, genetic programming, genetic improvement, grammatical evolution, SBSE, NSGA-II, LitterBox, JSON, refactoring, Java %R doi:10.1109/SCAM52516.2021.00023 %U https://arxiv.org/abs/2108.07114 %U http://dx.doi.org/doi:10.1109/SCAM52516.2021.00023 %P 120-130 %0 Conference Proceedings %T A cellular-programming approach to pattern classification %A Adorni, Giovanni %A Bergenti, Federico %A Cagnoni, Stefano %Y Banzhaf, Wolfgang %Y Poli, Riccardo %Y Schoenauer, Marc %Y Fogarty, Terence C. %S Proceedings of the First European Workshop on Genetic Programming %S LNCS %D 1998 %8 14 15 apr %V 1391 %I Springer-Verlag %C Paris %@ 3-540-64360-5 %F adorni:1998:cpapc %X In this paper we discuss the capability of the cellular programming approach to produce non-uniform cellular automata performing two-dimensional pattern classification. More precisely, after an introduction to the evolutionary cellular automata model, we describe a general approach suitable for designing cellular classifiers. The approach is based on a set of non-uniform cellular automata performing specific classification tasks, which have been designed by means of a cellular evolutionary algorithm. The proposed approach is discussed together with some preliminary results obtained on a benchmark data set consisting of car-plate digits. %K genetic algorithms, genetic programming %R doi:10.1007/BFb0055934 %U http://dx.doi.org/doi:10.1007/BFb0055934 %P 142-150 %0 Conference Proceedings %T Genetic Programming of a Goal-Keeper Control Strategy for the RoboCup Middle Size Competition %A Adorni, Giovanni %A Cagnoni, Stefano %A Mordonini, Monica %Y Poli, Riccardo %Y Nordin, Peter %Y Langdon, William B. %Y Fogarty, Terence C. %S Genetic Programming, Proceedings of EuroGP’99 %S LNCS %D 1999 %8 26 27 may %V 1598 %I Springer-Verlag %C Goteborg, Sweden %@ 3-540-65899-8 %F adorni:1999:GPgkcsrcmsc %X In this paper we describe a genetic programming approach to the design of a motion-control strategy for a goalkeeper robot created to compete in the RoboCup99, the robot soccer world championships which have been held yearly since 1997, as part of the Italian middle size robot team (ART, Azzurra Robot Team). The evolved program sends a motion command to the robot, based on the analysis of information received from a human-coded vision sub-system. The preliminary results obtained on a simulator are encouraging. They suggest that even using very simple fitness functions and training sets including only a small sub-set of the situations that the goalkeeper is required to tackle, it is possible to evolve a complex behaviour that permits the goalkeeper to perform well also in more challenging real-world conditions. %K genetic algorithms, genetic programming %R doi:10.1007/3-540-48885-5_9 %U http://dx.doi.org/doi:10.1007/3-540-48885-5_9 %P 109-119 %0 Conference Proceedings %T Efficient low-level vision program design using Sub-machine-code Genetic Programming %A Adorni, Giovanni %A Cagnoni, Stefano %A Mordonini, Monica %Y Gori, Marco %S AIIA 2002, Workshop sulla Percezione e Visione nelle Macchine %D 2002 %8 October 13 sep %C Siena, Italy %G en %F oai:CiteSeerPSU:539182 %X Sub-machine-code Genetic Programming (SmcGP) is a variant of GP aimed at exploiting the intrinsic parallelism of sequential CPUs. The paper describes an approach to low-level vision algorithm design for real-time applications by means of Sub-machine-code Genetic Programming(SmcGP), a variant of GP aimed at exploiting the intrinsic parallelism of sequential CPUs. The SmcGPbased design of two processing modules of a license-plate recognition system is taken into consideration as a case study to show the potential of the approach. The paper reports results obtained in recognizing the very low-resolution binary patterns that have to be classified in such an application along with preliminary results obtained using SmcGP to design a license-plate extraction algorithm. %K genetic algorithms, genetic programming %U http://www-dii.ing.unisi.it/aiia2002/paper/PERCEVISIO/adorni-aiia02.pdf %0 Conference Proceedings %T Design of Explicitly or Implicitly Parallel Low-resolution Character Recognition Algorithms by Means of Genetic Programming %A Adorni, Giovanni %A Cagnoni, Stefano %Y Roy, Rajkumar %Y Köppen, Mario %Y Ovaska, Seppo %Y Furuhashi, Takeshi %Y Hoffmann, Frank %S Soft Computing and Industry Recent Applications %D 2001 %8 October %I Springer-Verlag %@ 1-85233-539-4 %F adorni:2001:wsc6 %O Published 2002 %K genetic algorithms, genetic programming %U https://link.springer.com/book/10.1007/978-1-4471-0123-9 %P 387-398 %0 Generic %T Automated conjecturing of Frobenius numbers via grammatical evolution %A Adzaga, Nikola %D 2014 %8 feb 17 %F oai:arXiv.org:1410.0532 %O Comment: 8 pages, 2 tables; added a clear introduction, otherwise reduced text significantly %X Conjecturing formulae and other symbolic relations occurs frequently in number theory and combinatorics. If we could automate conjecturing, we could benefit not only from speeding up, but also from finding conjectures previously out of our grasp. Grammatical evolution, a genetic programming technique, can be used for automated conjecturing in mathematics. Concretely, this work describes how one can interpret the Frobenius problem as a symbolic regression problem, and then apply grammatical evolution to it. In this manner, a few formulas for Frobenius numbers of specific quadruples were found automatically. The sketch of the proof for one conjectured formula, using lattice point enumeration method, is provided as well. Same method can easily be used on other problems to speed up and enhance the research process. %K genetic algorithms, genetic programming, grammatical evolution, mathematics, number theory, mathematics, combinatorics %U http://arxiv.org/abs/1410.0532 %0 Journal Article %T Automated Conjecturing of Frobenius Numbers via Grammatical Evolution %A Adzaga, Nikola %J Experimental Mathematics %D 2017 %V 26 %N 2 %I Taylor & Francis %@ 1058-6458 %F Adzaga:2017:EM %X Conjecturing formulas and other symbolic relations occurs frequently in number theory and combinatorics. If we could automate conjecturing, we could benefit not only from faster conjecturing but also from finding conjectures previously out of our grasp. Grammatical evolution (GE), a genetic programming technique, can be used for automated conjecturing in mathematics. Concretely, this work describes how one can interpret the Frobenius problem as a symbolic regression problem, and then apply GE to it. In this manner, a few formulas for Frobenius numbers of specific quadruples were found automatically. The sketch of the proof of one conjectured formula, using lattice point enumeration method, is provided as well. The same method can easily be used on other problems to speed up and enhance the research process. %K genetic algorithms, genetic programming, grammatical evolution, automated conjecturing, Frobenius problem %9 journal article %R doi:10.1080/10586458.2016.1175393 %U http://dx.doi.org/doi:10.1080/10586458.2016.1175393 %P 247-252 %0 Conference Proceedings %T Lexicase selection in learning classifier systems %A Aenugu, Sneha %A Spector, Lee %Y Lopez-Ibanez, Manuel %Y Stuetzle, Thomas %Y Auger, Anne %Y Posik, Petr %Y Peprez Caceres, Leslie %Y Sutton, Andrew M. %Y Veerapen, Nadarajen %Y Solnon, Christine %Y Engelbrecht, Andries %Y Doncieux, Stephane %Y Risi, Sebastian %Y Machado, Penousal %Y Volz, Vanessa %Y Blum, Christian %Y Chicano, Francisco %Y Xue, Bing %Y Mouret, Jean-Baptiste %Y Liefooghe, Arnaud %Y Fieldsend, Jonathan %Y Lozano, Jose Antonio %Y Arnold, Dirk %Y Ochoa, Gabriela %Y Yu, Tian-Li %Y Hoos, Holger %Y Jin, Yaochu %Y Hu, Ting %Y Nicolau, Miguel %Y Purshouse, Robin %Y Baeck, Thomas %Y Petke, Justyna %Y Antoniol, Giuliano %Y Lengler, Johannes %Y Lehre, Per Kristian %S GECCO ’19: Proceedings of the Genetic and Evolutionary Computation Conference %D 2019 %8 13 17 jul %I ACM %C Prague, Czech Republic %F Aenugu:2019:GECCO %X The lexicase parent selection method selects parents by considering performance on individual data points in random order instead of using a fitness function based on an aggregated data accuracy. While the method has demonstrated promise in genetic programming and more recently in genetic algorithms, its applications in other forms of evolutionary machine learning have not been explored. In this paper, we investigate the use of lexicase parent selection in Learning Classifier Systems (LCS) and study its effect on classification problems in a supervised setting. We further introduce a new variant of lexicase selection, called batch-lexicase selection, which allows for the tuning of selection pressure. We compare the two lexicase selection methods with tournament and fitness proportionate selection methods on binary classification problems. We show that batch-lexicase selection results in the creation of more generic rules which is favourable for generalization on future data. We further show that batch-lexicase selection results in better generalization in situations of partial or missing data. %K genetic algorithms, LCS, Learning Classifier Systems, Parent Selection, Lexicase Selection %R doi:10.1145/3321707.3321828 %U http://dx.doi.org/doi:10.1145/3321707.3321828 %P 356-364 %0 Conference Proceedings %T Offspring Selection: A New Self-Adaptive Selection Scheme for Genetic Algorithms %A Affenzeller, M. %A Wagner, S. %Y Ribeiro, Bernardete %Y Albrecht, Rudolf F. %Y Dobnikar, Andrej %Y Pearson, David W. %Y Steele, Nigel C. %S Proceedings of the seventh International Conference Adaptive and Natural Computing Algorithms %D 2005 %8 21 23 mar %I Springer %C Coimbra, Portugal %F Affenzeller:2005:ICANNGA %X In terms of goal orientedness, selection is the driving force of Genetic Algorithms (GAs). In contrast to crossover and mutation, selection is completely generic, i.e. independent of the actually employed problem and its representation. GA-selection is usually implemented as selection for reproduction (parent selection). In this paper we propose a second selection step after reproduction which is also absolutely problem independent. This self-adaptive selection mechanism, which will be referred to as offspring selection, is closely related to the general selection model of population genetics. As the problem- and representation-specific implementation of reproduction in GAs (crossover) is often critical in terms of preservation of essential genetic information, offspring selection has proven to be very suited for improving the global solution quality and robustness concerning parameter settings and operators of GAs in various fields of applications. The experimental part of the paper discusses the potential of the new selection model exemplarily on the basis of standardized real-valued test functions in high dimensions %K genetic algorithms, genetic programming, OS-GP %R doi:10.1007/3-211-27389-1_52 %U https://link.springer.com/chapter/10.1007/3-211-27389-1_52 %U http://dx.doi.org/doi:10.1007/3-211-27389-1_52 %P 218-221 %0 Book %T Genetic Algorithms and Genetic Programming: Modern Concepts and Practical Applications %A Affenzeller, Michael %A Winkler, Stephan %A Wagner, Stefan %A Beham, Andreas %S Numerical Insights %D 2009 %I CRC Press %C Singapore %@ 1-58488-629-3 %F Affenzeller:GAGP %X Genetic Algorithms and Genetic Programming: Modern Concepts and Practical Applications discusses algorithmic developments in the context of genetic algorithms (GAs) and genetic programming (GP). It applies the algorithms to significant combinatorial optimisation problems and describes structure identification using HeuristicLab as a platform for algorithm development. The book focuses on both theoretical and empirical aspects. The theoretical sections explore the important and characteristic properties of the basic GA as well as main characteristics of the selected algorithmic extensions developed by the authors. In the empirical parts of the text, the authors apply GAs to two combinatorial optimisation problems: the traveling salesman and capacitated vehicle routing problems. To highlight the properties of the algorithmic measures in the field of GP, they analyze GP-based nonlinear structure identification applied to time series and classification problems. Written by core members of the HeuristicLab team, this book provides a better understanding of the basic workflow of GAs and GP, encouraging readers to establish new bionic, problem-independent theoretical concepts. By comparing the results of standard GA and GP implementation with several algorithmic extensions, it also shows how to substantially increase achievable solution quality. %K genetic algorithms, genetic programming %U http://gagp2009.heuristiclab.com/ %0 Journal Article %T Effective allele preservation by offspring selection: an empirical study for the TSP %A Affenzeller, Michael %A Wagner, Stefan %A Winkler, Stephan M. %J International Journal of Simulation and Process Modelling %D 2010 %8 apr 11 %V 6 %N 1 %I Inderscience Publishers %@ 1740-2131 %G eng %F Affenzeller:2010:IJSPM %X The basic selection ideas of the different representatives of evolutionary algorithms are sometimes quite diverse. The selection concept of Genetic Algorithms (GAs) and Genetic Programming (GP) is basically realised by the selection of above-average parents for reproduction, whereas Evolution Strategies (ES) use the fitness of newly evolved offspring as the basis for selection (survival of the fittest due to birth surplus). This contribution considers aspects of population genetics and ES in order to propose an enhanced and generic selection model for GAs which is able to preserve the alleles which are part of a high quality solution. Some selected aspects of these enhanced techniques are discussed exemplary on the basis of the Travelling Salesman Benchmark (TSP) problem instances. %K genetic algorithms, genetic programming, soft computing, evolutionary computation, GAs selection, self adaptation, population genetics, evolution strategies, modelling, allele preservation, offspring selection, travelling salesman problem %9 journal article %R doi:10.1504/IJSPM.2010.032655 %U https://pure.fh-ooe.at/en/publications/effective-allele-preservation-by-offspring-selection-an-empirical-2 %U http://dx.doi.org/doi:10.1504/IJSPM.2010.032655 %P 29-39 %0 Conference Proceedings %T New Genetic Programming Hypothesis Search Strategies for Improving the Interpretability in Medical Data Mining Applications %A Affenzeller, M. %A Fischer, C. %A Kronberger, G. K. %A Winkler, S. M. %A Wagner, S. %S Proccedings of 23rd IEEE European Modeling & Simulation Symposium EMSS 2011 %D 2011 %8 sep %C Roma, Italy %F 2453 %K genetic algorithms, genetic programming %U http://research.fh-ooe.at/files/publications/2453_EMSS_2011_Affenzeller.pdf %0 Conference Proceedings %T Enhanced Confidence Interpretations of GP Based Ensemble Modeling Results %A Affenzeller, Michael %A Winkler, Stephan M. %A Forstenlechner, Stefan %A Kronberger, Gabriel %A Kommenda, Michael %A Wagner, Stefan %A Stekel, Herbert %Y Jimenez, Emilio %Y Sokolov, Boris %S The 24th European Modeling and Simulation Symposium, EMSS 2012 %D 2012 %8 sep 19 21 %C Vienna, Austria %F Affenzeller:2012:EMSS %X In this paper we describe the integration of ensemble modelling into genetic programming based classification and discuss concepts how to use genetic programming specific features for achieving new confidence indicators that estimate the trustworthiness of predictions. These new concepts are tested on a real world dataset from the field of medical diagnosis for cancer prediction where the trustworthiness of modeling results is of highest importance %K genetic algorithms, genetic programming, data mining, ensemble modelling, medical data analysis %U http://research.fh-ooe.at/en/publication/2935 %P 340-345 %0 Conference Proceedings %T Improving the Accuracy of Cancer Prediction by Ensemble Confidence Evaluation %A Affenzeller, Michael %A Winkler, Stephan M. %A Stekel, Herbert %A Forstenlechner, Stefan %A Wagner, Stefan %Y Moreno-Diaz, Roberto %Y Pichler, Franz %Y Quesada-Arencibia, Alexis %S Computer Aided Systems Theory - EUROCAST 2013 %S Lecture Notes in Computer Science %D 2013 %8 feb 10 15 %V 8111 %I Springer %C Las Palmas de Gran Canaria, Spain %G English %F Affenzeller:2013:EUROCAST %O Revised Selected Papers, Part I %X This paper discusses a novel approach for the prediction of breast cancer, melanoma and cancer in the respiratory system using ensemble modelling techniques. For each type of cancer, a set of unequally complex predictors are learnt by symbolic classification based on genetic programming. In addition to standard ensemble modeling, where the prediction is based on a majority voting of the prediction models, two confidence parameters are used which aim to quantify the trustworthiness of each single prediction based on the clearness of the majority voting. Based on the calculated confidence of each ensemble prediction, predictions might be considered uncertain. The experimental part of this paper discusses the increase of accuracy that can be obtained for those samples which are considered trustable depending on the ratio of predictions that are considered trustable. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-53856-8_40 %U http://dx.doi.org/10.1007/978-3-642-53856-8_40 %U http://dx.doi.org/doi:10.1007/978-3-642-53856-8_40 %P 316-323 %0 Book Section %T Gaining Deeper Insights in Symbolic Regression %A Affenzeller, Michael %A Winkler, Stephan M. %A Kronberger, Gabriel %A Kommenda, Michael %A Burlacu, Bogdan %A Wagner, Stefan %E Riolo, Rick %E Moore, Jason H. %E Kotanchek, Mark %B Genetic Programming Theory and Practice XI %S Genetic and Evolutionary Computation %D 2013 %8 September 11 may %I Springer %C Ann Arbor, USA %F Affenzeller:2013:GPTP %X A distinguishing feature of symbolic regression using genetic programming is its ability to identify complex nonlinear white-box models. This is especially relevant in practice where models are extensively scrutinised in order to gain knowledge about underlying processes. This potential is often diluted by the ambiguity and complexity of the models produced by genetic programming. In this contribution we discuss several analysis methods with the common goal to enable better insights in the symbolic regression process and to produce models that are more understandable and show better generalisation. In order to gain more information about the process we monitor and analyse the progresses of population diversity, building block information, and even more general genealogy information. Regarding the analysis of results, several aspects such as model simplification, relevance of variables, node impacts, and variable network analysis are presented and discussed. %K genetic algorithms, genetic programming, Symbolic regression, Algorithm analysis, Population diversity Building block analysis, Genealogy, Variable networks %R doi:10.1007/978-1-4939-0375-7_10 %U http://dx.doi.org/doi:10.1007/978-1-4939-0375-7_10 %P 175-190 %0 Conference Proceedings %T Offspring Selection Genetic Algorithm Revisited: Improvements in Efficiency by Early Stopping Criteria in the Evaluation of Unsuccessful Individuals %A Affenzeller, Michael %A Burlacu, Bogdan %A Winkler, Stephan M. %A Kommenda, Michael %A Kronberger, Gabriel K. %A Wagner, Stefan %Y Moreno-Diaz, Roberto %Y Pichler, Franz %Y Quesada-Arencibia, Alexis %S 16th International Conference on Computer Aided Systems Theory, EUROCAST 2017 %S Lecture Notes in Computer Science %D 2017 %8 feb %V 10671 %I Springer %C Las Palmas de Gran Canaria, Spain %F 6339 %X This paper proposes some algorithmic extensions to the general concept of offspring selection which itself is an algorithmic extension of genetic algorithms and genetic programming. Offspring selection is characterized by the fact that many offspring solution candidates will not participate in the ongoing evolutionary process if they do not achieve the success criterion. The algorithmic enhancements proposed in this contribution aim to early estimate if a solution candidate will not be accepted based on partial solution evaluation. The qualitative characteristics of offspring selection are not affected by this means. The discussed variant of offspring selection is analysed for several symbolic regression problems with offspring selection genetic programming. The achievable gains in terms of efficiency are remarkable especially for large data-sets. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-319-74718-7_51 %U https://link.springer.com/chapter/10.1007/978-3-319-74718-7_51 %U http://dx.doi.org/doi:10.1007/978-3-319-74718-7_51 %P 424-431 %0 Conference Proceedings %T Dynamic Observation of Genotypic and Phenotypic Diversity for Different Symbolic Regression GP Variants %A Affenzeller, Michael %A Winkler, Stephan M. %A Burlacu, Bogdan %A Kronberger, Gabriel %A Kommenda, Michael %A Wagner, Stefan %S Proceedings of the Genetic and Evolutionary Computation Conference Companion %S GECCO ’17 %D 2017 %8 15 19 jul %I ACM %C Berlin, Germany %F Affenzeller:2017:GECCO %X Understanding the relationship between selection, genotype-phenotype map and loss of population diversity represents an important step towards more effective genetic programming (GP) algorithms. This paper describes an approach to capture dynamic changes in this relationship. We analyse the frequency distribution of points in the diversity plane defined by structural and semantic similarity measures. We test our methodology using standard GP (SGP) on a number of test problems, as well as Offspring Selection GP (OS-GP), an algorithmic flavour where selection is explicitly focused towards adaptive change. We end with a discussion about the implications of diversity maintenance for each of the tested algorithms. We conclude that diversity needs to be considered in the context of fitness improvement, and that more diversity is not necessarily beneficial in terms of solution quality. %K genetic algorithms, genetic programming, genetic and phenotypic diversity, offspring selection, population dynamics, symbolic regression %R doi:10.1145/3067695.3082530 %U http://doi.acm.org/10.1145/3067695.3082530 %U http://dx.doi.org/doi:10.1145/3067695.3082530 %P 1553-1558 %0 Conference Proceedings %T Symbolic Regression in Materials Science: Discovering Interatomic Potentials from Data %A Burlacu, Bogdan %A Kommenda, Michael %A Kronberger, Gabriel %A Winkler, Stephan M. %A Affenzeller, Michael %Y Trujillo, Leonardo %Y Winkler, Stephan M. %Y Silva, Sara %Y Banzhaf, Wolfgang %S Genetic Programming Theory and Practice XIX %S Genetic and Evolutionary Computation %D 2022 %8 jun 2 4 %I Springer %C Ann Arbor, USA %F Affenzeller:2022:GPTP %X Particle-based modeling of materials at atomic scale plays an important role in the development of new materials and the understanding of their properties. The accuracy of particle simulations is determined by interatomic potentials, which allow calculating the potential energy of an atomic system as a function of atomic coordinates and potentially other properties. First-principles-based ab initio potentials can reach arbitrary levels of accuracy, however, their applicability is limited by their high computational cost. Machine learning (ML) has recently emerged as an effective way to offset the high computational costs of ab initio atomic potentials by replacing expensive models with highly efficient surrogates trained on electronic structure data. Among a plethora of current methods, symbolic regression (SR) is gaining traction as a powerful “white-box” approach for discovering functional forms of interatomic potentials. This contribution discusses the role of symbolic regression in Materials Science (MS) and offers a comprehensive overview of current methodological challenges and state-of-the-art results. A genetic programming-based approach for modeling atomic potentials from raw data (consisting of snapshots of atomic positions and associated potential energy) is presented and empirically validated on ab initio electronic structure data. %K genetic algorithms, genetic programming %R doi:10.1007/978-981-19-8460-0_1 %U http://dx.doi.org/doi:10.1007/978-981-19-8460-0_1 %P 1-30 %0 Conference Proceedings %T GP in Prescriptive Analytics %A Affenzeller, Michael %Y Hu, Ting %Y Ofria, Charles %Y Trujillo, Leonardo %Y Winkler, Stephan %S Genetic Programming Theory and Practice XX %S Genetic and Evolutionary Computation %D 2023 %8 jun 1 3 %C Michigan State University, USA %F Affenzeller:2023:GPTP %K genetic algorithms, genetic programming %0 Journal Article %T The added utility of nonlinear methods compared to linear methods in rescaling soil moisture products %A Afshar, M. H. %A Yilmaz, M. T. %J Remote Sensing of Environment %D 2017 %V 196 %@ 0034-4257 %F Afshar:2017:RSE %X In this study, the added utility of nonlinear rescaling methods relative to linear methods in the framework of creating a homogenous soil moisture time series has been explored. The performances of 31 linear and nonlinear rescaling methods are evaluated by rescaling the Land Parameter Retrieval Model (LPRM) soil moisture datasets to station-based watershed average datasets obtained over four United States Department of Agriculture (USDA) Agricultural Research Service (ARS) watersheds. The linear methods include first-order linear regression, multiple linear regression, and multivariate adaptive regression splines (MARS), whereas the nonlinear methods include cumulative distribution function matching (CDF), artificial neural networks (ANN), support vector machines (SVM), Genetic Programming (GEN), and copula methods. MARS, GEN, SVM, ANN, and the copula methods are also implemented to use lagged observations to rescale the datasets. The results of a total of 31 different methods show that the nonlinear methods improve the correlation and error statistics of the rescaled product compared to the linear methods. In general, the method that yielded the best results using training data improved the validation correlations, on average, by 0.063, whereas ELMAN ANN and GEN, using lagged observations methods, yielded correlation improvements of 0.052 and 0.048, respectively. The lagged observations improved the correlations when they were incorporated into rescaling equations in linear and nonlinear fashions, with the nonlinear methods (particularly SVM and GEN but not ANN and copula) benefitting from these lagged observations more than the linear methods. The overall results show that a large majority of the similarities between the LPRM and watershed average datasets are due to linear relations; however, nonlinear relations clearly exist, and the use of nonlinear rescaling methods clearly improves the accuracy of the rescaled product. %K genetic algorithms, genetic programming, Soil moisture, Rescaling, Linear, Nonlinear, Remote sensing %9 journal article %R doi:10.1016/j.rse.2017.05.017 %U http://www.sciencedirect.com/science/article/pii/S003442571730216X %U http://dx.doi.org/doi:10.1016/j.rse.2017.05.017 %P 224-237 %0 Conference Proceedings %T A Turing Test for Genetic Improvement %A Afzal, Afsoon %A Lacomis, Jeremy %A Le Goues, Claire %A Timperley, Christopher Steven %Y Petke, Justyna %Y Stolee, Kathryn %Y Langdon, William B. %Y Weimer, Westley %S GI-2018, ICSE workshops proceedings %D 2018 %8 February %I ACM %C Gothenburg, Sweden %F Timperley:2018:GI %X Genetic improvement is a research field that aims to develop searchbased techniques for improving existing code. GI has been used to automatically repair bugs, reduce energy consumption, and to improve run-time performance. In this paper, we reflect on the often-overlooked relationship between GI and developers within the context of continually evolving software systems. We introduce a distinction between transparent and opaque patches based on intended lifespan and developer interaction. Finally, we outline a Turing test for assessing the ability of a GI system to produce opaque patches that are acceptable to humans. This motivates research into the role GI systems will play in transparent development contexts. %K genetic algorithms, genetic programming, genetic improvement %R doi:10.1145/3194810.3194817 %U http://dx.doi.org/10.1145/3194810.3194817 %U http://dx.doi.org/doi:10.1145/3194810.3194817 %P 17-18 %0 Thesis %T Automated Testing of Robotic and Cyberphysical Systems %A Afzal, Afsoon %D 2021 %8 may 2021 %C Pittsburgh, PA 15213, USA %C Institute for Software Research, School of Computer Science, Carnegie Mellon University %F Afsoon_Afzal:thesis %X Robotics and cyberphysical systems are increasingly being deployed to settings where they are in frequent interaction with the public. Therefore, failures in these systems can be catastrophic by putting human lives in danger and causing extreme financial loss. Large-scale assessment of the quality of these systems before deployment can prevent these costly damages. Because of the complexity and other special features of these systems, testing,and more specifically automated testing, faces challenges. In this dissertation, I study the unique challenges of testing robotics and cyberphysical systems, and propose an end-to-end automated testing pipeline to provide tools and methods that can help roboticists in large-scale, automated testing of their systems. My key insight is that we can use (low-fidelity) simulation to automatically test robotic and cyber-physical systems, and identify many potentially catastrophic failures in advance at low cost. My core thesis is: Robotic and cyberphysical systems have unique features such as interacting with the physical world and integrating hardware and software components, which creates challenges for automated, large-scale testing approaches. An automated testing framework using software-in-the-loop (low-fidelity) simulation can facilitate automated testing for these systems. This framework can be offered using a clustering approach as an automated oracle, and an evolutionary-based automated test input generation with scenario coverage fitness functions. To support this thesis, I conduct a number of qualitative, quantitative, and mixed method studies that 1) identify main challenges of testing robotic and cyberphysical systems, 2) show that low-fidelity simulation can be an effective approach in detecting bugs and errors with low cost, and 3) identify challenges of using simulators in automated testing. Additionally, I propose automated techniques for creating oracles and generating test inputs to facilitate automated testing of robotic and cyberphysical systems. I present an approach to automatically generate oracles for cyberphysical systems using clustering, which can observe and identify common patterns of system behavior.These patterns can be used to distinguish erroneous behavior of the system and act as an oracle. I evaluate the quality of test inputs for robotic systems with respect to their reliability, and effectiveness in revealing faults in the system. I observe a high rate of non-determinism among test executions that complicates test input generation and evaluation, and show that coverage-based metrics are generally poor indicators of test input quality. Finally, I present an evolutionary-based automated test generation approach with a fitness function that is based on scenario coverage. The automated oracle and automated test input generation approaches contribute to a fully automated testing framework that can perform large-scale, automated testing on robotic and cyberphysical systems in simulation. %K SBSE, testing cyber-physical systems, robotics testing, automated quality assurance, simulation-based testing, challenges of testing, automated oracle inference, automated test generation %9 Ph.D. thesis %U https://afsafzal.github.io/materials/thesis.pdf %0 Journal Article %T SOSRepair: Expressive Semantic Search for Real-World Program Repair %A Afzal, Afsoon %A Motwani, Manish %A Stolee, Kathryn T. %A Brun, Yuriy %A Le Goues, Claire %J IEEE Transactions on Software Engineering %D 2021 %V 47 %N 10 %@ 0098-5589 %F Afzal:2021:TSE %X Automated program repair holds the potential to significantly reduce software maintenance effort and cost. However, recent studies have shown that it often produces low-quality patches that repair some but break other functionality. We hypothesize that producing patches by replacing likely faulty regions of code with semantically-similar code fragments, and doing so at a higher level of granularity than prior approaches can better capture abstraction and the intended specification, and can improve repair quality. We create SOSRepair, an automated program repair technique that uses semantic code search to replace candidate buggy code regions with behaviorally-similar (but not identical) code written by humans. SOSRepair is the first such technique to scale to real-world defects in real-world systems. On a subset of the ManyBugs benchmark of such defects, SOSRepair produces patches for 23 (35percent) of the 65 defects, including 3, 5, and 8 defects for which previous state-of-the-art techniques Angelix, Prophet, and GenProg do not, respectively. On these 23 defects, SOSRepair produces more patches (8, 35percent) that pass all independent tests than the prior techniques. We demonstrate a relationship between patch granularity and the ability to produce patches that pass all independent tests. We then show that fault localization precision is a key factor in SOSRepair’s success. Manually improving fault localisation allows SOSRepair to patch 24 (37percent) defects, of which 16 (67percent) pass all independent tests. We conclude that (1) higher-granularity, semantic-based patches can improve patch quality, (2) semantic search is promising for producing high-quality real-world defect repairs, (3) research in fault localization can significantly improve the quality of program repair techniques, and (4) semi-automated approaches in which developers suggest fix locations may produce high-quality patches. %K genetic algorithms, genetic programming, genetic improvement, APR %9 journal article %R doi:10.1109/TSE.2019.2944914 %U https://doi.org/10.1109/TSE.2019.2944914 %U http://dx.doi.org/doi:10.1109/TSE.2019.2944914 %P 2162-2181 %0 Conference Proceedings %T A Systematic Mapping Study on Non-Functional Search-based Software Testing %A Afzal, Wasif %A Torkar, Richard %A Feldt, Robert %S Proceedings of the 20th International Conference on Software Engineering and Knowledge Engineering (SEKE ’08) %D 2008 %8 jul 1 3 %I Knowledge Systems Institute Graduate School %C San Francisco, CA, USA %@ 1-891706-22-5 %F AfzalTF08 %X Automated software test generation has been applied across the spectrum of test case design methods; this includes white-box (structural), black-box (functional), grey-box (combination of structural and functional) and non-functional testing. In this paper, we undertake a systematic mapping study to present a broad review of primary studies on the application of search-based optimization techniques to non-functional testing. The motivation is to identify the evidence available on the topic and to identify gaps in the application of search-based optimization techniques to different types of non-functional testing. The study is based on a comprehensive set of 35 papers obtained after using a multi-stage selection criteria and are published in workshops, conferences and journals in the time span 1996–2007. We conclude that the search-based software testing community needs to do more and broader studies on non-functional search-based software testing (NFSBST) and the results from our systematic map can help direct such efforts. %K genetic algorithms, genetic programming %U http://www.torkar.se/resources/A-systematic-mapping-study-on-non-functional-search-based-software-testing.pdf %P 488-493 %0 Conference Proceedings %T Suitability of Genetic Programming for Software Reliability Growth Modeling %A Afzal, Wasif %A Torkar, Richard %S The 2008 International Symposium on Computer Science and its Applications (CSA’08) %D 2008 %8 13 15 oct %I IEEE Computer Society %C Hobart, ACT %F Afzal08e %X Genetic programming (GP) has been found to be effective in finding a model that fits the given data points without making any assumptions about the model structure. This makes GP a reasonable choice for software reliability growth modeling. This paper discusses the suitability of using GP for software reliability growth modeling and highlights the mechanisms that enable GP to progressively search for fitter solutions. %K genetic algorithms, genetic programming, software reliability data points, software reliability growth modeling, SBSE %R doi:10.1109/CSA.2008.13 %U http://dx.doi.org/doi:10.1109/CSA.2008.13 %P 114-117 %0 Conference Proceedings %T A comparative evaluation of using genetic programming for predicting fault count data %A Afzal, Wasif %A Torkar, Richard %S Proceedings of the Third International Conference on Software Engineering Advances (ICSEA’08) %D 2008 %8 26 31 %C Sliema, Malta %F Afzal08d %X There have been a number of software reliability growth models (SRGMs) proposed in literature. Due to several reasons, such as violation of models’ assumptions and complexity of models, the practitioners face difficulties in knowing which models to apply in practice. This paper presents a comparative evaluation of traditional models and use of genetic programming (GP) for modeling software reliability growth based on weekly fault count data of three different industrial projects. The motivation of using a GP approach is its ability to evolve a model based entirely on prior data without the need of making underlying assumptions. The results show the strengths of using GP for predicting fault count data. %K genetic algorithms, genetic programming, prediction, software reliability growth modeling, SBSE %R doi:10.1109/ICSEA.2008.9 %U http://dx.doi.org/doi:10.1109/ICSEA.2008.9 %P 407-414 %0 Conference Proceedings %T Prediction of fault count data using genetic programming %A Afzal, Wasif %A Torkar, Richard %A Feldt, Robert %S Proceedings of the 12th IEEE International Multitopic Conference (INMIC’08) %D 2008 %8 23 24 dec %I IEEE %C Karachi, Pakistan %F Afzal08b %X Software reliability growth modeling helps in deciding project release time and managing project resources. A large number of such models have been presented in the past. Due to the existence of many models, the models’ inherent complexity, and their accompanying assumptions; the selection of suitable models becomes a challenging task. This paper presents empirical results of using genetic programming (GP) for modeling software reliability growth based on weekly fault count data of three different industrial projects. The goodness of fit (adaptability) and predictive accuracy of the evolved model is measured using five different measures in an attempt to present a fair evaluation. The results show that the GP evolved model has statistically significant goodness of fit and predictive accuracy. %K genetic algorithms, genetic programming, SBSE, fault count data, prediction %R doi:10.1109/INMIC.2008.4777762 %U http://drfeldt.googlepages.com/afzal_submitted0805icsea_prediction_.pdf %U http://dx.doi.org/doi:10.1109/INMIC.2008.4777762 %P 349-356 %0 Conference Proceedings %T Search-Based Prediction of Fault Count Data %A Afzal, Wasif %A Torkar, Richard %A Feldt, Robert %Y Di Penta, Massimiliano %Y Poulding, Simon %S Proceedings 1st International Symposium on Search Based Software Engineering SSBSE 2009 %D 2009 %8 13 15 may %I IEEE %C Windsor, UK %F Afzal:2009:SSBSE %X Symbolic regression, an application domain of genetic programming (GP), aims to find a function whose output has some desired property, like matching target values of a particular data set. While typical regression involves finding the coefficients of a pre-defined function, symbolic regression finds a general function, with coefficients, fitting the given set of data points. The concepts of symbolic regression using genetic programming can be used to evolve a model for fault count predictions. Such a model has the advantages that the evolution is not dependent on a particular structure of the model and is also independent of any assumptions, which are common in traditional time-domain parametric software reliability growth models. This research aims at applying experiments targeting fault predictions using genetic programming and comparing the results with traditional approaches to compare efficiency gains. %K genetic algorithms, genetic programming, SBSE, search-based prediction, software fault count data, software reliability growth model, symbolic regression, regression analysis, software fault tolerance %R doi:10.1109/SSBSE.2009.17 %U http://dx.doi.org/doi:10.1109/SSBSE.2009.17 %P 35-38 %0 Journal Article %T A systematic review of search-based testing for non-functional system properties %A Afzal, Wasif %A Torkar, Richard %A Feldt, Robert %J Information and Software Technology %D 2009 %8 jun %V 51 %N 6 %@ 0950-5849 %F Afzal2009 %X Search-based software testing is the application of metaheuristic search techniques to generate software tests. The test adequacy criterion is transformed into a fitness function and a set of solutions in the search space are evaluated with respect to the fitness function using a metaheuristic search technique. The application of metaheuristic search techniques for testing is promising due to the fact that exhaustive testing is infeasible considering the size and complexity of software under test. Search-based software testing has been applied across the spectrum of test case design methods; this includes white-box (structural), black-box (functional) and grey-box (combination of structural and functional) testing. In addition, metaheuristic search techniques have also been applied to test non-functional properties. The overall objective of undertaking this systematic review is to examine existing work into non-functional search-based software testing (NFSBST). We are interested in types of non-functional testing targeted using metaheuristic search techniques, different fitness functions used in different types of search-based non-functional testing and challenges in the application of these techniques. The systematic review is based on a comprehensive set of 35 articles obtained after a multi-stage selection process and have been published in the time span 1996-2007. The results of the review show that metaheuristic search techniques have been applied for non-functional testing of execution time, quality of service, security, usability and safety. A variety of metaheuristic search techniques are found to be applicable for non-functional testing including simulated annealing, tabu search, genetic algorithms, ant colony methods, grammatical evolution, genetic programming (and its variants including linear genetic programming) and swarm intelligence methods. The review reports on different fitness functions used to guide the search for each of the categories of execution time, safety, usability, quality of service and security; along with a discussion of possible challenges in the application of metaheuristic search techniques. %K genetic algorithms, genetic programming, Systematic review, Non-functional system properties, Search-based software testing %9 journal article %R doi:10.1016/j.infsof.2008.12.005 %U http://drfeldt.googlepages.com/afzal_submitted0805ist_sysrev_nfr_sb.pdf %U http://dx.doi.org/doi:10.1016/j.infsof.2008.12.005 %P 957-976 %0 Thesis %T Search-Based Approaches to Software Fault Prediction and Software Testing %A Afzal, Wasif %D 2009 %C Sweden %C School of Engineering, Dept. of Systems and Software Engineering, Blekinge Institute of Technology %G eng %F Afzal:Licentiate %X Software verification and validation activities are essential for software quality but also constitute a large part of software development costs. Therefore efficient and cost-effective software verification and validation activities are both a priority and a necessity considering the pressure to decrease time-to-market and intense competition faced by many, if not all, companies today. It is then perhaps not unexpected that decisions related to software quality, when to stop testing, testing schedule and testing resource allocation needs to be as accurate as possible. This thesis investigates the application of search-based techniques within two activities of software verification and validation: Software fault prediction and software testing for non-functional system properties. Software fault prediction modeling can provide support for making important decisions as outlined above. In this thesis we empirically evaluate symbolic regression using genetic programming (a search-based technique) as a potential method for software fault predictions. Using data sets from both industrial and open-source software, the strengths and weaknesses of applying symbolic regression in genetic programming are evaluated against competitive techniques. In addition to software fault prediction this thesis also consolidates available research into predictive modeling of other attributes by applying symbolic regression in genetic programming, thus presenting a broader perspective. As an extension to the application of search-based techniques within software verification and validation this thesis further investigates the extent of application of search-based techniques for testing non-functional system properties. Based on the research findings in this thesis it can be concluded that applying symbolic regression in genetic programming may be a viable technique for software fault prediction. We additionally seek literature evidence where other search-based techniques are applied for testing of non-functional system properties, hence contributing towards the growing application of search-based techniques in diverse activities within software verification and validation. %K genetic algorithms, genetic programming, SBSE, Software Engineering, Computer Science, Artificial Intelligence %9 Licentiate Dissertation %9 Masters thesis %U http://www.bth.se/fou/forskinfo.nsf/all/f0738b5fc4ca0bbac12575980043def3/$file/Afzal_lic.pdf %0 Book Section %T Genetic Programming for Cross-Release Fault Count Predictions in Large and Complex Software Projects %A Afzal, Wasif %A Torkar, Richard %A Feldt, Robert %A Gorschek, Tony %E Chis, Monica %B Evolutionary Computation and Optimization Algorithms in Software Engineering: Applications and Techniques %D 2010 %8 jun %I IGI Global %F Afzal:2010:ECoaSE %X Software fault prediction can play an important role in ensuring software quality through efficient resource allocation. This could, in turn, reduce the potentially high consequential costs due to faults. Predicting faults might be even more important with the emergence of short-timed and multiple software releases aimed at quick delivery of functionality. Previous research in software fault prediction has indicated that there is a need i) to improve the validity of results by having comparisons among number of data sets from a variety of software, ii) to use appropriate model evaluation measures and iii) to use statistical testing procedures. Moreover, cross-release prediction of faults has not yet achieved sufficient attention in the literature. In an attempt to address these concerns, this paper compares the quantitative and qualitative attributes of 7 traditional and machine-learning techniques for modelling the cross-release prediction of fault count data. The comparison is done using extensive data sets gathered from a total of 7 multi-release open-source and industrial software projects. These software projects together have several years of development and are from diverse application areas, ranging from a web browser to a robotic controller software. Our quantitative analysis suggests that genetic programming (GP) tends to have better consistency in terms of goodness of fit and accuracy across majority of data sets. It also has comparatively less model bias. Qualitatively, ease of configuration and complexity are less strong points for GP even though it shows generality and gives transparent models. Artificial neural networks did not perform as well as expected while linear regression gave average predictions in terms of goodness of fit and accuracy. Support vector machine regression and traditional software reliability growth models performed below average on most of the quantitative evaluation criteria while remained on average for most of the qualitative measures. %K genetic algorithms, genetic programming, SBSE %R doi:10.4018/978-1-61520-809-8.ch006 %U http://dx.doi.org/doi:10.4018/978-1-61520-809-8.ch006 %P 94-126 %0 Conference Proceedings %T Search-based Prediction of Fault-slip-through in Large Software Projects %A Afzal, Wasif %A Torkar, Richard %A Feldt, Robert %A Wikstrand, Greger %S Second International Symposium on Search Based Software Engineering (SSBSE 2010) %D 2010 %8 July 9 sep %C Benevento, Italy %F Afzal:2010:SSBSE %X A large percentage of the cost of rework can be avoided by finding more faults earlier in a software testing process. Therefore, determination of which software testing phases to focus improvements work on, has considerable industrial interest. This paper evaluates the use of five different techniques, namely particle swarm optimization based artificial neural networks (PSO-ANN), artificial immune recognition systems (AIRS), gene expression programming (GEP), genetic programming (GP) and multiple regression (MR), for predicting the number of faults slipping through unit, function, integration and system testing phases. The objective is to quantify improvement potential in different testing phases by striving towards finding the right faults in the right phase. We have conducted an empirical study of two large projects from a telecommunication company developing mobile platforms and wireless semiconductors. The results are compared using simple residuals, goodness of fit and absolute relative error measures. They indicate that the four search-based techniques (PSO-ANN, AIRS, GEP, GP) perform better than multiple regression for predicting the fault-slip-through for each of the four testing phases. At the unit and function testing phases, AIRS and PSO-ANN performed better while GP performed better at integration and system testing phases. The study concludes that a variety of search-based techniques are applicable for predicting the improvement potential in different testing phases with GP showing more consistent performance across two of the four test phases. %K genetic algorithms, genetic programming, gene expression programming, sbse, AIRS, GEP, GP, MR, PSO-ANN, artificial immune recognition system, artificial neural network, fault-slip-through, multiple regression, particle swarm optimisation, search-based prediction, software project, software testing process, artificial immune systems, fault tolerant computing, neural nets, particle swarm optimisation, program testing, regression analysis %R doi:10.1109/SSBSE.2010.19 %U http://dx.doi.org/doi:10.1109/SSBSE.2010.19 %P 79-88 %0 Conference Proceedings %T Using Faults-Slip-Through Metric as a Predictor of Fault-Proneness %A Afzal, Wasif %S 17th Asia Pacific Software Engineering Conference (APSEC 2010) %D 2010 %8 nov 30 dec 3 %F Afzal:2010:APSEC %X Background: The majority of software faults are present in small number of modules, therefore accurate prediction of fault-prone modules helps improve software quality by focusing testing efforts on a subset of modules. Aims: This paper evaluates the use of the faults-slip-through (FST) metric as a potential predictor of fault-prone modules. Rather than predicting the fault-prone modules for the complete test phase, the prediction is done at the specific test levels of integration and system test. Method: We applied eight classification techniques, to the task of identifying fault prone modules, representing a variety of approaches, including a standard statistical technique for classification (logistic regression), tree-structured classifiers (C4.5 and random forests), a Bayesian technique (Naive Bayes), machine-learning techniques (support vector machines and back-propagation artificial neural networks) and search-based techniques (genetic programming and artificial immune recognition systems) on FST data collected from two large industrial projects from the telecommunication domain. Results: Using area under the receiver operating characteristic (ROC) curve and the location of (PF, PD) pairs in the ROC space, the faults slip-through metric showed impressive results with the majority of the techniques for predicting fault-prone modules at both integration and system test levels. There were, however, no statistically significant differences between the performance of different techniques based on AUC, even though certain techniques were more consistent in the classification performance at the two test levels. Conclusions: We can conclude that the faults-slip-through metric is a potentially strong predictor of fault-proneness at integration and system test levels. The faults-slip-through measurements interact in ways that is conveniently accounted for by majority of the data mining techniques. %K genetic algorithms, genetic programming, sbse, Bayesian technique, artificial immune recognition systems, back-propagation artificial neural networks, data mining, fault-proneness predictor, faults-slip-through metric, logistic regression, machine-learning techniques, receiver operating characteristic curve, search-based techniques, software faults, software quality, standard statistical technique, support vector machines, system test levels, tree-structured classifiers, backpropagation, data mining, neural nets, program testing, software quality, statistical analysis, support vector machines %R doi:10.1109/APSEC.2010.54 %U http://dx.doi.org/doi:10.1109/APSEC.2010.54 %P 414-422 %0 Journal Article %T On the application of genetic programming for software engineering predictive modeling: A systematic review %A Afzal, Wasif %A Torkar, Richard %J Expert Systems with Applications %D 2011 %V 38 %N 9 %@ 0957-4174 %F Afzal201111984 %X The objective of this paper is to investigate the evidence for symbolic regression using genetic programming (GP) being an effective method for prediction and estimation in software engineering, when compared with regression/machine learning models and other comparison groups (including comparisons with different improvements over the standard GP algorithm). We performed a systematic review of literature that compared genetic programming models with comparative techniques based on different independent project variables. A total of 23 primary studies were obtained after searching different information sources in the time span 1995-2008. The results of the review show that symbolic regression using genetic programming has been applied in three domains within software engineering predictive modeling: (i) Software quality classification (eight primary studies). (ii) Software cost/effort/size estimation (seven primary studies). (iii) Software fault prediction/software reliability growth modelling (eight primary studies). While there is evidence in support of using genetic programming for software quality classification, software fault prediction and software reliability growth modelling; the results are inconclusive for software cost/effort/size estimation. %K genetic algorithms, genetic programming, Systematic review, Symbolic regression, Modelling %9 journal article %R doi:10.1016/j.eswa.2011.03.041 %U http://www.sciencedirect.com/science/article/B6V03-52C8FT6-5/2/668361024e4b2bcf9a4a73195271591c %U http://dx.doi.org/doi:10.1016/j.eswa.2011.03.041 %P 11984-11997 %0 Thesis %T Search-Based Prediction of Software Quality: Evaluations And Comparisons %A Afzal, Wasif %D 2011 %8 May %C Sweden %C School of Computing, Blekinge Institute of Technology %F Afzal:thesis %X Software verification and validation (V&V) activities are critical for achieving software quality; however, these activities also constitute a large part of the costs when developing software. Therefore efficient and effective software V&V activities are both a priority and a necessity considering the pressure to decrease time-to-market and the intense competition faced by many, if not all, companies today. It is then perhaps not unexpected that decisions that affects software quality, e.g., how to allocate testing resources, develop testing schedules and to decide when to stop testing, needs to be as stable and accurate as possible. The objective of this thesis is to investigate how search-based techniques can support decision-making and help control variation in software V&V activities, thereby indirectly improving software quality. Several themes in providing this support are investigated: predicting reliability of future software versions based on fault history; fault prediction to improve test phase efficiency; assignment of resources to fixing faults; and distinguishing fault-prone software modules from non-faulty ones. A common element in these investigations is the use of search-based techniques, often also called metaheuristic techniques, for supporting the V&V decision-making processes. Search-based techniques are promising since, as many problems in real world, software V&V can be formulated as optimisation problems where near optimal solutions are often good enough. Moreover, these techniques are general optimization solutions that can potentially be applied across a larger variety of decision-making situations than other existing alternatives. Apart from presenting the current state of the art, in the form of a systematic literature review, and doing comparative evaluations of a variety of metaheuristic techniques on large-scale projects (both industrial and open-source), this thesis also presents methodological investigations using search-based techniques that are relevant to the task of software quality measurement and prediction. The results of applying search-based techniques in large-scale projects, while investigating a variety of research themes, show that they consistently give competitive results in comparison with existing techniques. Based on the research findings, we conclude that search-based techniques are viable techniques to use in supporting the decision-making processes within software V&V activities. The accuracy and consistency of these techniques make them important tools when developing future decision support for effective management of software V&V activities. %K genetic algorithms, genetic programming, SBSE %9 Ph.D. thesis %U http://www.bth.se/fou/forskinfo.nsf/0/dd0dcce8cc126a52c125784500410306/$file/Dis%20Wasif%20Afzal%20thesis.pdf %0 Journal Article %T Prediction of faults-slip-through in large software projects: an empirical evaluation %A Afzal, Wasif %A Torkar, Richard %A Feldt, Robert %A Gorschek, Tony %J Software Quality Journal %D 2014 %8 mar %V 22 %N 1 %I Springer US %@ 0963-9314 %G English %F Afzal:2013:SQJ %X A large percentage of the cost of rework can be avoided by finding more faults earlier in a software test process. Therefore, determination of which software test phases to focus improvement work on has considerable industrial interest. We evaluate a number of prediction techniques for predicting the number of faults slipping through to unit, function, integration, and system test phases of a large industrial project. The objective is to quantify improvement potential in different test phases by striving toward finding the faults in the right phase. The results show that a range of techniques are found to be useful in predicting the number of faults slipping through to the four test phases; however, the group of search-based techniques (genetic programming, gene expression programming, artificial immune recognition system, and particle swarm optimisation (PSO) based artificial neural network) consistently give better predictions, having a representation at all of the test phases. Human predictions are consistently better at two of the four test phases. We conclude that the human predictions regarding the number of faults slipping through to various test phases can be well supported by the use of search-based techniques. A combination of human and an automated search mechanism (such as any of the search-based techniques) has the potential to provide improved prediction results. %K genetic algorithms, genetic programming, SBSE, Prediction, Empirical, Faults-slip-through, Search-based %9 journal article %R doi:10.1007/s11219-013-9205-3 %U http://www.bth.se/fou/forskinfo.nsf/all/3d40224f7cbf862dc1257b7800251e66?OpenDocument %U http://dx.doi.org/doi:10.1007/s11219-013-9205-3 %P 51-86 %0 Book Section %T Towards Benchmarking Feature Subset Selection Methods for Software Fault Prediction %A Afzal, Wasif %A Torkar, Richard %E Pedrycz, Witold %E Succi, Giancarlo %E Sillitti, Alberto %B Computational Intelligence and Quantitative Software Engineering %S Studies in Computational Intelligence %D 2016 %V 617 %I Springer %F Afzal2016 %X Despite the general acceptance that software engineering datasets often contain noisy, irrelevant or redundant variables, very few benchmark studies of feature subset selection (FSS) methods on real-life data from software projects have been conducted. This paper provides an empirical comparison of state-of-the-art FSS methods: information gain attribute ranking (IG); Relief (RLF); principal component analysis (PCA); correlation-based feature selection (CFS); consistency-based subset evaluation (CNS); wrapper subset evaluation (WRP); and an evolutionary computation method, genetic programming (GP), on five fault prediction datasets from the PROMISE data repository. For all the datasets, the area under the receiver operating characteristic curve, the AUC value averaged over 10-fold cross-validation runs, was calculated for each FSS method-dataset combination before and after FSS. Two diverse learning algorithms, C4.5 and naive Bayes (NB) are used to test the attribute sets given by each FSS method. The results show that although there are no statistically significant differences between the AUC values for the different FSS methods for both C4.5 and NB, a smaller set of FSS methods (IG, RLF, GP) consistently select fewer attributes without degrading classification accuracy. We conclude that in general, FSS is beneficial as it helps improve classification accuracy of NB and C4.5. There is no single best FSS method for all datasets but IG, RLF and GP consistently select fewer attributes without degrading classification accuracy within statistically significant boundaries. %K genetic algorithms, genetic programming, SBSE, Feature subset selection, Fault prediction, Empirical %R doi:10.1007/978-3-319-25964-2_3 %U http://dx.doi.org/doi:10.1007/978-3-319-25964-2_3 %P 33-58 %0 Conference Proceedings %T A Genetic Programming Approach for Constructing Foreground and Background Saliency Features for Salient Object Detection %A Afzali, Shima %A Al-Sahaf, Harith %A Xue, Bing %A Hollitt, Christopher %A Zhang, Mengjie %Y Mitrovic, Tanja %Y Xue, Bing %Y Li, Xiaodong %S Australasian Joint Conference on Artificial Intelligence %S LNCS %D 2018 %8 dec 11 14 %V 11320 %I Springer %C Wellington, New Zealand %F afzali:2018:AJCAI %K genetic algorithms, genetic programming %R doi:10.1007/978-3-030-03991-2_21 %U http://link.springer.com/chapter/10.1007/978-3-030-03991-2_21 %U http://dx.doi.org/doi:10.1007/978-3-030-03991-2_21 %0 Conference Proceedings %T Genetic Programming for Feature Selection and Feature Combination in Salient Object Detection %A Afzali, Shima %A Al-Sahaf, Harith %A Xue, Bing %A Hollitt, Christopher %A Zhang, Mengjie %Y Kaufmann, Paul %Y Castillo, Pedro A. %S 22nd International Conference, EvoApplications 2019 %S LNCS %D 2019 %8 24 26 apr %V 11454 %I Springer Verlag %C Leipzig, Germany %F Afzali:2019:evoapplications %X Salient Object Detection (SOD) aims to model human visual attention system to cope with the complex natural scene which contains various objects at different scales. Over the past two decades, a wide range of saliency features have been introduced in the SOD field, however feature selection has not been widely investigated for selecting informative, non-redundant, and complementary features from the existing features. In SOD, multi-level feature extraction and feature combination are two fundamental stages to compute the final saliency map. However, designing a good feature combination framework is a challenging task and requires domain-expert intervention. In this paper, we propose a genetic programming (GP) based method that is able to automatically select the complementary saliency features and generate mathematical function to combine those features. The performance of the proposed method is evaluated using four benchmark datasets and compared to nine state-of-the-art methods. The qualitative and quantitative results show that the proposed method significantly outperformed, or achieved comparable performance to, the competitor methods. %K genetic algorithms, genetic programming, Salient Object Detection, Feature combination, Feature selection %R doi:10.1007/978-3-030-16692-2_21 %U http://dx.doi.org/doi:10.1007/978-3-030-16692-2_21 %P 308-324 %0 Thesis %T Evolutionary Computation for Feature Manipulation in Salient Object Detection %A Afzali Vahed Moghaddam, Shima %D 2020 %C New Zealand %C Computer Science, Victoria University of Wellington %F Afzali:thesis %X The human visual system can efficiently cope with complex natural scenes containing various objects at different scales using the visual attention mechanism. Salient object detection (SOD) aims to simulate the capability of the human visual system in prioritizing objects for high-level processing. SOD is a process of identifying and localizing the most attention grabbing object(s) of a scene and separating the whole extent of the object(s) from the scene. In SOD, significant research has been dedicated to design and introduce new features to the domain. The existing saliency feature space suffers from some difficulties such as having high dimensionality, features are not equally important, some features are irrelevant, and the original features are not informative enough. These difficulties can lead to various performance limitations. Feature manipulation is the process which improves the input feature space to enhance the learning quality and performance. Evolutionary computation (EC) techniques have been employed in a wide range of tasks due to their powerful search abilities. Genetic programming (GP) and particle swarm optimization (PSO) are well-known EC techniques which have been used for feature manipulation. The overall goal of this thesis is to develop feature manipulation methods including feature weighting, feature selection, and feature construction using EC techniques to improve the input feature set for SOD. This thesis proposes a feature weighting method using PSO to explore the relative contribution of each saliency feature in the feature combination process. Saliency features are referred to the features which are extracted from different levels (e.g., pixel, segmentation) of an image to compute the saliency values over the entire image. The experimental results show that different datasets favour different weights for the employed features. The results also reveal that by considering the importance of each feature in the combination process, the proposed method has achieved better performance than that of the competitive methods. This thesis proposes a new bottom-up SOD method to detect salient objects by constructing two new informative saliency features and designing a new feature combination framework. The proposed method aims at developing features which target to identify different regions of the image. The proposed method makes a good balance between computational time and performance. This thesis proposes a GP-based method to automatically construct foreground and background saliency features. The automatically constructed features do not require domain-knowledge and they are more informative compared to the manually constructed features. The results show that GP is robust towards the changes in the input feature set (e.g., adding more features to the input feature set) and improves the performance by introducing more informative features to the SOD domain. This thesis proposes a GP-based SOD method which automatically produces saliency maps (a 2-D map containing saliency values) for different types of images. This GP-based SOD method applies feature selection and feature combination during the learning process for SOD. GP with built-in feature selection process which selects informative features from the original set and combines the selected features to produce the final saliency map. The results show that GP can potentially explore a large search space and find a good way to combine different input features. This thesis introduces GP for the first time to construct high-level saliency features from the low-level features for SOD, which aims to improve the performance of SOD, particularly on challenging and complex SOD tasks. The proposed method constructs fewer features that achieve better saliency performance than the original full feature set. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://hdl.handle.net/10063/8897 %0 Journal Article %T An automatic feature construction method for salient object detection: A genetic programming approach %A Afzali Vahed Moghaddam, Shima %A Al-Sahaf, Harith %A Xue, Bing %A Hollitt, Christopher %A Zhang, Mengjie %J Expert Systems with Applications %D 2021 %V 186 %@ 0957-4174 %F Afzali:2021:ESA %X Over the last two decades, salient object detection (SOD) has received increasingly more attention due to its ability to handle complex natural scenes and its various real-world applications. The performance of an SOD method mainly relies on saliency features that are extracted with different levels of information. Low-level saliency features are often effective in simple scenarios, but they are not always robust in challenging scenarios. With the recent prevalence of high-level saliency features such as deep convolutional neural networks (CNNs) features, a remarkable progress has been achieved in the SOD field. However, CNN-based constructed high-level features unavoidably drop the location information and low-level fine details (e.g., edges and corners) of salient object(s), leading to unclear/blurry boundary predictions. In addition, deep CNN methods have difficulties to generalize and accurately detect salient objects when they are trained with limited number of images (e.g. small datasets). This paper proposes a new automatic feature construction method using Genetic Programming (GP) to construct informative high-level saliency features for SOD. The proposed method takes low-level and hand-crafted saliency features as input to construct high-level features. The constructed GP-based high-level features not only detect the general objects, but they are also good at capturing details and edges/boundaries. The GP-based constructed features have better interpretability compared to CNN-based features. The proposed GP-based method can potentially cope with a small number of samples for training to obtain a good generalization as long as the given training data has enough information to represent the distribution of the data. The experiments on six datasets reveal that the new method achieves consistently high performance compared to twelve state-of-the-art SOD methods %K genetic algorithms, genetic programming, Salient object detection, Feature construction %9 journal article %R doi:10.1016/j.eswa.2021.115726 %U https://www.sciencedirect.com/science/article/pii/S0957417421011076 %U http://dx.doi.org/doi:10.1016/j.eswa.2021.115726 %P 115726 %0 Conference Proceedings %T Random Systems with Complete Connections %A Agapie, Alexandru %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 1 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F agapie:1999:RSCC %K genetic algorithms and classifier systems, poster papers %U http://gpbib.cs.ucl.ac.uk/gecco1999/GA-862.ps %P 770 %0 Conference Proceedings %T Learning Recursive Functions with Object Oriented Genetic Programming %A Agapitos, Alexandros %A Lucas, Simon M. %Y Collet, Pierre %Y Tomassini, Marco %Y Ebner, Marc %Y Gustafson, Steven %Y Ekárt, Anikó %S Proceedings of the 9th European Conference on Genetic Programming %S Lecture Notes in Computer Science %D 2006 %8 October 12 apr %V 3905 %I Springer %C Budapest, Hungary %@ 3-540-33143-3 %F eurogp06:AgapitosLucas %X This paper describes the evolution of recursive functions within an Object-Oriented Genetic Programming (OOGP) system. We evolved general solutions to factorial, Fibonacci, exponentiation, even-n-Parity, and nth-3. We report the computational effort required to evolve these methods and provide a comparison between crossover and mutation variation operators, and also undirected random search. We found that the evolutionary algorithms performed much better than undirected random search, and that mutation outperformed crossover on most problems. %K genetic algorithms, genetic programming %R doi:10.1007/11729976_15 %U http://dx.doi.org/doi:10.1007/11729976_15 %P 166-177 %0 Conference Proceedings %T Evolving Efficient Recursive Sorting Algorithms %A Agapitos, Alexandros %A Lucas, Simon M. %Y Yen, Gary G. %Y Wang, Lipo %Y Bonissone, Piero %Y Lucas, Simon M. %S Proceedings of the 2006 IEEE Congress on Evolutionary Computation %D 2006 %8 16 21 jul %I IEEE Press %C Vancouver %@ 0-7803-9487-9 %F Agapitos:2006:CEC %X Object Oriented Genetic Programming (OOGP) is applied to the task of evolving general recursive sorting algorithms. We studied the effects of language primitives and fitness functions on the success of the evolutionary process. For language primitives, these were the methods of a simple list processing package. Five different fitness functions based on sequence disorder were evaluated. The time complexity of the successfully evolved algorithms was measured experimentally in terms of the number of method invocations made, and for the best evolved individuals this was best approximated as O(n log(n)). This is the first time that sorting algorithms of this complexity have been evolved. %K genetic algorithms, genetic programming, computational complexity, evolutionary computation, object-oriented languages, object-oriented programming, OOGP, evolutionary process, fitness function, language primitives, object oriented genetic programming, recursive sorting algorithms, time complexity %R doi:10.1109/CEC.2006.1688643 %U http://privatewww.essex.ac.uk/~aagapi/papers/AgapitosLucasEvolvingSort.pdf %U http://dx.doi.org/doi:10.1109/CEC.2006.1688643 %P 9227-9234 %0 Conference Proceedings %T Evolving a Statistics Class Using Object Oriented Evolutionary Programming %A Agapitos, Alexandros %A Lucas, Simon M. %Y Ebner, Marc %Y O’Neill, Michael %Y Ekárt, Anikó %Y Vanneschi, Leonardo %Y Esparcia-Alcázar, Anna Isabel %S Proceedings of the 10th European Conference on Genetic Programming %S Lecture Notes in Computer Science %D 2007 %8 November 13 apr %V 4445 %I Springer %C Valencia, Spain %@ 3-540-71602-5 %F eurogp07:agapitos1 %X Object Oriented Evolutionary Programming is used to evolve programs that calculate some statistical measures on a set of numbers. We compared this technique with a more standard functional representation. We also studied the effects of scalar and Pareto-based multi-objective fitness functions to the induction of multi-task programs. We found that the induction of a program residing in an OO representation space is more efficient, yielding less fitness evaluations, and that scalar fitness performed better than Pareto-based fitness in this problem domain. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-540-71605-1_27 %U http://dx.doi.org/doi:10.1007/978-3-540-71605-1_27 %P 291-300 %0 Conference Proceedings %T Evolving Modular Recursive Sorting Algorithms %A Agapitos, Alexandros %A Lucas, Simon M. %Y Ebner, Marc %Y O’Neill, Michael %Y Ekárt, Anikó %Y Vanneschi, Leonardo %Y Esparcia-Alcázar, Anna Isabel %S Proceedings of the 10th European Conference on Genetic Programming %S Lecture Notes in Computer Science %D 2007 %8 November 13 apr %V 4445 %I Springer %C Valencia, Spain %@ 3-540-71602-5 %F eurogp07:agapitos2 %X A fundamental issue in evolutionary learning is the definition of the solution representation language. We present the application of Object Oriented Genetic Programming to the task of coevolving general recursive sorting algorithms along with their primitive representation alphabet. We report the computational effort required to evolve target solutions and provide a comparison between crossover and mutation variation operators, and also undirected random search. We found that the induction of evolved method signatures (typed parameters and return type) can be realized through an evolutionary fitness-driven process. We also found that the evolutionary algorithm outperformed undirected random search, and that mutation performed better than crossover in this problem domain. The main result is that modular sorting algorithms can be evolved. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-540-71605-1_28 %U http://dx.doi.org/doi:10.1007/978-3-540-71605-1_28 %P 301-310 %0 Conference Proceedings %T Evolving controllers for simulated car racing using object oriented genetic programming %A Agapitos, Alexandros %A Togelius, Julian %A Lucas, Simon Mark %Y Thierens, Dirk %Y Beyer, Hans-Georg %Y Bongard, Josh %Y Branke, Jurgen %Y Clark, John Andrew %Y Cliff, Dave %Y Congdon, Clare Bates %Y Deb, Kalyanmoy %Y Doerr, Benjamin %Y Kovacs, Tim %Y Kumar, Sanjeev %Y Miller, Julian F. %Y Moore, Jason %Y Neumann, Frank %Y Pelikan, Martin %Y Poli, Riccardo %Y Sastry, Kumara %Y Stanley, Kenneth Owen %Y Stutzle, Thomas %Y Watson, Richard A. %Y Wegener, Ingo %S GECCO ’07: Proceedings of the 9th annual conference on Genetic and evolutionary computation %D 2007 %8 July 11 jul %V 2 %I ACM Press %C London %F 1277271 %X The Probabilistic Adaptive Mapping Developmental Genetic Programming (PAM DGP) algorithm that cooperatively Co-evolves a population of adaptive mappings and associated genotypes is used to learn recursive solutions given a function set consisting of general (not implicitly recursive) machine-language instructions. PAM DGP using redundant encodings to model the evolution of the biological genetic code is found to more efficiently learn 2nd and 3rd order recursive Fibonacci functions than related developmental systems and traditional linear GP. PAM DGP using redundant encoding is also demonstrated to produce the semantically highest quality solutions for all three recursive functions considered (Factorial, 2nd and 3rd order Fibonacci). PAM DGP is then shown to have produced such solutions by evolving redundant mappings to select and emphasise appropriate subsets of the function set useful for producing the naturally recursive solutions. %K genetic algorithms, genetic programming, evolutionary computer games, evolutionary robotics, homologous uniform crossover, neural networks, object oriented, subtree macro-mutation %R doi:10.1145/1276958.1277271 %U http://gpbib.cs.ucl.ac.uk/gecco2007/docs/p1543.pdf %U http://dx.doi.org/doi:10.1145/1276958.1277271 %P 1543-1550 %0 Conference Proceedings %T Multiobjective Techniques for the Use of State in Genetic Programming Applied to Simulated Car Racing %A Agapitos, Alexandros %A Togelius, Julian %A Lucas, Simon M. %Y Srinivasan, Dipti %Y Wang, Lipo %S 2007 IEEE Congress on Evolutionary Computation %D 2007 %8 25 28 sep %I IEEE Press %C Singapore %@ 1-4244-1340-0 %F Agapitos:2007:cec %X Multi-objective optimisation is applied to encourage the effective use of state variables in car controlling programs evolved using Genetic Programming. Three different metrics for measuring the use of state within a program are introduced. Comparisons are performed among multi- and single-objective fitness functions with respect to learning speed and final fitness of evolved individuals, and attempts are made at understanding whether there is a trade-off between good performance and stateful controllers in this problem domain. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2007.4424659 %U 1977.pdf %U http://dx.doi.org/doi:10.1109/CEC.2007.4424659 %P 1562-1569 %0 Conference Proceedings %T Learning to recognise mental activities: genetic programming of stateful classifiers for brain-computer interfacing %A Agapitos, Alexandros %A Dyson, Matthew %A Lucas, Simon M. %A Sepulveda, Francisco %Y Keijzer, Maarten %Y Antoniol, Giuliano %Y Congdon, Clare Bates %Y Deb, Kalyanmoy %Y Doerr, Benjamin %Y Hansen, Nikolaus %Y Holmes, John H. %Y Hornby, Gregory S. %Y Howard, Daniel %Y Kennedy, James %Y Kumar, Sanjeev %Y Lobo, Fernando G. %Y Miller, Julian Francis %Y Moore, Jason %Y Neumann, Frank %Y Pelikan, Martin %Y Pollack, Jordan %Y Sastry, Kumara %Y Stanley, Kenneth %Y Stoica, Adrian %Y Talbi, El-Ghazali %Y Wegener, Ingo %S GECCO ’08: Proceedings of the 10th annual conference on Genetic and evolutionary computation %D 2008 %8 December 16 jul %I ACM %C Atlanta, GA, USA %F Agapitos:2008:gecco %X Two families (stateful and stateless) of genetically programmed classifiers were tested on a five class brain-computer interface (BCI) data set of raw EEG signals. The ability of evolved classifiers to discriminate mental tasks from each other were analysed in terms of accuracy, precision and recall. A model describing the dynamics of state usage in stateful programs is introduced. An investigation of relationships between the model attributes and associated classification results was made. The results show that both stateful and stateless programs can be successfully evolved for this task, though stateful programs start from lower fitness and take longer to evolve %K genetic algorithms, genetic programming, Brain computer interface, classification on Raw signal, stateful representation, statistical signal primitives %R doi:10.1145/1389095.1389326 %U http://gpbib.cs.ucl.ac.uk/gecco2008/docs/p1155.pdf %U http://dx.doi.org/doi:10.1145/1389095.1389326 %P 1155-1162 %0 Conference Proceedings %T On the genetic programming of time-series predictors for supply chain management %A Agapitos, Alexandros %A Dyson, Matthew %A Kovalchuk, Jenya %A Lucas, Simon Mark %Y Keijzer, Maarten %Y Antoniol, Giuliano %Y Congdon, Clare Bates %Y Deb, Kalyanmoy %Y Doerr, Benjamin %Y Hansen, Nikolaus %Y Holmes, John H. %Y Hornby, Gregory S. %Y Howard, Daniel %Y Kennedy, James %Y Kumar, Sanjeev %Y Lobo, Fernando G. %Y Miller, Julian Francis %Y Moore, Jason %Y Neumann, Frank %Y Pelikan, Martin %Y Pollack, Jordan %Y Sastry, Kumara %Y Stanley, Kenneth %Y Stoica, Adrian %Y Talbi, El-Ghazali %Y Wegener, Ingo %S GECCO ’08: Proceedings of the 10th annual conference on Genetic and evolutionary computation %D 2008 %8 December 16 jul %I ACM %C Atlanta, GA, USA %F Agapitos2:2008:gecco %K genetic algorithms, genetic programming, Iterated single-step prediction, prediction/forecasting, single-step prediction, statistical time-series Features %R doi:10.1145/1389095.1389327 %U http://gpbib.cs.ucl.ac.uk/gecco2008/docs/p1163.pdf %U http://dx.doi.org/doi:10.1145/1389095.1389327 %P 1163-1170 %0 Conference Proceedings %T Generating Diverse Opponents with Multiobjective Evolution %A Agapitos, Alexandros %A Togelius, Julian %A Lucas, Simon M. %A Schmidhuber, Jurgen %A Konstantinidis, Andreas %S Proceedings of the 2008 IEEE Symposium on Computational Intelligence and Games %D 2008 %8 dec 15 18 %I IEEE %C Perth, Australia %F Agapitos:2008:CIG %X For computational intelligence to be useful in creating game agent AI, we need to focus on creating interesting and believable agents rather than just learn to play the games well. To this end, we propose a way to use multiobjective evolutionary algorithms to automatically create populations of NPCs, such as opponents and collaborators, that are interestingly diverse in behaviour space. Experiments are presented where a number of partially conflicting objectives are defined for racing game competitors, and multiobjective evolution of GP-based controllers yield Pareto fronts of interesting controllers. %K genetic algorithms, genetic programming, Reinforcement Learning, Multiobjective Evolution, AI in Computer Games, EMOA, Car Racing, MOGA, AI game agent, computational intelligence, diverse opponent generation, game play learning, multiobjective evolutionary algorithm, nonplayer character, computer games, evolutionary computation, learning (artificial intelligence), multi-agent systems %R doi:10.1109/CIG.2008.5035632 %U http://julian.togelius.com/Agapitos2008Generating.pdf %U http://dx.doi.org/doi:10.1109/CIG.2008.5035632 %P 135-142 %0 Conference Proceedings %T Evolutionary Learning of Technical Trading Rules without Data-mining Bias %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %Y Schaefer, Robert %Y Cotta, Carlos %Y Kolodziej, Joanna %Y Rudolph, Guenter %S PPSN 2010 11th International Conference on Parallel Problem Solving From Nature %S Lecture Notes in Computer Science %D 2010 %8 November 15 sep %V 6238 %I Springer %C Krakow, Poland %F agapitos_etal:ppsn2010 %X In this paper we investigate the profitability of evolved technical trading rules when controlling for data-mining bias. For the first time in the evolutionary computation literature, a comprehensive test for a rule’s statistical significance using Hansen’s Superior Predictive Ability is explicitly taken into account in the fitness function, and multi-objective evolutionary optimisation is employed to drive the search towards individual rules with better generalisation abilities. Empirical results on a spot foreign-exchange market index suggest that increased out-of-sample performance can be obtained after accounting for data-mining bias effects in a multi-objective fitness function, as compared to a single-criterion fitness measure that considers solely the average return. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-15844-5_30 %U http://dx.doi.org/doi:10.1007/978-3-642-15844-5_30 %P 294-303 %0 Conference Proceedings %T Evolutionary Prediction of Total Electron Content over Cyprus %A Agapitos, Alexandros %A Konstantinidis, Andreas %A Haralambous, Haris %A Papadopoulos, Harris %Y Papadopoulos, Harris %Y Andreou, Andreas %Y Bramer, Max %S 6th IFIP Advances in Information and Communication Technology AIAI 2010 %S IFIP Advances in Information and Communication Technology %D 2010 %8 oct 6 7 %V 339 %I Springer %C Larnaca, Cyprus %F Agapitos:2010:AIAI %X Total Electron Content (TEC) is an ionospheric characteristic used to derive the signal delay imposed by the ionosphere on trans-ionospheric links and subsequently overwhelm its negative impact in accurate position determination. In this paper, an Evolutionary Algorithm (EA), and particularly a Genetic Programming (GP) based model is designed. The proposed model is based on the main factors that influence the variability of the predicted parameter on a diurnal, seasonal and long-term time-scale. Experimental results show that the GP-model, which is based on TEC measurements obtained over a period of 11 years, has produced a good approximation of the modeled parameter and can be implemented as a local model to account for the ionospheric imposed error in positioning. The GP-based approach performs better than the existing Neural Network-based approach in several cases. %K genetic algorithms, genetic programming, Evolutionary Algorithms, Global Positioning System, Total Electron Content %R doi:10.1007/978-3-642-16239-8_50 %U http://dx.doi.org/doi:10.1007/978-3-642-16239-8_50 %P 387-394 %0 Conference Proceedings %T Promoting the generalisation of genetically induced trading rules %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %Y Kapetanios, G. %Y Linton, O. %Y McAleer, M. %Y Ruiz, E. %S Proceedings of the 4th International Conference on Computational and Financial Econometrics CFE’10 %D 2010 %8 October 12 dec %I ERCIM %C Senate House, University of London, UK %F agapitosetal:2010:cfe %X The goal of Machine Learning is not to induce an exact representation of the training patterns themselves, but rather to build a model of the underlying pattern-generation process. One of the most important aspects of this computational process is how to obtain general models that are representative of the true concept, and as a result, perform efficiently when presented with novel patterns from that concept. A particular form of evolutionary machine learning, Genetic Programming, tackles learning problems by means of an evolutionary process of program discovery. In this paper we investigate the profitability of evolved technical trading rules when accounting for the problem of over-fitting. Out-of-sample rule performance deterioration is a well-known problem, and has been mainly attributed to the tendency of the evolved models to find meaningless regularities in the training dataset due to the high dimensionality of features and the rich hypothesis space. We present a review of the major established methods for promoting generalisation in conventional machine learning paradigms. Then, we report empirical results of adapting such techniques to the Genetic Programming methodology, and applying it to discover trading rules for various financial datasets. %K genetic algorithms, genetic programming %U http://www.cfe-csda.org/cfe10/LondonBoA.pdf %P E678 %0 Conference Proceedings %T Maximum Margin Decision Surfaces for Increased Generalisation in Evolutionary Decision Tree Learning %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %A Theodoridis, Theodoros %Y Silva, Sara %Y Foster, James A. %Y Nicolau, Miguel %Y Giacobini, Mario %Y Machado, Penousal %S Proceedings of the 14th European Conference on Genetic Programming, EuroGP 2011 %S LNCS %D 2011 %8 27 29 apr %V 6621 %I Springer Verlag %C Turin, Italy %F agapitos:2011:EuroGP %X Decision tree learning is one of the most widely used and practical methods for inductive inference. We present a novel method that increases the generalisation of genetically-induced classification trees, which employ linear discriminants as the partitioning function at each internal node. Genetic Programming is employed to search the space of oblique decision trees. At the end of the evolutionary run, a (1+1) Evolution Strategy is used to geometrically optimise the boundaries in the decision space, which are represented by the linear discriminant functions. The evolutionary optimisation concerns maximising the decision-surface margin that is defined to be the smallest distance between the decision-surface and any of the samples. Initial empirical results of the application of our method to a series of datasets from the UCI repository suggest that model generalisation benefits from the margin maximisation, and that the new method is a very competent approach to pattern classification as compared to other learning algorithms. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-20407-4_6 %U http://dx.doi.org/doi:10.1007/978-3-642-20407-4_6 %P 61-72 %0 Conference Proceedings %T Stateful program representations for evolving technical trading rules %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %Y Krasnogor, Natalio %Y Lanzi, Pier Luca %Y Engelbrecht, Andries %Y Pelta, David %Y Gershenson, Carlos %Y Squillero, Giovanni %Y Freitas, Alex %Y Ritchie, Marylyn %Y Preuss, Mike %Y Gagne, Christian %Y Ong, Yew Soon %Y Raidl, Guenther %Y Gallager, Marcus %Y Lozano, Jose %Y Coello-Coello, Carlos %Y Silva, Dario Landa %Y Hansen, Nikolaus %Y Meyer-Nieberg, Silja %Y Smith, Jim %Y Eiben, Gus %Y Bernado-Mansilla, Ester %Y Browne, Will %Y Spector, Lee %Y Yu, Tina %Y Clune, Jeff %Y Hornby, Greg %Y Wong, Man-Leung %Y Collet, Pierre %Y Gustafson, Steve %Y Watson, Jean-Paul %Y Sipper, Moshe %Y Poulding, Simon %Y Ochoa, Gabriela %Y Schoenauer, Marc %Y Witt, Carsten %Y Auger, Anne %S GECCO ’11: Proceedings of the 13th annual conference companion on Genetic and evolutionary computation %D 2011 %8 December 16 jul %I ACM %C Dublin, Ireland %F Agapitos:2011:GECCOcomp %X A family of stateful program representations in grammar-based Genetic Programming are being compared against their stateless counterpart in the problem of binary classification of sequences of daily prices of a financial asset. Empirical results suggest that stateful classifiers learn as fast as stateless ones but generalise better to unseen data, rendering this form of program representation strongly appealing to the automatic programming of technical trading rules. %K genetic algorithms, genetic programming: Poster %R doi:10.1145/2001858.2001969 %U http://dx.doi.org/doi:10.1145/2001858.2001969 %P 199-200 %0 Conference Proceedings %T Learning Environment Models in Car Racing Using Stateful Genetic Programming %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %A Theodoridis, Theodoros %S Proceedings of the 2011 IEEE Conference on Computational Intelligence and Games %D 2011 %8 31 aug 3 sep %I IEEE %C Seoul, South Korea %F Agapitos:2011:CIG %X For computational intelligence to be useful in creating game agent AI we need to focus on methods that allow the creation and maintenance of models for the environment, which the artificial agents inhabit. Maintaining a model allows an agent to plan its actions more effectively by combining immediate sensory information along with a memories that have been acquired while operating in that environment. To this end, we propose a way to build environment models for non-player characters in car racing games using stateful Genetic Programming. A method is presented, where general-purpose 2-dimensional data-structures are used to build a model of the racing track. Results demonstrate that model-building behaviour can be cooperatively coevolved with car-controlling behaviour in modular programs that make use of these models in order to navigate successfully around a racing track. %K genetic algorithms, genetic programming, Reinforcement Learning, Multiobjective Evolution, AI in Computer Games, Car Racing, AI game agent, computational intelligence, diverse opponent generation, game play learning, nonplayer character, computer games, evolutionary computation, learning (artificial intelligence), multi-agent systems, 2D data structures, artificial agents, car racing games, learning environment models, model building behaviour, modular programs, non player characters, cognition, computer games, data structures, learning (artificial intelligence), multi-agent systems %R doi:10.1109/CIG.2011.6032010 %U http://cilab.sejong.ac.kr/cig2011/proceedings/CIG2011/papers/paper54.pdf %U http://dx.doi.org/doi:10.1109/CIG.2011.6032010 %P 219-226 %0 Book Section %T An Evolutionary Algorithmic Investigation of US Corporate Payout Policy %A Agapitos, Alexandros %A Goyal, Abhinav %A Muckley, Cal %E Brabazon, Anthony %E O’Neill, Michael %E Maringer, Dietmar %B Natural Computing in Computational Finance (Volume 4) %S Studies in Computational Intelligence %D 2012 %V 380 %I Springer %F Agapitos:NCFE:2011 %X This Chapter examines cash dividends and share repurchases in the United States during the period 1990 to 2008. In the extant literature a variety of classical statistical methodologies have been adopted, foremost among these is the method of panel regression modelling. Instead, in this Chapter, we have informed our model specifications and our coefficient estimates using a genetic program. Our model captures effects from a wide range of pertinent proxy variables related to the agency cost-based life cycle theory, the signalling theory and the catering theory of corporate payout policy determination. In line with the extant literature, our findings indicate the predominant importance of the agency-cost based life cycle theory. The adopted evolutionary algorithm approach also provides important new insights concerning the influence of firm size, the concentration of firm ownership and cash flow uncertainty with respect to corporate payout policy determination in the United States. %K genetic algorithms, genetic programming, US Corporate Payout Policy, Symbolic Regression %R doi:10.1007/978-3-642-23336-4_7 %U http://hdl.handle.net/10197/3552 %U http://dx.doi.org/doi:10.1007/978-3-642-23336-4_7 %P 123-139 %0 Conference Proceedings %T Evolving Seasonal Forecasting Models with Genetic Programming in the Context of Pricing Weather-Derivatives %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %Y Di Chio, Cecilia %Y Agapitos, Alexandros %Y Cagnoni, Stefano %Y Cotta, Carlos %Y Fernandez de Vega, F. %Y Di Caro, Gianni A. %Y Drechsler, Rolf %Y Ekart, Aniko %Y Esparcia-Alcazar, Anna I. %Y Farooq, Muddassar %Y Langdon, William B. %Y Merelo, Juan J. %Y Preuss, Mike %Y Richter, Hendrik %Y Silva, Sara %Y Simoes, Anabela %Y Squillero, Giovanni %Y Tarantino, Ernesto %Y Tettamanzi, Andrea G. B. %Y Togelius, Julian %Y Urquhart, Neil %Y Uyar, A. Sima %Y Yannakakis, Georgios N. %S Applications of Evolutionary Computing, EvoApplications 2012: EvoCOMNET, EvoCOMPLEX, EvoFIN, EvoGAMES, EvoHOT, EvoIASP, EvoNUM, EvoPAR, EvoRISK, EvoSTIM, EvoSTOC %S LNCS %D 2011 %8 November 13 apr %V 7248 %I Springer Verlag %C Malaga, Spain %F agapitos:evoapps12 %X In this study we evolve seasonal forecasting temperature models, using Genetic Programming (GP), in order to provide an accurate, localised, long-term forecast of a temperature profile as part of the broader process of determining appropriate pricing model for weather-derivatives, financial instruments that allow organisations to protect themselves against the commercial risks posed by weather fluctuations. Two different approaches for time-series modelling are adopted. The first is based on a simple system identification approach whereby the temporal index of the time-series is used as the sole regressor of the evolved model. The second is based on iterated single-step prediction that resembles autoregressive and moving average models in statistical time-series modelling. Empirical results suggest that GP is able to successfully induce seasonal forecasting models, and that autoregressive models compose a more stable unit of evolution in terms of generalisation performance for the three datasets investigated. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-29178-4_14 %U http://dx.doi.org/doi:10.1007/978-3-642-29178-4_14 %P 135-144 %0 Book Section %T Genetic Programming for the Induction of Seasonal Forecasts: A Study on Weather Derivatives %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %E Michael, Doumpos %E Constantin, Zopounidis %E Panos, Pardalos %B Financial Decision Making Using Computational Intelligence %S Springer Optimization and Its Applications %D 2012 %V 70 %I Springer %F Agapitos:FDMCI:2012 %O Due: July 31, 2012 %K genetic algorithms, genetic programming, Weather derivatives pricing, Seasonal temperature forecasting, Autoregressive models, Supervised ensemble learning, Generalisation %U http://www.springer.com/mathematics/applications/book/978-1-4614-3772-7 %P 153-182 %0 Conference Proceedings %T Controlling Overfitting in Symbolic Regression Based on a Bias/Variance Error Decomposition %A Agapitos, Alexandros %A Brabazon, Anthony %A O’Neill, Michael %Y Coello Coello, Carlos A. %Y Cutello, Vincenzo %Y Deb, Kalyanmoy %Y Forrest, Stephanie %Y Nicosia, Giuseppe %Y Pavone, Mario %S Parallel Problem Solving from Nature, PPSN XII (part 1) %S Lecture Notes in Computer Science %D 2012 %8 sep 1 5 %V 7491 %I Springer %C Taormina, Italy %F conf/ppsn/Agapitos12 %X We consider the fundamental property of generalisation of data-driven models evolved by means of Genetic Programming (GP). The statistical treatment of decomposing the regression error into bias and variance terms provides insight into the generalisation capability of this modelling method. The error decomposition is used as a source of inspiration to design a fitness function that relaxes the sensitivity of an evolved model to a particular training dataset. Results on eight symbolic regression problems show that new method is capable on inducing better-generalising models than standard GP for most of the problems. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-32937-1_44 %U http://dx.doi.org/doi:10.1007/978-3-642-32937-1_44 %P 438-447 %0 Conference Proceedings %T Adaptive Distance Metrics for Nearest Neighbour Classification based on Genetic Programming %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %Y Krawiec, Krzysztof %Y Moraglio, Alberto %Y Hu, Ting %Y Uyar, A. Sima %Y Hu, Bin %S Proceedings of the 16th European Conference on Genetic Programming, EuroGP 2013 %S LNCS %D 2013 %8 March 5 apr %V 7831 %I Springer Verlag %C Vienna, Austria %F agapitos:2013:EuroGP %X Nearest Neighbour (NN) classification is a widely-used, effective method for both binary and multi-class problems. It relies on the assumption that class conditional probabilities are locally constant. However, this assumption becomes invalid in high dimensions, and severe bias can be introduced, which degrades the performance of the method. The employment of a locally adaptive distance metric becomes crucial in order to keep class conditional probabilities approximately uniform, whereby better classification performance can be attained. This paper presents a locally adaptive distance metric for NN classification based on a supervised learning algorithm (Genetic Programming) that learns a vector of feature weights for the features composing an instance query. Using a weighted Euclidean distance metric, this has the effect of adaptive neighbourhood shapes to query locations, stretching the neighbourhood along the directions for which the class conditional probabilities don’t change much. Initial empirical results on a set of real-world classification datasets showed that the proposed method enhances the generalisation performance of standard NN algorithm, and that it is a competent method for pattern classification as compared to other learning algorithms. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-37207-0_1 %U http://dx.doi.org/doi:10.1007/978-3-642-37207-0_1 %P 1-12 %0 Conference Proceedings %T Higher Order Functions for Kernel Regression %A Agapitos, Alexandros %A McDermott, James %A O’Neill, Michael %A Kattan, Ahmed %A Brabazon, Anthony %Y Nicolau, Miguel %Y Krawiec, Krzysztof %Y Heywood, Malcolm I. %Y Castelli, Mauro %Y Garcia-Sanchez, Pablo %Y Merelo, Juan J. %Y Rivas Santos, Victor M. %Y Sim, Kevin %S 17th European Conference on Genetic Programming %S LNCS %D 2014 %8 23 25 apr %V 8599 %I Springer %C Granada, Spain %F agapitos:2014:EuroGP %X Kernel regression is a well-established nonparametric method, in which the target value of a query point is estimated using a weighted average of the surrounding training examples. The weights are typically obtained by applying a distance-based kernel function, which presupposes the existence of a distance measure. This paper investigates the use of Genetic Programming for the evolution of task-specific distance measures as an alternative to Euclidean distance. Results on seven real-world datasets show that the generalisation performance of the proposed system is superior to that of Euclidean-based kernel regression and standard GP. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-662-44303-3_1 %U http://dx.doi.org/doi:10.1007/978-3-662-44303-3_1 %P 1-12 %0 Conference Proceedings %T Ensemble Bayesian Model Averaging in Genetic Programming %A Agapitos, Alexandros %A O’Neill, Michael %A Brabazon, Anthony %Y Coello Coello, Carlos A. %S Proceedings of the 2014 IEEE Congress on Evolutionary Computation %D 2014 %8 June 11 jul %C Beijing, China %@ 0-7803-8515-2 %F Agapitos:2014:CEC %X This paper considers the general problem of function estimation via Genetic Programming (GP). Data analysts typically select a model from a population of models, and then proceed as if the selected model had generated the data. This approach ignores the uncertainty in model selection, leading to over-confident inferences and lack of generalisation. We adopt a coherent method for accounting for this uncertainty through a weighted averaging of all models competing in a population of GP. It is a principled statistical method for post-processing a population of programs into an ensemble, which is based on Bayesian Model Averaging (BMA). Under two different formulations of BMA, the predictive probability density function (PDF) of a response variable is a weighted average of PDFs centred around the individual predictions of component models that take the form of either standalone programs or ensembles of programs. The weights are equal to the posterior probabilities of the models generating the predictions, and reflect the models’ skill on the training dataset. The method was applied to a number of synthetic symbolic regression problems, and results demonstrate that it generalises better than standard methods for model selection, as well as methods for ensemble construction in GP. %K genetic algorithms, Genetic programming, Data mining, Classification, clustering and data analysis %R doi:10.1109/CEC.2014.6900567 %U http://dx.doi.org/doi:10.1109/CEC.2014.6900567 %P 2451-2458 %0 Conference Proceedings %T Deep Evolution of Feature Representations for Handwritten Digit Recognition %A Agapitos, Alexandros %A O’Neill, Michael %A Nicolau, Miguel %A Fagan, David %A Kattan, Ahmed %A Curran, Kathleen %Y Murata, Yadahiko %S Proceedings of 2015 IEEE Congress on Evolutionary Computation (CEC 2015) %D 2015 %I IEEE Press %C Sendai, Japan %F agapitos:cec2015 %X A training protocol for learning deep neural networks, called greedy layer-wise training, is applied to the evolution of a hierarchical, feed-forward Genetic Programming based system for feature construction and object recognition. Results on a popular handwritten digit recognition benchmark clearly demonstrate that two layers of feature transformations improves generalisation compared to a single layer. In addition, we show that the proposed system outperforms several standard Genetic Programming systems, which are based on hand-designed features, and use different program representations and fitness functions. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2015.7257189 %U http://dx.doi.org/doi:10.1109/CEC.2015.7257189 %P 2452-2459 %0 Conference Proceedings %T Genetic Programming with Memory For Financial Trading %A Agapitos, Alexandros %A Brabazon, Anthony %A O’Neill, Michael %Y Squillero, Giovanni %Y Burelli, Paolo %S 19th European Conference on the Applications of Evolutionary Computation %S Lecture Notes in Computer Science %D 2016 %8 mar 30 apr 1 %V 9597 %I Springer %C Porto, Portugal %F EvoBafin16Agapitosetal %X A memory-enabled program representation in strongly-typed Genetic Programming (GP) is compared against the standard representation in a number of financial time-series modelling tasks. The paper first presents a survey of GP systems that use memory. Thereafter, a number of simulations show that memory-enabled programs generalise better than their standard counterparts in most datasets of this problem domain. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-319-31204-0_2 %U http://dx.doi.org/10.1007/978-3-319-31204-0_2 %U http://dx.doi.org/doi:10.1007/978-3-319-31204-0_2 %P 19-34 %0 Journal Article %T Recursion in tree-based genetic programming %A Agapitos, Alexandros %A O’Neill, Michael %A Kattan, Ahmed %A Lucas, Simon M. %J Genetic Programming and Evolvable Machines %D 2017 %8 jun %V 18 %N 2 %@ 1389-2576 %F Agapitos:2016:GPEM %X Recursion is a powerful concept that enables a solution to a problem to be expressed as a relatively simple decomposition of the original problem into sub-problems of the same type. We survey previous research about the evolution of recursive programs in tree-based Genetic Programming. We then present an analysis of the fitness landscape of recursive programs, and report results on evolving solutions to a range of problems. We conclude with guidelines concerning the choice of fitness function and variation operators, as well as the handling of the halting problem. The main findings are as follows. The distribution of fitness changes initially as we look at programs of increasing size but once some threshold has been exceeded, it shows very little variation with size. Furthermore, the proportion of halting programs decreases as size increases. Recursive programs exhibit the property of weak causality; small changes in program structure may cause big changes in semantics. Nevertheless, the evolution of recursive programs is not a needle-in-a-haystack problem; the neighbourhoods of optimal programs are populated by halting individuals of intermediate fitness. Finally, mutation-based variation operators performed the best in finding recursive solutions. Evolution was also shown to outperform random search. %K genetic algorithms, genetic programming, Evolutionary program synthesis Recursive programs, Variation operators, Fitness landscape analysis %9 journal article %R doi:10.1007/s10710-016-9277-5 %U http://dx.doi.org/doi:10.1007/s10710-016-9277-5 %P 149-183 %0 Journal Article %T Regularised Gradient Boosting for Financial Time-series Modelling %A Agapitos, Alexandros %A Brabazon, Anthony %A O’Neill, Michael %J Computational Management Science %D 2017 %8 jul %V 14 %N 3 %F Agapitos:2018:CMS %X Gradient Boosting (GB) learns an additive expansion of simple basis-models. This is accomplished by iteratively fitting an elementary model to the negative gradient of a loss function with respect to the expansion’s values at each training data-point evaluated at each iteration. For the case of squared-error loss function, the negative gradient takes the form of an ordinary residual for a given training data-point. Studies have demonstrated that running GB for hundreds of iterations can lead to overfitting, while a number of authors showed that by adding noise to the training data, generalisation is impaired even with relatively few basis-models. Regularisation is realised through the shrinkage of every newly-added basis-model to the expansion. This paper demonstrates that GB with shrinkage-based regularisation is still prone to overfitting in noisy datasets. We use a transformation based on a sigmoidal function for reducing the influence of extreme values in the residuals of a GB iteration without removing them from the training set. This extension is built on top of shrinkage-based regularisation. Simulations using synthetic, noisy data show that the proposed method slows-down overfitting and reduces the generalisation error of regularised GB. The proposed method is then applied to the inherently noisy domain of financial time-series modelling. Results suggest that for the majority of datasets the method generalises better when compared against standard regularised GB, as well as against a range of other time-series modelling methods. %K genetic algorithms, genetic programming, Boosting algorithms, Gradient boosting, Stagewise additive modelling, Regularisation, Financial time-series modelling, Financial forecasting, Feedforward neural networks, ANN, Noisy data, Ensemble learning %9 journal article %R doi:10.1007/s10287-017-0280-y %U https://ideas.repec.org/a/spr/comgts/v14y2017i3d10.1007_s10287-017-0280-y.html %U http://dx.doi.org/doi:10.1007/s10287-017-0280-y %P 367-391 %0 Journal Article %T A Survey of Statistical Machine Learning Elements in Genetic Programming %A Agapitos, Alexandros %A Loughran, Roisin %A Nicolau, Miguel %A Lucas, Simon %A O’Neill, Michael %A Brabazon, Anthony %J IEEE Transactions on Evolutionary Computation %D 2019 %8 dec %V 23 %N 6 %@ 1089-778X %F Agapitos:ieeeTEC %X Modern Genetic Programming operates within the Statistical Machine Learning framework. In this framework evolution needs to balance between approximation of an unknown target function on the training data and generalisation, which is the ability to predict well on new data. The article provides a survey and critical discussion of Statistical Machine Learning methods that enable Genetic Programming to generalise. %K genetic algorithms, genetic programming, Statistical Machine Learning, SML, Generalisation, Overfitting, Classification, Symbolic Regression, Model selection, Regularisation, Model Averaging, Bias-Variance trade-off %9 journal article %R doi:10.1109/TEVC.2019.2900916 %U http://ncra.ucd.ie/papers/08648159.pdf %U http://dx.doi.org/doi:10.1109/TEVC.2019.2900916 %P 1029-1048 %0 Conference Proceedings %T Computational Brittleness and the Evolution of Computer Viruses %A Agapow, Paul-Michael %Y Voigt, Hans-Michael %Y Ebeling, Werner %Y Rechenberg, Ingo %Y Schwefel, Hans-Paul %S Parallel Problem Solving From Nature IV. Proceedings of the International Conference on Evolutionary Computation %S LNCS %D 1996 %8 22 26 sep %V 1141 %I Springer-Verlag %C Berlin, Germany %@ 3-540-61723-X %F agapow:1996:cbecv %X In recent years computer viruses have grown to be of great concern. They have also been proposed as prototypical artificial life, but the possibility of their evolution has been dismissed due to modern computer programs being computationally brittle (i.e. a random change to a functional program will almost certainly render it non-functional) and the series of steps required for the evolution of a new virus being improbable. These allegations are examined by studying homology between functional program sequences. It is concluded that programs are far less brittle than expected. While the evolution of viruses de novo is still unlikely, evolution of pre-existing viruses and programs is feasible. This has significant implications for computer security and evolutionary computation. %K genetic algorithms, genetic programming %R doi:10.1007/3-540-61723-X_964 %U http://dx.doi.org/doi:10.1007/3-540-61723-X_964 %P 2-11 %0 Book Section %T Genetic Programming for Wafer Property Prediction After Plasma Enhanced %A Agarwal, Ashish %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 2000 %D 2000 %8 jun %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F agarwal:2000:GPWPPAPE %K genetic algorithms, genetic programming %P 16-24 %0 Conference Proceedings %T Probabilistic Analysis of a Geosynthetic Reinforced Soil Retaining Wall Under Seismic Conditions Using Genetic Programming %A Agarwal, Ekansh %A Verma, Ajeet Kumar %A Pain, Anindya %A Sarkar, Shantanu %S Soil Dynamics, Earthquake and Computational Geotechnical Engineering %D 2023 %I Springer %F agarwal:2023:SDECGE %K genetic algorithms, genetic programming %R doi:10.1007/978-981-19-6998-0_20 %U http://link.springer.com/chapter/10.1007/978-981-19-6998-0_20 %U http://dx.doi.org/doi:10.1007/978-981-19-6998-0_20 %0 Journal Article %T A high Performance Algorithm for Solving large scale Travelling Salesman Problem using Distributed Memory Architectures %A Aggarwal, Khushboo %A Singh, Sunil Kumar %A Khattar, Sakar %J Indian Journal of Computer Science and Engineering %D 2011 %8 aug sep %V 2 %N 4 %@ 2231-3850 %G en %F Aggarwal:2011:ijcse %X In this paper, we present an intelligent solution system for travelling salesman problem. The solution has three stages. The first stage uses Clustering Analysis in Data Mining to classify all customers by a number of attributes, such as distance, demand level, the density of customer, and city layout. The second stage introduces how to generate feasible routing schemes for each vehicle type. Specifically, a depth-first search algorithm with control rules is presented to generate feasible routing schemes. In the last stage, a genetic programming model is applied to find the best possible solution. Finally, we present a paradigm for using this algorithm for distributed memory architectures to gain the benefits of parallel processing. %K genetic algorithms, genetic programming, TSP, traveling salesman problem, fitness functions %9 journal article %U http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.300.6369 %P 516-521 %0 Generic %T Prediction of Protein Secondary Structure using Genetic Programming %A Aggarwal, Varun %D 2003 %I Summer Internship Project Report During June-July 2003 %F Aggarwal:intern %X Project 1:Using SOM and Genetic Programming to predict Protein Secondary structure Project 2: Improving PSIPRED Predictions using Genetic Programming %K genetic algorithms, genetic programming %U http://web.mit.edu/varun_ag/www/psspreport.pdf %0 Conference Proceedings %T Evolved Matrix Operations for Post-Processing Protein Secondary Structure Predictions %A Aggarwal, Varun %A MacCallum, Robert %Y Keijzer, Maarten %Y O’Reilly, Una-May %Y Lucas, Simon M. %Y Costa, Ernesto %Y Soule, Terence %S Genetic Programming 7th European Conference, EuroGP 2004, Proceedings %S LNCS %D 2004 %8 May 7 apr %V 3003 %I Springer-Verlag %C Coimbra, Portugal %@ 3-540-21346-5 %F maccallum:2004:eurogp %X Predicting the three-dimensional structure of proteins is a hard problem, so many have opted instead to predict the secondary structural state (usually helix, strand or coil) of each amino acid residue. This should be an easier task, but it now seems that a ceiling of around 76 percent per-residue three-state accuracy has been reached. Further improvements will require the correct processing of so-called ’long-range information’. We present a novel application of genetic programming to evolve high level matrix operations to post-process secondary structure prediction probabilities produced by the popular, state-of-the-art neural network based PSIPRED by David Jones. We show that global and long-range information may be used to increase three-state accuracy by at least 0.26 percentage points - a small but statistically significant difference. This is on top of the 0.14 percentage point increase already made by PSIPRED’s built-in filters. %K genetic algorithms, genetic programming: Poster %R doi:10.1007/978-3-540-24650-3_20 %U http://web.mit.edu/varun_ag/www/aggarwal-eurogp2004.pdf %U http://dx.doi.org/doi:10.1007/978-3-540-24650-3_20 %P 220-229 %0 Book Section %T Design of Posynomial Models for Mosfets: Symbolic Regression Using Genetic Algorithms %A Aggarwal, Varun %A O’Reilly, Una-May %E Riolo, Rick L. %E Soule, Terence %E Worzel, Bill %B Genetic Programming Theory and Practice IV %S Genetic and Evolutionary Computation %D 2006 %8 November 13 may %V 5 %I Springer %C Ann Arbor %@ 0-387-33375-4 %F Aggarwal:2006:GPTP %X Starting from a broad description of analogue circuit design in terms of topology design and sizing, we discuss the difficulties of sizing and describe approaches that are manual or automatic. These approaches make use of blackbox optimisation techniques such as evolutionary algorithms or convex optimization techniques such as geometric programming. Geometric programming requires posynomial expressions for a circuit’s performance measurements. We show how a genetic algorithm can be exploited to evolve a polynomial expression (i.e. model) of transistor (i.e. mosfet) behaviour more accurately than statistical techniques in the literature. %K genetic algorithms, genetic programming, circuit sizing, symbolic regression, posynomial models, geometric programming %R doi:10.1007/978-0-387-49650-4_14 %U http://people.csail.mit.edu/unamay/publications-dir/gptp06.pdf %U http://dx.doi.org/doi:10.1007/978-0-387-49650-4_14 %P 219-236 %0 Journal Article %T The use of ELM-WT (extreme learning machine with wavelet transform algorithm) to predict exergetic performance of a DI diesel engine running on diesel/biodiesel blends containing polymer waste %A Aghbashlo, Mortaza %A Shamshirband, Shahaboddin %A Tabatabaei, Meisam %A Yee, Por Lip %A Larimi, Yaser Nabavi %J Energy %D 2016 %V 94 %@ 0360-5442 %F Aghbashlo:2016:Energy %X In this study, a novel method based on Extreme Learning Machine with wavelet transform algorithm (ELM-WT) was designed and adapted to estimate the exergetic performance of a DI diesel engine. The exergetic information was obtained by calculating mass, energy, and exergy balance equations for the experimental trials conducted at various engine speeds and loads as well as different biodiesel and expanded polystyrene contents. Furthermore, estimation capability of the ELM-WT model was compared with that of the ELM, GP (genetic programming) and ANN (artificial neural network) models. The experimental results showed that an improvement in the exergetic performance modelling of the DI diesel engine could be achieved by the ELM-WT approach in comparison with the ELM, GP, and ANN methods. Furthermore, the results showed that the applied algorithm could learn thousands of times faster than the conventional popular learning algorithms. Obviously, the developed ELM-WT model could be used with a high degree of confidence for further work on formulating novel model predictive strategy for investigating exergetic performance of DI diesel engines running on various renewable and non-renewable fuels. %K genetic algorithms, genetic programming, Biodiesel, DI diesel engine, Exergetic performance parameters, Expanded polystyrene, Cost sensitivity analysis, Extreme learning machine-wavelet (ELM-WT) %9 journal article %R doi:10.1016/j.energy.2015.11.008 %U http://www.sciencedirect.com/science/article/pii/S0360544215015327 %U http://dx.doi.org/doi:10.1016/j.energy.2015.11.008 %P 443-456 %0 Journal Article %T Image classification: an evolutionary approach %A Agnelli, Davide %A Bollini, Alessandro %A Lombardi, Luca %J Pattern Recognition Letters %D 2002 %8 jan %V 23 %N 1-3 %@ 0167-8655 %F agnelli:2002:PRL %X Evolutionary algorithms are proving viable in solving complex optimization problems such as those typical of supervised learning approaches to image understanding. This paper presents an evolutionary approach to image classification and discusses some experimental results, suggesting that genetic programming could provide a convenient alternative to standard supervised learning methods. %K genetic algorithms, genetic programming, Image classification, Supervised learning %9 journal article %R doi:10.1016/S0167-8655(01)00128-3 %U http://dx.doi.org/doi:10.1016/S0167-8655(01)00128-3 %P 303-309 %0 Conference Proceedings %T Proofster: Automated Formal Verification %A Agrawal, Arpan %A First, Emily %A Kaufman, Zhanna %A Reichel, Tom %A Zhang, Shizhuo %A Zhou, Timothy %A Sanchez-Stern, Alex %A Ringer, Talia %A Brun, Yuriy %S Proceedings of the Demonstrations Track at the 45th International Conference on Software Engineering (ICSE) %D 2023 %8 14 20 may %C Melbourne %F Agrawal:2023:ICSE %X Formal verification is an effective but extremely work-intensive method of improving software quality. Verifying the correctness of software systems often requires significantly more effort than implementing them in the first place, despite the existence of proof assistants, such as Coq, aiding the process. Recent work has aimed to fully automate the synthesis of formal verification proofs, but little tool support exists for practitioners. This paper presents Proofster, a web-based tool aimed at assisting developers with the formal verification process via proof synthesis. Proofster inputs a Coq theorem specifying a property of a software system and attempts to automatically synthesize a formal proof of the correctness of that property. When it is unable to produce a proof, Proofster outputs the proof-space search tree its synthesis explored, which can guide the developer to provide a hint to enable Proofster to synthesize the proof. Proofster runs online at https://proofster.cs.umass.edu/ and a video demonstrating Proofster is available at https://youtu.be/xQAi66lRfwI/. %K genetic algorithms, genetic programming %R doi:10.1109/ICSE-Companion58688.2023.00018 %U http://dx.doi.org/doi:10.1109/ICSE-Companion58688.2023.00018 %P 26-30 %0 Conference Proceedings %T Reliability-Centered Maintenance Methodology-Based Fuzzy Classifier System Design for Fault Tolerance %A Aguilar, Jose L. %A Cerrada, Mariela %Y Koza, John R. %Y Banzhaf, Wolfgang %Y Chellapilla, Kumar %Y Deb, Kalyanmoy %Y Dorigo, Marco %Y Fogel, David B. %Y Garzon, Max H. %Y Goldberg, David E. %Y Iba, Hitoshi %Y Riolo, Rick %S Genetic Programming 1998: Proceedings of the Third Annual Conference %D 1998 %8 22 25 jul %I Morgan Kaufmann %C University of Wisconsin, Madison, Wisconsin, USA %@ 1-55860-548-7 %F aguilar:1998:rcmmcfssdft %K genetic algorithms, classifiers %P 621 %0 Conference Proceedings %T Approaches Based on Genetic Algorithms for Multiobjective Optimization Problems %A Aguilar, Jose %A Miranda, Pablo %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 1 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F aguilar:1999:ABGAMOP %K genetic algorithms and classifier systems %U http://gpbib.cs.ucl.ac.uk/gecco1999/GA-873.pdf %P 3-10 %0 Conference Proceedings %T Three Geometric Approaches for representing Decision Rules in a Supervised Learning System %A Aguilar, Jesus %A Riquelme, Jose %A Toro, Miguel %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 1 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F aguilar:1999:TGADRSLS %K genetic algorithms and classifier systems, poster papers %U http://gpbib.cs.ucl.ac.uk/gecco1999/GA-391.pdf %P 771 %0 Conference Proceedings %T Three geometric approaches for representing decision rules in a supervised learning system %A Aguilar, Jesus %A Riquelme, Jose %A Toro, Miguel %Y Brave, Scott %Y Wu, Annie S. %S Late Breaking Papers at the 1999 Genetic and Evolutionary Computation Conference %D 1999 %8 13 jul %C Orlando, Florida, USA %F aguilar:1999:T %X hyperrectangles, rotated hyperrectangles and hyperellipses %K Genetic Algorithms, data mining, supervised learning, hyper rectangles, rotated hyper rectangles, hyper ellipse %P 8-15 %0 Conference Proceedings %T Fuzzy Classifier System and Genetic Programming on System Identification Problems %A Aguilar, Jose %A Cerrada, Mariela %Y Spector, Lee %Y Goodman, Erik D. %Y Wu, Annie %Y Langdon, W. B. %Y Voigt, Hans-Michael %Y Gen, Mitsuo %Y Sen, Sandip %Y Dorigo, Marco %Y Pezeshk, Shahram %Y Garzon, Max H. %Y Burke, Edmund %S Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001) %D 2001 %8 July 11 jul %I Morgan Kaufmann %C San Francisco, California, USA %@ 1-55860-774-9 %F aguilar3:2001:gecco %K genetic algorithms, genetic programming, real world applications %U http://gpbib.cs.ucl.ac.uk/gecco2001/d24.pdf %P 1245-1251 %0 Conference Proceedings %T Genetic Programming-Based Approach for System Identification Applying Genetic Programming to obtain Separation %A Aguilar, Jose %A Cerrada, Mariela %Y Mastorakis, Nikos E. %S WSES International Conferences WSEAS NNA-FSFS-EC 2001 %D 2001 %8 feb 11 15 %C Puerto De La Cruz, Tenerife, Spain %F WSEAS_640_Aguilar %X In this work, an approach based on Genetic Programming is proposed for the input-output systems identification problem. Laguerre’s functions and the ARX method have been commonly used to solve the systems identification problem. Recently, approaches based on Artificial Neural Networks have been used to solve this problem. Genetic Programming is an Evolutionary Computation technique based on the evolution of mathematical symbols (constants, functions, variables, operators, etc.). To achieve the identification, a set of analysis trees is used to describe the different models (individuals) that our approach proposes during its execution. At the end of the evolutionary process, an input-output model of the system is proposed by our approach (it is the best individual). %K genetic algorithms, genetic programming, Genetic Programming, Evolutionary Computation, Identification Systems %U http://www.wseas.us/e-library/conferences/tenerife2001/papers/640.pdf %P 6401-6406 %0 Generic %T A Data Mining Algorithm Based on the Genetic Programming %A Aguilar, J. %A Altamiranda, J. %D 2004 %F Aguilar:2004:sci %X Data Mining is composed by a set of methods to extract knowledgement from large database. One of these methods is Genetic Programming. In this work we use this method to build a Data Mining System that define a set of patterns in order to classify the data. We define a grammar, which is used by the Genetic Programming in order to define the rules that represent the patterns. In this way, we can group the data in class and simplify the information in the database according to the set of patterns. %K genetic algorithms, genetic programming, Data Mining, Clustering %0 Conference Proceedings %T Data Extrapolation Using Genetic Programming to Matrices Singular Values Estimation %A Aguilar, Jose %A Gonzalez, Gilberto %Y Yen, Gary G. %Y Lucas, Simon M. %Y Fogel, Gary %Y Kendall, Graham %Y Salomon, Ralf %Y Zhang, Byoung-Tak %Y Coello, Carlos A. Coello %Y Runarsson, Thomas Philip %S Proceedings of the 2006 IEEE Congress on Evolutionary Computation %D 2006 %8 16 21 jul %I IEEE Press %C Vancouver, BC, Canada %@ 0-7803-9487-9 %F Aguilar:DEU:cec2006 %X In mathematical models where the dimensions of the matrices are very large, the use of classical methods to compute the singular values is very time consuming and requires a lot of computational resources. In this way, it is necessary to find new faster methods to compute the singular values of a very large matrix. We present a method to estimate the singular values of a matrix based on Genetic Programming (GP). GP is an approach based on the evolutionary principles of the species. GP is used to make extrapolations of data out of sample data. The extrapolations of data are achieved by irregularity functions which approximate very well the trend of the sample data. GP produces from just simple’s functions, operators and a fitness function, complex mathematical expressions that adjust smoothly to a group of points of the form (xi, yi). We obtain amazing mathematical formulas that follow the behaviour of the sample data. We compare our algorithm with two techniques: the linear regression and non linear regression approaches. Our results suggest that we can predict with some percentage of error the largest singular values of a matrix without computing the singular values of the whole matrix and using only some random selected columns of the matrix. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2006.1688718 %U http://ieeexplore.ieee.org/servlet/opac?punumber=11108 %U http://dx.doi.org/doi:10.1109/CEC.2006.1688718 %P 3227-3230 %0 Journal Article %T Genetic algorithms and Darwinian approaches in financial applications: A survey %A Aguilar-Rivera, Ruben %A Valenzuela-Rendon, Manuel %A Rodriguez-Ortiz, J. J. %J Expert Systems with Applications %D 2015 %8 30 nov %V 42 %N 21 %@ 0957-4174 %F AguilarRivera:2015:ESA %X This article presents a review of the application of evolutionary computation methods to solving financial problems. Genetic algorithms, genetic programming, multi-objective evolutionary algorithms, learning classifier systems, co-evolutionary approaches, and estimation of distribution algorithms are the techniques considered. The novelty of our approach comes in three different manners: it covers time lapses not included in other review articles, it covers problems not considered by others, and the scope covered by past and new references is compared and analysed. The results concluded the interest about methods and problems has changed through time. Although, genetic algorithms have remained the most popular approach in the literature. There are combinations of problems and solutions methods which are yet to be investigated. %K genetic algorithms, genetic programming, Evolutionary computation, Finance, Portfolio optimization, Survey %9 journal article %R doi:10.1016/j.eswa.2015.06.001 %U http://www.sciencedirect.com/science/article/pii/S0957417415003954 %U http://dx.doi.org/doi:10.1016/j.eswa.2015.06.001 %P 7684-7697 %0 Conference Proceedings %T A Genetic Programming Approach to Logic Function Synthesis by Means of Multiplexers %A Aguirre, Arturo Hernandez %A Coello, Carlos A. Coello %A Buckles, Bill P. %Y Stoica, Adrian %Y Keymeulen, Didier %Y Lohn, Jason %S Proceedings of the The First NASA/DOD Workshop on Evolvable Hardware %D 1999 %8 19 21 jul %I IEEE Computer Society %C Pasadena, California %@ 0-7695-0256-3 %F aguirre:1999:EH %X This paper presents an approach based on the use of genetic programming to synthesize logic functions. The proposed approach uses the 1-control line multiplexer as the only design unit, defining any logic function (defined by a truth table) through the replication of this single unit. Our fitness function first explores the search space trying to find a feasible design and then concentrates in the minimization of such (fully feasible) circuit. The proposed approach is illustrated using several sample Boolean functions. %K genetic algorithms, genetic programming, evolvable hardware, 1-control line multiplexer, Boolean functions, fitness function, genetic programming approach, logic function synthesis, minimisation, multiplexers, Boolean functions, logic design, minimisation, multiplexing equipment %R doi:10.1109/EH.1999.785434 %U http://dx.doi.org/doi:10.1109/EH.1999.785434 %P 46-53 %0 Conference Proceedings %T Cooperative Crossover and Mutation Operators in Genetic Algorithms %A Aguirre, Hernan E. %A Tanaka, Kiyoshi %A Sugimura, Tatsuo %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 1 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F aguirre:1999:CCMOGA %K genetic algorithms and classifier systems, poster papers %P 772 %0 Journal Article %T Evolutionary Synthesis of Logic Circuits Using Information Theory %A Aguirre, Arturo Hernandez %A Coello Coello, Carlos A. %J Artificial Intelligence Review %D 2003 %V 20 %N 3-4 %I Kluwer Academic Publishers %@ 0269-2821 %G English %F Aguirre:2003:AIR %X In this paper, we propose the use of Information Theory as the basis for designing a fitness function for Boolean circuit design using Genetic Programming. Boolean functions are implemented by replicating binary multiplexers. Entropy-based measures, such as Mutual Information and Normalised Mutual Information are investigated as tools for similarity measures between the target and evolving circuit. Three fitness functions are built over a primitive one. We show that the landscape of Normalized Mutual Information is more amenable for being used as a fitness function than simple Mutual Information. The evolutionary synthesised circuits are compared to the known optimum size. A discussion of the potential of the Information-Theoretical approach is given. %K genetic algorithms, genetic programming, circuit synthesis, computer-aided design, evolutionary algorithms, evolvable hardware, information theory %9 journal article %R doi:10.1023/B:AIRE.0000006603.98023.97 %U http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.378.9801 %U http://dx.doi.org/doi:10.1023/B:AIRE.0000006603.98023.97 %P 445-471 %0 Conference Proceedings %T Mutual Information-based Fitness Functions for Evolutionary Circuit Synthesis %A Hernandez-Aguirre, Arturo %A Coello-Coello, Carlos %S Proceedings of the 2004 IEEE Congress on Evolutionary Computation %D 2004 %8 20 23 jun %V 2 %I IEEE Press %C Portland, Oregon %@ 0-7803-8515-2 %F Hernandez-Aguirre:2004:MIFFfECS %X Mutual Information and Normalised Mutual Information measures are investigated. The goal is the analysis of some fitness functions based in mutual information and what problems prevent them from common use. We identify and find a clear explanation to them, thereafter, we propose new fitness functions and ran several experiments to investigate their effect on the search space, convergence time, and quality of solutions. %K genetic algorithms, genetic programming, EHW, Evolutionary Design Automation, Evolutionary design & evolvable hardware %R doi:10.1109/CEC.2004.1331048 %U http://delta.cs.cinvestav.mx/~ccoello/conferences/cec04-muxmutual.pdf.gz %U http://dx.doi.org/doi:10.1109/CEC.2004.1331048 %P 1309-1316 %0 Journal Article %T Settling velocity of drill cuttings in drilling fluids: A review of experimental, numerical simulations and artificial intelligence studies %A Agwu, Okorie E. %A Akpabio, Julius U. %A Alabi, Sunday B. %A Dosunmu, Adewale %J Powder Technology %D 2018 %V 339 %@ 0032-5910 %F AGWU:2018:PT %X In this paper, a comprehensive review of experimental, numerical and artificial intelligence studies on the subject of cuttings settling velocity in drilling muds made by researchers over the last seven decades is brought to the fore. In this respect, 91 experimental, 13 numerical simulations and 7 artificial intelligence researches were isolated, reviewed, tabulated and discussed. A comparison of the three methods and the challenges facing each of these methods were also reviewed. The major outcomes of this review include: (1) the unanimity among experimental researchers that mud rheology, particle size and shape and wall effect are major parameters affecting the settling velocity of cuttings in wellbores; (2) the prevalence of cuttings settling velocity experiments done with the mud in static conditions and the wellbore in the vertical configuration; (3) the extensive use of rigid particles of spherical shape to represent drill cuttings due to their usefulness in experimental visualization, particle tracking, and numerical implementation; (4) the existence of an artificial intelligence technique - multi-gene genetic programming (MGGP) which can provide an explicit equation that can help in predicting settling velocity; (5) the limited number of experimental studies factoring in the effect of pipe rotation and well inclination effects on the settling velocity of cuttings and (6) the most applied numerical method for determining settling velocity is the finite element method. Despite these facts, there is need to perform more experiments with real drill cuttings and factor in the effects of conditions such as drillstring rotation and well inclination and use data emanating therefrom to develop explicit models that would include the effects of these. It should be noted however, that the aim of this paper is not to create an encyclopaedia of particle settling velocity research, but to provide to the researcher with a basic, theoretical, experimental and numerical overview of what has so far been achieved in the area of cuttings settling velocity in drilling muds %K genetic algorithms, genetic programming, Artificial Intelligence, Drill cuttings, Numerical simulations, Settling velocity %9 journal article %R doi:10.1016/j.powtec.2018.08.064 %U http://www.sciencedirect.com/science/article/pii/S0032591018307022 %U http://dx.doi.org/doi:10.1016/j.powtec.2018.08.064 %P 728-746 %0 Journal Article %T Modeling the downhole density of drilling muds using multigene genetic programming %A Agwu, Okorie Ekwe %A Akpabio, Julius Udoh %A Dosunmu, Adewale %J Upstream Oil and Gas Technology %D 2021 %V 6 %@ 2666-2604 %F AGWU:2021:UOGT %X The main objective of this paper is to use experimental measurements of downhole pressure, temperature and initial mud density to predict downhole density using multigene genetic programming. From the results, the mean square error for the WBM density model was 0.0012, with a mean absolute error of 0.0246 and the square of correlation coefficient (R2) was 0.9998; while for the OBM, the MSE was 0.000359 with MAE of 0.01436 and R2 of 0.99995. In assessing the OBM model’s generalization capability, the model had an MSE of 0.031, MAE of 0.138 and mean absolute percentage error (MAPE) of 0.95percent %K genetic algorithms, genetic programming, Multigene genetic programming, Downhole mud density, Drilling mud, HTHP %9 journal article %R doi:10.1016/j.upstre.2020.100030 %U https://www.sciencedirect.com/science/article/pii/S266626042030030X %U http://dx.doi.org/doi:10.1016/j.upstre.2020.100030 %P 100030 %0 Generic %T Modeling Time Series of Real Systems using Genetic Programming %A Ahalpara, Dilip P. %A Parikh, Jitendra C. %D 2006 %8 14 jul %I ArXiv Nonlinear Sciences e-prints %F nlin/0607029 %O Submitted to Physical Review E %X Analytic models of two computer generated time series (Logistic map and Rossler system) and two real time series (ion saturation current in Aditya Tokamak plasma and NASDAQ composite index) are constructed using Genetic Programming (GP) framework. In each case, the optimal map that results from fitting part of the data set also provides a very good description of rest of the data. Predictions made using the map iteratively range from being very good to fair. %K genetic algorithms, genetic programming %U http://arxiv.org/PS_cache/nlin/pdf/0607/0607029v1.pdf %0 Journal Article %T Genetic Programming based approach for Modeling Time Series data of real systems %A Ahalpara, Dilip P. %A Parikh, Jitendra C. %J International Journal of Modern Physics C, Computational Physics and Physical Computation %D 2008 %V 19 %N 1 %F Ahalpara:2008:IJMPC %X Analytic models of a computer generated time series (logistic map) and three real time series (ion saturation current in Aditya Tokamak plasma, NASDAQ composite index and Nifty index) are constructed using Genetic Programming (GP) framework. In each case, the optimal map that results from fitting part of the data set also provides a very good description of the rest of the data. Predictions made using the map iteratively are very good for computer generated time series but not for the data of real systems. For such cases, an extended GP model is proposed and illustrated. A comparison of these results with those obtained using Artificial Neural Network (ANN) is also carried out. %K genetic algorithms, genetic programming, Time series analysis, artificial neural networks %9 journal article %R doi:10.1142/S0129183108011942 %U http://dx.doi.org/doi:10.1142/S0129183108011942 %P 63-91 %0 Journal Article %T Characterizing and modelling cyclic behaviour in non-stationary time series through multi-resolution analysis %A Ahalpara, Dilip P. %A Verma, Amit %A Parikh, Jitendra C. %A Panigrahi, Prasanta K. %J Pramana %D 2008 %8 nov %V 71 %I Springer India, in co-publication with Indian Academy of Sciences %@ 0304-4289 %F 2008Prama..71..459A %X A method based on wavelet transform is developed to characterise variations at multiple scales in non-stationary time series. We consider two different financial time series, S&P CNX Nifty closing index of the National Stock Exchange (India) and Dow Jones industrial average closing values. These time series are chosen since they are known to comprise of stochastic fluctuations as well as cyclic variations at different scales. The wavelet transform isolates cyclic variations at higher scales when random fluctuations are averaged out; this corroborates correlated behaviour observed earlier in financial time series through random matrix studies. Analysis is carried out through Haar, Daubechies-4 and continuous Morlet wavelets for studying the character of fluctuations at different scales and show that cyclic variations emerge at intermediate time scales. It is found that Daubechies family of wavelets can be effectively used to capture cyclic variations since these are local in nature. To get an insight into the occurrence of cyclic variations, we then proceed to model these wavelet coefficients using genetic programming (GP) approach and using the standard embedding technique in the reconstructed phase space. It is found that the standard methods (GP as well as artificial neural networks) fail to model these variations because of poor convergence. A novel interpolation approach is developed that overcomes this difficulty. The dynamical model equations have, primarily, linear terms with additive Pade-type terms. It is seen that the emergence of cyclic variations is due to an interplay of a few important terms in the model. Very interestingly GP model captures smooth variations as well as bursty behaviour quite nicely. %K genetic algorithms, genetic programming, finance, Non-stationary time series, wavelet transform, Characterizing and modelling cyclic behaviour in non-stationary time series through multi-resolution analysis %9 journal article %R doi:10.1007/s12043-008-0125-x %U http://dx.doi.org/doi:10.1007/s12043-008-0125-x %P 459-485 %0 Conference Proceedings %T Genetic Programming Based Approach for Synchronization with Parameter Mismatches in EEG %A Ahalpara, Dilip %A Arora, Siddharth %A Santhanam, M. %Y Vanneschi, Leonardo %Y Gustafson, Steven %Y Moraglio, Alberto %Y De Falco, Ivanoe %Y Ebner, Marc %S Proceedings of the 12th European Conference on Genetic Programming, EuroGP 2009 %S LNCS %D 2009 %8 apr 15 17 %V 5481 %I Springer %C Tuebingen %F Ahalpara:2009:eurogp %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-01181-8_2 %U http://dx.doi.org/doi:10.1007/978-3-642-01181-8_2 %P 13-24 %0 Conference Proceedings %T Improved forecasting of time series data of real system using genetic programming %A Ahalpara, Dilip P. %Y Branke, Juergen %Y Pelikan, Martin %Y Alba, Enrique %Y Arnold, Dirk V. %Y Bongard, Josh %Y Brabazon, Anthony %Y Butz, Martin V. %Y Clune, Jeff %Y Cohen, Myra %Y Deb, Kalyanmoy %Y Engelbrecht, Andries P. %Y Krasnogor, Natalio %Y Miller, Julian F. %Y O’Neill, Michael %Y Sastry, Kumara %Y Thierens, Dirk %Y van Hemert, Jano %Y Vanneschi, Leonardo %Y Witt, Carsten %S GECCO ’10: Proceedings of the 12th annual conference on Genetic and evolutionary computation %D 2010 %8 July 11 jul %I ACM %C Portland, Oregon, USA %F Ahalpara:2010:gecco %X A study is made to improve short term forecasting of time series data of real system using Genetic Programming (GP) under the framework of time delayed embedding technique. GP based approach is used to make analytical model of time series data of real system using embedded vectors that help reconstruct the phase space. The map equations, involving non-linear symbolic expressions in the form of binary trees comprising of time delayed components in the immediate past, are first obtained by carrying out single-step GP fit for the training data set and usually they are found to give good fitness as well as single-step predictions. However while forecasting the time series based on multi-step predictions in the out-of-sample region in an iterative manner, these solutions often show rapid deterioration as we dynamically forward the solution in future time. With a view to improve on this limitation, it is shown that if the multi-step aspect is incorporated while making the GP fit itself, the corresponding GP solutions give multi-step predictions that are improved to a fairly good extent for around those many multi-steps as incorporated during the multi-step GP fit. Two different methods for multi-step fit are introduced, and the corresponding prediction results are presented. The modified method is shown to make better forecast for out-of-sample multi-step predictions for the time series of a real system, namely Electroencephelograph (EEG) signals. %K genetic algorithms, genetic programming, Poster %R doi:10.1145/1830483.1830658 %U http://dx.doi.org/doi:10.1145/1830483.1830658 %P 977-978 %0 Conference Proceedings %T A Sniffer Technique for an Efficient Deduction of Model Dynamical Equations using Genetic Programming %A Ahalpara, Dilip %A Sen, Abhijit %Y Silva, Sara %Y Foster, James A. %Y Nicolau, Miguel %Y Giacobini, Mario %Y Machado, Penousal %S Proceedings of the 14th European Conference on Genetic Programming, EuroGP 2011 %S LNCS %D 2011 %8 27 29 apr %V 6621 %I Springer Verlag %C Turin, Italy %F ahalpara:2011:EuroGP %X A novel heuristic technique that enhances the search facility of the standard genetic programming (GP) algorithm is presented. The method provides a dynamic sniffing facility to optimise the local search in the vicinity of the current best chromosomes that emerge during GP iterations. Such a hybrid approach, that combines the GP method with the sniffer technique, is found to be very effective in the solution of inverse problems where one is trying to construct model dynamical equations from either finite time series data or knowledge of an analytic solution function. As illustrative examples, some special function ordinary differential equations (ODEs) and integrable nonlinear partial differential equations (PDEs) are shown to be efficiently and exactly recovered from known solution data. The method can also be used effectively for solution of model equations (the direct problem) and as a tool for generating multiple dynamical systems that share the same solution space. %K genetic algorithms, genetic programming, local search, hill climbing %R doi:10.1007/978-3-642-20407-4_1 %U http://dx.doi.org/doi:10.1007/978-3-642-20407-4_1 %P 1-12 %0 Conference Proceedings %T Variations in Financial Time Series: Modelling Through Wavelets and Genetic Programming %A Ahalpara, Dilip P. %A Panigrahi, Prasanta K. %A Parikh, Jitendra C. %S Econophysics of Markets and Business Networks %D 2007 %I Springer %F ahalpara:2007:EMBN %K genetic algorithms, genetic programming %R doi:10.1007/978-88-470-0665-2_3 %U http://link.springer.com/chapter/10.1007/978-88-470-0665-2_3 %U http://dx.doi.org/doi:10.1007/978-88-470-0665-2_3 %0 Journal Article %T Modelling mechanical behaviour of rubber concrete using evolutionary polynomial regression %A Ahangar-Asr, Alireza %A Faramarzi, Asaad %A Javadi, Akbar A. %A Giustolisi, Orazio %J Engineering Computation %D 2011 %V 28 %N 4 %I Emerald Group Publishing Limited %@ 0264-4401 %F Ahangar-Asr:2011:EC %X Using discarded tyre rubber as concrete aggregate is an effective solution to the environmental problems associated with disposal of this waste material. However, adding rubber as aggregate in concrete mixture changes, the mechanical properties of concrete, depending mainly on the type and amount of rubber used. An appropriate model is required to describe the behaviour of rubber concrete in engineering applications. The purpose of this paper is to show how a new evolutionary data mining technique, evolutionary polynomial regression (EPR), is used to predict the mechanical properties of rubber concrete. Design/methodology/approach EPR is a data-driven method based on evolutionary computing, aimed to search for polynomial structures representing a system. In this technique, a combination of the genetic algorithm and the least square method is used to find feasible structures and the appropriate constants for those structures. Findings Data from 70 cases of experiments on rubber concrete are used for development and validation of the EPR models. Three models are developed relating compressive strength, splitting tensile strength, and elastic modulus to a number of physical parameters that are known to contribute to the mechanical behaviour of rubber concrete. The most outstanding characteristic of the proposed technique is that it provides a transparent, structured, and accurate representation of the behaviour of the material in the form of a polynomial function, giving insight to the user about the contributions of different parameters involved. The proposed model shows excellent agreement with experimental results, and provides an efficient method for estimation of mechanical properties of rubber concrete. Originality/value In this paper, a new evolutionary data mining approach is presented for the analysis of mechanical behaviour of rubber concrete. The new approach overcomes the shortcomings of the traditional and artificial neural network-based methods presented in the literature for the analysis of slopes. EPR provides a viable tool to find a structured representation of the system, which allows the user to gain additional information on how the system performs. %K genetic algorithms, genetic programming, Mechanical & Materials Engineering, Concretes, Mechanical behaviour of materials, Rubbers %9 journal article %R doi:10.1108/02644401111131902 %U http://dx.doi.org/doi:10.1108/02644401111131902 %P 492-507 %0 Thesis %T Application of an Evolutionary Data Mining Technique for Constitutive Modelling of Geomaterials %A AhangarAsr, Alireza %D 2012 %8 31 dec %C UK %C University of Exeter %F Ahangar-Asr:thesis %X Modelling behaviour of materials involves approximating the actual behaviour with that of an idealised material that deforms in accordance with some constitutive relationships. Several constitutive models have been developed for various materials many of which involve determination of material parameters with no physical meaning. ANN is a computer-based modelling technique for computation and knowledge representation inspired by the neural architecture and operation of the human brain. It has been shown by various researchers that ANNs offer outstanding advantages in constitutive modelling of material; however, these networks have some shortcoming. In this thesis, the Evolutionary Polynomial Regression (EPR) was introduced as an alternative approach to constitutive modelling of the complex behaviour of saturated and unsaturated soils and also modelling of a number of other civil and geotechnical engineering materials and systems. EPR overcomes the shortcomings of ANN by providing a structured and transparent model representing the behaviour of the system. In this research EPR is applied to modelling of stress-strain and volume change behaviour of unsaturated soils, modelling of SWCC in unsaturated soils, hydro-thermo-mechanical modelling of unsaturated soils, identification of coupling parameters between shear strength behaviour and chemical’s effects in compacted soils, modelling of permeability and compaction characteristics of soils, prediction of the stability status of soil and rock slopes and modelling the mechanical behaviour of rubber concrete. Comparisons between EPR-based material model predictions, the experimental data and the predictions from other data mining and regression modelling techniques and also the results of the parametric studies revealed the exceptional capabilities of the proposed methodology in modelling the very complicated behaviour of geotechnical and civil engineering materials. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://hdl.handle.net/10871/9925 %0 Journal Article %T An evolutionary-based polynomial regression modeling approach to predicting discharge flow rate under sheet piles %A Ahangar-Asr, Alireza %A Johari, A. %A Javadi, Akbar A. %J Engineering with Computers %D 2023 %V 39 %N 6 %F DBLP:journals/ewc/AhangarAsrJJ23 %K genetic algorithms, genetic programming, EPR, Sheet piles/cut-off walls, Seepage flow rate, Evolutionary computation, Data mining %9 journal article %R doi:10.1007/S00366-023-01872-1 %U https://rdcu.be/dPatP %U http://dx.doi.org/doi:10.1007/S00366-023-01872-1 %P 4093-4101 %0 Conference Proceedings %T Removal of Mixed Impulse noise and Gaussian noise using genetic programming %A Aher, R. P. %A Jodhanle, K. C. %S Signal Processing (ICSP), 2012 IEEE 11th International Conference on %D 2012 %V 1 %F Aher:2012:ICSP %X In this paper, we have put forward a nonlinear filtering method for removing mixed Impulse and Gaussian noise, based on the two step switching scheme. The switching scheme uses two cascaded detectors for detecting the noise and two corresponding estimators which effectively and efficiently filters the noise from the image. A supervised learning algorithm, Genetic programming, is employed for building the two detectors with complementary characteristics. Most of the noisy pixels are identified by the first detector. The remaining noises are searched by the second detector, which is usually hidden in image details or with amplitudes close to its local neighbourhood. Both the detectors designed are based on the robust estimators of location and scale i.e. Median and Median Absolute Deviation (MAD). Unlike many filters which are specialised only for a particular noise model, the proposed filters in this paper are capable of effectively suppressing all kinds of Impulse and Gaussian noise. The proposed two-step Genetic Programming filters removes impulse and Gaussian noise very efficiently, and also preserves the image details. %K genetic algorithms, genetic programming, Gaussian noise, image denoising, impulse noise, learning (artificial intelligence), nonlinear filters, Gaussian noise, Median Absolute Deviation, cascaded detectors, complementary characteristics, image details, impulse noise, local neighbourhood, noisy pixels, nonlinear filtering method, second detector, supervised learning algorithm, two step switching scheme, alpha trimmed mean estimator, CWM, Gaussian Noise, Impulse noise, Median, Median Absolute Deviation (MAD), Non-Linear filters, Supervised Learning, Switching scheme %R doi:10.1109/ICoSP.2012.6491563 %U http://dx.doi.org/doi:10.1109/ICoSP.2012.6491563 %P 613-618 %0 Conference Proceedings %T WES: Agent-based User Interaction Simulation on Real Infrastructure %A Ahlgren, John %A Berezin, Maria Eugenia %A Bojarczuk, Kinga %A Dulskyte, Elena %A Dvortsova, Inna %A George, Johann %A Gucevska, Natalija %A Harman, Mark %A Laemmel, Ralf %A Meijer, Erik %A Sapora, Silvia %A Spahr-Summers, Justin %Y Yoo, Shin %Y Petke, Justyna %Y Weimer, Westley %Y Bruce, Bobby R. %S GI @ ICSE 2020 %D 2020 %8 March %I ACM %C internet %F Ahlgren:2020:GI %O Invited Keynote %X We introduce the Web-Enabled Simulation (WES) research agenda, and describe FACEBOOK WW system. We describe the application of WW to reliability, integrity and privacy at FACEBOOK, where it is used to simulate social media interactions on an infrastructure consisting of hundreds of millions of lines of code. The WES agenda draws on research from many areas of study, including Search Based Software Engineering, Machine Learning, Programming Languages, Multi Agent Systems, Graph Theory, Game AI, and AI Assisted Game Play. We conclude with a set of open problems and research challenges to motivate wider investigation. %K genetic algorithms, genetic programming, genetic improvement, SBSE, social testing, APR, Connectivity, Data Science, Facebook AI Research, Human Computer Interaction, UX Human, Machine Learning %R doi:10.1145/3387940.3392089 %U https://research.fb.com/wp-content/uploads/2020/04/WES-Agent-based-User-Interaction-Simulation-on-Real-Infrastructure.pdf %U http://dx.doi.org/doi:10.1145/3387940.3392089 %P 276-284 %0 Conference Proceedings %T Testing Web Enabled Simulation at Scale Using Metamorphic Testing %A Ahlgren, John %A Berezin, Maria Eugenia %A Bojarczuk, Kinga %A Dulskyte, Elena %A Dvortsova, Inna %A George, Johann %A Gucevska, Natalija %A Harman, Mark %A Lomeli, Maria %A Meijer, Erik %A Sapora, Silvia %A Spahr-Summers, Justin %Y van Deursen, Arie %Y Xie, Tao %Y Dieste, Natalia Juristo Oscar %S Proceedings of the International Conference on Software Engineering, ICSE 2021 %D 2021 %8 25 28 may %I IEEE %F Ahlgren:2021:ICSE %X We report on Facebook deployment of MIA (Metamorphic Interaction Automaton). MIA is used to test Facebook’s Web Enabled Simulation, built on a web infrastructure of hundreds of millions of lines of code. MIA tackles the twin problems of test flakiness and the unknowable oracle problem. It uses metamorphic testing to automate continuous integration and regression test execution. MIA also plays the role of a test bot, automatically commenting on all relevant changes submitted for code review. It currently uses a suite of over 40 metamorphic test cases. Even at this extreme scale, a non-trivial metamorphic test suite subset yields outcomes within 20 minutes (sufficient for continuous integration and review processes). Furthermore, our offline mode simulation reduces test flakiness from approximately 50percent (of all online tests) to 0percent (offline). Metamorphic testing has been widely-studied for 22 years. This paper is the first reported deployment into an industrial continuous integration system. %K genetic algorithms, genetic programming, Genetic Improvement, SBSE, Metamorphic Testing, Oracle Problem, Scalability, Testing, Test Flakiness, Web-Enabled Simulation %R doi:10.1109/ICSE-SEIP52600.2021.00023 %U https://research.fb.com/publications/testing-web-enabled-simulation-at-scale-using-metamorphic-testing/ %U http://dx.doi.org/doi:10.1109/ICSE-SEIP52600.2021.00023 %P 140-149 %0 Conference Proceedings %T Facebook’s Cyber-Cyber and Cyber-Physical Digital Twins %A Ahlgren, John %A Bojarczuk, Kinga %A Drossopoulou, Sophia %A Dvortsova, Inna %A George, Johann %A Gucevska, Natalija %A Harman, Mark %A Lomeli, Maria %A Lucas, Simon M. %A Meijer, Erik %A Omohundro, Steve %A Rojas, Rubmary %A Sapora, Silvia %A Zhou, Norm %Y Chitchyan, Ruzanna %Y Li, Jingyue %Y Weber, Barbara %Y Yue, Tao %S EASE 2021: Evaluation and Assessment in Software Engineering %D 2021 %8 jun 21 24 %I ACM %C Trondheim, Norway %F DBLP:conf/ease/AhlgrenBDDGGHLL21 %X A cyber/cyber digital twin is a simulation of a software system. By contrast, a cyber-physical digital twin is a simulation of a non-software (physical) system. Although cyberphysical digital twins have received a lot of recent attention, their cyber–cyber counterparts have been comparatively overlooked. In this paper we show how the unique properties of cyber cyber digital twins open up exciting opportunities for research and development. Like all digital twins, the cyber–cyber digital twin is both informed by and informs the behaviour of the twin it simulates. It is therefore a software system that simulates another software system, making it conceptually truly a twin, blurring the distinction between the simulated and the simulator. Cyber-cyber digital twins can be twins of other cyber–cyber digital twins, leading to a hierarchy of twins. As we shall see, these apparently philosophical observations have practical ramifications for the design, implementation and deployment of digital twins at Meta. %K genetic algorithms, genetic programming, genetic improvement, SBSE, Web Enabled Simulation, Digital Twin, facebook, meta, social media, online, software engineering %R doi:10.1145/3463274.3463275 %U https://research.facebook.com/publications/facebooks-cyber-cyber-and-cyber-physical-digital-twins/ %U http://dx.doi.org/doi:10.1145/3463274.3463275 %0 Unpublished Work %T Using Genetic Programming to Play Mancala %A Ahlschwede, John %D 2000 %F ahlschwede:2000:ugppm %X This paper will explain what genetic programming is, what mancala is, how I used genetic programming to evolve mancala-playing programs, and the results I got from doing so. %K genetic algorithms, genetic programming %9 unpublished %U http://www.corngolem.com/john/gp/index.html %0 Conference Proceedings %T Co-Evolving Hierarchical Programs Using Genetic Programming %A Ahluwalia, Manu %A Fogarty, Terence C. %Y Koza, John R. %Y Goldberg, David E. %Y Fogel, David B. %Y Riolo, Rick L. %S Genetic Programming 1996: Proceedings of the First Annual Conference %D 1996 %8 28–31 jul %I MIT Press %C Stanford University, CA, USA %F ahluwalia:1996:ccpGP %K genetic algorithms, genetic programming %U http://cognet.mit.edu/sites/default/files/books/9780262315876/pdfs/9780262315876_chap58.pdf %P 419 %0 Conference Proceedings %T Co-evolving Functions in Genetic Programming: A Comparison in ADF Selection Strategies %A Ahluwalia, Manu %A Bull, Larry %A Fogarty, Terence C. %Y Koza, John R. %Y Deb, Kalyanmoy %Y Dorigo, Marco %Y Fogel, David B. %Y Garzon, Max %Y Iba, Hitoshi %Y Riolo, Rick L. %S Genetic Programming 1997: Proceedings of the Second Annual Conference %D 1997 %8 13 16 jul %I Morgan Kaufmann %C Stanford University, CA, USA %F Ahluwalia:1997: %K genetic algorithms, genetic programming %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/gp1997/Ahluwalia_1997_.pdf %P 3-8 %0 Conference Proceedings %T Co-evolving Functions in Genetic Programming: An Emergent Approach using ADFs and GLiB %A Ahluwalia, Manu %A Bull, Larry %A Fogarty, Terence C. %Y Koza, John R. %S Late Breaking Papers at the 1997 Genetic Programming Conference %D 1997 %8 13–16 jul %I Stanford Bookstore %C Stanford University, CA, USA %@ 0-18-206995-8 %F ahluwalia:1997:cfGPea %K genetic algorithms, genetic programming %P 1-6 %0 Conference Proceedings %T Co-evolving Functions in Genetic Programming: Dynamic ADF Creation using GLiB %A Ahluwalia, M. %A Bull, L. %Y Porto, V. William %Y Saravanan, N. %Y Waagen, D. %Y Eiben, A. E. %S Evolutionary Programming VII: Proceedings of the Seventh Annual Conference on Evolutionary Programming %S LNCS %D 1998 %8 25 27 mar %V 1447 %I Springer-Verlag %C Mission Valley Marriott, San Diego, California, USA %@ 3-540-64891-7 %F ahluwalia:1998:cfGP:ADF+GLiB %K genetic algorithms, genetic programming %R doi:10.1007/BFb0040753 %U http://dx.doi.org/doi:10.1007/BFb0040753 %P 809-818 %0 Conference Proceedings %T A Genetic Programming-based Classifier System %A Ahluwalia, Manu %A Bull, Larry %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 1 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F ahluwalia:1999:AGPCS %K genetic algorithms, genetic programming, classifier systems %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/gecco1999/ahluwalia_1999_agpcs.pdf %P 11-18 %0 Conference Proceedings %T Coevolving Functions in Genetic Programming: Classification using K-nearest-neighbour %A Ahluwalia, Manu %A Bull, Larry %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 2 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F ahluwalia:1999:CFGPCK %K genetic algorithms, genetic programming %U http://gpbib.cs.ucl.ac.uk/gecco1999/GP-413.ps %P 947-952 %0 Thesis %T Co-evolving functions in genetic programming %A Ahluwalia, Manu %D 2000 %C UK %C University of the West of England at Bristol %F Ahluwalia:thesis %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.322427 %0 Journal Article %T Coevolving functions in genetic programming %A Ahluwalia, Manu %A Bull, Larry %J Journal of Systems Architecture %D 2001 %8 jul %V 47 %N 7 %@ 1383-7621 %F Ahluwalia:2001:SA %X In this paper we introduce a new approach to the use of automatically defined functions (ADFs) within genetic programming. The technique consists of evolving a number of separate sub-populations of functions which can be used by a population of evolving main programs. We present and refine a set of mechanisms by which the number and constitution of the function sub-populations can be defined and compare their performance on two well-known classification tasks. A final version of the general approach, for use explicitly on classification tasks, is then presented. It is shown that in all cases the coevolutionary approach performs better than traditional genetic programming with and without ADFs. %K genetic algorithms, genetic programming, ADF, Classification, EDF, Feature selection/extraction, Hierarchical programs, Knn, Speciation %9 journal article %R doi:10.1016/S1383-7621(01)00016-9 %U http://www.sciencedirect.com/science/article/B6V1F-43RV156-3/1/16dd3ab5502922479ef7bb1ca4f7b9c3 %U http://dx.doi.org/doi:10.1016/S1383-7621(01)00016-9 %P 573-585 %0 Conference Proceedings %T Breast cancer detection using cartesian genetic programming evolved artificial neural networks %A Ahmad, Arbab Masood %A Khan, Gul Muhammad %A Mahmud, Sahibzada Ali %A Miller, Julian Francis %Y Soule, Terry %Y Auger, Anne %Y Moore, Jason %Y Pelta, David %Y Solnon, Christine %Y Preuss, Mike %Y Dorin, Alan %Y Ong, Yew-Soon %Y Blum, Christian %Y Silva, Dario Landa %Y Neumann, Frank %Y Yu, Tina %Y Ekart, Aniko %Y Browne, Will %Y Kovacs, Tim %Y Wong, Man-Leung %Y Pizzuti, Clara %Y Rowe, Jon %Y Friedrich, Tobias %Y Squillero, Giovanni %Y Bredeche, Nicolas %Y Smith, Stephen L. %Y Motsinger-Reif, Alison %Y Lozano, Jose %Y Pelikan, Martin %Y Meyer-Nienberg, Silja %Y Igel, Christian %Y Hornby, Greg %Y Doursat, Rene %Y Gustafson, Steve %Y Olague, Gustavo %Y Yoo, Shin %Y Clark, John %Y Ochoa, Gabriela %Y Pappa, Gisele %Y Lobo, Fernando %Y Tauritz, Daniel %Y Branke, Jurgen %Y Deb, Kalyanmoy %S GECCO ’12: Proceedings of the fourteenth international conference on Genetic and evolutionary computation conference %D 2012 %8 July 11 jul %I ACM %C Philadelphia, Pennsylvania, USA %F Ahmad:2012:GECCO %X A fast learning neuro-evolutionary technique that evolves Artificial Neural Networks using Cartesian Genetic Programming (CGPANN) is used to detect the presence of breast cancer. Features from breast mass are extracted using fine needle aspiration (FNA) and are applied to the CGPANN for diagnosis of breast cancer. FNA data is obtained from the Wisconsin Diagnostic Breast Cancer website and is used for training and testing the network. The developed system produces fast and accurate results when compared to contemporary work done in the field. The error of the model comes out to be as low as 1percent for Type-I (classifying benign sample falsely as malignant) and 0.5percent for Type-II (classifying malignant sample falsely as benign). %K genetic algorithms, genetic programming, Cartesian Genetic Programming, real world applications, Algorithms, Design, Performance, Breast Cancer, Fine Needle Aspiration, FNA, ANN, Artificial Neural Network, Neuro-evolution %R doi:10.1145/2330163.2330307 %U http://dx.doi.org/doi:10.1145/2330163.2330307 %P 1031-1038 %0 Conference Proceedings %T Bio-signal Processing Using Cartesian Genetic Programming Evolved Artificial Neural Network (CGPANN) %A Ahmad, Arbab Masood %A Khan, Gul Muhammad %S Frontiers of Information Technology (FIT), 2012 10th International Conference on %D 2012 %F Ahmad:2012:FIT %X The aim of this paper is to explore the application of Neuro-Evolutionary Techniques to the diagnosis of various diseases. We applied the evolutionary technique of Cartesian Genetic programming Evolved Artificial Neural Network (CG-PANN) for the detection of three important diseases. Some cases showed excellent results while others are in the process of experimentation. In the first case we worked on diagnosing the extent of Parkinson’s disease using a computer based test. Experiments in this case are in progress. In the second case, we applied the Fine Needle Aspirate (FNA) data for Breast Cancer from the WDBC website to our network to classify the samples as either benign (non-cancerous) or malignant (cancerous). The results from these experiments were highly satisfactory. In the third case, we developed a modified form of Pan-Tompkins’s algorithm to detect the fiducial points from ECG signals and extracted key features from them. The features shall be applied to our network to classify the signals for the different types of Arrhythmias. Experimentation is still in progress. %K genetic algorithms, genetic programming, cardiology, diseases, electrocardiography, feature extraction, medical signal processing, neural nets, signal classification, CG-PANN, Cartesian genetic programming evolved artificial neural network, ECG signal, FNA data, Pan-Tompkins algorithm, Parkinson disease, arrhythmia, benign cancer, bio-signal processing, breast cancer, electrocardiography, experimentation process, feature extraction, fiducial point, fine needle aspirate, malignant cancer, neuro-evolutionary technique, Artificial neural networks, Cancer, Diseases, Electrocardiography, Feature extraction, Training, Breast Cancer detection, CGPANN, Cardiac Arrhythmias, FNA, Parkinson’s Disease %R doi:10.1109/FIT.2012.54 %U http://dx.doi.org/doi:10.1109/FIT.2012.54 %P 261-268 %0 Conference Proceedings %T Classification of Arrhythmia Types Using Cartesian Genetic Programming Evolved Artificial Neural Networks %A Ahmad, Arbab Masood %A Khan, Gul Muhammad %A Mahmud, Sahibzada Ali %Y Iliadis, Lazaros S. %Y Papadopoulos, Harris %Y Jayne, Chrisina %S Proceedings of 14th International Conference on Engineering Applications of Neural Networks (EANN 2013), Part I %S Communications in Computer and Information Science %D 2013 %8 sep 13 16 %V 383 %I Springer %C Halkidiki, Greece %F conf/eann/AhmadKM13 %X Cartesian Genetic programming Evolved Artificial Neural Network (CGPANN) is explored for classification of different types of arrhythmia and presented in this paper. Electrocardiography (ECG) signal is preprocessed to acquire important parameters and then presented to the classifier. The parameters are calculated from the location and amplitudes of ECG fiducial points, determined with a new algorithm inspired by Pan-Tompkins’s algorithm [14]. The classification results are satisfactory and better than contemporary methods introduced in the field. %K genetic algorithms, genetic programming, cartesian genetic programming, CGPANN, artificial neural network, neuro-evolution, CVD, cardiac arrhythmias, classification, fiducial points, LBBB beats, RBBB beats %R doi:10.1007/978-3-642-41013-0_29 %U http://dx.doi.org/10.1007/978-3-642-41013-0 %U http://dx.doi.org/doi:10.1007/978-3-642-41013-0_29 %P 282-291 %0 Conference Proceedings %T Classification of Mammograms Using Cartesian Genetic Programming Evolved Artificial Neural Networks %A Ahmad, Arbab Masood %A Khan, Gul Muhammad %A Mahmud, Sahibzada Ali %Y Iliadis, Lazaros S. %Y Maglogiannis, Ilias %Y Papadopoulos, Harris %S Proceedings 10th IFIP WG 12.5 International Conference Artificial Intelligence Applications and Innovations, AIAI 2014 %S IFIP Advances in Information and Communication Technology %D 2014 %V 436 %I Springer %C Rhodes, Greece, September 19-21, 2014 %F conf/ifip12/AhmadKM14 %X We developed a system that classifies masses or microcalcifications observed in a mammogram as either benign or malignant. The system assumes prior manual segmentation of the image. The image segment is then processed for its statistical parameters and applied to a computational intelligence system for classification. We used Cartesian Genetic Programming Evolved Artificial Neural Network (CGPANN) for classification. To train and test our system we selected 2000 mammogram images with equal number of benign and malignant cases from the well-known Digital Database for Screening Mammography (DDSM). To find the input parameters for our network we exploited the overlay files associated with the mammograms. These files mark the boundaries of masses or microcalcifications. A Gray Level Co-occurrence matrix (GLCM) was developed for a rectangular region enclosing each boundary and its statistical parameters computed. Five experiments were conducted in each fold of a 10-fold cross validation strategy. Testing accuracy of 100 percent was achieved in some experiments. %K genetic algorithms, genetic programming, cartesian genetic programming, mammogram image classification, GLCM, CGPANN, haralick’s parameters %R doi:10.1007/978-3-662-44654-6_20 %U http://dx.doi.org/10.1007/978-3-662-44654-6_20 %U http://dx.doi.org/doi:10.1007/978-3-662-44654-6_20 %P 203-213 %0 Conference Proceedings %T A comparison of semantic-based initialization methods for genetic programming %A Ahmad, Hammad %A Helmuth, Thomas %Y Cotta, Carlos %Y Ray, Tapabrata %Y Ishibuchi, Hisao %Y Obayashi, Shigeru %Y Filipic, Bogdan %Y Bartz-Beielstein, Thomas %Y Dick, Grant %Y Munetomo, Masaharu %Y Fernandez Alzueta, Silvino %Y Stuetzle, Thomas %Y Pellicer, Pablo Valledor %Y Lopez-Ibanez, Manuel %Y Tauritz, Daniel R. %Y Oliveto, Pietro S. %Y Weise, Thomas %Y Wrobel, Borys %Y Zamuda, Ales %Y Auger, Anne %Y Bect, Julien %Y Brockhoff, Dimo %Y Hansen, Nikolaus %Y Le Riche, Rodolphe %Y Picheny, Victor %Y Derbel, Bilel %Y Li, Ke %Y Li, Hui %Y Li, Xiaodong %Y Zapotecas, Saul %Y Zhang, Qingfu %Y Doncieux, Stephane %Y Duro, Richard %Y Auerbach, Joshua %Y de Vladar, Harold %Y Fernandez-Leiva, Antonio J. %Y Merelo, J. J. %Y Castillo-Valdivieso, Pedro A. %Y Camacho-Fernandez, David %Y Chavez de la O, Francisco %Y Akman, Ozgur %Y Alyahya, Khulood %Y Branke, Juergen %Y Doherty, Kevin %Y Fieldsend, Jonathan %Y Marano, Giuseppe Carlo %Y Lagaros, Nikos D. %Y Nakayama, Koichi %Y Oshima, Chika %Y Wagner, Stefan %Y Affenzeller, Michael %Y Naujoks, Boris %Y Volz, Vanessa %Y Tusar, Tea %Y Kerschke, Pascal %Y Alshammari, Riyad %Y Makanju, Tokunbo %Y Alexander, Brad %Y Haraldsson, Saemundur O. %Y Wagner, Markus %Y Woodward, John R. %Y Yoo, Shin %Y McCall, John %Y Sanchez-Pi, Nayat %Y Marti, Luis %Y Vasconcellos, Danilo %Y Nakata, Masaya %Y Stein, Anthony %Y Veerapen, Nadarajen %Y Liefooghe, Arnaud %Y Verel, Sebastien %Y Ochoa, Gabriela %Y Smith, Stephen L. %Y Cagnoni, Stefano %Y Patton, Robert M. %Y La Cava, William %Y Olson, Randal %Y Orzechowski, Patryk %Y Urbanowicz, Ryan %Y De Falco, Ivanoe %Y Della Cioppa, Antonio %Y Tarantino, Ernesto %Y Scafuri, Umberto %Y Baltus, P. G. M. %Y Iacca, Giovanni %Y Hallawa, Ahmed %Y Yaman, Anil %Y Rahat, Alma %Y Wang, Handing %Y Jin, Yaochu %Y Walker, David %Y Everson, Richard %Y Oyama, Akira %Y Shimoyama, Koji %Y Kumar, Hemant %Y Chiba, Kazuhisa %Y Palar, Pramudita Satria %S GECCO ’18: Proceedings of the Genetic and Evolutionary Computation Conference Companion %D 2018 %8 15 19 jul %I ACM %C Kyoto, Japan %F Ahmad:2018:GECCOcomp %X During the initialization step, a genetic programming (GP) system traditionally creates a population of completely random programs to populate the initial population. These programs almost always perform poorly in terms of their total error—some might not even output the correct data type. In this paper, we present new methods for initialization that attempt to generate programs that are somewhat relevant to the problem being solved and/or increase the initial diversity (both error and behavioural diversity) of the population prior to the GP run. By seeding the population—and thereby eliminating worthless programs and increasing the initial diversity of the population—we hope to improve the performance of the GP system. Here, we present two novel techniques for initialization (Lexicase Seeding and Pareto Seeding) and compare them to a previous method (Enforced Diverse Populations) and traditional, non-seeded initialization. Surprisingly, we found that none of the initialization m %K genetic algorithms, genetic programming %R doi:10.1145/3205651.3208218 %U http://dx.doi.org/doi:10.1145/3205651.3208218 %P 1878-1881 %0 Conference Proceedings %T CirFix: automatically repairing defects in hardware design code %A Ahmad, Hammad %A Huang, Yu %A Weimer, Westley %Y Falsafi, Babak %Y Ferdman, Michael %Y Lu, Shan %Y Wenisch, Thomas F. %S ASPLOS 2022: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems %D 2022 %8 28 feb 4 mar %I ACM %C Lausanne, Switzerland %F DBLP:conf/asplos/Ahmad0W22 %X CirFix, is a framework for automatically repairing defects in hardware designs implemented in languages like Verilog. We propose a novel fault localization approach based on assignments to wires and registers, and a fitness function tailored to the hardware domain to bridge the gap between software-level automated program repair and hardware descriptions. We also present a benchmark suite of 32 defect scenarios corresponding to a variety of hardware projects. Overall, CirFix produces plausible repairs for 21/32 and correct repairs for 16/32 of the defect scenarios. This repair rate is comparable to that of successful program repair approaches for software, indicating CirFix is effective at bringing over the benefits of automated program repair to the hardware domain for the first time. %K genetic algorithms, genetic programming, genetic improvement, automated program repair, APR, hardware designs, HDL benchmark %R doi:10.1145/3503222.3507763 %U https://doi.org/10.1145/3503222.3507763 %U http://dx.doi.org/doi:10.1145/3503222.3507763 %P 990-1003 %0 Conference Proceedings %T Digging into Semantics: Where Do Search-Based Software Repair Methods Search? %A Ahmad, Hammad %A Cashin, Padraic %A Forrest, Stephanie %A Weimer, Westley %Y Rudolph, Guenter %Y Kononova, Anna V. %Y Aguirre, Hernan E. %Y Kerschke, Pascal %Y Ochoa, Gabriela %Y Tusar, Tea %S Parallel Problem Solving from Nature - PPSN XVII - 17th International Conference, PPSN 2022, Proceedings, Part II %S Lecture Notes in Computer Science %D 2022 %8 sep 10 14 %V 13399 %I Springer %C Dortmund, Germany %F DBLP:conf/ppsn/AhmadCFW22 %X Search-based methods are a popular approach for automatically repairing software bugs, a field known as automated program repair (APR). There is increasing interest in empirical evaluation and comparison of different APR methods, typically measured as the rate of successful repairs on benchmark sets of buggy programs. Such evaluations, however, fail to explain why some approaches succeed and others fail. Because these methods typically use syntactic representations, i.e., source code, we know little about how the different methods explore their semantic spaces, which is relevant for assessing repair quality and understanding search dynamics. We propose an automated method based on program semantics, which provides quantitative and qualitative information about different APR search-based techniques. Our approach requires no manual annotation and produces both mathematical and human-understandable insights. In an empirical evaluation of 4 APR tools and 34 defects, we investigate the relationship between search-space exploration, semantic diversity and repair success, examining both the overall picture and how the tools search unfolds. Our results suggest that population diversity alone is not sufficient for finding repairs, and that searching in the right place is more important than searching broadly, highlighting future directions for the research community. %K genetic algorithms, genetic programming, genetic improvement, APR, SBSE, Semantic search spaces, Program repair, Patch diversity, Daikon, Defects4J %R doi:10.1007/978-3-031-14721-0_1 %U https://web.eecs.umich.edu/~weimerw/p/weimer-asplos2022.pdf %U http://dx.doi.org/doi:10.1007/978-3-031-14721-0_1 %P 3-18 %0 Journal Article %T Genetic Programming In Clusters %A Ahmad, Ishfaq %J IEEE Concurrency %D 2000 %8 jul \slash sep %V 8 %N 3 %I IEEE Computer Society %C Los Alamitos, CA, USA %@ 1092-3063 %F Ahmad:2000:CCGc %K genetic algorithms, genetic programming %9 journal article %R doi:10.1109/MCC.2000.10016 %U http://csdl.computer.org/comp/mags/pd/2000/03/p3toc.htm %U http://dx.doi.org/doi:10.1109/MCC.2000.10016 %P 10-11,13 %0 Conference Proceedings %T Evolving MIMO multi-layered artificial neural networks using grammatical evolution %A Ahmad, Qadeer %A Rafiq, Atif %A Raja, Muhammad Adil %A Javed, Noman %Y Hung, Chih-Cheng %Y Papadopoulos, George A. %S Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, SAC 2019 %D 2019 %8 apr 8 12 %I ACM %C Limassol, Cyprus %F conf/sac/AhmadRRJ19 %K genetic algorithms, genetic programming, grammatical evolution, ANN %R doi:10.1145/3297280.3297408 %U http://dx.doi.org/doi:10.1145/3297280.3297408 %P 1278-1285 %0 Journal Article %T Application of an artificial intelligence technique enhanced with intelligent water drops for monthly reference evapotranspiration estimation %A Ahmadi, Farshad %A Mehdizadeh, Saeid %A Mohammadi, Babak %A Pham, Quoc Bao %A Doan, Thi Ngoc Canh %A Vo, Ngoc Duong %J Agricultural Water Management %D 2021 %V 244 %@ 0378-3774 %F Ahmadi:2021:AWM %X Reference evapotranspiration (ET0) is one of the most important parameters, which is required in many fields such as hydrological, agricultural, and climatological studies. Therefore, its estimation via reliable and accurate techniques is a necessity. The present study aims to estimate the monthly ET0 time series of six stations located in Iran. To achieve this objective, gene expression programming (GEP) and support vector regression (SVR) were used as standalone models. A novel hybrid model was then introduced through coupling the classical SVR with an optimisation algorithm, namely intelligent water drops (IWD) (i.e., SVR$-$IWD). Two various types of scenarios were considered, including the climatic data- and antecedent ET0 data-based patterns. In the climatic data-based models, the effective climatic parameters were recognised by using two pre-processing techniques consisting of τ Kendall and entropy. It is worthy to mention that developing the hybrid SVR-IWD model as well as using the τ Kendall and entropy approaches to discern the most influential weather parameters on ET0 are the innovations of current research. The results illustrated that the applied pre-processing methods introduced different climatic inputs to feed the models. The overall results of present study revealed that the proposed hybrid SVR-IWD model outperformed the standalone SVR one under both the considered scenarios when estimating the monthly ET0. In addition to the mentioned models, two types of empirical equations were also used including the Hargreaves$-$Samani (H$-$S) and Priestley$-$Taylor (P$-$T) in their original and calibrated versions. It was concluded that the calibrated versions showed superior performances compared to their original ones. %K genetic algorithms, genetic programming, gene expression programming, empirical models, intelligent water drops, reference evapotranspiration, support vector regression %9 journal article %R doi:10.1016/j.agwat.2020.106622 %U https://www.sciencedirect.com/science/article/pii/S0378377420321697 %U http://dx.doi.org/doi:10.1016/j.agwat.2020.106622 %P 106622 %0 Journal Article %T Artificial neural network development by means of a novel combination of grammatical evolution and genetic algorithm %A Ahmadizar, Fardin %A Soltanian, Khabat %A AkhlaghianTab, Fardin %A Tsoulos, Ioannis %J Engineering Applications of Artificial Intelligence %D 2015 %8 mar %V 39 %@ 0952-1976 %F journals/eaai/AhmadizarSAT15 %X The most important problems with exploiting artificial neural networks (ANNs) are to design the network topology, which usually requires an excessive amount of expert’s effort, and to train it. In this paper, a new evolutionary-based algorithm is developed to simultaneously evolve the topology and the connection weights of ANNs by means of a new combination of grammatical evolution (GE) and genetic algorithm (GA). GE is adopted to design the network topology while GA is incorporated for better weight adaptation. The proposed algorithm needs to invest a minimal expert’s effort for customisation and is capable of generating any feedforward ANN with one hidden layer. Moreover, due to the fact that the generalisation ability of an ANN may decrease because of over fitting problems, the algorithm uses a novel adaptive penalty approach to simplify ANNs generated through the evolution process. As a result, it produces much simpler ANNs that have better generalization ability and are easy to implement. The proposed method is tested on some real world classification datasets, and the results are statistically compared against existing methods in the literature. The results indicate that our algorithm outperforms the other methods and provides the best overall performance in terms of the classification accuracy and the number of hidden neurons. The results also present the contribution of the proposed penalty approach in the simplicity and generalisation ability of the generated networks. %K genetic algorithms, genetic programming, grammatical evolution, Neural networks, ANN, Adaptive penalty approach, Classification problems %9 journal article %U http://www.sciencedirect.com/science/article/pii/S0952197614002759 %P 1-13 %0 Conference Proceedings %T Evolutionary fusion of local texture patterns for facial expression recognition %A Ahmed, Faisal %A Paul, Padma Polash %A Gavrilova, Marina L. %S 2015 IEEE International Conference on Image Processing (ICIP) %D 2015 %8 sep %F Ahmed:2015:ieeeICIP %X This paper presents a simple, yet effective facial feature descriptor based on evolutionary synthesis of different local texture patterns. Unlike the traditional face descriptors that exploit visually-meaningful facial features, the proposed method adopts a genetic programming-based feature fusion approach that uses different local texture patterns and a set of linear and nonlinear operators in order to synthesise new features. The strength of this approach lies in fusing the advantages of different state-of-the-art local texture descriptors and thus, obtaining more robust composite features. Recognition performance of the proposed method is evaluated using the Cohn-Kanade (CK) and the Japanese female facial expression (JAFFE) database. In our experiments, facial features synthesised based on the proposed approach yield an improved recognition performance, as compared to some well-known face feature descriptors. %K genetic algorithms, genetic programming %R doi:10.1109/ICIP.2015.7350956 %U http://dx.doi.org/doi:10.1109/ICIP.2015.7350956 %P 1031-1035 %0 Journal Article %T A novel genetic-programming based differential braking controller for an 8x8 combat vehicle %A Ahmed, Moataz %A El-Gindy, Moustafa %A Lang, Haoxiang %J International Journal of Dynamics and Control %D 2020 %V 8 %N 4 %F ahmed:2020:IJDC %X Lateral stability of multi-axle vehicle’s was not considered and studied widely despite its advantages and use in different fields such as transportation, commercial, and military applications. In this research, a novel adaptive Direct Yaw moment Control based on Genetic-Programming (GPDB) is developed and compared with an Adaptive Neuro-Fuzzy Inference System (ANFIS). In addition, a phase-plane analysis of the vehicles nonlinear model is also discussed to introduce the activation criteria to the proposed controller in order to prevent excessive control effort. The controller is evaluated through a series of severe maneuvers in the simulator. The developed GPDB resulting in comparable performance to the ANFIS controller with better implementation facility and design procedure, where a single equation replaces multiple fuzzy rules. The results show fidelity and the ability of the developed controller to stabilize the vehicle near limit-handling driving conditions %K genetic algorithms, genetic programming, Stability control, Direct yaw control, Differential braking, Adaptive neuro-fuzzy, Fuzzy logic %9 journal article %R doi:10.1007/s40435-020-00693-0 %U http://link.springer.com/article/10.1007/s40435-020-00693-0 %U http://dx.doi.org/doi:10.1007/s40435-020-00693-0 %0 Thesis %T Integrated Chassis Control Strategies For Multi-Wheel Combat Vehicle %A Ahmed, Moataz Aboelfadl %D 2021 %8 nov %C Oshawa, Ontario, Canada %C Department of Automotive and Mechatronics Engineering Faculty of Engineering and Applied Science, University of Ontario Institute of Technology %F Aboelfadl_Ahmed_Moataz %X Combat vehicles are exposed to high risks due to their high ground clearance and nature of operation in harsh environments. This requires robust stability controllers to cope with the rapid change and uncertainty of driving conditions on various terrains. Moreover, it is required to enhance vehicle stability and increase safety to reduce accidents fatality probability. This research focuses on investigating the effectiveness of different lateral stability controllers and their integration in enhancing the cornering performance of an 8x8 combat vehicle when driving at limited handling conditions. In this research, a new Active Rear Steering (ARS) stability controller for an 8x8 combat vehicle is introduced. This technique is extensively investigated to show its merits and effectiveness for human and autonomous operation. For human operation, the ARS is developed using Linear Quadratic Regulator (LQR) control method, which is compared with previous techniques. Furthermore, the controller is extended and tested for working in a rough and irregular road profile using a novel adaptive Integral Sliding Mode Controller (ISMC). In the case of autonomous operation, a frequency domain analysis is conducted to show the benefits of considering the steering of the rear axles in the path-following performance at different driving conditions. The study compared two different objectives for the controller; the first is including the steering of the rear axles in the path following controller, while the second is to integrate it as a stability controller with a front-steering path-following controller. In addition, this research introduces a novel Differential Braking (DB) controller. The proposed control prevents the excessive use of braking forces and consequently the longitudinal dynamics deterioration. Besides, it introduces an effective DB controller with less dependency and sensitivity to the reference yaw model. Eventually, two various Integrated Chassis Controllers (ICC) are developed and compared. The first is developed by integrating the ISMC-ARS with the DB controller using a fuzzy logic controller. The second ICC integrates the ISMC-ARS with a developed robust Torque Vectoring Controller (TVC). This integration is designed based on a performance map that shows the effective region of each controller using a new technique based on Machine Learning (ML). %K genetic algorithms, genetic programming, Chassis control, Lateral stability, Intelligent control, Multi-axle, Combat vehicles %9 Ph.D. thesis %U https://hdl.handle.net/10155/1380 %0 Conference Proceedings %T Genetic Programming for Biomarker Detection in Mass Spectrometry Data %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %Y Thielscher, Michael %Y Zhang, Dongmo %S 25th Joint Conference Australasian Conference on Artificial Intelligence, AI 2012 %S Lecture Notes in Computer Science %D 2012 %8 dec 4 7 %V 7691 %I Springer %C Sydney, Australia %F DBLP:conf/ausai/AhmedZP12 %X Classification of mass spectrometry (MS) data is an essential step for biomarker detection which can help in diagnosis and prognosis of diseases. However, due to the high dimensionality and the small sample size, classification of MS data is very challenging. The process of biomarker detection can be referred to as feature selection and classification in terms of machine learning. Genetic programming (GP) has been widely used for classification and feature selection, but it has not been effectively applied to biomarker detection in the MS data. In this study we develop a GP based approach to feature selection, feature extraction and classification of mass spectrometry data for biomarker detection. In this approach, we firstly use GP to reduce the redundant features by selecting a small number of important features and constructing high-level features, then we use GP to classify the data based on selected features and constructed features. This approach is examined and compared with three well known machine learning methods namely decision trees, naive Bayes and support vector machines on two biomarker detection data sets. The results show that the proposed GP method can effectively select a small number of important features from thousands of original features for these problems, the constructed high-level features can further improve the classification performance, and the GP method outperforms the three existing methods, namely naive Bayes, SVMs and J48, on these problems. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-35101-3_23 %U http://dx.doi.org/doi:10.1007/978-3-642-35101-3_23 %P 266-278 %0 Conference Proceedings %T Feature Selection and Classification of High Dimensional Mass Spectrometry Data: A Genetic Programming Approach %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %Y Vanneschi, Leonardo %Y Bush, William S. %Y Giacobini, Mario %S 11th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBIO 2013 %S LNCS %D 2013 %8 apr 3 5 %V 7833 %I Springer Verlag %C Vienna, Austria %F Ahmed:2013:evobio %X Biomarker discovery using mass spectrometry (MS) data is very useful in disease detection and drug discovery. The process of biomarker discovery in MS data must start with feature selection as the number of features in MS data is extremely large (e.g. thousands) while the number of samples is comparatively small. In this study, we propose the use of genetic programming (GP) for automatic feature selection and classification of MS data. This GP based approach works by using the features selected by two feature selection metrics, namely information gain (IG) and relief-f (REFS-F) in the terminal set. The feature selection performance of the proposed approach is examined and compared with IG and REFS-F alone on five MS data sets with different numbers of features and instances. Naive Bayes (NB), support vector machines (SVMs) and J48 decision trees (J48) are used in the experiments to evaluate the classification accuracy of the selected features. Meanwhile, GP is also used as a classification method in the experiments and its performance is compared with that of NB, SVMs and J48. The results show that GP as a feature selection method can select a smaller number of features with better classification performance than IG and REFS-F using NB, SVMs and J48. In addition, GP as a classification method also outperforms NB and J48 and achieves comparable or slightly better performance than SVMs on these data sets. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-37189-9_5 %U http://dx.doi.org/doi:10.1007/978-3-642-37189-9_5 %P 43-55 %0 Conference Proceedings %T Enhanced Feature Selection for Biomarker Discovery in LC-MS Data using GP %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %Y de la Fraga, Luis Gerardo %S 2013 IEEE Conference on Evolutionary Computation %D 2013 %8 jun 20 23 %V 1 %C Cancun, Mexico %F Ahmed:2013:CEC %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2013.6557621 %U http://dx.doi.org/doi:10.1109/CEC.2013.6557621 %P 584-591 %0 Conference Proceedings %T GPMS: A Genetic Programming Based Approach to Multiple Alignment of Liquid Chromatography-Mass Spectrometry Data %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %Y Esparcia-Alcazar, Anna Isabel %Y Mora, Antonio Miguel %S 17th European Conference on the Applications of Evolutionary Computation %S LNCS %D 2014 %8 23 25 apr %V 8602 %I Springer %C Granada %F Ahmed:evoapps14 %X Alignment of samples from Liquid chromatography-mass spectrometry (LC-MS) measurements has a significant role in the detection of biomarkers and in metabolomic studies.The machine drift causes differences between LC-MS measurements, and an accurate alignment of the shifts introduced to the same peptide or metabolite is needed. In this paper, we propose the use of genetic programming (GP) for multiple alignment of LC-MS data. The proposed approach consists of two main phases. The first phase is the peak matching where the peaks from different LC-MS maps (peak lists) are matched to allow the calculation of the retention time deviation. The second phase is to use GP for multiple alignment of the peak lists with respect to a reference. In this paper, GP is designed to perform multiple-output regression by using a special node in the tree which divides the output of the tree into multiple outputs. Finally, the peaks that show the maximum correlation after dewarping the retention times are selected to form a consensus aligned map.The proposed approach is tested on one proteomics and two metabolomics LC-MS datasets with different number of samples. The method is compared to several benchmark methods and the results show that the proposed approach outperforms these methods in three fractions of the protoemics dataset and the metabolomics dataset with a larger number of maps. Moreover, the results on the rest of the datasets are highly competitive with the other methods %K genetic algorithms, genetic programming %R doi:10.1007/978-3-662-45523-4_74 %U http://dx.doi.org/doi:10.1007/978-3-662-45523-4_74 %P 915-927 %0 Conference Proceedings %T A New GP-Based Wrapper Feature Construction Approach to Classification and Biomarker Identification %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %Y Coello Coello, Carlos A. %S Proceedings of the 2014 IEEE Congress on Evolutionary Computation %D 2014 %8 June 11 jul %C Beijing, China %@ 0-7803-8515-2 %F Ahmed:2014:CEC %X Mass spectrometry (MS) is a technology used for identification and quantification of proteins and metabolites. It helps in the discovery of proteomic or metabolomic biomarkers, which aid in diseases detection and drug discovery. The detection of biomarkers is performed through the classification of patients from healthy samples. The mass spectrometer produces high dimensional data where most of the features are irrelevant for classification. Therefore, feature reduction is needed before the classification of MS data can be done effectively. Feature construction can provide a means of dimensionality reduction and aims at improving the classification performance. In this paper, genetic programming (GP) is used for construction of multiple features. Two methods are proposed for this objective. The proposed methods work by wrapping a Random Forest (RF) classifier to GP to ensure the quality of the constructed features. Meanwhile, five other classifiers in addition to RF are used to test the impact of the constructed features on the performance of these classifiers. The results show that the proposed GP methods improved the performance of classification over using the original set of features in five MS data sets. %K genetic algorithms, genetic programming, Evolutionary programming, Biometrics, bioinformatics and biomedical applications %R doi:10.1109/CEC.2014.6900317 %U http://dx.doi.org/doi:10.1109/CEC.2014.6900317 %P 2756-2763 %0 Conference Proceedings %T Multiple feature construction for effective biomarker identification and classification using genetic programming %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %A Xue, Bing %Y Igel, Christian %Y Arnold, Dirk V. %Y Gagne, Christian %Y Popovici, Elena %Y Auger, Anne %Y Bacardit, Jaume %Y Brockhoff, Dimo %Y Cagnoni, Stefano %Y Deb, Kalyanmoy %Y Doerr, Benjamin %Y Foster, James %Y Glasmachers, Tobias %Y Hart, Emma %Y Heywood, Malcolm I. %Y Iba, Hitoshi %Y Jacob, Christian %Y Jansen, Thomas %Y Jin, Yaochu %Y Kessentini, Marouane %Y Knowles, Joshua D. %Y Langdon, William B. %Y Larranaga, Pedro %Y Luke, Sean %Y Luque, Gabriel %Y McCall, John A. W. %Y Montes de Oca, Marco A. %Y Motsinger-Reif, Alison %Y Ong, Yew Soon %Y Palmer, Michael %Y Parsopoulos, Konstantinos E. %Y Raidl, Guenther %Y Risi, Sebastian %Y Ruhe, Guenther %Y Schaul, Tom %Y Schmickl, Thomas %Y Sendhoff, Bernhard %Y Stanley, Kenneth O. %Y Stuetzle, Thomas %Y Thierens, Dirk %Y Togelius, Julian %Y Witt, Carsten %Y Zarges, Christine %S GECCO ’14: Proceedings of the 2014 conference on Genetic and evolutionary computation %D 2014 %8 December 16 jul %I ACM %C Vancouver, BC, Canada %F Ahmed:2014:GECCOa %X Biomarker identification, i.e., detecting the features that indicate differences between two or more classes, is an important task in omics sciences. Mass spectrometry (MS) provide a high throughput analysis of proteomic and metabolomic data. The number of features of the MS data sets far exceeds the number of samples, making biomarker identification extremely difficult. Feature construction can provide a means for solving this problem by transforming the original features to a smaller number of high-level features. This paper investigates the construction of multiple features using genetic programming (GP) for biomarker identification and classification of mass spectrometry data. In this paper, multiple features are constructed using GP by adopting an embedded approach in which Fisher criterion and p-values are used to measure the discriminating information between the classes. This produces nonlinear high-level features from the low-level features for both binary and multi-class mass spectrometry data sets. Meanwhile, seven different classifiers are used to test the effectiveness of the constructed features. The proposed GP method is tested on eight different mass spectrometry data sets. The results show that the high-level features constructed by the GP method are effective in improving the classification performance in most cases over the original set of features and the low-level selected features. In addition, the new method shows superior performance in terms of biomarker detection rate. %K genetic algorithms, genetic programming %R doi:10.1145/2576768.2598292 %U http://doi.acm.org/10.1145/2576768.2598292 %U http://dx.doi.org/doi:10.1145/2576768.2598292 %P 249-256 %0 Conference Proceedings %T Prediction of detectable peptides in MS data using genetic programming %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %Y Igel, Christian %Y Arnold, Dirk V. %Y Gagne, Christian %Y Popovici, Elena %Y Auger, Anne %Y Bacardit, Jaume %Y Brockhoff, Dimo %Y Cagnoni, Stefano %Y Deb, Kalyanmoy %Y Doerr, Benjamin %Y Foster, James %Y Glasmachers, Tobias %Y Hart, Emma %Y Heywood, Malcolm I. %Y Iba, Hitoshi %Y Jacob, Christian %Y Jansen, Thomas %Y Jin, Yaochu %Y Kessentini, Marouane %Y Knowles, Joshua D. %Y Langdon, William B. %Y Larranaga, Pedro %Y Luke, Sean %Y Luque, Gabriel %Y McCall, John A. W. %Y Montes de Oca, Marco A. %Y Motsinger-Reif, Alison %Y Ong, Yew Soon %Y Palmer, Michael %Y Parsopoulos, Konstantinos E. %Y Raidl, Guenther %Y Risi, Sebastian %Y Ruhe, Guenther %Y Schaul, Tom %Y Schmickl, Thomas %Y Sendhoff, Bernhard %Y Stanley, Kenneth O. %Y Stuetzle, Thomas %Y Thierens, Dirk %Y Togelius, Julian %Y Witt, Carsten %Y Zarges, Christine %S GECCO Comp ’14: Proceedings of the 2014 conference companion on Genetic and evolutionary computation companion %D 2014 %8 December 16 jul %I ACM %C Vancouver, BC, Canada %F Ahmed:2014:GECCOcomp %X The use of mass spectrometry to verify and quantify biomarkers requires the identification of the peptides that can be detectable. In this paper, we propose the use of genetic programming (GP) to measure the detection probability of the peptides. The new GP method is tested and verified on two different yeast data sets with increasing complexity and shows improved performance over other state-of-art classification and feature selection algorithms. %K genetic algorithms, genetic programming, biological and biomedical applications: Poster %R doi:10.1145/2598394.2598421 %U http://doi.acm.org/10.1145/2598394.2598421 %U http://dx.doi.org/doi:10.1145/2598394.2598421 %P 37-38 %0 Journal Article %T Improving Feature Ranking for Biomarker Discovery in Proteomics Mass Spectrometry Data using Genetic Programming %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %J Connection Science %D 2014 %V 26 %N 3 %@ 0954-0091 %F Ahmed:2014:CS %X Feature selection on mass spectrometry (MS) data is essential for improving classification performance and biomarker discovery. The number of MS samples is typically very small compared with the high dimensionality of the samples, which makes the problem of biomarker discovery very hard. In this paper, we propose the use of genetic programming for biomarker detection and classification of MS data. The proposed approach is composed of two phases: in the first phase, feature selection and ranking are performed. In the second phase, classification is performed. The results show that the proposed method can achieve better classification performance and biomarker detection rate than the information gain (IG) based and the RELIEF feature selection methods. Meanwhile, four classifiers, Naive Bayes, J48 decision tree, random forest and support vector machines, are also used to further test the performance of the top ranked features. The results show that the four classifiers using the top ranked features from the proposed method achieve better performance than the IG and the RELIEF methods. Furthermore, GP also outperforms a genetic algorithm approach on most of the used data sets. %K genetic algorithms, genetic programming, biomarker discovery, feature selection, classification %9 journal article %R doi:10.1080/09540091.2014.906388 %U http://dx.doi.org/doi:10.1080/09540091.2014.906388 %P 215-243 %0 Conference Proceedings %T Genetic Programming for Measuring Peptide Detectability %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %A Xue, Bing %Y Dick, Grant %Y Browne, Will N. %Y Whigham, Peter A. %Y Zhang, Mengjie %Y Bui, Lam Thu %Y Ishibuchi, Hisao %Y Jin, Yaochu %Y Li, Xiaodong %Y Shi, Yuhui %Y Singh, Pramod %Y Tan, Kay Chen %Y Tang, Ke %S Simulated Evolution and Learning - 10th International Conference, SEAL 2014, Dunedin, New Zealand, December 15-18, 2014. Proceedings %S Lecture Notes in Computer Science %D 2014 %V 8886 %I Springer %F conf/seal/AhmedZPX14 %K genetic algorithms, genetic programming %U http://dx.doi.org/10.1007/978-3-319-13563-2 %P 593-604 %0 Conference Proceedings %T A Multi-objective Genetic Programming Biomarker Detection Approach in Mass Spectrometry Data %A Ahmed, Soha %A Zhang, Mengjie %A Peng, Lifeng %A Xue, Bing %Y Squillero, Giovanni %Y Burelli, Paolo %S 19th European Conference on Applications of Evolutionary Computation, EvoApplications 2016 %S Lecture Notes in Computer Science %D 2016 %8 mar 30 – apr 1 %V 9597 %I Springer %C Porto, Portugal %F conf/evoW/AhmedZPX16 %X Mass spectrometry is currently the most commonly used technology in biochemical research for proteomic analysis. The main goal of proteomic profiling using mass spectrometry is the classification of samples from different clinical states. This requires the identification of proteins or peptides (biomarkers) that are expressed differentially between different clinical states. However, due to the high dimensionality of the data and the small number of samples, classification of mass spectrometry data is a challenging task. Therefore, an effective feature manipulation algorithm either through feature selection or construction is needed to enhance the classification performance and at the same time minimise the number of features. Most of the feature manipulation methods for mass spectrometry data treat this problem as a single objective task which focuses on improving the classification performance. This paper presents two new methods for biomarker detection through multi-objective feature selection and feature construction. The results show that the proposed multi-objective feature selection method can obtain better subsets of features than the single-objective algorithm and two traditional multi-objective approaches for feature selection. Moreover, the multi-objective feature construction algorithm further improves the performance over the multi-objective feature selection algorithm. This paper is the first multi-objective genetic programming approach for biomarker detection in mass spectrometry data %K genetic algorithms, genetic programming %R doi:10.1007/978-3-319-31204-0_8 %U http://dx.doi.org/doi:10.1007/978-3-319-31204-0_8 %P 106-122 %0 Journal Article %T Analysis and Optimization of Machining Hardened Steel AISI 4140 with Self-Propelled Rotary Tools %A Ahmed, Waleed %A Hegab, Hussien %A Mohany, Atef %A Kishawy, Hossam %J Materials %D 2021 %V 14 %N 20 %@ 1996-1944 %F ahmed:2021:Materials %X It is necessary to improve the machinability of difficult-to-cut materials such as hardened steel, nickel-based alloys, and titanium alloys as these materials offer superior properties such as chemical stability, corrosion resistance, and high strength to weight ratio, making them indispensable for many applications. Machining with self-propelled rotary tools (SPRT) is considered one of the promising techniques used to provide proper tool life even under dry conditions. In this work, an attempt has been performed to analyse, model, and optimise the machining process of AISI 4140 hardened steel using self-propelled rotary tools. Experimental analysis has been offered to (a) compare the fixed and rotary tools performance and (b) study the effect of the inclination angle on the surface quality and tool wear. Moreover, the current study implemented some artificial intelligence-based approaches (i.e., genetic programming and NSGA-II) to model and optimise the machining process of AISI 4140 hardened steel with self-propelled rotary tools. The feed rate, cutting velocity, and inclination angle were the selected design variables, while the tool wear, surface roughness, and material removal rate (MRR) were the studied outputs. The optimal surface roughness was obtained at a cutting speed of 240 m/min, an inclination angle of 20?, and a feed rate of 0.1 mm/rev. In addition, the minimum flank tool wear was observed at a cutting speed of 70 m/min, an inclination angle of 10?, and a feed rate of 0.15 mm/rev. Moreover, different weights have been assigned for the three studied outputs to offer different optimised solutions based on the designer’s interest (equal-weighted, finishing, and productivity scenarios). It should be stated that the findings of the current work offer valuable recommendations to select the optimised cutting conditions when machining hardened steel AISI 4140 within the selected ranges. %K genetic algorithms, genetic programming, modeling, machining, optimization, rotary tools %9 journal article %R doi:10.3390/ma14206106 %U https://www.mdpi.com/1996-1944/14/20/6106 %U http://dx.doi.org/doi:10.3390/ma14206106 %0 Journal Article %T Acoustic monitoring of an aircraft auxiliary power unit %A Ahmed, Umair %A Ali, Fakhre %A Jennions, Ian %J ISA Transactions %D 2023 %@ 0019-0578 %F AHMED:2023:isatra %X In this paper, the development and implementation of a novel approach for fault detection of an aircraft auxiliary power unit (APU) has been demonstrated. The developed approach aims to target the proactive identification of faults, in order to streamline the required maintenance and maximize the aircraft’s operational availability. The existing techniques rely heavily on the installation of multiple types of intrusive sensors throughout the APU and therefore present a limited potential for deployment on an actual aircraft due to space constraints, accessibility issues as well as associated development and certification requirements. To overcome these challenges, an innovative approach based on non-intrusive sensors i.e., microphones in conjunction with appropriate feature extraction, classification, and regression techniques, has been successfully demonstrated for online fault detection of an APU. The overall approach has been implemented and validated based on the experimental test data acquired from Cranfield University’s Boeing 737-400 aircraft, including the quantification of sensor location sensitivities on the efficacy of the acquired models. The findings of the overall analysis suggest that the acoustic-based models can accurately enable near real-time detection of faulty conditions i.e., Inlet Guide Vane malfunction, reduced mass flows through the Load Compressor and Bleed Valve malfunction, using only two microphones installed in the periphery of the APU. This study constitutes an enabling technology for robust, cost-effective, and efficient in-situ monitoring of an aircraft APU and potentially other associated thermal systems i.e., environmental control system, fuel system, and engines %K genetic algorithms, genetic programming, Aircraft, Auxiliary power unit, Condition monitoring, Acoustics, Signal processing, Machine learning, Sensors, Feature extraction, Fault detection, Microphones %9 journal article %R doi:10.1016/j.isatra.2023.01.014 %U https://www.sciencedirect.com/science/article/pii/S0019057823000149 %U http://dx.doi.org/doi:10.1016/j.isatra.2023.01.014 %0 Journal Article %T Towards Early Diagnosis and Intervention: An Ensemble Voting Model for Precise Vital Sign Prediction in Respiratory Disease %A Ahmed, Usman %A Lin, Jerry Chun-Wei %A Srivastava, Gautam %J IEEE Journal of Biomedical and Health Informatics %D 2023 %@ 2168-2208 %F Ahmed:JBHI %X Worldwide, cardiovascular and chronic respiratory diseases account for approximately 19 million deaths annually. Evidence indicates that the ongoing COVID-19 pandemic directly contributes to increased blood pressure, cholesterol, as well as blood glucose levels. Timely screening of critical physiological vital signs benefits both healthcare providers and individuals by detecting potential health issues. This study aims to implement a machine learning-based prediction and classification system to forecast vital signs associated with cardiovascular and chronic respiratory diseases. The system predicts patients’ health status and notifies caregivers and medical professionals when necessary. Using real-world data, a linear regression model inspired by the Facebook Prophet model was developed to predict vital signs for the upcoming 180 seconds. With 180 seconds of lead time, caregivers can potentially save patients’ lives through early diagnosis of their health conditions. For this purpose, a Naive Bayes classification model, a Support Vector Machine model, a Random Forest model, and genetic programming-based hyper tunning were employed. The proposed model outdoes previous attempts at vital sign prediction. Compared with alternative methods, the Facebook Prophet model has the best mean square in predicting vital signs. A hyperparameter-tuning is used to refine the model, yielding improved short- and long-term outcomes for each and every vital sign. Furthermore, the F-measure for the proposed classification model is 0.98 with an increase of 0.21. The incorporation of additional elements, such as momentum indicators, could increase the model’s flexibility with calibration. The findings of this study demonstrate that the proposed model is more accurate in predicting vital signs and trends. %K genetic algorithms, genetic programming, Diseases, Medical diagnostic imaging, Medical services, Heart, Predictive models, Machine learning, Decision trees, Artificial intelligence, Sensor readings, Heart disease, Long-term lung disease %9 journal article %R doi:10.1109/JBHI.2023.3270888 %U http://dx.doi.org/doi:10.1109/JBHI.2023.3270888 %0 Journal Article %T Multivariate time-series sensor vital sign forecasting of cardiovascular and chronic respiratory diseases %A Ahmed, Usman %A Lin, Jerry Chun-Wei %A Srivastava, Gautam %J Sustainable Computing: Informatics and Systems %D 2023 %V 38 %@ 2210-5379 %F AHMED:2023:suscom %X Approximately 19 million people die each year from cardiovascular and chronic respiratory diseases. As a result of the recent Covid-19 epidemic, blood pressure, cholesterol, and blood sugar levels have risen. Not only do healthcare institutions benefit from studying physiological vital signs, but individuals also benefit from being alerted to health problems in a timely manner. This study uses machine learning to categorize and predict cardiovascular and chronic respiratory diseases. By predicting a patient’s health status, caregivers and medical professionals can be alerted when needed. We predicted vital signs for 180 seconds using real-world vital sign data. A person’s life can be saved if caregivers react quickly and anticipate emergencies. The tree-based pipeline optimization method (TPOT) is used instead of manually adjusting machine learning classifiers. This paper focuses on optimizing classification accuracy by combining feature pre-processors and machine learning models with TPOT genetic programming making use of linear and Prophet models to predict important indicators. The TPOT tuning parameter combines predicted values with classical classification models such as Naive Bayes, Support Vector Machines, and Random Forests. As a result of this study, we show the importance of categorizing and increasing the accuracy of predictions. The proposed model achieves its adaptive behavior by conceptually incorporating different machine learning classifiers. We compare the proposed model with several state-of-the-art algorithms using a large amount of training data. Test results at the University of Queensland using 32 patient’s data showed that the proposed model outperformed existing algorithms, improving the classification of cardiovascular disease from 0.58 to 0.71 and chronic respiratory disease from 0.49 to 0.70, respectively, while minimizing the mean percent error in vital signs. Our results suggest that the Facebook Prophet prediction model in conjunction with the TPOT classification model can correctly diagnose a patient’s health status based on abnormal vital signs and enables patients to receive prompt medical attention %K genetic algorithms, genetic programming, Machine learning, Sensor data, Cardiovascular disease, Chronic respiratory disease. TPOT %9 journal article %R doi:10.1016/j.suscom.2023.100868 %U https://www.sciencedirect.com/science/article/pii/S2210537923000239 %U http://dx.doi.org/doi:10.1016/j.suscom.2023.100868 %P 100868 %0 Conference Proceedings %T A Genetic Programming Approach to Data Clustering %A Ahn, Chang Wook %A Oh, Sanghoun %A Oh, Moonyoung %Y Kim, Tai-Hoon %Y Adeli, Hojjat %Y Grosky, William I. %Y Pissinou, Niki %Y Shih, Timothy K. %Y Rothwell, Edward J. %Y Kang, Byeong Ho %Y Shin, Seung-Jung %S Proceedings of the International Conference on Multimedia, Computer Graphics and Broadcasting (MulGraB 2011) Part II %S Communications in Computer and Information Science %D 2011 %8 dec 8 10 %V 263 %I Springer %C Jeju Island, Korea %F conf/fgit/AhnOO11 %O Held as Part of the Future Generation Information Technology Conference, FGIT 2011, in Conjunction with GDC 2011 %X This paper presents a genetic programming (GP) to data clustering. The aim is to accurately classify a set of input data into their genuine clusters. The idea lies in discovering a mathematical function on clustering regularities and then use the rule to make a correct decision on the entities of each cluster. To this end, GP is incorporated into the clustering procedures. Each individual is represented by a parsing tree on the program set. Fitness function evaluates the quality of clustering with regard to similarity criteria. Crossover exchanges sub-trees between parental candidates in a positionally independent fashion. Mutation introduces (in part) a new sub-tree with a low probability. The variation operators (i.e., crossover, mutation) offer an effective search capability to obtain the improved quality of solution and the enhanced speed of convergence. Experimental results demonstrate that the proposed approach outperforms a well-known reference. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-27186-1_15 %U http://dx.doi.org/doi:10.1007/978-3-642-27186-1_15 %P 123-132 %0 Journal Article %T A genetic algorithm for fitting Lorentzian line shapes in Mossbauer spectra %A Ahonen, Hannu %A de Souza Jr., Paulo A. %A Garg, Vijayendra Kumar %J Nuclear Instruments and Methods in Physics Research B %D 1997 %8 May %V 124 %@ 0168583X %F Aho97 %X A genetic algorithm was implemented for finding an approximative solution to the problem of fitting a combination of Lorentzian lines to a measured Mossbauer spectrum. This iterative algorithm exploits the idea of letting several solutions (individuals) compete with each other for the opportunity of being selected to create new solutions (reproduction). Each solution was represend as a string of binary digits (chromossome). In addition, the bits in the new solutions may be switched randomly from zero to one or conversely (mutation). The input of the program that implements the genetic algorithm consists of the measured spectrum, the maximum velocity, the peak positions and the expected number of Lorentzian lines in the spectrum. Each line is represented with the help of three variables, which correspond to its intensity, full line width at hald maxima and peak position. An additional parameter was associated to the background level in the spectrum. A chi-2 test was used for determining the quality of each parameter combination (fitness). The results obtained seem to be very promising and encourage to further development of the algorithm and its implementation. %K genetic algorithms %9 journal article %P 633-638 %0 Conference Proceedings %T AutoQP: Genetic Programming for Quantum Programming %A Ahsan, Usama %A ul Amir Afsar Minhas, Fayyaz %S 2020 17th International Bhurban Conference on Applied Sciences and Technology (IBCAST) %D 2020 %8 jan %F Ahsan:2020:IBCAST %X Quantum computing is a new era in the field of computation which makes use of quantum mechanical phenomena such as superposition, entanglement, and quantum annealing. It is a very promising field and has given a new paradigm to efficiently solve complex computational problems. However, programming quantum computers is a difficult task. In this research, we have developed a system called AutoQP which can write quantum computer code through genetic programming on a classical computer provided the input and expected output of a quantum program. We have tested AutoQP on two different quantum algorithms: Deutsch Problem and the Bernstein-Vazirani problem. In our experimental analysis, AutoQP was able to generate quantum programs for solving both problems. The code generated by AutoQP was successfully tested on actual IBM quantum computers as well. It is expected that the proposed system can be very useful for the general development of quantum programs based on the IBM gate model. The source code for the proposed system is available at the URL: https://github.com/usamaahsan93/AutoQP. %K genetic algorithms, genetic programming %R doi:10.1109/IBCAST47879.2020.9044554 %U http://dx.doi.org/doi:10.1109/IBCAST47879.2020.9044554 %P 378-382 %0 Journal Article %T A Survey of Genetic Programming and Its Applications %A Ahvanooey, Milad Taleby %A Li, Qianmu %A Wu, Ming %A Wang, Shuo %J KSII Trans. Internet Inf. Syst. %D 2019 %V 13 %N 4 %F DBLP:journals/itiis/AhvanooeyLWW19 %K genetic algorithms, genetic programming %9 journal article %R doi:10.3837/tiis.2019.04.002 %U https://doi.org/10.3837/tiis.2019.04.002 %U http://dx.doi.org/doi:10.3837/tiis.2019.04.002 %P 1765-1794 %0 Thesis %T QoS-aware web service composition using genetic algorithms %A Ai, Lifeng %D 2011 %8 jun %C Australia %C Queensland University of Technology %F Lifeng_Ai_Thesis %X Web service technology is increasingly being used to build various e-Applications, in domains such as e-Business and e-Science. Characteristic benefits of web service technology are its inter-operability, decoupling and just-in-time integration. Using web service technology, an e-Application can be implemented by web service composition, by composing existing individual web services in accordance with the business process of the application. This means the application is provided to customers in the form of a value-added composite web service. An important and challenging issue of web service composition, is how to meet Quality-of-Service (QoS) requirements. This includes customer focused elements such as response time, price, throughput and reliability as well as how to best provide QoS results for the composites. This in turn best fulfils customers’ expectations and achieves their satisfaction. Fulfilling these QoS requirements or addressing the QoS-aware web service composition problem is the focus of this project. From a computational point of view, QoS-aware web service composition can be transformed into diverse optimisation problems. These problems are characterised as complex, large-scale, highly constrained and multi-objective problems. We therefore use genetic algorithms (GAs) to address QoS-based service composition problems. More precisely, this study addresses three important subproblems of QoS-aware web service composition; QoS-based web service selection for a composite web service accommodating constraints on inter-service dependence and conflict, QoS-based resource allocation and scheduling for multiple composite services on hybrid clouds, and performance-driven composite service partitioning for decentralised execution. Based on operations research theory, we model the three problems as a constrained optimisation problem, a resource allocation and scheduling problem, and a graph partitioning problem, respectively. Then, we present novel GAs to address these problems. We also conduct experiments to evaluate the performance of the new GAs. Finally, verification experiments are performed to show the correctness of the GAs. The major outcomes from the first problem are three novel GAs: a penaltybased GA, a min-conflict hill-climbing repairing GA, and a hybrid GA. These GAs adopt different constraint handling strategies to handle constraints on interservice dependence and conflict. This is an important factor that has been largely ignored by existing algorithms that might lead to the generation of infeasible composite services. Experimental results demonstrate the effectiveness of our GAs for handling the QoS-based web service selection problem with constraints on inter-service dependence and conflict, as well as their better scalability than the existing integer programming-based method for large scale web service selection problems. The major outcomes from the second problem has resulted in two GAs; a random-key GA and a cooperative coevolutionary GA (CCGA). Experiments demonstrate the good scalability of the two algorithms. In particular, the CCGA scales well as the number of composite services involved in a problem increases, while no other algorithms demonstrate this ability. The findings from the third problem result in a novel GA for composite service partitioning for decentralised execution. Compared with existing heuristic algorithms, the new GA is more suitable for a large-scale composite web service program partitioning problems. In addition, the GA outperforms existing heuristic algorithms, generating a better deployment topology for a composite web service for decentralised execution. These effective and scalable GAs can be integrated into QoS-based management tools to facilitate the delivery of feasible, reliable and high quality composite web services. %K genetic algorithms, quality of service, web services, composite web services, optimisation %9 Ph.D. thesis %U http://eprints.qut.edu.au/46666/1/Lifeng_Ai_Thesis.pdf %0 Conference Proceedings %T Cooperative Co-evolution Inspired Operators for Classical GP Schemes %A Aichour, Malek %A Lutton, Evelyne %Y Krasnogor, Natalio %Y Nicosia, Giuseppe %Y Pavone, Mario %Y Pelta, David %S Proceedings of International Workshop on Nature Inspired Cooperative Strategies for Optimization (NICSO ’07) %S Studies in Computational Intelligence %D 2007 %8 August 10 nov %V 129 %I Springer %C Acireale, Italy %F Aichour:2007:NICSO %X This work is a first step toward the design of a cooperative-coevolution GP for symbolic regression, which first output is a selective mutation operator for classical GP. Cooperative co-evolution techniques rely on the imitation of cooperative capabilities of natural populations and have been successfully applied in various domains to solve very complex optimisation problems. It has been proved on several applications that the use of two fitness measures (local and global) within an evolving population allow to design more efficient optimization schemes. We currently investigate the use of a two-level fitness measurement for the design of operators, and present in this paper a selective mutation operator. Experimental analysis on a symbolic regression problem give evidence of the efficiency of this operator in comparison to classical subtree mutation %K genetic algorithms, genetic programming %R doi:10.1007/978-3-540-78987-1_16 %U http://dx.doi.org/doi:10.1007/978-3-540-78987-1_16 %P 169-178 %0 Conference Proceedings %T A Genetic Programming Approach to Automatically Construct Informative Attributes for Mammographic Density Classification %A Ain, Qurrat Ul %A Xue, Bing %A Al-Sahaf, Harith %A Zhang, Mengjie %S 2022 IEEE International Conference on Data Mining Workshops (ICDMW) %D 2022 %8 nov %F Ain:2022:ICDMW %X Breast density is widely used as an initial indicator of developing breast cancer. At present, current classification methods for mammographic density usually require manual operations or expert knowledge that makes them expensive in real-time situations. Such methods achieve only moderate classification accuracy due to the limited model capacity and computational resources. In addition, most existing studies focus on improving classification accuracy using only raw images or the entire set of original attributes and remain unable to identify hidden patterns or causal information necessary to discriminate breast density classes. It is challenging to find high-quality knowledge when some attributes defining the data space are redundant or irrelevant. In this study, we present a novel attribute construction method using genetic programming (GP) for the task of breast density classification. To extract informative features from the raw mammographic images, wavelet decomposition, local binary patterns, and histogram of oriented gradients are used to include texture, local and global image properties. The study evaluates the goodness of the proposed method on two benchmark real-world mammographic image datasets and compares the results of the proposed GP method with eight conventional classification methods. The experimental results reveal that the proposed method significantly outperforms most of the commonly used classification methods in binary and multi-class classification tasks. Furthermore, the study shows the potential of G P for mammographic breast density classification by interpreting evolved attributes that highlight important breast density characteristics. %K genetic algorithms, genetic programming %R doi:10.1109/ICDMW58026.2022.00057 %U http://dx.doi.org/doi:10.1109/ICDMW58026.2022.00057 %P 378-387 %0 Journal Article %T Automatically Diagnosing Skin Cancers From Multimodality Images Using Two-Stage Genetic Programming %A Ain, Qurrat Ul %A Al-Sahaf, Harith %A Xue, Bing %A Zhang, Mengjie %J IEEE Transactions on Cybernetics %D 2022 %@ 2168-2275 %F Ain:2022:ieeeTC %X Developing a computer-aided diagnostic system for detecting various skin malignancies from images has attracted many researchers. Unlike many machine-learning approaches, such as artificial neural networks, genetic programming (GP) automatically evolves models with flexible representation. GP successfully provides effective solutions using its intrinsic ability to select prominent features (i.e., feature selection) and build new features (i.e., feature construction). Existing approaches have used GP to construct new features from the complete set of original features and the set of operators. However, the complete set of features may contain redundant or irrelevant features that do not provide useful information for classification. This study aims to develop a two-stage GP method, where the first stage selects prominent features, and the second stage constructs new features from these selected features and operators, such as multiplication in a wrapper approach to improve the classification performance. To include local, global, texture, color, and multiscale image properties of skin images, GP selects and constructs features extracted from local binary patterns and pyramid-structured wavelet decomposition. The accuracy of this GP method is assessed using two real-world skin image datasets captured from the standard camera and specialized instruments, and compared with commonly used classification algorithms, three state of the art, and an existing embedded GP method. The results reveal that this new approach of feature selection and feature construction effectively helps improve the performance of the machine-learning classification algorithms. Unlike other black-box models, the evolved models by GP are interpretable; therefore, the proposed method can assist dermatologists to identify prominent features, which has been shown by further analysis on the evolved models. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1109/TCYB.2022.3182474 %U http://dx.doi.org/doi:10.1109/TCYB.2022.3182474 %0 Journal Article %T Genetic programming for automatic skin cancer image classification %A Ain, Qurrat Ul %A Al-Sahaf, Harith %A Xue, Bing %A Zhang, Mengjie %J Expert Systems with Applications %D 2022 %V 197 %@ 0957-4174 %F AIN:2022:eswa %X Developing a computer-aided diagnostic system for detecting various types of skin malignancies from images has attracted many researchers. However, analyzing the behaviors of algorithms is as important as developing new systems in order to establish the effectiveness of a system in real-time situations which impacts greatly how well it can assist the dermatologist in making a diagnosis. Unlike many machine learning approaches such as Artificial Neural Networks, Genetic Programming (GP) automatically evolves models with its dynamic representation and flexibility. This study aims at analyzing recently developed GP-based approaches to skin image classification. These approaches have used the intrinsic feature selection and feature construction ability of GP to effectively construct informative features from a variety of pre-extracted features. These features encompass local, global, texture, color and multi-scale image properties of skin images. The performance of these GP methods is assessed using two real-world skin image datasets captured from standard camera and specialized instruments, and compared with six commonly used classification algorithms as well as existing GP methods. The results reveal that these constructed features greatly help improve the performance of the machine learning classification algorithms. Unlike ’black-box’ algorithms like deep neural networks, GP models are interpretable, therefore, our analysis shows that these methods can help dermatologists identify prominent skin image features. Further, it can help researchers identify suitable feature extraction methods for images captured from a specific instrument. Being fast, these methods can be deployed for making a quick and effective diagnosis in actual clinic situations %K genetic algorithms, genetic programming, Image classification, Dimensionality reduction, Feature selection, Feature construction %9 journal article %R doi:10.1016/j.eswa.2022.116680 %U https://www.sciencedirect.com/science/article/pii/S0957417422001634 %U http://dx.doi.org/doi:10.1016/j.eswa.2022.116680 %P 116680 %0 Conference Proceedings %T A New Genetic Programming Representation for Feature Learning in Skin Cancer Detection %A Ain, Qurrat Ul %A Al-Sahaf, Harith %A Xue, Bing %A Zhang, Mengjie %Y Silva, Sara %Y Paquete, Luis %Y Vanneschi, Leonardo %Y Lourenco, Nuno %Y Zamuda, Ales %Y Kheiri, Ahmed %Y Liefooghe, Arnaud %Y Xue, Bing %Y Bi, Ying %Y Pillay, Nelishia %Y Moser, Irene %Y Guijt, Arthur %Y Catarino, Jessica %Y Garcia-Sanchez, Pablo %Y Trujillo, Leonardo %Y Silva, Carla %Y Veerapen, Nadarajen %S Proceedings of the 2023 Genetic and Evolutionary Computation Conference %S GECCO ’23 %D 2023 %8 15 19 jul %I Association for Computing Machinery %C Lisbon, Portugal %F ain:2023:GECCOcomp %X The process of automatically extracting informative high-level features from skin cancer images is enhanced by integrating well-developed feature descriptors into learning algorithms. This paper develops a new genetic programming-based feature learning approach to automatically select and combine six well-developed descriptors to extract high-level features for skin cancer image classification. The new approach can automatically learn various global features for image classification. The experimental results show that the new approach achieves significantly better classification performance than the baseline approach and six commonly used feature descriptors on two real-world skin image datasets. %K genetic algorithms, genetic programming, feature learning, feature extraction, melanoma detection, image classification: Poster %R doi:10.1145/3583133.3590550 %U http://dx.doi.org/doi:10.1145/3583133.3590550 %P 707-710 %0 Conference Proceedings %T Skin Cancer Detection with Multimodal Data: A Feature Selection Approach Using Genetic Programming %A Ain, Qurrat Ul %A Xue, Bing %A Al-Sahaf, Harith %A Zhang, Mengjie %S Australasian Conference on Data Science and Machine Learning, AusDM 2023 %D 2023 %I Springer %F ain:2023:AusDM %K genetic algorithms, genetic programming %R doi:10.1007/978-981-99-8696-5_18 %U http://link.springer.com/chapter/10.1007/978-981-99-8696-5_18 %U http://dx.doi.org/doi:10.1007/978-981-99-8696-5_18 %0 Conference Proceedings %T Exploring Genetic Programming Models in Computer-Aided Diagnosis of Skin Cancer Images %A Ain, Qurrat UI %A Al-Sahaf, Harith %A Xue, Bing %A Zhang, Mengjie %Y Xue, Bing %S 2024 IEEE Congress on Evolutionary Computation (CEC) %D 2024 %8 30 jun 5 jul %I IEEE %C Yokohama, Japan %F ain:2024:CEC %X Extracting important information from complex skin lesion images is vital to effectively distinguish between different types of skin cancer images. In addition to providing high classification performance, such computer-aided diagnostic methods are needed where the models are interpretable and can provide knowledge about the discriminative features in skin lesion images. This underlying information can significantly assist dermatologists in identifying a particular stage or type of cancer. With its flexible representation and global search abilities, Genetic Programming (GP) is an ideal learning al-gorithm to evolve interpretable models and identify important features with significant information to discriminate between skin cancer classes. This paper provides an in-depth analysis of a recent GP-based feature learning method where different well-developed feature descriptors are integrated into the learning algorithms to extract high-level features for skin cancer image classification. The study explores the effectiveness of using feature learning for this complex task and designing program structure to suit the problem domain as it has shown promising results compared to commonly used feature descriptors and an existing GP-based feature learning method developed for general image classification. This study analyses the GP-evolved models to identify the prominent features and most effective feature descriptors important for the classification of these skin cancer images. The evolved models are interpretable, they provide knowledge that can assist dermatologists in making diagnoses in real-time clinical situations by identifying prominent skin cancer characteristics captured by the feature descriptors and learnt during the evolutionary process. %K genetic algorithms, genetic programming, Representation learning, Visualization, Computational modeling, Feature extraction, Skin, Lesions, Image Classification, Skin Cancer Detection %R doi:10.1109/CEC60901.2024.10612105 %U http://dx.doi.org/doi:10.1109/CEC60901.2024.10612105 %0 Conference Proceedings %T Genetic Programming Approaches for Minimum Cost Topology Optimisation of Optical Telecommunication Networks %A Aiyarak, P. %A Saket, A. S. %A Sinclair, M. C. %Y Zalzala, Ali %S Second International Conference on Genetic Algorithms in Engineering Systems: Innovations and Applications, GALESIA %D 1997 %8 January 4 sep %I IEE %C University of Strathclyde, Glasgow, UK %@ 0-85296-693-8 %F aiyarak:1997:GPtootn %X This paper compares the relative efficiency of three approaches for the minimum-cost topology optimisation of the COST 239 European Optical Network (EON) using genetic programming. The GP was run for the central nine nodes using three approaches: relational function set, decision trees, and connected nodes. Only the best two, decision trees and connected nodes, were run for the full EON. The results are also compared with earlier genetic algorithm work on the EON. %K genetic algorithms, genetic programming, telecommunication networks, topology %R doi:10.1049/cp:19971216 %U http://uk.geocities.com/markcsinclair/ps/galesia97_aiy.ps.gz %U http://dx.doi.org/doi:10.1049/cp:19971216 %P 415-420 %0 Conference Proceedings %T A novel estimation methodology for tracheal pressure in mechanical ventilation control %A Ajcevic, Milos %A De Lorenzo, Andrea %A Accardo, Agostino %A Bartoli, Alberto %A Medvet, Eric %S 8th International Symposium on Image and Signal Processing and Analysis (ISPA 2013) %D 2013 %8 April 6 sep %C Trieste, Italy %F Ajcevic:2013:ISPA %X High-frequency percussive ventilation (HFPV) is a non-conventional mechanical ventilatory strategy which has proved useful in the treatment of a number of pathological conditions. HFPV usually involves the usage of endotracheal tubes (EET) connecting the ventilator circuit to the airway of the patient. The pressure of the air flow insufflated by HFPV must be controlled very accurately in order to avoid barotrauma and volutrauma. Since the actual tracheal pressure cannot be measured, a model for estimating such a pressure based on the EET properties and on the air flow properties that can actually be measured in clinical practice is necessary. In this work we propose a novel methodology, based on Genetic Programming, for synthesising such a model. We experimentally evaluated our models against the state-of-the-art baseline models, crafted by human experts, and found that our models for estimating tracheal pressure are significantly more accurate. %K genetic algorithms, genetic programming, biomechanics, biomedical electronics, biomedical equipment, diseases, injuries, medical control systems, patient treatment, physiological models, air flow pressure, air flow properties, barotrauma, endotracheal tubes, estimation methodology, high-frequency percussive ventilation, mechanical ventilation control, nonconventional mechanical ventilatory strategy, pathological conditions, patient airway, patient treatment, state-of-the-art baseline models, tracheal pressure, ventilator circuit, volutrauma, Electron tubes, Lungs, Physiology, Pressure measurement, Testing, Ventilation %R doi:10.1109/ISPA.2013.6703827 %U http://dx.doi.org/doi:10.1109/ISPA.2013.6703827 %P 695-699 %0 Thesis %T Personalized setup of high frequency percussive ventilator by estimation of respiratory system viscoelastic parameters %A Ajcevic, Milos %D 2013/2014 %C Italy %C Universita degli studi di Trieste %F Ajcevic:thesis %X High Frequency Percussive Ventilation (HFPV) is a non-conventional ventilatory modality which has proven highly effective in patients with severe gas exchange impairment. However, at the present time, HFPV ventilator provides only airway pressure measurement. The airway pressure measurements and gas exchange analysis are currently the only parameters that guide the physician during the HFPV ventilator setup and treatment monitoring. The evaluation of respiratory system resistance and compliance parameters in patients undergoing mechanical ventilation is used for lung dysfunctions detection, ventilation setup and treatment effect evaluation. Furthermore, the pressure measured by ventilator represents the sum of the endotracheal tube pressure drop and the tracheal pressure. From the clinical point of view, it is very important to take into account the real amount of pressure dissipated by endotracheal tube to avoid lung injury. HFPV is pressure controlled logic ventilation, thus hypoventilation and hyperventilation cases are possible because of tidal volume variations in function of pulmonary and endotracheal tube impedance. This thesis offers a new approach for HFPV ventilator setup in accordance with protective ventilatory strategy and optimization of alveolar recruitment using estimation of the respiratory mechanics parameters and endotracheal pressure drop. Respiratory system resistance and compliance parameters were estimated, firstly in vitro and successively in patients undergoing HFPV, applying least squares regression on Dorkin high frequency model starting from measured respiratory signals. The Blasius model was identified as the most adequate to estimate pressure drop across the endotracheal tube during HFPV. Beside measurement device was developed in order to measure respiratory parameters in patients undergoing HFPV. The possibility to tailor HFPV ventilator setup, using respiratory signals measurement and estimation of respiratory system resistance, compliance and endotracheal tube pressure drop, provided by this thesis, opens a new prospective to this particular ventilatory strategy, improving its beneficial effects and minimizing ventilator-induced lung damage. %K genetic algorithms, genetic programming, High Frequency Percussive Ventilation, Respiratory signal processing, Parameter identification %9 Ph.D. thesis %U http://hdl.handle.net/10077/10976 %0 Journal Article %T Evolving Suspiciousness Metrics From Hybrid Data Set for Boosting a Spectrum Based Fault Localization %A Ajibode, Adekunle Akinjobi %A Shu, Ting %A Ding, Zuohua %J IEEE Access %D 2020 %V 8 %@ 2169-3536 %F Ajibode:2020:A %X Spectrum Based Fault Localization (SBFL) uses different metrics called risk evaluation formula to guide and pinpoint faults in debugging process. The accuracy of a specific SBFL method may be limited by the used formulae and program spectra. However, it has been demonstrated recently that Genetic Programming could be used to automatically design formulae directly from the program spectra. Therefore, this article presents Genetic Programming approach for proposing risk evaluation formula with the inclusion of radicals to evolve suspiciousness metric directly from the program spectra. 92 faults from Unix utilities of SIR repository and 357 real faults from Defect4J repository were used. The approach combines these data sets, used 2percent of the total faults (113) to evolve the formulae and the remaining 7percent (336) to validate the effectiveness of the metrics generated by our approach. The proposed approach then uses Genetic Programming to run 30 evolution to produce different 30 metrics. The GP-generated metrics consistently out-performed all the classic formulae in both single and multiple faults, especially OP2 on average of 2.2percent in single faults and 3.4percent in multiple faults. The experiment results conclude that the combination of Hybrid data set and radical is a good technique to evolve effective formulae for spectra-based fault localization. %K genetic algorithms, genetic programming, Measurement, Debugging, Boosting, Debugging, fault localization, SBFL %9 journal article %R doi:10.1109/ACCESS.2020.3035413 %U http://dx.doi.org/doi:10.1109/ACCESS.2020.3035413 %P 198451-198467 %0 Book Section %T Developing a Computer-Controller Opponent for a First-Person Simulation Game using Genetic Programming %A Akalin, Frederick R. %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 2002 %D 2002 %8 jun %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F akalin:2002:DCOFSGGP %K genetic algorithms, genetic programming %P 11-20 %0 Journal Article %T Application of Fixed Length Gene Genetic Programming (FLGGP) in Hydropower Reservoir Operation %A Akbari-Alashti, Habib %A Haddad, Omid Bozorg %A Marino, Miguel A. %J Water Resources Management %D 2015 %V 29 %N 9 %F akbari-alashti:2015:WRM %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s11269-015-1003-1 %U http://link.springer.com/article/10.1007/s11269-015-1003-1 %U http://dx.doi.org/doi:10.1007/s11269-015-1003-1 %0 Conference Proceedings %T Derivation of Relational Fuzzy Classification Rules Using Evolutionary Computation %A Akbarzadeh, Vahab %A Sadeghian, Alireza %A dos Santos, Marcus V. %Y Wang, Jun %S 2008 IEEE World Congress on Computational Intelligence %D 2008 %8 January 6 jun %I IEEE Press %C Hong Kong %F Akbarzadeh:2008:fuzz %X An evolutionary system for derivation of fuzzy classification rules is presented. This system uses two populations: one of fuzzy classification rules, and one of membership function definitions. A constrained-syntax genetic programming evolves the first population and a mutation-based evolutionary algorithm evolves the second population. These two populations co-evolve to better classify the underlying dataset. Unlike other approaches that use fuzzification of continuous attributes of the dataset for discovering fuzzy classification rules, the system presented here fuzzifies the relational operators “greater than” and “less than” using evolutionary methods. For testing our system, the system is applied to the Iris dataset. Our experimental results show that our system outperforms previous evolutionary and non-evolutionary systems on accuracy of classification and derivation of interrelation between the attributes of the Iris dataset. The resulting fuzzy rules of the system can be directly used in knowledge-based systems. %K genetic algorithms, genetic programming, constrained-syntax genetic programming, evolutionary computation, knowledge-based systems, mutation-based evolutionary algorithm, relational fuzzy classification rules, fuzzy set theory, knowledge based systems %R doi:10.1109/FUZZY.2008.4630598 %U FS0398.pdf %U http://dx.doi.org/doi:10.1109/FUZZY.2008.4630598 %P 1689-1693 %0 Conference Proceedings %T Genetic Algorithms and Genetic Programming: Combining Strength in One Evolutionary Strategy %A Akbarzadeh-T., M.-R. %A Tunstel, E. %A Jamshidi, M. %S Proceedings of the 1997 WERC/HSRC Joint Conference on the Environment %D 1997 %8 26 29 apr %C Albuquerque, NM, USA %F Akbarzadeh:1997:jce %X Genetic Algorithms (GA) and Genetic Programs (GP) are two of the most widely used evolution strategies for parameter optimisation of complex systems. GAs have shown a great deal of success where the representation space is a string of binary or real-valued numbers. At the same time, GP has demonstrated success with symbolic representation spaces and where structure among symbols is explored. This paper discusses weaknesses and strengths of GA and GP in search of a combined and more evolved optimization algorithm. This combination is especially attractive for problem domains with non-homogeneous parameters. In particular, a fuzzy logic membership function is represented by numerical strings, whereas rule-sets are represented by symbols and structural connectives. Two examples are provided which exhibit how GA and GP are best used in optimising robot performance in manipulating hazardous waste. The first example involves optimisation for a fuzzy controller for a flexible robot using GA and the second example illustrates usage of GP in optimizing an intelligent navigation algorithm for a mobile robot. A novel strategy for combining GA and GP is presented. %K genetic algorithms, genetic programming %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/Akbarzadeh_1997_jce.pdf %P 373-377 %0 Conference Proceedings %T Soft computing paradigms for hybrid fuzzy controllers: experiments and applications %A Akbarzadeh-T., M. R. %A Tunstel, E. %A Kumbla, K. %A Jamshidi, M. %S Proceedings of the 1998 IEEE World Congress on Computational Intelligence %D 1998 %8 May 9 may %V 2 %I IEEE Press %C Anchorage, Alaska, USA %@ 0-7803-4863-X %F Akbarzadeh:1998:wcci %X Neural networks (NN), genetic algorithms (GA), and genetic programs (GP) are often augmented with fuzzy logic-based schemes to enhance artificial intelligence of a given system. Such hybrid combinations are expected to exhibit added intelligence, adaptation, and learning ability. In the paper, implementation of three hybrid fuzzy controllers are discussed and verified by experimental results. These hybrid controllers consist of a hierarchical NN-fuzzy controller applied to a direct drive motor, a GA-fuzzy hierarchical controller applied to a flexible robot link, and a GP-fuzzy behavior-based controller applied to a mobile robot navigation task. It is experimentally shown that all three architectures are capable of significantly improving the system response. %K genetic algorithms, genetic programming, neurocontrollers, fuzzy control, hierarchical systems, mobile robots, path planning, brushless DC motors, machine control, manipulators, soft computing paradigms, hybrid fuzzy controllers, neural networks, genetic programs, fuzzy logic-based schemes, added intelligence, adaptation, learning ability, direct drive motor, genetic algorithm-fuzzy hierarchical controller, flexible robot link, genetic programming-fuzzy behavior-based controller, mobile robot navigation task %R doi:10.1109/FUZZY.1998.686289 %U http://www-robotics.jpl.nasa.gov/people/Edward_Tunstel/fieee98.pdf %U http://dx.doi.org/doi:10.1109/FUZZY.1998.686289 %P 1200-1205 %0 Journal Article %T Soft computing for autonomous robotic systems %A Akbarzadeh-T., M.-R. %A Kumbla, K. %A Tunstel, E. %A Jamshidi, M. %J Computers and Electrical Engineering %D 2000 %V 26 %N 1 %F Akbarzadeh-T:2000:CEE %X Neural networks (NN), genetic algorithms (GA), and genetic programming (GP) are augmented with fuzzy logic-based schemes to enhance artificial intelligence of automated systems. Such hybrid combinations exhibit added reasoning, adaptation, and learning ability. In this expository article, three dominant hybrid approaches to intelligent control are experimentally applied to address various robotic control issues which are currently under investigation at the NASA Center for Autonomous Control Engineering. The hybrid controllers consist of a hierarchical NN-fuzzy controller applied to a direct drive motor, a GA-fuzzy hierarchical controller applied to position control of a flexible robot link, and a GP-fuzzy behavior based controller applied to a mobile robot navigation task. Various strong characteristics of each of these hybrid combinations are discussed and used in these control architectures. The NN-fuzzy architecture takes advantage of NN for handling complex data patterns, the GA-fuzzy architecture uses the ability of GA to optimize parameters of membership functions for improved system response, and the GP-fuzzy architecture uses the symbolic manipulation capability of GP to evolve fuzzy rule-sets. %K genetic algorithms, genetic programming, Soft computing, Neural networks, Fuzzy logic, Robotic control, Articial intelligence %9 journal article %U http://www.sciencedirect.com/science/article/B6V25-3Y6GXY5-2/1/6a6f9ff946815d4e95fe3884c98e74e5 %P 5-32 %0 Conference Proceedings %T Friendship Modeling for Cooperative Co-Evolutionary Fuzzy Systems: A Hybrid GA-GP Algorithm %A Akbarzadeh-T., M.-R. %A Mosavat, I. %A Abbasi, S. %S Proceedings of the 22nd International Conference of North American Fuzzy Information Processing Society, NAFIPS 2003 %D 2003 %8 24 26 jul %F Akbarzadeh:2003:ICNAFIPS %X A novel approach is proposed to combine the strengths of GA and GP to optimise rule sets and membership functions of fuzzy systems in a co-evolutionary strategy in order to avoid the problem of dual representation in fuzzy systems. The novelty of proposed algorithm is twofold. One is that GP is used for the structural part (Rule sets) and GA for the string part (Membership functions). The goal is to reduce/eliminate the problem of competing conventions by co-evolving pieces of the problem separately and then in combination. Second is exploiting the synergism between rules sets and membership functions by imitating the effect of ’matching’ and friendship in cooperating teams of humans, thereby significantly reducing the number of function evaluations necessary for evolution. The method is applied to a chaotic time series prediction problem and compared with the standard fuzzy table look-up scheme. demonstrate several significant improvements with the proposed approach; specifically, four times higher fitness and more steady fitness improvements as compared with epochal improvements observed in GP. %K genetic algorithms, genetic programming, Artificial neural networks, Chaos, Computational modelling, Convergence, Evolutionary computation, Fuzzy logic, Fuzzy systems, Genetic programming, Humans, Stochastic processes, cooperative systems, fuzzy systems, groupware, modelling, table lookup, time series, chaotic time series prediction, cooperative co-evolutionary fuzzy systems, friendship modeling, function evaluations, fuzzy lookup tables, hybrid GA-GP algorithm, membership functions, rules sets %R doi:10.1109/NAFIPS.2003.1226756 %U http://dx.doi.org/doi:10.1109/NAFIPS.2003.1226756 %P 61-66 %0 Generic %T Evolutionary Optimization of Model Merging Recipes %A Akiba, Takuya %E Wilson, Dennis G. %E Kalkreuth, Roman %E Medvet, Eric %E Nadizar, Giorgia %E Squillero, Giovanni %E Tonda, Alberto %E Lavinas, Yuri %D 2024 %8 14 jul %I Association for Computing Machinery %C Melbourne %F Akiba:2024:GGP %O Invited talk %K genetic algorithms, genetic programming, ANN, LLM %U https://arxiv.org/abs/2403.13187 %0 Conference Proceedings %T Multiple-Organisms Learning and Evolution by Genetic Programming %A Akira, Yoshida %Y McKay, Bob %Y Tsujimura, Yasuhiro %Y Sarker, Ruhul %Y Namatame, Akira %Y Yao, Xin %Y Gen, Mitsuo %S Proceedings of The Third Australia-Japan Joint Workshop on Intelligent and Evolutionary Systems %D 1999 %8 22 25 nov %C School of Computer Science Australian Defence Force Academy, Canberra, Australia %F Akira:1999:AJ %K genetic algorithms, genetic programming %0 Conference Proceedings %T Intraspecific Evolution of Learning by Genetic Programming %A Akira, Yoshida %Y Poli, Riccardo %Y Banzhaf, Wolfgang %Y Langdon, William B. %Y Miller, Julian F. %Y Nordin, Peter %Y Fogarty, Terence C. %S Genetic Programming, Proceedings of EuroGP’2000 %S LNCS %D 2000 %8 15 16 apr %V 1802 %I Springer-Verlag %C Edinburgh %@ 3-540-67339-3 %F akira:2000:moelGP %X Spatial dynamic pattern formations or trails can be observed in a simple square world where individuals move to look for scattered foods. They seem to show the emergence of co-operation, job separation, or division of territories when genetic programming controls the reproduction, mutation, crossing over of the organisms. We try to explain the co-operative behaviours among multiple organisms by means of density of organisms and their environment. Next, we add some interactions between organisms, and between organism and their environment to see that the more interaction make the convergence of intraspecific learning faster. At last, we study that MDL-based fitness evaluation is effective for improvement of generalisation of genetic programming. %K genetic algorithms, genetic programming: Poster %R doi:10.1007/978-3-540-46239-2_15 %U http://dx.doi.org/doi:10.1007/978-3-540-46239-2_15 %P 209-224 %0 Journal Article %T Software Defect Prediction Using Genetic Programming and Neural Networks %A Akour, Mohammed %A Melhem, Wasen Yahya %J International Journal of Open Source Software and Processes %D 2017 %V 8 %N 4 %@ 1942-3926 %F journals/ijossp/AkourM17 %X This article describes how classification methods on software defect prediction is widely researched due to the need to increase the software quality and decrease testing efforts. However, findings of past researches done on this issue has not shown any classifier which proves to be superior to the other. Additionally, there is a lack of research that studies the effects and accuracy of genetic programming on software defect prediction. To find solutions for this problem, a comparative software defect prediction experiment between genetic programming and neural networks are performed on four datasets from the NASA Metrics Data repository. Generally, an interesting degree of accuracy is detected, which shows how the metric-based classification is useful. Nevertheless, this article specifies that the application and usage of genetic programming is highly recommended due to the detailed analysis it provides, as well as an important feature in this classification method which allows the viewing of each attributes impact in the dataset. %K genetic algorithms, genetic programming, ANN, SBSE %9 journal article %R doi:10.4018/IJOSSP.2017100102 %U http://dx.doi.org/doi:10.4018/IJOSSP.2017100102 %P 32-51 %0 Journal Article %T Software Effort Estimation Using Multi Expression Programming %A Al-Saati, Najla Akram %A Alreffaee, Taghreed Riyadh %J AL-Rafidain Journal of Computer Sciences and Mathematics %D 2014 %V 11 %N 2 %I Mosul University %@ 1815-4816 %F Al-Saati:2014:mosul %X The process of finding a function that can estimate the effort of software systems is considered to be the most important and most complex process facing systems developers in the field of software engineering. The accuracy of estimating software effort forms an essential part of the software development phases. A lot of experts applied different ways to find solutions to this issue, such as the COCOMO and other methods. Recently, many questions have been put forward about the possibility of using Artificial Intelligence to solve such problems, different scientists made ​​several studies about the use of techniques such as Genetic Algorithms and Artificial Neural Networks to solve estimation problems. We use one of the Linear Genetic Programming methods (Multi Expression programming) which apply the principle of competition between equations encrypted within the chromosomes to find the best formula for resolving the issue of software effort estimation. As for to the test data, benchmark known datasets are employed taken from previous projects, the results are evaluated by comparing them with the results of Genetic Programming (GP) using different fitness functions. The gained results indicate the surpassing of the employed method in finding more efficient functions for estimating about 7 datasets each consisting of many projects. %K genetic algorithms, genetic programming, Effort Estimation, Multi Expression Programming %9 journal article %R doi:10.33899/csmj.2014.163756 %U https://csmj.mosuljournals.com/article_163756.html %U http://dx.doi.org/doi:10.33899/csmj.2014.163756 %P 53-71 %0 Journal Article %T Using Multi Expression Programming in Software Effort Estimation %A AL-Saati, Najla Akram %A Alreffaee, Taghreed Riyadh %J International Journal of Recent Research and Review %D 2017 %8 jun %V X %N 2 %@ 2277-8322 %F Akram:2017:ijrr %X Estimating the effort of software systems is an essential topic in software engineering, carrying out an estimation process reliably and accurately for a software forms a vital part of the software development phases. Many researchers have used different methods and techniques hopping to find solutions to this issue, such techniques include COCOMO, SEER-SEM,SLIM and others. Recently, Artificial Intelligent techniques are being used to solve such problems; different studies have been issued focusing on techniques such as Neural Networks NN, Genetic Algorithms GA, and Genetic Programming GP. This work uses one of the linear variations of GP, namely: Multi Expression Programming (MEP) aiming to find the equation that best estimates the effort of software. Benchmark datasets (based on previous projects) are used learning and testing. Results are compared with those obtained by GP using different fitness functions. Results show that MEP is far better in discovering effective functions for the estimation of about 6 datasets each comprising several projects. %K genetic algorithms, genetic programming, Multi Expression Programming, SBSE, Software Effort, Estimation, Software Engineering %9 journal article %U http://www.ijrrr.com/papers10-2/paper1-Using%20Multi%20Expression%20Programming%20in%20Software%20Effort%20Estimation.pdf %P 1-10 %0 Generic %T Using Multi Expression Programming in Software Effort Estimation %A Al-Saati, Najla Akram %A Alreffaee, Taghreed Riyadh %D 2018 %8 30 apr %I arXiv %F Akram:2018:arxiv %X Estimating the effort of software systems is an essential topic in software engineering, carrying out an estimation process reliably and accurately for a software forms a vital part of the software development phases. Many researchers have used different methods and techniques hopping to find solutions to this issue, such techniques include COCOMO, SEER-SEM,SLIM and others. Recently, Artificial Intelligent techniques are being used to solve such problems; different studies have been issued focusing on techniques such as Neural Networks NN, Genetic Algorithms GA, and Genetic Programming GP. This work uses one of the linear variations of GP, namely: Multi Expression Programming (MEP) aiming to find the equation that best estimates the effort of software. Benchmark datasets (based on previous projects) are used learning and testing. Results are compared with those obtained by GP using different fitness functions. Results show that MEP is far better in discovering effective functions for the estimation of about 6 datasets each comprising several projects. %K genetic algorithms, genetic programming, SBSE, ANN, software effort, estimation, multi expression programming %U http://arxiv.org/abs/1805.00090 %0 Journal Article %T Quality by Design Approach: Application of Artificial Intelligence Techniques of Tablets Manufactured by Direct Compression %A Aksu, Buket %A Paradkar, Anant %A Matas, Marcel %A Ozer, Ozgen %A Guneri, Tamer %A York, Peter %J AAPS PharmSciTech %D 2012 %8 sep 06 %V 13 %N 4 %I American Association of Pharmaceutical Scientists %G English %F Aksu:2012:AAPS %X The publication of the International Conference of Harmonization (ICH) Q8, Q9, and Q10 guidelines paved the way for the standardization of quality after the Food and Drug Administration issued current Good Manufacturing Practices guidelines in 2003. Quality by Design, mentioned in the ICH Q8 guideline, offers a better scientific understanding of critical process and product qualities using knowledge obtained during the life cycle of a product. In this scope, the knowledge space is a summary of all process knowledge obtained during product development, and the design space is the area in which a product can be manufactured within acceptable limits. To create the spaces, artificial neural networks (ANNs) can be used to emphasise the multidimensional interactions of input variables and to closely bind these variables to a design space. This helps guide the experimental design process to include interactions among the input variables, along with modelling and optimisation of pharmaceutical formulations. The objective of this study was to develop an integrated multivariate approach to obtain a quality product based on an understanding of the cause–effect relationships between formulation ingredients and product properties with ANNs and genetic programming on the ramipril tablets prepared by the direct compression method. In this study, the data are generated through the systematic application of the design of experiments (DoE) principles and optimisation studies using artificial neural networks and neurofuzzy logic programs. %K genetic algorithms, genetic programming, gene expression programming, artificial neural networks, ANNs, GEP, optimisation, quality by design (qbd) %9 journal article %R doi:10.1208/s12249-012-9836-x %U http://dx.doi.org/10.1208/s12249-012-9836-x %U http://dx.doi.org/doi:10.1208/s12249-012-9836-x %P 1138-1146 %0 Conference Proceedings %T A Genetic Programming Classifier Design Approach for Cell Images %A Akyol, Aydin %A Yaslan, Yusuf %A Erol, Osman Kaan %Y Mellouli, Khaled %S Proceedings of the 9th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty, ECSQARU %S Lecture Notes in Computer Science %D 2007 %8 oct 31 nov 2 %V 4724 %I Springer %C Hammamet, Tunisia %F conf/ecsqaru/AkyolYE07 %X This paper describes an approach for the use of genetic programming (GP) in classification problems and it is evaluated on the automatic classification problem of pollen cell images. In this work, a new reproduction scheme and a new fitness evaluation scheme are proposed as advanced techniques for GP classification applications. Also an effective set of pollen cell image features is defined for cell images. Experiments were performed on Bangor/Aberystwyth Pollen Image Database and the algorithm is evaluated on challenging test configurations. We reached at 96percent success rate on the average together with significant improvement in the speed of convergence. %K genetic algorithms, genetic programming, cell classification, classifier design, pollen classification %R doi:10.1007/978-3-540-75256-1_76 %U http://dx.doi.org/doi:10.1007/978-3-540-75256-1_76 %P 878-888 %0 Journal Article %T Adaptive Gene Level Mutation %A Al-Afandi, Jalal %A Horvath, Andras %J Algorithms %D 2021 %V 14 %N 1 %@ 1999-4893 %F al-afandi:2021:Algorithms %X Genetic Algorithms are stochastic optimisation methods where solution candidates, complying to a specific problem representation, are evaluated according to a predefined fitness function. These approaches can provide solutions in various tasks even, where analytic solutions can not be or are too complex to be computed. In this paper we will show, how certain set of problems are partially solvable allowing us to grade segments of a solution individually, which results local and individual tuning of mutation parameters for genes. We will demonstrate the efficiency of our method on the N-Queens and travelling salesman problems where we can demonstrate that our approach always results faster convergence and in most cases a lower error than the traditional approach. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/a14010016 %U https://www.mdpi.com/1999-4893/14/1/16 %U http://dx.doi.org/doi:10.3390/a14010016 %0 Thesis %T Image Reconstructing in Electrical Capacitance Tomography of Manufacturing Processes Using Genetic Programming %A Al-Afeef, Ala’ S. %D 2010 %8 jul %C Al-Salt, Jordan %C Al-Balqa Applied University %F Al-Afeef:mastersthesis %X Electrical capacitance tomography is considered the most attractive technique for industrial process imaging because of its low construction cost, safety, fast data acquisition , non-invasiveness, non-intrusiveness, simple structure, wide application field and suitability for most kinds of flask and vessels, however, the low accuracy of the reconstructed images is the main limitation of implementing an ECT system. In order to improve the imaging accuracy, one may 1) increase the number of measurements by raising number of electrodes, 2) improve the reconstruction algorithm so that more information can be extracted from the captured data, however, increasing the number of electrodes has a limited impact on the imaging accuracy improvement. This means that, in order to improve the reconstructed image, more accurate reconstruction algorithms must be developed. In fact, ECT image reconstruction is still an inefficiently resolved problem because of many limitations, mainly the Soft-field and Ill-condition characteristic of ECT. Although there are many algorithms to solve the image reconstruction problem, these algorithms are not yet able to present a single model that can relate between image pixels and capacitance measurements in a mathematical relationship. The originality of this thesis lies in introducing a new technique for solving the non-linear inverse problem in ECT based on Genetic Programming (GP) to handle the ECT imaging for conductive materials. GP is a technique that has not been applied to ECT. GP found to be efficient in dealing with the Non-linear relation between the measured capacitance and permittivity distribution in ECT. This thesis provides new implemented software that can handle the ECT based GP problem with a user-friendly interface. The developed simulation results are promising. %K genetic algorithms, genetic programming, Image Reconstructing, Electrical Capacitance Tomography %9 Masters thesis %U https://sites.google.com/site/alaaalfeef/home/Alaa_afeef_Thesis_Final.pdf %0 Conference Proceedings %T Image reconstruction of a metal fill industrial process using Genetic Programming %A Al-Afeef, Alaa %A Sheta, Alaa F. %A Al-Rabea, Adnan %S 10th International Conference on Intelligent Systems Design and Applications (ISDA), 2010 %D 2010 %8 29 nov 1 dec %C Cairo %F Al-Afeef:2010:ISDA %X Electrical Capacitance Tomography (ECT) is one of the most attractive technique for industrial process imaging because of its low construction cost, safety, non-invasiveness, non-intrusiveness, fast data acquisition, simple structure, wide application field and suitability for most kinds of flask and vessels. However, image reconstruction based ECT suffers many limitations. They include the Soft-field and Ill-condition characteristic of ECT. The basic idea of the ECT for image reconstruction for a metal fill problem is to model the image pixels as a function of the capacitance measurements. Developing this relationship represents a challenge for systems engineering community. In this paper, we presents our innovative idea on solving the non-linear inverse problem for conductive materials of the ECT using Genetic Programming (GP). GP found to be a very efficient algorithm in producing a mathematical model of image pixels in the form of Lisp expression. The reported results are promising. %K genetic algorithms, genetic programming, electrical capacitance tomography, ill-condition characteristic, image reconstruction, industrial process imaging, metal fill industrial process, soft-field characteristic, image reconstruction, industrial engineering, tomography, Process Tomography %R doi:10.1109/ISDA.2010.5687299 %U http://sites.google.com/site/alaaalfeef/home/8.pdf %U http://dx.doi.org/doi:10.1109/ISDA.2010.5687299 %P 12-17 %0 Book %T Image Reconstruction of a Manufacturing Process: A Genetic Programming Approach %A Al-Afeef, Alaa %A Sheta, Alaa %A Rabea, Adnan %D 2011 %8 apr %7 1 %I Lambert Academic Publishing %F AfeefBook2011 %X Product Description Evolutionary Computation (EC) is one of the most attractive techniques in the area of Computer Science. EC includes Genetic Algorithms (GAs), Genetic Programming (GP), Evolutionary Strategy (ES) and Evolutionary Programming (EP). GP have been widely used to solve a variety of problems in image enhancement, analysis and segmentation. This book explores the use of GP as a powerful approach to solve the image reconstruction problem for Lost Foam Casting (LFC) manufacturing process. The data set was collected using the Electrical Capacitance Tomography (ECT) technique. ECT is one of the most attractive technique for industrial process imaging because of its low construction cost, safety, non-invasiveness, non-intrusiveness, fast data acquisition, simple structure, wide application field and suitability for most kinds of flask and vessels. GP found to be a very efficient algorithm in producing a mathematical model of image pixels in a form of Lisp expression. A Graphical User Interface (GUI) Toolbox based Matlab was developed to help analysing and visualising the reconstructed images based GP problem. The reported results are promising. %K genetic algorithms, genetic programming %U https://www.morebooks.de/store/gb/book/image-reconstruction-of-a-manufacturing-process/isbn/978-3-8443-2569-0 %0 Journal Article %T GADS and Reusability %A Al-Bastaki, Y. %A Awad, W. %J Journal of Artificial Intelligence %D 2010 %V 3 %N 2 %I Asian Network for Scientific Information %@ 19945450 %G eng %F Al-Bastaki:2010:JAI %X Genetic programming is a domain-independent method that genetically breeds population of computer programs to solve problems. Genetic programming is considered to be a machine learning technique used to optimise a population of computer programs according to a fitness landscape determined by a program’s ability to perform a given computational task. There are a number of representation methods to illustrate these programs, such as LISP expressions and integer lists. This study investigated the effectiveness of genetic programming in solving the symbolic regression problem where, the population programs are expressed as integer sequences rather than lisp expressions. This study also introduced the concept of reusable program to genetic algorithm for developing software. %K genetic algorithms, genetic programming, GADS, reusability %9 journal article %U http://docsdrive.com/pdfs/ansinet/jai/2010/67-72.pdf %P 67-77 %0 Conference Proceedings %T An evolutionary computing approach for estimating global solar radiation %A Al-Hajj, Rami %A Assi, Ali %A Batch, Farhan %S 2016 IEEE International Conference on Renewable Energy Research and Applications (ICRERA) %D 2016 %8 20 23 nov %C Birmingham, UK %F Al-Hajj:2016:ICRERA %X This paper presents a non-linear regression model based on an evolutionary computing technique namely the genetic programming for estimating solar radiation. This approach aims to estimate the best formula that represents the function for estimating the global solar radiation on horizontals with respect to the measured climatological data. First, we present a reference approach to find one global formula that models the relation among the solar radiation amount and a set of weather factors. In the second step, we present an enhanced approach that consists of multi formulas of regression in a parallel structure. The performance of the proposed approaches has been evaluated using statistical analysis measures. The obtained results were promising and comparable to those obtained by other empirical and neural models conducted by other research groups. %K genetic algorithms, genetic programming, Decision support systems, Evolutionary computation, Hand-held computers, climatological data, evolutionary computation, global solar radiation %R doi:10.1109/ICRERA.2016.7884553 %U http://dx.doi.org/doi:10.1109/ICRERA.2016.7884553 %P 285-290 %0 Journal Article %T A Hybrid LSTM-Based Genetic Programming Approach for Short-Term Prediction of Global Solar Radiation Using Weather Data %A Al-Hajj, Rami %A Assi, Ali %A Fouad, Mohamad %A Mabrouk, Emad %J Processes %D 2021 %V 9 %N 7 %@ 2227-9717 %F al-hajj:2021:Processes %X The integration of solar energy in smart grids and other utilities is continuously increasing due to its economic and environmental benefits. However, the uncertainty of available solar energy creates challenges regarding the stability of the generated power the supply-demand balance’s consistency. An accurate global solar radiation (GSR) prediction model can ensure overall system reliability and power generation scheduling. This article describes a nonlinear hybrid model based on Long Short-Term Memory (LSTM) models and the Genetic Programming technique for short-term prediction of global solar radiation. The LSTMs are Recurrent Neural Network (RNN) models that are successfully used to predict time-series data. We use these models as base predictors of GSR using weather and solar radiation (SR) data. Genetic programming (GP) is an evolutionary heuristic computing technique that enables automatic search for complex solution formulas. We use the GP in a post-processing stage to combine the LSTM models’ outputs to find the best prediction of the GSR. We have examined two versions of the GP in the proposed model: a standard version and a boosted version that incorporates a local search technique. We have shown an improvement in terms of performance provided by the proposed hybrid model. We have compared its performance to stacking techniques based on machine learning for combination. The results show that the suggested method provides significant improvement in terms of performance and consistency. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/pr9071187 %U https://www.mdpi.com/2227-9717/9/7/1187 %U http://dx.doi.org/doi:10.3390/pr9071187 %0 Conference Proceedings %T Genetic Programming-Based Simultaneous Feature Selection and Imputation for Symbolic Regression with Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Palaiahnakote, Shivakumara %Y di Baja, Gabriella Sanniti %Y Wang, Liang %Y Yan, Wei Qi %S Pattern Recognition - 5th Asian Conference, ACPR 2019, Auckland, New Zealand, November 26-29, 2019, Revised Selected Papers, Part II %S Lecture Notes in Computer Science %D 2019 %V 12047 %I Springer %F DBLP:conf/acpr/Al-HelaliCXZ19 %K genetic algorithms, genetic programming %R doi:10.1007/978-3-030-41299-9_44 %U https://doi.org/10.1007/978-3-030-41299-9_44 %U http://dx.doi.org/doi:10.1007/978-3-030-41299-9_44 %P 566-579 %0 Conference Proceedings %T Genetic Programming for Imputation Predictor Selection and Ranking in Symbolic Regression with High-Dimensional Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Liu, Jixue %Y Bailey, James %S AI 2019: Advances in Artificial Intelligence - 32nd Australasian Joint Conference, Adelaide, SA, Australia, December 2-5, 2019, Proceedings %S Lecture Notes in Computer Science %D 2019 %V 11919 %I Springer %F DBLP:conf/ausai/Al-Helali00Z19 %K genetic algorithms, genetic programming %R doi:10.1007/978-3-030-35288-2_42 %U https://doi.org/10.1007/978-3-030-35288-2_42 %U http://dx.doi.org/doi:10.1007/978-3-030-35288-2_42 %P 523-535 %0 Conference Proceedings %T A Genetic Programming-based Wrapper Imputation Method for Symbolic Regression with Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %S 2019 IEEE Symposium Series on Computational Intelligence (SSCI) %D 2019 %8 dec %F Al-Helali:2019:SSCI %X Dealing with missing values is one of the challenges in symbolic regression on many real-world data sets. One of the popular approaches to address this challenge is to use imputation. Traditional imputation methods are usually performed based on the predictive features without considering the original target variable. In this work, a genetic programming-based wrapper imputation method is proposed, which wrappers a regression method to consider the target variable when constructing imputation models for the incomplete features. In addition to the imputation performance, the regression performance is considered for evaluating the imputation models. Genetic programming (GP) is used for building the imputation models and decision tree (DT) is used for evaluating the regression performance during the GP evolutionary process. The experimental results show that the proposed method has a significant advance in enhancing the symbolic regression performance compared with some state-of- the-art imputation methods. %K genetic algorithms, genetic programming %R doi:10.1109/SSCI44817.2019.9002861 %U http://dx.doi.org/doi:10.1109/SSCI44817.2019.9002861 %P 2395-2402 %0 Conference Proceedings %T Data Imputation for Symbolic Regression with Missing Values: A Comparative Study %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %S 2020 IEEE Symposium Series on Computational Intelligence (SSCI) %D 2020 %8 dec %F Al-Helali:2020:SSCI %X Symbolic regression via genetic programming is considered as a crucial machine learning tool for empirical modelling. However, in reality, it is common for real-world data sets to have some data quality problems such as noise, outliers, and missing values. Although several approaches can be adopted to deal with data incompleteness in machine learning, most studies consider the classification tasks, and only a few have considered symbolic regression with missing values. In this work, the performance of symbolic regression using genetic programming on real-world data sets that have missing values is investigated. This is done by studying how different imputation methods affect symbolic regression performance. The experiments are conducted using thirteen real-world incomplete data sets with different ratios of missing values. The experimental results show that although the performance of the imputation methods differs with the data set, CART has a better effect than others. This might be due to its ability to deal with categorical and numerical variables. Moreover, the superiority of the use of imputation methods over the commonly used deletion strategy is observed. %K genetic algorithms, genetic programming %R doi:10.1109/SSCI47803.2020.9308216 %U http://dx.doi.org/doi:10.1109/SSCI47803.2020.9308216 %P 2093-2100 %0 Conference Proceedings %T Hessian Complexity Measure for Genetic Programming-based Imputation Predictor Selection in Symbolic Regression with Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Hu, Ting %Y Lourenco, Nuno %Y Medvet, Eric %S EuroGP 2020: Proceedings of the 23rd European Conference on Genetic Programming %S LNCS %D 2020 %8 15 17 apr %V 12101 %I Springer Verlag %C Seville, Spain %F Al-Helali:2020:EuroGP %X Missing values bring several challenges when learning from real-world data sets. Imputation is a widely adopted approach to estimating missing values. However, it has not been adequately investigated in symbolic regression. When imputing the missing values in an incomplete feature, the other features that are used in the prediction process are called imputation predictors. In this work, a method for imputation predictor selection using regularized genetic programming (GP) models is presented for symbolic regression tasks on incomplete data. A complexity measure based on the Hessian matrix of the phenotype of the evolving models is proposed. It is employed as a regularizer in the fitness function of GP for model selection and the imputation predictors are selected from the selected models. In addition to the baseline which uses all the available predictors, the proposed selection method is compared with two GP-based feature selection variations: the standard GP feature selector and GP with feature selection pressure. The trends in the results reveal that in most cases, using the predictors selected by regularized GP models could achieve a considerable reduction in the imputation error and improve the symbolic regression performance as well. %K genetic algorithms, genetic programming, Symbolic regression, Incomplete data, Feature selection, Imputation, Model complexity %R doi:10.1007/978-3-030-44094-7_1 %U http://dx.doi.org/doi:10.1007/978-3-030-44094-7_1 %P 1-17 %0 Conference Proceedings %T Genetic Programming with Noise Sensitivity for Imputation Predictor Selection in Symbolic Regression with Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Jin, Yaochu %S 2020 IEEE Congress on Evolutionary Computation, CEC 2020 %D 2020 %8 19 24 jul %I IEEE Press %C internet %F Al-Helali:2020:CEC %X This paper presents a feature selection method that incorporates a sensitivity-based single feature importance measure in a context-based feature selection approach. The single-wise importance is based on the sensitivity of the learning performance with respect to adding noise to the predictive features. Genetic programming is used as a context-based selection mechanism, where the selection of features is determined by the change in the performance of the evolved genetic programming models when the feature is injected with noise. Imputation is a key strategy to mitigate the data incompleteness problem. However, it has been rarely investigated for symbolic regression on incomplete data. In this work, an attempt to contribute to filling this gap is presented. The proposed method is applied to selecting imputation predictors (features/variables) in symbolic regression with missing values. The evaluation is performed on real-world data sets considering three performance measures: imputation accuracy, symbolic regression performance, and features’ reduction ability. Compared with the benchmark methods, the experimental evaluation shows that the proposed method can achieve an enhanced imputation, improve the symbolic regression performance, and use smaller sets of selected predictors. %K genetic algorithms, genetic programming %R doi:10.1109/CEC48606.2020.9185526 %U http://dx.doi.org/doi:10.1109/CEC48606.2020.9185526 %P paperid24344 %0 Conference Proceedings %T Multi-Tree Genetic Programming-based Transformation for Transfer Learning in Symbolic Regression with Highly Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Jin, Yaochu %S 2020 IEEE Congress on Evolutionary Computation, CEC 2020 %D 2020 %8 19 24 jul %I IEEE Press %C internet %F Al-Helali:2020:CEC2 %X Transfer learning has been considered a key solution for the problem of learning when there is a lack of knowledge in some target domains. Its idea is to benefit from the learning on different (but related in some way) domains that have adequate knowledge and transfer what can improve the learning in the target domains. Although incompleteness is one of the main causes of knowledge shortage in many machine learning real-world tasks, it has received a little effort to be addressed by transfer learning. In particular, to the best of our knowledge, there is no single study to use transfer learning for the symbolic regression task when the underlying data are incomplete. The current work addresses this point by presenting a transfer learning method for symbolic regression on data with high ratios of missing values. A multi-tree genetic programming algorithm based feature-based transformation is proposed for transferring data from a complete source domain to a different, incomplete target domain. The experimental work has been conducted on real-world data sets considering different transfer learning scenarios each is determined based on three factors: missingness ratio, domain difference, and task similarity. In most cases, the proposed method achieved positive transductive transfer learning in both homogeneous and heterogeneous domains. Moreover, even with less significant success, the obtained results show the applicability of the proposed approach for inductive transfer learning. %K genetic algorithms, genetic programming %R doi:10.1109/CEC48606.2020.9185670 %U http://dx.doi.org/doi:10.1109/CEC48606.2020.9185670 %P paperid24250 %0 Conference Proceedings %T Multi-Tree Genetic Programming for Feature Construction-Based Domain Adaptation in Symbolic Regression with Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Coello Coello, Carlos Artemio %Y Aguirre, Arturo Hernandez %Y Uribe, Josu Ceberio %Y Fabre, Mario Garza %Y Toscano Pulido, Gregorio %Y Rodriguez-Vazquez, Katya %Y Wanner, Elizabeth %Y Veerapen, Nadarajen %Y Montes, Efren Mezura %Y Allmendinger, Richard %Y Marin, Hugo Terashima %Y Wagner, Markus %Y Bartz-Beielstein, Thomas %Y Filipic, Bogdan %Y Trautmann, Heike %Y Tang, Ke %Y Koza, John %Y Goodman, Erik %Y Langdon, William B. %Y Nicolau, Miguel %Y Zarges, Christine %Y Volz, Vanessa %Y Tusar, Tea %Y Naujoks, Boris %Y Bosman, Peter A. N. %Y Whitley, Darrell %Y Solnon, Christine %Y Helbig, Marde %Y Doncieux, Stephane %Y Wilson, Dennis G. %Y Fernandez de Vega, Francisco %Y Paquete, Luis %Y Chicano, Francisco %Y Xue, Bing %Y Bacardit, Jaume %Y Mostaghim, Sanaz %Y Fieldsend, Jonathan %Y Schuetze, Oliver %Y Arnold, Dirk %Y Ochoa, Gabriela %Y Segura, Carlos %Y Cotta, Carlos %Y Emmerich, Michael %Y Zhang, Mengjie %Y Purshouse, Robin %Y Ray, Tapabrata %Y Petke, Justyna %Y Ishikawa, Fuyuki %Y Lengler, Johannes %Y Neumann, Frank %S Proceedings of the 2020 Genetic and Evolutionary Computation Conference %S GECCO ’20 %D 2020 %8 jul 8 12 %I Association for Computing Machinery %C internet %F Al-Helali:2020:GECCO %X Nowadays, transfer learning has gained a rapid popularity in tasks with limited data available. While traditional learning limits the learning process to knowledge available in a specific (target) domain, transfer learning can use parts of knowledge extracted from learning in a different (source) domain to help learning in the target domain. This concept is of special importance when there is a lack of knowledge in the target domain. Consequently, since data incompleteness is a serious cause of knowledge shortage in real-world learning tasks, it can be typically addressed using transfer learning. One way to achieve that is feature construction-based domain adaptation. However, although it is considered as a powerful feature construction algorithm, Genetic Programming has not been fully for domain adaptation. In this work, a multi-tree genetic programming method is proposed for feature construction-based domain adaptation. The main idea is to construct a transformation from the source feature space to the target feature space, which maps the source domain close to the target domain. This method is used for symbolic regression with missing values. The experimental work shows encouraging potential of the proposed approach when applied to real-world tasks considering different transfer learning scenarios. %K genetic algorithms, genetic programming, transfer tearning, incomplete data, symbolic regression %R doi:10.1145/3377930.3390160 %U https://doi.org/10.1145/3377930.3390160 %U http://dx.doi.org/doi:10.1145/3377930.3390160 %P 913-921 %0 Journal Article %T A new imputation method based on genetic programming and weighted KNN for symbolic regression with incomplete data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %J Soft Computing %D 2021 %8 apr %V 25 %N 8 %@ 1432-7643 %F DBLP:journals/soco/Al-Helali00021 %X Incompleteness is one of the problematic data quality challenges in real-world machine learning tasks. A large number of studies have been conducted for addressing this challenge. However, most of the existing studies focus on the classification task and only a limited number of studies for symbolic regression with missing values exist. a new imputation method for symbolic regression with incomplete data is proposed. The method aims to improve both the effectiveness and efficiency of imputing missing values for symbolic regression. This method is based on genetic programming (GP) and weighted K-nearest neighbors (KNN). It constructs GP-based models using other available features to predict the missing values of incomplete features. The instances used for constructing such models are selected using weighted KNN. The experimental results on real-world data sets show that the proposed method outperforms a number of state-of-the-art methods with respect to the imputation accuracy, the symbolic regression performance, and the imputation time. %K genetic algorithms, genetic programming, Symbolic regression, Incomplete data, KNN, Imputation %9 journal article %R doi:10.1007/s00500-021-05590-y %U https://doi.org/10.1007/s00500-021-05590-y %U http://dx.doi.org/doi:10.1007/s00500-021-05590-y %P 5993-6012 %0 Journal Article %T Multi-Tree Genetic Programming with New Operators for Transfer Learning in Symbolic Regression with Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %J IEEE Transactions on Evolutionary Computation %D 2021 %8 dec %V 25 %N 6 %@ 1089-778X %F Al-Helali:ieeeTEC %X Lack of knowledge is a common consequence of data incompleteness when learning from real-world data. To deal with such a situation, this work uses transfer learning to re-use knowledge from different (yet related) but complete domains. Due to its powerful feature construction ability, genetic programming is used to construct feature-based transformations that map the feature space of the source domain to that of the target domain such that their differences are reduced. Particularly, this work proposes a new multi-tree genetic programming-based feature construction approach to transfer learning in symbolic regression with missing values. It transfers knowledge related to the importance of the features and instances in the source domain to the target domain to improve the learning performance. Moreover, new genetic operators are developed to encourage minimising the distribution discrepancy between the transformed domain and the target domain. A new probabilistic crossover is developed to make the well-constructed trees in the individuals more likely to be mated than the other trees. A new mutation operator is designed to give more probability for the poorly-constructed trees to be mutated. The experimental results show that the proposed method not only achieves better performance compared with different traditional learning methods but also advances two recent transfer learning methods on real-world data sets with various incompleteness and learning scenarios. %K genetic algorithms, genetic programming, Symbolic Regression, Incomplete Data, Transfer Learning, Evolutionary Learning %9 journal article %R doi:10.1109/TEVC.2021.3079843 %U http://dx.doi.org/doi:10.1109/TEVC.2021.3079843 %P 1049-1063 %0 Conference Proceedings %T Genetic Programming-Based Selection of Imputation Methods in Symbolic Regression with Missing Values %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Gallagher, Marcus %Y Moustafa, Nour %Y Lakshika, Erandi %S AI 2020: Advances in Artificial Intelligence - 33rd Australasian Joint Conference, AI 2020, Canberra, ACT, Australia, November 29-30, 2020, Proceedings %S Lecture Notes in Computer Science %D 2020 %V 12576 %I Springer %F DBLP:conf/ausai/Al-Helali00Z20 %K genetic algorithms, genetic programming %R doi:10.1007/978-3-030-64984-5_13 %U https://doi.org/10.1007/978-3-030-64984-5_13 %U http://dx.doi.org/doi:10.1007/978-3-030-64984-5_13 %P 163-175 %0 Conference Proceedings %T GP with a Hybrid Tree-vector Representation for Instance Selection and Symbolic Regression on Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %Y Ong, Yew-Soon %S 2021 IEEE Congress on Evolutionary Computation (CEC) %D 2021 %8 28 jun 1 jul %C Krakow, Poland %F Al-Helali:2021:CEC %X Data incompleteness is a pervasive problem in symbolic regression, and machine learning in general. Unfortunately, most symbolic regression methods are only applicable when the given data is complete. One common approach to handling this situation is data imputation. It works by estimating missing values based on existing data. However, which existing data should be used for imputing the missing values? The answer to this question is important when dealing with incomplete data. To address this question, this work proposes a mixed tree-vector representation for genetic programming to perform instance selection and symbolic regression on incomplete data. In this representation, each individual has two components: an expression tree and a bit vector. While the tree component constructs symbolic regression models, the vector component selects the instances that are used to impute missing values by the weighted k-nearest neighbour (WKNN) imputation method. The complete imputed instances are then used to evaluate the GP-based symbolic regression model. The obtained experimental results show the applicability of the proposed method on real-world data sets with different missingness scenarios. When compared with existing methods, the proposed method not only produces more effective symbolic regression models but also achieves more efficient imputations. %K genetic algorithms, genetic programming, Computational modeling, Machine learning, Evolutionary computation, Regression tree analysis, Symbolic Regression, Incomplete Data, Imputation, Instance Selection %R doi:10.1109/CEC45853.2021.9504767 %U http://dx.doi.org/doi:10.1109/CEC45853.2021.9504767 %P 604-611 %0 Journal Article %T Genetic Programming for Feature Selection Based on Feature Removal Impact in High-Dimensional Symbolic Regression %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %J IEEE Transactions on Emerging Topics in Computational Intelligence %@ 2471-285X %F Al-Helali:ETCI %O Early access %X Symbolic regression is increasingly important for discovering mathematical models for various prediction tasks. It works by searching for the arithmetic expressions that best represent a target variable using a set of input features. However, as the number of features increases, the search process becomes more complex. To address high-dimensional symbolic regression, this work proposes a genetic programming for feature selection method based on the impact of feature removal on the performance of SR models. Unlike existing Shapely value methods that simulate feature absence at the data level, the proposed approach suggests removing features at the model level. This approach circumvents the production of unrealistic data instances, which is a major limitation of Shapely value and permutation-based methods. Moreover, after calculating the importance of the features, a cut-off strategy, which works by injecting a number of random features and using their importance to automatically set a threshold, is proposed for selecting important features. The experimental results on artificial and real-world high-dimensional data sets show that, compared with state-of-the-art feature selection methods using the permutation importance and Shapely value, the proposed method not only improves the SR accuracy but also selects smaller sets of features. %K genetic algorithms, genetic programming, Feature extraction, Data models, Computational modelling, Task analysis, Predictive models, Machine learning, Feature selection, high dimensionality, symbolic regression %9 journal article %R doi:10.1109/TETCI.2024.3369407 %U http://dx.doi.org/doi:10.1109/TETCI.2024.3369407 %0 Journal Article %T Multitree Genetic Programming With Feature-Based Transfer Learning for Symbolic Regression on Incomplete Data %A Al-Helali, Baligh %A Chen, Qi %A Xue, Bing %A Zhang, Mengjie %J IEEE Transactions on Cybernetics %@ 2168-2275 %F Al-Helali:CYB %O Early access %X Data incompleteness is a serious challenge in real-world machine-learning tasks. Nevertheless, it has not received enough attention in symbolic regression (SR). Data missingness exacerbates data shortage, especially in domains with limited available data, which in turn limits the learning ability of SR algorithms. Transfer learning (TL), which aims to transfer knowledge across tasks, is a potential solution to solve this issue by making amends for the lack of knowledge. However, this approach has not been adequately investigated in SR. To fill this gap, a multitree genetic programming-based TL method is proposed in this work to transfer knowledge from complete source domains (SDs) to incomplete related target domains (TDs). The proposed method transforms the features from a complete SD to an incomplete TD. However, having many features complicates the transformation process. To mitigate this problem, we integrate a feature selection mechanism to eliminate unnecessary transformations. The method is examined on real-world and synthetic SR tasks with missing values to consider different learning scenarios. The obtained results not only show the effectiveness of the proposed method but also show its training efficiency compared with the existing TL methods. Compared to state-of-the-art methods, the proposed method reduced an average of more than 2.58percent and 4percent regression error on heterogeneous and homogeneous domains, respectively. %K genetic algorithms, genetic programming, Task analysis, Feature extraction, Data models, Transfer learning, Contracts, Adaptation models, Routing, incomplete data, symbolic regression (SR), transfer learning (TL) %9 journal article %R doi:10.1109/TCYB.2023.3270319 %U http://dx.doi.org/doi:10.1109/TCYB.2023.3270319 %0 Thesis %T Itemset size-sensitive interestingness measures for association rule mining and link prediction %A Aljandal, Waleed A. %D 2009 %8 may %C Manhattan, Kansas, USA %C Department of Computing and Information Sciences, Kansas State University %F WaleedAljandal2009 %X Association rule learning is a data mining technique that can capture relationships between pairs of entities in different domains. The goal of this research is to discover factors from data that can improve the precision, recall, and accuracy of association rules found using interestingness measures and frequent itemset mining. Such factors can be calibrated using validation data and applied to rank candidate rules in domain-dependent tasks such as link existence prediction. In addition, I use interestingness measures themselves as numerical features to improve link existence prediction. The focus of this dissertation is on developing and testing an analytical framework for association rule interestingness measures, to make them sensitive to the relative size of itemsets. I survey existing interestingness measures and then introduce adaptive parametric models for normalizing and optimizing these measures, based on the size of itemsets containing a candidate pair of co-occurring entities. The central thesis of this work is that in certain domains, the link strength between entities is related to the rarity of their shared memberships (i.e., the size of itemsets in which they co-occur), and that a data-driven approach can capture such properties by normalizing the quantitative measures used to rank associations. To test this hypothesis under different levels of variability in itemset size, I develop several test bed domains, each containing an association rule mining task and a link existence prediction task. The definitions of itemset membership and link existence in each domain depend on its local semantics. My primary goals are: to capture quantitative aspects of these local semantics in normalization factors for association rule interestingness measures; to represent these factors as quantitative features for link existence prediction, to apply them to significantly improve precision and recall in several real-world domains; and to build an experimental framework for measuring this improvement, using information theory and classification-based validation. %K genetic algorithms, data Mining, Association Rule, Interestingness Measures, Link Prediction %9 Ph.D. thesis %U https://krex.k-state.edu/dspace/handle/2097/1245 %0 Journal Article %T Thunderstorms Prediction using Genetic Programming %A Al-Jundi, Ruba %A Yasen, Mais %A Al-Madi, Nailah %J International Journal of Information Systems and Computer Sciences %D 2018 %V 7 %N 1 %I WARSE %@ 2319-7595 %F ThunderStormGP %O Special Issue of ICSIC 2017, Held during 23-24 September 2017 in Amman Arab University, Amman, Jordan %X Thunderstorms prediction is a major challenge for efficient flight planning and air traffic management. As the inaccurate forecasting of weather poses a danger to aviation, it increases the need to build a good prediction model. Genetic Programming (GP) is one of the evolutionary computation techniques that is used for classification process. Genetic Programming has proven its efficiency especially for dynamic and nonlinear classification. This research proposes a thunderstorm prediction model that makes use of Genetic Programming and takes real data of Lake Charles Airport (LCH) as a case study. The proposed model is evaluated using different metrics such as recall, F-measure and compared with other well-known classifiers. The results show that Genetic Programming got higher recall value of predicting thunderstorms in comparison with the other classifiers. %K genetic algorithms, genetic programming, Evolutionary computation, Machine Learning, Weather Prediction. %9 journal article %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/al-madi/Thunderstorm_Prediction.pdf %0 Conference Proceedings %T Adaptive genetic programming applied to classification in data mining %A Al-Madi, N. %A Ludwig, S. A. %S Proceedings of the Fourth World Congress on Nature and Biologically Inspired Computing, NaBIC 2012 %D 2012 %F Al-Madi:2012:NaBIC %X Classification is a data mining method that assigns items in a collection to target classes with the goal to accurately predict the target class for each item in the data. Genetic programming (GP) is one of the effective evolutionary computation techniques to solve classification problems, however, it suffers from a long run time. In addition, there are many parameters that need to be set before the GP is run. In this paper, we propose an adaptive GP that automatically determines the best parameters of a run, and executes the classification faster than standard GP. This adaptive GP has three variations. The first variant consists of an adaptive selection process ensuring that the produced solutions in the next generation are better than the solutions in the previous generation. The second variant adapts the crossover and mutation rates by modifying the probabilities ensuring that a solution with a high fitness is protected. And the third variant is an adaptive function list that automatically changes the functions used by deleting the functions that do not favourably contribute to the classification. These proposed variations were implemented and compared to the standard GP. The results show that a significant speedup can be achieved by obtaining similar classification accuracies. %K genetic algorithms, genetic programming, data mining, pattern classification, adaptive GP, adaptive genetic programming, classification accuracies, crossover rates, data mining, mutation rates, Accuracy, Evolutionary computation, Sociology, Standards, Statistics, Adaptive Genetic Programming, Classification, Evolutionary Computation %R doi:10.1109/NaBIC.2012.6402243 %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/al-madi/Adaptive_Genetic_Programming_applied_to_Classification_in_Data_Mining.pdf %U http://dx.doi.org/doi:10.1109/NaBIC.2012.6402243 %P 79-85 %0 Conference Proceedings %T Improving genetic programming classification for binary and multiclass datasets %A Al-Madi, Nailah %A Ludwig, Simone A. %Y Hammer, Barbara %Y Zhou, Zhi-Hua %Y Wang, Lipo %Y Chawla, Nitesh %S IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2013 %D 2013 %8 16 19 apr %C Singapore %F Al-Madi:2013:SSCI %X Genetic Programming (GP) is one of the evolutionary computation techniques that is used for the classification process. GP has shown that good accuracy values especially for binary classifications can be achieved, however, for multiclass classification unfortunately GP does not obtain high accuracy results. In this paper, we propose two approaches in order to improve the GP classification task. One approach (GP-K) uses the K-means clustering technique in order to transform the produced value of GP into class labels. The second approach (GP-D) uses a discretization technique to perform the transformation. A comparison of the original GP, GP-K and GP-D was conducted using binary and multiclass datasets. In addition, a comparison with other state-of-the-art classifiers was performed. The results reveal that GP-K shows good improvement in terms of accuracy compared to the original GP, however, it has a slightly longer execution time. GP-D also achieves higher accuracy values than the original GP as well as GP-K, and the comparison with the state-of-the-art classifiers reveal competitive accuracy values. %K genetic algorithms, genetic programming, Evolutionary Computation, Classification, Multiclass, Binary Classification %R doi:10.1109/CIDM.2013.6597232 %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/al-madi/improving_GP.pdf %U http://dx.doi.org/doi:10.1109/CIDM.2013.6597232 %P 166-173 %0 Conference Proceedings %T Segment-based genetic programming %A Al-Madi, Nailah %A Ludwig, Simone A. %Y Blum, Christian %Y Alba, Enrique %Y Bartz-Beielstein, Thomas %Y Loiacono, Daniele %Y Luna, Francisco %Y Mehnen, Joern %Y Ochoa, Gabriela %Y Preuss, Mike %Y Tantar, Emilia %Y Vanneschi, Leonardo %Y McClymont, Kent %Y Keedwell, Ed %Y Hart, Emma %Y Sim, Kevin %Y Gustafson, Steven %Y Vladislavleva, Ekaterina %Y Auger, Anne %Y Bischl, Bernd %Y Brockhoff, Dimo %Y Hansen, Nikolaus %Y Mersmann, Olaf %Y Posik, Petr %Y Trautmann, Heike %Y Iqbal, Muhammad %Y Shafi, Kamran %Y Urbanowicz, Ryan %Y Wagner, Stefan %Y Affenzeller, Michael %Y Walker, David %Y Everson, Richard %Y Fieldsend, Jonathan %Y Stonedahl, Forrest %Y Rand, William %Y Smith, Stephen L. %Y Cagnoni, Stefano %Y Patton, Robert M. %Y Pappa, Gisele L. %Y Woodward, John %Y Swan, Jerry %Y Krawiec, Krzysztof %Y Tantar, Alexandru-Adrian %Y Bosman, Peter A. N. %Y Vega-Rodriguez, Miguel %Y Chaves-Gonzalez, Jose M. %Y Gonzalez-Alvarez, David L. %Y Santander-Jimenez, Sergio %Y Spector, Lee %Y Keijzer, Maarten %Y Holladay, Kenneth %Y Tusar, Tea %Y Naujoks, Boris %S GECCO ’13 Companion: Proceeding of the fifteenth annual conference companion on Genetic and evolutionary computation conference companion %D 2013 %8 June 10 jul %I ACM %C Amsterdam, The Netherlands %F AL-Madi:2013:GECCOcomp %X Genetic Programming (GP) is one of the successful evolutionary computation techniques applied to solve classification problems, by searching for the best classification model applying the fitness evaluation. The fitness evaluation process greatly impacts the overall execution time of GP and is therefore the focus of this research study. This paper proposes a segment-based GP (SegGP) technique that reduces the execution time of GP by partitioning the dataset into segments, and using the segments in the fitness evaluation process. Experiments were done using four datasets and the results show that SegGP can obtain higher or similar accuracy results in shorter execution time compared to standard GP. %K genetic algorithms, genetic programming %R doi:10.1145/2464576.2464648 %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/al-madi/Segment-Based_Genetic_Programming.pdf %U http://dx.doi.org/doi:10.1145/2464576.2464648 %P 133-134 %0 Conference Proceedings %T Scaling Genetic Programming for Data Classification using MapReduce Methodology %A Al-Madi, Nailah %A Ludwig, Simone A. %Y Ludwig, Simone %Y Melin, Patricia %Y Abraham, Ajith %Y Madureira, Ana Maria %Y Nygard, Kendall %Y Castillo, Oscar %Y Muda, Azah Kamilah %Y Ma, Kun %Y Corchado, Emilio %S 5th World Congress on Nature and Biologically Inspired Computing %D 2013 %8 December 14 aug %I IEEE %C Fargo, USA %F Al-Madi:2013:nabic %X Genetic Programming (GP) is an optimisation method that has proved to achieve good results. It solves problems by generating programs and applying natural operations on these programs until a good solution is found. GP has been used to solve many classifications problems, however, its drawback is the long execution time. When GP is applied on the classification task, the execution time proportionally increases with the dataset size. Therefore, to manage the long execution time, the GP algorithm is parallelised in order to speed up the classification process. Our GP is implemented based on the MapReduce methodology (abbreviated as MRGP), in order to benefit from the MapReduce concept in terms of fault tolerance, load balancing, and data locality. MRGP does not only accelerate the execution time of GP for large datasets, it also provides the ability to use large population sizes, thus finding the best result in fewer numbers of generations. MRGP is evaluated using different population sizes ranging from 1,000 to 100,000 measuring the accuracy, scalability, and speedup %K genetic algorithms, genetic programming, Evolutionary computation, data classification, Parallel Processing, MapReduce, Hadoop %R doi:10.1109/NaBIC.2013.6617851 %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/al-madi/MRGP.pdf %U http://dx.doi.org/doi:10.1109/NaBIC.2013.6617851 %P 132-139 %0 Thesis %T Improved genetic programming techniques for data classification %A Al-Madi, Nailah Shikri %D 2013 %8 dec %C Fargo, North Dakota, USA %C Computer Science, North Dakota State University %F Al-Madi:thesis %X Evolutionary algorithms are one category of optimisation techniques that are inspired by processes of biological evolution. Evolutionary computation is applied to many domains and one of the most important is data mining. Data mining is a relatively broad field that deals with the automatic knowledge discovery from databases and it is one of the most developed fields in the area of artificial intelligence. Classification is a data mining method that assigns items in a collection to target classes with the goal to accurately predict the target class for each item in the data. Genetic programming (GP) is one of the effective evolutionary computation techniques to solve classification problems. GP solves classification problems as an optimization tasks, where it searches for the best solution with highest accuracy. However, GP suffers from some weaknesses such as long execution time, and the need to tune many parameters for each problem. Furthermore, GP can not obtain high accuracy for multiclass classification problems as opposed to binary problems. In this dissertation, we address these drawbacks and propose some approaches in order to overcome them. Adaptive GP variants are proposed in order to automatically adapt the parameter settings and shorten the execution time. Moreover, two approaches are proposed to improve the accuracy of GP when applied to multiclass classification problems. In addition, a Segment-based approach is proposed to accelerate the GP execution time for the data classification problem. Furthermore, a parallelisation of the GP process using the MapReduce methodology was proposed which aims to shorten the GP execution time and to provide the ability to use large population sizes leading to a faster convergence. The proposed approaches are evaluated using different measures, such as accuracy, execution time, sensitivity, specificity, and statistical tests. Comparisons between the proposed approaches with the standard GP, and with other classification techniques were performed, and the results showed that these approaches overcome the drawbacks of standard GP by successfully improving the accuracy and execution time. %K genetic algorithms, genetic programming, Artificial intelligence, Computer science, Applied sciences, Data classification, Data mining, MRGP %9 Ph.D. thesis %U https://library.ndsu.edu/ir/handle/10365/27097 %0 Journal Article %T Mike Preuss: Multimodal optimization by means of evolutionary algorithms %A Al-Madi, Nailah %J Genetic Programming and Evolvable Machines %D 2016 %8 sep %V 17 %N 3 %@ 1389-2576 %F Al-Madi:2016:GPEM %O Book review %K genetic algorithms %9 journal article %R doi:10.1007/s10710-016-9272-x %U https://rdcu.be/dR8cf %U http://dx.doi.org/doi:10.1007/s10710-016-9272-x %P 315-316 %0 Journal Article %T Genetic Programming Approach to Hierarchical Production Rule Discovery %A Al-Maqaleh, Basheer M. %A Bharadwaj, Kamal K. %J International Science Index %D 2007 %V 1 %N 11 %I World Academy of Science, Engineering and Technology %@ 1307-6892 %G en %F Al-Maqaleh:2007:isi %X Automated discovery of hierarchical structures in large data sets has been an active research area in the recent past. This paper focuses on the issue of mining generalised rules with crisp hierarchical structure using Genetic Programming (GP) approach to knowledge discovery. The post-processing scheme presented in this work uses flat rules as initial individuals of GP and discovers hierarchical structure. Suitable genetic operators are proposed for the suggested encoding. Based on the Subsumption Matrix(SM), an appropriate fitness function is suggested. Finally, Hierarchical Production Rules (HPRs) are generated from the discovered hierarchy. Experimental results are presented to demonstrate the performance of the proposed algorithm. %K genetic algorithms, genetic programming, hierarchy, knowledge discovery in database, subsumption matrix. k %9 journal article %U http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.308.1481 %P 531-534 %0 Conference Proceedings %T Genetic Algorithm Approach to Automated Discovery of Comprehensible Production Rules %A Al-Maqaleh, Basheer Mohamad Ahmad %S Second International Conference on Advanced Computing Communication Technologies (ACCT 2012) %D 2012 %8 jan %F Al-Maqaleh:2012:ACCT %X In the recent past, there has been an increasing interest in applying evolutionary methods to Knowledge Discovery in Databases (KDD) and a number of successful applications of Genetic Algorithms (GA) and Genetic Programming (GP) to KDD have been demonstrated. The most predominant representation of the discovered knowledge is the standard Production Rules (PRs) in the form If P Then D. This paper presents a classification algorithm based on GA approach that discovers comprehensible rules in the form of PRs. The proposed approach has flexible chromosome encoding, where each chromosome corresponds to a PR. For the proposed scheme a suitable and effective fitness function and appropriate genetic operators are proposed for the suggested representation. Experimental results are presented to demonstrate the performance of the proposed algorithm. %K genetic algorithms, GA, KDD, PR, automated discovery, chromosome encoding, comprehensible production rules, genetic algorithm approach, genetic operators, knowledge discovery in databases, production rules, data mining, database management systems %R doi:10.1109/ACCT.2012.57 %U http://dx.doi.org/doi:10.1109/ACCT.2012.57 %P 69-71 %0 Journal Article %T A New Software Reliability Growth Model: Genetic-Programming-Based Approach %A Al-Rahamneh, Zainab %A Reyalat, Mohammad %A Sheta, Alaa F. %A Bani-Ahmad, Sulieman %A Al-Oqeili, Saleh %J Journal of Software Engineering and Applications %D 2011 %8 aug %V 4 %N 8 %I Scientific Research Publishing %@ 19453116 %G eng %F Al-Rahamneh:2011:JSEA %X A variety of Software Reliability Growth Models (SRGM) have been presented in literature. These models suffer many problems when handling various types of project. The reason is; the nature of each project makes it difficult to build a model which can generalise. In this paper we propose the use of Genetic Programming (GP) as an evolutionary computation approach to handle the software reliability modelling problem. GP deals with one of the key issues in computer science which is called automatic programming. The goal of automatic programming is to create, in an automated way, a computer program that enables a computer to solve problems. GP will be used to build a SRGM which can predict accumulated faults during the software testing process. We evaluate the GP developed model and compare its performance with other common growth models from the literature. Our experiments results show that the proposed GP model is superior compared to Yamada S-Shaped, Generalised Poisson, NHPP and Schneidewind reliability models. %K genetic algorithms, genetic programming, SBSE, software reliability, modelling, software faults %9 journal article %R doi:10.4236/jsea.2011.48054 %U http://www.scirp.org/journal/PaperDownload.aspx?DOI=10.4236/jsea.2011.48054 %U http://dx.doi.org/doi:10.4236/jsea.2011.48054 %P 476-481 %0 Conference Proceedings %T Hybrid Multi-Agent Architecture (HMAA) for meeting scheduling %A Al-Ratrout, Serein %A Siewe, Francois %A Al-Dabbas, Omar %A Al-Fawair, Mai %S 2010 7th International Multi- Conference on Systems, Signals and Devices %D 2010 %8 27 30 jun %I IEEE %C Amman, Jordan %G en %F Al-Ratrout:2010:SSD %X This paper presents a novel multi-agent architecture for meeting scheduling. The proposed architecture is a new Hybrid Multi-Agent Architecture (HMAA) that generates new heuristics for solving NP-hard problems. Moreover, the paper investigates the feasibility of running computationally intensive algorithms on multi-agent architectures while preserving the ability of small agents to run on small devices, including mobile devices. Three experimental groups are conducted in order to test the feasibility of the proposed architecture. The results show that the performance of the proposed architecture is better than those of many existing meeting scheduling frameworks. Moreover, it has been proved that HMAA preserves small agents’ mobility (i.e. the ability to run on small devices) while implementing evolutionary algorithms. %K genetic algorithms, genetic programming, multiagent, meeting scheduling, heuristic %R doi:10.1109/SSD.2010.5585505 %U http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.1011.3891 %U http://dx.doi.org/doi:10.1109/SSD.2010.5585505 %0 Journal Article %T Employing Gene Expression Programming in Estimating Software Effort %A Al-Saati, Najla Akram %A Al-Reffaee, Taghreed Riyadh %J International Journal of Computer Applications %D 2018 %8 aug %V 182 %N 8 %I Foundation of Computer Science (FCS), NY, USA %C New York, USA %@ 0975-8887 %F Al-Saati:2018:IJCA %X The problem of estimating the effort for software packages is one of the most significant challenges encountering software designers. The precision in estimating the effort or cost can have a huge impact on software development. Various methods have been investigated in order to discover good enough solutions to this problem; lately evolutionary intelligent techniques are explored like Genetic Algorithms, Genetic Programming, Neural Networks, and Swarm Intelligence. In this work, Gene Expression Programming (GEP) is investigated to show its efficiency in acquiring equations that best estimates software effort. Datasets employed are taken from previous projects. The comparisons of learning and testing results are carried out with COCOMO, Analogy, GP and four types of Neural Networks, all show that GEP outperforms all these methods in discovering effective functions for the estimation with robustness and efficiency. %K genetic algorithms, genetic programming, Gene Expression Programming, Effort Estimation, Software Engineering, Artificial Intelligence %9 journal article %R doi:10.5120/ijca2018917619 %U http://www.ijcaonline.org/archives/volume182/number8/29837-2018917619 %U http://dx.doi.org/doi:10.5120/ijca2018917619 %P 1-8 %0 Generic %T Applying Gene Expression Programming for Solving One-Dimensional Bin-Packing Problems %A Al-Saati, Najla Akram %D 2020 %8 nov %I arXiv %F journals/corr/abs-2001-09923 %K genetic algorithms, genetic programming, gene expression programming %U https://arxiv.org/abs/2001.09923 %0 Conference Proceedings %T A genetic programming approach to feature selection and construction for ransomware, phishing and spam detection %A Al-Sahaf, Harith. %A Welch, Ian %Y Allmendinger, Richard %Y Cotta, Carlos %Y Doerr, Carola %Y Oliveto, Pietro S. %Y Weise, Thomas %Y Zamuda, Ales %Y Auger, Anne %Y Brockhoff, Dimo %Y Hansen, Nikolaus %Y Tusar, Tea %Y Varelas, Konstantinos %Y Camacho-Fernandez, David %Y Vasile, Massimiliano %Y Riccardi, Annalisa %Y Derbel, Bilel %Y Li, Ke %Y Li, Xiaodong %Y Zapotecas, Saul %Y Zhang, Qingfu %Y Akman, Ozgur %Y Alyahya, Khulood %Y Branke, Juergen %Y Fieldsend, Jonathan %Y Chugh, Tinkle %Y Hakanen, Jussi %Y Ceberio Uribe, Josu %Y Santucci, Valentino %Y Baioletti, Marco %Y McCall, John %Y Hart, Emma %Y Tauritz, Daniel R. %Y Woodward, John R. %Y Nakayama, Koichi %Y Oshima, Chika %Y Wagner, Stefan %Y Affenzeller, Michael %Y Osaba, Eneko %Y Del Ser, Javier %Y Kerschke, Pascal %Y Naujoks, Boris %Y Volz, Vanessa %Y Esparcia-Alcazar, Anna I. %Y Alshammari, Riyad %Y Hemberg, Erik %Y Makanju, Tokunbo %Y Alexander, Brad %Y Haraldsson, Saemundur O. %Y Wagner, Markus %Y Fernandez Alzueta, Silvino %Y Valledor Pellicer, Pablo %Y Stuetzle, Thomas %Y Walker, David %Y Johns, Matt %Y Ross, Nick %Y Keedwell, Ed %Y Nakata, Masaya %Y Stein, Anthony %Y Tatsumi, Takato %Y Veerapen, Nadarajen %Y Liefooghe, Arnaud %Y Verel, Sebastien %Y Ochoa, Gabriela %Y Smith, Stephen %Y Cagnoni, Stefano %Y Patton, Robert M. %Y La Cava, William %Y Olson, Randal %Y Orzechowski, Patryk %Y Urbanowicz, Ryan %Y Oyama, Akira %Y Shimoyama, Koji %Y Singh, Hemant Kumar %Y Chiba, Kazuhisa %Y Palar, Pramudita Satria %Y Rahat, Alma %Y Everson, Richard %Y Wang, Handing %Y Jin, Yaochu %Y Gallagher, Marcus %Y Preuss, Mike %Y Teytaud, Olivier %Y Lezama, Fernando %Y Soares, Joao %Y Vale, Zita %S GECCO ’19: Proceedings of the Genetic and Evolutionary Computation Conference Companion %D 2019 %8 13 17 jul %I ACM %C Prague, Czech Republic %F Al-Sahaf:2019:GECCOcomp %K genetic algorithms, genetic programming %R doi:10.1145/3319619.3322083 %U http://dx.doi.org/doi:10.1145/3319619.3322083 %P 332-333 %0 Journal Article %T A survey on evolutionary machine learning %A Al-Sahaf, Harith %A Bi, Ying %A Chen, Qi %A Lensen, Andrew %A Mei, Yi %A Sun, Yanan %A Tran, Binh %A Xue, Bing %A Zhang, Mengjie %J Journal of the Royal Society of New Zealand %D 2019 %V 49 %N 2 %I Taylor & Francis %F Al-Sahaf:2019:JRSNZ %O The 2019 Annual Collection of Reviews %X Artificial intelligence (AI) emphasises the creation of intelligent machines/systems that function like humans. AI has been applied to many real-world applications. Machine learning is a branch of AI based on the idea that systems can learn from data, identify hidden patterns, and make decisions with little/minimal human intervention. Evolutionary computation is an umbrella of population-based intelligent/learning algorithms inspired by nature, where New Zealand has a good international reputation. This paper provides a review on evolutionary machine learning, i.e. evolutionary computation techniques for major machine learning tasks such as classification, regression and clustering, and emerging topics including combinatorial optimisation, computer vision, deep learning, transfer learning, and ensemble learning. The paper also provides a brief review of evolutionary learning applications, such as supply chain and manufacturing for milk/dairy, wine and seafood industries, which are important to New Zealand. Finally, the paper presents current issues with future perspectives in evolutionary machine learning. %K genetic algorithms, genetic programming, TPOT, AI, ANN, EML, GPU, EMO, autoML, artificial intelligence, machine learning, evolutionary computation, classification, regression, clustering, combinatorial optimisation, deep learning, transfer learning, ensemble learning %9 journal article %R doi:10.1080/03036758.2019.1609052 %U https://doi.org/10.1080/03036758.2019.1609052 %U http://dx.doi.org/doi:10.1080/03036758.2019.1609052 %P 205-228 %0 Conference Proceedings %T The Influence of Input Data Standardization Methods on the Prediction Accuracy of Genetic Programming Generated Classifiers %A Al Shorman, Amaal R. %A Faris, Hossam %A Castillo, Pedro A. %A Guervos, Juan Julian Merelo %A Al-Madi, Nailah %Y Sabourin, Christophe %Y Guervos, Juan Julian Merelo %Y Linares-Barranco, Alejandro %Y Madani, Kurosh %Y Warwick, Kevin %S Proceedings of the 10th International Joint Conference on Computational Intelligence, IJCCI 2018, Seville, Spain, September 18-20, 2018 %D 2018 %I SciTePress %F DBLP:conf/ijcci/ShormanFCGA18 %K genetic algorithms, genetic programming %R doi:10.5220/0006959000790085 %U https://doi.org/10.5220/0006959000790085 %U http://dx.doi.org/doi:10.5220/0006959000790085 %P 79-85 %0 Thesis %T Multi-objective search-based approach for software project management %A Al-Zubaidi, Wisam Haitham Abbood %D 2019 %8 31 mar %C Wollongong, NSW 2522, Australia %C University of Wollongong %F Al-Zubaidi:thesis %X Project management covers the entire lifecycle of software, underpinning the success or failure of many software projects. Managing modern software projects often follows the incremental and iterative process where a software product is incrementally developed through a number of iterations. In each iteration, the development team needs to complete a number of issues, each of which can be implementing a new feature for the software, modifying an existing functionality, fixing a bug or conducting some other project tasks. Although this agile approach reduces the risk of project failures, managing projects at the level of issues and iterations is still highly difficult due to the inherent dynamic nature of software, especially in large-scale software projects. Challenges in this context can be in many forms such as making accurate estimations of the resolution time and effort of resolving issues or selecting suitable issues for upcoming iterations. These integral parts of planning is highly challenging since many factors need considering such as customer business value and the team historical estimations, capability and performance. Challenges also exist at the implementation level, such as managing the reviewing of code changes made to resolve issues. There is currently a serious lack of automated support which help project managers and software development teams address those challenges. This thesis aims to fill those gaps. We leverage a huge amount of historical data in software projects to generate valuable insight for dealing with those challenges in managing iterations and issues. We reformulate those project management problems as search-based optimization problems and employ a range of evolutionary meta-heuristics search techniques to solve them. The search is simultaneously guided by a number of multiple fitness functions that express different objectives (e.g. customer business value, developer expertise and workload, and complexity of estimation models) and constraints (e.g. a team historical capability and performance) in the context of modern software projects. Using this approach, we build novel models for estimating issue resolution time and effort, suggesting appropriate issues for upcoming iterations in iteration planning and recommending suitable reviewers for code changes made to resolve issues. An extensive empirical evaluation on a range of large software projects (including Mesos, Usergrid, Aurora, Slider, Kylin, Mahout, Common, Hdfs, MapReduce, Yarn, Apstud, Mule, Dnn, Timob, Tisud, Xd, Nexus, Android, LibreOffice, Qt, and Openstack) demonstrates the highly effective performance of our approach against other alternative techniques (improvement between 1.83 to 550 percent) to show the effectiveness of our approach. %K genetic algorithms, genetic programming, SBSE, Iteration Planning, Agile Development, Effort Estimation, MOGP %9 Ph.D. thesis %U https://ro.uow.edu.au/theses1/690/ %0 Conference Proceedings %T Intrinsic Evolution of Large Digital Circuits Using a Modular Approach %A Alagesan, Shri Vidhya %A Kannan, Sruthi %A Shanthi, G. %A Shanthi, A. P. %A Parthasarathi, Ranjani %S NASA/ESA Conference on Adaptive Hardware and Systems, AHS ’08 %D 2008 %8 jun %F Alagesan:2008:AHS %X This work pioneers a generic and flexible approach to intrinsically evolve large digital circuits. One of the popular ways of handling the scalability problem prevalent in evolvable hardware (EHW) and evolve large circuits is to partition the circuit, evolve the individual partitions and then compact them. However, as the partition sizes become larger, this method also fails. This drawback is overcome by the modular developmental Cartesian genetic programming (MDCGP) technique, which still uses partitioning, but augments it further with horizontal and vertical reuse. The results obtained are promising and show that there is 100percent evolvability for 128-bit partitions, the largest partitions evolved so far. The fitness evaluation for the evolved partitions is done by downloading them onto Xilinx Virtex II Pro board. This work is the first step towards the development of a flexible evolvable framework which harnesses the power of hardware for the time consuming fitness evaluation and at the same time provides flexibility by carrying out the other parts using the easily modifiable software platform. %K genetic algorithms, genetic programming, Cartesian genetic programming, Xilinx Virtex II Pro board, evolvable hardware, large digital circuits, modular approach, modular developmental Cartesian genetic programming, scalability problem, software platform, time consuming fitness evaluation, digital circuits %R doi:10.1109/AHS.2008.52 %U http://dx.doi.org/doi:10.1109/AHS.2008.52 %P 19-26 %0 Journal Article %T Hybrid approach of using bi-objective genetic programming in well control optimization of waterflood management %A Al-Aghbari, Mohammed %A M. Gujarathi, Ashish %J Geoenergy Science and Engineering %D 2023 %V 228 %@ 2949-8910 %F ALAGHBARI:2023:geoen %X A new hybrid optimization approach is proposed by applying bi-objective genetic programming (BioGP) algorithm along with NSGA-II algorithm to expand the diversity of the Pareto solutions and speed up the convergence. The novel methodology is used in two distinct cases: the benchmark model for the Brugge field and a Middle Eastern oil-field sector model. The Brugge field includes twenty producing wells and ten injecting wells, but the real sector model has three injectors and four producers. The two primary objectives applied are to optimize the total volume of produced oil and reduce cumulative produced water. In the optimization process, the injection rate (qwi) and the bottom-hole pressure (BHP) are the control parameters for injection and producing wells, respectively. The hybrid technique of applying BioGP guided NSGA-II in the Brugge field model demonstrated a 50percent acceleration in the convergence speed when compared to the NSGA-II solution. The calculated Pareto solutions for the Middle-Eastern sector model by the proposed methodology at various generations exhibited better diversity and convergence in comparison to the NSGA-II solutions. The highest cumulative produced oil of 550.45 times 103 m3 is obtained by the proposed hybrid methodology in comparison to the NSGA-II’s highest cumulative of 522 times 103 m3. The two solution points A’ and B’ achieved using the BioGP guided NSGA-II have lower WOR by 17percent and 15percent, respectively, than A and B solutions established by NSGA-II alone. Pareto solution ranking is performed using the net flow method (NFM) and the best optimum solution determined for BioGP guided NSGA-II is 532.38 times 103 m3 oil using equal-based weight compared to 505.44 times 103 m3 using the entropy-based weights of 41percent oil & 59percent water. Overall, the optimal Pareto solutions achieved by the proposed methodology of using BioGP guided NSGA-II algorithm has better diversity with improvement in convergence speed in comparison to the NSGA-II %K genetic algorithms, genetic programming, Multi-objective optimization, Bi-objective genetic programming, BioGP, NSGA-II, Net-flow method, Waterflood optimization, Reservoir simulation %9 journal article %R doi:10.1016/j.geoen.2023.211967 %U https://www.sciencedirect.com/science/article/pii/S2949891023005547 %U http://dx.doi.org/doi:10.1016/j.geoen.2023.211967 %P 211967 %0 Conference Proceedings %T Detection and Quantitative Prediction of Diplocarpon earlianum Infection Rate in Strawberry Leaves using Population-based Recurrent Neural Network %A Alajas, Oliver John %A Concepcion, Ronnie %A Bandala, Argel %A Sybingco, Edwin %A Vicerra, Ryan Rhay %A Dadios, Elmer P. %A Mendigoria, Christan Hail %A Aquino, Heinrick %A Ambata, Leonard %A Duarte, Bernardo %S 2022 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS) %D 2022 %8 jun %F Alajas:2022:IEMTRONICS %X Fragaria ananassa, a member of the rose family’s flowering plants, commonly recognized as strawberry, is prone to Diplocarpon earlianum infection that causes leaf scorch. Assessment via visual inspection of strawberries by farmers is normally ineffective, destructive, and laborious. To address this challenge, the use of integrated computer vision and machine learning techniques was done to classify a healthy from a scorch-infected strawberry leaf image and to estimate the leaf region infection rate (LRIR). A dataset made up of 204 normally healthy and 161 scorch-infected strawberry leaf images was used. Images were initially preprocessed and segmented via graph-cut segmentation to extract the region of interest for feature extraction and selection. The hybrid combination of neighborhood and principal component analysis (NCA-PCA) was used to select desirable features. Multigene genetic programming (MGGP) was used to formulate the fitness function that will be essential for determining the optimized neuron configurations of the recurrent neural network (RNN) through genetic algorithm (GA), and cuckoo search algorithm (CSA), and artificial bee colony (ABC). Four classification machine learning models were configured in which the classification tree (CTree) bested other detection models with an accuracy of 10percent and exhibited the shortest inference time of 14.746 s. The developed ABC-RNN3 model outperformed GA-RNN3 and CSA-RNN3 in performing non-invasive LRIR prediction with an R2 value of 0.948. With the use of the NCA-PCA-CTree3-ABC-RNN3 hybrid model, for crop disease detection and infection rate prediction, plant disease assessment proved to be more efficient and labor cost-effective than manual disease inspection methods. %K genetic algorithms, genetic programming %R doi:10.1109/IEMTRONICS55184.2022.9795744 %U http://dx.doi.org/doi:10.1109/IEMTRONICS55184.2022.9795744 %0 Conference Proceedings %T Grape Phaeomoniella chlamydospora Leaf Blotch Recognition and Infected Area Approximation Using Hybrid Linear Discriminant Analysis and Genetic Programming %A Alajas, Oliver John %A Concepcion II, Ronnie %A Bandala, Argel %A Sybingco, Edwin %A Dadios, Elmer %A Mendigoria, Christan Hail %A Aquino, Heinrick %S 2022 IEEE 14th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM) %D 2022 %8 January 04 dec %C Boracay Island, Philippines %F Alajas:2022:HNICEM %X Grapes, scientifically called Vitis vinifera, are vulnerable against Phaeomoniella chlamydospora, the microorganism that causes Esca (black measles) to the leaves, trunks, cordons, and fruit of a young vineyard. Manual visual examination via the naked eye can prove to be challenging especially if done in large-scale vineyards. To address this issue, merging the use of computer vision, image processing, and machine learning was employed as a means of performing blotch identification and leaf blotch area prediction. The dataset is made up of 543 images, comprised of healthy and Esca infected leaves which were captured by an RGB camera. Images were preprocessed and segmented to isolate the diseased pixels and compute the ground truth pixel area. Desirable leaf signatures (G, B, contrast, H, R, S, a*, b*, Cb, and Cr) derived from the feature extraction process using a classification tree. The LDA12 was able to accurately distinguish the healthy from the blotch-infected leaves with a whopping 98.77percent accuracy compared to NB, KNN, and SVM. The MGSR12, with an R2 of 0.9208, topped other models such as RTree, GPR, and RLinear. The hybrid CTree-LDA12-MGSR12 algorithm proved to be ideal in performing leaf health classification and blotched area assessment of grape phenotypes which is important in plant disease identification and fungal spread prevention. %K genetic algorithms, genetic programming, Support vector machines, SVM, Image segmentation, Visualization, Image recognition, Computational modelling, Pipelines, Process control, image processing, plant disease detection, machine learning, computer vision, soft computing, black measles %R doi:10.1109/HNICEM57413.2022.10109613 %U http://dx.doi.org/doi:10.1109/HNICEM57413.2022.10109613 %0 Report %T An Indexed Bibliography of Genetic Programming %A Alander, Jarmo T. %D 1995 %N 94-1-GP %I Department of Information Technology and Industrial Management, University of Vaasa %C Finland %F Alander:1995:ibGP %X 220 references. Indexed by subject, publication type and author %K genetic algorithms, genetic programming %9 Report Series no %U ftp://ftp.uwasa.fi/cs/report94-1/gaGPbib.ps.Z %0 Book %T An Indexed Bibliography of Genetic Algorithms: Years 1957–1993 %A Alander, Jarmo T. %D 1994 %I Art of CAD ltd %C Vaasa, Finland %F Alander:1994:bib %K genetic algorithms, genetic programming %U http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.53.4481&rep=rep1&type=pdf %0 Conference Proceedings %T 2nd order equation %A Alander, Jarmo T. %A Moghadampour, Ghodrat %A Ylinen, Jari %Y Alander, Jarmo T. %S Proceedings of the Second Nordic Workshop on Genetic Algorithms and their Applications (2NWGA) %S Proceedings of the University of Vaasa, Nro. 13 %D 1996 %8 19. 23. aug %I University of Vaasa %C Vaasa (Finland) %F ga96fAlander %X In this work we have tried to use genetic programming to solve the simple second order equation %K genetic algorithms, genetic programming, mathematics, algebra %U ftp://ftp.uwasa.fi/cs/2NWGA/Ghodrat2.ps.Z %P 215-218 %0 Journal Article %T Process optimization, multi-gene genetic programming modeling and reliability assessment of bioactive extracts recovery from Phyllantus emblica %A Alanzi, Hamdan %A Alenezi, Hamoud %A Adeyi, Oladayo %A Adeyi, Abiola J. %A Olusola, Emmanuel %A Gan, Chee-Yuen %A Olalere, Olusegun Abayomi %J Journal of Engineering Research %D 2024 %@ 2307-1877 %F ALANZI:2024:jer %X This study investigates the feasibility of extracting bioactive antioxidants from Phyllantus emblica leaves using a combination of ethanol-water mixture (0-100percent) and heat-assisted extraction technology (HAE-T). Operating temperature (30-50degreeC), solid-to-liquid ratio (1:20-1:60g/mL), and extraction time (45-180min) were varied to determine their effects on extract total phenolic content (TPC), yield (EY), and antioxidant activity (AA). The Box-Behnken experimental design (BBD) within response surface methodology (RSM) was employed, with multi-objective process optimization using the desirability function algorithm to find the optimal process variables for maximizing TPC, EY, and AA simultaneously. The extraction process was modeled using BBD-RSM and multi-gene genetic programming (MGGP) algorithm, with model reliability assessed via Monte Carlo simulation. HPLC characterization identified betulinic acid, gallic acid, chlorogenic acid, caffeic acid, ellagic acid, and ferulic acid as bioactive constituents in the extract. The study found that a 50percent ethanol solution yielded the best extraction efficiency. The optimal process parameters for maximum EY (21.6565percent), TPC (67.116mg GAE/g), and AA (3.68583uM AAE/g) were determined as OT of 41.61degreeC, S:L of 1:60g/mL, and ET of 180min. Both BBD-RSM and MGGP-based models satisfactorily predicted the observed process responses, with BBD-RSM models showing slightly better performance. Reliability analysis indicated high certainty in the predictions, with BBD-RSM models achieving 99.985percent certainty for TPC, 97.569percent for EY, and 98.661percent for AA values %K genetic algorithms, genetic programming, leaf, bioactive extract, Heat-assisted technology, multi gene genetic programming, reliability assessment %9 journal article %R doi:10.1016/j.jer.2024.02.020 %U https://www.sciencedirect.com/science/article/pii/S2307187724000476 %U http://dx.doi.org/doi:10.1016/j.jer.2024.02.020 %0 Journal Article %T Machine learning based prediction models for spilt tensile strength of fiber reinforced recycled aggregate concrete %A Alarfaj, Mohammed %A Qureshi, Hisham Jahangir %A Shahab, Muhammad Zubair %A Javed, Muhammad Faisal %A Arifuzzaman, Md %A Gamil, Yaser %J Case Studies in Construction Materials %D 2024 %V 20 %@ 2214-5095 %F ALARFAJ:2024:cscm %X The demand for concrete production has led to a significant annual requirement for raw materials, resulting in a substantial amount of waste concrete. In response, recycled aggregate concrete has emerged as a promising solution. However, it faces challenges due to the vulnerability of the hardened mortar attached to natural aggregates, leading to susceptibility to cracking and reduced strength. This study focuses on predicting the split tensile strength of fiber reinforced recycled aggregate concrete using five prediction models, including two deep neural network models DNN1 and DNN2, one optimizable Gaussian process regression (OGPR), and two genetic programming based GEP1 and GEP2 models. The models exhibited high accuracy in predicting spilt tensile strength with robust R2, RMSE, and MAE values. DNN2 has the highest R2 value of 0.94 and GEP1 has the lowest R2 value of 0.76. DNN2 model R2 was 3.3percent and 13.5percent higher than OGPR and GEP2. Similarly, DNN2 and GEP2 model performed 9.3percent and 9.21percent better than DNN1 and GEP1 respectively in terms of R2. DNN2 model performed 20.32percent and 31.5percent better than OGPR and GEP2 in terms of MAE. Similarly, GEP2 and DNN2 MAE were 13.1percent and 31.5percent better than GEP1 and DNN1. Sensitivity analysis using the relevance factor and permutation feature importance revealed that the most significant positive factors are cement, natural coarse aggregates, density of recycle aggregates, and superplasticizer while recycle aggregate concrete, max size, and water content of recycle aggregates and water content have the most negative effect on STS values. The proposed ML methods, especially DNN2 and OGPR can be effectively used in practical projects, saving time and cost for eco-friendly fiber reinforced recycled aggregate concrete mixes. However, it is required to study more input variables and use hybrid models to further enhance the accuracy and reliability of the models %K genetic algorithms, genetic programming, Gene expression programming, Fiber reinforced Recycled Aggregate Concrete, Machine Learning, Sustainability, Eco-friendly Concrete, Spilt Tensile Strength, Deep neural networks, ANN, Optimizable gaussian process regression %9 journal article %R doi:10.1016/j.cscm.2023.e02836 %U https://www.sciencedirect.com/science/article/pii/S2214509523010173 %U http://dx.doi.org/doi:10.1016/j.cscm.2023.e02836 %P e02836 %0 Journal Article %T Comparative study of genetic programming-based algorithms for predicting the compressive strength of concrete at elevated temperature %A Alaskar, Abdulaziz %A Alfalah, Ghasan %A Althoey, Fadi %A Abuhussain, Mohammed Awad %A Javed, Muhammad Faisal %A Deifalla, Ahmed Farouk %A Ghamry, Nivin A. %J Case Studies in Construction Materials %D 2023 %V 18 %@ 2214-5095 %F ALASKAR:2023:cscm %X The elevated temperature severely influences the mixed properties of concrete, causing a decrease in its strength properties. Accurate proportioning of concrete components for obtaining the required compressive strength (C-S) at elevated temperatures is a complicated and time-taking process. However, using evolutionary programming techniques such as gene expression programming (GEP) and multi-expression programming (MEP) provides the accurate prediction of concrete C-S. This article presents the genetic programming-based models (such as gene expression programming (GEP) and multi-expression programming (MEP)) for forecasting the concrete compressive strength (C-S) at elevated temperatures. In this regard, 207 C-S values at elevated temperatures were obtained from previous studies. In the model’s development, C-S was considered as the output parameter with the nine most influential input parameters, including; Nano silica, cement, fly ash, water, temperature, silica fume, superplasticizer, sand, and gravels. The efficacy and accuracy of the GEP and MEP-based models were assessed by using statistical measures such as mean absolute error (MAE), correlation coefficient (R2), and root mean square error (RMSE). Moreover, models were also evaluated for external validation using different validation criteria recommended by previous studies. In comparing GEP and MEP models, GEP gave higher R2 and lower RMSE and MAE values of 0.854, 5.331 MPa, and 0.018 MPa respectively, indicating a strong correlation between actual and anticipated outputs. Thus, the GEP-based model was used further for sensitivity analysis, which revealed that cement is the most influencing factor. In addition, the proposed GEP model provides simple mathematical expression that can be easily implemented in practice %K genetic algorithms, genetic programming, gene expression programming %9 journal article %R doi:10.1016/j.cscm.2023.e02199 %U https://www.sciencedirect.com/science/article/pii/S2214509523003790 %U http://dx.doi.org/doi:10.1016/j.cscm.2023.e02199 %P e02199 %0 Journal Article %T Development of multiple explicit data-driven models for accurate prediction of CO2 minimum miscibility pressure %A Alatefi, Saad %A Agwu, Okorie Ekwe %A Azim, Reda Abdel %A Alkouh, Ahmad %A Dzulkarnain, Iskandar %J Chemical Engineering Research and Design %D 2024 %@ 0263-8762 %F ALATEFI:2024:cherd %X multiple data-driven models for predicting CO2 minimum miscibility pressure (MMP). The aim is to address the issue of existing models lacking explicit presentation. With a database of 155 data points, five models were developed using artificial neural network (ANN), multigene genetic programming (MGGP), support vector regression (SVR), multivariate adaptive regression splines (MARS), and multiple linear regression (MLR). Comparative analysis was conducted using statistical metrics (R2, MSE, MAE, RMSE), and sensitivity analysis was performed on input variables. The results showed that ANN and SVR had comparable predictive performance (ANN: R2 = 0.982, MSE = 0.00676, MAE = 0.9765, RMSE = 0.082), SVR (R2 = 0.935, MSE = 0.0041, MAE = 0.72, RMSE = 0.064) followed by MARS, MLR, and MGGP. Sensitivity analysis revealed that reservoir temperature was the most influential parameter across all models, except for the MLR algorithm where injected CO2 amount was crucial. These models can be used for a wide range of CO2 MMP ranging from 940psi to 5830psi, thus rendering them useful for any reservoir globally. These models offer improved accuracy and computational efficiency compared to existing ones, potentially reducing costs associated with laboratory experiments and providing rapid and precise CO2 MMP predictions %K genetic algorithms, genetic programming, Artificial intelligence, CO2, Explicit models, Gas flooding, Minimum miscibility pressure %9 journal article %R doi:10.1016/j.cherd.2024.04.033 %U https://www.sciencedirect.com/science/article/pii/S0263876224002351 %U http://dx.doi.org/doi:10.1016/j.cherd.2024.04.033 %0 Conference Proceedings %T Hybrid evolutionary designer of modular robots %A Alattas, R. %S 2016 Annual Connecticut Conference on Industrial Electronics, Technology Automation (CT-IETA) %D 2016 %8 oct %F Alattas:2016:CT-IETA %X The majority of robotic design approaches start with designing morphology, then designing the robot control. Even in evolutionary robotics, the morphology tends to be fixed while evolving the robot control, which considered insufficient since the robot control and morphology are interdependent. Moreover, both control and morphology are highly interdependent with the surrounding environment, which affects the used optimisation strategies. Therefore, we propose in this paper a novel hybrid GP/GA method for designing autonomous modular robots that co-evolves the robot control and morphology and also considers the surrounding environment to allow the robot of achieving behaviour specific tasks and adapting to the environmental changes. The introduced method is automatically designing feasible robots made up of various modules. Then, our new evolutionary designer is evaluated using a benchmark problem in modular robotics, which is a walking task where the robot has to move a certain distance. %K genetic algorithms, genetic programming %R doi:10.1109/CT-IETA.2016.7868256 %U http://dx.doi.org/doi:10.1109/CT-IETA.2016.7868256 %0 Conference Proceedings %T Soft Computing Based Approaches for High Performance Concrete %A Alavi, A. H. %A Heshmati, A. A. %A Salehzadeh, H. %A Gandomi, A. H. %A Askarinejad, A. %Y Papadrakakis, M. %Y Topping, B. H. V. %S Proceedings of the Sixth International Conference on Engineering Computational Technology %S Civil-Comp Proceedings %D 2008 %8 February 5 sep %V 89 %I Civil-Comp Press %C Athens %F Alavi:2008:ICECT %X High performance concrete (HPC) is a class of concrete that provides superior performance than those of conventional types. The enhanced performance characteristics of HPC are generally achieved by the addition of various cementitious materials and chemical and mineral admixtures to conventional concrete mix designs. These parameters considerably influence the compressive strength and workability properties of HPC mixes. An extensive understanding of the relation between these parameters and properties of the resulting matrix is required for developing a standard mix design procedure for HPC mix. To avoid testing several mix proportions to generate a successful mix and also simulating the behaviour of strength and workability improvement to an arbitrary degree of accuracy that often lead to savings in cost and time, it is idealistic to develop prediction models so that the performance characteristics of HPC mixes can be evaluated from the influencing parameters. Therefore, in this paper, linear genetic programming (LGP) is used for the first time in the literature to develop mathematical models to be able to predict the strength and slump flow of HPC mixes in terms of the variables responsible. Subsequently, the LGP based prediction results are compared with the results of proposed multilayer perceptron (MLP) in terms of prediction performance. Sand-cement ratio, coarse aggregate-cement ratio, water-cement ratio, percentage of silica fume and percentage of superplasticiser are used as the input variables to the models to predict the strength and slump flow of HPC mixes. A reliable database was obtained from the previously published literature in order to develop the models. The results of the present study, based on the values of performance measures for the models, demonstrated that for the prediction of compressive strength the optimum MLP model outperforms both the best team and the best single solution that have been created by LGP. It can be seen that for the slump flow the best LGP team solution has produced better results followed by the LGP best single solution and the MLP model. It can be concluded that LGPs are able to reach a prediction performance very close to or even better than the MLP model and as promising candidates can be used for solving such complex prediction problems. %K genetic algorithms, genetic programming, linear genetic programming, high performance concrete, multilayer perceptron, compressive strength, workability, mix design %R doi:10.4203/ccp.89.86 %U http://www.civil-comp.com/pubs/catalog.htm?t=contents&f=26_3 %U http://dx.doi.org/doi:10.4203/ccp.89.86 %P Paper86 %0 Conference Proceedings %T Utilisation of Computational Intelligence Techniques for Stabilised Soil %A Alavi, A. H. %A Heshmati, A. A. %A Gandomi, A. H. %A Askarinejad, A. %A Mirjalili, M. %Y Papadrakakis, M. %Y Topping, B. H. V. %S Proceedings of the Sixth International Conference on Engineering Computational Technology %S Civil-Comp Proceedings %D 2008 %8 February 5 sep %V 89 %I Civil-Comp Press %C Athens %F Alavi:2008:ICECT2 %X In the present study, two branches of computational intelligence techniques namely, the multilayer perceptron (MLP) and linear genetic programming (LGP), are employed to simulate the complex behaviour of the strength improvement in a chemical stabilisation process. Due to a need to avoid extensive and cumbersome experimental stabilisation tests on soils on every new occasion, it was decided to develop mathematical models to be able to estimate the unconfined compressive strength (UCS) as a quality of the stabilised soil after both compaction and curing by using particle size distribution, liquid limit, plasticity index, linear shrinkage as the properties of natural soil before compaction and stabilisation and the quantities and types of stabiliser. A comprehensive and reliable set of data including 219 previously published UCS test results were used to develop the prediction models. Based on the values of performance measures for the models, it was observed that all models are able to predict the UCS value to an acceptable degree of accuracy. The results demonstrated that the optimum MLP model with one hidden layer and thirty six neurons outperforms both the best single and the best team program that have been created by LGP. It can also be concluded that the best team program evolved by LGP has a better performance than the best single evolved program. This investigation revealed that, on average, LGP is able to reach a prediction performance similar to the MLP model. Moreover, LGP as a white-box model provides the programs of an imperative language or machine language that can be inspected and evaluated to provide a better understanding of the underlying relationship between the different interrelated input and output data. %K genetic algorithms, genetic programming, linear genetic programming, stabilised soil, multilayer perceptron, textural properties of soil, cement, lime, asphalt, unconfined compressive strength %R doi:10.4203/ccp.89.175 %U http://www.civil-comp.com/pubs/catalog.htm?t=contents&f=26_3 %U http://dx.doi.org/doi:10.4203/ccp.89.175 %P Paper175 %0 Journal Article %T Comment on ’Sivapragasam C, Maheswaran R, Venkatesh V. 2008. Genetic programming approach for flood routing in natural channels. Hydrological Processes 22: 623-628’ %A Alavi, A. H. %A Gandomi, A. H. %A Gandomi, M. %J Hydrological Processes %D 2010 %8 15 mar %V 24 %N 6 %I John Wiley & Sons, Ltd. %@ 1099-1085 %F Alavi:2010:HP %K genetic algorithms, genetic programming, AIMGP, Discipulus %9 journal article %R doi:10.1002/hyp.7511 %U http://onlinelibrary.wiley.com/doi/10.1002/hyp.7511/abstract %U http://dx.doi.org/doi:10.1002/hyp.7511 %P 798-799 %0 Journal Article %T Multi Expression Programming: A New Approach to Formulation of Soil Classification %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %A Sahab, Mohammad Ghasem %A Gandomi, Mostafa %J Engineering with Computers %D 2010 %8 apr %V 26 %N 2 %F Alavi:2010:EwC %X This paper presents an alternative approach to formulation of soil classification by means of a promising variant of genetic programming (GP), namely multi expression programming (MEP). Properties of soil, namely plastic limit, liquid limit, colour of soil, percentages of gravel, sand, and fine-grained particles are used as input variables to predict the classification of soils. The models are developed using a reliable database obtained from the previously published literature. The results demonstrate that the MEP-based formulae are able to predict the target values to high degree of accuracy. The MEP-based formulation results are found to be more accurate compared with numerical and analytical results obtained by other researchers. %K genetic algorithms, genetic programming, Multi expression programming, Soil classification, Formulation %9 journal article %R doi:10.1007/s00366-009-0140-7 %U http://dx.doi.org/doi:10.1007/s00366-009-0140-7 %P 111-118 %0 Journal Article %T High-Precision Modeling of Uplift Capacity of Suction Caissons Using a Hybrid Computational Method %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %A Mousavi, Mehdi %A Mollahasani, Ali %J Geomechanics and Engineering %D 2010 %8 dec %V 2 %N 4 %F Alavi:2010:GeoMechEng %X A new prediction model is derived for the uplift capacity of suction caissons using a hybrid method coupling genetic programming (GP) and simulated annealing (SA), called GP/SA. The predictor variables included in the analysis are the aspect ratio of caisson, shear strength of clayey soil, load point of application, load inclination angle, soil permeability, and loading rate. The proposed model is developed based on well established and widely dispersed experimental results gathered from the literature. To verify the applicability of the proposed model, it is employed to estimate the uplift capacity of parts of the test results that are not included in the modelling process. Traditional GP and multiple regression analyses are performed to benchmark the derived model. The external validation of the GP/SA and GP models was further verified using several statistical criteria recommended by researchers. Contributions of the parameters affecting the uplift capacity are evaluated through a sensitivity analysis. A subsequent parametric analysis is carried out and the obtained trends are confirmed with some previous studies. Based on the results, the GP/SA-based solution is effectively capable of estimating the horizontal, vertical and inclined uplift capacity of suction caissons. Furthermore, the GP/SA model provides a better prediction performance than the GP, regression and different models found in the literature. The proposed simplified formulation can reliably be employed for the pre-design of suction caissons. It may be also used as a quick check on solutions developed by more time consuming and in-depth deterministic analyses. %K genetic algorithms, genetic programming, suction caissons, uplift capacity, simulated annealing, nonlinear modelling %9 journal article %R doi:10.12989/gae.2010.2.4.253 %U http://technopress.kaist.ac.kr/?page=container&journal=gae&volume=2&num=4 %U http://dx.doi.org/doi:10.12989/gae.2010.2.4.253 %P 253-280 %0 Journal Article %T A Robust Data Mining Approach for Formulation of Geotechnical Engineering Systems %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %J International Journal of Computer Aided Methods in Engineering-Engineering Computations %D 2011 %V 28 %N 3 %@ 0264-4401 %F Alavi:2010:ijcamieec %X Purpose- The complexity of analysis of geotechnical behaviour is due to multivariable dependencies of soil and rock responses. In order to cope with this complex behavior, traditional forms of engineering design solutions are reasonably simplified. Incorporating simplifying assumptions into the development of the traditional models may lead to very large errors. In the present study, capabilities of promising variants of genetic programming (GP), namely linear genetic programming (LGP), gene expression programming (GEP) and multi expression programming (MEP) are illustrated by applying them to the formulation of several complex geotechnical engineering problems. Design/methodology/approach- LGP, GEP and MEP are new variants of GP that make a clear distinction between the genotype and the phenotype of an individual. Compared with the traditional GP, the LGP, GEP and MEP techniques are more compatible with computer architectures. This results in a significant speedup in their execution. These methods have a great ability to directly capture the knowledge contained in the experimental data without making assumptions about the underlying rules governing the system. This is one their major advantages over most of the traditional constitutive modeling methods. Findings- In order to demonstrate the simulation capabilities of LGP, GEP and MEP, they were applied to the prediction of (i) relative crest settlement of concrete-faced rockfill dams, (ii) slope stability, (iii) settlement around tunnels, and (iv) soil liquefaction. The results are compared with those obtained by other models presented in the literature and found to be more accurate. LGP has the best overall behaviour for the analysis of the considered problems in comparison with GEP and MEP. The simple and straightforward constitutive models developed using LGP, GEP and MEP provide valuable analysis tools accessible to practising engineers. Originality/value- The LGP, GEP and MEP approaches overcome the shortcomings of different methods previously presented in the literature for the analysis of geotechnical engineering systems. Contrary to artificial neural networks and many other soft computing tools, LGP, GEP and MEP provide prediction equations that can readily be used for routine design practice. The constitutive models derived using these methods can efficiently be incorporated into the finite element or finite difference analyses as material models. They may also be used as a quick check on solutions developed by more time consuming and in-depth deterministic analyses. %K genetic algorithms, genetic programming, gene expression programming, multi expression programming, Linear-based genetic programming, Data mining, Data collection, Geotechnical engineering, Programming and algorithm theory, Systems analysis, Formulation %9 journal article %R doi:10.1108/02644401111118132 %U http://www.emeraldinsight.com/journals.htm?articleid=1912293 %U http://dx.doi.org/doi:10.1108/02644401111118132 %P 242-274 %0 Conference Proceedings %T Nonlinear Modeling of Liquefaction Behavior of Sand-Silt Mixtures in terms of Strain Energy %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %Y Scinteie, Rodian %Y Plescan, Costel %S Proceedings of the 8th International Symposium on Highway and Bridge Engineering, Technology and Innovation in Transportation Infrastructure, 2010 %D 2010 %8 October %C Iasi, Romania %F Alavi:2010:HBE %K genetic algorithms, genetic programming, GPLAB, Discipulus, simulated annealing, capacity energy, Matlab %U http://www.intersections.ro/Conferences/HBE2010.pdf %P 50-69 %0 Journal Article %T Formulation of Flow Number of Asphalt Mixes Using a Hybrid Computational Method %A Alavi, Amir Hossein %A Ameri, Mahmoud %A Gandomi, Amir Hossein %A Mirzahosseini, Mohammad Reza %J Construction and Building Materials %D 2011 %8 mar %V 25 %N 3 %@ 0950-0618 %F Alavi:2010:CBM %X A high-precision model was derived to predict the flow number of dense asphalt mixtures using a novel hybrid method coupling genetic programming and simulated annealing, called GP/SA. The proposed constitutive model correlates the flow number of Marshall specimens with the percentages of filler, bitumen, voids in mineral aggregate, Marshall stability and flow. The comprehensive experimental database used for the development of the model was established upon a series of uniaxial dynamic creep tests conducted in this study. Generalised regression neural network and multiple regression-based analyses were performed to benchmark the GP/SA model. The contributions of the variables affecting the flow number were evaluated through a sensitivity analysis. A subsequent parametric study was carried out and the trends of the results were confirmed with the results of the experimental study. The results indicate that the proposed GP/SA model is effectively capable of evaluating the flow number of asphalt mixtures. The derived model is remarkably straightforward and provides an analysis tool accessible to practising engineers. %K genetic algorithms, genetic programming, Asphalt concrete mixture, Flow number, Simulated annealing, Marshall mix design, Regression analysis %9 journal article %R doi:10.1016/j.conbuildmat.2010.09.010 %U http://dx.doi.org/doi:10.1016/j.conbuildmat.2010.09.010 %P 1338-1355 %0 Journal Article %T Discussion on ’Soft computing approach for real-time estimation of missing wave heights’ by S.N. Londhe [Ocean Engineering 35 (2008) 1080-1089] %A Alavi, A. H. %A Gandomi, A. H. %A Heshmati, A. A. R. %J Ocean Engineering %D 2010 %8 sep %V 37 %N 13 %@ 0029-8018 %F Alavi20101239 %X The paper studied by Londhe (2008) \citeLondhe20081080 uses genetic programming (GP) for estimation of missing wave heights. The paper includes some problems about the fundamental aspects and use of the GP approach. In this discussion, some controversial points of the paper are given. %K genetic algorithms, genetic programming, Linear genetic programming, Tree structure, Wave forecasts %9 journal article %R doi:10.1016/j.oceaneng.2010.06.003 %U http://www.sciencedirect.com/science/article/B6V4F-50DXD90-1/2/b2489a1aebf49e771abca1b27d3b24b4 %U http://dx.doi.org/doi:10.1016/j.oceaneng.2010.06.003 %P 1239-1240 %0 Journal Article %T Genetic-based modeling of uplift capacity of suction caissons %A Alavi, Amir Hossein %A Aminian, Pejman %A Gandomi, Amir Hossein %A Arab Esmaeili, Milad %J Expert Systems with Applications %D 2011 %8 15 sep %V 38 %N 10 %@ 0957-4174 %F Alavi2011 %X In this study, classical tree-based genetic programming (TGP) and its recent variants, namely linear genetic programming (LGP) and gene expression programming (GEP) are used to develop new prediction equations for the uplift capacity of suction caissons. The uplift capacity is formulated in terms of several inflecting variables. An experimental database obtained from the literature is employed to develop the models. Further, a conventional statistical analysis is performed to benchmark the proposed models. Sensitivity and parametric analyses are conducted to verify the results. TGP, LGP and GEP are found to be effective methods for evaluating the horizontal, vertical, and inclined uplift capacity of suction caissons. The TGP, LGP and GEP models reach a prediction performance better than or comparable with the models found in the literature. %K genetic algorithms, genetic programming, Gene expression programming, Suction caissons, Uplift capacity, Formulation %9 journal article %R doi:10.1016/j.eswa.2011.04.049 %U http://www.sciencedirect.com/science/article/pii/S0957417411005653 %U http://dx.doi.org/doi:10.1016/j.eswa.2011.04.049 %P 12608-12618 %0 Journal Article %T New Ground-Motion Prediction Equations Using Multi Expression Programing %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %A Modaresnezhad, Minoo %A Mousavi, Mehdi %J Journal of Earthquake Engineering %D 2011 %V 15 %N 4 %@ 1363-2469 %F Alavi:2011:JEQE %X High-precision attenuation models were derived to estimate peak ground acceleration (PGA), velocity (PGV), and displacement (PGD) using a new variant of genetic programming, namely multi expression programming (MEP). The models were established based on an extensive database of ground-motion recordings released by Pacific Earthquake Engineering Research Center (PEER). For more validity verification, the models were employed to predict the ground-motion parameters of the Iranian plateau earthquakes. The results indicate that the MEP attenuation models are capable of effectively estimating the peak ground-motion parameters. The proposed models are able to reach a prediction performance comparable with the attenuation relationships found in the literature. %K genetic algorithms, genetic programming, Multi-Expression Programming, Time-Domain Ground-Motion Parameters, Attenuation Relationship, Nonlinear Modelling %9 journal article %R doi:10.1080/13632469.2010.526752 %U http://www.tandfonline.com/doi/abs/10.1080/13632469.2010.526752#.UlMR6NKc_G0 %U http://dx.doi.org/doi:10.1080/13632469.2010.526752 %P 511-536 %0 Journal Article %T Energy-based numerical models for assessment of soil liquefaction %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %J Geoscience Frontiers %D 2012 %V 3 %N 4 %@ 1674-9871 %F Alavi2012541 %X This study presents promising variants of genetic programming (GP), namely linear genetic programming (LGP) and multi expression programming (MEP) to evaluate the liquefaction resistance of sandy soils. Generalised LGP and MEP-based relationships were developed between the strain energy density required to trigger liquefaction (capacity energy) and the factors affecting the liquefaction characteristics of sands. The correlations were established based on well established and widely dispersed experimental results obtained from the literature. To verify the applicability of the derived models, they were employed to estimate the capacity energy values of parts of the test results that were not included in the analysis. The external validation of the models was verified using statistical criteria recommended by researchers. Sensitivity and parametric analyses were performed for further verification of the correlations. The results indicate that the proposed correlations are effectively capable of capturing the liquefaction resistance of a number of sandy soils. The developed correlations provide a significantly better prediction performance than the models found in the literature. Furthermore, the best LGP and MEP models perform superior than the optimal traditional GP model. The verification phases confirm the efficiency of the derived correlations for their general application to the assessment of the strain energy at the onset of liquefaction. %K genetic algorithms, genetic programming, Soil liquefaction, Capacity energy, Multi expression programming, Sand, Formulation %9 journal article %R doi:10.1016/j.gsf.2011.12.008 %U http://www.sciencedirect.com/science/article/pii/S167498711100137X %U http://dx.doi.org/doi:10.1016/j.gsf.2011.12.008 %P 541-555 %0 Book Section %T A Genetic Programming-Based Approach for the Performance Characteristics Assessment of Stabilized Soil %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %A Mollahasani, Ali %E Chiong, Raymond %E Weise, Thomas %E Michalewicz, Zbigniew %B Variants of Evolutionary Algorithms for Real-World Applications %D 2012 %I Springer %F books/sp/chiong2012/AlaviGM12 %X This chapter presents a variant of genetic programming, namely linear genetic programming (LGP), and a hybrid search algorithm coupling LGP and simulated annealing (SA), called LGP/SA, to predict the performance characteristics of stabilised soil. LGP and LGP/SA relate the unconfined compressive strength (UCS), maximum dry density (MDD), and optimum moisture content (OMC) metrics of stabilised soil to the properties of the natural soil as well as the types and quantities of stabilizing additives. Different sets of LGP and LGP/SA-based prediction models have been separately developed. The contributions of the parameters affecting UCS, MDD, and OMC are evaluated through a sensitivity analysis. A subsequent parametric analysis is carried out and the trends of the results are compared with previous studies. A comprehensive set of data obtained from the literature has been used for developing the models. Experimental results confirm that the accuracy of the proposed models is satisfactory. In particular, the LGP-based models are found to be more accurate than the LGP/SA-based models. %K genetic algorithms, genetic programming, Chemical stabilisation, Simulated annealing, Nonlinear modelling %R doi:10.1007/978-3-642-23424-8_11 %U http://dx.doi.org/doi:10.1007/978-3-642-23424-8_11 %P 343-376 %0 Book Section %T Linear and Tree-Based Genetic Programming for Solving Geotechnical Engineering Problems %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %A Mollahasani, Ali %A Bolouri Bazaz, Jafar %E Yang, Xin-She %E Gandomi, Amir Hossein %E Talatahari, Siamak %E Alavi, Amir Hossein %B Metaheuristics in Water, Geotechnical and Transport Engineering %D 2013 %I Elsevier %C Oxford %F Alavi:2013:MWGTE %X This chapter presents new approaches for solving geotechnical engineering problems using classical tree-based genetic programming (TGP) and linear genetic programming (LGP). TGP and LGP are symbolic optimisation techniques that create computer programs to solve a problem using the principle of Darwinian natural selection. Generally, they are supervised, machine-learning techniques that search a program space instead of a data space. Despite remarkable prediction capabilities of the TGP and LGP approaches, the contents of reported applications indicate that the progress in their development is marginal and not moving forward. The present study introduces a state-of-the-art examination of TGP and LGP applications in solving complex geotechnical engineering problems that are beyond the computational capability of traditional methods. In order to justify the capabilities of these techniques, they are systematically employed to formulate a typical geotechnical engineering problem. For this aim, effective angle of shearing resistance (phi) of soils is formulated in terms of the physical properties of soil. The validation of the TGP and LGP models is verified using several statistical criteria. The numerical example shows the superb accuracy, efficiency, and great potential of TGP and LGP. The models obtained using TGP and LGP can be used efficiently as quick checks on solutions developed by more time consuming and in-depth deterministic analyses. The current research directions and issues that need further attention in the future are discussed. Keywords Tree-based genetic programming, linear genetic programming geotechnical engineering, prediction %K genetic algorithms, genetic programming, Tree-based genetic programming, linear genetic programming, geotechnical engineering, prediction %R doi:10.1016/B978-0-12-398296-4.00012-X %U http://www.sciencedirect.com/science/article/pii/B978012398296400012X %U http://dx.doi.org/doi:10.1016/B978-0-12-398296-4.00012-X %P 289-310 %0 Journal Article %T Design equations for prediction of pressuremeter soil deformation moduli utilizing expression programming systems %A Alavi, Amir Hossein %A Gandomi, Amir Hossein %A Chahkandi Nejad, Hadi %A Mollahasani, Ali %A Rashed, Azadeh %J Neural Computing and Applications %D 2013 %8 nov %V 23 %N 6 %I Springer-Verlag %@ 0941-0643 %G English %F Alavi:2014:NCA %X Providing precise estimations of soil deformation modulus is very difficult due to its dependence on many factors. In this study, gene expression programming (GEP) and multi-expression programming (MEP) systems are presented to derive empirical equations for the prediction of the pressuremeter soil deformation modulus. The employed expression programming (EP) systems formulate the soil deformation modulus in terms of the soil physical properties. Selection of the best models is on the basis of developing and controlling several models with different combinations of the affecting parameters. The proposed EP-based models are established upon 114 pressure meter tests on different soil types conducted in this study. The generalisation capabilities of the models are verified using several statistical criteria. Contributions of the variables influencing the soil modulus are evaluated through a sensitivity analysis. The GEP and MEP approaches accurately characterise the soil deformation modulus resulting in a very good prediction performance. The result indicates that moisture content and soil dry unit weight can efficiently represent the initial state and consolidation history of soil for determining its modulus. %K genetic algorithms, genetic programming, gene expression programming, Soil deformation modulus, Expression programming techniques, Pressure meter test, Soil physical properties %9 journal article %R doi:10.1007/s00521-012-1144-6 %U http://link.springer.com/article/10.1007%2Fs00521-012-1144-6 %U http://dx.doi.org/doi:10.1007/s00521-012-1144-6 %P 1771-1786 %0 Journal Article %T New design equations for estimation of ultimate bearing capacity of shallow foundations resting on rock masses %A Alavi, Amir H. %A Sadrossadat, Ehsan %J Geoscience Frontiers %D 2014 %@ 1674-9871 %F Alavi:2014:GF %X Rock masses are commonly used as the underlying layer of important structures such as bridges, dams and transportation constructions. The success of a foundation design for such structures mainly depends on the accuracy of estimating the bearing capacity of rock beneath them. Several traditional numerical approaches are proposed for the estimation of the bearing capacity of foundations resting on rock masses to avoid performing elaborate and expensive experimental studies. Despite this fact, there still exists a serious need to develop more robust predictive models. This paper proposes new nonlinear prediction models for the ultimate bearing capacity of shallow foundations resting on non-fractured rock masses using a novel evolutionary computational approach, called linear genetic programming. A comprehensive set of rock socket, centrifuge rock socket, plate load and large-scaled footing load test results is used to develop the models. In order to verify the validity of the models, the sensitivity analysis is conducted and discussed. The results indicate that the proposed models accurately characterise the bearing capacity of shallow foundations. The correlation coefficients between the experimental and predicted bearing capacity values are equal to 0.95 and 0.96 for the best LGP models. Moreover, the derived models reach a notably better prediction performance than the traditional equations. %K genetic algorithms, genetic programming, Rock mass properties, Ultimate bearing capacity, Shallow foundation, Prediction, Evolutionary computation %9 journal article %R doi:10.1016/j.gsf.2014.12.005 %U http://www.sciencedirect.com/science/article/pii/S1674987114001625 %U http://dx.doi.org/doi:10.1016/j.gsf.2014.12.005 %0 Journal Article %T Progress of machine learning in geosciences: Preface %A Alavi, Amir H. %A Gandomi, Amir H. %A Lary, David J. %J Geoscience Frontiers %D 2016 %V 7 %N 1 %@ 1674-9871 %F Alavi:2016:GSF %O Editorial %K genetic algorithms, genetic programming %9 journal article %R doi:10.1016/j.gsf.2015.10.006 %U http://www.sciencedirect.com/science/article/pii/S1674987115001243 %U http://dx.doi.org/doi:10.1016/j.gsf.2015.10.006 %P 1-2 %0 Journal Article %T A new approach for modeling of flow number of asphalt mixtures %A Alavi, Amir H. %A Hasni, Hassene %A Zaabar, Imen %A Lajnef, Nizar %J Archives of Civil and Mechanical Engineering %D 2017 %V 17 %N 2 %@ 1644-9665 %F Alavi:2017:ACME %X Flow number of asphalt-aggregate mixtures is an explanatory parameter for the analysis of rutting potential of asphalt mixtures. In this study, a new model is proposed for the determination of flow number using a robust computational intelligence technique, called multi-gene genetic programming (MGGP). MGGP integrates genetic programming and classical regression to formulate the flow number of Marshall Specimens. A reliable experimental database is used to develop the proposed model. Different analyses are performed for the performance evaluation of the model. On the basis of a comparison study, the MGGP model performs superior to the models found in the literature. %K genetic algorithms, genetic programming, Asphalt mixture, Flow number, Marshall mix design %9 journal article %R doi:10.1016/j.acme.2016.06.004 %U http://www.sciencedirect.com/science/article/pii/S1644966516300814 %U http://dx.doi.org/doi:10.1016/j.acme.2016.06.004 %P 326-335 %0 Conference Proceedings %T Type-Constrained Genetic Programming for Rule-Base Definition in Fuzzy Logic Controllers %A Alba, Enrique %A Cotta, Carlos %A Troya, Jose M. %Y Koza, John R. %Y Goldberg, David E. %Y Fogel, David B. %Y Riolo, Rick L. %S Genetic Programming 1996: Proceedings of the First Annual Conference %D 1996 %8 28–31 jul %I MIT Press %C Stanford University, CA, USA %F alba:1996:tGPrdflc %K genetic algorithms, genetic programming %U http://cognet.mit.edu/sites/default/files/books/9780262315876/pdfs/9780262315876_chap31.pdf %P 255-260 %0 Conference Proceedings %T Entropic and Real-Time Analysis of the Search with Panmictic, Structured, and Parallel Distributed Genetic Algorithms %A Alba, Enrique %A Cotta, Carlos %A Troya, Jose M. %Y Banzhaf, Wolfgang %Y Daida, Jason %Y Eiben, Agoston E. %Y Garzon, Max H. %Y Honavar, Vasant %Y Jakiela, Mark %Y Smith, Robert E. %S Proceedings of the Genetic and Evolutionary Computation Conference %D 1999 %8 13 17 jul %V 1 %I Morgan Kaufmann %C Orlando, Florida, USA %@ 1-55860-611-4 %F alba:1999:ERASPSPDGA %K genetic algorithms and classifier systems, poster papers %U http://gpbib.cs.ucl.ac.uk/gecco1999/Ga-808.pdf %P 773 %0 Conference Proceedings %T Tackling epistasis with panmictic and structured genetic algorithms %A Alba, Enrique %A Troya, Jose M. %Y Brave, Scott %Y Wu, Annie S. %S Late Breaking Papers at the 1999 Genetic and Evolutionary Computation Conference %D 1999 %8 13 jul %C Orlando, Florida, USA %F alba:1999:T %K Genetic Algorithms, NK %P 1-7 %0 Journal Article %T Evolutionary Design of Fuzzy Logic Controllers Using Strongly-Typed GP %A Alba, Enrique %A Cotta, Carlos %A Troya, Jose M. %J Mathware & Soft Computing %D 1999 %V 6 %N 1 %F alba:1999:edflcSGP %X An evolutionary approach to the design of fuzzy logic controllers is presented in this paper. We propose the use of the genetic programming paradigm to evolve fuzzy rule-bases (internally represented as type-constrained syntactic trees). This model has been applied to the cart-centering problem, although it can be readily extended to other problems. The obtained results show that a good parameterization of the algorithm, and an appropriate evaluation function, can lead to near-optimal solutions. %K genetic algorithms, genetic programming, Type System, Fuzzy Logic Controller, Cart-Centering Problem %9 journal article %U http://docto-si.ugr.es/Mathware/v6n1/PS/7-alba.ps.gz %P 109-124 %0 Book %T Parallel Metaheuristics: A New Class of Algorithms %A Alba, Enrique %D 2005 %8 aug %I John Wiley & Sons %C NJ, USA %@ 0-471-67806-6 %F Alba05 %X This single reference on parallel metaheuristic presents modern and ongoing research information on using, designing, and analysing efficient models of parallel algorithms. Table of Contents Author Information Introduction. PART I: INTRODUCTION TO METAHEURISTICS AND PARALLELISM. 1. An Introduction to Metaheuristic Techniques. 2. Measuring the Performance of Parallel Metaheuristics. 3. New Technologies in Parallelism. 4. Metaheuristics and Parallelism. PART II: PARALLEL METAHEURISTIC MODELS. 5. Parallel Genetic Algorithms. 6. Spatially Structured Genetic Programming. 7. Parallel Evolution Strategies. 8. Parallel Ant Colony Algorithms. 9. Parallel Estimation of Distribution Algorithms. 10. Parallel Scatter Search. 11. Parallel Variable Neighbourhood Search. 12. Parallel Simulated Annealing. 13. Parallel Tabu Search. 14. Parallel GRASP. 15. Parallel Hybrid Metaheuristics. 16. Parallel Multi Objective. 17. Parallel Heterogeneous Metaheuristics. PART III: THEORY AND APPLICATIONS. 18. Theory of Parallel Genetic Algorithms. 19. Parallel Metaheuristics. 20. Parallel Metaheuristics in Telecommunications. 21. Bioinformatics and Parallel Metaheuristics. Index. %K genetic algorithms, genetic programming, book, text, general computer engineering %U https://www.amazon.com/Parallel-Metaheuristics-New-Class-Algorithms/dp/0471678066/ref=sr_1_1 %0 Conference Proceedings %T Optimizing Diabetes Predictive Modeling with Automated Decision Trees %A Albalushi, Muna %A Al Jassim, Rasha %A Jetly, Karan %A Al Khayari, Raya %A Al Maqbali, Hilal %S 2023 IEEE Smart World Congress (SWC) %D 2023 %8 aug %F Albalushi:2023:SWC %X This paper introduces Linear Genetic Programming for Optimising Decision Tree (LGP-OptTree), a novel form of Genetic Programming (GP) aimed at enhancing diabetes detection. LGP-OptTree is designed to optimise the attributes and hyperparameters of decision trees by using a unique genotype and phenotype structure. The proposed method is evaluated on the Pima dataset and compared with other techniques. By fine-tuning the attributes and hyperparameters of decision trees using LGP-OptTree, this study aims to improve the accuracy and efficacy of diabetes detection. A performance metric is used to determine the effectiveness of the proposed method with respect to other approaches. The contribution of this research lies in providing general healthcare professionals with a new approach for enhancing diabetes detection accuracy through decision trees. %K genetic algorithms, genetic programming, Measurement, Medical services, Predictive models, Prediction algorithms, Diabetes, Decision trees, Evolutionary Algorithm %R doi:10.1109/SWC57546.2023.10449077 %U http://dx.doi.org/doi:10.1109/SWC57546.2023.10449077 %0 Conference Proceedings %T Learning to Combine Spectral Indices with Genetic Programming %A Hernandez Albarracin, Juan Felipe %A dos Santos, Jefersson Alex %A da S. Torres, Ricardo %S 2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) %D 2016 %8 oct %F Albarracin:2016:SIBGRAPI %X This paper introduces a Genetic Programming-based method for band selection and combination, aiming to support remote sensing image classification tasks. Relying on ground-truth data, our method selects spectral bands and finds the arithmetic combination of those bands (i.e., spectral index) that best separates examples of different classes. Experimental results demonstrate that the proposed method is very effective in pixel-wise binary classification problems. %K genetic algorithms, genetic programming %R doi:10.1109/SIBGRAPI.2016.063 %U http://dx.doi.org/doi:10.1109/SIBGRAPI.2016.063 %P 408-415 %0 Journal Article %T A Soft Computing Approach for Selecting and Combining Spectral Bands %A Albarracin, Juan F. H. %A Oliveira, Rafael S. %A Hirota, Marina %A dos Santos, Jefersson A. %A da S. Torres, Ricardo %J Remote Sensing %D 2020 %V 12 %N 14 %@ 2072-4292 %F albarracin:2020:RS %X We introduce a soft computing approach for automatically selecting and combining indices from remote sensing multispectral images that can be used for classification tasks. The proposed approach is based on a Genetic-Programming (GP) framework, a technique successfully used in a wide variety of optimisation problems. Through GP, it is possible to learn indices that maximize the separability of samples from two different classes. Once the indices specialized for all the pairs of classes are obtained, they are used in pixelwise classification tasks. We used the GP-based solution to evaluate complex classification problems, such as those that are related to the discrimination of vegetation types within and between tropical biomes. Using time series defined in terms of the learnt spectral indices, we show that the GP framework leads to superior results than other indices that are used to discriminate and classify tropical biomes. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/rs12142267 %U https://www.mdpi.com/2072-4292/12/14/2267 %U http://dx.doi.org/doi:10.3390/rs12142267 %0 Conference Proceedings %T A Study of Semantic Geometric Crossover Operators in Regression Problems %A Albinati, Julio %A Pappa, Gisele L. %A Otero, Fernando E. B. %A Oliveira, Luiz Otavio V. B. %Y Johnson, Colin %Y Krawiec, Krzysztof %Y Moraglio, Alberto %Y O’Neill, Michael %S Semantic Methods in Genetic Programming %D 2014 %8 13 sep %C Ljubljana, Slovenia %F Albinati:2014:SMGP %O Workshop at Parallel Problem Solving from Nature 2014 conference %K genetic algorithms, genetic programming %U http://www.cs.put.poznan.pl/kkrawiec/smgp2014/uploads/Site/Albinati.pdf %0 Conference Proceedings %T The Effect of Distinct Geometric Semantic Crossover Operators in Regression Problems %A Albinati, Julio %A Pappa, Gisele L. %A Otero, Fernando E. B. %A Oliveira, Luiz Otavio V. B. %Y Machado, Penousal %Y Heywood, Malcolm I. %Y McDermott, James %Y Castelli, Mauro %Y Garcia-Sanchez, Pablo %Y Burelli, Paolo %Y Risi, Sebastian %Y Sim, Kevin %S 18th European Conference on Genetic Programming %S LNCS %D 2015 %8 August 10 apr %V 9025 %I Springer %C Copenhagen %F Albinati:2015:EuroGP %X This paper investigates the impact of geometric semantic crossover operators in a wide range of symbolic regression problems. First, it analyses the impact of using Manhattan and Euclidean distance geometric semantic crossovers in the learning process. Then, it proposes two strategies to numerically optimise the crossover mask based on mathematical properties of these operators, instead of simply generating them randomly. An experimental analysis comparing geometric semantic crossovers using Euclidean and Manhattan distances and the proposed strategies is performed in a test bed of twenty datasets. The results show that the use of different distance functions in the semantic geometric crossover has little impact on the test error, and that our optimized crossover masks yield slightly better results. For SGP practitioners, we suggest the use of the semantic crossover based on the Euclidean distance, as it achieved similar results to those obtained by more complex operators. %K genetic algorithms, genetic programming, Crossover, Crossover mask optimisation %R doi:10.1007/978-3-319-16501-1 %U http://dx.doi.org/doi:10.1007/978-3-319-16501-1 %P 3-15 %0 Journal Article %T Multi-Dimensional Regression Models for Predicting the Wall Thickness Distribution of Corrugated Pipes %A Albrecht, Hanny %A Roland, Wolfgang %A Fiebig, Christian %A Berger-Weber, Gerald Roman %J Polymers %D 2022 %V 14 %N 17 %@ 2073-4360 %F albrecht:2022:Polymers %X Corrugated pipes offer both higher stiffness and higher flexibility while simultaneously requiring less material than rigid pipes. Production rates of corrugated pipes have therefore increased significantly in recent years. Due to rising commodity prices, pipe manufacturers have been driven to produce corrugated pipes of high quality with reduced material input. To the best of our knowledge, corrugated pipe geometry and wall thickness distribution significantly influence pipe properties. Essential factors in optimising wall thickness distribution include adaptation of the mold block geometry and structure optimisation. To achieve these goals, a conventional approach would typically require numerous iterations over various pipe geometries, several mold block geometries, and then fabrication of pipes to be tested experimentally—an approach which is very time-consuming and costly. To address this issue, we developed multi-dimensional mathematical models that predict the wall thickness distribution in corrugated pipes as functions of the mold geometry by using symbolic regression based on genetic programming (GP). First, the blow molding problem was transformed into a dimensionless representation. Then, a screening study was performed to identify the most significant influencing parameters, which were subsequently varied within wide ranges as a basis for a comprehensive, numerically driven parametric design study. The data set obtained was used as input for data-driven modelling to derive novel regression models for predicting wall thickness distribution. Finally, model accuracy was confirmed by means of an error analysis that evaluated various statistical metrics. With our models, wall thickness distribution can now be predicted and subsequently used for structural analysis, thus enabling digital mold block design and optimising the wall thickness distribution. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/polym14173455 %U https://www.mdpi.com/2073-4360/14/17/3455 %U http://dx.doi.org/doi:10.3390/polym14173455 %P ArticleNo.3455 %0 Book Section %T Evolutionary Computation and Parallel Processing Applied to the Design of Multilayer Perceptrons %A Albuquerque, Ana Claudia M. L. %A Melo, Jorge D. %A Doria Neto, Adriao D. %E Nedjah, Nadia %E de Macedo Mourelle, Luiza %B Evolvable Machines: Theory & Practice %S Studies in Fuzziness and Soft Computing %D 2004 %V 161 %I Springer %C Berlin %@ 3-540-22905-1 %F Albuquerque:2004:EMTP %K genetic algorithms %U http://www.springeronline.com/sgw/cda/frontpage/0,11855,5-175-22-33980449-0,00.html %P 181-203 %0 Conference Proceedings %T On the Impact of the Representation on Fitness Landscapes %A Albuquerque, Paul %A Chopard, Bastien %A Mazza, Christian %A Tomassini, Marco %Y Poli, Riccardo %Y Banzhaf, Wolfgang %Y Langdon, William B. %Y Miller, Julian F. %Y Nordin, Peter %Y Fogarty, Terence C. %S Genetic Programming, Proceedings of EuroGP’2000 %S LNCS %D 2000 %8 15 16 apr %V 1802 %I Springer-Verlag %C Edinburgh %@ 3-540-67339-3 %F albuquerque:2000:irfl %X In this paper we study the role of program representation on the properties of a type of Genetic Programming (GP) algorithm. In a specific case, which we believe to be generic of standard GP, we show that the way individuals are coded is an essential concept which impacts the fitness landscape. We give evidence that the ruggedness of the landscape affects the behavior of the algorithm and we find that, below a critical population, whose size is representation-dependent, premature convergence occurs. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-540-46239-2_1 %U http://dx.doi.org/doi:10.1007/978-3-540-46239-2_1 %P 1-15 %0 Journal Article %T Predictive Models of Double-Vibropolishing in Bowl System Using Artificial Intelligence Methods %A Alcaraz, Joselito Yam II %A Ahluwalia, Kunal %A Yeo, Swee-Hock %J Journal of Manufacturing and Materials Processing %D 2019 %V 3 %N 1 %@ 2504-4494 %F alcaraz:2019:JMMP %X Vibratory finishing is a versatile and efficient surface finishing process widely used to finish components of various functionalities. Research efforts were focused in fundamental understanding of the process through analytical solutions and simulations. On the other hand, predictive modelling of surface roughness using computational intelligence (CI) methods are emerging in recent years, though CI methods have not been extensively applied yet to a new vibratory finishing method called double-vibropolishing. In this study, multi-variable regression, artificial neural networks, and genetic programming models were designed and trained with experimental data obtained from subjecting rectangular Ti-6Al-4V test coupons to double vibropolishing in a bowl system configuration. Model selection was done by comparing the mean-absolute percentage error and r-squared values from both training and testing datasets. Exponential regression was determined as the best model for the bowl double-vibropolishing system studied with a Test MAPE score of 6.1percent and a R-squared score of 0.99. A family of curves was generated using the exponential regression model as a potential tool in predicting surface roughness with time. %K genetic algorithms, genetic programming, vibratory finishing, double vibro-polishing, artificial intelligence, regression, neural network, ANN %9 journal article %R doi:10.3390/jmmp3010027 %U https://www.mdpi.com/2504-4494/3/1/27 %U http://dx.doi.org/doi:10.3390/jmmp3010027 %0 Journal Article %T Thiophene Stability in Photodynamic Therapy: A Mathematical Model Approach %A Alcazar, Jackson J. %J International Journal of Molecular Sciences %D 2024 %8 21 feb %V 25 %N 5 %@ 1422-0067 %F alcazar:2024:IJMS %O Special Issue Molecular Aspects of Photodynamic Therapy %X Thiophene-containing photosensitizers are gaining recognition for their role in photodynamic therapy (PDT). However, the inherent reactivity of the thiophene moiety toward singlet oxygen threatens the stability and efficiency of these photosensitizers. This study presents a novel mathematical model capable of predicting the reactivity of thiophene toward singlet oxygen in PDT, using Conceptual Density Functional Theory (CDFT) and genetic programming. The research combines advanced computational methods, including various DFT techniques and symbolic regression, and is validated with experimental data. The findings underscore the capacity of the model to classify photosensitizers based on their photodynamic efficiency and safety, particularly noting that photosensitizers with a constant rate 1000 times lower than that of unmodified thiophene retain their photodynamic performance without substantial singlet oxygen quenching. Additionally, the research offers insights into the impact of electronic effects on thiophene reactivity. Finally, this study significantly advances thiophene-based photosensitizer design, paving the way for therapeutic agents that achieve a desirable balance between efficiency and safety in PDT. %K genetic algorithms, genetic programming, safe PDT, efficient PDT, thiophene-containing photosensitiser, singlet oxygen, conceptual DFT %9 journal article %R doi:10.3390/ijms25052528 %U https://www.mdpi.com/1422-0067/25/5/2528 %U http://dx.doi.org/doi:10.3390/ijms25052528 %P ArticleNo.2528 %0 Conference Proceedings %T Evolving Monotone Conjunctions in Regimes Beyond Proved Convergence %A Alchirch, Pantia-Marina %A Diochnos, Dimitrios I. %A Papakonstantinopoulou, Katia %Y Medvet, Eric %Y Pappa, Gisele %Y Xue, Bing %S EuroGP 2022: Proceedings of the 25th European Conference on Genetic Programming %S LNCS %D 2022 %8 20 22 apr %V 13223 %I Springer Verlag %C Madrid, Spain %F Alchirch:2022:EuroGP %X Recently it was shown, using the typical mutation mechanism that is used in evolutionary algorithms, that monotone conjunctions are provably evolvable under a specific set of Bernoulli (p)n distributions. A natural question is whether this mutation mechanism allows convergence under other distributions as well. Our experiments indicate that the answer to this question is affirmative and, at the very least, this mechanism converges under Bernoulli (p)n distributions outside of the known proved regime. %K genetic algorithms, genetic programming: Poster, Evolvability, Monotone conjunctions, Distribution-specific learning, Bernoulli (p)**n distributions %R doi:10.1007/978-3-031-02056-8_15 %U http://dx.doi.org/doi:10.1007/978-3-031-02056-8_15 %P 228-244 %0 Conference Proceedings %T Lightweight Symbolic Regression with the Interaction-Transformation Representation %A Aldeia, Guilherme %A de Franca, Fabricio %Y Vellasco, Marley %S 2018 IEEE Congress on Evolutionary Computation (CEC) %D 2018 %8 August 13 jul %I IEEE %C Rio de Janeiro, Brazil %F Aldeia:2018:CEC %X Symbolic Regression techniques stand out from other regression analysis tools because of the possibility of generating powerful but yet simple expressions. These simple expressions may be useful in many practical situations in which the practitioner wants to interpret the obtained results, fine tune the model, or understand the generating phenomena. Despite this possibility, the current state-of-the-art algorithms for Symbolic Regression usually require a high computational budget while having little guarantees regarding the simplicity of the returned expressions. Recently, a new Data Structure representation for mathematical expressions, called Interaction-Transformation (IT), was introduced together with a search-based algorithm named SymTree that surpassed a subset of the recent Symbolic Regression algorithms and even some state-of-the-art nonlinear regression algorithms, while returning simple expressions as a result. This paper introduces a lightweight tool based on this algorithm, named Lab Assistant. This tool runs on the client-side of any compatible Internet browser with JavaScript. Alongside this tool, two algorithms using the IT representation are introduced. Some experiments are performed in order to show the potential of the Lab Assistant to help practitioners, professors, researchers and students willing to experiment with Symbolic Regression. The results showed that this tool is competent to find the correct expression for many well known Physics and Engineering relations within a reasonable average time frame of a few seconds. This tool opens up lots of possibilities in Symbolic Regression research for low-cost devices to be used in applications where a high-end computer is not available. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2018.8477951 %U http://dx.doi.org/doi:10.1109/CEC.2018.8477951 %0 Conference Proceedings %T A Parametric Study of Interaction-Transformation Evolutionary Algorithm for Symbolic Regression %A Aldeia, Guilherme %A de Franca, Fabricio %Y Jin, Yaochu %S 2020 IEEE Congress on Evolutionary Computation, CEC 2020 %D 2020 %8 19 24 jul %I IEEE Press %C internet %F Aldeia:2020:CEC %X The balance between approximation error and model complexity is an important trade-off for Symbolic Regression algorithms. This trade-off is achieved by means of specific operators for bloat control, modified operators, limits to the size of the generated expressions and multi-objective optimization. Recently, the representation Interaction-Transformation was introduced with the goal of limiting the search space to simpler expressions, thus avoiding bloating. This representation was used in the context of an Evolutionary Algorithm in order to find concise expressions resulting in small approximation errors competitive with the literature. Particular to this algorithm, two parameters control the complexity of the generated expression. This paper investigates the influence of those parameters w.r.t. the goodness-of-fit. Through some extensive experiments, we find that the maximum number of terms is more important to control goodness-of-fit but also that there is a limit to the extent that increasing its value renders any benefits. Second, the limit to the minimum and maximum value of the exponent has a smaller influence to the results and it can be set to a default value without impacting the final results. %K genetic algorithms, genetic programming %R doi:10.1109/CEC48606.2020.9185521 %U http://dx.doi.org/doi:10.1109/CEC48606.2020.9185521 %P paperid24027 %0 Conference Proceedings %T Measuring Feature Importance of Symbolic Regression Models Using Partial Effects %A Aldeia, Guilherme Seidyo Imai %A Olivetti de Franca, Fabricio %Y Chicano, Francisco %Y Tonda, Alberto %Y Krawiec, Krzysztof %Y Helbig, Marde %Y Cleghorn, Christopher W. %Y Wilson, Dennis G. %Y Yannakakis, Georgios %Y Paquete, Luis %Y Ochoa, Gabriela %Y Bacardit, Jaume %Y Gagne, Christian %Y Mostaghim, Sanaz %Y Jourdan, Laetitia %Y Schuetze, Oliver %Y Posik, Petr %Y Segura, Carlos %Y Tinos, Renato %Y Cotta, Carlos %Y Heywood, Malcolm %Y Zhang, Mengjie %Y Trujillo, Leonardo %Y Miikkulainen, Risto %Y Xue, Bing %Y Neumann, Aneta %Y Allmendinger, Richard %Y Ishikawa, Fuyuki %Y Medina-Bulo, Inmaculada %Y Neumann, Frank %Y Sutton, Andrew M. %S Proceedings of the 2021 Genetic and Evolutionary Computation Conference %S GECCO ’21 %D 2021 %8 jul 10 14 %I Association for Computing Machinery %C internet %F Aldeia:2021:GECCO %X In explainable AI, one aspect of a prediction’s explanation is to measure each predictor’s importance to the decision process.The importance can measure how much variation a predictor promotes locally or how much the predictor contributes to the deviation from a reference point (Shapley value). If we have the ground truth analytical model, we can calculate the former using the Partial Effect, calculated as the predictor’s partial derivative. Also, we can estimate the latter by calculating the average partial effect multiplied by the difference between the predictor and the reference value. Symbolic Regression is a gray-box model for regression problems that returns an analytical model approximating the input data. Although it is often associated with interpretability, few works explore this property. We will investigate the use of Partial Effect with the analytical models generated by the Interaction-Transformation Evolutionary Algorithm symbolic regressor (ITEA). We show that the regression models returned by ITEA coupled with Partial Effect provide the closest explanations to the ground truth and a close approximation to Shapley values. These results openup new opportunities to explain symbolic regression modelscompared to the approximations provided by model agnostic approaches. %K genetic algorithms, genetic programming, XAI, explainable AI, symbolic regression, interaction-transformation, Supervised learning, SHAP, Shapley value %R doi:10.1145/3449639.3459302 %U http://dx.doi.org/doi:10.1145/3449639.3459302 %P 750-758 %0 Conference Proceedings %T Interaction-Transformation Evolutionary Algorithm with coefficients optimization %A Aldeia, Guilherme Seidyo Imai %A Olivetti de Franca, Fabricio %Y Trautmann, Heike %Y Doerr, Carola %Y Moraglio, Alberto %Y Bartz-Beielstein, Thomas %Y Filipic, Bogdan %Y Gallagher, Marcus %Y Ong, Yew-Soon %Y Gupta, Abhishek %Y Kononova, Anna V. %Y Wang, Hao %Y Emmerich, Michael %Y Bosman, Peter A. N. %Y Zaharie, Daniela %Y Caraffini, Fabio %Y Dreo, Johann %Y Auger, Anne %Y Dietric, Konstantin %Y Dufosse, Paul %Y Glasmachers, Tobias %Y Hansen, Nikolaus %Y Mersmann, Olaf %Y Posik, Petr %Y Tusar, Tea %Y Brockhoff, Dimo %Y Eftimov, Tome %Y Kerschke, Pascal %Y Naujoks, Boris %Y Preuss, Mike %Y Volz, Vanessa %Y Derbel, Bilel %Y Li, Ke %Y Li, Xiaodong %Y Zapotecas, Saul %Y Zhang, Qingfu %Y Coletti, Mark %Y Schuman, Catherine (Katie) %Y Scott, Eric “Siggy” %Y Patton, Robert %Y Wiegand, Paul %Y Bassett, Jeffrey K. %Y Gunaratne, Chathika %Y Chugh, Tinkle %Y Allmendinger, Richard %Y Hakanen, Jussi %Y Tauritz, Daniel %Y Woodward, John %Y Lopez-Ibanez, Manuel %Y McCall, John %Y Bacardit, Jaume %Y Brownlee, Alexander %Y Cagnoni, Stefano %Y Iacca, Giovanni %Y Walker, David %Y Toutouh, Jamal %Y O’Reilly, UnaMay %Y Machado, Penousal %Y Correia, Joao %Y Nesmachnow, Sergio %Y Ceberio, Josu %Y Villanueva, Rafael %Y Hidalgo, Ignacio %Y Fernandez de Vega, Francisco %Y Paolo, Giuseppe %Y Coninx, Alex %Y Cully, Antoine %Y Gaier, Adam %Y Wagner, Stefan %Y Affenzeller, Michael %Y Bruce, Bobby R. %Y Nowack, Vesna %Y Blot, Aymeric %Y Winter, Emily %Y Langdon, William B. %Y Petke, Justyna %Y Fernandez Alzueta, Silvino %Y Valledor Pellicer, Pablo %Y Stuetzle, Thomas %Y Paetzel, David %Y Wagner, Alexander %Y Heider, Michael %Y Veerapen, Nadarajen %Y Malan, Katherine %Y Liefooghe, Arnaud %Y Verel, Sebastien %Y Ochoa, Gabriela %Y Omidvar, Mohammad Nabi %Y Sun, Yuan %Y Tarantino, Ernesto %Y Ivanoe, De Falco %Y Della Cioppa, Antonio %Y Umberto, Scafuri %Y Rieffel, John %Y Mouret, Jean-Baptiste %Y Doncieux, Stephane %Y Nikolaidis, Stefanos %Y Togelius, Julian %Y Fontaine, Matthew C. %Y Georgescu, Serban %Y Chicano, Francisco %Y Whitley, Darrell %Y Kyriienko, Oleksandr %Y Dahl, Denny %Y Shir, Ofer %Y Spector, Lee %Y Rahat, Alma %Y Everson, Richard %Y Fieldsend, Jonathan %Y Wang, Handing %Y Jin, Yaochu %Y Hemberg, Erik %Y Elsayed, Marwa A. %Y Kommenda, Michael %Y La Cava, William %Y Kronberger, Gabriel %Y Gustafson, Steven %S Proceedings of the 2022 Genetic and Evolutionary Computation Conference Companion %S GECCO ’22 %D 2022 %8 September 13 jul %I Association for Computing Machinery %C Boston, USA %F aldeia:2022:SymReg %X Symbolic Regression is the task of finding a mathematical expression to describe the relationship between one or more independent variables with a dependent variable. The search space can be vast and include any algebraic function; thus, finding optimal values for coefficients may not be a trivial task. The Interaction-Transformation representation alleviates this problem enforcing that the coefficients of the expression is part of a linear transformation, allowing the application of least squares. But this solution also limits the search space of the expressions. This paper proposes four different strategies to optimize the coefficients of the nonlinear part of the Interaction-Transformation representation. We benchmark the proposed strategies by applying the Interaction-Transformation Evolutionary Algorithm (ITEA) to six well-known data sets to evaluate four optimization heuristics combining linear and non-linear methods. The results show that optimizing the non-linear and linear coefficients separately was the best strategy to find better-performing expressions with a higher run-time and expression size. The non-linear optimization method alone was the worst-performing method. %K genetic algorithms, genetic programming, Representation of mathematical functions, symbolic regression, coefficient optimization, benchmark, evolutionary algorithm %R doi:10.1145/3520304.3533987 %U http://dx.doi.org/doi:10.1145/3520304.3533987 %P 2274-2281 %0 Journal Article %T Interpretability in symbolic regression: a benchmark of explanatory methods using the Feynman data set %A Aldeia, Guilherme Seidyo Imai %A Olivetti de Franca, Fabricio %J Genetic Programming and Evolvable Machines %D 2022 %8 sep %V 23 %N 3 %@ 1389-2576 %F Aldeia:2022:GPEM %O Special Issue: Highlights of Genetic Programming 2021 Events %X In some situations, the interpretability of the machine learning models plays a role as important as the model accuracy. Interpretability comes from the need to trust the prediction model, verify some of its properties, or even enforce them to improve fairness. Many model-agnostic explanatory methods exists to provide explanations for black-box models. In the regression task, the practitioner can use white-boxes or gray-boxes models to achieve more interpretable results, which is the case of symbolic regression. When using an explanatory method, and since interpretability lacks a rigorous definition, there is a need to evaluate and compare the quality and different explainers. This paper proposes a benchmark scheme to evaluate explanatory methods to explain regression models, mainly symbolic regression models. Experiments were performed using 100 physics equations with different interpretable and non-interpretable regression methods and popular explanation methods, evaluating the performance of the explainers performance with several explanation measures. In addition, we further analyzed four benchmarks from the GP community. The results have shown that Symbolic Regression models can be an interesting alternative to white-box and black-box models that is capable of returning accurate models with appropriate explanations. Regarding the explainers, we observed that Partial Effects and SHAP were the most robust explanation models, with Integrated Gradients being unstable only with tree-based models. This benchmark is publicly available for further experiments. %K genetic algorithms, genetic programming, Symbolic regression, Explanatory methods, Feature importance attribution, Benchmark %9 journal article %R doi:10.1007/s10710-022-09435-x %U http://dx.doi.org/doi:10.1007/s10710-022-09435-x %P 309-349 %0 Book Section %T Toward a Technique for Cooperative Network Design Using Evolutionary Methods %A Alderson, David %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 1999 %D 1999 %8 15 mar %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F alderson:1999:TTCNDUEM %K genetic algorithms %P 1-10 %0 Journal Article %T Evaluation of water quality indexes with novel machine learning and SHapley Additive ExPlanation (SHAP) approaches %A Aldrees, Ali %A Khan, Majid %A Taha, Abubakr Taha Bakheit %A Ali, Mujahid %J Journal of Water Process Engineering %D 2024 %V 58 %@ 2214-7144 %F ALDREES:2024:jwpe %X Water quality indexes (WQI) are pivotal in assessing aquatic systems. Conventional modeling approaches rely on extensive datasets with numerous unspecified inputs, leading to time-consuming WQI assessment procedures. Numerous studies have used machine learning (ML) methods for WQI analysis but often lack model interpretability. To address this issue, this study developed five interpretable predictive models, including two gene expression programming (GEP) models, two deep neural networks (DNN) models, and one optimizable Gaussian process regressor (OGPR) model for estimating electrical conductivity (EC) and total dissolved solids (TDS). For the model development, a total of 372 records on a monthly basis were collected in the Upper Indus River at two outlet stations. The efficacy and accuracy of the models were assessed using various statistical measures, such as correlation (R), mean square error (MAE), root mean square error (RMSE), and 5-fold cross-validation. The DNN2 model demonstrated outstanding performance compared to the other five models, exhibiting R-values closer to 1.0 for both EC and TDS. However, the genetic programming-based models, GEP1 and GEP2, exhibited comparatively lower accuracy in predicting the water quality indexes. The SHapely Additive exPlanation (SHAP) analysis revealed that bicarbonate, calcium, and sulphate jointly contribute approximately 78 percent to EC, while the combined presence of sodium, bicarbonate, calcium, and magnesium accounts for around 87 percent of TDS in water. Notably, the influence of pH and chloride was minimal on both water quality indexes. In conclusion, the study highlights the cost-effective and practical potential of predictive models for EC and TDS in assessing and monitoring river water quality %K genetic algorithms, genetic programming, Gene expression programming, Water quality indexes, ANN, Deep neural networks, Optimizable Gaussian process regressor, SHAP %9 journal article %R doi:10.1016/j.jwpe.2024.104789 %U https://www.sciencedirect.com/science/article/pii/S2214714424000199 %U http://dx.doi.org/doi:10.1016/j.jwpe.2024.104789 %P 104789 %0 Conference Proceedings %T A new framework for scalable genetic programming %A Aleb, Nassima %A Kechid, Samir %Y Gustafson, Steven %Y Vladislavleva, Ekaterina %S GECCO 2012 Symbolic regression and modeling workshop %D 2012 %8 July 11 jul %I ACM %C Philadelphia, Pennsylvania, USA %F Aleb:2012:GECCOcomp %X This paper presents a novel framework for scalable multi-objective genetic programming. We introduce a new program modeling aiming at facilitating programs’ creation, execution and improvement. The proposed modeling allows making symbolic executions in such a way to reduce drastically the time of programs’ executions and to allow well-founded programs recombination. %K genetic algorithms, genetic programming %R doi:10.1145/2330784.2330859 %U http://dx.doi.org/doi:10.1145/2330784.2330859 %P 487-492 %0 Conference Proceedings %T An interpolation based crossover operator for genetic programming %A Aleb, Nassima %A Kechid, Samir %Y Blum, Christian %Y Alba, Enrique %Y Bartz-Beielstein, Thomas %Y Loiacono, Daniele %Y Luna, Francisco %Y Mehnen, Joern %Y Ochoa, Gabriela %Y Preuss, Mike %Y Tantar, Emilia %Y Vanneschi, Leonardo %Y McClymont, Kent %Y Keedwell, Ed %Y Hart, Emma %Y Sim, Kevin %Y Gustafson, Steven %Y Vladislavleva, Ekaterina %Y Auger, Anne %Y Bischl, Bernd %Y Brockhoff, Dimo %Y Hansen, Nikolaus %Y Mersmann, Olaf %Y Posik, Petr %Y Trautmann, Heike %Y Iqbal, Muhammad %Y Shafi, Kamran %Y Urbanowicz, Ryan %Y Wagner, Stefan %Y Affenzeller, Michael %Y Walker, David %Y Everson, Richard %Y Fieldsend, Jonathan %Y Stonedahl, Forrest %Y Rand, William %Y Smith, Stephen L. %Y Cagnoni, Stefano %Y Patton, Robert M. %Y Pappa, Gisele L. %Y Woodward, John %Y Swan, Jerry %Y Krawiec, Krzysztof %Y Tantar, Alexandru-Adrian %Y Bosman, Peter A. N. %Y Vega-Rodriguez, Miguel %Y Chaves-Gonzalez, Jose M. %Y Gonzalez-Alvarez, David L. %Y Santander-Jimenez, Sergio %Y Spector, Lee %Y Keijzer, Maarten %Y Holladay, Kenneth %Y Tusar, Tea %Y Naujoks, Boris %S GECCO ’13 Companion: Proceeding of the fifteenth annual conference companion on Genetic and evolutionary computation conference companion %D 2013 %8 June 10 jul %I ACM %C Amsterdam, The Netherlands %F Aleb:2013:GECCOcomp %X This paper presents a new crossover operator for genetic programming. We exploit two concepts of formal methods: Weakest precondition and Craig interpolation, to perform semantically aware crossover. Weakest preconditions are used to locate faulty parts of a program and Craig interpolation is used to correct these ones. %K genetic algorithms, genetic programming %R doi:10.1145/2464576.2482689 %U http://dx.doi.org/doi:10.1145/2464576.2482689 %P 1107-1112 %0 Journal Article %T The use of evolutionary programming based on training examples for the generation of finite state machines for controlling objects with complex behavior %A Aleksandrov, A. V. %A Kazakov, S. V. %A Sergushichev, A. A. %A Tsarev, F. N. %A Shalyto, A. A. %J Journal of Computer and Systems Sciences International %D 2013 %8 may %V 52 %N 3 %I SP MAIK Nauka/Interperiodica %@ 1064-2307 %G English %F Aleksandrov:2013:JCSSI %X It is proposed to use evolutionary programming to generate finite state machines (FSMs) for controlling objects with complex behaviour. The well-know approach in which the FSM performance is evaluated by simulation, which is typically time consuming, is replaced with comparison of the object’s behaviour controlled by the FSM with the behaviour of this object controlled by a human. A feature of the proposed approach is that it makes it possible to deal with objects that have not only discrete but also continuous parameters. The use of this approach is illustrated by designing an FSM controlling a model aircraft executing a loop-the-loop manoeuvre. %K genetic algorithms, genetic programming, FSM %9 journal article %R doi:10.1134/S1064230713020020 %U http://dx.doi.org/doi:10.1134/S1064230713020020 %P 410-425 %0 Journal Article %T Evolving the Controller of Automated Steering of a Car in Slippery Road Conditions %A Alekseeva, Natalia %A Tanev, Ivan %A Shimohara, Katsunori %J Algorithms %D 2018 %8 jul %V 11 %N 7 %@ 1999-4893 %F Alekseeva:2018:Algorithms %O Special Issue Algorithms for PID Controller %X The most important characteristics of autonomous vehicles are their safety and their ability to adapt to various traffic situations and road conditions. In our research, we focused on the development of controllers for automated steering of a realistically simulated car in slippery road conditions. We comparatively investigated three implementations of such controllers: a proportional-derivative (PD) controller built in accordance with the canonical servo-control model of steering, a PID controller as an extension of the servo-control, and a controller designed heuristically via the most versatile evolutionary computing paradigm: genetic programming (GP). The experimental results suggest that the controller evolved via GP offers the best quality of control of the car in all of the tested slippery (rainy, snowy, and icy) road conditions. %K genetic algorithms, genetic programming, autonomous vehicles, automated steering, slippery road conditions, PID controllers %9 journal article %R doi:10.3390/a11070108 %U http://www.mdpi.com/1999-4893/11/7/108 %U http://dx.doi.org/doi:10.3390/a11070108 %P 108 %0 Conference Proceedings %T On the Emergence of Oscillations in the Evolved Autosteering of a Car on Slippery Roads %A Alekseeva, Natalia %A Tanev, Ivan %A Shimohara, Katsunori %S 2019 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM) %D 2019 %8 August 12 jul %C Hong Kong %F Alekseeva:2019:AIM %X One of the important features of autonomous vehicles is their versatility to various traffic situations and road conditions. We explore the feasibility of using genetic programming to develop an adequate auto-steering of a car in slippery road conditions. We also investigate an important emergent property of the best-evolved steering solutions - the steering oscillations - and discuss how these oscillations contribute to the better controllability of the sliding car. We present the limitations and the technical challenges of the real world implementation of steering oscillations. %K genetic algorithms, genetic programming %R doi:10.1109/AIM.2019.8868610 %U http://dx.doi.org/doi:10.1109/AIM.2019.8868610 %P 1371-1378 %0 Journal Article %T PD Steering Controller Utilizing the Predicted Position on Track for Autonomous Vehicles Driven on Slippery Roads %A Alekseeva, Natalia %A Tanev, Ivan %A Shimohara, Katsunori %J Algorithms %D 2020 %8 feb %V 13 %N 2 %F DBLP:journals/algorithms/AlekseevaTS20 %X Among the most important characteristics of autonomous vehicles are the safety and robustness in various traffic situations and road conditions. In this paper, we focus on the development and analysis of the extended version of the canonical proportional-derivative PD controllers that are known to provide a good quality of steering on non-slippery (dry) roads. However, on slippery roads, due to the poor yaw controllability of the vehicle (suffering from understeering and oversteering), the quality of control of such controllers deteriorates. The proposed predicted PD controller (PPD controller) overcomes the main drawback of PD controllers, namely, the reactiveness of their steering behavior. The latter implies that steering output is a direct result of the currently perceived lateral- and angular deviation of the vehicle from its intended, ideal trajectory, which is the center of the lane. This reactiveness, combined with the tardiness of the yaw control of the vehicle on slippery roads, results in a significant lag in the control loop that could not be compensated completely by the predictive (derivative) component of these controllers. In our approach, keeping the controller efforts at the same level as in PD controllers by avoiding (i) complex computations and (ii) adding additional variables, the PPD controller shows better quality of steering than that of the evolved (via genetic programming) models. %K genetic algorithms, genetic programming, autonomous vehicles, automated steering, slippery road conditions, PD controllers, predictive model %9 journal article %R doi:10.3390/a13020048 %U https://www.mdpi.com/1999-4893/13/2/48/pdf %U http://dx.doi.org/doi:10.3390/a13020048 %P id48 %0 Journal Article %T Modeling deformation modulus of a stratified sedimentary rock mass using neural network, fuzzy inference and genetic programming %A Alemdag, S. %A Gurocak, Z. %A Cevik, A. %A Cabalar, A. F. %A Gokceoglu, C. %J Engineering Geology %D 2016 %V 203 %@ 0013-7952 %F Alemdag:2016:EG %O Special Issue on Probabilistic and Soft Computing Methods for Engineering Geology %X This paper investigates a series of experimental results and numerical simulations employed to estimate the deformation modulus of a stratified rock mass. The deformation modulus of rock mass has a significant importance for some applications in engineering geology and geotechnical projects including foundation, slope, and tunnel designs. Deformation modulus of a rock mass can be determined using large scale in-situ tests. This large scale sophisticated in-situ testing equipments are sometimes difficult to install, plus time consuming to be employed in the field. Therefore, this study aims to estimate indirectly the deformation modulus values via empirical methods such as the neural network, neuro fuzzy and genetic programming approaches. A series of analyses have been developed for correlating various relationships between the deformation modulus of rock mass, rock mass rating, rock quality designation, uniaxial compressive strength, and elasticity modulus of intact rock parameters. The performance capacities of proposed models are assessed and found as quite satisfactory. At the completion of a comparative study on the accuracy of models, in the results, it is seen that overall genetic programming models yielded more precise results than neural network and neuro fuzzy models. %K genetic algorithms, genetic programming, Deformation modulus, Rock mass, Neural network, Neuro fuzzy %9 journal article %R doi:10.1016/j.enggeo.2015.12.002 %U http://www.sciencedirect.com/science/article/pii/S0013795215300971 %U http://dx.doi.org/doi:10.1016/j.enggeo.2015.12.002 %P 70-82 %0 Conference Proceedings %T Immediate transference of global improvements to all individuals in a population in Genetic Programming compared to Automatically Defined Functions for the EVEN-5 PARITY problem %A Aler, Ricardo %Y Banzhaf, Wolfgang %Y Poli, Riccardo %Y Schoenauer, Marc %Y Fogarty, Terence C. %S Proceedings of the First European Workshop on Genetic Programming %S LNCS %D 1998 %8 14 15 apr %V 1391 %I Springer-Verlag %C Paris %@ 3-540-64360-5 %F aler:1998:5parity %X Koza has shown how automatically defined functions (ADFs) can reduce computational effort in the GP paradigm. In Koza’s ADF, as well as in standard GP, an improvement in a part of a program (an ADF or a main body) can only be transferred via crossover. In this article, we consider whether it is a good idea to transfer immediately improvements found by a single individual to the whole population. A system that implements this idea has been proposed and tested for the EVEN-5-PARITY and EVEN-6-PARITY problems. Results are very encouraging: computational effort is reduced (compared to Koza’s ADFs) and the system seems to be less prone to early stagnation. Finally, our work suggests further research where less extreme approaches to our idea could be tested. %K genetic algorithms, genetic programming %R doi:10.1007/BFb0055928 %U http://dx.doi.org/doi:10.1007/BFb0055928 %P 60-70 %0 Conference Proceedings %T Evolved Heuristics for Planning %A Aler, Ricardo %A Borrajo, Daniel %A Isasi, Pedro %Y Porto, V. William %Y Saravanan, N. %Y Waagen, D. %Y Eiben, A. E. %S Evolutionary Programming VII: Proceedings of the Seventh Annual Conference on Evolutionary Programming %S LNCS %D 1998 %8 25 27 mar %V 1447 %I Springer-Verlag %C Mission Valley Marriott, San Diego, California, USA %@ 3-540-64891-7 %F aler:1998:ehp %K genetic algorithms, genetic programming %R doi:10.1007/BFb0040753 %U http://dx.doi.org/doi:10.1007/BFb0040753 %P 745-754 %0 Conference Proceedings %T Genetic Programming and Deductive-Inductive Learning: A Multistrategy Approach %A Aler, Ricardo %A Borrajo, Daniel %A Isasi, Pedro %Y Shavlik, Jude %S Proceedings of the Fifteenth International Conference on Machine Learning, ICML’98 %D 1998 %8 jul %I Morgan Kaufmann %C Madison, Wisconsin, USA %@ 1-55860-556-8 %F icml98-ricardo %X Genetic Programming (GP) is a machine learning technique that was not conceived to use domain knowledge for generating new candidate solutions. It has been shown that GP can benefit from domain knowledge obtained by other machine learning methods with more powerful heuristics. However, it is not obvious that a combination of GP and a knowledge intensive machine learning method can work better than the knowledge intensive method alone. In this paper we present a multistrategy approach where an already multistrategy approach (\sc hamlet combines analytical and inductive learning) and an evolutionary technique based on GP (EvoCK) are combined for the task of learning control rules for problem solving in planning. Results show that both methods complement each other, supplying to the other method what the other method lacks and obtaining better results than using each method alone. %K genetic algorithms, genetic programming, Learning in Planning, Multistrategy learning %U http://scalab.uc3m.es/~dborrajo/papers/icml98.ps.gz %P 10-18 %0 Thesis %T Programacion Genetica de Heuristicas para Planificacion %A Mur, Ricardo Aler %D 1999 %8 jul %C Spain %C Facultad de Informatica de la Universidad Politecnica de Madrid %F aler:thesis %X The aim of this thesis is to use and extend the machine learning genetic programming (GP) paradigm to learn control knowledge for domain independent planning. GP will be used as a standalone technique and as part of a multi-strategy system. Planning is the problem of finding a sequence of steps to transform an initial state in a final state. Finding a correct plan is NP-hard. A solution proposed by Artificial Intelligence is to augment a domain independent planner with control knowledge, to improve its efficiency. Machine learning techniques are used for that purpose. However, although a lot has been achieved, the domain independent planning problem has not been solved completely, therefore there is still room for research. The reason for using GP to learn planning control knowledge is twofold. First, it is intended for exploring the control knowledge space in a less biased way than other techniques. Besides, learning search control knowledge with GP will consider the planning system, the domain theory, planning search and efficiency measures in a global manner, all at the same time. Second, GP flexibility will be used to add useful biases and characteristics to another learning method that lacks them (that is, a multi-strategy GP based system). In the present work, Prodigy will be used as the base planner and Hamlet will be used as the learning system to which useful characteristics will be added through GP. In other words, GP will be used to solve some of Hamlet limitations by adding new biases/characteristics to Hamlet. In addition to the main goal, this thesis will design and experiment with methods to add background knowledge to a GP system, without modifying its basic algorithm. The first method seeds the initial population with individuals obtained by another method (Hamlet). Actually, this is the multi-strategy system discussed in the later paragraph. The second method uses a new genetic operator (instance based crossover) that is able to use instances/examples to bias its search, like other machine learning techniques. To test the validity of the methods proposed, extensive empirical and statistical validation will be carried out. %K genetic algorithms, genetic programming, Planning, Problem Solving, Rule Based System %9 Ph.D. thesis %U http://oa.upm.es/1101/1/10199907.pdf %0 Conference Proceedings %T GP fitness functions to evolve heuristics for planning %A Aler, Ricardo %A Borrajo, Daniel %A Isasi, Pedro %Y Middendorf, Martin %S Evolutionary Methods for AI Planning %D 2000 %8 August %C Las Vegas, Nevada, USA %F aler:2000:G %X There are several ways of applying Genetic Programming (GP) to STRIPS-like planning in the literature. In this paper we emphasise the use of a new one, based on learning heuristics for planning. In particular, we focus on the design of fitness functions for this task. We explore two alternatives (black and white box fitness functions) and present some empirical results %K genetic algorithms, genetic programming %U http://scalab.uc3m.es/~dborrajo/papers/gecco00.ps.gz %P 189-195 %0 Conference Proceedings %T Knowledge Representation Issues in Control Knowledge Learning %A Aler, Ricardo %A Borrajo, Daniel %A Isasi, Pedro %Y Langley, Pat %S Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000) %D 2000 %8 jun 29 jul 2 %I Morgan Kaufmann %C Stanford University, Standord, CA, USA %@ 1-55860-707-2 %G en %F oai:CiteSeerPSU:341634 %X Knowledge representation is a key issue for any machine learning task. There have already been many comparative studies about knowledge representation with respect to machine learning in classification tasks. However, apart from some work done on reinforcement learning techniques in relation to state representation, very few studies have concentrated on the effect of knowledge representation for machine learning applied to problem solving, and more specifically, to planning. In this paper, we present an experimental comparative study of the effect of changing the input representation of planning domain knowledge on control knowledge learning. We show results in two classical domains using three different machine learning systems, that have previously shown their effectiveness on learning planning control knowledge: a pure EBL mechanism, a combination of EBL and induction (HAMLET), and a Genetic Programming based system (EVOCK). %K genetic algorithms, genetic programming, EBL, HAMLET, EVOCK %U http://scalab.uc3m.es/~dborrajo/papers/icml00.ps.gz %P 1-8 %0 Conference Proceedings %T Grammars for Learning Control Knowledge with GP %A Aler, Ricardo %A Borrajo, Daniel %A Isasi, Pedro %S Proceedings of the 2001 Congress on Evolutionary Computation CEC2001 %D 2001 %8 27 30 may %I IEEE Press %C COEX, World Trade Center, 159 Samseong-dong, Gangnam-gu, Seoul, Korea %@ 0-7803-6658-1 %F aler:2001:glckg %X In standard GP there are no constraints on the structure to evolve: any combination of functions and terminals is valid. However, sometimes GP is used to evolve structures that must respect some constraints. Instead of ad-hoc mechanisms, grammars can be used to guarantee that individuals comply with the language restrictions. In addition, grammars permit great flexibility to define the search space. EVOCK (Evolution of Control Knowledge) is a GP based system that learns control rules for PRODIGY, an AI planning system. EVOCK uses a grammar to constrain individuals to PRODIGY 4.0 control rule syntax. The authors describe the grammar specific details of EVOCK. Also, the grammar approach flexibility has been used to extend the control rule language used by EVOCK in earlier work. Using this flexibility, tests were performed to determine whether using combinations of several types of control rules for planning was better than using only the standard select type. Experiments have been carried out in the blocksworld domain that show that using the combination of types of control rules does not get better individuals, but it produces good individuals more frequently %K genetic algorithms, genetic programming, computational linguistics, grammars, learning (artificial intelligence), search problems, AI planning system, EVOCK, Evolution of Control Knowledge, GP based system, PRODIGY, ad-hoc mechanisms, blocksworld domain, control knowledge learning, control rule language, control rule syntax, control rules, grammar approach flexibility, grammar specific, grammars, language restrictions, search space, standard GP, standard select type %R doi:10.1109/CEC.2001.934330 %U http://scalab.uc3m.es/~dborrajo/papers/cec01.ps.gz %U http://dx.doi.org/doi:10.1109/CEC.2001.934330 %P 1220-1227 %0 Journal Article %T Learning to Solve Planning Problems Efficiently by Means of Genetic Programming %A Aler, Ricardo %A Borrajo, Daniel %A Isasi, Pedro %J Evolutionary Computation %D 2001 %8 Winter %V 9 %N 4 %@ 1063-6560 %F aler:2001:ECJ %X Declarative problem solving, such as planning, poses interesting challenges for Genetic Programming (GP). There have been recent attempts to apply GP to planning that fit two approaches: (a) using GP to search in plan space or (b) to evolve a planner. In this article, we propose to evolve only the heuristics to make a particular planner more efficient. This approach is more feasible than (b) because it does not have to build a planner from scratch but can take advantage of already existing planning systems. It is also more efficient than (a) because once the heuristics have been evolved, they can be used to solve a whole class of different planning problems in a planning domain, instead of running GP for every new planning problem. Empirical results show that our approach (EVOCK) is able to evolve heuristics in two planning domains (the blocks world and the logistics domain) that improve PRODIGY4.0 performance. Additionally, we experiment with a new genetic operator Instance-Based Crossover that is able to use traces of the base planner as raw genetic material to be injected into the evolving population. %K genetic algorithms, genetic programming, genetic planning, evolving heuristics, planning, search. EvoCK, STGP, blocks world, logistics, Prodigy4.0, STRIPS, PDL40. %9 journal article %R doi:10.1162/10636560152642841 %U http://www.mitpressjournals.org/doi/pdf/10.1162/10636560152642841 %U http://dx.doi.org/doi:10.1162/10636560152642841 %P 387-420 %0 Journal Article %T Using genetic programming to learn and improve control knowledge %A Aler, Ricardo %A Borrajo, Daniel %A Isasi, Pedro %J Artificial Intelligence %D 2002 %8 oct %V 141 %N 1-2 %F aler:2002:AI %X The purpose of this article is to present a multi-strategy approach to learn heuristics for planning. This multi-strategy system, called HAMLET-EVOCK, combines a learning algorithm specialised in planning () and a genetic programming (GP) based system (: Evolution of Control Knowledge). Both systems are able to learn heuristics for planning on their own, but both of them have weaknesses. Based on previous experience and some experiments performed in this article, it is hypothesised that handicaps are due to its example-driven operators and not having a way to evaluate the usefulness of its control knowledge. It is also hypothesized that even if control knowledge is sometimes incorrect, it might be easily correctable. For this purpose, a GP-based stage is added, because of its complementary biases: GP genetic operators are not example-driven and it can use a fitness function to evaluate control knowledge. and are combined by seeding initial population with control knowledge. It is also useful for to start from a knowledge-rich population instead of a random one. By adding the GP stage to , the number of solved problems increases from 58% to 85% in the blocks world and from 50% to 87% in the logistics domain (0% to 38% and 0% to 42% for the hardest instances of problems considered). %K genetic algorithms, genetic programming, Speedup learning, Multi-strategy learning, Planning %9 journal article %R doi:10.1016/S0004-3702(02)00246-1 %U http://scalab.uc3m.es/~dborrajo/papers/aij-evock.ps.gz %U http://dx.doi.org/doi:10.1016/S0004-3702(02)00246-1 %P 29-56 %0 Conference Proceedings %T Cost-benefit Analysis of Using Heuristics in ACGP %A Aleshunas, John %A Janikow, Cezary %Y Smith, Alice E. %S Proceedings of the 2011 IEEE Congress on Evolutionary Computation %D 2011 %8 May 8 jun %I IEEE Press %C New Orleans, USA %@ 0-7803-8515-2 %F Aleshunas:2011:CAoUHiA %X Constrained Genetic Programming (CGP) is a method of searching the Genetic Programming search space non-uniformly, giving preferences to certain subspaces according to some heuristics. Adaptable CGP (ACGP) is a method for discovery of the heuristics. CGP and ACGP have previously demonstrated their capabilities using first-order heuristics: parent-child probabilities. Recently, the same advantage has been shown for second-order heuristics: parent- children probabilities. A natural question to ask is whether we can benefit from extending ACGP with deeper-order heuristics. This paper attempts to answer this question by performing cost-benefit analysis while simulating the higher- order heuristics environment. We show that this method cannot be extended beyond the current second or possibly third-order heuristics without a new method to deal with the sheer number of such deeper-order heuristics. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2011.5949749 %U http://dx.doi.org/doi:10.1109/CEC.2011.5949749 %P 1177-1183 %0 Conference Proceedings %T Constructing an Optimisation Phase Using Grammatical Evolution %A Alexander, B. J. %A Gratton, M. J. %Y Tyrrell, Andy %S 2009 IEEE Congress on Evolutionary Computation %D 2009 %8 18 21 may %I IEEE Press %C Trondheim, Norway %F Alexander:2009:cec %X Optimising compilers present their authors with an intractable design space. A substantial body of work has used heuristic search techniques to search this space for the purposes of adapting optimisers to their environment. To date, most of this work has focused on sequencing, tuning and guiding the actions of atomic hand-written optimisation phases. In this paper we explore the adaption of optimisers at a deeper level by demonstrating that it is feasible to automatically build a non-trivial optimisation phase, for a simple functional language, using Grammatical Evolution. We show that the individuals evolved compare well in performance to a handwritten optimisation phase on a range of benchmarks. We conclude with proposals of how this work and its applications can be extended. %K genetic algorithms, genetic programming, grammatical evolution, SBSE, evolutionary computation, functional languages, grammars, optimising compilers, search problems, atomic hand-written optimisation phases, heuristic search techniques, intractable design space %R doi:10.1109/CEC.2009.4983083 %U P395.pdf %U http://dx.doi.org/doi:10.1109/CEC.2009.4983083 %P 1209-1216 %0 Conference Proceedings %T Boosting Search for Recursive Functions Using Partial Call-Trees %A Alexander, Brad %A Zacher, Brad %Y Bartz-Beielstein, Thomas %Y Brank, Juergen %Y Smith, Jim %S 13th International Conference on Parallel Problem Solving from Nature %S Lecture Notes in Computer Science %D 2014 %8 13 17 sep %V 8672 %I Springer %C Ljubljana, Slovenia %F alexander2014boosting %X Recursive functions are a compact and expressive way to solve challenging problems in terms of local processing. These properties have made recursive functions a popular target for genetic programming. Unfortunately, the evolution of substantial recursive programs has proved difficult. One cause of this problem is the difficulty in evolving both correct base and recursive cases using just information derived from running test cases. In this work we describe a framework that exploits additional information in the form of partial call-trees. Such trees - a by-product of deriving input-output cases by hand - guides the search process by allowing the separate evolution of the recursive case. We show that the speed of evolution of recursive functions is significantly enhanced by the use of partial call-trees and demonstrate application of the technique in the derivation of functions for a suite of numerical functions. %K genetic algorithms, genetic programming, grammatical evolution, Recursion, Call-Tree, Adaptive Grammar %R doi:10.1007/978-3-319-10762-2_38 %U http://dx.doi.org/doi:10.1007/978-3-319-10762-2_38 %P 384-393 %0 Book Section %T Discussion on Automatic Fault Localisation and Repair %A Alexander, Bradley %E Mei, Hong %E Minku, Leandro %E Neumann, Frank %E Yao, Xin %B Computational Intelligence for Software Engineering %D 2014 %8 oct 20 23 %I National Institute of Informatics %C Japan %F Alexander:2014:shonan %O NII Shonan Meeting Report: No. 2014-13 %K genetic algorithms, genetic programming, genetic improvement, APR %U http://shonan.nii.ac.jp/seminar/reports/wp-content/uploads/sites/56/2015/01/No.2014-13.pdf %P 16-19 %0 Conference Proceedings %T Using Scaffolding with Partial Call-Trees to Improve Search %A Alexander, Brad %A Pyromallis, Connie %A Lorenzetti, George %A Zacher, Brad %Y Handl, Julia %Y Hart, Emma %Y Lewis, Peter R. %Y Lopez-Ibanez, Manuel %Y Ochoa, Gabriela %Y Paechter, Ben %S 14th International Conference on Parallel Problem Solving from Nature %S LNCS %D 2016 %8 17 21 sep %V 9921 %I Springer %C Edinburgh %F Alexander:2016:PPSN %X Recursive functions are an attractive target for genetic programming because they can express complex computation compactly. However, the need to simultaneously discover correct recursive and base cases in these functions is a major obstacle in the evolutionary search process. To overcome these obstacles two recent remedies have been proposed. The first is Scaffolding which permits the recursive case of a function to be evaluated independently of the base case. The second is Call- Tree-Guided Genetic Programming (CTGGP) which uses a partial call tree, supplied by the user, to separately evolve the parameter expressions for recursive calls. Used in isolation, both of these approaches have been shown to offer significant advantages in terms of search performance. In this work we investigate the impact of different combinations of these approaches. We find that, on our benchmarks, CTGGP significantly outperforms Scaffolding and that a combination CTGGP and Scaffolding appears to produce further improvements in worst-case performance. %K genetic algorithms, genetic programming, Grammatical evolution, Recursion %R doi:10.1007/978-3-319-45823-6_3 %U http://dx.doi.org/doi:10.1007/978-3-319-45823-6_3 %P 324-334 %0 Generic %T A Preliminary Exploration of Floating Point Grammatical Evolution %A Alexander, Brad %D 2018 %8 September %I arXiv %F Alexander:2018:arxiv %X Current GP frameworks are highly effective on a range of real and simulated benchmarks. However, due to the high dimensionality of the genotypes for GP, the task of visualising the fitness landscape for GP search can be difficult. This paper describes a new framework: Floating Point Grammatical Evolution (FP-GE) which uses a single floating point genotype to encode an individual program. This encoding permits easier visualisation of the fitness landscape arbitrary problems by providing a way to map fitness against a single dimension. The new framework also makes it trivially easy to apply continuous search algorithms, such as Differential Evolution, to the search problem. In this work, the FP-GE framework is tested against several regression problems, visualising the search landscape for these and comparing different search meta-heuristics. %K genetic algorithms, genetic programming, grammatical evolution %U http://arxiv.org/abs/1806.03455 %0 Conference Proceedings %T Temperature Forecasting in the Concept of Weather Derivatives: a Comparison between Wavelet Networks and Genetic Programming %A Alexandiris, Antonios K. %A Kampouridis, Michael %Y Iliadis, Lazaros S. %Y Papadopoulos, Harris %Y Jayne, Chrisina %S Proceedings of 14th International Conference on Engineering Applications of Neural Networks (EANN 2013), Part I %S Communications in Computer and Information Science %D 2013 %8 sep 13 16 %V 383 %I Springer %C Halkidiki, Greece %F conf/eann/AlexandirisK13 %X The purpose of this study is to develop a model that accurately describes the dynamics of the daily average temperature in the context of weather derivatives pricing. More precisely we compare two state of the art algorithms, namely wavelet networks and genetic programming against the classic linear approaches widely using in the contexts of temperature derivative pricing. The accuracy of the valuation process depends on the accuracy of the temperature forecasts. Our proposed models were evaluated and compared in-sample and out-of-sample in various locations. Our findings suggest that the proposed nonlinear methods significantly outperform the alternative linear models and can be used for accurate weather derivative pricing. %K genetic algorithms, genetic programming, weather derivatives, wavelet networks, temperature derivatives %R doi:10.1007/978-3-642-41013-0_2 %U http://dx.doi.org/10.1007/978-3-642-41013-0 %U http://dx.doi.org/doi:10.1007/978-3-642-41013-0_2 %P 12-21 %0 Journal Article %T A comparison of wavelet networks and genetic programming in the context of temperature derivatives %A Alexandridis, Antonis K. %A Kampouridis, Michael %A Cramer, Sam %J International Journal of Forecasting %D 2017 %V 33 %N 1 %@ 0169-2070 %F Alexandridis:2017:IJF %X The purpose of this study is to develop a model that describes the dynamics of the daily average temperature accurately in the context of weather derivatives pricing. More precisely, we compare two state-of-the-art machine learning algorithms, namely wavelet networks and genetic programming, with the classic linear approaches that are used widely in the pricing of temperature derivatives in the financial weather market, as well as with various machine learning benchmark models such as neural networks, radial basis functions and support vector regression. The accuracy of the valuation process depends on the accuracy of the temperature forecasts. Our proposed models are evaluated and compared, both in-sample and out-of-sample, in various locations where weather derivatives are traded. Furthermore, we expand our analysis by examining the stability of the forecasting models relative to the forecasting horizon. Our findings suggest that the proposed nonlinear methods outperform the alternative linear models significantly, with wavelet networks ranking first, and that they can be used for accurate weather derivative pricing in the weather market. %K genetic algorithms, genetic programming, Weather derivatives, Wavelet networks, Temperature derivatives, Modelling, Forecasting %9 journal article %R doi:10.1016/j.ijforecast.2016.07.002 %U http://www.sciencedirect.com/science/article/pii/S0169207016300711 %U http://dx.doi.org/doi:10.1016/j.ijforecast.2016.07.002 %P 21-47 %0 Thesis %T Optimisation of Time Domain Controllers for Supply Ships Using Genetic Algorithms and Genetic Programming %A Alfaro Cid, Maria Eva %D 2003 %8 oct %C Glasgow, UK %C The University of Glasgow %F Alfaro-Cid:thesis %X The use of genetic methods for the optimisation of propulsion and heading controllers for marine vessels is presented in this thesis. The first part of this work is a study of the optimisation, using Genetic Algorithms, of controller designs based on a number of different time-domain control methodologies such as PID, Sliding Mode, H? and Pole Placement. These control methodologies are used to provide the structure for propulsion and navigation controllers for a ship. Given the variety in the number of parameters to optimise and the controller structures, the Genetic Algorithm is tested in different control optimisation problems with different search spaces. This study presents how the Genetic Algorithm solves this minimisation problem by evolving controller parameters solutions that satisfactorily perform control duties while keeping actuator usage to a minimum. A variety of genetic operators are introduced and a comparison study is conducted to find the Genetic Algorithm scheme best suited to the parameter controller optimisation problem. The performance of the four control methodologies is also compared. A variation of Genetic Algorithms, the Structured Genetic Algorithm, is also used for the optimisation of the H? controller. The H? controller optimisation presents the difficulty that the optimisation focus is not on parameters but on transfer functions. Structured Genetic Algorithm incorporates hierarchy in the representation of solutions making it very suitable for structural optimisation. The H? optimisation problem has been found to be very appropriate for comparing the performance of Genetic Algorithms versus Structured Genetic Algorithm. During the second part of this work, the use of Genetic Programming to optimise the controller structure is assessed. Genetic Programming is used to evolve control strategies that, given as inputs the current and desired state of the propulsion and heading dynamics, generate the commanded forces required to manoeuvre the ship. Two Genetic Programming algorithms are implemented. The only difference between them is how they generate the numerical constants needed for the solution of the problem. The first approach uses a random generation of constants while the second approach uses a combination of Genetic Programming with Genetic Algorithms. Finally, the controllers optimised using genetic methods are evaluated through computer simulations and real manoeuvrability tests in a laboratory water basin facility. The robustness of each controller is analysed through the simulation of environmental disturbances. Also, optimisations in presence of disturbances are carried out so that the different controllers obtained can be compared. The particular vessels used in this study are two scale models of a supply ship called CyberShip I and CyberShip II. The results obtained illustrate the benefits of using Genetic Algorithms and Genetic Programming to optimise propulsion and navigation controllers for surface ships. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://casnew.iti.es/papers/ThesisEva.pdf %0 Conference Proceedings %T Clasificación de Senales de Electroencefalograma Usando Programación Genética %A Alfaro-Cid, Eva %A Esparcia-Alcázar, Anna %A Sharman, Ken %S Actas del IV Congreso Espanol sobre Metaheurísticas, Algoritmos Evolutivos y Bioinspirados (MAEB’05) %D 2005 %8 sep %C Granada, Spain %F alfespshar05 %X En este articulo presentamos una nueva manera de aplicar programacion genetica al problema de clasificacion de series temporales. Eneste caso las series de datos usadas son senalesde electroencefalograma. Se han implementado dos tipos de algoritmos de programaciongenetica: uno de ellos usa programacion distribuida mientras que el otro aplica una tecnica de muestreo aleatorio para evitar el problema de la sobreadaptacion. Los arboles resultantes obtienen porcentajes de aciertos en la clasificacion equivalentes a los que se obtienen usando metodos de clasifficacion tradicionales %K genetic algorithms, genetic programming %U http://www.iti.upv.es/cas/nade/data/maeb05vfinal.pdf %0 Conference Proceedings %T Evolution of a Strategy for Ship Guidance Using Two Implementations of Genetic Programming %A Alfaro-Cid, Eva %A McGookin, Euan William %A Murray-Smith, David James %Y Keijzer, Maarten %Y Tettamanzi, Andrea %Y Collet, Pierre %Y van Hemert, Jano I. %Y Tomassini, Marco %S Proceedings of the 8th European Conference on Genetic Programming %S Lecture Notes in Computer Science %D 2005 %8 30 mar 1 apr %V 3447 %I Springer %C Lausanne, Switzerland %@ 3-540-25436-6 %F eurogp:Alfaro-CidMM05 %X In this paper the implementation of Genetic Programming (GP) to optimise a controller structure for a supply ship is assessed. GP is used to evolve control strategies that, given the current and desired state of the propulsion and heading dynamics of a supply ship as inputs, generate the commanded forces required to manoeuvre the ship. The optimised controllers are evaluated through computer simulations and real manoeuvrability tests in a water basin laboratory. In order to deal with the issue of the generation of numerical constants, two kinds of GP algorithms are implemented. The first one chooses the constants necessary to create the controller structure by random generation . The second algorithm includes a Genetic Algorithms (GAs) technique for the optimisation of such constants. The results obtained illustrate the benefits of using GP to optimise propulsion and navigation controllers for ships. %K genetic algorithms, genetic programming: Poster %R doi:10.1007/978-3-540-31989-4_22 %U http://dx.doi.org/doi:10.1007/978-3-540-31989-4_22 %P 250-260 %0 Conference Proceedings %T Using distributed genetic programming to evolve classifiers for a brain computer interface %A Alfaro-Cid, Eva %A Esparcia-Alcázar, Anna %A Sharman, Ken %Y Verleysen, Michel %S ESANN’2006 proceedings - European Symposium on Artificial Neural Networks %D 2006 %8 26 28 apr %C Bruges, Belgium %@ 2-930307-06-4 %F conf/esann/Alfaro-CidES06 %X The objective of this paper is to illustrate the application of genetic programming to evolve classifiers for multi-channel time series data. The paper shows how high performance distributed genetic programming (GP) has been implemented for evolving classifiers. The particular application discussed herein is the classification of human electroencephalographic (EEG) signals for a brain-computer interface (BCI). The resulting classifying structures provide classification rates comparable to those obtained using traditional, human-designed, classification %K genetic algorithms, genetic programming %U http://www.dice.ucl.ac.be/Proceedings/esann/esannpdf/es2006-44.pdf %P 59-66 %0 Conference Proceedings %T Evolving a Learning Machine by Genetic Programming %A Alfaro-Cid, Eva %A Sharman, Ken %A Esparcia-Alcazar, Anna I. %Y Yen, Gary G. %Y Wang, Lipo %Y Bonissone, Piero %Y Lucas, Simon M. %S Proceedings of the 2006 IEEE Congress on Evolutionary Computation %D 2006 %8 16 21 jul %I IEEE Press %C Vancouver %@ 0-7803-9487-9 %F Alfaro-Cid:2006:CEC %X We describe a novel technique for evolving a machine that can learn. The machine is evolved using a Genetic Programming (GP) algorithm that incorporates in its function set what we have called a learning node. Such a node is tuned by a second optimisation algorithm (in this case Simulated Annealing), mimicking a natural learning process and providing the GP tree with added flexibility and adaptability. The result of the evolution is a system with a fixed structure but with some variable parameters. The system can then learn new tasks in new environments without undergoing further evolution. %K genetic algorithms, genetic programming, simulated annealing, function set, learning machine, learning node, optimization algorithm, simulated annealing %R doi:10.1109/CEC.2006.1688316 %U http://dx.doi.org/doi:10.1109/CEC.2006.1688316 %P 958-962 %0 Conference Proceedings %T Predicción de quiebra empresarial usando programación genética %A Alfaro Cid, Eva %A Sharman, Ken %A Esparcia Alcázar, Anna I. %Y Rodriguez, Francisco Almeida %Y Batista, Maria Belen Melian %Y Perez, Jose Andres Moreno %Y Vega, Jose Marcos Moreno %S Actas del V Congreso Español sobre Metaheurísticas, Algoritmos Evolutivos y Bioinspirados (MAEB’07) %D 2007 %8 Febrero %I La Laguna %C Tenerife, Spain %F alshaes2007a %K genetic algorithms, genetic programming %U https://dialnet.unirioja.es/servlet/articulo?codigo=4142085 %P 703-710 %0 Conference Proceedings %T Aprendizaje automático con programación genética %A Alfaro Cid, Eva %A Sharman, Ken %A Esparcia Alcázar, Anna I. %A Cuesta Cañada, Alberto %S Actas del V Congreso Español sobre Metaheurísticas, Algoritmos Evolutivos y Bioinspirados (MAEB’07) %D 2007 %8 Febrero %I La Laguna %C Tenerife, Spain %F alshaescu2007a %K genetic algorithms, genetic programming %U https://dialnet.unirioja.es/servlet/articulo?codigo=4148339 %P 819-826 %0 Conference Proceedings %T A genetic programming approach for bankruptcy prediction using a highly unbalanced database %A Alfaro-Cid, Eva %A Sharman, Ken %A Esparcia-Alcàzar, Anna I. %Y Giacobini, Mario %Y Brabazon, Anthony %Y Cagnoni, Stefano %Y Di Caro, Gianni A. %Y Drechsler, Rolf %Y Farooq, Muddassar %Y Fink, Andreas %Y Lutton, Evelyne %Y Machado, Penousal %Y Minner, Stefan %Y O’Neill, Michael %Y Romero, Juan %Y Rothlauf, Franz %Y Squillero, Giovanni %Y Takagi, Hideyuki %Y Uyar, A. Sima %Y Yang, Shengxiang %S Applications of Evolutionary Computing, EvoWorkshops2007: EvoCOMNET, EvoFIN, EvoIASP, EvoInteraction, EvoMUSART, EvoSTOC, EvoTransLog %S LNCS %D 2007 %8 November 13 apr %V 4448 %I Springer Verlag %C Valencia, Spain %F alfaro-cid:evows07 %X in this paper we present the application of a genetic programming algorithm to the problem of bankruptcy prediction. To carry out the research we have used a database of Spanish companies. The database has two important drawbacks: the number of bankrupt companies is very small when compared with the number of healthy ones (unbalanced data) and a considerable number of companies have missing data. For comparison purposes we have solved the same problem using a support vector machine. Genetic programming has achieved very satisfactory results, improving those obtained with the support vector machine. %K genetic algorithms, genetic programming, SVM %R doi:10.1007/978-3-540-71805-5_19 %U http://dx.doi.org/doi:10.1007/978-3-540-71805-5_19 %P 169-178 %0 Conference Proceedings %T A SOM and GP Tool for Reducing the Dimensionality of a Financial Distress Prediction Problem %A Alfaro-Cid, Eva %A Mora, Antonio Miguel %A Guervós, Juan Julián Merelo %A Esparcia-Alcázar, Anna %A Sharman, Ken %Y Giacobini, Mario %Y Brabazon, Anthony %Y Cagnoni, Stefano %Y Di Caro, Gianni %Y Drechsler, Rolf %Y Ekárt, Anikó %Y Esparcia-Alcázar, Anna %Y Farooq, Muddassar %Y Fink, Andreas %Y McCormack, Jon %Y O’Neill, Michael %Y Romero, Juan %Y Rothlauf, Franz %Y Squillero, Giovanni %Y Uyar, Sima %Y Yang, Shengxiang %S Proceedings of EvoCOMNET, EvoFIN, EvoHOT, EvoIASP, EvoMUSART, EvoNUM, EvoSTOC, and EvoTransLog, Applications of Evolutionary Computing, EvoWorkshops %S Lecture Notes in Computer Science %D 2008 %8 26 28 mar %V 4974 %I Springer %C Naples %F conf/evoW/Alfaro-CidMGES08 %K genetic algorithms, genetic programming %R doi:10.1007/978-3-540-78761-7_13 %U http://dx.doi.org/doi:10.1007/978-3-540-78761-7_13 %P 123-132 %0 Conference Proceedings %T Comparing Multiobjective Evolutionary Ensembles for Minimizing Type I and II Errors for Bankruptcy Prediction %A Alfaro-Cid, E. %A Castillo, P. A. %A Esparcia, A. %A Sharman, K. %A Merelo, J. J. %A Prieto, A. %A Laredo, J. L. J. %Y Wang, Jun %S 2008 IEEE World Congress on Computational Intelligence %D 2008 %8 January 6 jun %I IEEE Press %C Hong Kong %F Alfaro-Cid:2008:cec %X In many real world applications type I (false positive) and type II (false negative) errors have to be dealt with separately, which is a complex problem since an attempt to minimise one of them usually makes the other grow. In fact, a type of error can be more important than the other, and a trade-off that minimises the most important error type must be reached. In the case of the bankruptcy prediction problem the error type II is of greater importance, being unable to identify that a company is at risk causes problems to creditors and slows down the taking of measures that may solve the problem. Despite the importance of type II errors, most bankruptcy prediction methods take into account only the global classification error. In this paper we propose and compare two methods to optimise both error types in classification: artificial neural networks and function trees ensembles created through multiobjective Optimization. Since the multiobjective Optimization process produces a set of equally optimal results (Pareto front) the classification of the test patterns in both cases is based on the non-dominated solutions acting as an ensemble. The experiments prove that, although the best classification rates are obtained using the artificial neural network, the multiobjective genetic programming model is able to generate comparable results in the form of an analytical function. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2008.4631188 %U EC0649.pdf %U http://dx.doi.org/doi:10.1109/CEC.2008.4631188 %P 2902-2908 %0 Journal Article %T Genetic Programming for the Automatic Design of Controllers for a Surface Ship %A Alfaro-Cid, Eva %A McGookin, Euan W. %A Murray-Smith, David J. %A Fossen, Thor I. %J IEEE Transactions on Intelligent Transportation Systems %D 2008 %8 jun %V 9 %N 2 %@ 1524-9050 %F Alfaro-Cid:2008:ieeeITS %X In this paper, the implementation of genetic programming (GP) to design a controller structure is assessed. GP is used to evolve control strategies that, given the current and desired state of the propulsion and heading dynamics of a supply ship as inputs, generate the commanded forces required to maneuver the ship. The controllers created using GP are evaluated through computer simulations and real maneuverability tests in a laboratory water basin facility. The robustness of each controller is analyzed through the simulation of environmental disturbances. In addition, GP runs in the presence of disturbances are carried out so that the different controllers obtained can be compared. The particular vessel used in this paper is a scale model of a supply ship called CyberShip II. The results obtained illustrate the benefits of using GP for the automatic design of propulsion and navigation controllers for surface ships. %K genetic algorithms, genetic programming, control system synthesis, navigation, propulsion, ships CyberShip II, automatic design, controller structure, navigation controllers, propulsion controllers, supply ship, surface ship %9 journal article %R doi:10.1109/TITS.2008.922932 %U http://results.ref.ac.uk/Submissions/Output/2145080 %U http://dx.doi.org/doi:10.1109/TITS.2008.922932 %P 311-321 %0 Conference Proceedings %T Prune and Plant: A New Bloat Control Method for Genetic Programming %A Alfaro-Cid, Eva %A Esparcia-Alcazar, Anna %A Sharman, Ken %A Fernandez de Vega, Francisco %A Merelo, J. J. %S Eighth International Conference on Hybrid Intelligent Systems, HIS ’08 %D 2008 %8 sep %F Alfaro-Cid:2008:HIS %X This paper reports a comparison of several bloat control methods and also evaluates a new proposal for limiting the size of the individuals: a genetic operator called prune and plant. The aim of this work is to prove the adequacy of this new method. Since a preliminary study of the method has already shown promising results, we have performed a thorough study in a set of benchmark problems aiming at demonstrating the utility of the new approach. Prune and plant has obtained results that maintain the quality of the final solutions in terms of fitness while achieving a substantial reduction of the mean tree size in all four problem domains considered. In addition, in one of these problem domains prune and plant has demonstrated to be better in terms of fitness, size reduction and time consumption than any of the other bloat control techniques under comparison. %K genetic algorithms, genetic programming, bloat control method, genetic operator, prune and plant, time consumption, tree size reduction, mathematical operators, trees (mathematics) %R doi:10.1109/HIS.2008.127 %U http://dx.doi.org/doi:10.1109/HIS.2008.127 %P 31-35 %0 Book Section %T Strong Typing, Variable Reduction and Bloat Control for Solving the Bankruptcy Prediction Problem Using Genetic Programming %A Alfaro-Cid, Eva %A Cuesta-Canada, Alberto %A Sharman, Ken %A Esparcia-Alcazar, Anna %E Brabazon, Anthony %E O’Neill, Michael %B Natural Computing in Computational Finance %S Studies in Computational Intelligence %D 2008 %V 100 %I Springer %F series/sci/Alfaro-CidCSE08 %X In this chapter we present the application of a genetic programming (GP) algorithm to the problem of bankruptcy prediction. To carry out the research we have used a database that includes extensive information (not only economic) from the companies. In order to handle the different data types we have used Strongly Typed GP and variable reduction. Also, bloat control has been implemented to obtain comprehensible classification models. For comparison purposes we have solved the same problem using a support vector machine (SVM). GP has achieved very satisfactory results, improving those obtained with the SVM. %K genetic algorithms, genetic programming, STGP, SVM %R doi:10.1007/978-3-540-77477-8_9 %U http://dx.doi.org/doi:10.1007/978-3-540-77477-8_9 %P 161-185 %0 Conference Proceedings %T Modeling Pheromone Dispensers Using Genetic Programming %A Alfaro-Cid, Eva %A Esparcia-Alcázar, Anna I. %A Moya, Pilar %A Femenia-Ferrer, Beatriu %A Sharman, Ken %A Merelo, J. J. %Y Giacobini, Mario %Y Brabazon, Anthony %Y Cagnoni, Stefano %Y Caro, Gianni A. Di %Y Ekárt, Anikó %Y Esparcia-Alcázar, Anna %Y Farooq, Muddassar %Y Fink, Andreas %Y Machado, Penousal %Y McCormack, Jon %Y O’Neill, Michael %Y Neri, Ferrante %Y Preuss, Mike %Y Rothlauf, Franz %Y Tarantino, Ernesto %Y Yang, Shengxiang %S Applications of Evolutionary Computing, EvoWorkshops 2009: EvoCOMNET, EvoENVIRONMENT, EvoFIN, EvoGAMES, EvoHOT, EvoIASP, EvoINTERACTION, EvoMUSART, EvoNUM, EvoSTOC, EvoTRANSLOG %S Lecture Notes in Computer Science %D 2009 %8 apr 15 17 %V 5484 %I Springer %C Tubingen, Germany %F Alfaro-Cid:2009:evonum %X Mating disruption is an agricultural technique that intends to substitute the use of insecticides for pest control. This technique consists of the diffusion of large amounts of sexual pheromone, so that the males are confused and mating is disrupted. Pheromones are released using devices called dispensers. The speed of release is, generally, a function of time and atmospheric conditions such as temperature and humidity. One of the objectives in the design of the dispensers is to minimise the effect of atmospheric conditions in the performance of the dispenser. With this objective, the Centro de Ecologia Quimica Agricola (CEQA) has designed an experimental dispenser that aims to compete with the dispensers already in the market. The hypothesis we want to validate (and which is based on experimental results) is that the performance of the CEQA dispenser is independent of the atmospheric conditions, as opposed to the most widely used commercial dispenser, Isomate CPlus. This was done using a genetic programming (GP) algorithm. GP evolved functions able to describe the performance of both dispensers and that support the initial hypothesis. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-642-01129-0_73 %U http://dx.doi.org/doi:10.1007/978-3-642-01129-0_73 %P 635-644 %0 Conference Proceedings %T Multiobjective genetic programming approach for a smooth modeling of the release kinetics of a pheromone dispenser %A Alfaro-Cid, Eva %A Esparcia-Alcazar, Anna %A Moya, Pilar %A Merelo, J. J. %A Femenia-Ferrer, Beatriu %A Sharman, Ken %A Primo, Jaime %Y Esparcia, Anna I. %Y Chen, Ying-ping %Y Ochoa, Gabriela %Y Ozcan, Ender %Y Schoenauer, Marc %Y Auger, Anne %Y Beyer, Hans-Georg %Y Hansen, Nikolaus %Y Finck, Steffen %Y Ros, Raymond %Y Whitley, Darrell %Y Wilson, Garnett %Y Harding, Simon %Y Langdon, W. B. %Y Wong, Man Leung %Y Merkle, Laurence D. %Y Moore, Frank W. %Y Ficici, Sevan G. %Y Rand, William %Y Riolo, Rick %Y Kharma, Nawwaf %Y Buckley, William R. %Y Miller, Julian %Y Stanley, Kenneth %Y Bacardit, Jaume %Y Browne, Will %Y Drugowitsch, Jan %Y Beume, Nicola %Y Preuss, Mike %Y Smith, Stephen L. %Y Cagnoni, Stefano %Y DeLeo, Jim %Y Floares, Alexandru %Y Baughman, Aaron %Y Gustafson, Steven %Y Keijzer, Maarten %Y Kordon, Arthur %Y Congdon, Clare Bates %S GECCO-2009 Symbolic regression and modeling workshop (SRM) %D 2009 %8 August 12 jul %I ACM %C Montreal %F DBLP:conf/gecco/Alfaro-CidEMMFSP09 %X The accurate modeling of the release kinetics of pheromone dispensers is a matter or great importance for ensuring that the dispenser field-life covers the flight period of the pest and for optimizing the layout of dispensers in the treated area. A new experimental dispenser has been recently designed by researchers at the Instituto Agroforestal del Mediterraneo - Centro de Ecologia Quimica Agricola (CEQA) of the Universidad Politecnica de Valencia (Spain). The most challenging problem for the modeling of the release kinetics of this dispensers is the difficulty in obtaining experimental measurements for building the model. The procedure for obtaining these data is very costly, both time and money wise, therefore the available data across the whole season are scarce. In prior work we demonstrated the utility of using Genetic Programming (GP) for this particular problem. However, the models evolved by the GP algorithm tend to have discontinuities in those time ranges where there are not available measurements. In this work we propose the use of a multiobjective Genetic Programming for modeling the performance of the CEQA dispenser. We take two approaches, involving two and nine objectives respectively. In the first one, one of the objectives of the GP algorithm deals with how well the model fits the experimental data, while the second objective measures how ’smooth’ the model behaviour is. In the second approach we have as many objectives as data points and the aim is to predict each point separately using the remaining ones. The results obtained endorse the utility of this approach for those modeling problems characterized by the lack of experimental data. %K genetic algorithms, genetic programming %R doi:10.1145/1570256.1570309 %U http://dx.doi.org/doi:10.1145/1570256.1570309 %P 2225-2230 %0 Journal Article %T Bloat Control Operators and Diversity in Genetic Programming: A Comparative Study %A Alfaro-Cid, Eva %A Merelo, J. J. %A Fernandez de Vega, Francisco %A Esparcia-Alcazar, Anna I. %A Sharman, Ken %J Evolutionary Computation %D 2010 %8 Summer %V 18 %N 2 %@ 1063-6560 %F Alfaro-Cid:2010:EC %X This paper reports a comparison of several bloat control methods and also evaluates a recent proposal for limiting the size of the individuals: a genetic operator called prune and plant. The aim of this work is to test the adequacy of this method. Since a preliminary study of the method has already shown promising results, we have performed a thorough study in a set of benchmark problems aiming at demonstrating the utility of the new approach. Prune and plant has obtained results that maintain the quality of the final solutions in terms of fitness while achieving a substantial reduction of the mean tree size in all four problem domains considered. In addition, in one of these problem domains, prune and plant has demonstrated to be better in terms of fitness, size reduction, and time consumption than any of the other bloat control techniques under comparison. The experimental part of the study presents a comparison of performance in terms of phenotypic and genotypic diversity. This comparison study can provide the practitioner with some relevant clues as to which bloat control method is better suited to a particular problem and whether the advantage of a method does or does not derive from its influence on the genetic pool diversity. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1162/evco.2010.18.2.18206 %U http://dx.doi.org/doi:10.1162/evco.2010.18.2.18206 %P 305-332 %0 Journal Article %T Genetic programming and serial processing for time series classification %A Alfaro-Cid, Eva %A Sharman, Ken %A Esparcia-Alcazar, Anna I. %J Evolutionary Computation %D 2014 %8 Summer %V 22 %N 2 %@ 1063-6560 %F Alfaro-Cid:2014:EC %X This work describes an approach devised by the authors for time series classification. In our approach genetic programming is used in combination with a serial processing of data, where the last output is the result of the classification. The use of genetic programming for classification, although still a field where more research in needed, is not new. However, the application of genetic programming to classification tasks is normally done by considering the input data as a feature vector. That is, to the best of our knowledge, there are not examples in the genetic programming literature of approaches where the time series data are processed serially and the last output is considered as the classification result. The serial processing approach presented here fills a gap in the existing literature. This approach was tested in three different problems. Two of them are real world problems whose data were gathered for on-line or conference competitions. As there are published results of these two problems this gives us the chance of comparing the performance of our approach against top performing methods. The serial processing of data in combination with genetic programming obtained competitive results in both competitions, showing its potential for solving time series classification problems. The main advantage of our serial processing approach is that it can easily handle very large data sets. %K genetic algorithms, genetic programming, Classification, time series, serial data processing, real world applications %9 journal article %R doi:10.1162/EVCO_a_00110 %U http://dx.doi.org/doi:10.1162/EVCO_a_00110 %P 265-285 %0 Journal Article %T Book Review: Grammatical Evolution: Evolutionary Automatic Programming in an Arbitrary Language %A Alfonseca, Manuel %A Ortega, Alfonso %J Genetic Programming and Evolvable Machines %D 2004 %8 dec %V 5 %N 4 %@ 1389-2576 %F alfonseca:2004:GPEM %K genetic algorithms, genetic programming, grammatical evolution %9 journal article %R doi:10.1023/B:GENP.0000036057.27304.5b %U https://rdcu.be/dR8co %U http://dx.doi.org/doi:10.1023/B:GENP.0000036057.27304.5b %P 393 %0 Journal Article %T Evolving an ecology of mathematical expressions with grammatical evolution %A Alfonseca, Manuel %A Gil, Francisco Jose Soler %J Biosystems %D 2013 %V 111 %N 2 %F journals/biosystems/AlfonsecaG13 %K genetic algorithms, genetic programming, grammatical evolution %9 journal article %U http://dx.doi.org/10.1016/j.biosystems.2012.12.004 %P 111-119 %0 Journal Article %T Evolving a predator-prey ecosystem of mathematical expressions with grammatical evolution %A Alfonseca, Manuel %A Gil, Francisco Jose Soler %J Complexity %D 2015 %V 20 %N 3 %F journals/complexity/AlfonsecaG15 %K genetic algorithms, genetic programming, grammatical evolution %9 journal article %U http://dx.doi.org/10.1002/cplx.21507 %P 66-83 %0 Conference Proceedings %T Toward Human-Like Summaries Generated from Heterogeneous Software Artefacts %A Alghamdi, Mahfouth %A Treude, Christoph %A Wagner, Markus %Y Alexander, Brad %Y Haraldsson, Saemundur O. %Y Wagner, Markus %Y Woodward, John R. %S 7th edition of GI @ GECCO 2019 %D 2019 %8 jul 13 17 %I ACM %C Prague, Czech Republic %F Alghamdi:2019:GI7 %X Automatic text summarisation has drawn considerable interest in the field of software engineering. It can improve the efficiency of software developers, enhance the quality of products, and ensure timely delivery. In this paper, we present our initial work towards automatically generating human-like multi-document summaries from heterogeneous software artefacts. Our analysis of the text properties of 545 human-written summaries from 15 software engineering projects will ultimately guide heuristics searches in the automatic generation of human-like summaries. %K genetic algorithms, genetic programming, genetic improvement, SBSE, Heterogeneous software artefacts, extractive summarisation, human-like summaries %R doi:10.1145/3319619.3326814 %U https://arxiv.org/abs/1905.02258 %U http://dx.doi.org/doi:10.1145/3319619.3326814 %P 1701-1702 %0 Conference Proceedings %T Development of 2D curve-fitting genetic/gene-expression programming technique for efficient time-series financial forecasting %A Alghieth, Manal %A Yang, Yingjie %A Chiclana, Francisco %S 2015 International Symposium on Innovations in Intelligent SysTems and Applications (INISTA) %D 2015 %8 sep %F Alghieth:2015:INISTA %X Stock market prediction is of immense interest to trading companies and buyers due to high profit margins. Therefore, precise prediction of the measure of increase or decrease of stock prices also plays an important role in buying/selling activities. This research presents a specialised extension to the genetic algorithms (GA) known as the genetic programming (GP) and gene expression programming (GEP) to explore and investigate the outcome of the GEP criteria on the stock market price prediction. The research presented in this paper aims at the modelling and prediction of short-to-medium term stock value fluctuations in the market via genetically tuned stock market parameters. The technique uses hierarchically defined GP and GEP techniques to tune algebraic functions representing the fittest equation for stock market activities. The proposed methodology is evaluated against five well-known stock market companies with each having its own trading circumstances during the past 20+ years. The proposed GEP/GP methodologies were evaluated based on variable window/population sizes, selection methods, and Elitism, Rank and Roulette selection methods. The Elitism-based approach showed promising results with a low error-rate in the resultant pattern matching with an overall accuracy of 93.46percent for short-term 5-day and 92.105 for medium-term 56-day trading periods. %K genetic algorithms, genetic programming, gene expression programming %R doi:10.1109/INISTA.2015.7276734 %U http://dx.doi.org/doi:10.1109/INISTA.2015.7276734 %0 Conference Proceedings %T Development of a Genetic Programming-based GA Methodology for the Prediction of Short-to-Medium-term Stock Markets %A Alghieth, Manal %A Yang, Yingjie %A Chiclana, Francisco %Y Ong, Yew-Soon %S Proceedings of 2016 IEEE Congress on Evolutionary Computation (CEC 2016) %D 2016 %8 24 29 jul %I IEEE Press %C Vancouver %F Alghieth:2016:CEC %X This research presents a specialised extension to the genetic algorithms (GA) known as the genetic programming (GP) and gene expression programming (GEP) to explore and investigate the outcome of the GEP criteria on the stock market price prediction. The aim of this research is to model and predict short-to-medium term stock value fluctuations in the market via genetically tuned stock market parameters. The technology proposes a fractional adaptive mutation rate Elitism (GEPFAMR) technique to initiate a balance between varied mutation rates and between varied-fitness chromosomes, thereby improving prediction accuracy and fitness improvement rate. The methodology is evaluated against different dataset and selection methods and showed promising results with a low error-rate in the resultant pattern matching with an overall accuracy of 95.96percent for short-term 5-day and 95.35percent for medium-term 56-day trading periods. %K genetic algorithms, genetic programming, gene expressing programming, Stock market, Time series financial forecasting %R doi:10.1109/CEC.2016.7744083 %U https://www.dora.dmu.ac.uk/handle/2086/11896 %U http://dx.doi.org/doi:10.1109/CEC.2016.7744083 %P 2381-2388 %0 Journal Article %T Exergoeconomic analysis and optimization of a solar energy-based integrated system with oxy-combustion for combined power cycle and carbon capturing %A Al-Hamed, Khaled H. M. %A Dincer, Ibrahim %J Energy %D 2022 %V 250 %@ 0360-5442 %F ALHAMED:2022:energy %X This work presents a newly developed integrated system that produces multiple useful products, namely electricity, space cooling, freshwater, and ammonium bicarbonate. The two sources of energy for this integrated system are solar energy and natural gas. The natural gas is consumed in an oxy-combustion Brayton cycle to produce electricity, while the solar energy provides electric power to the carbon capturing unit to produce ammonium bicarbonate as a valuable chemical product to compensate for the operation costs of carbon capture. This integrated system is studied using the exergoeconomic analysis and the multi-objective optimization method of genetic programming and genetic algorithm to enhance the thermodynamic and economic aspects of this system. Applying such an analysis to this integrated system adds more understanding and knowledge on how effectively and efficiently this carbon capture system operates and whether or not it is financially viable to pursue this integrated system for further prototyping and concept demonstration. The results of this exergoeconomic analysis show that the production cost of ammonium bicarbonate per 1 kg in this integrated system is 0.0687 $ kg-1, and this is much lower than the market price. This means that producing ammonium bicarbonate as a way to capture carbon dioxide is feasible financially. Furthermore, the optimization results show that the overall exergy destruction rate and the overall unit cost of products are 86,000 kW and 5.19 times 10-3 $ kJ-1, respectively, when operated under optimum conditions %K genetic algorithms, genetic programming, Ammonia, Carbon capture, Energy, Exergoeconomic analysis, Gas turbine, Optimization %9 journal article %R doi:10.1016/j.energy.2022.123814 %U https://www.sciencedirect.com/science/article/pii/S0360544222007174 %U http://dx.doi.org/doi:10.1016/j.energy.2022.123814 %P 123814 %0 Conference Proceedings %T Evolving diverse Ms. Pac-Man playing agents using genetic programming %A Alhejali, Atif M. %A Lucas, Simon M. %S UK Workshop on Computational Intelligence (UKCI 2010) %D 2010 %8 August 10 sep %F Alhejali:2010:UKCI %X This paper uses genetic programming (GP) to evolve a variety of reactive agents for a simulated version of the classic arcade game Ms. Pac-Man. A diverse set of behaviours were evolved using the same GP setup in three different versions of the game. The results show that GP is able to evolve controllers that are well-matched to the game used for evolution and, in some cases, also generalise well to previously unseen mazes. For comparison purposes, we also designed a controller manually using the same function set as GP. GP was able to significantly outperform this hand-designed controller. The best evolved controllers are competitive with the best reactive controllers reported for this problem. %K genetic algorithms, genetic programming, Ms PacMan game, reactive agents, computer games, learning (artificial intelligence), software agents %R doi:10.1109/UKCI.2010.5625586 %U http://dx.doi.org/doi:10.1109/UKCI.2010.5625586 %P 1-6 %0 Conference Proceedings %T Using a Training Camp with Genetic Programming to Evolve Ms Pac-Man Agents %A Alhejali, Atif M. %A Lucas, Simon M. %S Proceedings of the 2011 IEEE Conference on Computational Intelligence and Games %D 2011 %8 31 aug 3 sep %I IEEE %C Seoul, South Korea %F Alhejali:2011:CIG %X This paper investigates using a training camp in conjunction with Genetic Programming in the evolution of Ms Pac-Man playing agents. We measure the amount of effort, time and resources required to run the training camp successfully. The approach is compared with standard GP. The results indicate that better and more stable performance can be achieved using the training camp method at the expense of greater manual effort in the design of the training scenarios. However, in addition to the better results, the training camp also provides more detailed insight into the strengths and weaknesses of each controller. %K genetic algorithms, genetic programming, Pac-Man, Evolving Controllers, Decomposition learning, Training camp %R doi:10.1109/CIG.2011.6031997 %U http://cilab.sejong.ac.kr/cig2011/proceedings/CIG2011/papers/paper31.pdf %U http://dx.doi.org/doi:10.1109/CIG.2011.6031997 %P 118-125 %0 Conference Proceedings %T Using genetic programming to evolve heuristics for a Monte Carlo Tree Search Ms Pac-Man agent %A Alhejali, Atif M. %A Lucas, Simon M. %S IEEE Conference on Computational Intelligence in Games (CIG 2013) %D 2013 %8 November 13 aug %C Niagara Falls, Canada %F Alhejali:2013:CIG %X Ms Pac-Man is one of the most challenging test beds in game artificial intelligence (AI). Genetic programming and Monte Carlo Tree Search (MCTS) have already been successful applied to several games including Pac-Man. In this paper, we use Monte Carlo Tree Search to create a Ms Pac-Man playing agent before using genetic programming to enhance its performance by evolving a new default policy to replace the random agent used in the simulations. The new agent with the evolved default policy was able to achieve an 18percent increase on its average score over the agent with random default policy. %K genetic algorithms, genetic programming, Monte Carlo methods, artificial intelligence, computer games, tree searching, Al, MCTS, Monte Carlo tree search Ms Pac-Man agent, evolved default policy, game artificial intelligence, random agent, random default policy, Equations, Games, Mathematical model, Monte Carlo methods, Sociology, Monte Carlo Tree Search, Pac-Man %R doi:10.1109/CIG.2013.6633639 %U http://dx.doi.org/doi:10.1109/CIG.2013.6633639 %0 Thesis %T Genetic Programming and the Evolution of Games Playing Agents %A Alhejali, Atif Mansour %D 2013 %C UK %C Computing and Electronic Systems, University of Essex %F Alhejali:thesis %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://www.essex.ac.uk/csee/news_and_seminars/newsEvent.aspx?e_id=5796 %0 Journal Article %T Evolutionary Algorithms and Theirs Use in the Design of Sequential Logic Circuits %A Ali, B. %A Almaini, A. E. A. %A Kalganova, T. %J Genetic Programming and Evolvable Machines %D 2004 %8 mar %V 5 %N 1 %@ 1389-2576 %F ali:2004:GPEM %X design synchronous sequential logic circuits with minimum number of logic gates is suggested. The proposed method consists of four main stages. The first stage is concerned with the use of genetic algorithms (GA) for the state assignment problem to compute optimal binary codes for each symbolic state and construct the state transition table of finite state machine (FSM). The second stage defines the subcircuits required to achieve the desired functionality. The third stage evaluates the subcircuits using extrinsic Evolvable Hardware (EHW). During the fourth stage, the final circuit is assembled. The obtained results compare favourably against those produced by manual methods and other methods based on heuristic techniques. %K genetic algorithms, evolvable hardware, sequential circuits, state assignment %9 journal article %R doi:10.1023/B:GENP.0000017009.11392.e2 %U http://dx.doi.org/doi:10.1023/B:GENP.0000017009.11392.e2 %0 Book Section %T Genetic Programming for Incentive-Based Design within a Cultural Algorithms Framework %A Ali, Mostafa Z. %A Reynolds, Robert G. %A Che, Xiangdong %E Riolo, Rick L. %E Soule, Terence %E Worzel, Bill %B Genetic Programming Theory and Practice VI %S Genetic and Evolutionary Computation %D 2008 %8 15 17 may %I Springer %C Ann Arbor %F Ali:2008:GPTP %K genetic algorithms, genetic programming %R doi:10.1007/978-0-387-87623-8_16 %U http://dx.doi.org/doi:10.1007/978-0-387-87623-8_16 %P 249-269 %0 Journal Article %T Difficult first strategy GP: an inexpensive sampling technique to improve the performance of genetic programming %A Ali, Muhammad Quamber %A Majeed, Hammad %J Evol. Intell. %D 2020 %V 13 %N 4 %F DBLP:journals/evi/AliM20 %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s12065-020-00355-2 %U https://doi.org/10.1007/s12065-020-00355-2 %U http://dx.doi.org/doi:10.1007/s12065-020-00355-2 %P 537-549 %0 Journal Article %T Multi-objective Lyapunov-based controller design for nonlinear systems via genetic programming %A Ali, Mir Masoud Ale %A Jamali, Ali %A Asgharnia, A. %A Ansari, R. %A Mallipeddi, Rammohan %J Neural Computing and Applications %D 2022 %8 jan %V 34 %N 2 %@ 0941-0643 %F DBLP:journals/nca/AliJAAM22 %X In system control, stability is considered the most important factor as unstable system is impractical or dangerous to use. Lyapunov direct method, one of the most useful tools in the stability analysis of nonlinear systems, enables the design of a controller by determining the region of attraction (ROA). However, the two main challenges posed are (1) it is hard to determine the scalar function referred to as Lyapunov function, and (2) the optimality of the designed controller is generally questionable. In this paper, multi-objective genetic programming (MOGP)-based framework is proposed to obtain both optimal Lyapunov and control functions at the same time. In other words, MOGP framework is employed to minimize several time-domain performances as well as the ROA radius to find the optimal Lyapunov and control functions. The proposed framework is tested in several nonlinear benchmark systems, and the control performance is compared with state-of-the-art algorithms. %K genetic algorithms, genetic programming, Lyapunov function, Stability, Region of attraction, Pareto %9 journal article %R doi:10.1007/s00521-021-06453-1 %U https://rdcu.be/dl3Cd %U http://dx.doi.org/doi:10.1007/s00521-021-06453-1 %P 1345-1357 %0 Journal Article %T Cotton yield prediction with Markov Chain Monte Carlo-based simulation model integrated with genetic programing algorithm: A new hybrid copula-driven approach %A Ali, Mumtaz %A Deo, Ravinesh C. %A Downs, Nathan J. %A Maraseni, Tek %J Agricultural and Forest Meteorology %D 2018 %V 263 %@ 0168-1923 %F ALI:2018:AFM %X Reliable data-driven models designed to accurately estimate cotton yield, an important agricultural commodity, can be adopted by farmers, agricultural system modelling experts and agricultural policy-makers in strategic decision-making processes. In this paper a hybrid genetic programing model integrated with the Markov Chain Monte Carlo (MCMC) based Copula technique is developed to incorporate climate-based inputs as the predictors of cotton yield, for selected study regions: Faisalabad %K genetic algorithms, genetic programming, Crop yield prediction, Cotton yield, Climate data, Markov Chain Monte Carlo based copula model %9 journal article %R doi:10.1016/j.agrformet.2018.09.002 %U http://www.sciencedirect.com/science/article/pii/S0168192318302971 %U http://dx.doi.org/doi:10.1016/j.agrformet.2018.09.002 %P 428-448 %0 Book Section %T Chapter 2 - Modeling wheat yield with data-intelligent algorithms: artificial neural network versus genetic programming and minimax probability machine regression %A Ali, Mumtaz %A Deo, Ravinesh C. %E Samui, Pijush %E Tien Bui, Dieu %E Chakraborty, Subrata %E Deo, Ravinesh C. %B Handbook of Probabilistic Models %D 2020 %I Butterworth-Heinemann %F ALI:2020:HPM %X In precision agriculture, data-intelligent algorithms applied for predicting wheat yield can generate crucial information about enhancing crop production and strategic decision-making. In this chapter, artificial neural network (ANN) model is trained with three neighboring station-based wheat yields to predict the yield for two nearby objective stations that share a common geographic boundary in the agricultural belt of Pakistan. A total of 2700 ANN models (with a combination of hidden neurons, training algorithm, and hidden transfer/output functions) are developed by trial-and-error method, attaining the lowest mean square error, in which the 90 best-ranked models for 3-layered neuronal network are used for wheat prediction. Models such as learning algorithms comprised of pure linear, tangent, and logarithmic sigmoid equations in hidden transfer/output functions, executed by Levenberg-Marquardt, scaled conjugate gradient, conjugate gradient with Powell-Beale restarts, Broyden-Fletcher-Goldfarb-Shanno quasi-Newton, Fletcher-Reeves update, one-step secant, conjugate gradient with Polak-Ribiere updates, gradient descent with adaptive learning, gradient descent with momentum, and gradient descent with momentum adaptive learning method are trained. For the predicted wheat yield at objective station 1 (i.e., Toba Taik Singh), the optimal architecture was 3-14-1 (input-hidden-output neurons) trained with the Levenberg-Marquardt algorithm and logarithmic sigmoid as activation and tangent sigmoid as output function, while at objective station 2 (i.e., Bakkar), the Levenberg-Marquardt algorithm provided the best architecture (3-20-1) with pure liner as activation and tangent sigmoid as output function. The results are benchmarked with those from minimax probability machine regression (MPMR) and genetic programming (GP) in accordance with statistical analysis of predicted yield based on correlations (r), Willmott’s index (WI), Nash-Sutcliffe coefficient (EV), root mean-squared error (RMSE), and mean absolute error (MAE). For objective station 1, the ANN model attained the r value of approximately 0.983, with WIapprox0.984 and EVapprox0.962, while the MPMR model attained rapprox0.957, WIapprox0.544, and EVapprox0.527, with the results attained by GP model, rapprox0.982, WIapprox0.980, and EVapprox0.955. For optimal ANN model, a relatively low value of RMSE approx 192.02kg/ha and MAE approx 162.75kg/ha was registered compared with the MPMR (RMSE approx 614.46kg/ha; MAE approx 431.29kg/ha) and GP model (RMSE approx 209.25kg/ha; MAE approx 182.84kg/ha). For both objective stations, ANN was found to be superior, as confirmed by a larger Legates-McCabe’s (LM) index used in conjunction with relative RMSE and MAE. Accordingly, it is averred that ANN is considered as a useful data-intelligent contrivance for predicting wheat yield by using nearest neighbor yield %K genetic algorithms, genetic programming, Agricultural precision, Artificial neural network, Minimax probability machine regression, Wheat yield model %R doi:10.1016/B978-0-12-816514-0.00002-3 %U http://www.sciencedirect.com/science/article/pii/B9780128165140000023 %U http://dx.doi.org/doi:10.1016/B978-0-12-816514-0.00002-3 %P 37-87 %0 Journal Article %T Can-Evo-Ens: Classifier stacking based evolutionary ensemble system for prediction of human breast cancer using amino acid sequences %A Ali, Safdar %A Majid, Abdul %J Journal of Biomedical Informatics %D 2015 %8 apr %V 54 %@ 1532-0464 %F Ali:2015:JBI %X The diagnostic of human breast cancer is an intricate process and specific indicators may produce negative results. In order to avoid misleading results, accurate and reliable diagnostic system for breast cancer is indispensable. Recently, several interesting machine-learning (ML) approaches are proposed for prediction of breast cancer. To this end, we developed a novel classifier stacking based evolutionary ensemble system Can-Evo-Ens for predicting amino acid sequences associated with breast cancer. In this paper, first, we selected four diverse-type of ML algorithms of Naive Bayes, K-Nearest Neighbour, Support Vector Machines, and Random Forest as base-level classifiers. These classifiers are trained individually in different feature spaces using physicochemical properties of amino acids. In order to exploit the decision spaces, the preliminary predictions of base-level classifiers are stacked. Genetic programming (GP) is then employed to develop a meta-classifier that optimal combine the predictions of the base classifiers. The most suitable threshold value of the best-evolved predictor is computed using Particle Swarm Optimisation technique. Our experiments have demonstrated the robustness of Can-Evo-Ens system for independent validation dataset. The proposed system has achieved the highest value of Area Under Curve (AUC) of ROC Curve of 99.95percent for cancer prediction. The comparative results revealed that proposed approach is better than individual ML approaches and conventional ensemble approaches of AdaBoostM1, Bagging, GentleBoost, and Random Subspace. It is expected that the proposed novel system would have a major impact on the fields of Biomedical, Genomics, Proteomics, Bioinformatics, and Drug Development. %K genetic algorithms, genetic programming, Breast cancer, Amino acids, Physicochemical properties, Stacking ensemble %9 journal article %R doi:10.1016/j.jbi.2015.01.004 %U http://www.sciencedirect.com/science/article/pii/S1532046415000064 %U http://dx.doi.org/doi:10.1016/j.jbi.2015.01.004 %P 256-269 %0 Thesis %T Intelligent Decision Making Ensemble Classification System for Breast Cancer Prediction %A Ali, Safdar %D 2015 %8 27 jul %C Nilore, Islamabad, Pakistan %C Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences %F Ali:thesis %X Breast cancer is a complex and heterogeneous disease which seriously impacts women’s health. The diagnostic of breast cancer is an intricate process. Therefore, an accurate and reliable prediction system for breast cancer is indispensable to avoid misleading results.In this regard, improved decision support systems are essential for breast cancer prediction. Consequently, this thesis focuses on the development of intelligent decision making systems using ensemble classification for the early prediction of breast cancer. Proteins of a breast tissue generally reflect the initial changes caused by successive genetic mutations, which may lead to cancer. In this research, such changes in protein sequences are exploited for the early diagnosis of breast cancer. It is found that substantial variation of Proline, Serine, Tyrosine, Cysteine, Arginine, and Asparagine amino acid molecules in cancerous proteins offer high discrimination for cancer diagnostic. Molecular descriptors derived from physicochemical properties of amino acids are used to transform primary protein sequences into feature spaces of amino acid composition (AAC), split amino acid composition (SAAC), pseudo amino acid composition-series (PseAAC-S), and pseudo amino acid composition-parallel (PseAAC-P). The research work in this thesis is divided in two phases. In the first phase, the basic framework is established to handle imbalanced dataset in order to enhance true prediction performance. In this phase, conventional individual learning algorithms are employed to develop different prediction systems. Firstly, in conjunction with oversampling based Mega-Trend-Diffusion (MTD) technique, individual prediction systems are developed. Secondly, homogeneous ensemble systems CanPro-IDSS and Can-CSCGnB are developed using MTD and cost-sensitive classifier (CSC) techniques, respectively. It is found that assimilation of MTD technique for the CanPro-IDSS system is superior than CSC based technique to handle imbalanced dataset of protein sequences. In this connection, a web based CanPro-IDSS cancer prediction system is also developed. Lastly, a novel heterogeneous ensemble system called IDMS-HBC is developed for breast cancer detection. The second phase of this research focuses on the exploitation of variation of amino acid molecules in cancerous protein sequences using physicochemical properties. In this phase, unlike traditional ensemble prediction approaches, the proposed IDM-PhyChm-Ens ensemble system is developed by combining the decision spaces of a specific classifier trained on different feature spaces. This intelligent ensemble system is constructed using diverse learning algorithms of Random Forest(RF), Support Vector Machines, K-Nearest Neighbor, and Naive Bayes (NB). It is observed that the combined spaces of SAAC+PseAAC-S and AAC+SAAC possess the best discrimination using ensemble-RF and ensemble-NB. Lastly, a novel classifier stacking based evolutionary ensemble system Can-Evo-Ens is also developed, whereby Genetic programming is used as the ensemble method. This study revealed that PseAAC-S feature space carries better discrimination power compared to AAC, SAAC, and PseAAC-P based feature extraction strategies. Intensive experiments are performed to evaluate the performance of the proposed intelligent decision making systems for cancer/non-cancer and breast/non-breast cancer datasets. The proposed approaches have demonstrated improvement over previous state-of-the-art approaches. The proposed systems maybe useful for academia, practitioners, and clinicians for the early diagnosis of breast cancer using protein sequences. Finally, it is expected that the findings of this research would have positive impact on diagnosis, prevention, treatment, and management of cancer %K genetic algorithms, genetic programming, Can-Evo-Ens %9 Ph.D. thesis %U http://faculty.pieas.edu.pk/abdulmajid/ %0 Journal Article %T A Systematic Review of the Application and Empirical Investigation of Search-Based Test-Case Generation %A Ali, Shaukat %A Briand, Lionel C. %A Hemmati, Hadi %A Panesar-Walawege, Rajwinder K. %J IEEE Transactions on Software Engineering %D 2010 %8 nov dec %V 36 %N 6 %@ 0098-5589 %F Ali:2010:ieeeTSE %X Metaheuristic search techniques have been extensively used to automate the process of generating test cases and thus providing solutions for a more cost-effective testing process. This approach to test automation, often coined as Search-based Software Testing (SBST), has been used for a wide variety of test case generation purposes. Since SBST techniques are heuristic by nature, they must be empirically investigated in terms of how costly and effective they are at reaching their test objectives and whether they scale up to realistic development artifacts. However, approaches to empirically study SBST techniques have shown wide variation in the literature. This paper presents the results of a systematic, comprehensive review that aims at characterising how empirical studies have been designed to investigate SBST cost-effectiveness and what empirical evidence is available in the literature regarding SBST cost-effectiveness and scalability. We also provide a framework that drives the data collection process of this systematic review and can be the starting point of guidelines on how SBST techniques can be empirically assessed. The intent is to aid future researchers doing empirical studies in SBST by providing an unbiased view of the body of empirical evidence and by guiding them in performing well designed empirical studies. %K genetic algorithms, genetic programming, SBSE %9 journal article %R doi:10.1109/TSE.2009.52 %U http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5210118&isnumber=4359463 %U http://dx.doi.org/doi:10.1109/TSE.2009.52 %P 742-762 %0 Conference Proceedings %T Critical analysis of swarm intelligence based routing protocols in adhoc and sensor wireless networks %A Ali, Zulfiqar %A Shahzad, Waseem %S International Conference on Computer Networks and Information Technology (ICCNIT 2011) %D 2011 %8 November 13 jul %C Abbottabad %F Ali:2011:ICCNIT %X There are various bio inspired and evolutionary approaches including genetic programming (GP), Neural Network, Evolutionary programming (EP), Particle Swarm Optimisation (PSO) and Ant Colony Optimisation (ACO) used for the routing protocols in ad hoc and sensor wireless networks. There are constraints involved in these protocols due to the mobility and non infrastructure nature of an ad hoc and sensor networks. We study in this research work a probabilistic performance evaluation frameworks and Swarm Intelligence approaches (PSO, ACO) for routing protocols. The performance evaluation metrics employed for wireless and ad hoc routing algorithms is routing overhead, route optimality, and energy consumption. This survey gives critical analysis of PSO and ACO based algorithms with other approaches applied for the optimisation of an ad hoc and wireless sensor network routing protocols. %K genetic algorithms, ACO, EP, PSO, adhoc network, ant colony optimisation, bioinspired approach, critical analysis, energy consumption, evolutionary approach, evolutionary programming, mobility nature, neural network, particle swarm optimisation, probabilistic performance evaluation framework, route optimality, routing overhead, routing protocol, swarm intelligence, wireless sensor network, evolutionary computation, mobile ad hoc networks, mobility management (mobile radio), particle swarm optimisation, performance evaluation, routing protocols, wireless sensor networks %R doi:10.1109/ICCNIT.2011.6020945 %U http://dx.doi.org/doi:10.1109/ICCNIT.2011.6020945 %P 287-292 %0 Conference Proceedings %T Miner for OACCR: Case of medical data analysis in knowledge discovery %A Ali, Samaher Hussein %S 6th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT 2012) %D 2012 %8 21 24 mar %C Sousse, Tunisia %F Ali:2012:SETIT %X Modern scientific data consist of huge datasets which gathered by a very large number of techniques and stored in much diversified and often incompatible data repositories as data of bioinformatics, geoinformatics, astroinformatics and Scientific World Wide Web. At the other hand, lack of reference data is very often responsible for poor performance of learning where one of the key problems in supervised learning is due to the insufficient size of the training dataset. Therefore, we try to suggest a new development a theoretically and practically valid tool for analysing small of sample data remains a critical and challenging issue for researches. This paper presents a methodology for Obtaining Accurate and Comprehensible Classification Rules (OACCR) of both small and huge datasets with the use of hybrid techniques represented by knowledge discovering. In this article the searching capability of a Genetic Programming Data Construction Method (GPDCM) has been exploited for automatically creating more visual samples from the original small dataset. Add to that, this paper attempts to developing Random Forest data mining algorithm to handle missing value problem. Then database which describes depending on their components were built by Principle Component Analysis (PCA), after that, association rule algorithm to the FP-Growth algorithm (FP-Tree) was used. At the last, TreeNet classifier determines the class under which each association rules belongs to was used. The proposed methodology provides fast, Accurate and comprehensible classification rules. Also, this methodology can be use to compression dataset in two dimensions (number of features, number of records). %K genetic algorithms, genetic programming, data mining, medical administrative data processing, OACCR, TreeNet classifier, astroinformatics, bioinformatics, data mining algorithm, datasets, genetic programming data construction method, geoinformatics, hybrid techniques, knowledge discovery, medical data analysis, obtaining accurate and comprehensible classification rules, principle component analysis, scientific World Wide Web, Algorithm design and analysis, Classification algorithms, Clustering algorithms, Data mining, Databases, Training, Vegetation, Adboosting, FP-Growth, GPDCM, PCA, Random Forest %R doi:10.1109/SETIT.2012.6482043 %U http://dx.doi.org/doi:10.1109/SETIT.2012.6482043 %P 962-975 %0 Conference Proceedings %T AutoGE: A Tool for Estimation of Grammatical Evolution Models %A Ali, Muhammad Sarmad %A Kshirsagar, Meghana %A Naredo, Enrique %A Ryan, Conor %Y Rocha, Ana Paula %Y Steels, Luc %Y van den Herik, H. Jaap %S Proceedings of the 13th International Conference on Agents and Artificial Intelligence, ICAART 2021 %D 2021 %8 feb 4 6 %V 2 %I SCITEPRESS %C Online %F conf/icaart/Ali0NR21 %K genetic algorithms, genetic programming, grammatical evolution %R doi:10.5220/0010393012741281 %U http://dx.doi.org/doi:10.5220/0010393012741281 %P 1274-1281 %0 Conference Proceedings %T Towards Automatic Grammatical Evolution for Real-world Symbolic Regression %A Ali, Muhammad Sarmad %A Kshirsagar, Meghana %A Naredo, Enrique %A Ryan, Conor %Y Baeck, Thomas %Y Wagner, Christian %Y Garibaldi, Jonathan M. %Y Lam, H. K. %Y Cottrell, Marie %Y Merelo, Juan Julian %Y Warwick, Kevin %S Proceedings of the 13th International Joint Conference on Computational Intelligence, IJCCI %D 2021 %8 oct 25 27 %I SCITEPRESS %C Online %F conf/ijcci/Ali0NR21 %X AutoGE (Automatic Grammatical Evolution) is a tool designed to aid users of GE for the automatic estimation of Grammatical Evolution (GE) parameters, a key one being the grammar. The tool comprises of a rich suite of algorithms to assist in fine tuning a BNF (Backus-Naur Form) grammar to make it adaptable across a wide range of problems. It primarily facilitates the identification of better grammar structures and the choice of function sets to enhance existing fitness scores at a lower computational overhead. we discuss and report experimental results for our Production Rule Pruning algorithm from AutoGE which employs a simple frequency-based approach for eliminating less useful productions. It captures the relationship between production rules and function sets involved in the problem domain to identify better grammar. The experimental study incorporates an extended function set and common grammar structures for grammar definition. Preliminary results based on ten popular real-world regression datasets demonstrate that the proposed algorithm not only identifies suitable grammar structures, but also prunes the grammar which results in shorter genome length for every problem, thus optimising memory usage. Despite using a fraction of budget in pruning, AutoGE was able to significantly enhance test scores for 3 problems. %K genetic algorithms, genetic programming, grammatical evolution, grammar pruning, effective genome length %R doi:10.5220/0010691500003063 %U http://dx.doi.org/doi:10.5220/0010691500003063 %P 68-78 %0 Conference Proceedings %T Automated Grammar-based Feature Selection in Symbolic Regression %A Ali, Muhammad Sarmad %A Kshirsagar, Meghana %A Naredo, Enrique %A Ryan, Conor %Y Rahat, Alma %Y Fieldsend, Jonathan %Y Wagner, Markus %Y Tari, Sara %Y Pillay, Nelishia %Y Moser, Irene %Y Aleti, Aldeida %Y Zamuda, Ales %Y Kheiri, Ahmed %Y Hemberg, Erik %Y Cleghorn, Christopher %Y Sun, Chao-li %Y Yannakakis, Georgios %Y Bredeche, Nicolas %Y Ochoa, Gabriela %Y Derbel, Bilel %Y Pappa, Gisele L. %Y Risi, Sebastian %Y Jourdan, Laetitia %Y Sato, Hiroyuki %Y Posik, Petr %Y Shir, Ofer %Y Tinos, Renato %Y Woodward, John %Y Heywood, Malcolm %Y Wanner, Elizabeth %Y Trujillo, Leonardo %Y Jakobovic, Domagoj %Y Miikkulainen, Risto %Y Xue, Bing %Y Neumann, Aneta %Y Allmendinger, Richard %Y Medina-Bulo, Inmaculada %Y Bechikh, Slim %Y Sutton, Andrew M. %Y Oliveto, Pietro Simone %S Proceedings of the 2022 Genetic and Evolutionary Computation Conference %S GECCO ’22 %D 2022 %8 September 13 jul %I Association for Computing Machinery %C Boston, USA %F ali:2022:GECCO %X With the growing popularity of machine learning (ML), regression problems in many domains are becoming increasingly high-dimensional. Identifying relevant features from a high-dimensional dataset still remains a significant challenge for building highly accurate machine learning models.Evolutionary feature selection has been used for high-dimensional symbolic regression using Genetic Programming (GP). While grammar based GP, especially Grammatical Evolution (GE), has been extensively used for symbolic regression, no systematic grammar-based feature selection approach exists. This work presents a grammar-based feature selection method, Production Ranking based Feature Selection (PRFS), and reports on the results of its application in symbolic regression.The main contribution of our work is to demonstrate that the proposed method can not only consistently select the most relevant features, but also significantly improves the generalization performance of GE when compared with several state-of-the-art ML-based feature selection methods. Experimental results on benchmark symbolic regression problems show that the generalization performance of GE using PRFS was significantly better than that of a state-of-the-art Random Forest based feature selection in three out of four problems, while in fourth problem the performance was the same. %K genetic algorithms, genetic programming, feature selection, symbolic regression, production ranking, grammatical evolution, grammar pruning %R doi:10.1145/3512290.3528852 %U http://dx.doi.org/doi:10.1145/3512290.3528852 %P 902-910 %0 Conference Proceedings %T Optimization of the Number and Placement of Routers in Wireless Mesh Networks %A Ali, Mohammed Sadeq Ali %A Cevik, Mesut %S 2022 International Conference on Artificial Intelligence of Things (ICAIoT) %D 2022 %8 dec %F Ali:2022:ICAIoT %X Wireless Mesh Networks (WMNs) are a new type of wireless network that has been growing in popularity. These networks consist of routers and clients. The routers are called mesh routers (MRs) and the clients are called mesh clients. WMNs have several advantages over traditional wireless networks, such as more reliable coverage and faster speeds. Many different types of algorithms can be used to determine the best placement for these routers, with some algorithms being better than others depending on the environment or situation. One algorithm is called a Genetic Algorithm (GA), which uses genetic programming to find an optimal solution for router placement. GA is used to find the best placement of the router so that it can provide the most coverage possible for a specific area GA or evolutionary algorithms are based on a biological theory known as Darwin’s theory. In evolutionary algorithms, it is since the information of the problem becomes chromosomes, and then the problem is solved by special problem-solving techniques in the evolutionary algorithm. The suggested method was implemented using the C++ programming environment and the NS2 software suite. Using a benchmark of produced instances, the experimental outcomes have been analysed. Variable sets of produced instances ranging in size from small to big have been explored. Consequently, several properties of WMNs, including the topological placement of mesh clients, have been recorded. %K genetic algorithms, genetic programming, Wireless networks, Wireless mesh networks, Evolutionary computation, Software, Reliability, Problem-solving, Internet of Things, IOT, routers, wireless network, WMNs %R doi:10.1109/ICAIoT57170.2022.10121861 %U http://dx.doi.org/doi:10.1109/ICAIoT57170.2022.10121861 %0 Journal Article %T Artificial intelligent techniques for prediction of rock strength and deformation properties - A review %A Ali, Mujahid %A Hin Lai, Sai %J Structures %D 2023 %V 55 %@ 2352-0124 %F ALI:2023:istruc %X In rock design projects, a number of mechanical properties are frequently employed, particularly unconfined compressive strength (UCS) and deformation (E). The researchers attempt to conduct an indirect investigation since direct measurement of UCS and E is time-consuming, expensive, and requires more expertise and methodologies. Recent and past studies investigate the UCS and E from rock index tests mainly P-wave velocity (Vp), slake durability index, Density, Shore hardness, Schmidt hammer Rebound number (Rn), unit weight, porosity (e) point load strength (Is(50)), and block punch strength index test as its economical and easy to use. The evaluation of these properties is the essential input into modern design methods that routinely adopt some form of numerical modeling, such as machine learning (ML), Artificial Neural Networking (ANN), finite element modeling (FEM), and finite difference methods. Besides, several researchers evaluate the correlation between the input parameters using statistical analysis tools before using them for intelligent techniques. The current study compared the results of laboratory tests, statistical analysis, and intelligent techniques for UCS and E estimation including ANN and adaptive neuro-fuzzy inference system (ANFIS), Genetic Programming (GP), Genetic Expression Programming (GEP), and hybrid models. Following the execution of the relevant models, numerous performance indicators, such as root mean squared error, coefficient of determination (R2), variance account for, and overall ranking, are reviewed to choose the best model and compare the acquired results. Based on the current review, it is concluded that the same rock types from different countries show different mechanical properties due to weathering, size, texture, mineral composition, and temperature. For instance, in the UCS of strong rock (granite) in Spain, ranges from 24 MPa to 278 MPa, whereas in Malaysian rocks, it shows 39 MPa to 212 MPa. On the other side, the coefficient of determination (R2) correlation for the UCS also varies from country to country; while using different modern techniques, the R2 values improved. Finally, recommendations on material properties and modern techniques have been suggested %K genetic algorithms, genetic programming, Deformation, Unconfined Compressive Strength (UCS), Intelligent techniques, ANN, Statistical analysis %9 journal article %R doi:10.1016/j.istruc.2023.06.131 %U https://www.sciencedirect.com/science/article/pii/S2352012423008901 %U http://dx.doi.org/doi:10.1016/j.istruc.2023.06.131 %P 1542-1555 %0 Conference Proceedings %T Symbolic method for deriving policy in reinforcement learning %A Alibekov, Eduard %A Kubalik, Jiri %A Babuska, Robert %S 2016 IEEE 55th Conference on Decision and Control (CDC) %D 2016 %8 dec %F Alibekov:2016:CDC %X This paper addresses the problem of deriving a policy from the value function in the context of reinforcement learning in continuous state and input spaces. We propose a novel method based on genetic programming to construct a symbolic function, which serves as a proxy to the value function and from which a continuous policy is derived. The symbolic proxy function is constructed such that it maximizes the number of correct choices of the control input for a set of selected states. Maximization methods can then be used to derive a control policy that performs better than the policy derived from the original approximate value function. The method was experimentally evaluated on two control problems with continuous spaces, pendulum swing-up and magnetic manipulation, and compared to a standard policy derivation method using the value function approximation. The results show that the proposed method and its variants outperform the standard method. %K genetic algorithms, genetic programming %R doi:10.1109/CDC.2016.7798684 %U http://dx.doi.org/doi:10.1109/CDC.2016.7798684 %P 2789-2795 %0 Thesis %T Symbolic Regression for Reinforcement Learning in Continuous Spaces %A Alibekov, Eduard %D 2021 %8 aug %C Czech Republic %C F3 Faculty of Electrical Engineering, Department of Cybernetics, Czech Technical University in Prague %F Alibekov:thesis %X Reinforcement Learning (RL) algorithms can optimally solve dynamic decision and control problems in engineering, economics, medicine, artificial intelligence, and other disciplines.However, state-of-the-art RL methods still have not solved the transition from a small set of discrete states to fully continuous spaces. They have to rely on numerical function approximators, such as radial basis functions or neural networks, to represent the value function or policy mappings. While these numerical approximators are well-developed, the choice of a suitable architecture is a difficult step that requires significant trial-and-error tuning. Moreover, numerical approximators frequently exhibit uncontrollable surface artifacts that damage the overall performance of the controlled system. Symbolic Regression (SR) is an evolutionary optimization technique that automatically, without human intervention, generates analytical expressions to fit numerical data. The method has gained attention in the scientific community not only for its ability to recover known physical laws, but also for suggesting yet unknown but physically plausible and interpretable relationships. Additionally, the analytical nature of the result approximators allows to unleash the full power of mathematical apparatus. This thesis aims to develop methods to integrate SR into RL in a fully continuous case. To accomplish this goal, the following original contributions to the field have been developed. (i) Introduction of policy derivation methods. Their main goal is to exploit the full potential of using continuous action spaces, contrary to the state-of-the-art discretised set of actions. (ii) Quasi-symbolic policy derivation (QSPD) algorithm, specifically designed to be used with a symbolic approximation of the value function. The goal of the proposed algorithm is to efficiently derive continuous policy out of symbolic approximator. The experimental evaluation indicated the superiority of QSPD over state-of-the-art methods. (iii) Design of a symbolic proxy-function concept. Such a function is successfully used to alleviate the negative impacts of approximation artifacts on policy derivation. (iv) Study on fitness criterion in the context of SR for RL. The analysis indicated a fundamental flaw with any other symmetric error functions, including commonly used mean squared error. Instead, a new error function procedure has been proposed alongside with a novel fitting procedure. The experimental evaluation indicated dramatic improvement of the approximation quality for both numerical and symbolic approximators. (v) Robust symbolic policy derivation (RSPD) algorithm, which adds an extra level of robustness against imperfections in symbolic approximators. The experimental evaluation demonstrated significant improvements in the reachability of the goal state. All these contributions are then combined into a single,efficient SR for RL (ESRL) framework. Such a framework is able to tackle high-dimensional, fully-continuous RL problems out-of-the-box. The proposed framework has been tested on three bench-marks: pendulum swing-up, magnetic manipulation, and high-dimensional drone strike benchmark. %K genetic algorithms, genetic programming, Single Node Genetic Programming, reinforcement learning, optimal control, function approximation,evolutionary optimization, symbolic regression, robotics, autonomous systems %9 Ph.D. thesis %U https://cyber.felk.cvut.cz/news/eduard-alibekov-defended-his-ph-d-thesis/ %0 Journal Article %T Prediction of the shear modulus of municipal solid waste (MSW): An application of machine learning techniques %A Alidoust, Pourya %A Keramati, Mohsen %A Hamidian, Pouria %A Amlashi, Amir Tavana %A Gharehveran, Mahsa Modiri %A Behnood, Ali %J Journal of Cleaner Production %D 2021 %V 303 %@ 0959-6526 %F ALIDOUST:2021:JCP %X The dynamic properties of Municipal Solid Waste (MSW) are site-specific and need to be evaluated separately in different regions. The laboratory-based evaluation of MSW has difficulties such as an unpleasant aroma or degradability of MSW, making the testing procedure unfavorable. Moreover, these evaluations are time- and cost-intensive, which may also require trained personnel to conduct the tests. To address this concern, alternatively, the shear modulus of MSW can be estimated through some predictive models. In this study, the shear modulus was evaluated using 153 cyclic triaxial tests. For this purpose, the effects of various factors, including the shear strain (ShS), age of the MSW (Age), percentage of plastic (POP), confining pressure (CP), unit weight (UW), and loading frequency (F) on the shear modulus of MSW were evaluated. The data obtained through laboratory experiments was then employed to model the dynamic response of MSW using four different machine learning techniques including Artificial Neural Networks (ANN), Multivariate Adaptive Regression Splines (MARS), Multi-Gene Genetic Programming (MGGP), and M5 model Tree (M5Tree). A comparison of the performance of developed models indicated that the ANN model outperformed the other models. More specifically, for ANN, MARS, MGGP, and M5Tree models, the corresponding values of R-squared equal to 0.9897, 0.9640, 0.9617, and 0.8482 for the training dataset, while the values for the testing dataset for ANN, MARS, MGGP, and M5Tree are 0.9812, 0.9551, 0.9574, and 0.8745. Furthermore, although the developed models using MARS and MGGP techniques resulted in more errors compared to the ANN technique, they were found to produce reliable predictions. To further compare the performance and efficiency of the developed models and study the effects of each input variable on the output variable (i.e., shear modulus), model validity, parametric study, and sensitivity analysis were performed %K genetic algorithms, genetic programming, Municipal solid waste, Cyclic triaxial test, Shear modulus, Artificial neural network (ANN), Multivariate adaptive regression splines (MARS), Multi-gene genetic programming (MGGP), M5 model tree (M5Tree) %9 journal article %R doi:10.1016/j.jclepro.2021.127053 %U https://www.sciencedirect.com/science/article/pii/S0959652621012725 %U http://dx.doi.org/doi:10.1016/j.jclepro.2021.127053 %P 127053 %0 Conference Proceedings %T Ant Colony Optimization, Genetic Programming and a hybrid approach for credit scoring: A comparative study %A Aliehyaei, R. %A Khan, S. %S 8th International Conference on Software, Knowledge, Information Management and Applications (SKIMA) %D 2014 %8 dec %F Aliehyaei:2014:SKIMA %X Credit scoring is a commonly used method for evaluating the risk involved in granting credits. Both Genetic Programming (GP) and Ant Colony Optimisation (ACO) have been investigated in the past as possible tools for credit scoring. This paper reports an investigation into the relative performances of GP, ACO and a new hybrid GP-ACO approach, which relies on the ACO technique to produce the initial populations for the GP technique. Performance of the hybrid approach has been compared with both the GP and ACO approaches using two well-known benchmark data sets. Experimental results demonstrate the dependence of GP and ACO classification accuracies on the input data set. For any given data set, the hybrid approach performs better than the worse of the other two methods. Results also show that use of ACO in the hybrid approach has only a limited impact in improving GP performance. %K genetic algorithms, genetic programming %R doi:10.1109/SKIMA.2014.7083391 %U http://dx.doi.org/doi:10.1109/SKIMA.2014.7083391 %0 Journal Article %T Sea water level forecasting using genetic programming and comparing the performance with Artificial Neural Networks %A Ghorbani, Mohammad Ali %A Khatibi, Rahman %A Aytek, Ali %A Makarynskyy, Oleg %A Shiri, Jalal %J Computer & Geosciences %D 2010 %V 36 %N 5 %@ 0098-3004 %F AliGhorbani2010620 %X Water level forecasting at various time intervals using records of past time series is of importance in water resources engineering and management. In the last 20 years, emerging approaches over the conventional harmonic analysis techniques are based on using Genetic Programming (GP) and Artificial Neural Networks (ANNs). In the present study, the GP is used to forecast sea level variations, three time steps ahead, for a set of time intervals comprising 12 h, 24 h, 5 day and 10 day time intervals using observed sea levels. The measurements from a single tide gauge at Hillarys Boat Harbour, Western Australia, were used to train and validate the employed GP for the period from December 1991 to December 2002. Statistical parameters, namely, the root mean square error, correlation coefficient and scatter index, are used to measure their performances. These were compared with a corresponding set of published results using an Artificial Neural Network model. The results show that both these artificial intelligence methodologies perform satisfactorily and may be considered as alternatives to the harmonic analysis. %K genetic algorithms, genetic programming, Sea-level variations, Forecasting, Artificial Neural Networks, Comparative studies %9 journal article %R doi:10.1016/j.cageo.2009.09.014 %U http://www.sciencedirect.com/science/article/B6V7D-4YCS020-1/2/514d629e145e62f37dbf599a1a7608a9 %U http://dx.doi.org/doi:10.1016/j.cageo.2009.09.014 %P 620-627 %0 Book Section %T Inter-Comparison of an Evolutionary Programming Model of Suspended Sediment Time-Series with Other Local Models %A Ghorbani, M. A. %A Khatibi, R. %A Asadi, H. %A Yousefi, P. %E Ventura, Sebastian %B Genetic Programming - New Approaches and Successful Applications %D 2012 %I InTech %F AliGhorbani:2012:GPnew %K genetic algorithms, genetic programming, Gene Expression Programming, GEP, ANN, MLR, Chaos %R doi:10.5772/47801 %U http://dx.doi.org/doi:10.5772/47801 %P 255-284 %0 Journal Article %T Predictive models of laminar flame speed in NH3/H2/O3/air mixtures using multi-gene genetic programming under varied fuelling conditions %A Ali Shah, Zubair %A Marseglia, G. %A De Giorgi, M. G. %J Fuel %D 2024 %V 368 %@ 0016-2361 %F ALISHAH:2024:fuel %X The primary aim of this study is to develop and validate a novel multi-gene genetic programming approach for accurately predicting Laminar Flame Speed (LFS) in ammonia (NH3)/hydrogen (H2)/air mixtures, a key aspect in the advancement of carbon-free fuel technologies. Ammonia, particularly when blended with hydrogen, presents significant potential as a carbon-free fuel due to its enhanced reactivity. This research not only investigates the effects of hydrogen concentration, initial temperature, and pressure on LFS and Ignition Delay Time (IDT) but also explores the impact of oxidizing agents like ozone (O3) in augmenting NH3 combustion. A modified reaction mechanism was implemented and validated through parametric analysis. Main findings demonstrate that IDT decreases with higher hydrogen concentrations, increased initial temperature, and initial pressure, although the influence of pressure decreases above 10 atm. Conversely, at lower temperatures (below 1200 K) and higher hydrogen concentrations (30 percent and 50 percent), the dominance of H2 chemistry can negatively impact initial pressure. LFS increases with higher temperature and hydrogen concentration, but decreases under elevated pressure, with its effect becoming negligible above 5 atm. An optimized equivalence ratio (?) range of 1.10 - 1.15 is identified for efficient combustion. Introducing ozone into the oxidizer notably improves LFS in NH3/H2/air mixtures, with the addition of 0.01 ozone mirroring the effect of a 10 percent hydrogen addition under normal conditions. The study’s fundamental contribution is the development of a multi-gene genetic algorithm, showcasing the correlation between predicted LFS values and actual values derived from chemkin simulations. The successful validation of this methodology across various case studies underscores its potential as a robust tool in zero-carbon combustion applications, marking a significant stride in the field %K genetic algorithms, genetic programming, NH, H, Laminar Flame Speed (LFS), Ignition Delay Time (IDT), Ozone (O), Multi-gene genetic programming %9 journal article %R doi:10.1016/j.fuel.2024.131652 %U https://www.sciencedirect.com/science/article/pii/S0016236124008007 %U http://dx.doi.org/doi:10.1016/j.fuel.2024.131652 %P 131652 %0 Conference Proceedings %T A Deep Learning Approach to Predicting Solutions in Streaming Optimisation Domains %A Alissa, Mohamad %A Sim, Kevin %A Hart, Emma %Y Coello Coello, Carlos Artemio %Y Aguirre, Arturo Hernandez %Y Uribe, Josu Ceberio %Y Fabre, Mario Garza %Y Toscano Pulido, Gregorio %Y Rodriguez-Vazquez, Katya %Y Wanner, Elizabeth %Y Veerapen, Nadarajen %Y Montes, Efren Mezura %Y Allmendinger, Richard %Y Marin, Hugo Terashima %Y Wagner, Markus %Y Bartz-Beielstein, Thomas %Y Filipic, Bogdan %Y Trautmann, Heike %Y Tang, Ke %Y Koza, John %Y Goodman, Erik %Y Langdon, William B. %Y Nicolau, Miguel %Y Zarges, Christine %Y Volz, Vanessa %Y Tusar, Tea %Y Naujoks, Boris %Y Bosman, Peter A. N. %Y Whitley, Darrell %Y Solnon, Christine %Y Helbig, Marde %Y Doncieux, Stephane %Y Wilson, Dennis G. %Y Fernandez de Vega, Francisco %Y Paquete, Luis %Y Chicano, Francisco %Y Xue, Bing %Y Bacardit, Jaume %Y Mostaghim, Sanaz %Y Fieldsend, Jonathan %Y Schuetze, Oliver %Y Arnold, Dirk %Y Ochoa, Gabriela %Y Segura, Carlos %Y Cotta, Carlos %Y Emmerich, Michael %Y Zhang, Mengjie %Y Purshouse, Robin %Y Ray, Tapabrata %Y Petke, Justyna %Y Ishikawa, Fuyuki %Y Lengler, Johannes %Y Neumann, Frank %S Proceedings of the 2020 Genetic and Evolutionary Computation Conference %S GECCO ’20 %D 2020 %8 jul 8 12 %I Association for Computing Machinery %C internet %F Alissa:2020:GECCO %X In the field of combinatorial optimisation, per-instance algorithm selection still remains a challenging problem, particularly with respect to streaming problems such as packing or scheduling. Typical approaches involve training a model to predict the best algorithm based on features extracted from the data, which is well known to be a difficult task and even more challenging with streaming data. We propose a radical approach that bypasses algorithm-selection altogether by training a Deep-Learning model using solutions obtained from a set of heuristic algorithms to directly predict a solution from the instance-data. To validate the concept, we conduct experiments using a packing problem in which items arrive in batches. Experiments conducted on six large datasets using batches of varying size show the model is able to accurately predict solutions, particularly with small batch sizes, and surprisingly in a small number of cases produces better solutions than any of the algorithms used to train the model. %K genetic algorithms, deep learning, algorithm selection problem, bin-packing %R doi:10.1145/3377930.3390224 %U https://doi.org/10.1145/3377930.3390224 %U http://dx.doi.org/doi:10.1145/3377930.3390224 %P 157-165 %0 Conference Proceedings %T A Neural Approach to Generation of Constructive Heuristics %A Alissa, Mohamad %A Sim, Kevin %A Hart, Emma %Y Ong, Yew-Soon %S 2021 IEEE Congress on Evolutionary Computation (CEC) %D 2021 %8 28 jun 1 jul %C Krakow, Poland %F Alissa:2021:CEC %X Both algorithm-selection methods and hyper-heuristic methods rely on a pool of complementary heuristics. Improving the pool with new heuristics can improve performance, however, designing new heuristics can be challenging. Methods such as genetic programming have proved successful in automating this process in the past. Typically, these make use of problem state-information and existing heuristics as components. Here we propose a novel neural approach for generating constructive heuristics, in which a neural network acts as a heuristic by generating decisions. We evaluate two architectures, an Encoder-Decoder LSTM and a Feed-Forward Neural Network. Both are trained using the decisions output from existing heuristics on a large set of instances. We consider streaming instances of bin-packing problems in a continual stream that must be packed immediately in strict order and using a limited number of resources. We show that the new heuristics generated are capable of solving a subset of instances better than the well-known heuristics forming the original pool, and hence the overall value of the pool is improved w.r.t. both Falkenauers performance metric and the number of bins used. %K genetic algorithms, genetic programming, Measurement, Navigation, Heuristic algorithms, Neural networks, ANN, Evolutionary computation, Dynamic scheduling, Automatic Heuristics Generation, Hyper-Heuristics, Encoder-Decoder LSTM, Streaming Bin-packing %R doi:10.1109/CEC45853.2021.9504989 %U http://dx.doi.org/doi:10.1109/CEC45853.2021.9504989 %P 1147-1154 %0 Conference Proceedings %T Firefly Programming For Symbolic Regression Problems %A Aliwi, Mohamed %A Aslan, Selcuk %A Demirci, Sercan %S 2020 28th Signal Processing and Communications Applications Conference (SIU) %D 2020 %8 oct %F Aliwi:2020:SIU %X Symbolic regression is the process of finding a mathematical formula that fits a specific set of data by searching in different mathematical expressions. This process requires great accuracy in order to reach the correct formula. In this paper, we will present a new method for solving symbolic regression problems based on the firefly algorithm. This method is called Firefly Programming (FP). The results of applying firefly programming algorithm to some symbolic regression benchmark problems will be compared to the results of Genetic Programming (GP) and Artificial Bee Colony Programming (ABCP) methods. %K genetic algorithms, genetic programming, Optimization, Statistics, Sociology, Linear programming, Brightness, firefly algorithm, symbolic regression, automatic programming %R doi:10.1109/SIU49456.2020.9302201 %U http://dx.doi.org/doi:10.1109/SIU49456.2020.9302201 %0 Conference Proceedings %T Kernel evolution for support vector classification %A Alizadeh, Mehrdad %A Ebadzadeh, Mohammad Mehdi %S IEEE Workshop on Evolving and Adaptive Intelligent Systems (EAIS 2011) %D 2011 %8 November 15 apr %C Paris %F Alizadeh:2011:EAIS %X Support vector machines (SVMs) have been used in a variety of classification tasks. SVMs undoubtedly are one of the most effective classifiers in several data mining applications. Determination of a kernel function and related parameters has been a bottleneck for this group of classifiers. In this paper a novel approach is proposed to use genetic programming (GP) to design domain-specific and optimal kernel functions for support vector classification (SVC) which automatically adjusts the parameters. Complex low dimensional mapping function is evolved using GP to construct an optimal linear and Gaussian kernel functions in new feature space. By using the principled kernel closure properties, these basic kernels are then used to evolve more optimal kernels. To evaluate the proposed method, benchmark datasets from UCI are applied. The result indicates that for some cases the proposed methods can find a more optimal solution than evolving known kernels. %K genetic algorithms, genetic programming, Gaussian kernel functions, automatic parameter adjustment, classification task, data mining application, domain-specific kernel functions, feature space, kernel evolution, low dimensional mapping function, optimal kernel functions, optimal linear functions, principled kernel closure properties, support vector classification, support vector machines, Gaussian processes, data mining, pattern classification, support vector machines %R doi:10.1109/EAIS.2011.5945924 %U http://dx.doi.org/doi:10.1109/EAIS.2011.5945924 %P 93-99 %0 Conference Proceedings %T Software effort estimation by tuning COOCMO model parameters using differential evolution %A Aljahdali, Sultan %A Sheta, Alaa F. %S 2010 IEEE/ACS International Conference on Computer Systems and Applications (AICCSA) %D 2010 %8 16 19 may %C Hammamet, Tunisia %F Aljahdali:2010:AICCSA %X Accurate estimation of software projects costs represents a challenge for many government organisations such as the Department of Defense (DOD) and NASA. Statistical models considerably used to assist in such a computation. There is still an urgent need on finding a mathematical model which can provide an accurate relationship between the software project effort/cost and the cost drivers. A powerful algorithm which can optimise such a relationship via tuning mathematical model parameters is urgently needed. In two new model structures to estimate the effort required for software projects using Genetic Algorithms (GAs) were proposed as a modification to the famous Constructive Cost Model (COCOMO). In this paper, we follow up on our previous work and present Differential Evolution (DE) as an alternative technique to estimate the COCOMO model parameters. The performance of the developed models were tested on NASA software project dataset provided in. The developed COCOMO-DE model was able to provide good estimation capabilities. %K genetic algorithms, genetic programming, sbse, COOCMO model parameter tuning, NASA software project dataset, constructive cost model, differential evolution, mathematical model, optimisation algorithm, software effort estimation, software projects cost estimation, statistical model, optimisation, software cost estimation %R doi:10.1109/AICCSA.2010.5586985 %U http://dx.doi.org/doi:10.1109/AICCSA.2010.5586985 %0 Journal Article %T Development of Software Reliability Growth Models for Industrial Applications Using Fuzzy Logic %A Aljahdali, Sultan %J Journal of Computer Science %D 2011 %V 7 %N 10 %I Science Publications %@ 15493636 %G eng %F Aljahdali:2011:Jcomputerscience %X Problem statement: The use of Software Reliability Growth Models (SRGM) plays a major role in monitoring progress, accurately predicting the number of faults in the software during both development and testing processes; define the release date of a software product, helps in allocating resources and estimating the cost for software maintenance. This leads to achieving the required reliability level of a software product. Approach: We investigated the use of fuzzy logic on building SRGM to estimate the expected software faults during testing process. Results: The proposed fuzzy model consists of a collection of linear sub-models, based on the Takagi-Sugeno technique and attached efficiently using fuzzy membership functions to represent the expected software faults as a function of historical measured faults. A data set provided by John Musa of bell telephone laboratories (i.e., real time control, military and operating system applications) was used to show the potential of using fuzzy logic in solving the software reliability modelling problem. Conclusion: The developed models provided high performance modelling capabilities. %K software reliability growth models (SRGM), takagi-sugeno technique, fuzzy logic (FL), artificial neural net-works (ANN), model structure, linear regression model, NASA space %9 journal article %R doi:10.3844/jcssp.2011.1574.1580 %U http://www.thescipub.com/pdf/10.3844/jcssp.2011.1574.1580 %U http://dx.doi.org/doi:10.3844/jcssp.2011.1574.1580 %P 1574-1580 %0 Journal Article %T Evolving Software Effort Estimation Models Using Multigene Symbolic Regression Genetic Programming %A Aljahdali, Sultan %A Sheta, Alaa %J International Journal of Advanced Research in Artificial Intelligence %D 2013 %V 2 %N 12 %I The Science and Information (SAI) Organization %G eng %F Aljahdali:2013:IJARAI %X Software has played an essential role in engineering, economic development, stock market growth and military applications. Mature software industry count on highly predictive software effort estimation models. Correct estimation of software effort lead to correct estimation of budget and development time. It also allows companies to develop appropriate time plan for marketing campaign. Now a day it became a great challenge to get these estimates due to the increasing number of attributes which affect the software development life cycle. Software cost estimation models should be able to provide sufficient confidence on its prediction capabilities. Recently, Computational Intelligence (CI) paradigms were explored to handle the software effort estimation problem with promising results. In this paper we evolve two new models for software effort estimation using Multigene Symbolic Regression Genetic Programming (GP). One model uses the Source Line Of Code (SLOC) as input variable to estimate the Effort (E); while the second model uses the Inputs, Outputs, Files, and User Enquiries to estimate the Function Point (FP). The proposed GP models show better estimation capabilities compared to other reported models in the literature. The validation results are accepted based Albrecht data set. %K genetic algorithms, genetic programming, SBSE %9 journal article %U http://thesai.org/Downloads/IJARAI/Volume2No12/Paper_7-Evolving_Software_Effort_Estimation_Models_Using.pdf %P 52-57 %0 Conference Proceedings %T Hate Speech Detection Using Genetic Programming %A Aljero, Mona Khalifa A. %A Dimililer, Nazife %S 2020 International Conference on Advanced Science and Engineering (ICOASE) %D 2020 %8 dec %F Aljero:2020:ICOASE %X There has been a steep increase in the use of social media in our everyday lives in recent years. Along with this, there has been an increase in hate speech disseminated on these platforms, due to the anonymity of the users as well as the ease of use. Social media platforms need to filter and prevent the spread of hate speech to protect their users and society. Due to the high traffic, automatic detection of hate speech is necessary. Hate speech detection is one of the most difficult classification challenges in text mining. Research in this domain focuses on the use of supervised machine learning approaches, such as support vector machine, logistic regression, convolutional neural network, and random forest. Ensemble techniques have also been employed. However, the performance of these approaches has not yet reached an acceptable level. In this paper, we propose the use of the Genetic Programming (GP) approach for binary classification of hate speech on social media platforms. Each individual in the GP framework represents a classifier that is evolved to optimize Fl-score. Experimental results show the effectiveness of our GP approach; the proposed approach outperforms the state-of-the-art using the same dataset HatEval. %K genetic algorithms, genetic programming %R doi:10.1109/ICOASE51841.2020.9436621 %U http://dx.doi.org/doi:10.1109/ICOASE51841.2020.9436621 %0 Journal Article %T Genetic Programming Approach to Detect Hate Speech in Social Media %A Aljero, Mona Khalifa A. %A Dimililer, Nazife %J IEEE Access %D 2021 %V 9 %@ 2169-3536 %F Aljero:2021:A %X Social media sites, which became central to our everyday lives, enable users to freely express their opinions, feelings, and ideas due to a certain level of depersonalization and anonymity they provide. If there is no control, these platforms may be used to propagate hate speech. In fact, in recent years, hate speech has increased on social media. Therefore, there is a need to monitor and prevent hate speech on these platforms. However, manual control is not feasible due to the high traffic of content production on social media sites. Moreover, the language used and the length of the messages provide a challenge when using classical machine learning approaches as prediction methods. This paper presents a genetic programming (GP) model for detecting hate speech where each chromosome represents a classifier employing a universal sentence encoder as a feature. A novel mutation technique that affects only the feature values in combination with the standard one-point mutation technique improved the performance of the GP model by enriching the offspring pool with alternative solutions. The proposed GP model outperformed all state-of-the-art systems for the four publicly available hate speech datasets. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1109/ACCESS.2021.3104535 %U http://dx.doi.org/doi:10.1109/ACCESS.2021.3104535 %P 115115-115125 %0 Journal Article %T Binary text classification using genetic programming with crossover-based oversampling for imbalanced datasets %A Aljero, Mona Khalifa A. %A Dimililer, Nazife %J Turkish J. Electr. Eng. Comput. Sci. %D 2023 %V 31 %N 1 %F DBLP:journals/elektrik/AljeroD23 %K genetic algorithms, genetic programming %9 journal article %R doi:10.55730/1300-0632.3978 %U https://doi.org/10.55730/1300-0632.3978 %U http://dx.doi.org/doi:10.55730/1300-0632.3978 %P 180-192 %0 Journal Article %T Ensemble Optimization for Invasive Ductal Carcinoma (IDC) Classification Using Differential Cartesian Genetic Programming %A Alkhaldi, Eid %A Salari, Ezzatollah %J IEEE Access %D 2022 %V 10 %@ 2169-3536 %F Alkhaldi:2022:IEEEAccess %X The high cost of acquiring annotated histological slides for breast specimens entails exploiting an ensemble of models appropriately trained on small datasets. Histological Image Classification ensembles strive to accurately detect abnormal tissues in the breast samples by determining the correlation between the predictions of its weak learners. Nonetheless, the state-of-the-art ensemble methods, such as boosting and bagging, count merely on manipulating the dataset and lack intelligent ensemble decision making. Furthermore, the methods mentioned above are short of the diversity of the weak models of the ensemble. Likewise, other commonly used voting strategies, such as weighted averaging, are limited to how the classifiers’ diversity and accuracy are balanced. Hence, In this paper, we assemble a Neural Network ensemble that integrates the models trained on small datasets by employing biologically-inspired methods. Our procedure is comprised of two stages. First, we train multiple heterogeneous pre-trained models on the benchmark Breast Histopathology Images for Invasive Ductal Carcinoma (IDC) classification dataset. In the second meta-training phase, we use the differential Cartesian Genetic Programming (dCGP) to generate a Neural Network that merges the trained models optimally. We compared our empirical outcomes with other state-of-the-art techniques. Our results demonstrate that improvising a Neural Network ensemble using Cartesian Genetic Programming transcended formerly published algorithms on slim datasets. %K genetic algorithms, genetic programming, Cartesian genetic programming %9 journal article %R doi:10.1109/ACCESS.2022.3228176 %U http://dx.doi.org/doi:10.1109/ACCESS.2022.3228176 %P 128790-128799 %0 Thesis %T Modelling pile capacity and load-settlement behaviour of piles embedded in sand & mixed soils using artificial intelligence %A Alkroosh, Iyad Salim Jabor %D 2011 %8 may %C Australia %C Curtin University, Faculty of Engineering and Computing, Department of Civil Engineering %G en %F Alkroosh:thesis %X This thesis presents the development of numerical models which are intended to be used to predict the bearing capacity and the load-settlement behaviour of pile foundations embedded in sand and mixed soils. Two artificial intelligence techniques, the gene expression programming (GEP) and the artificial neural networks (ANNs), are used to develop the models. The GEP is a developed version of genetic programming (GP). Initially, the GEP is used to model the bearing capacity of the bored piles, concrete driven piles and steel driven piles. The use of the GEP is extended to model the load-settlement behaviour of the piles but achieved limited success. Alternatively, the ANNs have been employed to model the load-settlement behaviour of the piles. The GEP and the ANNs are numerical modelling techniques that depend on input data to determine the structure of the model and its unknown parameters. The GEP tries to mimic the natural evolution of organisms and the ANNs tries to imitate the functions of human brain and nerve system. The two techniques have been applied in the field of geotechnical engineering and found successful in solving many problems. The data used for developing the GEP and ANN models are collected from the literature and comprise a total of 50 bored pile load tests and 58 driven pile load tests (28 concrete pile load tests and 30 steel pile load tests) as well as CPT data. The bored piles have different sizes and round shapes, with diameters ranging from 320 to 1800 mm and lengths from 6 to 27 m. The driven piles also have different sizes and shapes (i.e. circular, square and hexagonal), with diameters ranging from 250 to 660 mm and lengths from 8 to 36 m. All the information of case records in the data source is reviewed to ensure the reliability of used data. The variables that are believed to have significant effect on the bearing capacity of pile foundations are considered. They include pile diameter, embedded length, weighted average cone point resistance within tip influence zone and weighted average cone point resistance and weighted average sleeve friction along shaft. The sleeve friction values are not available in the bored piles data, so the weighted average sleeve friction along shaft is excluded from bored piles models. The models output is the pile capacity (interpreted failure load). Additional input variables are included for modelling the load-settlement behaviour of piles. They include settlement, settlement increment and current state of load settlement. The output is the next state of load-settlement. The data are randomly divided into two statistically consistent sets, training set for model calibration and an independent validation set for model performance verification. The predictive ability of the developed GEP model is examined via comparing the performance of the model in training and validation sets. Two performance measures are used: the mean and the coefficient of correlation. The performance of the model was also verified through conducting sensitivity analysis which aimed to determine the response of the model to the variations in the values of each input variables providing the other input variables are constant. The accuracy of the GEP model was evaluated further by comparing its performance with number of currently adopted traditional CPT-based methods. For this purpose, several ranking criteria are used and whichever method scores best is given rank 1. The GEP models, for bored and driven piles, have shown good performance in training and validation sets with high coefficient of correlation between measured and predicted values and low mean values. The results of sensitivity analysis have revealed an incremental relationship between each of the input variables and the output, pile capacity. This agrees with what is available in the geotechnical knowledge and experimental data. The results of comparison with CPT-based methods have shown that the GEP models perform well. %K genetic algorithms, genetic programming, gene expression programming, modelling pile capacity, load-settlement behaviour of piles, artificial intelligence, (GEP) and the artificial neural networks (ANNs), numerical modelling techniques %9 Ph.D. thesis %U http://espace.library.curtin.edu.au/Modelling.pdf %0 Book %T Modelling pile capacity & load-settlement behaviour from CPT data: For piles in sand and mixed soils using artificial intelligence %A Alkroosh, Iyad %D 2012 %8 23 may %I Lambert Academic Publishing %F Alkroosh:book %X This work involves the presentation of new approach attempted to predict the axial capacity and load-settlement behaviour of piles embedded in sand and mixed soils. Two artificial intelligence techniques including Gene Expression Programming (GEP) and Artificial Neural Networks (ANNs) have been used in the approach. The work begins with the definitions of the two techniques and explanation of their terminology and the theories which each of them is based on. The work also comprises extensive literature review of the proposed procedures for evaluating pile capacity and load settlement behaviour. The application of the artificial intelligence in the work begins with the use of the GEP for modelling the pile capacity. The modelling involves data collection, selection of input variables, data division, determination of setting parameters and GEP model selection and model formulation and validation. Two models are developed, a model for bored piles and two others for driven piles. In the second phase of this work, the artificial neural network used for modelling the load-settlement behaviour of the piles. %K genetic algorithms, genetic programming, Gene Expression Programming, ANN %U https://www.amazon.co.uk/Modelling-pile-capacity-load-settlement-behaviour/dp/3848436906 %0 Journal Article %T Predicting pile dynamic capacity via application of an evolutionary algorithm %A Alkroosh, I. %A Nikraz, H. %J Soils and Foundations %D 2014 %V 54 %N 2 %@ 0038-0806 %F Alkroosh:2014:SF %X This study presents the development of a new model obtained from the correlation of dynamic input and SPT data with pile capacity. An evolutionary algorithm, gene expression programming (GEP), was used for modelling the correlation. The data used for model development comprised 24 cases obtained from existing literature. The modelling was carried out by dividing the data into two sets: a training set for model calibration and a validation set for verifying the generalisation capability of the model. The performance of the model was evaluated by comparing its predictions of pile capacity with experimental data and with predictions of pile capacity by two commonly used traditional methods and the artificial neural networks (ANNs) model. It was found that the model performs well with a coefficient of determination, mean, standard deviation and probability density at 50percent equivalent to 0.94, 1.08, 0.14, and 1.05, respectively, for the training set, and 0.96, 0.95, 0.13, and 0.93, respectively, for the validation set. The low values of the calculated mean squared error and mean absolute error indicated that the model is accurate in predicting pile capacity. The results of comparison also showed that the model predicted pile capacity more accurately than traditional methods including the ANNs model. %K genetic algorithms, genetic programming, gene expression programming %9 journal article %R doi:10.1016/j.sandf.2014.02.013 %U http://www.sciencedirect.com/science/article/pii/S0038080614000213 %U http://dx.doi.org/doi:10.1016/j.sandf.2014.02.013 %P 233-242 %0 Journal Article %T High-throughput classification of yeast mutants for functional genomics using metabolic footprinting %A Allen, Jess %A Davey, Hazel M. %A Broadhurst, David %A Heald, Jim K. %A Rowland, Jem J. %A Oliver, Stephen G. %A Kell, Douglas B. %J Nature Biotechnology %D 2003 %8 jun %V 21 %N 6 %F Allen:2003:NB %X Many technologies have been developed to help explain the function of genes discovered by systematic genome sequencing. At present, transcriptome and proteome studies dominate large-scale functional analysis strategies. Yet the metabolome, because it is ’downstream’, should show greater effects of genetic or physiological changes and thus should be much closer to the phenotype of the organism. We earlier presented a functional analysis strategy that used metabolic fingerprinting to reveal the phenotype of silent mutations of yeast genes1. However, this is difficult to scale up for high-throughput screening. Here we present an alternative that has the required throughput (2 min per sample). This ’metabolic footprinting’ approach recognizes the significance of ’overflow metabolism’ in appropriate media. Measuring intracellular metabolites is time-consuming and subject to technical difficulties caused by the rapid turnover of intracellular metabolites and the need to quench metabolism and separate metabolites from the extracellular space. We therefore focused instead on direct, noninvasive, mass spectrometric monitoring of extracellular metabolites in spent culture medium. Metabolic footprinting can distinguish between different physiological states of wild-type yeast and between yeast single-gene deletion mutants even from related areas of metabolism. By using appropriate clustering and machine learning techniques, the latter based on genetic programming2-8, we show that metabolic footprinting is an effective method to classify ’unknown’ mutants by genetic defect. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1038/nbt823 %U http://dbkgroup.org/Papers/NatureBiotechnology21(692-696).pdf %U http://dx.doi.org/doi:10.1038/nbt823 %P 692-696 %0 Journal Article %T Discrimination of Modes of Action of Antifungal Substances by Use of Metabolic Footprinting %A Allen, Jess %A Davey, Hazel M. %A Broadhurst, David %A Rowland, Jem J. %A Oliver, Stephen G. %A Kell, Douglas B. %J Applied and Environmental Microbiology %D 2004 %8 oct %V 70 %N 10 %F Allen:2004:AEM %X Diploid cells of Saccharomyces cerevisiae were grown under controlled conditions with a Bioscreen instrument, which permitted the essentially continuous registration of their growth via optical density measurements. Some cultures were exposed to concentrations of a number of antifungal substances with different targets or modes of action (sterol biosynthesis, respiratory chain, amino acid synthesis, and the uncoupler). Culture supernatants were taken and analyzed for their metabolic footprints by using direct-injection mass spectrometry. Discriminant function analysis and hierarchical cluster analysis allowed these antifungal compounds to be distinguished and classified according to their modes of action. Genetic programming, a rule-evolving machine learning strategy, allowed respiratory inhibitors to be discriminated from others by using just two masses. Metabolic footprinting thus represents a rapid, convenient, and information-rich method for classifying the modes of action of antifungal substances. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1128/AEM.70.10.6157-6165.2004 %U http://dx.doi.org/doi:10.1128/AEM.70.10.6157-6165.2004 %P 6157-6165 %0 Conference Proceedings %T Evolving reusable 3D packing heuristics with genetic programming %A Allen, Sam %A Burke, Edmund K. %A Hyde, Matthew R. %A Kendall, Graham %Y Raidl, Guenther %Y Rothlauf, Franz %Y Squillero, Giovanni %Y Drechsler, Rolf %Y Stuetzle, Thomas %Y Birattari, Mauro %Y Congdon, Clare Bates %Y Middendorf, Martin %Y Blum, Christian %Y Cotta, Carlos %Y Bosman, Peter %Y Grahl, Joern %Y Knowles, Joshua %Y Corne, David %Y Beyer, Hans-Georg %Y Stanley, Ken %Y Miller, Julian F. %Y van Hemert, Jano %Y Lenaerts, Tom %Y Ebner, Marc %Y Bacardit, Jaume %Y O’Neill, Michael %Y Di Penta, Massimiliano %Y Doerr, Benjamin %Y Jansen, Thomas %Y Poli, Riccardo %Y Alba, Enrique %S GECCO ’09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation %D 2009 %8 August 12 jul %I ACM %C Montreal %F DBLP:conf/gecco/AllenBHK09 %X This paper compares the quality of reusable heuristics designed by genetic programming (GP) to those designed by human programmers. The heuristics are designed for the three dimensional knapsack packing problem. Evolutionary computation has been employed many times to search for good quality solutions to such problems. However, actually designing heuristics with GP for this problem domain has never been investigated before. In contrast, the literature shows that it has taken years of experience by human analysts to design the very effective heuristic methods that currently exist. Hyper-heuristics search a space of heuristics, rather than directly searching a solution space. GP operates as a hyper-heuristic in this paper, because it searches the space of heuristics that can be constructed from a given set of components. We show that GP can design simple, yet effective, stand-alone constructive heuristics. While these heuristics do not represent the best in the literature, the fact that they are designed by evolutionary computation, and are human competitive, provides evidence that further improvements in this GP methodology could yield heuristics superior to those designed by humans. %K genetic algorithms, genetic programming %R doi:10.1145/1569901.1570029 %U http://dx.doi.org/doi:10.1145/1569901.1570029 %P 931-938 %0 Thesis %T Algorithms and data structures for three-dimensional packing %A Allen, Sam D. %D 2011 %8 jul %C UK %C School of Computer Science, University of Nottingham %F Allen:thesis %X Cutting and packing problems are increasingly prevalent in industry. A well used freight vehicle will save a business money when delivering goods, as well as reducing the environmental impact, when compared to sending out two lesser-used freight vehicles. A cutting machine that generates less wasted material will have a similar effect. Industry reliance on automating these processes and improving productivity is increasing year-on-year. This thesis presents a number of methods for generating high quality solutions for these cutting and packing challenges. It does so in a number of ways. A fast, efficient framework for heuristically generating solutions to large problems is presented, and a method of incrementally improving these solutions over time is implemented and shown to produce even higher packing. The results from these findings provide the best known results for 28 out of 35 problems from the literature. This framework is analysed and its effectiveness shown over a number of datasets, along with a discussion of its theoretical suitability for higher-dimensional packing problems. A way of automatically generating new heuristics for this framework that can be problem specific, and therefore highly tuned to a given dataset, is then demonstrated and shown to perform well when compared to the expert-designed packing heuristics. Finally some mathematical models which can guarantee the optimality of packings for small datasets are given, and the (in)effectiveness of these techniques discussed. The models are then strengthened and a novel model presented which can handle much larger problems under certain conditions. The thesis finishes with a discussion about the applicability of the different approaches taken to the real-world problems that motivate them. %K genetic algorithms, genetic programming, packing, shipment, business, operations research %9 Ph.D. thesis %U http://etheses.nottingham.ac.uk/2779/1/thesis_nicer.pdf %0 Book Section %T Content Diversity in Genetic Programming and its Correlation with Fitness %A Almal, A. %A Worzel, W. P. %A Wollesen, E. A. %A MacLean, C. D. %E Yu, Tina %E Riolo, Rick L. %E Worzel, Bill %B Genetic Programming Theory and Practice III %S Genetic Programming %D 2005 %8 December 14 may %V 9 %I Springer %C Ann Arbor %@ 0-387-28110-X %F Almal:2005:GPTP %X A technique used to visualise DNA sequences is adapted to visualize large numbers of individuals in a genetic programming population. This is used to examine how the content diversity of a population changes during evolution and how this correlates with changes in fitness. %K genetic algorithms, genetic programming, diversity, chaos game, fitness correlation, visualisation %R doi:10.1007/0-387-28111-8_12 %U http://dx.doi.org/doi:10.1007/0-387-28111-8_12 %P 177-190 %0 Conference Proceedings %T Using genetic programming to classify node positive patients in bladder cancer %A Almal, Arpit A. %A Mitra, Anirban P. %A Datar, Ram H. %A Lenehan, Peter F. %A Fry, David W. %A Cote, Richard J. %A Worzel, William P. %Y Keijzer, Maarten %Y Cattolico, Mike %Y Arnold, Dirk %Y Babovic, Vladan %Y Blum, Christian %Y Bosman, Peter %Y Butz, Martin V. %Y Coello Coello, Carlos %Y Dasgupta, Dipankar %Y Ficici, Sevan G. %Y Foster, James %Y Hernandez-Aguirre, Arturo %Y Hornby, Greg %Y Lipson, Hod %Y McMinn, Phil %Y Moore, Jason %Y Raidl, Guenther %Y Rothlauf, Franz %Y Ryan, Conor %Y Thierens, Dirk %S GECCO 2006: Proceedings of the 8th annual conference on Genetic and evolutionary computation %D 2006 %8 August 12 jul %V 1 %I ACM Press %C Seattle, Washington, USA %@ 1-59593-186-4 %F 1144040 %K genetic algorithms, genetic programming, Biological Applications, algorithms and similarity measures, bladder cancer, classification rules, classifier design and evaluation, concept learning and induction, feature design and evaluation, feature selection, machine learning, Nodal staging, pattern analysis, program synthesis, synthesis %R doi:10.1145/1143997.1144040 %U http://gpbib.cs.ucl.ac.uk/gecco2006/docs/p239.pdf %U http://dx.doi.org/doi:10.1145/1143997.1144040 %P 239-246 %0 Book Section %T Program Structure-Fitness Disconnect and Its Impact On Evolution In GP %A Almal, A. A. %A MacLean, C. D. %A Worzel, W. P. %E Riolo, Rick L. %E Soule, Terence %E Worzel, Bill %B Genetic Programming Theory and Practice V %S Genetic and Evolutionary Computation %D 2007 %8 17 19 may %I Springer %C Ann Arbor %F Almal:2007:GPTP %X Simple Genetic Programming (GP) is generally considered to lack the strong separation between genotype and phenotype found in natural evolution. In many cases, the genotype and the phenotype are considered identical in GP since the program representation does not undergo any modification prior to its encounter with ’environment’ in the form of inputs and a fitness function. However, this view overlooks a key fact: fitness in GP is determined without reference to the makeup of the individual programs but evolutionary changes occur in the structure and content of the individual without reference to its fitness. This creates a disconnect between ’genetic recombination’ and fitness similar to that in nature that can create unexpected effects during the evolution of a population and suggests an important dynamic that has not been thoroughly considered by the GP community. This paper describes some of the observed effects of this disconnect and studies some approaches for the estimating diversity of a population which could lead to a new way of modelling the dynamics of GP. We also speculate on the similarity of these effects and some recently studied aspects of natural evolution. %K genetic algorithms, genetic programming, phenotype, genotype, evolutionary dynamics, GP structure, GP content, speciation, population, fitness %R doi:10.1007/978-0-387-76308-8_9 %U http://dx.doi.org/doi:10.1007/978-0-387-76308-8_9 %P 143-158 %0 Book Section %T A Population Based Study of Evolutionary Dynamics in Genetic Programming %A Almal, A. A. %A MacLean, C. D. %A Worzel, W. P. %E Riolo, Rick L. %E Soule, Terence %E Worzel, Bill %B Genetic Programming Theory and Practice VI %S Genetic and Evolutionary Computation %D 2008 %8 15 17 may %I Springer %C Ann Arbor %F Almal:2008:GPTP %K genetic algorithms, genetic programming %R doi:10.1007/978-0-387-87623-8_2 %U http://dx.doi.org/doi:10.1007/978-0-387-87623-8_2 %P 19-29 %0 Conference Proceedings %T On the Detection of Community Smells using Genetic Programming-based Ensemble Classifier Chain %A Almarimi, Nuri %A Ouni, Ali %A Chouchen, Moataz %A Saidani, Islem %A Mkaouer, Mohamed Wiem %S 15th IEEE/ACM International Conference on Global Software Engineering (ICGSE) %D 2020 %8 26 jun %C internet %F almarimi2020community %X Community smells are symptoms of organizational and social issues within the software development community that often increase the project costs and impact software quality. Recent studies have identified a variety of community smells and defined them as suboptimal patterns connected to organizational-social structures in the software development community such as the lack of communication, coordination and collaboration. Recognizing the advantages of the early detection of potential community smells in a software project, we introduce a novel approach that learns from various community organizational and social practices to provide an automated support for detecting community smells. In particular, our approach learns from a set of interleaving organizational-social symptoms that characterize the existence of community smell instances in a software project. We build a multi-label learning model to detect 8 common types of community smells. We use the ensemble classifier chain (ECC) model that transforms multi-label problems into several single-label problems which are solved using genetic programming (GP) to find the optimal detection rules for each smell type. To evaluate the performance of our approach, we conducted an empirical study on a benchmark of 103 open source projects and 407 community smell instances. The statistical tests of our results show that our approach can detect the eight considered smell types with an average F-measure of 89percent achieving a better performance compared to different state-of-the-art techniques. Furthermore, we found that the most influential factors that best characterize community smells include the social network density and closeness centrality as well as the standard deviation of the number of developers per time zone and per community. %K genetic algorithms, genetic programming, SBSE, community smells, social debt, socio-technical factors, search-based software engineering, multi-label learning %R doi:10.1145/3372787.3390439 %U https://conf.researchr.org/details/icgse-2020/icgse-2020-research-papers/6/On-the-Detection-of-Community-Smells-using-Genetic-Programming-based-Ensemble-Classif %U http://dx.doi.org/doi:10.1145/3372787.3390439 %P 43-54 %0 Conference Proceedings %T On the Detection of Community Smells Using Genetic Programming-based Ensemble Classifier Chain %A Almarimi, Nuri %A Ouni, Ali %A Chouchen, Moataz %A Saidani, Islem %A Mkaouer, Mohamed Wiem %S 2020 ACM/IEEE 15th International Conference on Global Software Engineering (ICGSE) %D 2020 %8 may %F Almarimi:2020:ICGSE %X Community smells are symptoms of organizational and social issues within the software development community that often increase the project costs and impact software quality. Recent studies have identified a variety of community smells and defined them as sub-optimal patterns connected to organizational-social structures in the software development community such as the lack of communication, coordination and collaboration. Recognizing the advantages of the early detection of potential community smells in a software project, we introduce a novel approach that learns from various community organizational and social practices to provide an auto-mated support for detecting community smells. In particular, our approach learns from a set of interleaving organizational-social symptoms that characterise the existence of community smell in-stances in a software project. We build a multi-label learning model to detect 8 common types of community smells. We use the ensemble classifier chain (ECC) model that transforms multi-label problems into several single-label problems which are solved using genetic programming (GP) to find the optimal detection rules for each smell type. To evaluate the performance of our approach, we conducted an empirical study on a benchmark of 103 open source projects and 407 community smell instances. The statistical tests of our results show that our approach can detect the eight considered smell types with an average F-measure of 89percent achieving a better performance compared to different state-of-the-art techniques. Furthermore, we found that the most influential factors that best characterise community smells include the social network density and closeness centrality as well as the standard deviation of the number of developers per time zone and per community. %K genetic algorithms, genetic programming, Search-based software engineering, SBSE, Costs, Social networking (online), Standards organizations, Collaboration, Transforms, Software quality, Community smells, Social debt, Socio-technical factors, Multi-label learning %U http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=10148849 %P 43-54 %0 Conference Proceedings %T Genetic Programming with External Memory in Sequence Recall Tasks %A Al Masalma, Mihyar %A Heywood, Malcolm %Y Trautmann, Heike %Y Doerr, Carola %Y Moraglio, Alberto %Y Bartz-Beielstein, Thomas %Y Filipic, Bogdan %Y Gallagher, Marcus %Y Ong, Yew-Soon %Y Gupta, Abhishek %Y Kononova, Anna V. %Y Wang, Hao %Y Emmerich, Michael %Y Bosman, Peter A. N. %Y Zaharie, Daniela %Y Caraffini, Fabio %Y Dreo, Johann %Y Auger, Anne %Y Dietric, Konstantin %Y Dufosse, Paul %Y Glasmachers, Tobias %Y Hansen, Nikolaus %Y Mersmann, Olaf %Y Posik, Petr %Y Tusar, Tea %Y Brockhoff, Dimo %Y Eftimov, Tome %Y Kerschke, Pascal %Y Naujoks, Boris %Y Preuss, Mike %Y Volz, Vanessa %Y Derbel, Bilel %Y Li, Ke %Y Li, Xiaodong %Y Zapotecas, Saul %Y Zhang, Qingfu %Y Coletti, Mark %Y Schuman, Catherine (Katie) %Y Scott, Eric “Siggy” %Y Patton, Robert %Y Wiegand, Paul %Y Bassett, Jeffrey K. %Y Gunaratne, Chathika %Y Chugh, Tinkle %Y Allmendinger, Richard %Y Hakanen, Jussi %Y Tauritz, Daniel %Y Woodward, John %Y Lopez-Ibanez, Manuel %Y McCall, John %Y Bacardit, Jaume %Y Brownlee, Alexander %Y Cagnoni, Stefano %Y Iacca, Giovanni %Y Walker, David %Y Toutouh, Jamal %Y O’Reilly, UnaMay %Y Machado, Penousal %Y Correia, Joao %Y Nesmachnow, Sergio %Y Ceberio, Josu %Y Villanueva, Rafael %Y Hidalgo, Ignacio %Y Fernandez de Vega, Francisco %Y Paolo, Giuseppe %Y Coninx, Alex %Y Cully, Antoine %Y Gaier, Adam %Y Wagner, Stefan %Y Affenzeller, Michael %Y Bruce, Bobby R. %Y Nowack, Vesna %Y Blot, Aymeric %Y Winter, Emily %Y Langdon, William B. %Y Petke, Justyna %Y Fernandez Alzueta, Silvino %Y Valledor Pellicer, Pablo %Y Stuetzle, Thomas %Y Paetzel, David %Y Wagner, Alexander %Y Heider, Michael %Y Veerapen, Nadarajen %Y Malan, Katherine %Y Liefooghe, Arnaud %Y Verel, Sebastien %Y Ochoa, Gabriela %Y Omidvar, Mohammad Nabi %Y Sun, Yuan %Y Tarantino, Ernesto %Y Ivanoe, De Falco %Y Della Cioppa, Antonio %Y Umberto, Scafuri %Y Rieffel, John %Y Mouret, Jean-Baptiste %Y Doncieux, Stephane %Y Nikolaidis, Stefanos %Y Togelius, Julian %Y Fontaine, Matthew C. %Y Georgescu, Serban %Y Chicano, Francisco %Y Whitley, Darrell %Y Kyriienko, Oleksandr %Y Dahl, Denny %Y Shir, Ofer %Y Spector, Lee %Y Rahat, Alma %Y Everson, Richard %Y Fieldsend, Jonathan %Y Wang, Handing %Y Jin, Yaochu %Y Hemberg, Erik %Y Elsayed, Marwa A. %Y Kommenda, Michael %Y La Cava, William %Y Kronberger, Gabriel %Y Gustafson, Steven %S Proceedings of the 2022 Genetic and Evolutionary Computation Conference Companion %S GECCO ’22 %D 2022 %8 September 13 jul %I Association for Computing Machinery %C Boston, USA %F alMasalma:2022:GECCOcomp %X Partially observable tasks imply that a learning agent has to recall previous state in order to make a decision in the present. Recent research with neural networks have investigated both internal and external memory mechanisms for this purpose, as well as proposing benchmarks to measure their effectiveness. These developments motivate our investigation using genetic programming and an external linked list memory model. A thorough empirical evaluation using a scalable sequence recall benchmark establishes the underlying strength of the approach. In addition, we assess the impact of decisions made regarding the instruction set and characterize the sensitivity to noise / obfuscation in the definition of the benchmarks. Compared to neural solutions to these benchmarks, GP extends the state-of-the-art to greater task depths than previously possible. %K genetic algorithms, genetic programming, modularity, partially observable, external memory %R doi:10.1145/3520304.3528883 %U http://dx.doi.org/doi:10.1145/3520304.3528883 %P 518-521 %0 Journal Article %T Benchmarking ensemble genetic programming with a linked list external memory on scalable partially observable tasks %A Al Masalma, Mihyar %A Heywood, Malcolm %J Genetic Programming and Evolvable Machines %D 2022 %8 30 nov %V 23 %N Suppl 1 %@ 1389-2576 %F alMasalma:2023:GPEM %X Reactive learning agents cannot solve partially observable sequential decision-making tasks as they are limited to defining outcomes purely in terms of the observable state. However, augmenting reactive agents with external memory might provide a path for addressing this limitation. In this work, external memory takes the form of a linked list data structure that programs have to learn how to use. We identify conditions under which additional recurrent connectivity from program output to input is necessary for state disambiguation. Benchmarking against recent results from the neural network literature on three scalable partially observable sequential decision-making tasks demonstrates that the proposed approach scales much more effectively. Indeed, solutions are shown to generalize to far more difficult sequences than those experienced under training conditions. Moreover, recommendations are made regarding the instruction set and additional benchmarking is performed with input state values designed to explicitly disrupt the identification of useful states for later recall. The protected division operator appears to be particularly useful in developing simple solutions to all three tasks. %K genetic algorithms, genetic programming, External memory, Partial observability, Internal state, Ensembles, noop, pop_head, pop_tail, DIV %9 journal article %R doi:10.1007/s10710-022-09446-8 %U https://rdcu.be/daFLX %U http://dx.doi.org/doi:10.1007/s10710-022-09446-8 %P s1-s29 %0 Journal Article %T Remote Sensing Image Classification Using Genetic-Programming-Based Time Series Similarity Functions %A Almeida, Alexandre E. %A da S. Torres, Ricardo %J IEEE Geoscience and Remote Sensing Letters %D 2017 %8 sep %V 14 %N 9 %@ 1545-598X %F Almeida:2017:ieeeGRSL %X In several applications, the automatic identification of regions of interest in remote sensing images is based on the assessment of the similarity of associated time series, i.e., two regions are considered as belonging to the same class if the patterns found in their spectral information observed over time are somewhat similar. In this letter, we investigate the use of a genetic programming (GP) framework to discover an effective combination of time series similarity functions to be used in remote sensing classification tasks. Performed experiments in a Forest-Savanna classification scenario demonstrated that the GP framework yields effective results when compared with the use of traditional widely used similarity functions in isolation. %K genetic algorithms, genetic programming, remote sensing, time series similarity %9 journal article %R doi:10.1109/LGRS.2017.2719033 %U http://dx.doi.org/doi:10.1109/LGRS.2017.2719033 %P 1499-1503 %0 Journal Article %T Deriving vegetation indices for phenology analysis using genetic programming %A Almeida, Jurandy %A dos Santos, Jefersson A. %A Miranda, Waner O. %A Alberton, Bruna %A Morellato, Leonor Patricia C. %A da S. Torres, Ricardo %J Ecological Informatics %D 2015 %V 26, Part 3 %@ 1574-9541 %F Almeida:2015:EI %X Plant phenology studies recurrent plant life cycle events and is a key component for understanding the impact of climate change. To increase accuracy of observations, new technologies have been applied for phenological observation, and one of the most successful strategies relies on the use of digital cameras, which are used as multi-channel imaging sensors to estimate colour changes that are related to phenological events. We monitor leaf-changing patterns of a cerrado-savanna vegetation by taking daily digital images. We extract individual plant color information and correlate with leaf phenological changes. For that, several vegetation indices associated with plant species are exploited for both pattern analysis and knowledge extraction. In this paper, we present a novel approach for deriving appropriate vegetation indices from vegetation digital images. The proposed method is based on learning phenological patterns from plant species through a genetic programming framework. A comparative analysis of different vegetation indices is conducted and discussed. Experimental results show that our approach presents higher accuracy on characterising plant species phenology. %K genetic algorithms, genetic programming, Remote phenology, Digital cameras, Image analysis, Vegetation indices %9 journal article %R doi:10.1016/j.ecoinf.2015.01.003 %U http://www.sciencedirect.com/science/article/pii/S1574954115000114 %U http://dx.doi.org/doi:10.1016/j.ecoinf.2015.01.003 %P 61-69 %0 Conference Proceedings %T A Genetically Programmable Hybrid Virtual Reconfigurable Architecture for Image Filtering Applications %A Almeida, M. A. %A Pedrino, E. C. %A Nicoletti, M. C. %S 2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) %D 2016 %8 oct %F Almeida:2016:SIBGRAPI %X A new and efficient automatic hybrid method, called Hy-EH, based on Virtual Reconfigurable Architectures (VRAs) and implemented in Field Programmable Gate Arrays (FPGAs) is proposed, for a hardware-embedded construction of image filters. The method also encompass an evolutionary software system, which represents the chromosome as a bi-dimensional grid of function elements (FEs), entirely parametrised using the Verilog-HDL (Verilog Hardware Description Language), which is reconfigured using the MATLAB toolbox GPLAB, before its download into the FPGA. In the so-called intrinsic proposals, evolutionary processes take place internally to the hardware, in a pre-defined fixed way, in extrinsic proposals evolutionary processes happen externally to the hardware. The hybrid Hy-EH method, described in this paper allows for the intrinsic creation of a flexible-sized hardware, in an extrinsic way i.e., by means of an evolutionary process that happens externally to the hardware. Hy-EH is also a convenient choice as far as extrinsic methods are considered, since it does not depend on a proprietary solution for its implementation. A comparative analysis of using the Hy-EH versus an existing intrinsic proposal, in two well-known problems, has been conducted. Results show that by using Hy-EH there was little hardware complexity due to the optimised and more flexible use of shorter chromosomes. %K genetic algorithms, genetic programming %R doi:10.1109/SIBGRAPI.2016.029 %U http://dx.doi.org/doi:10.1109/SIBGRAPI.2016.029 %P 152-157 %0 Journal Article %T Hybrid Evolvable Hardware for automatic generation of image filters %A Almeida, M. A. %A Pedrino, E. C. %J Integrated Computer-Aided Engineering %D 2018 %V 25 %N 3 %@ 1069-2509 %F Almeida:2018:ICAE %X In this article, a new framework is proposed and implemented for automatic generation of image filters in reconfigurable hardware (FPGA), called H-EHW (Hybrid-Evolvable Hardware). This consists basically of two modules. The first (training module) is responsible for the automatic generation of solutions (filters). The second (fusion module) converts such solutions into hardware, thus creating a virtual and reconfigurable architecture for fast image processing. Monochromatic pairs of images are used for the system training and testing. Extensive tests show that there are several benefits of the proposed system when compared to other similar systems described in the literature, such as: reduced phenotype length (generated circuit), reduced reconfiguration time, greater hardware reconfiguration flexibility and no more need for the manipulation of the bitstream of the FPGA for circuit evolution (a problem often encountered in practice by designers). %K genetic algorithms, genetic programming, Evolvable Hardware, FPGA, virtual reconfigurable architecture %9 journal article %R doi:10.3233/ICA-180561 %U http://dx.doi.org/doi:10.3233/ICA-180561 %P 289-303 %0 Book Section %T Communicating Agents Developed with Genetic Programming %A Almgren, Magnus %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 2000 %D 2000 %8 jun %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F almgren:2000:CADGP %K genetic algorithms, genetic programming %P 25-32 %0 Thesis %T Investigation of the performance of cracked steel members strengthened with carbon fibre reinforced polymers under impact loads %A Al-Mosawe, Alaa %D 2016 %C Melbourne, Australia %C Faculty of Science, Engineering and Technology, Swinburne University of Technology %F AlMosawe:thesis %X Generally, steel structures are subjected to different types of loadings during their life-time. Over time these structures sustain fewer loads than those for which they were designed. The reduction in structural capacity might occur as a result of various parameters, including aging, changes in use, increases in applied loads, and as a result of environmental effects causing corrosion. These structures need to be strengthened or repaired in order to be able to carry the different applied loads. Carbon fibre reinforced polymers (CFRPs) are a new method of strengthening, the use of which has grown in the last few decades. This method of strengthening has attracted structural engineers due to its ease of application, light weight and very high tensile strength. The bond between CFRP and steel members is the main issue in understanding the bond behaviour. This thesis presents the effect of impact loading on the bond behaviour of CFRP-steel double strap joints. The results of comprehensive experimental tests are presented in this project on the basis of testing large numbers of CFRP-steel double strap joints under both static and dynamic loadings. Another series of tests was conducted to investigate the mechanical properties of the composite material itself. The mechanical properties were investigated under different loading rates, starting from quasi-static loading at 2mm/min, to impact loadings of 201000mm/minute, 258000mm/minute and 300000mm/minute. The experimental results showed that loading rate has a significant effect on the material properties, and a significant increase was shown in tensile strength and modulus of elasticity. The results of another series of tests are presented in this thesis. A number of CFRP-steel double strap joints were prepared and tested under quasi-static loads. Three different types of CFRP modulus (low modulus 165 GPa, normal modulus 205GPa and ultra-high CFRP modulus 460 GPa) were used, to study the effect of CFRP modulus on the bond behaviour between steel and CFRP laminates. In order to investigate the effect of CFRP geometry on the bond properties, two different CFRP sections were used (20 by 1.4mm and 10 by 1.4mm). The results showed a significant influence on the bond strength, strain distribution along the bond, effective bond length and failure mode for specimens with different CFRP modulus. The results also showed that a small CFRP section is sensitive to any little movement. Further tests were also conducted on CFRP-steel double strap specimens with different CFRP moduli under high impact loading rates. The load rates used in this project were 201m per minute, 258m per minute and 300m/min. The aim of this test was to find the degree of joint enhancement under dynamic loadings compared to quasi-static loads. The results showed a significant increase in load-carrying capacity, and strain distribution along the bond. However, a significant decrease in the effective bond length under impact loads was observed compared to quasi-static testing. Different failure modes were shown compared to specimens tested under quasi-static loadings. Finite element analysis was conducted in this research to model the CFRP-steel double strap joint under both quasi-static and dynamic loads. The individual components of the joint (CFRP laminate, Araldite 420 adhesive and steel plates) were first modeled and analysed under the four loading rates. The CFRP-steel double strap joints were modelled using non-linear finite element analysis using the commercial software ABAQUS 6.13. The results showed good prediction of material properties and joint behaviour using non-linear finite element analysis, and the results of tensile joint strength, strain distribution along the bond, effective bond length and failure modes were close to those tested experimentally. This thesis also shows a new formulation of CFRP-steel double strap joints using genetic programming; the data from the experimental and numerical analysis were analysed using genetic programming software. Three different parameters were used: bond length, loading rate and the CFRP modulus. The outcomes of this analysis are showing an expression tree and a new equation to express the bond strength of these types of joints. The results are assumed to be used for the range of parameters used as input data in the programming. Finally, some suggestions on future work to continue the investigation of the bond behaviour between CFRP and steel in the double strap joints are provided. %K genetic algorithms, genetic programming, CFRP, Fe %9 Ph.D. thesis %U http://hdl.handle.net/1959.3/414765 %0 Journal Article %T Strength of Cfrp-steel double strap joints under impact loads using genetic programming %A Al-Mosawe, Alaa %A Kalfat, Robin %A Al-Mahaidi, Riadh %J Composite Structures %D 2017 %V 160 %@ 0263-8223 %F AlMosawe:2017:CS %X Carbon fibre reinforced polymers (CFRPs) are widely used by structural engineers to increase the strength of existing structures subjected to different loading actions. Existing steel structures are subjected to impact loadings due to the presence of new types of loads, and these structures need to be strengthened to sustain the new applied loads. Design guidelines for FRP-strengthened steel structures are not yet available, due to the lack of understanding of bond properties and bond strength. This paper presents the application of genetic programming (GP) to predict the bond strength of CFRP-steel double strap joints subjected to direct tension load. Extensive data from experimental tests and finite element modelling were used to develop a new joint strength formulation. The selected parameters which have a direct impact on the joint strength were: bond length, CFRP modulus and the loading rate. A wide range of loading rates and four CFRP moduli with different bond lengths were used. The prediction of the GP model was compared with the experimental values. The model has a high value of R squared, which indicates good accuracy of results. %K genetic algorithms, genetic programming, Carbon fibre, Genetic programing, Impact behaviour, Joint strength, CFRP-steel joint %9 journal article %R doi:10.1016/j.compstruct.2016.11.016 %U http://www.sciencedirect.com/science/article/pii/S0263822316317767 %U http://dx.doi.org/doi:10.1016/j.compstruct.2016.11.016 %P 1205-1211 %0 Conference Proceedings %T Classification of localized muscle fatigue with genetic programming on sEMG during isometric contraction %A Al-Mulla, M. R. %A Sepulveda, F. %A Colley, M. %A Kattan, A. %S Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2009 %D 2009 %8 February 6 sep %C Minneapolis, Minnesota, USA %F Al-Mulla:2009:EMBC %X Genetic programming is used to generate a solution that can classify localized muscle fatigue from filtered and rectified surface electromyography (sEMG). The GP has two classification phases, the GP training phase and a GP testing phase. In the training phase, the program evolved with multiple components. One component analyzes statistical features extracted from sEMG to chop the signal into blocks and label them using a fuzzy classifier into three classes: non-fatigue, transition-to-fatigue and fatigue. The blocks are then projected onto a two-dimensional Euclidean space via two further (evolved) program components. K-means clustering is then applied to group similar data blocks. Each cluster is then labeled according to its dominant members. The programs that achieve good classification are evolved. In the testing phase, it tests the signal using the evolved components, however without the use of a fuzzy classifier. As the results show the evolved program achieves good classification and it can be used on any unseen isometric sEMG signals to classify fatigue without requiring any further evolution. The GP was able to classify the signal into a meaningful sequence of non-fatigue -> transition-to-fatiguer -> fatigue. By identifying a transition-to fatigue state the GP can give a prediction of an oncoming fatigue. The genetic classifier gave promising results 83.17percent correct classification on average of all signals in the test set, especially considering that the GP is classifying muscle fatigue for ten different individuals. %K genetic algorithms, genetic programming, GP training phase, K-means clustering, fuzzy classifier, isometric contraction, isometric sEMG signal filtering, localized muscle fatigue classification, nonfatigue classifier, rectified surface electromyography, statistical feature extraction, transition-to-fatigue classifier, two-dimensional Euclidean space, biomechanics, electromyography, fatigue, feature extraction, filtering theory, fuzzy logic, medical signal processing, neurophysiology, pattern clustering, signal classification, statistical analysis %R doi:10.1109/IEMBS.2009.5335368 %U http://dx.doi.org/doi:10.1109/IEMBS.2009.5335368 %P 2633-2638 %0 Journal Article %T Evolved pseudo-wavelet function to optimally decompose sEMG for automated classification of localized muscle fatigue %A Al-Mulla, Mohamed R. %A Sepulveda, Francisco %A Colley, M. %J Medical Engineering and Physics %D 2011 %8 may %V 33 %N 4 %F Al-Mulla:2011:MEP %X The purpose of this study was to develop an algorithm for automated muscle fatigue detection in sports related scenarios. Surface electromyography (sEMG) of the biceps muscle was recorded from ten subjects performing semi-isometric (i.e., attempted isometric) contraction until fatigue. For training and testing purposes, the signals were labelled in two classes (Non-Fatigue and Fatigue), with the labelling being determined by a fuzzy classifier using elbow angle and its standard deviation as inputs. A genetic algorithm was used for evolving a pseudo-wavelet function for optimising the detection of muscle fatigue on any unseen sEMG signals. Tuning of the generalised evolved pseudo-wavelet function was based on the decomposition of twenty sEMG trials. After completing twenty independent pseudo-wavelet evolution runs, the best run was selected and then tested on ten previously unseen sEMG trials to measure the classification performance. Results show that an evolved pseudo-wavelet improved the classification of muscle fatigue between 7.31percent and 13.15percent when compared to other wavelet functions, giving an average correct classification of 88.41percent %K genetic algorithms, Localized muscle fatigue, sEMG, Wavelet analysis, matlab %9 journal article %R doi:10.1016/j.medengphy.2010.11.008 %U http://dx.doi.org/doi:10.1016/j.medengphy.2010.11.008 %P 411-417 %0 Conference Proceedings %T Genetic Improvement of Shoreline Evolution Forecasting Models %A Al Najar, Mahmoud %A Almar, Rafael %A Bergsma, Erwin W. J. %A Delvit, Jean-Marc %A Wilson, Dennis G. %Y Bruce, Bobby R. %Y Nowack, Vesna %Y Blot, Aymeric %Y Winter, Emily %Y Langdon, W. B. %Y Petke, Justyna %S GI @ GECCO 2022 %D 2022 %8 September %I Association for Computing Machinery %C Boston, USA %F AlNajar:2022:GI %X Coastal development and climate change are changing the geography of our coasts, while more and more people are moving towards the coasts. Recent advances in artificial intelligence allow for automatic analysis of observational data. Cartesian Genetic Programming (CGP) is a form of Genetic Programming (GP) that has been successfully used in a large variety of tasks including data-driven symbolic regression. We formulate the problem of shoreline evolution forecasting as a Genetic Improvement (GI) problem using CGP to encode and improve upon ShoreFor, an equilibrium shoreline prediction model, to study the effectiveness of CGP in GI in forecasting tasks. This work presents an empirical study of the sensitivity of CGP to a number of evolutionary configurations and constraints and compares the performances of the evolved models to the base ShoreFor model. %K genetic algorithms, genetic programming, genetic improvement, Cartesian Genetic Programming, RCGP, symbolic regression, forecasting, shoreline evolution, earth observation salellite, CGP-ShoreFor, ShorefFor, physical sciences, geography, coastal erosion, Tairua New Zealand, Mielke correlation coefficient %R doi:10.1145/3520304.3534041 %U http://www.cs.ucl.ac.uk/staff/W.Langdon/gecco2022/gi2022/papers/AlNajar_2022_GI.pdf %U http://dx.doi.org/doi:10.1145/3520304.3534041 %P 1916-1923 %0 Conference Proceedings %T Improving a Shoreline Forecasting Model with Symbolic Regression %A Al Najar, Mahmoud %A Almar, Rafael %A Bergsma, Erwin W. J. %A Delvit, Jean-Marc %A Wilson, Dennis G. %S ICLR 2023 Workshop on Tackling Climate Change with Machine Learning %D 2023 %8 April %C Kigali Rwanda %F alnajar2023improving %X Given the current context of climate change and the increasing population densities at coastal zones around the globe, there is an increasing need to be able to predict the development of our coasts. Recent advances in artificial intelligence allow for automatic analysis of observational data. Symbolic Regression (SR) is a type of Machine Learning algorithm that aims to find interpretable symbolic expressions that can explain relations in the data. In this work, we aim to study the problem of forecasting shoreline change using SR. We make use of Cartesian Genetic Programming (CGP) in order to encode and improve upon ShoreFor, a physical shoreline prediction model. During training, CGP individuals are evaluated and selected according to their predictive score at five different coastal sites. This work presents a comparison between a CGP-evolved model and the base ShoreFor model. In addition to evolution’s ability to produce well-performing models, it demonstrates the usefulness of SR as a research tool to gain insight into the behaviors of shorelines in various geographical zones. %K genetic algorithms, genetic programming, Cartesian Genetic Programming, Machine Learning, Interpretable ML, XAI, Symbolic Computation, Earth Observation & Monitoring, Extreme Weather, Ocean, Atmosphere, Hybrid Physical Models, Time-series Analysis %U https://www.climatechange.ai/papers/iclr2023/21 %0 Thesis %T Estimating Coastal Evolution with Machine Learning %A Al Najar, Mahmoud %D 2023 %8 30 nov %C France %C University of Toulouse %F AlNajar:thesis %O forthcoming %X Forecasting coastal evolution is a prerequisite for effective coastal management and has been a fundamental goal of coastal research for decades. However, coastal evolution is a complex process, and predicting its development through time remains challenging. The absence of representative datasets which accurately track the state and evolution of coastal systems greatly limits our ability to study these processes given different natural and anthropological scenarios. While traditional field surveys have been used extensively in the literature and have served as important assets in advancing our knowledge of these systems, the high operational costs of traditional field surveys limit their use to local and sparse spatio-temporal scales. Satellite-based Remote Sensing (RS) provides the opportunity for frequently monitoring the Earth at high temporal resolutions and scales, but requires the development of novel data processing methodologies for large streams of Earth Observation data. Machine Learning (ML) is a subfield of Artificial Intelligence which aims at constructing algorithms able to leverage large amounts of example data in order to automatically construct predictive models, and has been a critical component of many scientific advancements in recent years. This thesis examines the potential and capability of modern ML in two important problems in Coastal Science where ML remains unexplored. Deep Learning and Interpretable Machine Learning are applied to the problems of satellite-derived bathymetry and shoreline evolution modelling. The work demonstrates that ML is competitive with current physics-based baselines on both tasks, and shows the potential of ML in automating many of our large-scale coastal data analysis towards gaining a global understanding of coastal evolution. %K genetic algorithms, genetic programming, genetic improvement, Cartesian Genetic Programming, Deep Learning, Earth Observation, Shoreline forecasting, Bathymetry inversion %9 Ph.D. thesis %U https://www.isae-supaero.fr/IMG/pdf/annonce_soutenance_these_m_al_najar.pdf %0 Journal Article %T Physics-based models, surrogate models and experimental assessment of the vehicle-bridge interaction in braking conditions %A Aloisio, Angelo %A Contento, Alessandro %A Alaggio, Rocco %A Quaranta, Giuseppe %J Mechanical Systems and Signal Processing %D 2023 %V 194 %@ 0888-3270 %F ALOISIO:2023:ymssp %X The dynamics of roadway bridges crossed by vehicles moving at variable speed has attracted far less attention than that generated by vehicles travelling at constant velocity. Consequently, the role of some parameters and the combination thereof, as well as influence and accuracy of the modelling strategies, are not fully understood yet. Therefore, a large statistical analysis is performed in the present study to provide novel insights into the dynamic vehicle-bridge interaction (VBI) in braking conditions. To this end, an existing mid-span prestressed concrete bridge is selected as case study. First, several numerical simulations are performed considering alternative vehicle models (i.e., single and two degrees-of-freedom models) and different braking scenarios (i.e., soft and hard braking conditions, with both stationary and nonstationary road roughness models in case of soft braking). The statistical appraisal of the obtained results unfolds some effects of the dynamic VBI modelling in braking conditions that have not been reported in previous studies. Additionally, the use of machine learning techniques is explored for the first time to develop surrogate models able to predict the effect of the dynamic VBI in braking conditions efficiently. These surrogate models are then employed to obtain the fragility curve for the selected prestressed concrete bridge, where the attainment of the decompression moment is considered as relevant limit state. Whilst the derivation of the fragility curve using numerical simulations turned out to be almost unpractical using standard computational resources, the proposed approach that exploits surrogate models carried out via machine learning techniques was demonstrated accurate despite the dramatic reduction of the total elaboration time. Finally, the accuracy of the numerical (physics-based and surrogate) models is evaluated on a statistical basis through comparisons with experimental data %K genetic algorithms, genetic programming, Bouncing, Braking, Bridge, Fragility curve, Machine learning, Moving load, Neural network, ANN, Pitching, Roughness, Surrogate model, Vehicle-bridge interaction %9 journal article %R doi:10.1016/j.ymssp.2023.110276 %U https://www.sciencedirect.com/science/article/pii/S0888327023001838 %U http://dx.doi.org/doi:10.1016/j.ymssp.2023.110276 %P 110276 %0 Conference Proceedings %T Straight Line Programs: A New Linear Genetic Programming Approach %A Alonso, Cesar L. %A Puente, Jorge %A Montana, Jose Luis %S 20th IEEE International Conference on Tools with Artificial Intelligence, ICTAI ’08 %D 2008 %8 nov %V 2 %F Alonso:2008:ieeeICTAI %X Tree encodings of programs are well known for their representative power and are used very often in Genetic Programming. In this paper we experiment with a new data structure, named straight line program (slp), to represent computer programs. The main features of this structure are described and new recombination operators for GP related to slp’s are introduced. Experiments have been performed on symbolic regression problems. Results are encouraging and suggest that the GP approach based on slp’s consistently outperforms conventional GP based on tree structured representations. %K genetic algorithms, genetic programming, computer programs, data structure, linear genetic programming approach, program tree encoding, straight line programs, symbolic regression problems, linear programming, regression analysis, tree data structures %R doi:10.1109/ICTAI.2008.14 %U http://dx.doi.org/doi:10.1109/ICTAI.2008.14 %P 517-524 %0 Journal Article %T A new Linear Genetic Programming approach based on straight line programs: some Theoretical and Experimental Aspects %A Alonso, Cesar L. %A Montana, Jose Luis %A Puente, Jorge %A Borges, Cruz Enrique %J International Journal on Artificial Intelligence Tools %D 2009 %V 18 %N 5 %F Alonso:2009:IJAIT %X Tree encodings of programs are well known for their representative power and are used very often in Genetic Programming. In this paper we experiment with a new data structure, named straight line program (slp), to represent computer programs. The main features of this structure are described, new recombination operators for GP related to slp’s are introduced and a study of the Vapnik-Chervonenkis dimension of families of slp’s is done. Experiments have been performed on symbolic regression problems. Results are encouraging and suggest that the GP approach based on slp’s consistently outperforms conventional GP based on tree structured representations. %K genetic algorithms, genetic programming, slp, Vapnik-Chervonenkis dimension, VC %9 journal article %R doi:10.1142/S0218213009000391 %U http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.301.3133 %U http://dx.doi.org/doi:10.1142/S0218213009000391 %P 757-781 %0 Conference Proceedings %T Evolution Strategies for Constants Optimization in Genetic Programming %A Alonso, Cesar L. %A Montana, Jose Luis %A Borges, Cruz Enrique %S 21st International Conference on Tools with Artificial Intelligence, ICTAI ’09 %D 2009 %8 nov %F Alonso:2009:ICTAI %X Evolutionary computation methods have been used to solve several optimization and learning problems. This paper describes an application of evolutionary computation methods to constants optimization in genetic programming. A general evolution strategy technique is proposed for approximating the optimal constants in a computer program representing the solution of a symbolic regression problem. The new algorithm has been compared with a recent linear genetic programming approach based on straight-line programs. The experimental results show that the proposed algorithm improves such technique. %K genetic algorithms, genetic programming, computer program, constants optimization, evolutionary computation methods, learning problems, linear genetic programming approach, symbolic regression problem, regression analysis %R doi:10.1109/ICTAI.2009.35 %U http://dx.doi.org/doi:10.1109/ICTAI.2009.35 %P 703-707 %0 Conference Proceedings %T Model Complexity Control in Straight Line Program Genetic Programming %A Alonso, Cesar Luis %A Montana, Jose Luis %A Borges, Cruz Enrique %Y Rosa, Agostinho C. %Y Dourado, Antonio %Y Correia, Kurosh Madani %Y Filipe, Joaquim %Y Kacprzyk, Janusz %S Proceedings of the 5th International Joint Conference on Computational Intelligence, IJCCI 2013 %D 2013 %8 20 22 sep %I SciTePress %C Vilamoura, Algarve, Portugal %F conf/ijcci/AlonsoMB13 %X In this paper we propose a tool for controlling the complexity of Genetic Programming models. The tool is supported by the theory of Vapnik-Chervonekis dimension (VCD) and is combined with a novel representation of models named straight line program. Experimental results, implemented on conventional algebraic structures (such as polynomials) and real problems, show that the empirical risk, penalized by suitable upper bounds for the Vapnik-Chervonenkis dimension, gives a generalisation error smaller than the use of statistical conventional techniques such as Bayesian or Akaike information criteria. %K genetic algorithms, genetic programming %R doi:10.5220/0004554100250036 %U https://ijcci.scitevents.org/Abstract.aspx?idEvent=0fEvcjBHBM8= %U http://dx.doi.org/doi:10.5220/0004554100250036 %P 25-36 %0 Book Section %T Genetic Programming Model Regularization %A Alonso, Cesar L. %A Montana, Jose Luis %A Borges, Cruz Enrique %E Madani, Kurosh %E Dourado, Antonio %E Rosa, Agostinho %E Filipe, Joaquim %E Kacprzyk, Janusz %B Computational Intelligence %S Springer Professional Technik %D 2016 %I Springer %F Alonso:2016:CI %O Selected extended papers from the fifth International Joint Conference on Computational Intelligence (IJCCI 2013), held in Vilamoura, Algarve, Portugal, from 20 to 22 September 2013 %X We propose a tool for controlling the complexity of Genetic Programming models. The tool is supported by the theory of Vapnik-Chervonekis dimension (VCD) and is combined with a novel representation of models named straight line program. Experimental results, implemented on conventional algebraic structures (such as polynomials), show that the empirical risk, penalized by suitable upper bounds for the Vapnik-Chervonenkis dimension, gives a generalization error smaller than the use of statistical conventional techniques such as Bayesian or Akaike information criteria. %K genetic algorithms, genetic programming, VC dimension %R doi:10.1007/978-3-319-23392-5_6 %U https://www.springerprofessional.de/en/genetic-programming-model-regularization/6856568 %U http://dx.doi.org/doi:10.1007/978-3-319-23392-5_6 %P 105-120 %0 Conference Proceedings %T Modelling Medical Time Series Using Grammar-Guided Genetic Programming %A Alonso, Fernando %A Martinez, Loic %A Perez-Perez, Aurora %A Santamaria, Agustin %A Valente, Juan Pedro %Y Perner, Petra %S 8th Industrial Conference in Data Mining, Medical Applications, E-Commerce, Marketing and Theoretical Aspects, ICDM 2008 %S Lecture Notes in Computer Science %D 2008 %8 jul 16 18 %V 5077 %I Springer %C Leipzig, Germany %F conf/incdm/AlonsoMPSV08 %X The analysis of time series is extremely important in the field of medicine, because this is the format of many medical data types. Most of the approaches that address this problem are based on numerical algorithms that calculate distances, clusters, reference models, etc. However, a symbolic rather than numerical analysis is sometimes needed to search for the characteristics of time series. Symbolic information helps users to efficiently analyse and compare time series in the same or in a similar way as a domain expert would. This paper describes the definition of the symbolic domain, the process of converting numerical into symbolic time series and a distance for comparing symbolic temporal sequences. Then, the paper focuses on a method to create the symbolic reference model for a certain population using grammar-guided genetic programming. The work is applied to the isokinetics domain within an application called I4. %K genetic algorithms, genetic programming, Time series characterization, isokinetics, symbolic distance, information extraction, reference model, text mining %R doi:10.1007/978-3-540-70720-2_3 %U http://dx.doi.org/doi:10.1007/978-3-540-70720-2_3 %P 32-46 %0 Conference Proceedings %T GGGP-based method for modeling time series: operator selection, parameter optimization and expert evaluation %A Alonso, Fernando %A Martinez, Loic %A Santamaria, Agustin %A Perez, Aurora %A Valente, Juan Pedro %Y Branke, Juergen %Y Pelikan, Martin %Y Alba, Enrique %Y Arnold, Dirk V. %Y Bongard, Josh %Y Brabazon, Anthony %Y Butz, Martin V. %Y Clune, Jeff %Y Cohen, Myra %Y Deb, Kalyanmoy %Y Engelbrecht, Andries P. %Y Krasnogor, Natalio %Y Miller, Julian F. %Y O’Neill, Michael %Y Sastry, Kumara %Y Thierens, Dirk %Y van Hemert, Jano %Y Vanneschi, Leonardo %Y Witt, Carsten %S GECCO ’10: Proceedings of the 12th annual conference on Genetic and evolutionary computation %D 2010 %8 July 11 jul %I ACM %C Portland, Oregon, USA %F Alonso:2010:gecco %X This paper describes the theoretical and experimental analysis conducted to define the best values for the various operators and parameters of a grammar-guided genetic programming process for creating isokinetic reference models for top competition athletes. Isokinetics is a medical domain that studies the strength exerted by the patient joints (knee, ankle, etc.). We also present an evaluation of the resulting reference models comparing our results with the reference models output using other methods. %K genetic algorithms, genetic programming, grammar-guided genetic programming, Poster %R doi:10.1145/1830483.1830664 %U http://dx.doi.org/doi:10.1145/1830483.1830664 %P 989-990 %0 Journal Article %T Symbolic Regression Model for Predicting Compression Strength of Prismatic Masonry Columns Confined by FRP %A Alotaibi, Khalid Saqer %A Islam, A. B. M. Saiful %J Buildings %D 2023 %V 13 %N 2 %@ 2075-5309 %F alotaibi:2023:Buildings %X The use of Fiber Reinforced Polymer (FRP) materials for the external confinement of existing concrete or masonry members is now an established technical solution. Several studies in the scientific literature show how FRP wrapping can improve the mechanical properties of members. Though there are numerous methods for determining the compressive strength of FRP confined concrete, no generalised formulae are available because of the greater complexity and heterogeneity of FRP-confined masonry. There are two main objectives in this analytical study: (a) proposing an entirely new mathematical expression to estimate the compressive strength of FRP confined masonry columns using symbolic regression model approach which can outperform traditional regression models, and (b) evaluating existing formulas. Over 198 tests of FRP wrapped masonry were compiled in a database and used to train the model. Several formulations from the published literature and international guidelines have been compared against experimental data. It is observed that the proposed symbolic regression model shows excellent performance compared to the existing models. The model is easier, has no restriction and thereby it can be feasibly employed to foresee the behaviour of FRP confined masonry elements. The coefficient of determination for the proposed symbolic regression model is determined as 0.91. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/buildings13020509 %U https://www.mdpi.com/2075-5309/13/2/509 %U http://dx.doi.org/doi:10.3390/buildings13020509 %P ArticleNo.509 %0 Thesis %T Modelling the High-Frequency FX Market: An Agent-Based Approach %A Aloud, Monira Essa %D 2013 %8 apr %C United Kingdom %C Department of Computing and Electronic Systems, University of Essex %F MoniraAloud-Ph.D.Thesis %X In this thesis, we use an agent-based modelling (ABM) approach to model the trading activity in the Foreign Exchange (FX) market which is the most liquid financial market in the world. We first establish the statistical properties (stylised facts) of the trading activity in the FX market using a unique high-frequency dataset of anonymised individual traders’ historical transactions on an account level, spanning 2.25 years. To the best of our knowledge, this dataset is the biggest available high-frequency dataset of individual FX market traders’ historical transactions. We then construct an agentbased FX market (ABFXM) which features a number of distinguishing elements including zero-intelligence directional-change event (ZI-DCT0) trading agents and asynchronous trading-time windows. The individual agents are characterised by different levels of wealth, trading time windows, different profit objectives and risk appetites and initial activation conditions. Using the identified stylized facts as a benchmark, we evaluate the trading activity reproduced from the ABFXM and we establish that this resembles to a satisfactory level the trading activity of the real FX market. In the course of this thesis, we study in depth the constructed ABFXM. We focus on performing a systematic exploration of the constituent elements of the ABFXM and their impact on the dynamics of the FX market behaviour. In particular, our study explores and identifies the essential elements under which the stylised facts of the FX market trading activity are exhibited in the ABFXM. Our study suggests that the key elements are the ZI-DCT0 agents, heterogeneity which has been embedded in our model in different ways, asynchronous trading time windows, initial activation conditions and the generation of limit orders. We also show that the dynamics of the market trading activity depend on the number of agents one considers. We explore the emergence of the stylised facts in the trading activity when the ABFXM is populated with agents with three different strategies: a variation of the zero-intelligence with a constraint (ZI-CV) strategy; the ZI-DCT0 strategy; and a genetic programming-based (GP) strategy. Our results show that the ZI-DCT0 agents best reproduce and explain the stylised facts observed in the FX market transactions data. Our study suggests that some the observed stylised facts could be the result of introducing a threshold which triggers the agents to respond to fixed periodic patterns in the price time series. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://fac.ksu.edu.sa/sites/default/files/MoniraAloud-Ph.D.Thesis.pdf %0 Journal Article %T Modeling the High-Frequency FX Market: An Agent-Based Approach %A Aloud, Monira %A Fasli, Maria %A Tsang, Edward %A Dupuis, Alexander %A Olsen, Richard %J Computational Intelligence %D 2017 %8 nov %V 33 %N 4 %@ 1467-8640 %F aloud:2017:coin %X The development of computational intelligence-based strategies for electronic markets has been the focus of intense research. To be able to design efficient and effective automated trading strategies, one first needs to understand the workings of the market, the strategies that traders use, and their interactions as well as the patterns emerging as a result of these interactions. In this article, we develop an agent-based model of the foreign exchange (FX) market, which is the market for the buying and selling of currencies. Our agent-based model of the FX market comprises heterogeneous trading agents that employ a strategy that identifies and responds to periodic patterns in the price time series. We use the agent-based model of the FX market to undertake a systematic exploration of its constituent elements and their impact on the stylised facts (statistical patterns) of transactions data. This enables us to identify a set of sufficient conditions that result in the emergence of the stylized facts similarly to the real market data, and formulate a model that closely approximates the stylized facts. We use a unique high-frequency data set of historical transactions data that enables us to run multiple simulation runs and validate our approach and draw comparisons and conclusions for each market setting. %K genetic algorithms, genetic programming, agent-based modeling, agent-based simulation, electronic markets, FX markets, stylized facts. %9 journal article %R doi:10.1111/coin.12114 %U http://repository.essex.ac.uk/18823/ %U http://dx.doi.org/doi:10.1111/coin.12114 %P 771-825 %0 Journal Article %T Book Review: Lee Spector $\bullet$ Automatic Quantum Computer Programming: A Genetic Programming Approach. Kluwer Academic Publishers (2004). ISBN 1-4020-7894-3. 100. 153 pp. %A Al-Rabadi, Anas N. %J The Computer Journal %D 2006 %8 jan %V 49 %N 1 %@ 0010-4620 %F Al-Rabadi:2006:EPB %K genetic algorithms, genetic programming %9 journal article %R doi:10.1093/comjnl/bxh134 %U http://comjnl.oxfordjournals.org/cgi/content/full/49/1/129 %U http://dx.doi.org/doi:10.1093/comjnl/bxh134 %P 129-130 %0 Conference Proceedings %T A smart agent to trade and predict foreign exchange market %A Alrefaie, Mohamed Taher %A Hamouda, Alaa-Aldine %A Ramadan, Rabie %S IEEE Symposium on Computational Intelligence for Engineering Solutions (CIES 2013) %D 2013 %8 apr %F Alrefaie:2013:CIES %X Foreign Exchange market is a worldwide market to exchange currencies with 3.98 trillion US dollars daily turnover. With such a massive turnover, probability of profit is very high; however, trading in such massive market needs high knowledge, skills and full commitment in order to achieve high profit. The purpose of this work is to design a smart agent that 1) acquire Foreign Exchange market prices, 2) pre-processes it, 3) predicts future trend using Genetic Programming approach and Adaptive Neuro-fuzzy Inference System and 4) makes a buy/sell decision to maximise profitability with no human supervision. %K genetic algorithms, genetic programming, foreign exchange trading, probability, US dollars daily turnover, adaptive neuro-fuzzy inference system, foreign exchange market, genetic programming approach, probability, smart agent, Companies, Fluctuations, Market research, Prediction algorithms, Predictive models, Profitability, ANFI, Agent, Forex, NSGA-II, Prediction %R doi:10.1109/CIES.2013.6611741 %U http://dx.doi.org/doi:10.1109/CIES.2013.6611741 %P 141-148 %0 Journal Article %T Utilization of magnetic water in cementitious adhesive for near-surface mounted CFRP strengthening system %A Al-Safy, Rawaa %A Al-Mosawe, Alaa %A Al-Mahaidi, Riadh %J Construction and Building Materials %D 2019 %V 197 %@ 0950-0618 %F ALSAFY:2019:CBM %X Cement-based adhesive (CBA) is used as a bonding agent in Carbon Fibre Reinforced Polymer (CFRP) applications as an alternative to epoxy-based adhesive due to the drawbacks of the epoxy system under severe service conditions which negatively affect the bond between the CFRP and strengthened elements. This paper reports the results of, an investigation carried out to develop two types of CBA using magnetized water (MW) for mixing and curing. Two magnetic devices (MD-I and MD-II), with different magnetic field strengths (9000 and 6000 Gauss) respectively, were employed for water magnetization. Different water flows with different water circulation times in the magnetizer were used for each device. Compressive and splitting tensile strength tests of the magnetized CBA (MCBA) were conducted for different curing periods (3. 7, 14, 21 and 28a days) using MW. It was found that MW treatment increases the strength of CBA. The highest strength was obtained for MCBA samples when MD-I was used at a low flow rate (Fa =a 0.1a m3/hr) for 15 mins of circulation time (T). The latter was found to positively affect MCBA properties when T was increased from 15a min to 60a mins. Prediction of the compressive and tensile strength values are also studied in this paper using genetic programming, the models showed good correlation with the experimental data %K genetic algorithms, genetic programming, Magnetic water, Cement-based adhesive, NSM, CFRP, Concrete, GP modelling %9 journal article %R doi:10.1016/j.conbuildmat.2018.11.219 %U http://www.sciencedirect.com/science/article/pii/S0950061818329143 %U http://dx.doi.org/doi:10.1016/j.conbuildmat.2018.11.219 %P 474-488 %0 Conference Proceedings %T Automatic feature extraction and image classification using genetic programming %A Al-Sahaf, Harith %A Neshatian, Kourosh %A Zhang, Mengjie %S 5th International Conference on Automation, Robotics and Applications (ICARA 2011) %D 2011 %8 June 8 dec %C Wellington, New Zealand %F Al-Sahaf:2011:ICARA %X In this paper, we propose a multilayer domain-independent GP-based approach to feature extraction and image classification. We propose two different structures for the system and compare the results with a baseline approach in which domain-specific pre-extracted features are used for classification. In the baseline approach, human/domain expert intervention is required to perform the task of feature extraction. The proposed approach, however, extracts (evolves) features and generates classifiers all automatically in one loop. The experiments are conducted on four image data sets. The results show that the proposed approach can achieve better performance compared to the baseline while removing the human from the loop. %K genetic algorithms, genetic programming, feature extraction, human-domain expert intervention, image classification, multilayer domain-independent GP-based approach, feature extraction, image classification %R doi:10.1109/ICARA.2011.6144874 %U http://dx.doi.org/doi:10.1109/ICARA.2011.6144874 %P 157-162 %0 Conference Proceedings %T Extracting Image Features for Classification By Two-Tier Genetic Programming %A Al-Sahaf, Harith %A Song, Andy %A Neshatian, Kourosh %A Zhang, Mengjie %Y Li, Xiaodong %S Proceedings of the 2012 IEEE Congress on Evolutionary Computation %D 2012 %8 October 15 jun %C Brisbane, Australia %@ 0-7803-8515-2 %F Al-Sahaf:2012:CEC %X Image classification is a complex but important task especially in the areas of machine vision and image analysis such as remote sensing and face recognition. One of the challenges in image classification is finding an optimal set of features for a particular task because the choice of features has direct impact on the classification performance. However the goodness of a feature is highly problem dependent and often domain knowledge is required. To address these issues we introduce a Genetic Programming (GP) based image classification method, Two-Tier GP, which directly operates on raw pixels rather than features. The first tier in a classifier is for automatically defining features based on raw image input, while the second tier makes decision. Compared to conventional feature based image classification methods, Two-Tier GP achieved better accuracies on a range of different tasks. Furthermore by using the features defined by the first tier of these Two-Tier GP classifiers, conventional classification methods obtained higher accuracies than classifying on manually designed features. Analysis on evolved Two-Tier image classifiers shows that there are genuine features captured in the programs and the mechanism of achieving high accuracy can be revealed. The Two-Tier GP method has clear advantages in image classification, such as high accuracy, good interpretability and the removal of explicit feature extraction process. %K genetic algorithms, genetic programming, Evolutionary Computer Vision %R doi:10.1109/CEC.2012.6256412 %U http://dx.doi.org/doi:10.1109/CEC.2012.6256412 %P 1630-1637 %0 Journal Article %T Two-Tier genetic programming: towards raw pixel-based image classification %A Al-Sahaf, Harith %A Song, Andy %A Neshatian, Kourosh %A Zhang, Mengjie %J Expert Systems with Applications %D 2012 %V 39 %N 16 %@ 0957-4174 %F AlSahaf2012 %X Classifying images is of great importance in machine vision and image analysis applications such as object recognition and face detection. Conventional methods build classifiers based on certain types of image features instead of raw pixels because the dimensionality of raw inputs is often too large. Determining an optimal set of features for a particular task is usually the focus of conventional image classification methods. In this study we propose a Genetic Programming (GP) method by which raw images can be directly fed as the classification inputs. It is named as Two-Tier GP as every classifier evolved by it has two tiers, the other for computing features based on raw pixel input, one for making decisions. Relevant features are expected to be self-constructed by GP along the evolutionary process. This method is compared with feature based image classification by GP and another GP method which also aims to automatically extract image features. Four different classification tasks are used in the comparison, and the results show that the highest accuracies are achieved by Two-Tier GP. Further analysis on the evolved solutions reveals that there are genuine features formulated by the evolved solutions which can classify target images accurately. %K genetic algorithms, genetic programming, Evolutionary computation, Feature extraction, Feature selection, Image classification %9 journal article %R doi:10.1016/j.eswa.2012.02.123 %U http://www.sciencedirect.com/science/article/pii/S0957417412003867 %U http://dx.doi.org/doi:10.1016/j.eswa.2012.02.123 %P 12291-12301 %0 Conference Proceedings %T Hybridisation of Genetic Programming and Nearest Neighbour for Classification %A Al-Sahaf, Harith %A Song, Andy %A Zhang, Mengjie %Y de la Fraga, Luis Gerardo %S 2013 IEEE Conference on Evolutionary Computation %D 2013 %8 jun 20 23 %V 1 %C Cancun, Mexico %F Al-Sahaf:2013:CEC %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2013.6557889 %U http://dx.doi.org/doi:10.1109/CEC.2013.6557889 %P 2650-2657 %0 Conference Proceedings %T Binary image classification using genetic programming based on local binary patterns %A Al-Sahaf, Harith %A Zhang, Mengjie %A Johnston, Mark %S 28th International Conference of Image and Vision Computing New Zealand (IVCNZ 2013) %D 2013 %8 nov %I IEEE Press %C Wellington %F Al-Sahaf:2013:IVCNZ %X Image classification represents an important task in machine learning and computer vision. To capture features covering a diversity of different objects, it has been observed that a sufficient number of learning instances are required to efficiently estimate the models’ parameter values. In this paper, we propose a genetic programming (GP) based method for the problem of binary image classification that uses a single instance per class to evolve a classifier. The method uses local binary patterns (LBP) as an image descriptor, support vector machine (SVM) as a classifier, and a one-way analysis of variance (ANOVA) as an analyser. Furthermore, a multi-objective fitness function is designed to detect distinct and informative regions of the images, and measure the goodness of the wrapped classifiers. The performance of the proposed method has been evaluated on six data sets and compared to the performances of both GP based (Two-tier GP and conventional GP) and non-GP (Naive Bayes, Support Vector Machines and hybrid Naive Bayes/Decision Trees) methods. The results show that a comparable or significantly better performance has been achieved by the proposed method over all methods on all of the data sets considered. %K genetic algorithms, genetic programming, computer vision, image classification, learning (artificial intelligence), statistical analysis, ANOVA, GP based methods, LBP, SVM, binary image classification, computer vision, image descriptor, learning instances, local binary patterns, machine learning, nonGP methods, one-way analysis of variance, support vector machine, wrapped classifiers, Accuracy, Analysis of variance, Feature extraction, Histograms, Support vector machines, Training, Vectors %R doi:10.1109/IVCNZ.2013.6727019 %U http://dx.doi.org/doi:10.1109/IVCNZ.2013.6727019 %P 220-225 %0 Conference Proceedings %T A One-Shot Learning Approach to Image Classification Using Genetic Programming %A Al-Sahaf, Harith %A Zhang, Mengjie %A Johnston, Mark %Y Cranefield, Stephen %Y Nayak, Abhaya %S Proceedings of the 26th Australasian Joint Conference on Artificial Intelligence (AI2013) %S LNAI %D 2013 %8 January 6 dec %V 8272 %I Springer %C Dunedin, New Zealand %F Al-Sahaf:2013:AI %X In machine learning, it is common to require a large number of instances to train a model for classification. In many cases, it is hard or expensive to acquire a large number of instances. In this paper, we propose a novel genetic programming (GP) based method to the problem of automatic image classification via adopting a one-shot learning approach. The proposed method relies on the combination of GP and Local Binary Patterns (LBP) techniques to detect a predefined number of informative regions that aim at maximising the between-class scatter and minimising the within-class scatter. Moreover, the proposed method uses only two instances of each class to evolve a classifier. To test the effectiveness of the proposed method, four different texture data sets are used and the performance is compared against two other GP-based methods namely Conventional GP and Two-tier GP. The experiments revealed that the proposed method outperforms these two methods on all the data sets. Moreover, a better performance has been achieved by Naive Bayes, Support Vector Machine, and Decision Trees (J48) methods when extracted features by the proposed method have been used compared to the use of domain-specific and Two-tier GP extracted features. %K genetic algorithms, genetic programming, Local Binary Patterns, Image Classification, One-shot Learning %R doi:10.1007/978-3-319-03680-9_13 %U http://dx.doi.org/10.1007/978-3-319-03680-9_13 %U http://dx.doi.org/doi:10.1007/978-3-319-03680-9_13 %P 110-122 %0 Conference Proceedings %T Genetic Programming Evolved Filters from a Small Number of Instances for Multiclass Texture Classification %A Al-Sahaf, Harith %A Zhang, Mengjie %A Johnston, Mark %Y Cree, Michael J. %Y Streeter, Lee V. %Y Perrone, John %Y Mayo, Michael %Y Blake, Anthony M. %S Proceedings of the 29th International Conference on Image and Vision Computing New Zealand, IVCNZ 2014 %D 2014 %8 nov 19 21 %I ACM %C Hamilton, New Zealand %F conf/ivcnz/Al-SahafZJ14 %X Texture classification is an essential task in pattern recognition and computer vision. In this paper, a novel genetic programming (GP) based method is proposed for the task of multiclass texture classification. The proposed method evolves a set of filters using only two instances per class. Moreover, the evolved program operates directly on the raw pixel values and does not require human intervention to perform feature selection and extraction. Two well-known and widely used data sets are used in this study to evaluate the performance of the proposed method. The performance of the new method is compared to that of two GP-based methods using the raw pixel values, and six non-GP methods using three different sets of domain-specific features. The results show that the proposed method has significantly outperformed the other methods on both data sets. %K genetic algorithms, genetic programming, Multiclass classification, Textures %R doi:10.1145/2683405.2683418 %U http://dl.acm.org/citation.cfm?id=2683405 %U http://dx.doi.org/doi:10.1145/2683405.2683418 %P 84-89 %0 Conference Proceedings %T Genetic Programming for Multiclass Texture Classification Using a Small Number of Instances %A Al-Sahaf, Harith %A Zhang, Mengjie %A Johnston, Mark %Y Dick, Grant %Y Browne, Will N. %Y Whigham, Peter A. %Y Zhang, Mengjie %Y Bui, Lam Thu %Y Ishibuchi, Hisao %Y Jin, Yaochu %Y Li, Xiaodong %Y Shi, Yuhui %Y Singh, Pramod %Y Tan, Kay Chen %Y Tang, Ke %S Simulated Evolution and Learning - 10th International Conference, SEAL 2014, Dunedin, New Zealand, December 15-18, 2014. Proceedings %S Lecture Notes in Computer Science %D 2014 %V 8886 %I Springer %F conf/seal/Al-SahafZJ14 %K genetic algorithms, genetic programming %U http://dx.doi.org/10.1007/978-3-319-13563-2 %P 335-346 %0 Conference Proceedings %T Image Descriptor: A Genetic Programming Approach to Multiclass Texture Classification %A Al-Sahaf, Harith %A Zhang, Mengjie %A Johnston, Mark %A Verma, Brijesh %Y Murata, Yadahiko %S Proceedings of 2015 IEEE Congress on Evolutionary Computation (CEC 2015) %D 2015 %8 25 28 may %I IEEE Press %C Sendai, Japan %F Al-Sahaf:2015:CEC %X Texture classification is an essential task in computer vision that aims at grouping instances that have a similar repetitive pattern into one group. Detecting texture primitives can be used to discriminate between materials of different types. The process of detecting prominent features from the texture instances represents a cornerstone step in texture classification. Moreover, building a good model using a few training instances is difficult. In this study, a genetic programming (GP) descriptor is proposed for the task of multiclass texture classification. The proposed method synthesises a set of mathematical formulas relying on the raw pixel values and a sliding window of a predetermined size. Furthermore, only two instances per class are used to automatically evolve a descriptor that has the potential to effectively discriminate between instances of different textures using a simple instance-based classifier to perform the classification task. The performance of the proposed approach is examined using two widely-used data sets, and compared with two GP-based and nine well-known non-GP methods. Furthermore, three hand-crafted domain-expert designed feature extraction methods have been used with the non-GP methods to examine the effectiveness of the proposed method. The results show that the proposed method has significantly outperformed all these other methods on both data sets, and the new method evolves a descriptor that is capable of achieving significantly better performance compared to hand-crafted features. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2015.7257190 %U http://dx.doi.org/doi:10.1109/CEC.2015.7257190 %P 2460-2467 %0 Conference Proceedings %T Evolutionary Image Descriptor: A Dynamic Genetic Programming Representation for Feature Extraction %A Al-Sahaf, Harith %A Zhang, Mengjie %A Johnston, Mark %Y Silva, Sara %Y Esparcia-Alcazar, Anna I. %Y Lopez-Ibanez, Manuel %Y Mostaghim, Sanaz %Y Timmis, Jon %Y Zarges, Christine %Y Correia, Luis %Y Soule, Terence %Y Giacobini, Mario %Y Urbanowicz, Ryan %Y Akimoto, Youhei %Y Glasmachers, Tobias %Y Fernandez de Vega, Francisco %Y Hoover, Amy %Y Larranaga, Pedro %Y Soto, Marta %Y Cotta, Carlos %Y Pereira, Francisco B. %Y Handl, Julia %Y Koutnik, Jan %Y Gaspar-Cunha, Antonio %Y Trautmann, Heike %Y Mouret, Jean-Baptiste %Y Risi, Sebastian %Y Costa, Ernesto %Y Schuetze, Oliver %Y Krawiec, Krzysztof %Y Moraglio, Alberto %Y Miller, Julian F. %Y Widera, Pawel %Y Cagnoni, Stefano %Y Merelo, J. J. %Y Hart, Emma %Y Trujillo, Leonardo %Y Kessentini, Marouane %Y Ochoa, Gabriela %Y Chicano, Francisco %Y Doerr, Carola %S GECCO ’15: Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation %D 2015 %8 November 15 jul %I ACM %C Madrid, Spain %F Al-Sahaf:2015:GECCO %X Texture classification aims at categorising instances that have a similar repetitive pattern. In computer vision, texture classification represents a fundamental element in a wide variety of applications, which can be performed by detecting texture primitives of the different classes. Using image descriptors to detect prominent features has been widely adopted in computer vision. Building an effective descriptor becomes more challenging when there are only a few labelled instances. This paper proposes a new Genetic Programming (GP) representation for evolving an image descriptor that operates directly on the raw pixel values and uses only two instances per class. The new method synthesises a set of mathematical formulas that are used to generate the feature vector, and the classification is then performed using a simple instance-based classifier. Determining the length of the feature vector is automatically handled by the new method. Two GP and nine well-known non-GP methods are compared on two texture image data sets for texture classification in order to test the effectiveness of the proposed method. The proposed method is also compared to three hand-crafted descriptors namely domain-independent features, local binary patterns, and Haralick texture features. The results show that the proposed method has superior performance over the competitive methods. %K genetic algorithms, genetic programming %R doi:10.1145/2739480.2754661 %U http://doi.acm.org/10.1145/2739480.2754661 %U http://dx.doi.org/doi:10.1145/2739480.2754661 %P 975-982 %0 Journal Article %T Binary Image Classification: A Genetic Programming Approach to the Problem of Limited Training Instances %A Al-Sahaf, Harith %A Zhang, Mengjie %A Johnston, Mark %J Evolutionary Computation %D 2016 %8 Spring %V 24 %N 1 %@ 1063-6560 %F Al-Sahaf:2015:EC %X In the Computer Vision and Pattern Recognition fields, image classification represents an important, yet difficult, task to perform. The remarkable ability of the human visual system, which relies on only one or a few instances to learn a completely new class or an object of a class, is a challenge to build effective computer models to replicate this ability. Recently, we have proposed two Genetic Programming (GP) based methods, One-shot GP and Compound-GP, that aim to evolve a program for the task of binary classification in images. The two methods are designed to use only one or a few instances per class to evolve the model. In this study, we investigate these two methods in terms of performance, robustness, and complexity of the evolved programs. Ten data sets that vary in difficulty have been used to evaluate these two methods. We also compare them with two other GP and six non-GP methods. The results show that One-shot GP and Compound-GP outperform or achieve comparable results to other competitor methods. Moreover, the features extracted by these two methods improve the performance of other classifiers with handcrafted features and those extracted by a recently developed GP-based method in most cases %K genetic algorithms, genetic programming, Local Binary Patterns, One-shot Learning, Image Classification %9 journal article %R doi:10.1162/EVCO_a_00146 %U http://dx.doi.org/doi:10.1162/EVCO_a_00146 %P 143-182 %0 Journal Article %T Automatically Evolving Rotation-invariant Texture Image Descriptors by Genetic Programming %A Al-Sahaf, Harith %A Al-Sahaf, Ausama %A Xue, Bing %A Johnston, Mark %A Zhang, Mengjie %J IEEE Transactions on Evolutionary Computation %D 2017 %8 feb %V 21 %N 1 %F Al-Sahaf:2016:ieeeTEC %9 journal article %R doi:10.1109/TEVC.2016.2577548 %U http://dx.doi.org/doi:10.1109/TEVC.2016.2577548 %P 83-101 %0 Journal Article %T Automatically Evolving Rotation-Invariant Texture Image Descriptors by Genetic Programming %A Al-Sahaf, Harith %A Al-Sahaf, Ausama %A Xue, Bing %A Johnston, Mark %A Zhang, Mengjie %J IEEE Transactions on Evolutionary Computation %D 2017 %8 feb %V 21 %N 1 %@ 1089-778X %F Al-Sahaf:2017a:ieeeTEC %X In computer vision, training a model that performs classification effectively is highly dependent on the extracted features, and the number of training instances. Conventionally, feature detection and extraction are performed by a domain expert who, in many cases, is expensive to employ and hard to find. Therefore, image descriptors have emerged to automate these tasks. However, designing an image descriptor still requires domain-expert intervention. Moreover, the majority of machine learning algorithms require a large number of training examples to perform well. However, labelled data is not always available or easy to acquire, and dealing with a large dataset can dramatically slow down the training process. In this paper, we propose a novel genetic programming-based method that automatically synthesises a descriptor using only two training instances per class. The proposed method combines arithmetic operators to evolve a model that takes an image and generates a feature vector. The performance of the proposed method is assessed using six datasets for texture classification with different degrees of rotation and is compared with seven domain-expert designed descriptors. The results show that the proposed method is robust to rotation and has significantly outperformed, or achieved a comparable performance to, the baseline methods. %K genetic algorithms, genetic programming, Classification, feature extraction, image descriptor, keypoint detection %9 journal article %R doi:10.1109/TEVC.2016.2577548 %U http://dx.doi.org/doi:10.1109/TEVC.2016.2577548 %P 83-101 %0 Conference Proceedings %T Evolving Texture Image Descriptors Using a Multitree Genetic Programming Representation %A Al-Sahaf, Harith %A Xue, Bing %A Zhang, Mengjie %S Proceedings of the Genetic and Evolutionary Computation Conference Companion %S GECCO ’17 %D 2017 %8 15 19 jul %I ACM %C Berlin, Germany %F Al-Sahaf:2017:GECCO %X Image descriptors play very important roles in a wide range of applications in computer vision and pattern recognition. In this paper, a multitree genetic programming method to automatically evolve image descriptors for multiclass texture image classification task is proposed. Instead of using domain knowledge, the proposed method uses only a few instances of each class to automatically identify a set of features that are distinctive between the instances of different classes. The results on seven texture classification datasets show significant, or comparable, performance has been achieved by the proposed method compared with the baseline method and six state-of-the-art methods. %K genetic algorithms, genetic programming, multiclass classification, multitree, textures %R doi:10.1145/3067695.3076039 %U http://doi.acm.org/10.1145/3067695.3076039 %U http://dx.doi.org/doi:10.1145/3067695.3076039 %P 219-220 %0 Conference Proceedings %T A Multitree Genetic Programming Representation for Automatically Evolving Texture Image Descriptors %A Al-Sahaf, Harith %A Xue, Bing %A Zhang, Mengjie %Y Shi, Yuhui %Y Tan, Kay Chen %Y Zhang, Mengjie %Y Tang, Ke %Y Li, Xiaodong %Y Zhang, Qingfu %Y Tan, Ying %Y Middendorf, Martin %Y Jin, Yaochu %S Proceedings of the 11th International Conference on Simulated Evolution and Learning, SEAL 2017 %S Lecture Notes in Computer Science %D 2017 %8 nov 10 13 %V 10593 %I Springer %C Shenzhen, China %F conf/seal/Al-SahafXZ17 %X Image descriptors are very important components in computer vision and pattern recognition that play critical roles in a wide range of applications. The main task of an image descriptor is to automatically detect micro-patterns in an image and generate a feature vector. A domain expert is often needed to undertake the process of developing an image descriptor. However, such an expert, in many cases, is difficult to find or expensive to employ. In this paper, a multitree genetic programming representation is adopted to automatically evolve image descriptors. Unlike existing hand-crafted image descriptors, the proposed method does not rely on predetermined features, instead, it automatically identifies a set of features using a few instances of each class. The performance of the proposed method is assessed using seven benchmark texture classification datasets and compared to seven state-of-the-art methods. The results show that the new method has significantly outperformed its counterpart methods in most cases. %K genetic algorithms, genetic programming, Multitree, Image classification, Feature extraction %R doi:10.1007/978-3-319-68759-9_41 %U http://dx.doi.org/doi:10.1007/978-3-319-68759-9_41 %P 499-511 %0 Journal Article %T Keypoints Detection and Feature Extraction: A Dynamic Genetic Programming Approach for Evolving Rotation-invariant Texture Image Descriptors %A Al-Sahaf, Harith %A Zhang, Mengjie %A Al-Sahaf, Ausama %A Johnston, Mark %J IEEE Transactions on Evolutionary Computation %D 2017 %8 dec %V 21 %N 6 %@ 1089-778X %F Al-Sahaf:2017:ieeeTEC %X The goodness of the features extracted from the instances and the number of training instances are two key components in machine learning, and building an effective model is largely affected by these two factors. Acquiring a large number of training instances is very expensive in some situations such as in the medical domain. Designing a good feature set, on the other hand, is very hard and often requires domain expertise. In computer vision, image descriptors have emerged to automate feature detection and extraction; however, domain-expert intervention is typically needed to develop these descriptors. The aim of this paper is to use Genetic Programming to automatically construct a rotation-invariant image descriptor by synthesising a set of formulae using simple arithmetic operators and first-order statistics, and determining the length of the feature vector simultaneously using only two instances per class. Using seven texture classification image datasets, the performance of the proposed method is evaluated and compared against eight domain-expert hand-crafted image descriptors. Quantitatively, the proposed method has significantly outperformed, or achieved comparable performance to, the competitor methods. Qualitatively, the analysis shows that the descriptors evolved by the proposed method can be interpreted. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1109/TEVC.2017.2685639 %U http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7885048 %U http://dx.doi.org/doi:10.1109/TEVC.2017.2685639 %P 825-844 %0 Thesis %T Genetic Programming for Automatically Synthesising Robust Image Descriptors with A Small Number of Instances %A Al-Sahaf, Harith %D 2017 %C New Zealand %C School of Engineering and Computer Science, Victoria University of Wellington %F Al-Sahaf:thesis %X Image classification is a core task in many applications of computer vision, including object detection and recognition. It aims at analysing the visual content and automatically categorising a set of images into different groups. Performing image classification can largely be affected by the features used to perform this task. Extracting features from images is a challenging task due to the large search space size and practical requirements such as domain knowledge and human intervention. Human intervention is usually needed to identify a good set of keypoints (regions of interest), design a set of features to be extracted from those keypoints such as lines and corners, and develop a way to extract those features. Automating these tasks has great potential to dramatically decrease the time and cost, and may potentially improve the performance of the classification task. There are two well-recognised approaches in the literature to automate the processes of identifying keypoints and extracting image features. Designing a set of domain-independent features is the first approach, where the focus is on dividing the image into a number of predefined regions and extracting features from those regions. The second approach is synthesising a function or a set of functions to form an image descriptor that aims at automatically detecting a set of keypoints such as lines and corners, and performing feature extraction. Although employing image descriptors is more effective and very popular in the literature, designing those descriptors is a difficult task that in most cases requires domain-expert intervention. The overall goal of this thesis is to develop a new domain independent Genetic Programming (GP) approach to image classification by using GP to evolve programs that are capable of automatically detecting diverse and informative keypoints, designing a set of features, and performing feature extraction using only a small number of training instances to facilitate image classification, and are robust to different image changes such as illumination and rotation. This thesis focuses on incorporating a variety of simple arithmetic operators and first-order statistics (mid-level features) into the evolutionary process and on representation of GP to evolve programs that are robust to image changes for image classification. This thesis proposes methods for domain-independent binary classification in images using GP to automatically identify regions within an image that have the potential to improve classification while considering the limitation of having a small training set. Experimental results show that in over 67percent of cases the new methods significantly outperform the use of existing hand-crafted features and features automatically detected by other methods. This thesis proposes the first GP approach for automatically evolving an illumination-invariant dense image descriptor that detects automatically designed keypoints, and performs feature extraction using only a few instances of each class. The experimental results show improvement of 86percent on average compared to two GP-based methods, and can significantly outperform domain-expert hand-crafted descriptors in more than 89percent of the cases. This thesis also considers rotation variation of images and proposes a method for automatically evolving rotation-invariant image descriptors through integrating a set of first-order statistics as terminals. Compared to hand-crafted descriptors, the experimental results reveal that the proposed method has significantly better performance in more than 83percent of the cases. This thesis proposes a new GP representation that allows the system to automatically choose the length of the feature vector side-by-side with evolving an image descriptor. Automatically determining the length of the feature vector helps to reduce the number of the parameters to be set. The results show that this method has evolved descriptors with a very small feature vector which yet still significantly outperform the competitive methods in more than 91percent of the cases. This thesis proposes a method for transfer learning by model in GP, where an image descriptor evolved on instances of a related problem (source domain) is applied directly to solve a problem being tackled (target domain). The results show that the new method evolves image descriptors that have better generalisability compared to hand-crafted image descriptors. Those automatically evolved descriptors show positive influence on classifying the target domain datasets in more than 56percent of the cases. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://hdl.handle.net/10063/6177 %0 Journal Article %T Automatically Evolving Texture Image Descriptors using the Multi-tree Representation in Genetic Programming using Few Instances %A Al-Sahaf, Harith %A Al-Sahaf, Ausama %A Xue, Bing %A Zhang, Mengjie %J Evolutionary Computation %D 2021 %8 Fall %V 29 %N 3 %@ 1063-6560 %F Al-Sahaf:EC %X The performance of image classification is highly dependent on the quality of the extracted features that are used to build a model. Designing such features usually requires prior knowledge of the domain and is often undertaken by a domain expert who, if available, is very costly to employ. Automating the process of designing such features can largely reduce the cost and efforts associated with this task. Image descriptors, such as local binary patterns, have emerged in computer vision, and aim at detecting keypoints, for example, corners, line-segments, and shapes, in an image and extracting features from those key points. In this article, genetic programming (GP) is used to automatically evolve an image descriptor using only two instances per class by using a multitree program representation. The automatically evolved descriptor operates directly on the raw pixel values of an image and generates the corresponding feature vector. Seven well-known datasets were adapted to the few-shot setting and used to assess the performance of the proposed method and compared against six handcrafted and one evolutionary computation-based image descriptor as well as three convolutional neural network (CNN) based methods. The experimental results show that the new method has significantly outperformed the competitor image descriptors and CNN-based methods. Furthermore, different patterns have been identified from analysing the evolved programs. %K genetic algorithms, genetic programming, ANN, image descriptor, multi-tree, image classification, feature extraction %9 journal article %R doi:10.1162/evco_a_00284 %U https://doi.org/10.1162/evco_a_00284 %U http://dx.doi.org/doi:10.1162/evco_a_00284 %P 331-366 %0 Conference Proceedings %T Automated Re-invention of a Previously Patented Optical Lens System Using Genetic Programming %A Al-Sakran, Sameer H. %A Koza, John R. %A Jones, Lee W. %Y Keijzer, Maarten %Y Tettamanzi, Andrea %Y Collet, Pierre %Y van Hemert, Jano I. %Y Tomassini, Marco %S Proceedings of the 8th European Conference on Genetic Programming %S Lecture Notes in Computer Science %D 2005 %8 30 mar 1 apr %V 3447 %I Springer %C Lausanne, Switzerland %@ 3-540-25436-6 %F eurogp:Al-SakranKJ05 %X The three dozen or so known instances of human-competitive designs produced by genetic programming for antennas, mechanical systems, circuits, and controllers raise the question of whether the genetic programming can be extended to the design of complex structures from other fields. This paper discusses efforts to apply genetic programming to the automated design of optical lens systems. The paper can be read from two different perspectives. First, broadly, it chronicles the step-by-step process by which the authors approached the problem of applying genetic programming to a domain that was new to them. Second, more narrowly, it describes the use of genetic programming to re-create the complete design for the previously patented Tackaberry-Muller optical lens system. Genetic programming accomplished this ’from scratch’ without starting from a pre-specified number of lens and a pre-specified layout and without starting from a pre-existing good design. The genetically evolved design for the Tackaberry-Muller lens system is an example, in the field of optical design, of a human-competitive result produced by genetic programming. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-540-31989-4_3 %U http://dx.doi.org/doi:10.1007/978-3-540-31989-4_3 %P 25-37 %0 Conference Proceedings %T Genetic Programming Testing Model %A Al Sallami, Nada M. A. %Y Ao, S. I. %Y Gelman, Len %Y Hukins, David WL %Y Hunter, Andrew %Y Korsunsky, A. M. %S Proceedings of the World Congress on Engineering (WCE’12) %S Lecture Notes in Engineering and Computer Science %D 2012 %8 jul 4 6 %I Newswood Limited %C London, UK %F Al_Sallami:2012:wce %X Software testing requires the use of a model to guide such efforts as test selection and test verification. In this case, testers are performing model-based testing. This paper introduces model-based testing and discusses its tasks in general terms with proposed finite state models. These FSMs depend on software’s semantic rather than its structure, , it use input-output specification and trajectory information to evolve and test general software. Finally, we close with a discussion of how our model-based testing can be used with genetic programming test generator. %K genetic algorithms, genetic programming, SBSE, model-based testing, test generator, finite state machine %U http://www.iaeng.org/publication/WCE2012/WCE2012_pp737-741.pdf %P 737-741 %0 Journal Article %T A new 3D molecular structure representation using quantum topology with application to structure-property relationships %A Alsberg, Bjorn K. %A Marchand-Geneste, Nathalie %A King, Ross D. %J Chemometrics and Intelligent Laboratory Systems %D 2000 %8 29 dec %V 54 %N 2 %@ 0169-7439 %F Alsberg:2000:CILS %X We present a new 3D molecular structure representation based on Richard F.W. Bader’s quantum topological atoms in molecules (AIM) theory for use in quantitative structure-property/activity relationship (QSPR/QSAR) modelling. Central to this structure representation using quantum topology (StruQT) are critical points located on the electron density distribution of the molecules. Other gradient fields such as the Laplacian of the electron density distribution can also be used. The type of critical point of particular interest is the bond critical point (BCP) which is here characterised by using the following three parameters: electron density [rho], the Laplacian [nabla]2[rho] and the ellipticity [epsi]. This representation has the advantage that there is no need to probe a large number of lattice points in 3D space to capture the important parts of the 3D electronic structure as is necessary in, e.g. comparative field analysis (CoMFA). We tested the new structure representation by predicting the wavelength of the lowest UV transition for a system of 18 anthocyanidins. Different quantitative structure-property relationship (QSPR) models are constructed using several chemometric/machine learning methods such as standard partial least squares regression (PLS), truncated PLS variable selection, genetic algorithm-based variable selection and genetic programming (GP). These models identified bonds that either take part in decreasing or increasing the dominant excitation wavelength. The models also correctly emphasised on the involvement of the conjugated [pi] system for predicting the wavelength through flagging the BCP ellipticity parameters as important for this particular data set. %K genetic algorithms, genetic programming, Structure representation using quantum topology, StruQT, Quantitative structure-activity relationships, QSAR, Quantitative structure-property relationships, QSPR, Atoms in molecules, AIM, Quantum chemistry, Bader theory, Multivariate analysis, Partial least squares regression, 3D structure representation, Variable selection %9 journal article %R doi:10.1016/S0169-7439(00)00101-5 %U http://dx.doi.org/doi:10.1016/S0169-7439(00)00101-5 %P 75-91 %0 Conference Proceedings %T Deploying Search Based Software Engineering with Sapienz at Facebook %A Alshahwan, Nadia %A Gao, Xinbo %A Harman, Mark %A Jia, Yue %A Mao, Ke %A Mols, Alexander %A Tei, Taijin %A Zorin, Ilya %Y Colanzi, Thelma Elita %Y McMinn, Phil %S SSBSE 2018 %S LNCS %D 2018 %8 August 9 sep %V 11036 %I Springer %C Montpellier, France %F Alshahwan:2018:SSBSE %X We describe the deployment of the Sapienz Search Based Software Engineering (SBSE) testing system. Sapienz has been deployed in production at Facebook since September 2017 to design test cases, localise and triage crashes to developers and to monitor their fixes. Since then, running in fully continuous integration within Facebook’s production development process, Sapienz has been testing Facebook’s Android app, which consists of millions of lines of code and is used daily by hundreds of millions of people around the globe. We continue to build on the Sapienz infrastructure, extending it to provide other software engineering services, applying it to other apps and platforms, and hope this will yield further industrial interest in and uptake of SBSE (and hybridisations of SBSE) as a result. %K genetic algorithms, genetic programming, genetic improvement, SBSE %R doi:10.1007/978-3-319-99241-9_1 %U https://discovery.ucl.ac.uk/id/eprint/10060107/ %U http://dx.doi.org/doi:10.1007/978-3-319-99241-9_1 %P 3-45 %0 Conference Proceedings %T Industrial experience of Genetic Improvement in Facebook %A Alshahwan, Nadia %Y Petke, Justyna %Y Tan, Shin Hwei %Y Langdon, William B. %Y Weimer, Westley %S GI-2019, ICSE workshops proceedings %D 2019 %8 28 may %I IEEE %C Montreal %F Alshahwan:2019:GI %O Invited Keynote %X Facebook recently had their first experience with Genetic Improvement (GI) by developing and deploying the automated bug fixing tool SapFix. The experience was successful resulting in landed fixes but also very educational. This paper will briefly outline some of the challenges for GI that were highlighted by this experience as well as a look at future directions in the area of mobile apps. %K genetic algorithms, genetic programming, genetic improvement, APR %R doi:10.1109/GI.2019.00010 %U https://doi.org/10.1109/GI.2019.00010 %U http://dx.doi.org/doi:10.1109/GI.2019.00010 %P 1 %0 Conference Proceedings %T Software Testing Research Challenges: An Industrial Perspective %A Alshahwan, Nadia %A Harman, Mark %A Marginean, Alexandru %Y Sampath, Sreedevi %S 16th IEEE International Conference on Software Testing, Verification and Validation (ICST 2023) %D 2023 %8 16 20 apr %C Dublin, Ireland %F Alshahwan:2023:ICST %O Keynote %X There have been rapid recent developments in automated software test design, repair and program improvement. Advances in artificial intelligence also have great potential impact to tackle software testing research problems. we highlight open research problems and challenges from an industrial perspective. This perspective draws on our experience at Meta Platforms, which has been actively involved in software testing research and development for approximately a decade. As we set out here, there are many exciting opportunities for software testing research to achieve the widest and deepest impact on software practice. With this overview of the research landscape from an industrial perspective, we aim to stimulate further interest in the deployment of software testing research. We hope to be able to collaborate with the scientific community on some of these research challenges. %K genetic algorithms, genetic programming, Genetic Improvement, SBSE, Automated Software Engineering, Software Testing, Automated Program Repair, APR, Artificial Intelligence, AI, Automated Remediation, regression testing %R doi:10.1109/ICST57152.2023.00008 %U https://research.facebook.com/file/1235985840680898/Software-Testing-Research-Challenges--An-Industrial-Perspective.pdf %U http://dx.doi.org/doi:10.1109/ICST57152.2023.00008 %P 1-10 %0 Generic %T Assured LLM-Based Software Engineering %A Alshahwan, Nadia %A Harman, Mark %A Harper, Inna %A Marginean, Alexandru %A Sengupta, Shubho %A Wang, Eddy %D 2024 %8 June %I arXiv %C Lisbon, Portugal %F alshahwan2024assured %O InteNSE 2024 Keynote %X How can we use Large Language Models (LLMs) to improve code independently of a human, while ensuring that the improved code - does not regress the properties of the original code? - improves the original in a verifiable and measurable way? To address this question, we advocate Assured LLM-Based Software Engineering; a generate-and-test approach, inspired by Genetic Improvement. Assured LLMSE applies a series of semantic filters that discard code that fails to meet these twin guarantees. This overcomes the potential problem of LLM’s propensity to hallucinate. It allows us to generate code using LLMs, independently of any human. The human plays the role only of final code reviewer, as they would do with code generated by other human engineers. This paper is an outline of the content of the keynote by Mark Harman at the International Workshop on Interpretability, Robustness, and Benchmarking in Neural Software Engineering, Monday 15th April 2024, Lisbon, Portugal %K genetic algorithms, genetic programming, genetic improvement, ANN, LLMSE, SBSE, prompt search space, facebook, automatic test oracle, refactoring, APR, searchable prompting language %U https://arxiv.org/abs/2402.04380 %0 Journal Article %T Automated Unit Test Improvement using Large Language Models at Meta %A Alshahwan, Nadia %A Chheda, Jubin %A Finogenova, Anastasia %A Gokkaya, Beliz %A Harman, Mark %A Harper, Inna %A Marginean, Alexandru %A Sengupta, Shubho %A Wang, Eddy %D 2024 %8 jul 15 19 %I ACM %C Porto de Galinhas, Brazil %F Alshahwan:2024:FSEcomp %X This paper describes Meta TestGen-LLM tool, which uses LLMs to automatically improve existing human-written tests. TestGen-LLM verifies that its generated test classes successfully clear a set of filters that assure measurable improvement over the original test suite, thereby eliminating problems due to LLM hallucination. We describe the deployment of TestGen-LLM at Meta test-a-thons for the Instagram and Facebook platforms. In an evaluation on Reels and Stories products for Instagram, 75percent of TestGen-LLMs test cases built correctly, 57percent passed reliably, and 25percent increased coverage. During Meta Instagram and Facebook test-a-thons, it improved 11.5percent of all classes to which it was applied, with 73percent of its recommendations being accepted for production deployment by Meta software engineers. We believe this is the first report on industrial scale deployment of LLM-generated code backed by such assurances of code improvement. %K TestGen-LLM, Automated Test Generation, Genetic Improvement, LLMs, Large Language Models, ANN, SBSE, Unit Testing %9 journal article %R doi:10.1145/3663529.3663839 %U https://doi.org/10.1145/3663529.3663839 %U http://dx.doi.org/doi:10.1145/3663529.3663839 %P 185-196 %0 Conference Proceedings %T Classifying SSH encrypted traffic with minimum packet header features using genetic programming %A Alshammari, Riyad %A Lichodzijewski, Peter %A Heywood, Malcolm I. %A Zincir-Heywood, A. Nur %Y Esparcia, Anna I. %Y Chen, Ying-ping %Y Ochoa, Gabriela %Y Ozcan, Ender %Y Schoenauer, Marc %Y Auger, Anne %Y Beyer, Hans-Georg %Y Hansen, Nikolaus %Y Finck, Steffen %Y Ros, Raymond %Y Whitley, Darrell %Y Wilson, Garnett %Y Harding, Simon %Y Langdon, W. B. %Y Wong, Man Leung %Y Merkle, Laurence D. %Y Moore, Frank W. %Y Ficici, Sevan G. %Y Rand, William %Y Riolo, Rick %Y Kharma, Nawwaf %Y Buckley, William R. %Y Miller, Julian %Y Stanley, Kenneth %Y Bacardit, Jaume %Y Browne, Will %Y Drugowitsch, Jan %Y Beume, Nicola %Y Preuss, Mike %Y Smith, Stephen L. %Y Cagnoni, Stefano %Y DeLeo, Jim %Y Floares, Alexandru %Y Baughman, Aaron %Y Gustafson, Steven %Y Keijzer, Maarten %Y Kordon, Arthur %Y Congdon, Clare Bates %S GECCO-2009 Defense applications of computational intelligence workshop %D 2009 %8 August 12 jul %I ACM %C Montreal %F DBLP:conf/gecco/AlshammariLHZ09 %X The classification of Encrypted Traffic, namely Secure Shell (SSH), on the fly from network TCP traffic represents a particularly challenging application domain for machine learning. Solutions should ideally be both simple - therefore efficient to deploy - and accurate. Recent advances to team based Genetic Programming provide the opportunity to decompose the original problem into a subset of classifiers with non-overlapping behaviors, in effect providing further insight into the problem domain and increasing the throughput of solutions. Thus, in this work we have investigated the identification of SSH encrypted traffic based on packet header features without using IP addresses, port numbers and payload data. Evaluation of C4.5 and AdaBoost - representing current best practice - against the Symbiotic Bid-based (SBB) paradigm of team-based Genetic Programming (GP) under data sets common and independent from the training condition indicates that SBB based GP solutions are capable of providing simpler solutions without sacrificing accuracy. %K genetic algorithms, genetic programming %R doi:10.1145/1570256.1570358 %U http://dx.doi.org/doi:10.1145/1570256.1570358 %P 2539-2546 %0 Conference Proceedings %T Unveiling Skype encrypted tunnels using GP %A Alshammari, Riyad %A Zincir-Heywood, A. Nur %S IEEE Congress on Evolutionary Computation (CEC 2010) %D 2010 %8 18 23 jul %I IEEE Press %C Barcelona, Spain %F Alshammari:2010:cec %X The classification of Encrypted Traffic, namely Skype, from network traffic represents a particularly challenging problem. Solutions should ideally be both simple -therefore efficient to deploy -and accurate. Recent advances to team-based Genetic Programming provide the opportunity to decompose the original problem into a subset of classifiers with non-overlapping behaviours. Thus, in this work we have investigated the identification of Skype encrypted traffic using Symbiotic Bid-Based (SBB) paradigm of team based Genetic Programming (GP) found on flow features without using IP addresses, port numbers and payload data. Evaluation of SBB-GP against C4.5 and AdaBoost -representing current best practice -indicates that SBB-GP solutions are capable of providing simpler solutions in terms number of features used and the complexity of the solution/model without sacrificing accuracy. %K genetic algorithms, genetic programming %R doi:10.1109/CEC.2010.5586288 %U http://dx.doi.org/doi:10.1109/CEC.2010.5586288 %0 Conference Proceedings %T An investigation on the identification of VoIP traffic: Case study on Gtalk and Skype %A Alshammari, Riyad %A Zincir-Heywood, A. Nur %S 2010 International Conference on Network and Service Management (CNSM) %D 2010 %8 25 29 oct %F Alshammari:2010:CNSM %X The classification of encrypted traffic on the fly from network traces represents a particularly challenging application domain. Recent advances in machine learning provide the opportunity to decompose the original problem into a subset of classifiers with non-overlapping behaviours, in effect providing further insight into the problem domain. Thus, the objective of this work is to classify VoIP encrypted traffic, where Gtalk and Skype applications are taken as good representatives. To this end, three different machine learning based approaches, namely, C4.5, AdaBoost and Genetic Programming (GP), are evaluated under data sets common and independent from the training condition. In this case, flow based features are employed without using the IP addresses, source/destination ports and payload information. Results indicate that C4.5 based machine learning approach has the best performance. %K genetic algorithms, genetic programming, AdaBoost, C4.5, Gtalk, IP address, Skype, VoIP encrypted traffic, machine learning, source/destination port, Internet telephony, learning (artificial intelligence), telecommunication traffic %R doi:10.1109/CNSM.2010.5691210 %U http://dx.doi.org/doi:10.1109/CNSM.2010.5691210 %P 310-313 %0 Conference Proceedings %T Is Machine Learning losing the battle to produce transportable signatures against VoIP traffic? %A Alshammari, Riyad %A Zincir-Heywood, A. Nur %Y Smith, Alice E. %S Proceedings of the 2011 IEEE Congress on Evolutionary Computation %D 2011 %8 May 8 jun %I IEEE Press %C New Orleans, USA %@ 0-7803-8515-2 %F Alshammari:2011:IMLltbtptsaVt %X Traffic classification becomes more challenging since the traditional techniques such as port numbers or deep packet inspection are ineffective against voice over IP (VoIP) applications, which uses non-standard ports and encryption. Statistical information based on network layer with the use of machine learning (ML) can achieve high classification accuracy and produce transportable signatures. However, the ability of ML to find transportable signatures depends mainly on the training data sets. In this paper, we explore the importance of sampling training data sets for the ML algorithms, specifically Genetic Programming, C5.0, Naive Bayesian and AdaBoost, to find transportable signatures. To this end, we employed two techniques for sampling network training data sets, namely random sampling and consecutive sampling. Results show that random sampling and 90-minute consecutive sampling have the best performance in terms of accuracy using C5.0 and SBB, respectively. In terms of complexity, the size of C5.0 solutions increases as the training size increases, whereas SBB finds simpler solutions. %K genetic algorithms, genetic programming, AdaBoost, C5.0, VoIP traffic classification, consecutive sampling, machine learning, naive Bayesian, random sampling, transportable signatures, voice over IP, Bayes methods, Internet telephony, learning (artificial intelligence), telecommunication security, telecommunication traffic %R doi:10.1109/CEC.2011.5949799 %U http://dx.doi.org/doi:10.1109/CEC.2011.5949799 %P 1542-1549 %0 Journal Article %T Identification of VoIP encrypted traffic using a machine learning approach %A Alshammari, Riyad %A Zincir-Heywood, A. Nur %J Journal of King Saud University - Computer and Information Sciences %D 2015 %V 27 %N 1 %@ 1319-1578 %F Alshammari:2015:JKSUCIS %X We investigate the performance of three different machine learning algorithms, namely C5.0, AdaBoost and Genetic programming (GP), to generate robust classifiers for identifying VoIP encrypted traffic. To this end, a novel approach (Alshammari and Zincir-Heywood, 2011) based on machine learning is employed to generate robust signatures for classifying VoIP encrypted traffic. We apply statistical calculation on network flows to extract a feature set without including payload information, and information based on the source and destination of ports number and IP addresses. Our results show that finding and employing the most suitable sampling and machine learning technique can improve the performance of classifying VoIP significantly. %K genetic algorithms, genetic programming, Machine learning, Encrypted traffic, Robustness, Network signatures %9 journal article %R doi:10.1016/j.jksuci.2014.03.013 %U http://www.sciencedirect.com/science/article/pii/S1319157814000561 %U http://dx.doi.org/doi:10.1016/j.jksuci.2014.03.013 %P 77-92 %0 Journal Article %T Prediction of heat load in district heating systems by Support Vector Machine with Firefly searching algorithm %A Al-Shammari, Eiman Tamah %A Keivani, Afram %A Shamshirband, Shahaboddin %A Mostafaeipour, Ali %A Yee, Por Lip %A Petkovic, Dalibor %A Ch, Sudheer %J Energy %D 2016 %V 95 %@ 0360-5442 %F AlShammari:2016:Energy %X District heating systems operation can be improved by control strategies. One of the options is the introduction of predictive control model. Predictive models of heat load can be applied to improve district heating system performances. In this article, short-term multistep-ahead predictive models of heat load for consumers connected to district heating system were developed using SVMs (Support Vector Machines) with FFA (Firefly Algorithm). Firefly algorithm was used to optimize SVM parameters. Seven SVM-FFA predictive models for different time horizons were developed. Obtained results of the SVM-FFA models were compared with GP (genetic programming), ANNs (artificial neural networks), and SVMs models with grid search algorithm. The experimental results show that the developed SVM-FFA models can be used with certainty for further work on formulating novel model predictive strategies in district heating systems. %K genetic algorithms, genetic programming, District heating systems, Heat load, Estimation, Prediction, Support Vector Machines, Firefly algorithm %9 journal article %R doi:10.1016/j.energy.2015.11.079 %U http://www.sciencedirect.com/science/article/pii/S0360544215016424 %U http://dx.doi.org/doi:10.1016/j.energy.2015.11.079 %P 266-273 %0 Journal Article %T Machine learning-based analysis of occupant-centric aspects: Critical elements in the energy consumption of residential buildings %A Alsharif, Rashed %A Arashpour, Mehrdad %A Golafshani, Emadaldin Mohammadi %A Hosseini, M. Reza %A Chang, Victor %A Zhou, Jenny %J Journal of Building Engineering %D 2022 %V 46 %@ 2352-7102 %F ALSHARIF:2022:JBE %X The housing sector consumes a significant amount of energy worldwide, which is mainly attributed to operating energy systems for the provision of thermally comfortable indoor environments. Although the literature in this field has focused on investigating critical factors in energy consumption, only a few studies have conducted a quantitative sensitivity analysis for thermal occupant factors (TOF) (i.e., metabolic rate and clothing level). Therefore, this paper introduces a framework for testing the criticality of TOF with a cross-comparison against building-related factors, considering the constraint of occupant thermal comfort. Using a building energy simulation model, the energy consumption of a case study is simulated, and building energy model alternatives are generated. The scope includes TOF and building envelope factors, with an established orthogonal experimental design. A popular branch of machine learning (ML) called linear genetic programming (LGP) is used to analyse the generated data from the experiment. Finally, a sensitivity analysis is conducted using the developed LGP model to determine and rank the criticality of the considered factors. The findings reveal that occupants’ metabolic rate and clothing level have relevancy factors of -0.48 and -0.38 respectively, which ranked them 2nd and 3rd against building envelope factors for achieving energy-efficient comfortable houses. This research contributes to the literature by introducing a framework that couples orthogonal experiment design with ML techniques to quantify the criticality of TOF and rank them against building-envelope factors %K genetic algorithms, genetic programming, Artificial Intelligence, Energy simulation, Metabolic rate, Predicted mean vote (PMV), Sustainability %9 journal article %R doi:10.1016/j.jobe.2021.103846 %U https://www.sciencedirect.com/science/article/pii/S2352710221017046 %U http://dx.doi.org/doi:10.1016/j.jobe.2021.103846 %P 103846 %0 Journal Article %T Field-Based Prediction Models for Stop Penalty in Traffic Signal Timing Optimization %A Alshayeb, Suhaib %A Stevanovic, Aleksandar %A Park, B. Brian %J Energies %D 2021 %V 14 %N 21 %@ 1996-1073 %F alshayeb:2021:Energies %X Transportation agencies optimise signals to improve safety, mobility, and the environment. One commonly used objective function to optimise signals is the Performance Index (PI), a linear combination of delays and stops that can be balanced to minimise fuel consumption (FC). The critical component of the PI is the stop penalty K, which expresses an FC stop equivalency estimated in seconds of pure delay. This study applies vehicular trajectory and FC data collected in the field, for a large fleet of modern vehicles, to compute the K-factor. The tested vehicles were classified into seven homogenous groups by using the k-prototype algorithm. Furthermore, multigene genetic programming (MGGP) is used to develop prediction models for the K-factor. The proposed K-factor models are expressed as functions of various parameters that impact its value, including vehicle type, cruising speed, road gradient, driving behaviour, idling FC, and the deceleration duration. A parametric analysis is carried out to check the developed models’ quality in capturing the individual impact of the included parameters on the K-factor. The developed models showed an excellent performance in estimating the K-factor under multiple conditions. Future research shall evaluate the findings by using field-based K-values in optimising signals to reduce FC. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/en14217431 %U https://www.mdpi.com/1996-1073/14/21/7431 %U http://dx.doi.org/doi:10.3390/en14217431 %0 Conference Proceedings %T Off-line Parameter Tuning for Guided Local Search Using Genetic Programming %A Alsheddy, Abdullah %A Kampouridis, Michael %Y Li, Xiaodong %S Proceedings of the 2012 IEEE Congress on Evolutionary Computation %D 2012 %8 October 15 jun %C Brisbane, Australia %@ 0-7803-8515-2 %F Alsheddy:2012:CEC %X Guided Local Search (GLS), which is a simple meta-heuristic with many successful applications, has lambda as the only parameter to tune. There has been no attempt to automatically tune this parameter, resulting in a parameterless GLS. Such a result is a very practical objective to facilitate the use of meta-heuristics for end-users (e.g. practitioners and researchers). In this paper, we propose a novel parameter tuning approach by using Genetic Programming (GP). GP is employed to evolve an optimal formula that GLS can use to dynamically compute lambda as a function of instance-dependent characteristics. Computational experiments on the travelling salesman problem demonstrate the feasibility and effectiveness of this approach, producing parameterless formulae with which the performance of GLS is competitive (if not better) than the standard GLS. %K genetic algorithms, genetic programming, Heuristics, metaheuristics and hyper-heuristics %R doi:10.1109/CEC.2012.6256155 %U http://dx.doi.org/doi:10.1109/CEC.2012.6256155 %P 112-116 %0 Conference Proceedings %T The Influence of the Picking Times of the Components in Time and Space Assembly Line Balancing Problems: An Approach with Evolutionary Algorithms %A Alsina, Emanuel F. %A Capodieci, Nicola %A Cabri, Giacomo %A Regattieri, Alberto %S 2015 IEEE Symposium Series on Computational Intelligence %D 2015 %8 dec %F Alsina:2015:ieeeSSCI %X The balancing of assembly lines is one of the most studied industrial problems, both in academic and practical fields. The workable application of the solutions passes through a reliable simplification of the real-world assembly line systems. Time and space assembly line balancing problems consider a realistic versions of the assembly lines, involving the optimisation of the entire line cycle time, the number of stations to install, and the area of these stations. Components, necessary to complete the assembly tasks, have different picking times depending on the area where they are allocated. The implementation in the real world of a line balanced disregarding the distribution of the tasks which use unwieldy components can result unfeasible. The aim of this paper is to present a method which balances the line in terms of time and space, hence optimises the allocation of the components using an evolutionary approach. In particular, a method which combines the bin packing problem with a genetic algorithm and a genetic programming is presented. The proposed method can be able to find different solutions to the line balancing problem and then evolve they in order to optimise the allocation of the components in certain areas in the workstation. %K genetic algorithms, genetic programming %R doi:10.1109/SSCI.2015.148 %U http://dx.doi.org/doi:10.1109/SSCI.2015.148 %P 1021-1028 %0 Thesis %T Models for the prediction and management of complex systems in industrial and dynamic environments %A Alsina, Emanuel Federico %D 2016 %C Italy %C Universita degli studi di Modena e Reggio Emilia %F Alsina:thesis %X The world in which we live is becoming more and more complex. Modelling the reality means to create simplifications and abstractions of that, in order to figure out what is going on in this modern and complex world in which we live. Nowadays, models have become crucial to make better decisions. Models help us to be clearer thinkers, and to understand how to transform data in useful information. There are too many data out there, models take these data and structure them into information, and then into knowledge. Two main topics are discussed in this work: (1) how to model complex systems, and (2) how to make predictions within complex systems, in industrial and dynamic environments. The purpose of this thesis is to present a series of models developed to support the decision makers in the complexity management. The first topic is addressed presenting some models concerning the balancing of assembly lines, machine degradation in production lines, operation schedule, and the positing of cranes in automated warehousing. In particular, concerning the assembly lines, two bio-inspired models which optimize the global picking time of the components considering their physical allocation are presented. Moreover, the use of a multi-agent model able to simultaneously consider different factors that affect machines in a production line is analysed. This approach takes into account the ageing and the degradation of the machines, the repairs, the replacement, and the preventive maintenance activities. Furthermore, in order to present how to manage the complexity intrinsic into the operations scheduling, a model inspired by the behaviour of an ant colony is showed. Finally, another multi-agent model is showed, which is able to find the optimal dwell point in automated storage retrieval systems exploiting an idea deriving from force-fields. After that, an entire chapter is dedicated to the prediction in complex systems. Prediction in industrial and dynamic environments is a challenge that professionals and academics have to face more and more. Some models able to capture non-linear relationships between temporal events are presented. These models are applied to different fields, from the reliability of mechanical and electrical components, to renewable energy. In the final analysis, models able to predict the users behaviors within online social communities are introduced. In these cases, various machine learning approaches (such as artificial neural networks, logistic regressions, and random trees) are detailed. This thesis want to be an inspiration for those people which have to manage the complexity in industrial and dynamic environments, showing examples and results, in order to explain how to make this world a little more understandable. %K genetic algorithms, genetic programming %9 Ph.D. thesis %U https://morethesis.unimore.it/theses/available/etd-11262015-110057/ %0 Conference Proceedings %T Feature selection and classification in genetic programming: Application to haptic-based biometric data %A Alsulaiman, Fawaz A. %A Sakr, Nizar %A Valdes, Julio J. %A El Saddik, Abdulmotaleb %A Georganas, Nicolas D. %S IEEE Symposium on Computational Intelligence for Security and Defense Applications, CISDA 2009 %D 2009 %8 jul %F Alsulaiman:2009:ieeeCISDA %X In this paper, a study is conducted in order to explore the use of genetic programming, in particular gene expression programming (GEP), in finding analytic functions that can behave as classifiers in high-dimensional haptic feature spaces. More importantly, the determined explicit functions are used in discovering minimal knowledge-preserving subsets of features from very high dimensional haptic datasets, thus acting as general dimensionality reducers. This approach is applied to the haptic-based biometrics problem; namely, in user identity verification. GEP models are initially generated using the original haptic biometric datatset, which is imbalanced in terms of the number of representative instances of each class. This procedure was repeated while considering an under-sampled (balanced) version of the datasets. The results demonstrated that for all datasets, whether imbalanced or under-sampled, a certain number (on average) of perfect classification models were determined. In addition, using GEP, great feature reduction was achieved as the generated analytic functions (classifiers) exploited only a small fraction of the available features. %K genetic algorithms, genetic programming, gene expression programming, analytic function, dimensionality reducers, feature selection, haptic dataset, haptic-based biometric data, haptic-based biometrics problem, high-dimensional haptic feature space, perfect classification model, feature extraction, haptic interfaces, pattern classification %R doi:10.1109/CISDA.2009.5356540 %U http://dx.doi.org/doi:10.1109/CISDA.2009.5356540 %P 1-7 %0 Conference Proceedings %T Identity verification based on haptic handwritten signatures: Genetic programming with unbalanced data %A Alsulaiman, Fawaz A. %A Valdes, Julio J. %A El Saddik, Abdulmotaleb %S Computational Intelligence for Security and Defence Applications (CISDA), 2012 IEEE Symposium on %D 2012 %F Alsulaiman:2012:CISDA %X In this paper, haptic-based handwritten signature verification using Genetic Programming (GP) classification is presented. The relevance of different haptic data types (e.g., force, position, torque, and orientation) in user identity verification is investigated. In particular, several fitness functions are used and their comparative performance is investigated. They take into account the unbalance dataset problem (large disparities within the class distribution), which is present in identity verification scenarios. GP classifiers using such fitness functions compare favourably with classical methods. In addition, they lead to simple equations using a much smaller number of attributes. It was found that collectively, haptic features were approximately as equally important as visual features from the point of view of their contribution to the identity verification process. %K genetic algorithms, genetic programming, handwriting recognition, haptic interfaces, image classification, GP classification, GP classifiers, fitness functions, genetic programming classification, haptic data types, haptic features, haptic-based handwritten signature verification, unbalance dataset problem, user identity verification, visual features, Biological cells, Biometrics, Force, Gene expression, Haptic interfaces, Vectors %R doi:10.1109/CISDA.2012.6291531 %U http://dx.doi.org/doi:10.1109/CISDA.2012.6291531 %0 Journal Article %T Identity verification based on handwritten signatures with haptic information using genetic programming %A Alsulaiman, Fawaz A. %A Sakr, Nizar %A Valdes, Julio J. %A El-Saddik, Abdulmotaleb %J ACM Transactions on Multimedia Computing, Communications, and Applications %D 2013 %8 may %V 9 %N 2 %I ACM %@ 1551-6857 %F journals/tomccap/AlsulaimanSVE13 %X In this article, haptic-based handwritten signature verification using Genetic Programming (GP) classification is presented. A comparison of GP-based classification with classical classifiers including support vector machine, k-nearest neighbours, naive Bayes, and random forest is conducted. In addition, the use of GP in discovering small knowledge-preserving subsets of features in high-dimensional datasets of haptic-based signatures is investigated and several approaches are explored. Subsets of features extracted from GP-generated models (analytic functions) are also exploited to determine the importance and relevance of different haptic data types (e.g., force, position, torque, and orientation) in user identity verification. The results revealed that GP classifiers compare favourably with the classical methods and use a much fewer number of attributes (with simple function sets). %K genetic algorithms, genetic programming, Biometrics, Haptics, classification, user verification %9 journal article %R doi:10.1145/2457450.2457453 %U http://doi.acm.org/http://dx.doi.org/10.1145/2457450.2457453 %U http://dx.doi.org/doi:10.1145/2457450.2457453 %P 11:1-11:21 %0 Conference Proceedings %T Identity verification based on haptic handwritten Signature: Novel fitness functions for GP framework %A Alsulaiman, Fawaz A. %A Valdes, Julio J. %A El Saddik, Abdulmotaleb %S IEEE International Symposium on Haptic Audio Visual Environments and Games (HAVE 2013) %D 2013 %8 oct %F Alsulaiman:2013:HAVE %X Fitness functions are the evaluation measures driving evolutionary processes towards solutions. In this paper, three fitness functions are proposed for solving the unbalanced dataset problem in Haptic-based handwritten signatures using genetic programming (GP). The use of these specifically designed fitness functions produced simpler analytical expressions than those obtained with currently available fitness measures, while keeping comparable classification accuracy. The functions introduced in this paper capture explicitly the nature of unbalanced data, exhibit better dimensionality reduction and have better False Rejection Rate. %K genetic algorithms, genetic programming, handwriting recognition, haptic interfaces, GP framework, evolutionary processes, false rejection rate, haptic based handwritten signatures, identity verification, novel fitness functions, Accuracy, Educational institutions, Evolutionary computation, Gene expression, Haptic interfaces, Programming %R doi:10.1109/HAVE.2013.6679618 %U http://dx.doi.org/doi:10.1109/HAVE.2013.6679618 %P 98-102 %0 Thesis %T Towards a Continuous User Authentication Using Haptic Information %A Alsulaiman, Fawaz Abdulaziz A. %D 2013 %C Canada %C School of Electrical Engineering and Computer Science, University of Ottawa %F Alsulaiman_Fawaz_Abdulaziz_A_2013_thesis %X With the advancement in multimedia systems and the increased interest in haptics to be used in interpersonal communication systems, where users can see, show, hear, tell, touch and be touched, mouse and keyboard are no longer dominant input devices. Touch, speech and vision will soon be the main methods of human computer interaction. Moreover, as interpersonal communication usage increases, the need for securing user authentication grows. In this research, we examine a user’s identification and verification based on haptic information. We divide our research into three main steps. The first step is to examine a pre-defined task, namely a handwritten signature with haptic information. The user target in this task is to mimic the legitimate signature in order to be verified. As a second step, we consider the user’s identification and verification based on user drawings. The user target is predefined, however there are no restrictions imposed on the order or on the level of details required for the drawing. Lastly, we examine the feasibility and possibility of distinguishing users based on their haptic interaction through an interpersonal communication system. In this third step, there are no restrictions on user movements, however a free movement to touch the remote party is expected. In order to achieve our goal, many classification and feature reduction techniques have been discovered and some new ones were proposed. Moreover, in this work we use evolutionary computing in user verification and identification. Analysis of haptic features and their significance on distinguishing users is hence examined. The results show a use of visual features by Genetic Programming (GP) towards identity verification, with a probability equal to 50percent while the remaining haptic features were used with a probability of approximately 50percent. Moreover, with a handwritten signature application, a verification success rate of 97.93percent with False Acceptance Rate (FAR) of 1.28percent and 11.54percent False Rejection Rate (FRR) is achieved with the use of genetic programming enhanced with the random over sampled data set. In addition, with a totally free user movement in a haptic-enabled interpersonal communication system, an identification success rate of 83.3percent is achieved when random forest classifier is used. %K genetic algorithms, genetic programming, User Authentication, Identity Verification, User Identification, Haptics, Haptic-enabled Interpersonal Communication System %9 Ph.D. thesis %U https://ruor.uottawa.ca/bitstream/10393/23946/3/Alsulaiman_Fawaz_Abdulaziz_A_2013_thesis.pdf %0 Journal Article %T Similarity of Amyloid Protein Motif using an Hybrid Intelligent System %A Altamiranda, J. %A Aguilar, J. %A Delamarche, C. %J IEEE Latin America Transactions (Revista IEEE America Latina) %D 2011 %8 sep %V 9 %N 5 %@ 1548-0992 %F Altamiranda:2011:ieeeLAT %O In Spanish %X The main objective of this research is to define and develop a comparison method of regular expressions, and apply it to amyloid proteins. In general, the biological problem that we study is concerning the search for similarities between non-homologous protein families, using regular expressions, with the goal of discover and identify specific regions conserved in the protein sequence, and in this way determine that proteins have a common origin. From the computer point of view, the problem consists of comparison of protein motifs expressed using regular expressions. A motif is a small region in a previously characterised protein, with a functional or structural significance in the protein sequence. In this work we proposed a hybrid method of motifs comparison based on the Genetic Programming, to generate the populations derived from every regular expression under comparison, and the Backpropagation Artificial Neural Network, for the comparison between them. The method of motifs comparison is tested using the database AMYPdb, and it allows discover possible similarities between amyloid families. %K genetic algorithms, genetic programming, AMYPdb database, amyloid protein motif, backpropagation artificial neural network, biological problem, hybrid intelligent system, nonhomologous protein family, protein sequence, regular expression, backpropagation, biology computing, neural nets, proteins %9 journal article %R doi:10.1109/TLA.2011.6030978 %U http://dx.doi.org/doi:10.1109/TLA.2011.6030978 %P 700-710 %0 Conference Proceedings %T Comparison and fusion model in protein motifs %A Altamiranda, Junior %A Aguilar, Jose %A Delamarche, Chistian %S XXXIX Latin American Computing Conference (CLEI 2013) %D 2013 %8 July 11 oct %I IEEE %C Naiguata %F Altamiranda:2013:CLEI %X Motifs are useful in biology to highlight the nucleotides/amino-acids that are involved in structure, function, regulation and evolution, or to infer homology between genes/proteins. PROSITE is a strategy to model protein motifs as Regular Expressions and Position Frequency Matrices. Multiple tools have been proposed to discover biological motifs, but not for the case of the motifs comparison problem, which is NP-Complete due to flexibility and independence at each position. In this paper we present a formal model to compare two protein motifs based on the Genetic Programming to generate the population of sequences derived from every regular expression under comparison and on a Neural Network Backpropagation to calculate a motif similarity score as fitness function. Additionally, we present a fusion formal method for two similar motifs based on the Ant Colony Optimisation technique. The comparison and fusion method was tested using amyloid protein motifs. %K genetic algorithms, genetic programming, Bioinformatics, Neural Network, ANN, ACO, Ant Colony Optimization %R doi:10.1109/CLEI.2013.6670618 %U http://dx.doi.org/doi:10.1109/CLEI.2013.6670618 %0 Book Section %T The Evolution of Evolvability in Genetic Programming %A Altenberg, Lee %E Kinnear, Jr., Kenneth E. %B Advances in Genetic Programming %D 1994 %I MIT Press %F kinnear:altenberg %X The notion of “evolvability” — the ability of a population to produce variants fitter than any yet existing — is developed as it applies to genetic algorithms. A theoretical analysis of the dynamics of genetic programming predicts the existence of a novel, emergent selection phenomenon: the evolution of evolvability. This is produced by the proliferation, within programs, of blocks of code that have a higher chance of increasing fitness when added to programs. Selection can then come to mold the variational aspects of the way evolved programs are represented. A model of code proliferation within programs is analyzed to illustrate this effect. The mathematical and conceptual framework includes: the definition of evolvability as a measure of performance for genetic algorithms; application of Price’s Covariance and Selection Theorem to show how the fitness function, representation, and genetic operators must interact to produce evolvability — namely, that genetic operators produce offspring with fitnesses specifically correlated with their parent’s fitnesses; how blocks of code emerge as a new level of replicator, proliferating as a function of their “constructional fitness”, which is distinct from their schema fitness; and how programs may change from innovative code to conservative code as the populations mature. Several new selection techniques and genetic operators are proposed in order to give better control over the evolution of evolvability and improved evolutionary performance. Copyright 1996 Lee Altenberg %K genetic algorithms, genetic programming %R doi:10.7551/mitpress/1108.003.0009 %U http://dynamics.org/~altenber/PAPERS/EEGP/ %U http://dx.doi.org/doi:10.7551/mitpress/1108.003.0009 %P 47-74 %0 Conference Proceedings %T Evolving better representations through selective genome growth %A Altenberg, Lee %S Proceedings of the 1st IEEE Conference on Evolutionary Computation %D 1994 %8 27 29 jun %V 1 %I IEEE %C Orlando, Florida, USA %F Altenberg:1994EBR %X The choice of how to represent the search space for a genetic algorithm (GA) is critical to the GA’s performance. Representations are usually engineered by hand and fixed for the duration of the GA run. Here a new method is described in which the degrees of freedom of the representation — i.e. the genes – are increased incrementally. The phenotypic effects of the new genes are randomly drawn from a space of different functional effects. Only those genes that initially increase fitness are kept. The genotype-phenotype map that results from this selection during the constructional of the genome allows better adaptation. This effect is illustrated with the NK landscape model. The resulting genotype-phenotype maps are much less epistatic than generic maps would be. They have extremely low values of “K” — the number of fitness components affected by each gene. Moreover, these maps are exquisitely tuned to the specifics of the random fitness functions, and achieve fitnesses many standard deviations above generic NK landscapes with the same \gp maps. The evolved maps create adaptive landscapes that are much smoother than generic NK landscapes ever are. Thus a caveat should be made when making arguments about the applicability of generic properties of complex systems to evolved systems. This method may help to solve the problem of choice of representations in genetic algorithms. Copyright 1996 Lee Altenberg %K genetic algorithms, genetic programming %U http://dynamics.org/~altenber/PAPERS/EBR/ %P 182-187 %0 Conference Proceedings %T Emergent phenomena in genetic programming %A Altenberg, Lee %Y Sebald, Anthony V. %Y Fogel, Lawrence J. %S Evolutionary Programming — Proceedings of the Third Annual Conference %D 1994 %8 24 26 feb %I World Scientific Publishing %C San Diego, CA, USA %@ 981-02-1810-9 %F Altenberg:1994EPIGP %X Evolutionary computation systems exhibit various emergent phenomena, primary of which is adaptation. In genetic programming, because of the indeterminate nature of the representation, the evolution of both recombination distributions and representations can emerge from the population dynamics. A review of ideas on these phenomena is presented, including theory on the evolution of evolvability through differential proliferation of subexpressions within programs. An analysis is given of a model of genetic programming dynamics that is supportive of the “Soft Brood Selection” conjecture, which was proposed as a means to counteract the emergence of highly conservative code, and instead favor highly evolvable code. Copyright 1996 Lee Altenberg %K genetic algorithms, genetic programming %U http://dynamics.org/~altenber/PAPERS/EPIGP/ %P 233-241 %0 Conference Proceedings %T The Schema Theorem and Price’s Theorem %A Altenberg, Lee %Y Whitley, L. Darrell %Y Vose, Michael D. %S Foundations of Genetic Algorithms 3 %D 1994 %8 31 jul –2 aug %I Morgan Kaufmann %C Estes Park, Colorado, USA %@ 1-55860-356-5 %F Altenberg:1995STPT %O Published 1995 %X Holland’s Schema Theorem is widely taken to be the foundation for explanations of the power of genetic algorithms (GAs). Yet some dissent has been expressed as to its implications. Here, dissenting arguments are reviewed and elaborated upon, explaining why the Schema Theorem has no implications for how well a GA is performing. Interpretations of the Schema Theorem have implicitly assumed that a correlation exists between parent and offspring fitnesses, and this assumption is made explicit in results based on Price’s Covariance and Selection Theorem. Schemata do not play a part in the performance theorems derived for representations and operators in general. However, schemata re-emerge when recombination operators are used. Using Geiringer’s recombination distribution representation of recombination operators, a “missing” schema theorem is derived which makes explicit the intuition for when a GA should perform well. Finally, the method of “adaptive landscape” analysis is examined and counterexamples offered to the commonly used correlation statistic. Instead, an alternative statistic—the transmission function in the fitness domain— is proposed as the optimal statistic for estimating GA performance from limited samples. Copyright 1996 Lee Altenberg %K genetic algorithms, genetic programming %R doi:10.1016/B978-1-55860-356-1.50006-6 %U http://dynamics.org/~altenber/PAPERS/STPT/ %U http://dx.doi.org/doi:10.1016/B978-1-55860-356-1.50006-6 %P 23-49 %0 Book Section %T Genome growth and the evolution of the genotype-phenotype map %A Altenberg, Lee %E Banzhaf, Wolfgang %E Eeckman, Frank H. %B Evolution as a Computational Process %S Lecture Notes in Computer Science %D 1992 %8 jul %V 899 %I Springer-Verlag %C Monterey, California, USA %F Altenberg:1995GGEGPM %X The evolution of new genes is distinct from evolution through allelic substitution in that new genes bring with them new degrees of freedom for genetic variability. Selection in the evolution of new genes can therefore act to sculpt the dimensions of variability in the genome. This “constructional” selection effect is an evolutionary mechanism, in addition to genetic modification, that can affect the variational properties of the genome and its evolvability. One consequence is a form of genic selection: genes with large potential for generating new useful genes when duplicated ought to proliferate in the genome, rendering it ever more capable of generating adaptive variants. A second consequence is that alleles of new genes whose creation produced a selective advantage may be more likely to also produce a selective advantage, provided that gene creation and allelic variation have correlated phenotypic effects. A fitness distribution model is analyzed which demonstrates these two effects quantitatively. These are effects that select on the nature of the genotype-phenotype map. New genes that perturb numerous functions under stabilizing selection, i.e. with high pleiotropy, are unlikely to be advantageous. Therefore, genes coming into the genome ought to exhibit low pleiotropy during their creation. If subsequent offspring genes also have low pleiotropy, then genic selection can occur. If subsequent allelic variation also has low pleiotropy, then that too should have a higher chance of not being deleterious. The effects on pleiotropy are illustrated with two model genotype-phenotype maps: Wagner’s linear quantitative-genetic model with Gaussian selection, and Kauffman’s “NK” adaptive landscape model. Constructional selection is compared with other processes and ideas about the evolution of constraints, evolvability, and the genotype-phenotype map. Empirical phenomena such as dissociability in development, morphological integration, and exon shuffling are discussed in the context of this evolutionary process. Copyright 1996 Lee Altenberg %K genetic algorithms, genetic programming %R doi:10.1007/3-540-59046-3_11 %U http://dynamics.org/~altenber/PAPERS/GGEGPM/ %U http://dx.doi.org/doi:10.1007/3-540-59046-3_11 %P 205-259 %0 Unpublished Work %T Selection, generalized transmission, and the evolution of modifier genes. II. Modifier polymorphisms %A Altenberg, Lee %A Feldman, Marcus W. %D 1995 %F Altenberg:and:Feldman:1995SGTEMG2 %O In preparation %9 unpublished %U ftp://ftp.mhpcc.edu/pub/incoming/altenberg/LeeSGTEMG2MP.ps.Z %0 Book Section %T Modularity in Evolution: Some Low-Level Questions %A Altenberg, Lee %E Rasskin-Gutman, Diego %E Callebaut, Werner %B Modularity: Understanding the Development and Evolution of Complex Natural Systems %D 2005 %8 jun %I MIT Press %C Cambridge, MA, USA %@ 0-262-03326-7 %F Altenberg:2004:MESLLQ %X Intuitive notions about the advantages of modularity for evolvability run into the problem of how we parse the organism into traits. In order to resolve the question of multiplicity, there needs to be a way to get the human observer out of the way, and define modularity in terms of physical processes. I will offer two candidate ideas towards this resolution: the dimensionality of phenotypic variation, and the causal screening off of phenotypic variables by other phenotypic variables. With this framework, the evolutionary advantages that have been attributed to modularity do not derive from modularity per se. Rather, they require that there be an ’alignment’ between the spaces of phenotypic variation, and the selection gradients that are available to the organism. Modularity may facilitate such alignment, but it is not sufficient; the appropriate phenotype-fitness map in conjunction with the genotype-phenotype map is also necessary for evolvability. Conclusion I have endeavoured in this essay to delve into some of the low-level conceptual issues associated with the idea of modularity in the genotype-phenotype map. My main proposal is that the evolutionary advantages that have been attributed to modularity do not derive from modularity per se. Rather, they require that there be an ’alignment’ between the spaces of phenotypic variation, and the selection gradients that are available to the organism. Modularity in the genotype-phenotype map may make such an alignment more readily attained, but it is not sufficient; the appropriate phenotype-fitness map in conjunction with the genotype-phenotype map is also necessary for evolvability. %K genetic algorithms, genetic programming %U http://dynamics.org/Altenberg/FILES/LeeMESLLQ.pdf %P 99-128 %0 Book Section %T Open Problems in the Spectral Analysis of Evolutionary Dynamics %A Altenberg, Lee %E Menon, Anil %B Frontiers of Evolutionary Computation %S Genetic Algorithms And Evolutionary Computation Series %D 2004 %V 11 %I Kluwer Academic Publishers %C Boston, MA, USA %@ 1-4020-7524-3 %F Altenberg:2004:OPSAED %X For broad classes of selection and genetic operators, the dynamics of evolution can be completely characterised by the spectra of the operators that define the dynamics, in both infinite and finite populations. These classes include generalised mutation, frequency-independent selection, uniparental inheritance. Several open questions exist regarding these spectra: 1. For a given fitness function, what genetic operators and operator intensities are optimal for finding the fittest genotype? The concept of rapid first hitting time, an analog of Sinclair’s rapidly mixing Markov chains, is examined. 2. What is the relationship between the spectra of deterministic infinite population models, and the spectra of the Markov processes derived from them in the case of finite populations? 3. Karlin proved a fundamental relationship between selection, rates of transformation under genetic operators, and the consequent asymptotic mean fitness of the population. Developed to analyse the stability of polymorphisms in subdivided populations, the theorem has been applied to unify the reduction principle for self-adaptation, and has other applications as well. Many other problems could be solved if it were generalised to account for the interaction of different genetic operators. Can Karlin’s theorem on operator intensity be extended to account for mixed genetic operators? %K genetic algorithms, genetic programming %R doi:10.1007/1-4020-7782-3_4 %U http://dynamics.org/Altenberg/FILES/LeeOPSAED.pdf %U http://dx.doi.org/doi:10.1007/1-4020-7782-3_4 %P 73-102 %0 Journal Article %T Evolvability Suppression to Stabilize Far-Sighted Adaptations %A Altenberg, Lee %J Artificial Life %D 2005 %8 Fall %V 11 %N 3 %@ 1064-5462 %F altenberg:2004:ESSFSA %X The opportunistic character of adaptation through natural selection can lead to ‘evolutionary pathologies’—situations in which traits evolve that promote the extinction of the population. Such pathologies include imprudent predation and other forms of habitat over-exploitation or the ‘tragedy of the commons’, adaptation to temporally unreliable resources, cheating and other antisocial behaviour, infectious pathogen carrier states, parthenogenesis, and cancer, an intra-organismal evolutionary pathology. It is known that hierarchical population dynamics can protect a population from invasion by pathological genes. Can it also alter the genotype so as to prevent the generation of such genes in the first place, i.e. suppress the evolvability of evolutionary pathologies? A model is constructed in which one locus controls the expression of the pathological trait, and a series of modifier loci exist which can prevent the expression of this trait. It is found that multiple ‘evolvability checkpoint’ genes can evolve to prevent the generation of variants that cause evolutionary pathologies. The consequences of this finding are discussed. %K genetic algorithms %9 journal article %R doi:10.1162/106454605774270633 %U http://dx.doi.org/doi:10.1162/106454605774270633 %P 427-443 %0 Journal Article %T Mathematics awaits: commentary on ”Genetic Programming and Emergence” by Wolfgang Banzhaf %A Altenberg, Lee %J Genetic Programming and Evolvable Machines %D 2014 %8 mar %V 15 %N 1 %@ 1389-2576 %F Altenberg:2014:GPEM %X Banzhaf provides a portal to the subject of emergence, noting contentious concepts while not getting sucked into fruitless debate. Banzhaf refutes arguments against downward causation much as Samuel Johnson kicks a stone to refute Berkeley by pointing to concrete examples in genetic programming, such as the growth of repetitive patterns within programs. Repetitive patterns are theoretically predicted to emerge from the evolution of evolvability and robustness under subtree exchange. Selection and genetic operators are co-equal creators of these emergent phenomena. GP systems entirely formal, and thus their emergent phenomena are essentially mathematical. The emergence of Lagrangian distributions for tree shapes under subtree exchange, for example, gives a glimpse of the possibilities for mathematical understanding of emergence in GP. The mathematics underlying emergence in genetic programming should be pursued with vigour. %K genetic algorithms, genetic programming, Evolvability, Robustness, Subtree exchange, Mathematics, Matrix theory, Lagrange distribution %9 journal article %R doi:10.1007/s10710-013-9198-5 %U http://dx.doi.org/doi:10.1007/s10710-013-9198-5 %P 87-89 %0 Journal Article %T Evolvability and robustness in artificial evolving systems: three perturbations %A Altenberg, Lee %J Genetic Programming and Evolvable Machines %D 2014 %8 sep %V 15 %N 3 %@ 1389-2576 %F Altenberg:2014:GPEMb %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s10710-014-9223-3 %U http://dx.doi.org/doi:10.1007/s10710-014-9223-3 %P 275-280 %0 Book Section %T Evolutionary Computation %A Altenberg, Lee %E Kliman, Richard M. %B The Encyclopedia of Evolutionary Biology %D 2016 %V 2 %I Academic Press %C Oxford, UK %F Altenberg:2016:EC %X Evolutionary computation is a method of solving engineering problems using algorithms that mimic Darwinian natural selection and Mendelian genetics, applied especially to optimization problems that are difficult to solve from first principles. Earliest beginnings were in the 1950s, and by the mid-1990s it had developed as an academic field with its own journals, conferences, and faculty. Several phenomena discovered in evolutionary biology were also discovered in parallel in evolutionary computation, including the evolvability problem, genetic modification, constructive neutral evolution, and genetic robustness. The related field of artificial life focuses on computational systems in which replication, natural selection, and ecological interactions are all emergent. %K genetic algorithms, genetic programming, Crossover, Encoding, Evolutionary algorithm, Evolvability, Genetic algorithm, Genetic operator, No free lunch theorems, Objective function, Optimization, Representation, Search space, Selection operator, Simulated annealing %R doi:10.1016/B978-0-12-800049-6.00307-3 %U https://www.sciencedirect.com/science/article/pii/B9780128000496003073 %U http://dx.doi.org/doi:10.1016/B978-0-12-800049-6.00307-3 %P 40-47 %0 Journal Article %T Probing the axioms of evolutionary algorithm design: Commentary on “On the mapping of genotype to phenotype in evolutionary algorithms” by Peter A. Whigham, Grant Dick, and James Maclaurin %A Altenberg, Lee %J Genetic Programming and Evolvable Machines %D 2017 %8 sep %V 18 %N 3 %@ 1389-2576 %F Altenberg:2017:GPEM %O Special Peer Commentary on Mapping of Genotype to Phenotype in Evolutionary Algorithms %X Properties such as continuity, locality, and modularity may seem necessary when designing representations and variation operators for evolutionary algorithms, but a closer look at what happens when evolutionary algorithms perform well reveals counterexamples to such schemes. Moreover, these variational properties can themselves evolve in sufficiently complex open-ended systems. These properties of evolutionary algorithms remain very much open questions. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s10710-017-9290-3 %U http://dx.doi.org/doi:10.1007/s10710-017-9290-3 %P 363-367 %0 Journal Article %T Automatic Generation and Evaluation of Recombination Games. Doctoral Dissertation by Cameron Browne, Review %A Althoefer, Ingo %J ICGA Journal %D 2010 %V 33 %N 4 %F Althoefer:2010:ICGA %K genetic algorithms, genetic programming %9 journal article %U https://chessprogramming.wikispaces.com/ICGA+Journal %0 Journal Article %T Prediction models for marshall mix parameters using bio-inspired genetic programming and deep machine learning approaches: A comparative study %A Althoey, Fadi %A Akhter, Muhammad Naveed %A Nagra, Zohaib Sattar %A Awan, Hamad Hassan %A Alanazi, Fayez %A Khan, Mohsin Ali %A Javed, Muhammad Faisal %A Eldin, Sayed M. %A Ozkilic, Yasin Onuralp %J Case Studies in Construction Materials %D 2023 %V 18 %@ 2214-5095 %F ALTHOEY:2023:cscm %X This research study uses four machine learning techniques, i.e., Multi Expression programming (MEP), Artificial Neural Network (ANN), Adaptive Neuro-Fuzzy Inference System (ANFIS), and Ensemble Decision Tree Bagging (DT-Bagging) for the development of new and advanced models for prediction of Marshall Stability (MS), and Marshall Flow (MF) of asphalt mixes. A comprehensive and detailed database of 343 data points was established for both MS and MF. The predicting variables were chosen among the four most influential, and easy-to-determine parameters. The models were trained, tested, validated, and the outcomes of the newly developed models were compared with actual outcomes. The root squared error (RSE), Nash-Sutcliffe efficiency (NSE), mean absolute error (MAE), root mean square error (RMSE), relative root mean square error (RRMSE), regression coefficient (R2), and correlation coefficient (R), were all used to evaluate the performance of models. The sensitivity analysis (SA) revealed that in the case of MS, the rising order of input significance was bulk specific gravity of compacted aggregate, Gmb (38.56 percent) > Percentage of Aggregates, Ps (19.84 percent) > Bulk Specific Gravity of Aggregate, Gsb (19.43 percent) > maximum specific gravity paving mix, Gmm (7.62 percent), while in case of MF the order followed was: Ps (36.93 percent) > Gsb (14.11 percent) > Gmb (10.85 percent) > Gmm (10.19 percent). The outcomes of parametric analysis (PA) consistency of results in relation to previous research findings. The DT-Bagging model outperformed all other models with values of 0.971 and 0.980 (R), 16.88 and 0.24 (MAE), 28.27 and 0.36 (RMSE), 0.069 and 0.041 (RSE), 0.020 and 0.032 (RRMSE), 0.010 and 0.016 (PI), 0.931 and 0.959 (NSE), for MS and MF, respectively. The results of the comparison analysis showed that ANN, ANFIS, MEP, and DT-Bagging are all effective and reliable approaches for the estimation of MS and MF. The MEP-derived mathematical expressions represent the novelty of MEP and are relatively simple and reliable. Roverall values for MS and MF were in the order of DT-Bagging >MEP >ANFIS >ANN with all values exceeding the permitted range of 0.80 for both MS and MF. Hence, all the modeling approaches showed higher performance, possessed high generalization and predication capabilities, and assess the relative significance of input parameters in the prediction of MS and MF. Hence, the findings of this research study would assist in the safer, faster, and sustainable prediction of MS and MF, from the standpoint of resources and time required to perform the Marshall tests %K genetic algorithms, genetic programming, Marshall Mix Parameter, Deep Learning, Prediction models, Asphalt, Bio-Inspired models %9 journal article %R doi:10.1016/j.cscm.2022.e01774 %U https://www.sciencedirect.com/science/article/pii/S2214509522009068 %U http://dx.doi.org/doi:10.1016/j.cscm.2022.e01774 %P e01774 %0 Journal Article %T Evolutionary data-modelling of an innovative low reflective vertical quay %A Altomare, C. %A Gironella, X. %A Laucelli, D. %J Journal of Hydroinformatics %D 2013 %8 January %V 15 %N 3 %F Altomare:2013:JoH %X Vertical walls are commonly used as berthing structures. However, conventional vertical quays may have serious technical and environmental problems, as they reflect almost all the energy of the incident waves, thus affecting operational conditions and structural strength. These drawbacks can be overcome by the use of low reflective structures, but for some instances no theoretical equations exist to determine the relationship between the reflection coefficient and parameters that affect the structural response. Therefore, this study tries to fill this gap by examining the wave reflection of an absorbing gravity wall by means of evolutionary polynomial regression, a hybrid evolutionary modelling paradigm that combines the best features of conventional numerical regression and genetic programming. The method implements a multi-modelling approach in which a multi-objective genetic algorithm is used to get optimal models in terms of parsimony of mathematical expressions and fitting to data. A database of physical laboratory observations is used to predict the reflection as a function of a set of variables that characterize wave conditions and structure features. The proposed modelling paradigm proved to be a useful tool for data analysis and is able to find feasible explicit models featured by an appreciable generalization performance. %K genetic algorithms, genetic programming, data-mining, evolutionary polynomial regression, low reflective vertical quay, wave reflection %9 journal article %R doi:10.2166/hydro.2012.219 %U https://iwaponline.com/jh/article-pdf/15/3/763/387059/763.pdf %U http://dx.doi.org/doi:10.2166/hydro.2012.219 %P 763-779 %0 Journal Article %T Determination of Semi-Empirical Models for Mean Wave Overtopping Using an Evolutionary Polynomial Paradigm %A Altomare, Corrado %A Laucelli, Daniele B. %A Mase, Hajime %A Gironella, Xavi %J Journal of Marine Science and Engineering %D 2020 %V 8 %N 8 %@ 2077-1312 %F altomare:2020:JMSE %X The present work employs the so-called Evolutionary Polynomial Regression (EPR) algorithm to build up a formula for the assessment of mean wave overtopping discharge for smooth sea dikes and vertical walls. EPR is a data-mining tool that combines and integrates numerical regression and genetic programming. This technique is here employed to dig into the relationship between the mean discharge and main hydraulic and structural parameters that characterise the problem under study. The parameters are chosen based on the existing and most used semi-empirical formulas for wave overtopping assessment. Besides the structural freeboard or local wave height, the unified models highlight the importance of local water depth and wave period in combination with foreshore slope and dike slope on the overtopping phenomena, which are combined in a unique parameter that is defined either as equivalent or imaginary slope. The obtained models aim to represent a trade-off between accuracy and parsimony. The final formula is simple but can be employed for a preliminary assessment of overtopping rates, covering the full range of dike slopes, from mild to vertical walls, and of water depths from the shoreline to deep water, including structures with emergent toes. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/jmse8080570 %U https://www.mdpi.com/2077-1312/8/8/570 %U http://dx.doi.org/doi:10.3390/jmse8080570 %0 Conference Proceedings %T Combining different meta-heuristics to improve the predictability of a Financial Forecasting algorithm %A Aluko, Babatunde %A Smonou, Dafni %A Kampouridis, Michael %A Tsang, Edward %S IEEE Conference on Computational Intelligence for Financial Engineering Economics (CIFEr 2104) %D 2014 %8 27 28 mar %F Aluko:2014:CIFEr %X Hyper-heuristics have successfully been applied to a vast number of search and optimisation problems. One of the novelties of hyper-heuristics is the fact that they manage and automate the meta-heuristic’s selection process. In this paper, we implemented and analysed a hyper-heuristic framework on three meta-heuristics namely Simulated Annealing, Tabu Search, and Guided Local Search, which had successfully been applied in the past to a Financial Forecasting algorithm called EDDIE. EDDIE uses Genetic Programming to extract and learn from historical data in order to predict future financial market movements. Results show that the algorithm’s effectiveness has been improved, thus making the combination of meta-heuristics under a hyper-heuristic framework an effective Financial Forecasting approach. %K genetic algorithms, genetic programming %R doi:10.1109/CIFEr.2014.6924092 %U http://dx.doi.org/doi:10.1109/CIFEr.2014.6924092 %P 333-340 %0 Journal Article %T Multi-objective optimization of an engine mount design by means of memetic genetic programming and a local exploration approach %A Alvarado-Iniesta, Alejandro %A Guillen-Anaya, Luis Gonzalo %A Rodriguez-Picon, Luis Alberto %A Neco-Caberta, Raul %J Journal of Intelligent Manufacturing %D 2020 %8 jan %V 31 %F alvarado-iniesta:JoIM %X the optimization of an engine mount design from a multi-objective. Our methodology is divided into three phases: phase one focuses on data collection through computer simulations. The objectives considered during the analyses are: total mass, first natural frequency and maximum von Mises stress. In phase two, a surrogate model by means of genetic programming is generated for each one of the objectives. Moreover, a local search procedure is incorporated into the overall genetic programming algorithm for improving its performance. Finally, in phase three, instead of steering the search to finding the approximate Pareto front, a local exploration approach based on a change in the weight space is used to lead a search into user defined directions turning the decision making more intuitive. %K genetic algorithms, genetic programming, Structural optimization, Multi-objective optimization, Finite element analysis, Decision making %9 journal article %R doi:10.1007/s10845-018-1432-9 %U http://link.springer.com/article/10.1007/s10845-018-1432-9 %U http://dx.doi.org/doi:10.1007/s10845-018-1432-9 %P 19-32 %0 Journal Article %T Multi-objective optimization of an aluminum torch brazing process by means of genetic programming and R-NSGA-II %A Alvarado-Iniesta, Alejandro %A Tlapa-Mendoza, Diego A. %A Limon-Romero, Jorge %A Mendez-Gonzalez, Luis C. %J The International Journal of Advanced Manufacturing Technology %D 2017 %V 91 %N 9 - 12 %F alvarado-iniesta:2017:IJAMT %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s00170-017-0102-y %U http://link.springer.com/article/10.1007/s00170-017-0102-y %U http://dx.doi.org/doi:10.1007/s00170-017-0102-y %0 Journal Article %T Forecasting front displacements with a satellite based ocean forecasting (SOFT) system %A Alvarez, A. %A Orfila, Alejandro %A Basterretxea, G. %A Tintore, J. %A Vizoso, G. %A Fornes, A. %J Journal of Marine Systems %D 2007 %8 mar %V 65 %N 1-4 %F Alvarez:2007:JMS %O Marine Environmental Monitoring and Prediction - Selected papers from the 36th International Liege Colloquium on Ocean Dynamics %X Relatively long term time series of satellite data are nowadays available. These spatiotemporal time series of satellite observations can be employed to build empirical models, called satellite based ocean forecasting (SOFT) systems, to forecast certain aspects of future ocean states. The forecast skill of SOFT systems predicting the sea surface temperature (SST) at sub-basin spatial scale (from hundreds to thousand kilometres), has been extensively explored in previous works. Thus, these works were mostly focused on predicting large scale patterns spatially stationary. At spatial scales smaller than sub-basin (from tens to hundred kilometres), spatiotemporal variability is more complex and propagating structures are frequently present. In this case, traditional SOFT systems based on Empirical Orthogonal Function (EOF) decompositions could not be optimal prediction systems. Instead, SOFT systems based on Complex Empirical Orthogonal Functions (CEOFs) are, a priori, better candidates to resolve these cases. In this work we study and compare the performance of an EOF and CEOF based SOFT systems forecasting the SST at weekly time scales of a propagating mesoscale structure. The SOFT system was implemented in an area of the Northern Balearic Sea (Western Mediterranean Sea) where a moving frontal structure is recurrently observed. Predictions from both SOFT systems are compared with observations and with the predictions obtained from persistence models. Results indicate that the implemented SOFT systems are superior in terms of predictability to persistence. No substantial differences have been found between the EOF and CEOF-SOFT systems. %K genetic algorithms, genetic programming, Satellite data, Ocean prediction, Front evolution %9 journal article %R doi:10.1016/j.jmarsys.2005.11.017 %U http://dx.doi.org/doi:10.1016/j.jmarsys.2005.11.017 %P 299-313 %0 Book Section %T Standard Versus Micro-Genetic Algorithms for Seismic Trace Inversion %A Alvarez, Gabriel %E Koza, John R. %B Genetic Algorithms and Genetic Programming at Stanford 2003 %D 2003 %8 April %I Stanford Bookstore %C Stanford, California, 94305-3079 USA %F alvarez:2003:SVMASTI %K genetic algorithms %P 1-10 %0 Conference Proceedings %T Human-inspired Scaling in Learning Classifier Systems: Case Study on the n-bit Multiplexer Problem Set %A Alvarez, Isidro M. %A Browne, Will N. %A Zhang, Mengjie %Y Friedrich, Tobias %Y Neumann, Frank %Y Sutton, Andrew M. %Y Middendorf, Martin %Y Li, Xiaodong %Y Hart, Emma %Y Zhang, Mengjie %Y Akimoto, Youhei %Y Bosman, Peter A. N. %Y Soule, Terry %Y Miikkulainen, Risto %Y Loiacono, Daniele %Y Togelius, Julian %Y Lopez-Ibanez, Manuel %Y Hoos, Holger %Y Handl, Julia %Y Gomez, Faustino %Y Fonseca, Carlos M. %Y Trautmann, Heike %Y Moraglio, Alberto %Y Punch, William F. %Y Krawiec, Krzysztof %Y Vasicek, Zdenek %Y Jansen, Thomas %Y Smith, Jim %Y Ludwig, Simone %Y Merelo, J. J. %Y Naujoks, Boris %Y Alba, Enrique %Y Ochoa, Gabriela %Y Poulding, Simon %Y Sudholt, Dirk %Y Koetzing, Timo %S GECCO ’16: Proceedings of the 2016 Annual Conference on Genetic and Evolutionary Computation %D 2016 %8 20 24 jul %I ACM %C Denver, USA %F Alvarez:2016:GECCO %X Learning classifier systems (LCSs) originated from artificial cognitive systems research, but migrated such that LCS became powerful classification techniques. Modern LCSs can be used to extract building blocks of knowledge in order to solve more difficult problems in the same or a related domain. The past work showed that the reuse of knowledge through the adoption of code fragments, GP-like sub-trees, into the XCS learning classifier system framework could provide advances in scaling. However, unless the pattern underlying the complete domain can be described by the selected LCS representation of the problem, a limit of scaling will eventually be reached. This is due to LCSs divide and conquer approach rule-based solutions, which entails an increasing number of rules (subclauses) to describe a problem as it scales. Inspired by human problem solving abilities, the novel work in this paper seeks to reuse learned knowledge and learned functionality to scale to complex problems by transferring them from simpler problems. Progress is demonstrated on the benchmark Multiplexer (Mux) domain, albeit the developed approach is applicable to other scalable domains. The fundamental axioms necessary for learning are proposed. The methods for transfer learning in LCSs are developed. Also, learning is recast as a decomposition into a series of sub-problems. Results show that from a conventional tabula rasa, with only a vague notion of what subordinate problems might be relevant, it is possible to learn a general solution to any n-bit Mux problem for the first time. This is verified by tests on the 264, 521 and 1034 bit Mux problems. %K genetic algorithms, genetic programming %R doi:10.1145/2908812.2908813 %U http://dx.doi.org/doi:10.1145/2908812.2908813 %P 429-436 %0 Conference Proceedings %T Application of Genetic Programming to the Choice of a Structure of Global Approximations %A Alvarez, Luis F. %A Toropov, Vassili V. %Y Koza, John R. %S Late Breaking Papers at the Genetic Programming 1998 Conference %D 1998 %8 22 25 jul %I Stanford University Bookstore %C University of Wisconsin, Madison, Wisconsin, USA %F alvarez:1998: %K genetic algorithms, genetic programming %P 1 %0 Conference Proceedings %T Approximation model building using genetic programming methodology: applications %A Alvarez, Luis F. %A Toropov, Vassili V. %A Hughes, David C. %A Ashour, Ashraf F. %Y Baranger, Thouraya %Y van Keulen, Fred %S Second ISSMO/AIAA Internet Conference on Approximations and Fast Reanalysis in Engineering Optimization %D 2000 %8 25 may 2 jun %G en %F oai:CiteSeerPSU:512359 %X Genetic Programming methodology is used for the creation of approximation functions obtained by the response surface methodology. Two important aspects of the problems are addressed: the choice of the plan of experiment and the model tuning using the least-squares response surface fitting. Several examples show the applications of the technique to problems where the values of response functions are obtained either by numerical simulation or laboratory experimentation. %K genetic algorithms, genetic programming %U http://www-tm.wbmt.tudelft.nl/~wbtmavk/2aro_conf/Toropov/Fred4.pdf %0 Thesis %T Design Optimization based on Genetic Programming %A Alvarez, L. F. %D 2000 %C UK %C Department of Civil and Environmental Engineering, University of Bradford %F Alvarez:thesis %X This thesis addresses two problems arising in many real-life design optimization applications: the high computational cost of function evaluations and the presence of numerical noise in the function values. The response surface methodology is used to construct approximations of the original model. A major difficulty in building highly accurate response surfaces is the selection of the structure of an approximation function. A methodology has been developed for the approximation model building using genetic programming. It is implemented in a computer code introducing two new features: the use of design sensitivity information when available, and the allocation and evaluation of tuning parameters in separation from the evolutionary process. A combination of a genetic algorithm and a gradient-based algorithm is used for tuning of the approximation functions. The problem of the choice of a design of experiments in the response surface methodology has been reviewed and a space-filling plan adopted. The developed methodology and software have been applied to design optimization problems with numerically simulated and experimental responses, demonstrating their considerable potential. The applications cover the approximation of a response function obtained by a finite element model for the detection of damage in steel frames, the creation of an empirical model for the prediction of the shear strength in concrete deep beams and a multicriteria optimization of the process of calcination of Roman cement. %K genetic algorithms, genetic programming, Design Optimization, Response Surface Methodology %9 Ph.D. thesis %U http://www.brad.ac.uk/staff/vtoropov/burgeon/thesis_luis/abstract.pdf %0 Journal Article %T Forecasting exchange rates using genetic algorithms %A Alvarez-Diaz, Marcos %A Alvarez, Alberto %J Applied Economics Letters %D 2003 %8 apr %V 10 %N 6 %F Alvarez-Diaz:2003:ael %X A novel approach is employed to investigate the predictability of weekly data on the euro/dollar, British pound/dollar, Deutsch mark/dollar, Japanese yen/dollar, French franc/dollar and Canadian dollar/dollar exchange rates. A functional search procedure based on the Darwinian theories of natural evolution and survival, called genetic algorithms (hereinafter GA), was used to find an analytical function that best approximates the time variability of the studied exchange rates. In all cases, the mathematical models found by the GA predict slightly better than the random walk model. The models are heavily dominated by a linear relationship with the most recent past value, while contributions from nonlinear terms to the total forecasting performance are rather small. In consequence, the results agree with previous works establishing explicitly that nonlinear nature of exchange rates cannot be exploited to substantially improve forecasting. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1080/13504850210158250 %U http://dx.doi.org/doi:10.1080/13504850210158250 %P 319-322 %0 Journal Article %T Genetic multi-model composite forecast for non-linear prediction of exchange rates %A Alvarez-Diaz, Marcos %A Alvarez, Alberto %J Empirical Economics %D 2005 %8 oct %V 30 %N 3 %@ 0377-7332 %F Alvarez-Diaz:2005:EE %X The existence of non-linear deterministic structures in the dynamics of exchange rates has already been amply demonstrated. In this paper, we attempt to exploit these non-linear structures employing forecasting techniques, such as Genetic Programming and Neural Networks, in the specific case of the Yen/US$ and Pound Sterling/US$ exchange rates. Forecasts obtained from genetic programming and neural networks are then genetically fused to verify whether synergy provides an improvement in the predictions. Our analysis considers both point predictions and the anticipating of either depreciations or appreciations. %K genetic algorithms, genetic programming, Composite-forecast or data-fusion, neural networks, ANN, exchange-rate forecasting %9 journal article %R doi:10.1007/s00181-005-0249-5 %U http://dx.doi.org/doi:10.1007/s00181-005-0249-5 %P 643-663 %0 Journal Article %T Using Genetic Algorithms to Estimate and Validate Bioeconomic Models: The Case of the Ibero-atlantic Sardine Fishery %A Alvarez-Diaz, Marcos %A Dominquez-Torreiro, Marcos %J Journal of Bioeconomics %D 2006 %8 apr %V 8 %N 1 %@ 1387-6996 %F Alvarez-Diaz:2006:jbe %X The Neo-classical approach to fisheries management is based on designing and applying bioeconomic models. Traditionally, the basic bioeconomic models have used pre-established non-linear functional forms (logistic, Cobb-Douglas) in order to try to reflect the dynamics of the renewable resources under study. This assumption might cause misspecification problems and, in consequence, a loss of predictive ability. In this work we intend to verify if there is a bias motivated by employing the said non-linear parametric perspective. For this purpose, we employ a novel non-linear and non-parametric prediction method, called Genetic Algorithms, and we compare its results with those obtained from the traditional methods. %K genetic algorithms, genetic programming, bioeconomic modeling, linear and non-linear forecasting %9 journal article %R doi:10.1007/s10818-005-0494-x %U http://dx.doi.org/doi:10.1007/s10818-005-0494-x %P 55-65 %0 Thesis %T Exchange rates forecasting using nonparametric methods %A Alvarez-Diaz, Marcos %D 2006 %C New York, NY, USA %C Columbia University %F Marcos_Alvarez-Diaz:thesis %X The existence of non-linear deterministic structures in the dynamics of exchange rates has already been amply demonstrated in the literature. With my research, I try to explain if we can exploit these non-linear structures in order to improve our predictive ability and, secondly, if we can use these predictions to generate profitable strategies in the Foreign Exchange Market. To this purpose, I employ different nonparametric forecasting methods such as Nearest Neighbours, Genetic Programming, Artificial Neural Networks, Data-Fusion or an Evolutionary Neural Network. My analysis will be centre on the specific case of the Yen/US$ and Pound Sterling/US$ exchange rates and it considers both point predictions and the anticipating of either depreciations or appreciations. My results reveal a slight forecasting ability for one-period-ahead which is lost when more periods ahead are considered, and my trading strategy obtains above-normal profits. However, when transaction costs are incorporated, the profits practically disappear or become negative %K genetic algorithms, genetic programming %9 Ph.D. thesis %U http://search.proquest.com/docview/305345652 %0 Journal Article %T The quality of institutions: A genetic programming approach %A Alvarez-Diaz, Marcos %A Caballero Miguez, Gonzalo %J Economic Modelling %D 2008 %V 25 %N 1 %@ 0264-9993 %F AlvarezDiaz2008161 %X The new institutional economics has studied the determinants of the quality of institutions. Traditionally, the majority of the empirical literature has adopted a parametric and linear approach. These forms impose ad hoc functional structures, sometimes introducing relationships between variables that are forced and misleading. This paper analyses the determinants of the quality of institutions using a non-parametric and non-linear approach. Specifically, we employ a Genetic Program (GP) to study the functional relation between the quality of institutions and a set of historical, economical, geographical, religious and social variables. Besides this, we compare the obtained results with those employing a parametric perspective (Ordinary Least Square Regression). Following the empirical results of our application, we can conclude that the parametric perspective adopted in previous papers about institutional quality could be accurate. %K genetic algorithms, genetic programming, Quality of institutions, Institutional determinants, Non-parametric perspective %9 journal article %R doi:10.1016/j.econmod.2007.05.001 %U http://www.sciencedirect.com/science/article/B6VB1-4P0VD80-1/2/c0bb8da3af64aa1ea6b0a4f90e4790b0 %U http://dx.doi.org/doi:10.1016/j.econmod.2007.05.001 %P 161-169 %0 Report %T The institutional determinants of CO2 emissions: A computational modelling approach using Artificial Neural Networks and Genetic Programming %A Alvarez-Diaz, Marcos %A Caballero Miguez, Gonzalo %A Solino, Mario %D 2008 %8 jul %N 401 %I Fundacion de las Cajas de Ahorros %C Madrid %F Alvarez-Diaz:funcas401 %K genetic algorithms, genetic programming, ANN %9 FUNCAS Working Paper %U https://dialnet.unirioja.es/ejemplar/212749 %0 Journal Article %T Forecasting tourist arrivals to Balearic Islands using genetic programming %A Alvarez-Diaz, Marcos %A Mateu-Sbert, Josep %A Rossello-Nadal, Jaume %J International Journal of Computational Economics and Econometrics %D 2009 %8 nov 06 %V 1 %N 1 %I Inderscience Publishers %@ 1757-1189 %F Alvarez-Diaz:2009:IJCEE %X Traditionally, univariate time-series models have largely dominated forecasting for international tourism demand. In this paper, the ability of a genetic program (GP) to predict monthly tourist arrivals from UK and Germany to Balearic Islands, Spain is explored. GP has already been employed satisfactorily in different scientific areas, including economics. The technique shows different advantages regarding to other forecasting methods. Firstly, it does not assume a priori a rigid functional form of the model. Secondly, it is more robust and easy-to-use than other non-parametric methods. Finally, it provides explicitly a mathematical equation which allows a simple ad hoc interpretation of the results. Comparing the performance of the proposed technique against other method commonly used in tourism forecasting (no-change model, moving average and ARIMA), the empirical results reveal that GP can be a valuable tool in this field. %K genetic algorithms, genetic programming, tourism forecasting, Diebold-Mariano test, tourist arrivals, Balearic Islands, UK, United Kingdom, Germany, Spain %9 journal article %R doi:10.1504/IJCEE.2009.029153 %U http://www.inderscience.com/link.php?id=29153 %U http://dx.doi.org/doi:10.1504/IJCEE.2009.029153 %P 64-75 %0 Journal Article %T On dichotomous choice contingent valuation data analysis: Semiparametric methods and Genetic Programming %A Alvarez Diaz, Marcos %A Gomez, Manuel Gonzalez %A Saavedra Gonzalez, Angeles %A De Una Alvarez, Jacobo %J Journal of Forest Economics %D 2010 %8 apr %V 16 %N 2 %@ 1104-6899 %F AlvarezDiaz2009 %X The aim of this paper is twofold. Firstly, we introduce a novel semi-parametric technique called Genetic Programming to estimate and explain the willingness to pay to maintain environmental conditions of a specific natural park in Spain. To the authors’ knowledge, this is the first time in which Genetic Programming is employed in contingent valuation. Secondly, we investigate the existence of bias due to the functional rigidity of the traditional parametric techniques commonly employed in a contingent valuation problem. We applied standard parametric methods (logit and probit) and compared with results obtained using semi parametric methods (a proportional hazard model and a genetic program). The parametric and semiparametric methods give similar results in terms of the variables finally chosen in the model. Therefore, the results confirm the internal validity of our contingent valuation exercise. %K genetic algorithms, genetic programming, Dichotomous choice contingent valuation, Genetic program, Parametric techniques, Proportional hazard model %9 journal article %R doi:10.1016/j.jfe.2009.02.002 %U http://dx.doi.org/doi:10.1016/j.jfe.2009.02.002 %P 145-156 %0 Journal Article %T Forecasting exchange rates using local regression %A Alvarez-Diaz, Marcos %A Alvarez, Alberto %J Applied Economics Letters %D 2010 %8 mar %V 17 %N 5 %@ 1350-4851 %F Alvarez-Diaz:2010:AEL %X In this article we use a generalisation of the standard nearest neighbours, called local regression (LR), to study the predictability of the yen/US dollar and pound sterling/US dollar exchange rates. We also compare our results with those previously obtained with global methods such as neural networks, genetic programming, data fusion and evolutionary neural networks. We want to verify if we can generalise to the exchange rate forecasting problem the belief that local methods beat global ones. %K genetic algorithms, genetic programming, local search %9 journal article %R doi:10.1080/13504850801987217 %U http://hdl.handle.net/10261/54902 %U http://dx.doi.org/doi:10.1080/13504850801987217 %P 509-514 %0 Journal Article %T Speculative strategies in the foreign exchange market based on genetic programming predictions %A Alvarez Diaz, Marcos %J Applied Financial Economics %D 2010 %8 mar %V 20 %N 6 %F Alvarez-Diaz:2010:AFE %X In this article, we investigate the out-of-sample forecasting ability of a Genetic Program (GP) to approach the dynamic evolution of the yen/US dollar and British pound/US dollar exchange rates, and verify whether the method can beat the random walk model. Later on, we use the predicted values to generate a trading rule and we check the possibility of obtaining extraordinary profits in the foreign exchange market. Our results reveal a slight forecasting ability for one-period-ahead, which is lost when more periods ahead are considered. On the other hand, our trading strategy obtains above-normal profits. However, when transaction costs are incorporated, the profits practically disappear or become negative. %K genetic algorithms, genetic programming %9 journal article %R doi:10.1080/09603100903459782 %U http://dx.doi.org/doi:10.1080/09603100903459782 %P 465-476 %0 Journal Article %T The institutional determinants of CO2 emissions: a computational modeling approach using Artificial Neural Networks and Genetic Programming %A Alvarez-Diaz, Marcos %A Caballero-Miguez, Gonzalo %A Solino, Mario %J Environmetrics %D 2011 %8 feb %V 22 %N 1 %F Alvarez-Diaz:2011:EM %X Understanding the complex process of climate change implies the knowledge of all possible determinants of CO2 emissions. This paper studies the influence of several institutional determinants on CO2 emissions, clarifying which variables are relevant to explain this influence. For this aim, Genetic Programming and Artificial Neural Networks are used to find an optimal functional relationship between the CO2 emissions and a set of historical, economic, geographical, religious, and social variables, which are considered as a good approximation to the institutional quality of a country. Besides this, the paper compares the results using these computational methods with that employing a more traditional parametric perspective: ordinary least squares regression (OLS). Following the empirical results of the cross-country application, this paper generates new evidence on the binomial institutions and CO2 emissions. Specifically, all methods conclude a significant influence of ethnolinguistic fractionalization (ETHF) on CO2 emissions. %K genetic algorithms, genetic programming, artificial neural networks, ANN, computational methods, CO2 emissions, institutional determinants %9 journal article %R doi:10.1002/env.1025 %U https://doi.org/10.1002/env.1025 %U http://dx.doi.org/doi:10.1002/env.1025 %P 42-49 %0 Journal Article %T Forecasting International Tourism Demand Using a Non-Linear Autoregressive Neural Network and Genetic Programming %A Alvarez-Diaz, Marcos %A Gonzalez-Gomez, Manuel %A Otero-Giraldez, Maria Soledad %J Forecasting %D 2019 %V 1 %N 1 %@ 2571-9394 %F Alvarez-Diaz:2019:Forecasting %O Special Issue Applications of Forecasting by Hybrid Artificial Intelligent Technologies %X This study explores the forecasting ability of two powerful non-linear computational methods: artificial neural networks and genetic programming. We use as a case of study the monthly international tourism demand in Spain, approximated by the number of tourist arrivals and of overnight stays. The forecasting results reveal that non-linear methods achieve slightly better predictions than those obtained by a traditional forecasting technique, the seasonal autoregressive integrated moving average (SARIMA) approach. This slight forecasting improvement was close to being statistically significant. Forecasters must judge whether the high cost of implementing these computational methods is worthwhile. %K genetic algorithms, genetic programming, ANN, international tourism demand forecasting, artificial neural networks, SARIMA, spain %9 journal article %R doi:10.3390/forecast1010007 %U https://www.mdpi.com/2571-9394/1/1/7/ %U http://dx.doi.org/doi:10.3390/forecast1010007 %P 90-106 %0 Journal Article %T Is it possible to accurately forecast the evolution of Brent crude oil prices? An answer based on parametric and nonparametric forecasting methods %A Alvarez-Diaz, Marcos %J Empirical Economics %D 2020 %8 sep %V 59 %F Alvarez-Diaz:2020:EE %X Can we accurately predict the Brent oil price? If so, which forecasting method can provide the most accurate forecasts? To unravel these questions, we aim at predicting the weekly Brent oil price growth rate by using several forecasting methods that are based on different approaches. Basically, we assess and compare the out-of-sample performances of linear parametric models (the ARIMA, the ARFIMA and the autoregressive model), a nonlinear parametric model (the GARCH-in-Mean model) and different nonparametric data-driven methods (a nonlinear autoregressive artificial neural network, genetic programming and the nearest-neighbor method). The results obtained show that (1) all methods are capable of predicting accurately both the value and the directional change in the Brent oil price, (2) there are no significant forecasting differences among the methods and (3) the volatility of the series could be an important factor to enhance our predictive ability. %K genetic algorithms, genetic programming, ANN, KNN, oil price, Forecasting, ARIMA, M-GARCH, Neural networks, Nearest-neighbour method %9 journal article %R doi:10.1007/s00181-019-01665-w %U http://dx.doi.org/doi:10.1007/s00181-019-01665-w %P 1285-1305 %0 Generic %T Optimizing Neural Architecture Search using Limited GPU Time in a Dynamic Search Space: A Gene Expression Programming Approach %A Alves, Jeovane Honorio %A de Oliveira, Lucas Ferrari %D 2020 %I arXiv %F journals/corr/abs-2005-07669 %K genetic algorithms, genetic programming, gene expression programming, GPU %U https://arxiv.org/abs/2005.07669 %0 Generic %T Implementing Genetic Algorithms on Arduino Micro-Controllers %A Alves, Nuno %D 2010 %8 feb 09 %F oai:arXiv.org:1002.2012 %X Since their conception in 1975, Genetic Algorithms have been an extremely popular approach to find exact or approximate solutions to optimisation and search problems. Over the last years there has been an enhanced interest in the field with related techniques, such as grammatical evolution, being developed. Unfortunately, work on developing genetic optimisations for low-end embedded architectures hasn’t embraced the same enthusiasm. This short paper tackles that situation by demonstrating how genetic algorithms can be implemented in Arduino Duemilanove, a 16 MHz open-source micro-controller, with limited computation power and storage resources. As part of this short paper, the libraries used in this implementation are released into the public domain under a GPL license. %K genetic algorithms, computer science, neural and evolutionary computing %U http://arxiv.org/abs/1002.2012 %0 Journal Article %T Prediction of biodiesel physico-chemical properties from its fatty acid composition using genetic programming %A Alviso, Dario %A Artana, Guillermo %A Duriez, Thomas %J Fuel %D 2020 %V 264 %@ 0016-2361 %F ALVISO:2020:Fuel %X This paper presents regression analysis of biodiesel physico-chemical properties as a function of fatty acid composition using an experimental database. The study is done by using 48 edible and non-edible oils-based biodiesel available data. Regression equations are presented as a function of fatty acid composition (saturated and unsaturated methyl esters). The physico-chemical properties studied are kinematic viscosity, flash point, cloud point, pour point (PP), cold filter plugging point, cetane (CN) and iodine numbers. The regression technique chosen to carry out this work is genetic programming (GP). Unlike multiple linear regression (MLR) strategies available in literature, GP provides generic, non-parametric regression among variables. For all properties analyzed, the performance of the regression is systematically better for GP than MLR. Indeed, the RSME related to the experimental database is lower for GP models, from approx3percent for CN to approx55percent for PP, in comparison to the best MLR model for each property. Particularly, most GP regression models reproduce correctly the dependence of properties on the saturated and unsaturated methyl esters %K genetic algorithms, genetic programming, Biodiesel, Fatty acid, Properties, Regression analysis %9 journal article %R doi:10.1016/j.fuel.2019.116844 %U http://www.sciencedirect.com/science/article/pii/S0016236119321982 %U http://dx.doi.org/doi:10.1016/j.fuel.2019.116844 %P 116844 %0 Journal Article %T Regressions of the dielectric constant and speed of sound of vegetable oils from their composition and temperature using genetic programming %A Alviso, Dario %A Zarate, Cristhian %A Artana, Guillermo %A Duriez, Thomas %J Journal of Food Composition and Analysis %D 2021 %V 104 %@ 0889-1575 %F ALVISO:2021:JFCA %X The dielectric constant (DC) and speed of sound (SoS) have been measured in many studies on vegetable oils (VOs). These measurements can be applied for quality control, for the detection of contaminants, and in works related to heated and frying VOs. There are several hundreds of VOs with potential use in the food industry, and for most of them, the DC and SoS values are not yet available. This paper proposes regression models of the DC and SoS of VOs as a function of their composition (saturated and unsaturated fatty acids) and the temperature. A regression study was conducted using available experimental databases including a total of 57 and 56 data in the range of 20-50 degreeC for the DC and SoS, respectively. The equations are obtained using genetic programming (GP). The goal is to minimize the mean absolute error (MAE) between the values of the measured and predicted DC and SoS for several VOs. The resulting GP regression equations reproduce correctly the dependencies of the DC and SoS of VOs on the saturated and unsaturated fatty acids. The validation of these equations is carried out by comparing their results to those of the experimental databases. The MAE values of the regression equations concerning the databases for DC and SoS of VOs are 0.02 and 1.0 m/s, respectively %K genetic algorithms, genetic programming, Vegetable oils, Regression, Dielectric constant, Speed of sound, Fatty acid %9 journal article %R doi:10.1016/j.jfca.2021.104175 %U https://www.sciencedirect.com/science/article/pii/S0889157521003756 %U http://dx.doi.org/doi:10.1016/j.jfca.2021.104175 %P 104175 %0 Journal Article %T Modeling of vegetable oils cloud point, pour point, cetane number and iodine number from their composition using genetic programming %A Alviso, Dario %A Zarate, Cristhian %A Duriez, Thomas %J Fuel %D 2021 %V 284 %@ 0016-2361 %F ALVISO:2021:Fuel %X Vegetable oils (VOs) are composed of 90-98percent of triglycerides, i.e. esters composed of three fatty acids and glycerol, and small amounts of mono- and di-glycerides. Due to their physico-chemical properties, VOs have been considered for uses especially in large ships, in stationary engines and low and medium speed diesel engines, in pure form or in blends with fuel oil, diesel, biodiesel and alcohols. There are about 350 VOs with potential as fuel sources, and for most of them, physico-chemical properties values have not yet been measured. In this context, regression models using only VOs fatty acid composition are very useful. In the present paper, regression analysis of VOs cloud point (CP), pour point (PP), cetane number (CN) and iodine number (IN) as a function of saturated and unsaturated fatty acids is conducted. The study is done by using 4 experimental databases including 88 different data of VOs. Concerning the regression technique, genetic programming (GP) has been chosen. The cost function of GP is defined to minimize the Mean Absolute Error (MAE) between experimental and predicted values of each property. The resulting GP models consisting of terms including saturated and unsaturated fatty acids reproduce correctly the dependencies of all four properties on those acids. And they are validated by showing that their results are in good agreement to the experimental databases. In fact, MAE values of the proposed models with respect to the databases for CP, PP, CN and IN are lower than 4.51 degreeC, 4.54 degreeC, 3.64 and 8.01, respectively %K genetic algorithms, genetic programming, Vegetable oils, Fatty acid, Cetane number, Iodine number, Cloud point, Pour point %9 journal article %R doi:10.1016/j.fuel.2020.119026 %U https://www.sciencedirect.com/science/article/pii/S0016236120320226 %U http://dx.doi.org/doi:10.1016/j.fuel.2020.119026 %P 119026 %0 Journal Article %T Evolution of Software Reliability Growth Models: A Comparison of Auto-Regression and Genetic Programming Models %A Alweshah, Mohammed %A Ahmed, Walid %A Aldabbas, Hamza %J International Journal of Computer Applications %D 2015 %8 sep %V 125 %N 3 %I Foundation of Computer Science (FCS), NY, USA %C New York, USA %@ 0975-8887 %F Alweshah:2015:IJCA %X Building reliability growth models to predict software reliability and identify and remove errors is both a necessity and a challenge for software testing engineers and project managers. Being able to predict the number of faults in software helps significantly in determining the software release date and in effectively managing project resources. Most of the growth models consider two or three parameters to estimate the accumulated faults in the testing process. Interest in using evolutionary computation to solve prediction and modeling problems has grown in recent years. In this paper, we explore the use of genetic programming (GP) as a tool to help in building growth models that can accurately predict the number of faults in software early on in the testing process. The proposed GP model is based on a recursive relation derived from the history of measured faults. The developed model is tested on real-time control, military, and operating system applications. The dataset was developed by John Musa of Bell Telephone Laboratories. The results of a comparison of the GP model with the traditional and simpler auto-regression model are presented. %K genetic algorithms, genetic programming %9 journal article %R doi:10.5120/ijca2015905864 %U https://www.ijcaonline.org/archives/volume125/number3/22413-2015905864 %U http://dx.doi.org/doi:10.5120/ijca2015905864 %P 20-25 %0 Conference Proceedings %T Applying Cartesian Genetic Programming to Evolve Rules for Intrusion Detection System %A Alyasiri, Hasanen %A Clark, John A. %A Kudenko, Daniel %Y Sabourin, Christophe %Y Guervos, Juan Julian Merelo %Y Linares-Barranco, Alejandro %Y Madani, Kurosh %Y Warwick, Kevin %S Proceedings of the 10th International Joint Conference on Computational Intelligence, IJCCI 2018 %D 2018 %8 sep 18 20 %I SciTePress %C Seville, Spain %F DBLP:conf/ijcci/AlyasiriCK18 %X With cyber-attacks becoming a regular feature in daily business and attackers continuously evolving their techniques, we are witnessing ever more sophisticated and targeted threats. Various artificial intelligence algorithms have been deployed to analyse such incidents. Extracting knowledge allows the discovery of new attack methods, intrusion scenarios, and attackers objectives and strategies, all of which can help distinguish attacks from legitimate behaviour. Among those algorithms, Evolutionary Computation (EC) techniques have seen significant application. Research has shown it is possible to use EC methods to construct IDS detection rules. we show how Cartesian Genetic Programming (CGP) can construct the behaviour rule upon which an intrusion detection will be able to make decisions regarding the nature of the activity observed in the system. The CGP framework evolves human readable solutions that provide an explanation of the logic behind its evolved decisions. Experiments are conducted on up-to-date cybersecurity datasets and compared with state of the art paradigms. We also introduce ensemble learning paradigm, indicating how CGP can be used as stacking technique to improve the learning performance. %K genetic algorithms, genetic programming, Cartesian Genetic Programming, Intrusion Detection System, Stacking Ensemble %R doi:10.5220/0006925901760183 %U https://www.scitepress.org/Papers/2018/69259/69259.pdf %U http://dx.doi.org/doi:10.5220/0006925901760183 %P 176-183 %0 Thesis %T Developing Efficient and Effective Intrusion Detection System using Evolutionary Computation %A Alyasiri, Hasanen %D 2018 %8 nov %C UK %C Computer Science, University of York %F Hasanen_Thesis_2018 %X The internet and computer networks have become an essential tool in distributed computing organisations especially because they enable the collaboration between components of heterogeneous systems. The efficiency and flexibility of online services have attracted many applications, but as they have grown in popularity so have the numbers of attacks on them. Thus, security teams must deal with numerous threats where the threat landscape is continuously evolving. The traditional security solutions are by no means enough to create a secure environment, intrusion detection systems (IDSs), which observe system works and detect intrusions, are usually used to complement other defense techniques. However, threats are becoming more sophisticated, with attackers using new attack methods or modifying existing ones. Furthermore, building an effective and efficient IDS is a challenging research problem due to the environment resource restrictions and its constant evolution. To mitigate these problems, we propose to use machine learning techniques to assist with the IDS building effort. In this thesis, Evolutionary Computation (EC) algorithms are empirically investigated for synthesising intrusion detection programs. EC can construct programs for raising intrusion alerts automatically. One novel proposed approach, i.e. Cartesian Genetic Programming, has proved particularly effective. We also used an ensemble-learning paradigm, in which EC algorithms were used as a meta-learning method to produce detectors. The latter is more fully worked out than the former and has proved a significant success. An efficient IDS should always take into account the resource restrictions of the deployed systems. Memory usage and processing speed are critical requirements. We apply a multi-objective approach to find trade-offs among intrusion detection capability and resource consumption of programs and optimise these objectives simultaneously. High complexity and the large size of detectors are identified as general issues with the current approaches. The multi-objective approach is used to evolve Pareto fronts for detectors that aim to maintain the simplicity of the generated patterns. We also investigate the potential application of these algorithms to detect unknown attacks. %K genetic algorithms, genetic programming, Cartesian Genetic Programming %9 Ph.D. thesis %U http://etheses.whiterose.ac.uk/id/eprint/23699 %0 Conference Proceedings %T Evolving Rules for Detecting Cross-Site Scripting Attacks Using Genetic Programming %A Alyasiri, Hasanen %Y Anbar, Mohammed %Y Abdullah, Nibras %Y Manickam, Selvakumar %S 2nd International Conference on Advances in Cyber Security, ACeS 2020 %S Communications in Computer and Information Science %D 2020 %8 dec 8 9 %V 1347 %I Springer %C Penang, Malaysia %F alyasiri2020evolving %O Revised Selected Papers %X Web services are now a critical element of many of our day-to-day activities. Their applications are one of the fastest-growing industries around. The security issues related to these services are a major concern to their providers and are directly relevant to the everyday lives of system users. Cross-Site Scripting (XSS) is a standout amongst common web application security attacks. Protection against XSS injection attacks needs more work. Machine learning has considerable potential to provide protection in this critical domain. In this article, we show how genetic programming can be used to evolve detection rules for XSS attacks. We conducted our experiments on a publicly available and up-to-date dataset. The experimental results showed that the proposed method is an effective countermeasure against XSS attacks. We then investigated the computational cost of the detection rules. The best-evolved rule has a processing time of 177.87 ms and consumes memory of 8600 bytes. %K genetic algorithms, genetic programming %R doi:10.1007/978-981-33-6835-4_42 %U https://link.springer.com/chapter/10.1007/978-981-33-6835-4_42 %U http://dx.doi.org/doi:10.1007/978-981-33-6835-4_42 %P 642-656 %0 Conference Proceedings %T Grammatical Evolution for Detecting Cyberattacks in Internet of Things Environments %A Alyasiri, Hasanen %A Clark, John A. %A Malik, Ali %A de Frein, Ruairi %S 2021 International Conference on Computer Communications and Networks (ICCCN) %D 2021 %8 19 22 jul %I IEEE %C Athens, Greece %F alyasiri2021grammatical %X The Internet of Things (IoT) is revolutionising nearly every aspect of modern life, playing an ever greater role in both industrial and domestic sectors. The increasing frequency of cyber-incidents is a consequence of the pervasiveness of IoT. Threats are becoming more sophisticated, with attackers using new attacks or modifying existing ones. Security teams must deal with a diverse and complex threat landscape that is constantly evolving. Traditional security solutions cannot protect such systems adequately and so researchers have begun to use Machine Learning algorithms to discover effective defense systems. we investigate how one approach from the domain of evolutionary computation, grammatical evolution, can be used to identify cyberattacks in IoT environments. The experiments were conducted on up-to-date datasets and compared with state-of-the-art algorithms. The potential application of evolutionary computation-based approaches to detect unknown attacks is also examined and discussed. %K genetic algorithms, genetic programming, Grammatical Evolution %R doi:10.1109/ICCCN52240.2021.9522283 %U https://ieeexplore.ieee.org/abstract/document/9522283 %U http://dx.doi.org/doi:10.1109/ICCCN52240.2021.9522283 %0 Journal Article %T Ryan J. Urbanowicz, and Will N. Browne: Introduction to learning classifier systems Springer, 2017, 123 pp, ISBN 978-3-662-55007-6 %A Amandi, Analia %J Genetic Programming and Evolvable Machines %D 2018 %8 dec %V 19 %N 4 %@ 1389-2576 %F Amandi:2018:GPEM %O Book review %K genetic algorithms, LCS %9 journal article %R doi:10.1007/s10710-018-9322-7 %U http://dx.doi.org/doi:10.1007/s10710-018-9322-7 %P 569-570 %0 Journal Article %T Modeling viscosity of CO2 at high temperature and pressure conditions %A Amar, Menad Nait %A Ghriga, Mohammed Abdelfetah %A Ouaer, Hocine %A Seghier, Mohamed El Amine Ben %A Pham, Binh Thai %A Andersen, Pal Ostebo %J Journal of Natural Gas Science and Engineering %D 2020 %8 may %V 77 %I HAL CCSD; Elsevier %@ 1875-5100 %G en %F Amar:2020:jNGSE %X The present work aims at applying Machine Learning approaches to predict CO2 viscosity at different thermodynamical conditions. Various data-driven techniques including multilayer perceptron (MLP), gene expression programming (GEP) and group method of data handling (GMDH) were implemented using 1124 experimental points covering temperature from 220 to 673 K and pressure from 0.1 to 7960 MPa. Viscosity was modelled as function of temperature and density measured at the stated conditions. Four backpropagation-based techniques were considered in the MLP training phase; Levenberg-Marquardt (LM), bayesian regularization (BR), scaled conjugate gradient (SCG) and resilient backpropagation (RB). MLP-LM was the most fit of the proposed models with an overall root mean square error (RMSE) of 0.0012 mPa s and coefficient of determination (R2) of 0.9999. A comparison showed that our MLP-LM model outperformed the best preexisting Machine Learning CO2 viscosity models, and that our GEP correlation was superior to preexisting explicit correlations. %K genetic algorithms, genetic programming, gene expression programming, ANN, carbon dioxide, correlations, data-driven, GEP, MLP, viscosity, chemical sciences/polymers, material chemistry, physical chemistry %9 journal article %R doi:10.1016/j.jngse.2020.103271 %U https://hal.archives-ouvertes.fr/hal-02534736 %U http://dx.doi.org/doi:10.1016/j.jngse.2020.103271 %P 103271 %0 Journal Article %T Machine learning and molecular descriptors enable rational solvent selection in asymmetric catalysis %A Amar, Yehia %A Schweidtmann, Artur M. %A Deutsch, Paul %A Cao, Liwei %A Lapkin, Alexei %J Chemical Science %D 2019 %8 jul %V 10 %N 27 %I Royal Society of Chemistry %F Amar:2019:ChemSci %O Edge Article %X Rational solvent selection remains a significant challenge in process development. Here we describe a hybrid mechanistic-machine learning approach, geared towards automated process development workflow. A library of 459 solvents was used, for which 12 conventional molecular descriptors, two reaction-specific descriptors, and additional descriptors based on screening charge density, were calculated. Gaussian process surrogate models were trained on experimental data from a Rh(CO)2(acac)/Josiphos catalysed asymmetric hydrogenation of a chiral alpha-beta unsaturated gamma-lactam. With two simultaneous objectives: high conversion and high diastereomeric excess, the multi-objective algorithm, trained on the initial dataset of 25 solvents, has identified solvents leading to better reaction outcomes. In addition to being a powerful design of experiments (DoE) methodology, the resulting Gaussian process surrogate model for conversion is, in statistical terms, predictive, with a cross-validation correlation coefficient of 0.84. After identifying promising solvents, the composition of solvent mixtures and optimal reaction temperature were found using a black-box Bayesian optimisation. We then demonstrated the application of a new genetic programming approach to select an appropriate machine learning model for a specific physical system, which should allow the transition of the overall process development workflow into the future robotic laboratories. %K genetic algorithms, genetic programming, TPOT, gamultiobj, matlab, GP surrogate models, in silico modeling %9 journal article %R doi:10.1039/C9SC01844A %U https://pubs.rsc.org/en/content/articlepdf/2019/sc/c9sc01844a %U http://dx.doi.org/doi:10.1039/C9SC01844A %P 6697-6706 %0 Thesis %T Accelerating process development of complex chemical reactions %A Amar, Yehia %D 2019 %C UK %C Department of Chemical Engineering and Biotechnology, University of Cambridge %F Amar:thesis %X Process development of new complex reactions in the pharmaceutical and fine chemicals industries is challenging, and expensive. The field is beginning to see a bridging between fundamental first-principles investigations, and use of data-driven statistical methods, such as machine learning. Nonetheless, process development and optimisation in these industries is mostly driven by trial-and-error, and experience. Approaches that move beyond these are limited to the well-developed optimisation of continuous variables, and often do not yield physical insights. This thesis describes several new methods developed to address research questions related to this challenge. First, we investigated whether using physical knowledge could aid statistics-guided self-optimisation of a C-H activation reaction, in which the optimisation variables were continuous. We then considered algorithmic treatment of the more challenging discrete variables, focusing on solvents. We parametrised a library of 459 solvents with physically meaningful molecular descriptors. Our case study was a homogeneous Rh-catalysed asymmetric hydrogenation to produce a chiral gamma-lactam, with conversion and diastereoselectivity as objectives. We adapted a state-of-the-art multi-objective machine learning algorithm, based on Gaussian processes, to use the descriptors as inputs, and to create a surrogate model for each objective. The aim of the algorithm was to determine a set of Pareto solutions with a minimum experimental budget, whilst simultaneously addressing model uncertainty. We found that descriptors are a valuable tool for Design of Experiments, and can produce predictive and interpretable surrogate models. Subsequently, a physical investigation of this reaction led to the discovery of an efficient catalyst-ligand system, which we studied by operando NMR, and identified a parameterised kinetic model. Turning the focus then to ligands for asymmetric hydrogenation, we calculated versatile empirical descriptors based on the similarity of atomic environments, for 102 chiral ligands, to predict diastereoselectivity. Whilst the model fit was good, it failed to accurately predict the performance of an unseen ligand family, due to analogue bias. Physical knowledge has then guided the selection of symmetrised physico-chemical descriptors. This produced more accurate predictive models for diastereoselectivity, including for an unseen ligand family. The contribution of this thesis is a development of novel and effective workflows and methodologies for process development. These open the door for process chemists to save time and resources, freeing them up from routine work, to focus instead on creatively designing new chemistry for future real-world applications. %K molecular descriptors, design of experiments, asymmetric hydrogenation, machine learning, process development %9 Ph.D. thesis %R doi:10.17863/CAM.35535 %U https://www.repository.cam.ac.uk/handle/1810/288220 %U http://dx.doi.org/doi:10.17863/CAM.35535 %0 Conference Proceedings %T Benchmarking Genetic Programming in a Multi-action Reinforcement Learning Locomotion Task %A Amaral, Ryan %A Ianta, Alexandru %A Bayer, Caleidgh %A Smith, Robert %A Heywood, Malcolm %Y Trautmann, Heike %Y Doerr, Carola %Y Moraglio, Alberto %Y Bartz-Beielstein, Thomas %Y Filipic, Bogdan %Y Gallagher, Marcus %Y Ong, Yew-Soon %Y Gupta, Abhishek %Y Kononova, Anna V. %Y Wang, Hao %Y Emmerich, Michael %Y Bosman, Peter A. N. %Y Zaharie, Daniela %Y Caraffini, Fabio %Y Dreo, Johann %Y Auger, Anne %Y Dietric, Konstantin %Y Dufosse, Paul %Y Glasmachers, Tobias %Y Hansen, Nikolaus %Y Mersmann, Olaf %Y Posik, Petr %Y Tusar, Tea %Y Brockhoff, Dimo %Y Eftimov, Tome %Y Kerschke, Pascal %Y Naujoks, Boris %Y Preuss, Mike %Y Volz, Vanessa %Y Derbel, Bilel %Y Li, Ke %Y Li, Xiaodong %Y Zapotecas, Saul %Y Zhang, Qingfu %Y Coletti, Mark %Y Schuman, Catherine (Katie) %Y Scott, Eric “Siggy” %Y Patton, Robert %Y Wiegand, Paul %Y Bassett, Jeffrey K. %Y Gunaratne, Chathika %Y Chugh, Tinkle %Y Allmendinger, Richard %Y Hakanen, Jussi %Y Tauritz, Daniel %Y Woodward, John %Y Lopez-Ibanez, Manuel %Y McCall, John %Y Bacardit, Jaume %Y Brownlee, Alexander %Y Cagnoni, Stefano %Y Iacca, Giovanni %Y Walker, David %Y Toutouh, Jamal %Y O’Reilly, UnaMay %Y Machado, Penousal %Y Correia, Joao %Y Nesmachnow, Sergio %Y Ceberio, Josu %Y Villanueva, Rafael %Y Hidalgo, Ignacio %Y Fernandez de Vega, Francisco %Y Paolo, Giuseppe %Y Coninx, Alex %Y Cully, Antoine %Y Gaier, Adam %Y Wagner, Stefan %Y Affenzeller, Michael %Y Bruce, Bobby R. %Y Nowack, Vesna %Y Blot, Aymeric %Y Winter, Emily %Y Langdon, William B. %Y Petke, Justyna %Y Fernandez Alzueta, Silvino %Y Valledor Pellicer, Pablo %Y Stuetzle, Thomas %Y Paetzel, David %Y Wagner, Alexander %Y Heider, Michael %Y Veerapen, Nadarajen %Y Malan, Katherine %Y Liefooghe, Arnaud %Y Verel, Sebastien %Y Ochoa, Gabriela %Y Omidvar, Mohammad Nabi %Y Sun, Yuan %Y Tarantino, Ernesto %Y Ivanoe, De Falco %Y Della Cioppa, Antonio %Y Umberto, Scafuri %Y Rieffel, John %Y Mouret, Jean-Baptiste %Y Doncieux, Stephane %Y Nikolaidis, Stefanos %Y Togelius, Julian %Y Fontaine, Matthew C. %Y Georgescu, Serban %Y Chicano, Francisco %Y Whitley, Darrell %Y Kyriienko, Oleksandr %Y Dahl, Denny %Y Shir, Ofer %Y Spector, Lee %Y Rahat, Alma %Y Everson, Richard %Y Fieldsend, Jonathan %Y Wang, Handing %Y Jin, Yaochu %Y Hemberg, Erik %Y Elsayed, Marwa A. %Y Kommenda, Michael %Y La Cava, William %Y Kronberger, Gabriel %Y Gustafson, Steven %S Proceedings of the 2022 Genetic and Evolutionary Computation Conference Companion %S GECCO ’22 %D 2022 %8 September 13 jul %I Association for Computing Machinery %C Boston, USA %F amaral:2022:GECCOcomp %X Reinforcement learning (RL) requires an agent to interact with an environment to maximize the cumulative rather than the immediate reward. Recently, there as been a significant growth in the availability of scalable RL tasks, e.g. OpenAI gym. However, most benchmarking studies concentrate on RL solutions based on some form of deep learning. In this work, we benchmark a family of linear genetic programming based approaches to the 2-d biped walker problem. The biped walker is an example of a RL environment described in terms of a multi-dimensional, real-valued 24-d input and 4-d action space. Specific recommendations are made regarding mechanisms to adopt that are able to consistently produce solutions, in this case using transfer from periodic restarts. %K genetic algorithms, genetic programming, real-valued actions, continuous control, reinforcement learning %R doi:10.1145/3520304.3528766 %U http://dx.doi.org/doi:10.1145/3520304.3528766 %P 522-525 %0 Conference Proceedings %T An Evolutionary Approach to Complex System Regulation Using Grammatical Evolution %A Amarteifio, Saoirse %A O’Neill, Michael %Y Pollack, Jordan %Y Bedau, Mark %Y Husbands, Phil %Y Ikegami, Takashi %Y Watson, Richard A. %S Artificial Life XI Ninth International Conference on the Simulation and Synthesis of Living Systems %D 2004 %8 December 15 sep %I The MIT Press %C Boston, Massachusetts %@ 0-262-66183-7 %F amarteifio:2004:AL %X Motivated by difficulties in engineering adaptive distributed systems, we consider a method to evolve cooperation in swarms to model dynamical systems. We consider an information processing swarm model that we find to be useful in studying control methods for adaptive distributed systems and attempt to evolve systems that form consistent patterns through the interaction of constituent agents or particles. This model considers artificial ants as walking sensors in an information-rich environment. Grammatical Evolution is combined with this swarming model as we evolve an ant’s response to information. The fitness of the swarm depends on information processing by individual ants, which should lead to appropriate macroscopic spatial and/or temporal patterns. We discuss three primary issues, which are tractability, representation and fitness evaluation of dynamical systems and show how Grammatical Evolution supports a promising approach to addressing these concerns %K genetic algorithms, genetic programming, grammatical evolution %R doi:10.7551/mitpress/1429.003.0093 %U http://ncra.ucd.ie/papers/alife2004.pdf %U http://dx.doi.org/doi:10.7551/mitpress/1429.003.0093 %P 551-556 %0 Conference Proceedings %T Coevolving Antibodies with a Rich Representation of Grammatical Evolution %A Amarteifio, Saoirse %A O’Neill, Michael %Y Corne, David %Y Michalewicz, Zbigniew %Y Dorigo, Marco %Y Eiben, Gusz %Y Fogel, David %Y Fonseca, Carlos %Y Greenwood, Garrison %Y Chen, Tan Kay %Y Raidl, Guenther %Y Zalzala, Ali %Y Lucas, Simon %Y Paechter, Ben %Y Willies, Jennifier %Y Guervos, Juan J. Merelo %Y Eberbach, Eugene %Y McKay, Bob %Y Channon, Alastair %Y Tiwari, Ashutosh %Y Volkert, L. Gwenn %Y Ashlock, Dan %Y Schoenauer, Marc %S Proceedings of the 2005 IEEE Congress on Evolutionary Computation %D 2005 %8 February 5 sep %V 1 %I IEEE Press %C Edinburgh, UK %@ 0-7803-9363-5 %F amarteifio:2005:CEC %X A number of natural anticipatory systems employ dual processes of feature definition and feature exploitation. Presented here, a coevolutionary dual process model based on the immune system, considers the effect of coevolving complementary templates to bias feature selection and recombination. This work considers the issue of module exploitation in evolutionary algorithms. Our approach is characterised by the use of rich representations in grammatical evolution. %K genetic algorithms, genetic programming, grammatical evolution, genotype-phenotype mapping %R doi:10.1109/CEC.2005.1554779 %U http://dx.doi.org/doi:10.1109/CEC.2005.1554779 %P 904-911 %0 Thesis %T Interpreting a Genotype-Phenotype Map with Rich Representations in XMLGE %A Amarteifio, Saoirse %D 2005 %C University of Limerick, Ireland %C University of Limerick %G en %F amarteifio:2005:IAGPMWRRIX %X A novel XML implementation of Grammatical Evolution is developed. This has a number of interesting features such as the use of XSLT for genetic operators and the use of reflection to build an object tree from an XML expression tree. This framework is designed to be used for remote or local evaluation of evolved program structures and provides a number of abstraction layers for program evaluation and evolution. A dynamical swarm system is evolved as a special-case function induction problem to illustrate the application of XMLGE. Particle behaviours are evolved to optimise colony performance. A dual process evolutionary algorithm based on the immune system using rich representations is developed. A dual process feature detection and feature integration model is described and the performance shown on benchmark GP problems. An adaptive feature detection method uses coevolving XPath antibodies to take selective interest in primary structures. Grammars are used to generate reciprocal binding structures (antibodies) given any primary domain grammar. A codon compression algorithm is developed which shows performance improvements on symbolic regression and multiplexer problems. The algorithm is based on questions about the information content of a genome. This also exploits information from the rich representation of XMLGE. %K genetic algorithms, genetic programming, grammatical evolution, xml %9 Master of Science in Computer Science %9 Masters thesis %U http://ncra.ucd.ie/downloads/pub/SaoirseMScThesis.pdf %0 Journal Article %T Electricity consumption forecasting models for administration buildings of the UK higher education sector %A Amber, K. P. %A Aslam, M. W. %A Hussain, S. K. %J Energy and Buildings %D 2015 %V 90 %@ 0378-7788 %F Amber:2015:EB %X Electricity consumption in the administration buildings of a typical higher education campus in the UK accounts for 26percent of the campus annual electricity consumption. A reliable forecast of electricity consumption helps energy managers in numerous ways such as in preparing future energy budgets and setting up energy consumption targets. In this paper, we developed two models, a multiple regression (MR) model and a genetic programming (GP) model to forecast daily electricity consumption of an administration building located at the Southwark campus of London South Bank University in London. Both models integrate five important independent variables, i.e. ambient temperature, solar radiation, relative humidity, wind speed and weekday index. Daily values of these variables were collected from year 2007 to year 2013. The data sets from year 2007 to 2012 are used for training the models while 2013 data set is used for testing the models. The predicted test results for both the models are analysed and compared with actual electricity consumption. At the end, some conclusions are drawn about the performance of both models regarding their forecasting capabilities. The results demonstrate that the GP model performs better with a Total Absolute Error (TAE) of 6percent compared to TAE of 7percent for MR model. %K genetic algorithms, genetic programming, Electricity forecasting, Administration buildings, Multiple regression %9 journal article %R doi:10.1016/j.enbuild.2015.01.008 %U http://www.sciencedirect.com/science/article/pii/S0378778815000110 %U http://dx.doi.org/doi:10.1016/j.enbuild.2015.01.008 %P 127-136 %0 Journal Article %T Intelligent techniques for forecasting electricity consumption of buildings %A Amber, K. P. %A Ahmad, R. %A Aslam, M. W. %A Kousar, A. %A Usman, M. %A Khan, M. S. %J Energy %D 2018 %V 157 %@ 0360-5442 %F AMBER:2018:Energy %X The increasing trend in building sector’s energy demand calls for reliable and robust energy consumption forecasting models. This study aims to compare prediction capabilities of five different intelligent system techniques by forecasting electricity consumption of an administration building located in London, United Kingdom. These five techniques are; Multiple Regression (MR), Genetic Programming (GP), Artificial Neural Network (ANN), Deep Neural Network (DNN) and Support Vector Machine (SVM). The prediction models are developed based on five years of observed data of five different parameters such as solar radiation, temperature, wind speed, humidity and weekday index. Weekday index is an important parameter introduced to differentiate between working and non-working days. First four years data is used for training the models and to obtain prediction data for fifth year. Finally, the predicted electricity consumption of all models is compared with actual consumption of fifth year. Results demonstrate that ANN performs better than all other four techniques with a Mean Absolute Percentage Error (MAPE) of 6percent whereas MR, GP, SVM and DNN have MAPE of 8.5percent, 8.7percent, 9percent and 11percent, respectively. The applicability of this study could span to other building categories and will help energy management teams to forecast energy consumption of various buildings %K genetic algorithms, genetic programming, Electricity forecasting, ANN, DNN, GP, MR, SVM %9 journal article %R doi:10.1016/j.energy.2018.05.155 %U http://www.sciencedirect.com/science/article/pii/S036054421830999X %U http://dx.doi.org/doi:10.1016/j.energy.2018.05.155 %P 886-893 %0 Conference Proceedings %T GPStar4: A Flexible Framework for Experimenting with Genetic Programming %A Amblard, Julien %A Filman, Robert %A Kopito, Gabriel %Y Kalkreuth, Roman %Y Baeck, Thomas %Y Wilson, Dennis G. %Y Kaufmann, Paul %Y Sotto, Leo Francoso Dal Piccol %Y Aktinson, Timothy %S Graph-based Genetic Programming %S GECCO ’23 %D 2023 %8 15 19 jul %I Association for Computing Machinery %C Lisbon, Portugal %F amblard:2023:GGP %X GPStar4 is a flexible workbench for experimenting with population algorithms. It is a framework that defines a genetic cycle, with inflection points for implementing an algorithm’s specific behaviors; it also provides a variety of implementations for these inflection points. A user of the system can select from the provided implementations and customize the places where alternative behavior is desired, or even create their own implementations. Components interact through a context mechanism that enables both mutable and immutable information sharing, type checking, computed defaults and event listeners.Interesting predefined components included in GPStar4 are implementations for classical tree-based expression structures; acyclic multigraphs with named ports, type systems for flat, hierarchical and attribute types, recursively defined populations using both subpopulation and build-from-parts semantics, and numeric and multi-objective fitnesses. Key enabling technologies for this flexibility include context mechanisms, choosers, and a variety of caches.GPStar4 can be run as an API library for other applications, as a command-line application, or as a stand-alone application with its own GUI. %K genetic algorithms, genetic programming, experimental framework, directed acyclic graph representations, population algorithms %R doi:10.1145/3583133.3596369 %U http://dx.doi.org/doi:10.1145/3583133.3596369 %P 1910-1915 %0 Journal Article %T Investigation of shear strength correlations and reliability assessments of sandwich structures by kriging method %A Ameryan, Ala %A Ghalehnovi, Mansour %A Rashki, Mohsen %J Composite Structures %D 2020 %V 253 %@ 0263-8223 %F AMERYAN:2020:CS %X Steel-concrete-steel (SCS) sandwich composite structure with corrugated-strip connectors (CSC) is the promising structure which is applied in offshore and building structures. The behavior prediction of shear connections is of major importance in SCS structures. The present study evaluated the existing shear strength correlations of SCS sandwich structures exploiting experimental data and Finite Element Analysis (FEA). The considered system is a double steel skin sandwich structure with CSC (DSCS). Due to the limitation of the literature regarding CSC development, some new correlations were proposed and studied relying on several FEA results through the Genetic Programming method. The accuracy of the estimated shear strength predicted by the existing and proposed equations was evaluated using the FEA data and push-out test results. The FE models were verified through experimental data. Moreover, the correlations were investigated based on reliability assessment due to the high importance of the reliability analysis of such structures. Given that high accuracy in estimating the shear strength fails to necessarily lead to acceptable results in structural reliability analysis, the reliability of the existing and proposed equations was evaluated using the Kriging model by considering experimental data. This meta-model could predict accurate values with a limited number of initial training samples %K genetic algorithms, genetic programming, Structural reliability, Kriging, Sandwich structures, Finite element, Experimental data, Failure probability %9 journal article %R doi:10.1016/j.compstruct.2020.112782 %U http://www.sciencedirect.com/science/article/pii/S0263822320327082 %U http://dx.doi.org/doi:10.1016/j.compstruct.2020.112782 %P 112782 %0 Journal Article %T Investigating the Bond Strength of FRP Rebars in Concrete under High Temperature Using Gene-Expression Programming Model %A Amin, Muhammad Nasir %A Iqbal, Mudassir %A Althoey, Fadi %A Khan, Kaffayatullah %A Faraz, Muhammad Iftikhar %A Qadir, Muhammad Ghulam %A Alabdullah, Anas Abdulalim %A Ajwad, Ali %J Polymers %D 2022 %V 14 %N 15 %@ 2073-4360 %F amin:2022:Polymers %X In recent times, the use of fibre-reinforced plastic (FRP) has increased in reinforcing concrete structures. The bond strength of FRP rebars is one of the most significant parameters for characterising the overall efficacy of the concrete structures reinforced with FRP. However, in cases of elevated temperature, the bond of FRP-reinforced concrete can deteriorate depending on a number of factors, including the type of FRP bars used, its diameter, surface form, anchorage length, concrete strength, and cover thickness. Hence, accurate quantification of FRP rebars in concrete is of paramount importance, especially at high temperatures. In this study, an artificial intelligence (AI)-based genetic-expression programming (GEP) method was used to predict the bond strength of FRP rebars in concrete at high temperatures. In order to predict the bond strength, we used failure mode temperature, fibre type, bar surface, bar diameter, anchorage length, compressive strength, and cover-to-diameter ratio as input parameters. The experimental dataset of 146 tests at various elevated temperatures were established for training and validating the model. A total of 70percent of the data was used for training the model and remaining 30percent was used for validation. Various statistical indices such as correlation coefficient (R), the mean absolute error (MAE), and the root-mean-square error (RMSE) were used to assess the predictive veracity of the GEP model. After the trials, the optimum hyperparameters were 150, 8, and 4 as number of chromosomes, head size and number of genes, respectively. Different genetic factors, such as the number of chromosomes, the size of the head, and the number of genes, were evaluated in eleven separate trials. The results as obtained from the rigorous statistical analysis and parametric study show that the developed GEP model is robust and can predict the bond strength of FRP rebars in concrete under high temperature with reasonable accuracy (i.e., R, RMSE and MAE 0.941, 2.087, and 1.620, and 0.935, 2.370, and 2.046, respectively, for training and validation). More importantly, based on the FRP properties, the model has been translated into traceable mathematical formulation for easy calculations. %K genetic algorithms, genetic programming, gene expression programming %9 journal article %R doi:10.3390/polym14152992 %U https://www.mdpi.com/2073-4360/14/15/2992 %U http://dx.doi.org/doi:10.3390/polym14152992 %P ArticleNo.2992 %0 Journal Article %T Prediction of Rapid Chloride Penetration Resistance to Assess the Influence of Affecting Variables on Metakaolin-Based Concrete Using Gene Expression Programming %A Amin, Muhammad Nasir %A Raheel, Muhammad %A Iqbal, Mudassir %A Khan, Kaffayatullah %A Qadir, Muhammad Ghulam %A Jalal, Fazal E. %A Alabdullah, Anas Abdulalim %A Ajwad, Ali %A Al-Faiad, Majdi Adel %A Abu-Arab, Abdullah Mohammad %J Materials %D 2022 %V 15 %N 19 %@ 1996-1944 %F amin:2022:Materials %X The useful life of a concrete structure is highly dependent upon its durability, which enables it to withstand the harsh environmental conditions. Resistance of a concrete specimen to rapid chloride ion penetration (RCP) is one of the tests to indirectly measure its durability. The central aim of this study was to investigate the influence of different variables, such as, age, amount of binder, fine aggregate, coarse aggregate, water to binder ratio, metakaolin content and the compressive strength of concrete on the RCP resistance using a genetic programming approach. The number of chromosomes (Nc), genes (Ng) and, the head size (Hs) of the gene expression programming (GEP) model were varied to study their influence on the predicted RCP values. The performance of all the GEP models was assessed using a variety of performance indices, i.e., R2, RMSE and comparison of regression slopes. The optimal GEP model (Model T3) was obtained when the Nc = 100, Hs = 8 and Ng = 3. This model exhibits an R2 of 0.89 and 0.92 in the training and testing phases, respectively. The regression slope analysis revealed that the predicted values are in good agreement with the experimental values, as evident from their higher R2 values. Similarly, parametric analysis was also conducted for the best performing Model T3. The analysis showed that the amount of binder, compressive strength and age of the sample enhanced the RCP resistance of the concrete specimens. Among the different input variables, the RCP resistance sharply increased during initial stages of curing (28-d), thus validating the model results. %K genetic algorithms, genetic programming, gene expression programming %9 journal article %R doi:10.3390/ma15196959 %U https://www.mdpi.com/1996-1944/15/19/6959 %U http://dx.doi.org/doi:10.3390/ma15196959 %P ArticleNo.6959 %0 Journal Article %T Rule-centred genetic programming (RCGP): an imperialist competitive approach %A Amini, Seyed Mohammad Hossein Hosseini %A Abdollahi, Mohammad %A Haeri, Maryam Amir %J Appl. Intell. %D 2020 %V 50 %N 8 %F DBLP:journals/apin/AminiAH20 %K genetic algorithms, genetic programming %9 journal article %R doi:10.1007/s10489-019-01601-6 %U https://doi.org/10.1007/s10489-019-01601-6 %U http://dx.doi.org/doi:10.1007/s10489-019-01601-6 %P 2589-2609 %0 Journal Article %T A robust predictive model for base shear of steel frame structures using a hybrid genetic programming and simulated annealing method %A Aminian, Pejman %A Javid, Mohamad Reza %A Asghari, Abazar %A Gandomi, Amir Hossein %A Arab Esmaeili, Milad %J Neural Computing and Applications %D 2011 %V 20 %N 8 %I Springer %@ 0941-0643 %F journals/nca/AminianJAGE11 %X This study presents a new empirical model to estimate the base shear of plane steel structures subjected to earthquake load using a hybrid method integrating genetic programming (GP) and simulated annealing (SA), called GP/SA. The base shear of steel frames was formulated in terms of the number of bays, number of storey, soil type, and situation of braced or unbraced. A classical GP model was developed to benchmark the GP/SA model. The comprehensive database used for the development of the correlations was obtained from finite element analysis. A parametric analysis was carried out to evaluate the sensitivity of the base shear to the variation of the influencing parameters. The GP/SA and classical GP correlations provide a better prediction performance than the widely used UBC code and a neural network-based model found in the literature. The developed correlations may be used as quick checks on solutions developed by deterministic analyses. %K genetic algorithms, genetic programming, base shear, steel frame structures, simulated annealing, nonlinear modelling %9 journal article %R doi:10.1007/s00521-011-0689-0 %U http://dx.doi.org/doi:10.1007/s00521-011-0689-0 %P 1321-1332 %0 Journal Article %T New design equations for assessment of load carrying capacity of castellated steel beams: a machine learning approach %A Aminian, Pejman %A Niroomand, Hadi %A Gandomi, Amir Hossein %A Alavi, Amir Hossein %A Arab Esmaeili, Milad %J Neural Computing and Applications %D 2013 %8 jul %V 23 %N 1 %I Springer %@ 0941-0643 %G English %F Aminian:2013:NCA %X This paper presents an innovative machine learning approach for the formulation of load carrying capacity of castellated steel beams (CSB). New design equations were developed to predict the load carrying capacity of CSB using linear genetic programming (LGP), and an integrated search algorithm of genetic programming and simulated annealing, called GSA. The load capacity was formulated in terms of the geometrical and mechanical properties of the castellated beams. An extensive trial study was carried out to select the most relevant input variables for the LGP and GSA models. A comprehensive database was gathered from the literature to develop the models. The generalisation capabilities of the models were verified via several criteria. The sensitivity of the failure load of CSB to the influencing variables was examined and discussed. The employed machine learning systems were found to be effective methods for evaluating the failure load of CSB. The prediction performance of the optimal LGP model was found to be better than that of the GSA model. %K genetic algorithms, genetic programming, Linear genetic programming, Castellated beam, Load carrying capacity, Simulated annealing, Formulation %9 journal article %R doi:10.1007/s00521-012-1138-4 %U http://link.springer.com/article/10.1007%2Fs00521-012-1138-4 %U http://dx.doi.org/doi:10.1007/s00521-012-1138-4 %P 119-131 %0 Book Section %T Statistical Genetic Programming: The Role of Diversity %A Amir Haeri, Maryam %A Ebadzadeh, Mohammad Mehdi %A Folino, Gianluigi %E Snasel, Vaclav %E Kroemer, Pavel %E Koeppen, Mario %E Schaefer, Gerald %B Soft Computing in Industrial Applications %S Advances in Intelligent Systems and Computing %D 2014 %8 21 nov %V 223 %I Springer %G English %F AmirHaeri:wsc17 %O Proceedings of the 17th Online World Conference on Soft Computing in Industrial Applications %X In this chapter, a new GP-based algorithm is proposed. The algorithm, named SGP (Statistical GP), exploits statistical information, i.e. mean, variance and correlation-based operators, in order to improve the GP performance. SGP incorporates new genetic operators, i.e. Correlation Based Mutation, Correlation Based Crossover, and Variance Based Editing, to drive the search process towards fitter and shorter solutions. Furthermore, this work investigates the correlation between diversity and fitness in SGP, both in terms of phenotypic and genotypic diversity. First experiments conducted on four symbolic regression problems illustrate the goodness of the approach and permits to verify the different behaviour of SGP in comparison with standard GP from the point of view of the diversity and its correlation with the fitness. %K genetic algorithms, genetic programming %R doi:10.1007/978-3-319-00930-8_4 %U http://dx.doi.org/10.1007/978-3-319-00930-8_4 %U http://dx.doi.org/doi:10.1007/978-3-319-00930-8_4 %P 37-48 %0 Journal Article %T Improving GP generalization: a variance-based layered learning approach %A Amir Haeri, Maryam %A Ebadzadeh, Mohammad Mehdi %A Folino, Gianluigi %J Genetic Programming and Evolvable Machines %D 2015 %8 mar %V 16 %N 1 %@ 1389-2576 %F AmirHaeri:2014:GPEM %X This paper introduces a new method that improves the generalisation ability of genetic programming (GP) for symbolic regression problems, named variance-based layered learning GP. In this approach, several datasets, called primitive training sets, are derived from the original training data. They are generated from less complex to more complex, for a suitable complexity measure. The last primitive dataset is still less complex than the original training set. The approach decomposes the evolution process into several hierarchical layers. The first layer of the evolution starts using the least complex (smoothest) primitive training set. In the next layers, more complex primitive sets are given to the GP engine. Finally, the original training data is given to the algorithm. We use the variance of the output values of a function as a measure of the functional complexity. This measure is used in order to generate smoother training data, and controlling the functional complexity of the solutions to reduce the overfitting. The experiments, conducted on four real-world and three artificial symbolic regression problems, demonstrate that the approach enhances the generalization ability of the GP, and reduces the complexity of the obtained solutions. %K genetic algorithms, genetic programming, VBLL-GP, Generalisation, Layered learning, Over fitting, Variance %9 journal article %R doi:10.1007/s10710-014-9220-6 %U http://dx.doi.org/doi:10.1007/s10710-014-9220-6 %P 27-55 %0 Journal Article %T Statistical genetic programming for symbolic regression %A Haeri, Maryam Amir %A Ebadzadeh, Mohammad Mehdi %A Folino, Gianluigi %J Applied Soft Computing %D 2017 %8 nov %V 60 %F journals/asc/HaeriEF17 %X In this paper, a new genetic programming (GP) algorithm for symbolic regression problems is proposed. The algorithm, named statistical genetic programming (SGP), uses statistical information (such as variance, mean and correlation coefficient) to improve GP. To this end, we define well-structured trees as a tree with the following property: nodes which are closer to the root have a higher correlation with the target. It is shown experimentally that on average, the trees with structures closer to well-structured trees are smaller than other trees. SGP biases the search process to find solutions whose structures are closer to a well-structured tree. For this purpose, it extends the terminal set by some small well-structured subtrees, and starts the search process in a search space that is limited to semi-well-structured trees (i.e., trees with at least one well-structured subtree). Moreover, SGP incorporates new genetic operators, i.e., correlation-based mutation and correlation-based crossover, which use the correlation between outputs of each subtree and the targets, to improve the functionality. Furthermore, we suggest a variance-based editing operator which reduces the size of the trees. SGP uses the new operators to explore the search space in a way that it obtains more accurate and smaller solutions in less time. SGP is tested on several symbolic regression benchmarks. The results show that it increases the evolution rate, the accuracy of the solutions, and the generalization ability, and decreases the rate of code growth. %K genetic algorithms, genetic programming, Symbolic regression, Well-structured subtree, Semi-well-structured tree, Well-structuredness measure, Correlation coefficient %9 journal article %R doi:10.1016/j.asoc.2017.06.050 %U http://dx.doi.org/doi:10.1016/j.asoc.2017.06.050 %P 447-469 %0 Journal Article %T Ground motion prediction equations (GMPEs) for elastic response spectra in the Iranian plateau using Gene Expression Programming (GEP) %A Amiri, Gholamreza Ghodrati %A Amiri, Mohamad Shamekhi %A Tabrizian, Zahra %J Journal of Intelligent and Fuzzy Systems %D 2014 %V 26 %N 6 %F journals/jifs/AmiriAT14 %X This paper proposes ground-motion prediction equations (GMPEs) for the horizontal component of earthquake in Iranian plateau. These equations present the velocity and acceleration response spectra at 5percent damping ratio as continuous period functions, within range of 0.1 to 4 seconds. So far many equations have been presented and the recent suggested proportions are functions of several parameters. In this research, due to easy usage and lack of information in Iran, only the magnitude of earthquake, the distance between earthquake source and the location and the ground type are used as important factors. Iranian plateau is divided into two zones: Alborz-Central Iran and Zagros, each of which is divided into rock and soil region according to the ground type. Regarding the fact that the occurred and reported earthquakes in Iran are shallow, surface wave magnitude (Ms) is used in this study. Moreover, hypocentral distance is considered as distance between the earthquake source and the location. To obtain the velocity and acceleration response spectra, a Gene Expression Programming (GEP) algorithm is used which uses no constant regression model and the model is acquired smartly as a continuous period function. The consequences show a consistency with high proportionality coefficient among the observed and anticipated results %K genetic algorithms, genetic programming, gene expression programming %9 journal article %R doi:10.3233/IFS-130950 %U http://dx.doi.org/10.3233/IFS-130950 %U http://dx.doi.org/doi:10.3233/IFS-130950 %P 2825-2839 %0 Journal Article %T Modeling intermolecular potential of He-F2 dimer from symmetry-adapted perturbation theory using multi-gene genetic programming %A Amiri, Mohammad %A Eftekhari, Mahdi %A Dehestani, Maryam %A Tajaddini, Azita %J Scientia Iranica %D 2013 %V 20 %N 3 %@ 1026-3098 %F Amiri:2013:SI %X Any molecular dynamical calculation requires a precise knowledge of interaction potential as an input. In an appropriate form, such that the potential, with respect to the coordinates, can be evaluated easily and accurately at arbitrary geometries (in our study parameters for geometry are R and theta), a good potential energy expression can offer the exact intermolecular behaviour of systems. There are many methods to create mathematical expressions for the potential energy. In this study for the first time, we used the Multi-gene Genetic Programming (MGGP) method to generate a potential energy model for the He-F2 system. The MGGP method is one of the most powerful methods used for non-linear regression problems. A dataset of size 714 created by the SAPT 2008 program is used to generate models of MGGP. The results obtained show the power of MGGP for producing an efficient nonlinear regression model, in terms of accuracy and complexity. %K genetic algorithms, genetic programming, Potential energy, SAPT, MGGP, Lennard-Jones potential, GPTIPS, Matlab %9 journal article %R doi:10.1016/j.scient.2012.12.040 %U https://core.ac.uk/download/pdf/81997689.pdf %U http://dx.doi.org/doi:10.1016/j.scient.2012.12.040 %P 543-548 %0 Journal Article %T Evaluating the synergic effect of waste rubber powder and recycled concrete aggregate on mechanical properties and durability of concrete %A Amiri, Mostafa %A Hatami, Farzad %A Golafshani, Emadaldin Mohammadi %J Case Studies in Construction Materials %D 2021 %V 15 %@ 2214-5095 %F AMIRI:2021:CSCM %X The use of waste materials in the concrete mixture can help human beings to preserve the environment and achieve environmentally-friendly concrete. In this study, the influences of simultaneous replacements of cement by waste rubber powder (WRP) and coarse aggregate by recycled concrete aggregate (RCA) on the mechanical properties and durability of concrete were investigated experimentally. To do so, concrete specimens containing the WRP with the replacement ratios of percent, 2.5 percent, and 5 percent by weight of cement, and the RCA with the replacement levels of percent, 25 percent, and 50 percent of coarse aggregate were prepared. Moreover, different water to binder ratios and binder content were used. Mechanical properties of the concrete specimens consisting of compressive, flexural, and tensile strengths and the durability test of rapid chloride migration test (RCMT) were carried out at different ages. It was observed that the mechanical properties of concrete decrease by raising the proportions of recycled materials in all replacement ratios. Because of the negative effects of the WRP and RCA on, respectively, the cement matrix and the interfacial transition zone, the reduction of the mechanical properties are higher for the concrete specimens with both recycled materials. In the case of durability, the migration rate of chloride ions in concrete reduces by increasing the WRP rates due to the blockage of micro-pores connections. However, adding the RCA has a negative effect on the durability performance of concrete. Finally, four equations were proposed and evaluated for the compressive, tensile, flexural strength reduction and durability factors of concrete containing the WRP and RCA using the genetic programming %K genetic algorithms, genetic programming, Waste rubber powder, Recycled concrete aggregate, Green concrete, Mechanical properties, Durability %9 journal article %R doi:10.1016/j.cscm.2021.e00639 %U https://www.sciencedirect.com/science/article/pii/S2214509521001546 %U http://dx.doi.org/doi:10.1016/j.cscm.2021.e00639 %P e00639 %0 Journal Article %T Genetic programming application in predicting fluid loss severity %A Amish, Mohamed %A Etta-Agbor, Eta %J Results in Engineering %D 2023 %V 20 %@ 2590-1230 %F AMISH:2023:rineng %X Numerous wells worldwide encounter significant, costly, and time-consuming lost circulation issues during drilling or while deploying tubulars across naturally fractured or induced fractured formations. This can potentially lead to formation damage, wellbore instability, and even blowouts. Effectively addressing this problem and restoring fluid circulation becomes crucial to curbing non-productive time and overall operational expenses. Although numerous methods have been introduced, a universally accepted industry solution for predicting lost circulation remains absent due to the complex interplay of various factors influencing its severity. Anticipating the onset of circulation loss is imperative to mitigate its impacts, minimise costs, and reduce risks to personnel and the environment. In this study, an innovative machine learning approach employing multigene genetic algorithms is used to analyse a dataset of 16,970 drilling datasets from 61 wells within the Marun oil field, located in Iran, where severe loss of circulation occurred. Geological characteristics, operational drilling parameters, and the properties of the drilling fluid were all considered. The dataset encompasses 19 parameters, of which seven are chosen as inputs for predicting lost circulation incidents. These inputs are then employed to construct a predictive model, employing an 85:15 training-to-test data ratio. To assess the model’s performance, unseen datasets are used. The novelty of this study lies in the proposed model’s consideration of a concise set of relevant input parameters, particularly real-time surface drilling parameters that are easily accessible for every well. The model attains a remarkable level of prediction accuracy for fluid loss, as indicated by various performance indices. The results indicate a mean absolute error of 1.33, a root mean square error of 2.58, and a coefficient of determination of 0.968. The suggested prediction model is optimised not only for data reduction but also for universal prediction and compatibility with other existing platforms. Moreover, it aids drilling engineers in implementing suitable mitigation strategies and designing optimal values for key operational surface parameters, both prior to and during drilling operations %K genetic algorithms, genetic programming, Lost circulation, Machine learning, Multigene genetic algorithms, Drilling. non-productive time %9 journal article %R doi:10.1016/j.rineng.2023.101464 %U https://www.sciencedirect.com/science/article/pii/S2590123023005911 %U http://dx.doi.org/doi:10.1016/j.rineng.2023.101464 %P 101464 %0 Journal Article %T Shape Quantization and Recognition with Randomized Trees %A Amit, Yali %A Geman, Donald %J Neural Computation %D 1997 %8 oct %V 9 %N 7 %F nc:Amit+Geman:1997 %X We explore a new approach to shape recognition based on a virtually infinite family of binary features (queries) of the image data, designed to accommodate prior information about shape invariance and regularity. Each query corresponds to a spatial arrangement of several local topographic codes (or tags), which are in themselves too primitive and common to be informative about shape. All the discriminating power derives from relative angles and distances among the tags. The important attributes of the queries are a natural partial ordering corresponding to increasing structure and complexity; semi-invariance, meaning that most shapes of a given class will answer the same way to two queries that are successive in the ordering; and stability, since the queries are not based on distinguished points and substructures. No classifier based on the full feature set can be evaluated, and it is impossible to determine a priori which arrangements are informative. Our approach is to select informative features and build tree classifiers at the same time by inductive learning. In effect, each tree provides an approximation to the full posterior where the features chosen depend on the branch that is traversed. Due to the number and nature of the queries, standard decision tree construction based on a fixed-length feature vector is not feasible. Instead we entertain only a small random sample of queries at each node, constrain their complexity to increase with tree depth, and grow multiple trees. The terminal nodes are labelled by estimates of the corresponding posterior distribution over shape classes. An image is classified by sending it down every tree and aggregating the resulting distributions. The method is applied to classifying handwritten digits and synthetic linear and nonlinear deformations of three hundred LATeX symbols. State-of-the-art error rates are achieved on the National Institute of Standards and Technology database of digits. The principal goal of the experiments on LATeX symbols is to analyse invariance, generalisation error and related issues, and a comparison with artificial neural networks methods is presented in this context. %9 journal article %P 1545-1588 %0 Journal Article %T Multi-agent architecture for Multiaobjective optimization of Flexible Neural Tree %A Ammar, Marwa %A Bouaziz, Souhir %A Alimi, Adel M. %A Abraham, Ajith %J Neurocomputing %D 2016 %V 214 %@ 0925-2312 %F Ammar:2016:Neurocomputing %X In this paper, a multi-agent system is introduced to parallelize the Flexible Beta Basis Function Neural Network (FBBFNT)’ training as a response to the time cost challenge. Different agents are formed; a Structure Agent is designed for the FBBFNT structure optimization and a variable set of Parameter Agents is used for the FBBFNT parameter optimization. The main objectives of the FBBFNT learning process were the accuracy and the structure complexity. With the proposed multi-agent system, the main purpose is to reach a good balance between these objectives. For that, a multi-objective context was adopted which based on Pareto dominance. The agents use two algorithms: the Pareto dominance Extended Genetic Programming (PEGP) and the Pareto Multi-Dimensional Particle Swarm Optimization ( PMD _ PSO ) algorithms for the structure and parameter optimization, respectively. The proposed system is called Pareto Multi-Agent Flexible Neural Tree ( PMA _ FNT ). To assess the effectiveness of PMA _ FNT , four benchmark real datasets of classification are tested. The results compared with some classifiers published in the literature. %K genetic algorithms, genetic programming, Flexible Neural Tree, Multi-agent architecture, Multi-objective optimization, Evolutionary Computation algorithms, Negotiation, Classification %9 journal article %R doi:10.1016/j.neucom.2016.06.019 %U http://www.sciencedirect.com/science/article/pii/S0925231216306579 %U http://dx.doi.org/doi:10.1016/j.neucom.2016.06.019 %P 307-316 %0 Journal Article %T Introducing artificial evolution into peer-to-peer networks with the distributed remodeling framework %A Amoretti, Michele %J Genetic Programming and Evolvable Machines %D 2013 %8 jun %V 14 %N 2 %@ 1389-2576 %F Amoretti:2013:GPEM %X A peer-to-peer (P2P) network is a complex system whose elements (peer nodes, or simply peers) cooperate to implement scalable distributed services. From a general point of view, the activities of a P2P system are consequences of external inputs coming from the environment, and of the internal feedback among nodes. The reaction of a peer to direct or indirect inputs from the environment is dictated by its functional structure, which is usually defined in terms of static rules (protocols) shared among peers. The introduction of artificial evolution mechanisms may improve the efficiency of P2P networks, with respect to resource consumption, while preserving high performance in response to the environmental needs. In this paper, we propose the distributed remodelling framework (DRF), a general approach for the design of efficient environment-driven peer-to-peer networks. As a case study, we consider an ultra-large-scale storage and computing system whose nodes perform lookups for resources provided by other nodes, to cope with task execution requests that cannot be fulfilled locally. Thanks to the DRF, workload modifications trigger reconfigurations at the level of single peers, from which global system adaptation emerges without centralised control. %K genetic algorithms, Peer-to-peer network, Artificial evolution, Complex adaptive system %9 journal article %R doi:10.1007/s10710-013-9182-0 %U http://dx.doi.org/doi:10.1007/s10710-013-9182-0 %P 127-153 %0 Conference Proceedings %T DNA Simulation of Boolean Circuits %A Amos, Martyn %A Dunne, Paul E. %A Gibbons, Alan %Y Koza, John R. %Y Banzhaf, Wolfgang %Y Chellapilla, Kumar %Y Deb, Kalyanmoy %Y Dorigo, Marco %Y Fogel, David B. %Y Garzon, Max H. %Y Goldberg, David E. %Y Iba, Hitoshi %Y Riolo, Rick %S Genetic Programming 1998: Proceedings of the Third Annual Conference %D 1998 %8 22 25 jul %I Morgan Kaufmann %C University of Wisconsin, Madison, Wisconsin, USA %@ 1-55860-548-7 %F amos:1998:DNAsbc %K DNA Computing %P 679-683 %0 Conference Proceedings %T Automatic generation of Lyapunov function using Genetic programming approach %A Amte, A. Y. %A Kate, P. S. %S 2015 International Conference on Energy Systems and Applications %D 2015 %8 oct %F Amte:2015:ICESA %X The paper introduces a novel approach for the automated generation of a Lyapunov function for the analysis of a given dynamic system using genetic programming (GP). Genetic programming is a branch of Genetic algorithm. It introduces the concept of GP for the automation of Lyapunov function in MATLAB used for various optimisation techniques. A Lyapunov function method used for transient stability assessment is discussed and hence discussion followed by the establishment of domain of attraction of stable equilibrium point. The results obtained by MATLAB coding for the generation of Lyapunov function of single machine infinite bus system is related by considering a ball rolling on the inner surface of a bowl which depicted in edition of Power System Analysis and Control. %K genetic algorithms, genetic programming %R doi:10.1109/ICESA.2015.7503454 %U http://dx.doi.org/doi:10.1109/ICESA.2015.7503454 %P 771-775 %0 Conference Proceedings %T PyGGI: Python General framework for Genetic Improvement %A An, Gabin %A Kim, Jinhan %A Lee, Seongmin %A Yoo, Shin %S Proceedings of Korea Software Congress %S KSC 2017 %D 2017 %8 20 22 dec %C Busan, South Korea %F An2017aa %X We present Python General Framework for Genetic Improvement (PYGGI, pronounced pigi), a lightweight general framework for Genetic Improvement (GI). It is designed to be a simple and easy to configure GI tool for multiple programming languages such as Java, C, or Python. Through two case studies, we show that PYGGI can modify source code of a given program either to improve non-functional properties or to automatically repair functional faults. %K genetic algorithms, genetic programming, Genetic Improvement %U https://coinse.kaist.ac.kr/publications/pdfs/An2017aa.pdf %P 536-538 %0 Conference Proceedings %T Comparing Line and AST Granularity Level for Program Repair using PyGGI %A An, Gabin %A Kim, Jinhan %A Yoo, Shin %Y Petke, Justyna %Y Stolee, Kathryn %Y Langdon, William B. %Y Weimer, Westley %S GI-2018, ICSE workshops proceedings %D 2018 %8 February %I ACM %C Gothenburg, Sweden %F An:2018:GI %X PyGGI is a lightweight Python framework that can be used to implement generic Genetic Improvement algorithms at the API level. The original version of PyGGI only provided lexical modifications, i.e., modifications of the source code at the physical line granularity level. This paper introduces new extensions to PyGGI that enables syntactic modifications for Python code, i.e., modifications that operates at the AST granularity level. Taking advantage of the new extensions, we also present a case study that compares the lexical and syntactic search granularity level for automated program repair, using ten seeded faults in a real world open source Python project. The results show that search landscapes at the AST granularity level are more effective (i.e. eventually more likely to produce plausible patches) due to the smaller sizes of ingredient spaces (i.e., the space from which we search for the material to build a patch), but may require longer time for search because the larger number of syntactically intact candidates leads to more fitness evaluations. %K genetic algorithms, genetic programming, genetic improvement, APR, SBSE %R doi:10.1145/3194810.3194814 %U http://www.cs.ucl.ac.uk/staff/W.Langdon/icse2018/gi2018/papers/An_2018_GI.pdf %U http://dx.doi.org/doi:10.1145/3194810.3194814 %P 19-26 %0 Journal Article %T Genetic Improvement Workshop at ICSE 2018 %A An, Gabin %J SIGEVOlution %D 2018 %8 dec %V 11 %N 4 %F An:2018:sigevolution %X In Gothenburg, on 2nd June 2018, the fourth edition of Genetic Improvement (GI) Workshop was co-located with this year’s ICSE (International Conference on Software Engineering), the biggest and probably the most prestigious software engineering conference... %K genetic algorithms, genetic programming, genetic improvement %9 journal article %R doi:10.1145/3302542.3302544 %U http://www.sigevolution.org/issues/SIGEVOlution1104.pdf %U http://dx.doi.org/doi:10.1145/3302542.3302544 %P 11-13 %0 Conference Proceedings %T PyGGI 2.0: Language Independent Genetic Improvement Framework %A An, Gabin %A Blot, Aymeric %A Petke, Justyna %A Yoo, Shin %Y Apel, Sven %Y Russo, Alessandra %S Proceedings of the 27th Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering ESEC/FSE 2019) %D 2019 %8 aug 26–30 %I ACM %C Tallinn, Estonia %F an:2019:fse %X PyGGI is a research tool for Genetic Improvement (GI), that is designed to be versatile and easy to use. We present version 2.0 of PyGGI, the main feature of which is an XML-based intermediate program representation. It allows users to easily define GI operators and algorithms that can be reused with multiple target languages. Using the new version of PyGGI, we present two case studies. First, we conduct an Automated Program Repair (APR) experiment with the QuixBugs benchmark, one that contains defective programs in both Python and Java. Second, we replicate an existing work on runtime improvement through program specialisation for the MiniSAT satisfiability solver. PyGGI 2.0 was able to generate a patch for a bug not previously fixed by any APR tool. It was also able to achieve 14percent runtime improvement in the case of MiniSAT. The presented results show the applicability and the expressiveness of the new version of PyGGI. A video of the tool demo is at: https://youtu.be/PxRUdlRDS40 %K genetic algorithms, genetic programming, Genetic Improvement, APR, SBSE, XML, srcML, Python %R doi:10.1145/3338906.3341184 %U http://www.cs.ucl.ac.uk/staff/a.blot/files/an_2019_fse.pdf %U http://dx.doi.org/doi:10.1145/3338906.3341184 %P 1100-1104 %0 Conference Proceedings %T "13th International Workshop on Genetic Improvement %F an:2024:GI %0 Journal Article %D 2023 %8 16 apr %I ACM %C Lisbon %F 2024"a %X The GI workshops continue to bring together researchers from across the world to exchange ideas about using optimisation techniques, particularly evolutionary computation, such as genetic programming, to improve existing software. Contents: \citeYoo:2024:GI \citeBlot:2024:GI \citeBaxter:2024:GI \citecallan:2024:GI \citeCraine:2024:GI \citelangdon:2024:GI \citeNemeth:2024:GI \citeSarmiento:2024:GI See also \citelangdon:2024:SEN Published: 08 August 2024 %K genetic algorithms, genetic programming, Genetic Improvement %9 journal article %R doi:10.1145/3643692 %U http://geneticimprovementofsoftware.com/events/icse2024 %U http://dx.doi.org/doi:10.1145/3643692 %0 Thesis %T Synergizing Fault Localization and Continuous Integration to Streamline Bug Resolution in Large-Scale Software Systems %A An, Gabin %D 2024 %8 April %C Daejeon, Korea %C Korea Advanced Institute of Science and Technology %F An:thesis %X This thesis explores the synergistic interaction between Continuous Integration (CI) and Fault Localization (FL) within software development, aiming to enhance the efficiency and effectiveness of the bug resolution process. CI plays a critical role as developers frequently merge code changes into a central repository, followed by automated builds and tests to quickly detect bugs and maintain a unified code-base that supports effective collaboration for large-scale software systems. FL is an automated debugging technique designed to precisely detect the locations of bugs within the codebase, reducing the burden on developers. While CI and FL each aim to streamline software development and maintenance independently, their potential for interaction has not been fully explored. This research suggests that leveraging historical CI data can enable more effective application of FL, and that FL can improve the bug resolution process within CI systems. The thesis comprises three studies: identifying common root causes of test failures using diverse information sources available in the CI environment, efficiently identifying bug-inducing commits using FL and code change histories, and developing an explainable FL technique using large language models. Each study addresses specific challenges and provides novel solutions to simplify the debugging and maintenance stages of software development. The proposed solutions are empirically evaluated and thoroughly compared against their baselines using real-world open-source software and large-scale industry software %K SBSE, fault localization, SBFL, AutoFL, FONTE, Defects4J, BugsInPy, continuous integration, bug resolution, debugging, bug assignment, buginducing commit, large language model, LLN, ANN, GPT, SAP HANA, BIC %9 Ph.D. thesis %0 Conference Proceedings %T Adaptive user similarity measures for recommender systems: A genetic programming approach %A Anand, Deepa %A Bharadwaj, K. K. %S 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010) %D 2010 %8 September 11 jul %V 8 %F Anand:2010:ICCSIT %X Recommender systems signify the shift from the paradigm of searching for items to discovering items and have been employed by an increasing number of e-commerce sites for matching users to their preferences. Collaborative Filtering is a popular recommendation technique which exploits the past user-item interactions to determine user similarity. The preferences of such similar users are leveraged to offer suggestions to the active user. Even though several techniques for similarity assessment have been suggested in literature, no technique has been proven to be optimal under all contexts/data conditions. Hence, we propose a two-stage process to assess user similarity, the first is to learn the optimal transformation function to convert the raw ratings data to preference data by employing genetic programming, and the second is to use the preference values, so derived, to compute user similarity. The application of such learnt user bias gives rise to adaptive similarity measures, i.e. similarity estimates that are dataset dependent and hence expected to work best under any data environment. We demonstrate the superiority of our proposed technique by contrasting it to traditional similarity estimation techniques on four different datasets representing varied data environments. %K genetic algorithms, genetic programming, adaptive user similarity measure, collaborative filtering, data environment, item discovery, item searching, optimal transformation function, preference value, raw ratings data, recommender system, similarity assessment, similarity estimation, user-item interaction, groupware, information filtering, recommender systems %R doi:10.1109/ICCSIT.2010.5563737 %U http://dx.doi.org/doi:10.1109/ICCSIT.2010.5563737 %P 121-125 %0 Thesis %T Enhancing Accuracy of Recommender Systems through various approaches to Local and Global Similarity Measures %A Anand, Deepa %D 2011 %8 jul %C New Delhi, India %C Computer and System Sciences, Jawaharlal Nehru University %F Anand:thesis %X Web 2.0 represents a paradigm shift in the way that internet is consumed. Users’ role has evolved from that of passive consumers of content to active prosumers, implying a plethora of information sources and an ever increasing ocean of content. Collaborative Recommender systems have thus emerged as Web 2.0 personalisation tools which aid users in grappling with the overload of information by allowing the discovery of content in contrast to plain search popularised by prior web technologies. To this end Collaborative filtering (CF) exploit the preferences of users who have liked similar items in the past to help a user to identify interesting products and services. The success of CF algorithms, however, is hugely dependent on the technique designed to determine the set of users whose opinion is sought. Traditionally user closeness is assessed by matching their preferences on a set of common experiences that both share. The challenge with this kind of computation is the overabundance of available content to be experienced, at the user’s disposal, thus rendering the user-preference space very sparse. The similarity so computed is thus unstable for user pairs sharing a small set of experiences and is in fact incomputable for most user pairs due to a lack of expressed common preferences. To remedy the sparsity problems we propose methods to enrich the set of user connections obtained using measures such as Pearson Correlation Coefficient (PCC) and Cosine Similarity (COS). We achieve this by leveraging on explicit trust elicitation and trust transitivity. When interacting with anonymous users online, in the absence of physical cues apparent in our daily life, trust provides a reliable measure of quality and guides the user decision process on whether or not to interact with an entity. These trust statements in addition to identifying malicious users also enhance user connectivity by establishing links between pairs of users whose closeness cannot be determined through preference data. In addition transitivity of trust can also be leveraged to further expand the set of neighbours to collaborate with. We first explore a bifurcated view of trust: functional and referral trust i.e. trust in an entity to recommend items and the trust in an entity to recommend recommenders and propose models to quantify referral trust. Such a referral-functional trust framework leads to more meaningful derivation of trust through transitivity resulting in better quality recommendations. Though trust has been extensively used in literature to support the CF process, distrust information has been explored very little in this context. We thus propose a tri-component computation of trust and distrust using preference, functional trust and referral trust in order to densify the network of user interconnections. To maintain a balance between increased coverage and the quality of recommendations, however, we quantify risk measures for each trust and distrust relationship so derived and prune the network to retain high quality relationships thus ensuring good connections formed between users through transitivity of trust and distrust. In the absence of supplemental information such as trust/distrust to provide extra knowledge about user links the local similarity connections can be harnessed to deem a pair of users similar if they are share preferences with the same set of users thus estimating the global similarity between user pairs. We investigate the effectiveness of various graph based global or indirect similarity computation schemes in enhancing the user or item neighborhood thus bettering the quality and number of recommendations obtained. %K genetic algorithms, genetic programming, recommender systems %9 Ph.D. thesis %U http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/papers/Anand_thesis.pdf %0 Journal Article %T Feature Extraction for Collaborative Filtering: A Genetic Programming Approach %A Anand, Deepa %J International Journal of Computer Science Issues %D 2012 %8 sep %V 9 %N 5 %I IJCSI Press %@ 16940784 %G eng %F Anand:2012:IJCSI %X Collaborative filtering systems offer customised recommendations to users by exploiting the interrelationships between users and items. Users are assessed for their similarity in tastes and items preferred by similar users are offered as recommendations. However scalability and scarcity of data are the two major bottlenecks to effective recommendations. With web based RS typically having users in order of millions, timely recommendations pose a major challenge. Sparsity of ratings data also affects the quality of suggestions. To alleviate these problems we propose a genetic programming approach to feature extraction by employing GP to convert from user-item space to user-feature preference space where the feature space is much smaller than the item space. The advantage of this approach lies in the reduction of sparse high dimensional preference information into a compact and dense low dimensional preference data. The features are constructed using GP and the individuals are evolved to generate the most discriminative set of features. We compare our approach to content based feature extraction approach and demonstrate the effectiveness of the GP approach in generating the optimal feature set. %K genetic algorithms, genetic programming, Recommender Systems, Collaborative Filtering, Feature Extraction %9 journal article %U http://www.ijcsi.org/contents.php?volume=9&&issue=5 %P 348-354 %0 Journal Article %T GenClass: A parallel tool for data classification based on Grammatical Evolution %A Anastasopoulos, Nikolaos %A Tsoulos, Ioannis G. %A Tzallas, Alexandros %J SoftwareX %D 2021 %V 16 %@ 2352-7110 %F ANASTASOPOULOS:2021:SoftwareX %X A genetic programming tool is proposed here for data classification. The tool is based on Grammatical Evolution technique and it is designed to exploit multicore computing systems using the OpenMP library. The tool constructs classification programs in a C-like programming language in order to classify the input data, producing simple if-else rules and upon termination the tool produces the classification rules in C and Python files. Also, the user can use his own Backus Normal Form (BNF) grammar through a command line option. The tool is tested on a wide range of classification problems and the produced results are compared against traditional techniques for data classification, yielding very promising results %K genetic algorithms, genetic programming, Data classification, Grammatical evolution, Stochastic methods %9 journal article %R doi:10.1016/j.softx.2021.100830 %U https://www.sciencedirect.com/science/article/pii/S2352711021001199 %U http://dx.doi.org/doi:10.1016/j.softx.2021.100830 %P 100830 %0 Generic %T Estimation of Gas Turbine Shaft Torque and Fuel Flow of a CODLAG Propulsion System Using Genetic Programming Algorithm %A Andelic, Nikola %A Baressi Segota, Sandi %A Lorencin, Ivan %A Car, Zlatan %D 2020 %8 July %I arXiv %F DBLP:journals/corr/abs-2012-03527 %X he publicly available dataset of condition based maintenance of combined diesel-electric and gas (CODLAG) propulsion system for ships has been used to obtain symbolic expressions which could estimate gas turbine shaft torque and fuel flow using genetic programming (GP) algorithm. The entire dataset consists of 11934 samples that was divided into training and testing portions of dataset in an 80:20 ratio. The training dataset used to train the GP algorithm to obtain symbolic expressions for gas turbine shaft torque and fuel flow estimation consisted of 9548 samples. The best symbolic expressions obtained for gas turbine shaft torque and fuel flow estimation were obtained based on their R2 score generated as a result of the application of the testing portion of the dataset on the aforementioned symbolic expressions. The testing portion of the dataset consisted of 2386 samples. The three best symbolic expressions obtained for gas turbine shaft torque estimation generated R2 scores of 0.999201, 0.999296, and 0.999374, respectively. The three best symbolic expressions obtained for fuel flow estimation generated R2 scores of 0.995495, 0.996465, and 0.996487, respectively. %K genetic algorithms, genetic programming, Artificial Intelligence, Combined Diesel-Electric and Gas Propulsion System, Genetic Programming Algorithm, Gas Turbine Shaft Torque Estimation, Fuel Flow Estimation %U https://arxiv.org/abs/2012.03527 %0 Journal Article %T Estimation of COVID-19 epidemic curves using genetic programming algorithm %A Andelic, Nikola %A Baressi Segota, Sandi %A Lorencin, Ivan %A Mrzljak, Vedran %A Car, Zlatan %J Health Informatics J. %D 2021 %V 27 %N 1 %F DBLP:journals/hij/AndelicSLMC21 %K genetic algorithms, genetic programming %9 journal article %R doi:10.1177/1460458220976728 %U https://doi.org/10.1177/1460458220976728 %U http://dx.doi.org/doi:10.1177/1460458220976728 %P 146045822097672 %0 Journal Article %T Estimation of COVID-19 Epidemiology Curve of the United States Using Genetic Programming Algorithm %A Andelic, Nikola %A Baressi Segota, Sandi %A Lorencin, Ivan %A Jurilj, Zdravko %A Sustersic, Tijana %A Blagojevic, Andela %A Protic, Alen %A Cabov, Tomislav %A Filipovic, Nenad %A Car, Zlatan %J International Journal of Environmental Research and Public Health %D 2021 %V 18 %N 3 %@ 1660-4601 %F andelic:2021:IJERPH %X Estimation of the epidemiology curve for the COVID-19 pandemic can be a very computationally challenging task. Thus far, there have been some implementations of artificial intelligence (AI) methods applied to develop epidemiology curve for a specific country. However, most applied AI methods generated models that are almost impossible to translate into a mathematical equation. In this paper, the AI method called genetic programming (GP) algorithm is used to develop a symbolic expression (mathematical equation) which can be used for the estimation of the epidemiology curve for the entire U.S. with high accuracy. The GP algorithm is used on the publicly available dataset that contains the number of confirmed, deceased and recovered patients for each U.S. state to obtain the symbolic expression for the estimation of the number of the aforementioned patient groups. The dataset consists of the latitude and longitude of the central location for each state and the number of patients in each of the goal groups for each day in the period of 22 January 2020–3 December 2020. The obtained symbolic expressions for each state are summed up to obtain symbolic expressions for estimation of each of the patient groups (confirmed, deceased and recovered). These symbolic expressions are combined to obtain the symbolic expression for the estimation of the epidemiology curve for the entire U.S. The obtained symbolic expressions for the estimation of the number of confirmed, deceased and recovered patients for each state achieved R2 score in the ranges 0.9406–0.9992, 0.9404–0.9998 and 0.9797–0.99955, respectively. These equations are summed up to formulate symbolic expressions for the estimation of the number of confirmed, deceased and recovered patients for the entire U.S. with achieved R2 score of 0.9992, 0.9997 and 0.9996, respectively. Using these symbolic expressions, the equation for the estimation of the epidemiology curve for the entire U.S. is formulated which achieved R2 score of 0.9933. Investigation showed that GP algorithm can produce symbolic expressions for the estimation of the number of confirmed, recovered and deceased patients as well as the epidemiology curve not only for the states but for the entire U.S. with very high accuracy. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/ijerph18030959 %U https://www.mdpi.com/1660-4601/18/3/959 %U http://dx.doi.org/doi:10.3390/ijerph18030959 %0 Journal Article %T Use of Genetic Programming for the Estimation of CODLAG Propulsion System Parameters %A Andelic, Nikola %A Baressi Segota, Sandi %A Lorencin, Ivan %A Poljak, Igor %A Mrzljak, Vedran %A Car, Zlatan %J Journal of Marine Science and Engineering %D 2021 %V 9 %N 6 %@ 2077-1312 %F andelic:2021:JMSE %X In this paper, the publicly available dataset for the Combined Diesel-Electric and Gas (CODLAG) propulsion system was used to obtain symbolic expressions for estimation of fuel flow, ship speed, starboard propeller torque, port propeller torque, and total propeller torque using genetic programming (GP) algorithm. The dataset consists of 11,934 samples that were divided into training and testing portions in an 80:20 ratio. The training portion of the dataset which consisted of 9548 samples was used to train the GP algorithm to obtain symbolic expressions for estimation of fuel flow, ship speed, starboard propeller, port propeller, and total propeller torque, respectively. After the symbolic expressions were obtained the testing portion of the dataset which consisted of 2386 samples was used to measure estimation performance in terms of coefficient of correlation (R2) and Mean Absolute Error (MAE) metric, respectively. Based on the estimation performance in each case three best symbolic expressions were selected with and without decay state coefficients. From the conducted investigation, the highest R2 and lowest MAE values were achieved with symbolic expressions for the estimation of fuel flow, ship speed, starboard propeller torque, port propeller torque, and total propeller torque without decay state coefficients while symbolic expressions with decay state coefficients have slightly lower estimation performance. %K genetic algorithms, genetic programming %9 journal article %R doi:10.3390/jmse9060612 %U https://www.mdpi.com/2077-1312/9/6/612 %U http://dx.doi.org/doi:10.3390/jmse9060612 %0 Conference Proceedings %T Utilization of Genetic Programming for Estimation of Molecular Structures Ground State Energies %A Andelic, Nikola %A Baressi Segota, Sandi %A Lorencin, Ivan %A Glucina, Matko %A Musulin, Jelena %A Stifanic, Daniel %A Car, Zlatan %S 1st Serbian International Conference on Applied Artificial Intelligence %D 2022 %8 may 19 20 %I Springer %C Kragujevac, Serbia %F Andelic:2022:SICAAI %X GP to predict ground-state energies of molecules made up of C, H, N, O, P, and S (CHONPS) atoms. The GP was trained and tested on a publicly available dataset which consist of 16242 molecules where ground state energies were computed using the density functional theory (DFT). The optimal parameters of GP were chosen using the random parameter search method. After multiple GP executions, the best symbolic expression was chosen using a coefficient of determination (R), mean absolute error (MAE), and root mean square error (RMSE). The best symbolic expression achieved R, MAE, and RMSE of 0.9434, 0.48, and 0.86, respectively. %K genetic algorithms, genetic programming, CHNOPS dataset, ground state energies %U http://aai2022.kg.ac.rs/wp-content/uploads/upload/AAI_2022_Papers.zip %0 Journal Article %T Detection of Malicious Websites Using Symbolic Classifier %A Andelic, Nikola %A Baressi Segota, Sandi %A Lorencin, Ivan %A Glucina, Matko %J Future Internet %D 2022 %8 nov %V 14 %N 12 %I MDPI %@ 1999-5903 %F Andelic:2022:FI %X Malicious websites are web locations that attempt to install malware, which is the general term for anything that will cause problems in computer operation, gather confidential information, or gain total control over the computer. a novel approach is proposed which consists of the implementation of the genetic programming symbolic classifier (GPSC) algorithm on a publicly available dataset to obtain a simple symbolic expression (mathematical equation) which could detect malicious websites with high classification accuracy. Due to a large imbalance of classes in the initial dataset, several data sampling methods (random under-sampling/oversampling, ADASYN, SMOTE, BorderlineSMOTE, and KmeansSMOTE) were used to balance the dataset classes. For this investigation, the hyper-parameter search method was developed to find the combination of GPSC hyperparameters with which high classification accuracy could be achieved. The first investigation was conducted using GPSC with a random hyperparameter search method and each dataset variation was divided on a train and test dataset in a ratio of 70:30. To evaluate each symbolic expression, the performance of each symbolic expression was measured on the train and test dataset and the mean and standard deviation values of accuracy (ACC), AUC, precision, recall and f1-score were obtained. The second investigation was also conducted using GPSC with the random hyperparameter search method; however, 70percent, i.e., the train dataset, was used to perform 5-fold cross-validation. If the mean accuracy, AUC, precision, recall, and f1-score values were above 0.97 then final training and testing (train/test 70:30) were performed with GPSC with the same randomly chosen hyperparameters used in a 5-fold cross-validation process and the final mean and standard deviation values of the aforementioned evaluation methods were obtained. In both investigations, the best symbolic expression was obtained in the case where the dataset balanced with the KMeansSMOTE method was used for training and testing. The best symbolic expression obtained using GPSC with the random hyperparameter search method and classic train釦est procedure (70:30) on a dataset balanced with the KMeansSMOTE method achieved values of %K genetic algorithms, genetic programming, malicious websites, oversampling methods, symbolic classifier, undersampling methods %9 journal article %R doi:10.3390/fi14120358 %U https://www.mdpi.com/1999-5903/14/12/358 %U http://dx.doi.org/doi:10.3390/fi14120358 %P Articleno358 %0 Journal Article %T The Development of Symbolic Expressions for Fire Detection with Symbolic Classifier Using Sensor Fusion Data %A Andelic, Nikola %A Baressi Segota, Sandi %A Lorencin, Ivan %A Car, Zlatan %J Sensors %D 2022 %8 dec %V 23 %N 1 %I MDPI %@ 1424-8220 %F Andelic:2022:Sensors %X Fire is usually detected with fire detection systems that are used to sense one or more products resulting from the fire such as smoke, heat, infrared, ultraviolet light radiation, or gas. Smoke detectors are mostly used in residential areas while fire alarm systems (heat, smoke, flame, and fire gas detectors) are used in commercial, industrial and municipal areas. However, in addition to smoke, heat, infrared, ultraviolet light radiation, or gas, other parameters could indicate a fire, such as air temperature, air pressure, and humidity, among others. Collecting these parameters requires the development of a sensor fusion system. However, with such a system, it is necessary to develop a simple system based on artificial intelligence (AI) that will be able to detect fire with high accuracy using the information collected from the sensor fusion system. The novelty of this paper is to show the procedure of how a simple AI system can be created in form of symbolic expression obtained with a genetic programming symbolic classifier (GPSC) algorithm and can be used as an additional tool to detect fire with high classification accuracy. Since the investigation is based on an initially imbalanced and publicly available dataset (high number of samples classified as 1-Fire Alarm and small number of samples 0-No Fire Alarm), the idea is to implement various balancing methods such as random undersampling/oversampling, Near Miss-1, ADASYN, SMOTE, and Borderline SMOTE. The obtained balanced datasets were used in GPSC with random hyperparameter search combined with 5-fold cross-validation to obtain symbolic expressions that could detect fire with high classification accuracy. For this in