Skip to main content

Advertisement

Log in

Multi-objective breast cancer classification by using multi-expression programming

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Despite many years of research, breast cancer detection is still a difficult, but very important problem to be solved. An automatic diagnosis system could establish whether a mammography presents tumours or belongs to a healthy patient and could offer, in this way, a second opinion to a radiologist that tries to establish a diagnosis. We therefore propose a system that could contribute to lowering both the costs and the work of an imaging diagnosis centre of breast cancer and in addition to increase the trust level in that diagnosis. We present a multi-objective evolutionary approach based on Multi-Expression Programming—a linear Genetic Programming method—that could classify a mammogram starting from a raw image of the breast. The processed images are represented through Histogram of Oriented Gradients and Kernel Descriptors since these image features have been reported as being very efficient in the image recognition scientific community and they have not been applied to mammograms before. Numerical experiments are performed on freely available datasets consisting of normal and abnormal film-based and digital mammograms and show the efficiency of the proposed decision support system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Asuncion A, Newman D (2007) UCI machine learning repository

  2. Banzhaf W (1993) Genetic programming for pedestrians. In: Forrest S (ed) Proceedings of the fifth international conference on genetic algorithms (ICGA’93). Morgan Kaufmann, San Mateo, p 628

  3. Barros RC, Basgalupp MP, de Carvalho ACPLF, Freitas AA (2012) A survey of evolutionary algorithms for decision-tree induction. IEEE Trans Syst Man Cybern Part C 42(3):291–312

    Article  Google Scholar 

  4. Basgalupp MP, de Carvalho ACPLF, Barros RC, Ruiz DD, Freitas AA (2009) Lexicographic multi-objective evolutionary induction of decision trees. IJBIC 1(1/2):105–117

    Article  Google Scholar 

  5. Bhowan U, Johnston M, Zhang M (2012) Developing new fitness functions in genetic programming for classification with unbalanced data. IEEE Trans Syst Man Cybern Part B: Cybern 42(2):406–421

    Article  Google Scholar 

  6. Bhowan U, Johnston M, Zhang M, Yao X (2013) Evolving diverse ensembles using genetic programming for classification with unbalanced data. IEEE Trans Evol Comput 17(3):368–386

    Article  Google Scholar 

  7. Bhowan U, Zhang M, Johnston M (2009) Genetic programming for image classification with unbalanced data. In: Proceeding of the 24th international conference image and vision computing New Zealand, IVCNZ ’09 (Wellington, Nov. 23–25). IEEE, pp 316–321

  8. Bhowan U, Zhang M, Johnston M (2009) Multi-objective genetic programming for classification with unbalanced data. In: Nicholson AE, Li X (eds) AI 2009: advances in artificial intelligence, 22nd Australasian joint conference, Melbourne, Australia, December 1–4, 2009. Proceedings. Vol 5866 of Lecture Notes in Computer Science. Springer, pp 370–380

  9. Bo L, Ren X, Fox D (2010) Kernel descriptors for visual recognition. In: Lafferty JD, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A (eds) NIPS. Curran Associates, Inc, pp 244–252

  10. Bot MCJ Improving induction of linear classification trees with genetic programming. In: Whitley D, Goldberg D, Cantu-Paz E, Spector L, Parmee I, Beyer H-G (eds) Proceedings of the genetic and evolutionary computation conference (GECCO-2000) (Las Vegas, Nevada, USA, 10–12 July 2000). Morgan Kaufmann, pp 403–410

  11. Corne D, Knowles JD, Oates MJ (2000) The pareto envelope-based selection algorithm for multi-objective optimisation. In: Schoenauer M, Deb K, Rudolph G, Yao X, Lutton E, Guervos JJM, Schwefel H-P (eds) Parallel problem solving from nature – PPSN VI (6th PPSN’2000). Vol 1917 of Lecture Notes in Computer Science (LNCS). Springer, New York, pp 839–848

    Google Scholar 

  12. Cortes C, Mohri M (2003) AUC optimization vs. error rate minimization. In: Thrun S, Saul LK, Schölkopf B (eds) NIPS. MIT Press, Cambridge

    Google Scholar 

  13. Crepeau RL Genetic evolution of machine language software. In: Rosca JP (ed) Proceedings of the workshop on genetic programming: from theory to real-world applications (Tahoe City, California, USA, 9 July 1995), pp 121–134

  14. Dalal N, Triggs B, Schmid C (2006) Human detection using oriented histograms of flow and appearance. In: ECCV, pp II: 428–441

  15. Deb K, Pratap A, Agarwal S, Meyarivan TA (2002) Fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE-EC 6, pp 182–197

  16. Eggermont J, Kok JN, Kosters WA (2004) Genetic programming for data classification: partitioning the search space. In: Proceedings of the 2004 symposium on applied computing (ACM SAC’04) (Nicosia, Cyprus, 14–17 Mar), pp 1001–1005

  17. Eklund SE (2002) A massively parallel GP engine in VLSI. In: Fogel DB, El-Sharkawi MA, Yao X, Greenwood G, Iba H, Marrow P, Shackleton M (eds) Proceedings of the 2002 congress on evolutionary computation CEC2002. IEEE Press, Piscataway, pp 629–633

    Google Scholar 

  18. Espejo PG, Ventura S, Herrera F (2010) A survey on the application of genetic programming to classification. IEEE Trans Syst Man Cybern Part C: Appl Rev 40(2):121–144

    Article  Google Scholar 

  19. Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin D, Forman D, Bray F (2013) Globocan 2012 v1.0, cancer incidence and mortality worldwide: Iarc cancerbase no. 11

  20. Gotzsche P, Nielsen M (2011) Screening for breast cancer with mammography. The Cochrane Library

  21. Khare V, Yao X, Deb K (2003) Performance scaling of multi-objective evolutionary algorithms. In: Fonseca CM, Fleming PJ, Zitzler E, Deb K, Thiele L (eds) Evolutionary multi-criterion optimization. Second international conference, EMO 2003 (Faro, Portugal, Apr.). Lecture Notes in Computer Science, vol 2632. Springer, pp 376–390

  22. Kim D (2004) Structural risk minimization on decision trees using an evolutionary multiobjective optimization. In: M. Keijzer U-M O’Reilly, Lucas SM, Costa E, Soule T (eds) Genetic programming 7th European conference, EuroGP 2004, Proceedings (Coimbra, Portugal, 5–7 Apr), vol 3003 of LNCS. Springer, pp 338– 348

  23. Koza J, Poli R (2005) Genetic programming. In: Burke EK, Kendall G (eds) Search methodologies: introductory tutorials in optimization and decision support techniques, ch. 5. Springer, Berlin

  24. Levesque J-C, Durand A, Gagné C, Sabourin R (2012) Multi-objective evolutionary optimization for generating ensembles of classifiers in the ROC space. In: Soule T, Moore JH (eds) Genetic and evolutionary computation conference, GECCO ’12, Philadelphia, PA, USA, July 7–11, 2012. ACM, pp 879– 886

  25. Moura D C, Guevara-López MÁ (2013) An evaluation of image descriptors combined with clinical data for breast cancer diagnosis. Int J Comput Assist Radiol Surg 8(4):561– 574

    Article  Google Scholar 

  26. Nelson H, Tyne K, Naik A, Bougatsos C, Chan B, Humphrey L (2009) Screening for breast cancer: systematic evidence review update for the us preventive services task force. Ann Intern Med 10(151):727

    Article  Google Scholar 

  27. Nordin P (1994) A compiling genetic programming system that directly manipulates the machine code. In: Kinnear KE Jr (ed) Advances in genetic programming, ch. 14. MIT Press, Cambridge, pp 311–332

  28. Oltean M, Grosan C (2003) Evolving evolutionary algorithms using multi expression programming. In: Banzhaf W, Christaller T, Dittrich P, Kim JT, Ziegler J (eds) Proceedings of European conference on artificial life: advances in artificial life. Vol 2801 of Lecture Notes in Artificial Intelligence. Springer, Berlin, pp 651–658

    Google Scholar 

  29. Openshaw S, Turton I (1994) Building new spatial interaction models using genetic programming. In: Fogarty TC (ed) Evolutionary computing, AISB workshop

  30. Papagelis A, Kalles D (2001) Breeding decision trees using evolutionary techniques. In: Proc. 18th international conf. on machine learning. Morgan Kaufmann, San Francisco, pp 393–400

  31. Parrott D, Li X, Ciesielski V (2005) Multi-objective techniques in genetic programming for evolving classifiers. In: 2005 IEEE congress on evolutionary computation (CEC’2005) (Edinburgh, Scotland, Sept.), vol 2. IEEE Service Center, pp 1141–1148

  32. Perkis T (1994) Stack-based genetic programming. In: International conference on evolutionary computation, pp 148–153

  33. Poli R, Langdon WB, McPhee NF (2008) A field guide to genetic programming. Published via http://lulu.com and freely available at http://www.gp-field-guide.org.uk

  34. Ramos-Pollán R, Guevara-López MÁ, Ortega CS, Díaz-Herrero G, Franco-Valiente JM, del Solar MR, de Posada NG, Vaz MAP, Loureiro J, Ramos I (2012) Discovering mammography-based machine learning classifiers for breast cancer diagnosis. J Med Syst 36(4):2259–2269

    Article  Google Scholar 

  35. Schölkopf B (2000) The kernel trick for distances. In: Leen TK, Dietterich TG, Tresp V (eds) NIPS. MIT Press, Cambridge, pp 301–307

    Google Scholar 

  36. Srinivas N, Deb K (1995) Multiobjective optimization using nondominated sorting in genetic algorithms. Evol Comput 2(3):221–248

    Article  Google Scholar 

  37. Suckling J, Parker J, Dance DR, Astley S, Hutt I, Boggis CRM, Ricketts I, Stamatakis E, Cerneaz N, Kok S-L, Taylor P, Betal D, Savage J (1994) The Mammographic Image Analysis Society digital mammogram database. In: Proceedings of the 2nd international workshop on digital mammography (York, England, July), pp 375–378

  38. Tabar L, Vitak B, Chen T, Yen A, Cohen A, Tot T, Chiu S, Chen S, Fann J, Rosell J, Fohlin H, Smith R, Duffy S (2011) Swedish two-county trial: impact of mammographic screening on breast cancer mortality during 3 decades. Radiology 3(260):658–663

    Article  Google Scholar 

  39. Tanigawa T, Zhao Q (2000) A study on efficient generation of decision trees using genetic programming. In: Whitley D, Goldberg D, Cantu-Paz E, Spector L, Parmee I, Beyer H-G (eds) Proceedings of the genetic and evolutionary computation conference (GECCO-2000)(Las Vegas, Nevada, USA, 10–12 July). Morgan Kaufmann, pp 1047–1052

  40. Tsakonas A, Dounias G (2002) Hierarchical classification trees using type-constrained genetic programming. In: Intelligent systems, 2002. Proceedings 2002 First International IEEE Symposium, vol 2, pp 50–54

  41. Wang P, Tang K, Weise T, Tsang EPK, Yao X (2014) Multiobjective genetic programming for maximizing ROC performance. Neurocomputing 125:102–118

    Article  Google Scholar 

  42. Zhao H (2007) A multi-objective genetic programming approach to developing pareto optimal decision trees. Decis Support Syst 43(3):809–826

    Article  Google Scholar 

  43. Zitzler E, Laumanns M (2001) SPEA2: improving the strength pareto evolutionary algorithm elektronische daten

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anca Andreica.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dioşan, L., Andreica, A. Multi-objective breast cancer classification by using multi-expression programming. Appl Intell 43, 499–511 (2015). https://doi.org/10.1007/s10489-015-0668-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-015-0668-8

Keywords

Navigation