Skip to main content

Advertisement

Log in

Knowledge Discovery with Genetic Programming for Providing Feedback to Courseware Authors

  • Published:
User Modeling and User-Adapted Interaction Aims and scope Submit manuscript

Abstract

We introduce a methodology to improve Adaptive Systems for Web-Based Education. This methodology uses evolutionary algorithms as a data mining method for discovering interesting relationships in students’ usage data. Such knowledge may be very useful for teachers and course authors to select the most appropriate modifications to improve the effectiveness of the course. We use Grammar-Based Genetic Programming (GBGP) with multi-objective optimization techniques to discover prediction rules. We present a specific data mining tool that can help non-experts in data mining carry out the complete rule discovery process, and demonstrate its utility by applying it to an adaptive Linux course that we developed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Agrawal, R., Imielinski, T., Swami, A. (1993). Mining Association Rules between Sets of Items in Large Databases. Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data. Washington, DC, pp. 207–216.

  • Araujo, D. L. A., Lopes, H. S., Freitas, A. A. (1999). A Parallel Genetic Algorithm for Rule Discovery in Large Databases. Proceedings of Conference on IEEE Systems and Cybernetics. Tokyo. pp. 940–945

  • Arruabarrena, R. Prez, T.A. Lpez-Cuadrado, J. and Gutirrez, J. (2002). On Evaluating Adaptive Systems for Education. Second Conference AH 2002. Adaptive Hypermedia and Adaptive Web-Based Systems. Nicosia, Cyprus. pp. 363–367.

  • Askira-Gelman, I. (1998). Knowledge Discovery: Comprehensibility of the Results. Hawaii International Conference on System Sciences. Kohala Coast, HI, pp. 247–255.

  • Bayardo, R. J. and Agrawal, R. (1999). Mining the most Interesting Rules. Fifth Conference ACM on Knowledge Discovery and Data Mining SIGKDD, San Diego, CA, USA. pp. 145–154.

  • Bojarczuk, C. C. Lopes, H. S., and Freitas, A. A. (2001). Constrained-Syntax Genetic Programming for Knowledge Discovery in Medical Databases. 10th International Symposium on System Modeling. Zakopane, Poland.

  • Brusilovsky, P. (1998). Adaptive Educational Systems on the World-Wide-Web: A Review of Available Technologies. Proceedings of the Workshop “www-Base Tutoring” at the 4th International Conference on Intelligent Tutoring Systems (ITS 98), San Antonio, TX.

  • Brusilovsky, P. (2001). Adaptive Educational Hypermedia. Proceedings of Tenth International PEG Conference, Tampere, Finland, pp. 8–12.

  • Carro, R.M., Pulido, E. and Rodriguez, P. (1999). Desinging Adaptive Web-based Courses with TANGOW. Conference Computers in Education, Chiba, Japan, pp. 697–704.

  • J. Cendrowska (1987) ArticleTitlePRISM: an algorithm for inducing modular rules Journal of Man-Machine Studies 27 349–370

    Google Scholar 

  • Coello, C. A., Veldhuizen, D. A., and Lamount, G. B. (2002). Evolutionary Algorithms for Solving Multi-Objective Problems. Dordrecht, Netherlands: Kluwer.

  • M. Delgado D. Sanchez M.J. Martin-Bautista M.A. Vila (2001) ArticleTitleMining association Rules with Improved Semantics in Medical Databases Artificial Intelligence in Medicine 21 241–245

    Google Scholar 

  • V. Dhar D. Chou F. Provost (2000) ArticleTitleDiscovering Interesting Patterns for Investment Decision Making with GLOWER Data Mining and Knowledge Discovery 4 251–280

    Google Scholar 

  • Dougherty J., Kohavi M., Sahami, M. (1995). Supervised and unsupervised discretization of continuous features. International Conference on Machine Learning Tahoe City.CA, pp. 194–202.

  • C. Darwin (1859) On the Origin of Species by Means of Natural Selection John Murray London

    Google Scholar 

  • K. Deb (2001) Multi-Objective Optimization Using Evolutionary Algorithms John Wiley & Sons USA

    Google Scholar 

  • De Bra, P. Wu, H. Aerst, A. and Houben, G.: 2000. Adaptation Control in Adaptive Hypermedia Systems. In: Proceedings of International Conference on Adaptive Hypermedia and Adaptive Web-based Systems. Trento, Italy, pp. 250–259.

  • De Bra, P., Aerts, A., Berden, B., De Lange, B., Rousseau, B., Santic, T., Smits, D., and Stash, N.: 2003. AHA! The Adaptive Hypermedia Architecture. In: Proceedings of the ACM Hypertext Conference, , Nottingham, UK pp. 81–84.

  • De Castro, C. and Romero, C.: 2002. HAMTUTOR. Autor Tool to Develop Adaptive Multimedia Courses. In: World Conference on E-Learning in Corporate, Government, Healthcare, Higher Education, Montreal, Canada, pp. 2575–2576

  • P. Dubois (2003) MySQL EditionNumber2 Sams USA

    Google Scholar 

  • Fonseca, C.M., and Fleming, P.J.: 1993. Genetic Algorithms for Multiobjective Optimization: Formulation, Discussion and Generalization. In: Proceedings 5rd International Conference on Genetic Algorithms, San Mateo, California, pp. 416–423.

  • Freitas, A.. A.: 1997. A Genetic Programming Framework for Two Data Mining Tasks: Classification and Generalized Rule Induction. In: Conference on Genetic Programming, CA, USA, pp. 96–101.

  • A. A. Freitas (2000) ArticleTitleUnderstanding the Crucial Differences Between Classification and Discovery of Association Rules ACM SIGKDD Explorations 2 IssueID1 65–69

    Google Scholar 

  • Freitas, A.. A. (2001). A Survey of Evolutionary Algorithms for Data Mining and Knowledge Discovery. In: A. Ghosh and S. Tsutsui (eds.): Advances in Evolutionary Computation. Springer-Verlag, pp. 819–845

  • A. A. Freitas (2002) Data Mining and Knowledge Discovery with Evolutionary Algorithms Springer-Verlag, Berlin Heidelberg, New York

    Google Scholar 

  • Gilbert, R. G., Goodacre, R., Shann, B., Kell, D. B., Taylor, J. and Rowland, J. J.: 1998, Genetic Programming-Based Variable Selection for High-Dimensional Data. In: Proceedings 3rd Conference Genetic Programming, San Francisco, CA, USA, pp. 109–115.

  • A. Ghosh B. Nath (2004) ArticleTitleMulti-objective rule mining using genetic algorithms Information Sciences 163 123–133

    Google Scholar 

  • J. Han L. Lakshamanan T. Ng. Raymond (1999) ArticleTitleConstraint-based, multidimensional data mining IEEE Computer 32 IssueID8 46–50

    Google Scholar 

  • Heift, T. and Nicholson, D. (2000). Enhanced server logs for intelligent, adaptive web-based systems. Technical Report. Institute for Semantic Information Processing. Universitt Osmabrk.

  • Herin, D., Sala, M. and Pompidor, P. 2002. Evaluating and Revising Courses from Web Resources Educational. In: International Conference on Intelligent Tutoring Systems. Biarritz, France, San Sebastian, Spain, pp. 208–218.

  • Hipp, J., Gntzer, U. and Nakhaeizadeh, G.: 2000. Mining Association Rules: Deriving a Superior Algorithm by Analyzing Today’s Approaches. In: European Symposium Data Mining and Knowledge Discovery. Lyon, France, pp. 13–16.

  • Jaroszewicz, S. and Simovici, D.: 2002. Pruning Redundant Association Rules Using Maximum Entropy Principle. In: Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining. Taipei, Taiwan, pp. 135–147.

  • W. Klösgen J. M. Zytkow (2002) Handbook of Data Mining and Knowledge Discovery Oxford University Press New York, NY

    Google Scholar 

  • J. R. Koza (1992) Genetic Programming: on the Programming of Computers by Means of Natural Selection MIT Press Cambridge, Massachusetts

    Google Scholar 

  • Lavrac, N., Flach, P. and Zupan, B.: 1999, Rule Evaluation Measures: A Unifying View. In: International Workshop on Inductive Logic Programming. Springer-Verlag, pp. 174–185.

  • Liu, B. and Hsu, W.: 1996. Post-Analysis of Learned Rules. In: Proceedings of National Conference on Artificial Intelligence. Portland, Oregon, USA, pp. 828–834.

  • B. Liu W. Hsu S. Chen Y. Ma (2000) ArticleTitleAnalyzing the Subjective Interestingness of Association Rules IEEE Intelligent Systems 15 IssueID5 47–55

    Google Scholar 

  • Liu, J. J. and Kwok, J. T.: 2000. An Extended Genetic Rule Induction Algorithm. In: Proceedings of the Congress on Evolutionary Computation. La Jolla, California, USA, pp. 458–463.

  • Z. Michalski (1998) Genetic Algorithms + Data Structures = Evolution Program Springer New York

    Google Scholar 

  • Minaei-Bidgoli, B. and Punch III, W. F.: 2003. Using Genetic Algorithms for Data Mining Optimization in an Educational Web-based System. In: Genetic and Evolutionary Computation Conference. Chicago, Illinois, USA, pp. 2252–2263.

  • S. Mitra S. K. Pal P. Mitra (2001) ArticleTitleData mining in soft computing framework: a survey IEEE Transaction on Neural Networks. 13 IssueID1 3–14

    Google Scholar 

  • Noda, E., Freitas, A. and Lopes, H. S.: 1999. Discovering Interesting Prediction Rules with a Genetic Algorithm. In: Proceedings of the Congress on Evolutionary Computation. Washington DC, USA, pp. 1322–1329.

  • P. Nordin W. Banzhaf R. E. Keller F. D. Francone (1998) Genetic Programming: An Introduction Morgan Kaufmann San Francisco, CA, USA

    Google Scholar 

  • Ortigosa, A. and Carro, R.M.: 2002. Asistiendo el Proceso de Mejora Continua de Cursos Adaptativos. In: III Congreso Int. de Interaccin Persona-Ordenador, Granada, pp. 246–250.

  • Padmanabhan, B. and Tuzhilin, A.: 2000. Small is Beautiful: Discovering the Minimal Set of Unexpected Patterns. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining. Boston, MA, USA, pp. 54–64.

  • Pahl, C. and Donnellan, D.: 2002. Data Mining Technology for the Evaluation of Web-based Teaching and Learning Systems. In: Proceedings of the Congress E-learning, Montreal, Canada.

  • Piatesky-Shapiro, G. and Matheus, J.: 1994. The Interestingness of Deviations. In: AAAI Workshop on Knowledge Discovery in Databases. Seattle, Washington, pp. 25–36.

  • D. Pierrakos G. Paliouras C. Papatheodorou C. D. Spyropoulos (2003) ArticleTitleWeb Usage Mining as a Tool for Personalization: A Survey User Modeling and User-Adapted Interaction 12 IssueID4 311–371

    Google Scholar 

  • Quilan, J. R.: 1987. Generating Production rules from decision trees. In. Proceedings of IJCAI, , pp. 304–307.

  • Ratle, A. and Sebag, M.: 2000. Genetic Programming and Domain Knowledge: Beyond the Limitations of Grammar Guided Machine Discovery. In. Proceedings of 6th Conference on Parallel problem solving for nature. Paris, France, pp. 211–220.

  • Romero, C., De Bra, P., Ventura, S. and De Castro, C.: 2002. Using Knowledge Level with AHA! For Discovering Interesting Relationship. In: Proceedings of the AACE ELearn’2002. Montreal, Canada, pp. 2721–2722.

  • Romero, C., Ventura, S., De Bra, P. and De Castro, C. (2002). Discovering Prediction Rules in AHA! Courses. In: 9th International Conference on User Modeling. Johnstown, PA, USA, pp. 25–34.

  • Ryu, T. W. and Eick, C. F.: 1996. Discovering Commonalities of a set of Objects Using Genetic Programming. Proceedings of Genetic Programming Conference.

  • Sarawagi, S., Thomas, S. and Agrawal, R.: 1998. Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications. In: Conference on Management of Data. Seattle, Washington, pp. 343–354.

  • E. Shortliffe B. Buchanan (1975) ArticleTitleA model of inexact reasoning in medicine Mathematical Biosciences 23 351–379

    Google Scholar 

  • P. Smythe R. M. Goodman (1992) ArticleTitleAn information theory approach to rule induction from databases IEEE Knowledge and Data Engineering 4 IssueID4 301–316

    Google Scholar 

  • Silberschatz A. (1995). On Subjective Measures of Interestingness in Knowledge. In: Proceedings of the Knowledge Discovery and Data Mining. Montreal, Canada, pp. 275–281.

  • A. Silverstein R. BrinS. Motwani (1998) ArticleTitleBeyond market baskets : generalizing association rules to dependence rules Data Mining and Knowledge Discoverty 2 29–68

    Google Scholar 

  • M. Spiliopoulou (2000) ArticleTitleWeb usage mining for web site evaluation Communicacion of the ACM 43 IssueID8 127–134

    Google Scholar 

  • N. Srinivas K. Deb (1994) ArticleTitleMultiobjective optimization using nondominated sorting in genetic algorithms Evolutionary Computation 2 IssueID3 217–249

    Google Scholar 

  • J. Srivastava R. Cooley M. Deshpnade P. Tan (2000) ArticleTitleWeb usage mining: discovery and applications of usage pattern from web data ACM SIGKDD 1 IssueID2 12–23

    Google Scholar 

  • Tan P. Kumar, V. and Srivastava, J.: 2002. Selecting the right Interestingness Measures for Association Patterns. In: Proceedings of the 8th International Conference on Knowledge Discovery and Data Mining. Edmonton, Canada, pp. 32–41.

  • Tang, T. and McCalla, G.: 2002. Student Modeling for a Web-based Learning Environment: a Data Mining Approach. In: Proceedings of the Conference on Artificial Intelligence AAAI, . Edmonton, Alberta, Canada, pp. 967–968.

  • Ventura, S., Ortiz, D. and Hervz, C.: 2002. Jclec: Una biblioteca de clases java para computacin evolutiva. In: I Congreso Espaol de Algoritmos Evolutivos y Bioinspirados, Merida, pp. 23–30.

  • Wang F. (2002). On Using Data-Mining Technology for Browsing Log File Anlisis in Asynchronous Learning Environment. In: Conference on Educational Multimedia, Hypermedia and Telecommunications. Denver, Colorado, pp. 2005–2006.

  • Whigham P. A. (1995). Gramatically-Based Genetic Programing. In: Proceedings of the Workshop on Genetic Programming. California, pp. 33–41.

  • Williams G. J.: 1999. Evolutionary Hot Spots Data Mining. An Architecture for Exploring for Interesting Discoveries. In: Conference on Knowledge Discovery and Data Mining. Beijing, China, pp. 184–193.

  • I. H. Witten E. Frank (2000) Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations Morgan Kaufmann San Francisco, CA

    Google Scholar 

  • M. L. Wong K. S. Leung (2000) Data Mining using Grammar Based Genetic Programming and Applications Kluwer Norwell, MA, USA

    Google Scholar 

  • Yao, Y. and Zhong, N.: 1999. An Analysis of Quantitative Measures Associated with Rules. In: Proceedings of PAKDD’99, pp. 479–488.

  • Yu, P., Own, C. and Lin, L.: 2001. On Learning Behavior Analysis of Web Based Interactive Environment. In: Proceedings ICCEE, Oslo/Bergen Norway.

  • Zaïane, O.R. and Luo, J.: 2001. Towards Evaluating Learners’ Behaviour in a Web-Based Distance Learning Environment. In: Proceedings of the IEEE International Conference on Advanced Learning Technologies. Madison, Wisconsin, pp. 357–360.

  • Zaïane, O.R.: 2002. Building a Recommender Agent for e-Learning Systems. In: Proceedings of the International Conference on Computers in Education, Auckland, New Zealand, pp. 55–59.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cristóbal Romero.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Romero, C., Ventura, S. & Bra, P.D. Knowledge Discovery with Genetic Programming for Providing Feedback to Courseware Authors. User Model User-Adap Inter 14, 425–464 (2004). https://doi.org/10.1007/s11257-004-7961-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11257-004-7961-2

Keywords

Navigation