A comparative analysis of data mining methods in predicting NCAA bowl outcomes

Dursun Delen, Douglas Cogdell, Nihat Kasap

Research output: Contribution to journalArticle

28 Scopus citations

Abstract

Predicting the outcome of a college football game is an interesting and challenging problem. Most previous studies have concentrated on ranking the bowl-eligible teams according to their perceived strengths, and using these rankings to predict the winner of a specific bowl game. In this study, using eight years of data and three popular data mining techniques (namely artificial neural networks, decision trees and support vector machines), we have developed both classification- and regression-type models in order to assess the predictive abilities of different methodologies (classification versus regression-based classification) and techniques. In the end, the results showed that the classification-type models predict the game outcomes better than regression-based classification models, and of the three classification techniques, decision trees produced the best results, with better than an 85% prediction accuracy on the 10-fold holdout sample. The sensitivity analysis on trained models revealed that the non-conference team winning percentage and average margin of victory are the two most important variables among the 28 that were used in this study.

Original languageEnglish
Pages (from-to)543-552
Number of pages10
JournalInternational Journal of Forecasting
Volume28
Issue number2
DOIs
StatePublished - 1 Apr 2012
Externally publishedYes

Keywords

  • Classification
  • College football
  • Knowledge discovery
  • Machine learning
  • Prediction
  • Regression

Fingerprint Dive into the research topics of 'A comparative analysis of data mining methods in predicting NCAA bowl outcomes'. Together they form a unique fingerprint.

  • Cite this