School of Computing

Module details

CO832 Data Mining and Knowledge Discovery (15 credits)

Syllabus

  • Introduction to data mining:
    • The motivation for data mining.
    • The nature of induction, in the context of data mining.
    • Basic concepts of data mining and knowledge discovery.
  • Discovery of association rules:
    • Main concepts and objective of this task.
    • A standard algorithm for discovering association rules.
  • Classification and related prediction tasks:
    • Main concepts and objective of this task.
    • Classification algorithms of several paradigms, such as decision tree, rule induction, instance-based learning, and naïve Bayes.
    • Discussion on the strengths and weaknesses of these kinds of algorithm.
    • An overview of other data mining tasks involving prediction.
  • Clustering:
    • Main concepts and objective of this task.
    • Classical algorithms for clustering.
  • Overview of the knowledge discovery process:
    • An overview of the iterative process of knowledge discovery, including not only the data mining step, but also pre-processing and post-processing steps.
    • General discussion of data quality issues in the context of data mining.
  • The pre-processing step of the knowledge discovery process:
    • Attribute selection, discretization and attribute construction as data pre-processing tasks for data mining.
    • Discussion of some algorithms for performing these pre-processing tasks.
  • Post-processing of discovered knowledge and Rule Interestingness:
    • Methods for refining discovered knowledge, for instance selecting the most interesting rules, out of the entire set of discovered rules.
    • Review of some rule interestingness measures.
  • Text mining and web mining:
    • Main concepts and objective of text mining and web mining, as extensions of data mining to the more difficult problem of mining texts and web pages.

Note

This web page provides advance information about a module due to run in the coming academic year. We believe the details are accurate at the time of writing but they may be subject to change.

School of Computing, University of Kent, Canterbury, Kent, CT2 7NF

Enquiries: +44 (0)1227 824180 or contact us.

Last Updated: 08/04/2011 15:43