An upper-level undergraduate course(s) in algorithms and data structures, a basic course on probability and statistics. This is a first course on data mining and no prior knowledge of data mining or machine learning is assumed. Homework assignments will require programming in Java, which can sometimes be substituted with C++
There is no single text book for the course. For each topic, we will list the relevant chapters from various books and papers. We will make sure that you can access such books chapters. Second, we will have students scribe two page summaries of lectures in class.
Chapters from the following text books could be useful
A. K. Jain and R. C. Dubes. Algorithms for Clustering Data. Prentice Hall, 1988. Local copy
These are old slides that are made available as is. Most of the instructions will be using the board and the material covered may be substantially different from what is there in these slides. Also, remember that it is not very useful to study from slides.