CS579 Machine Learning: Assignment #3.

Due: at the beginning of the lecture on Tuesday, March 8.

Problem 1 In this problem, you will use the WEKA system to analyze two data sets. You will apply three learning algorithms to each data set and compare their performance.

Problem 2 Exercise 4 from section 9.8 of the book (p.195)

Problem 3 To reduce the cost of nearest neighbor algorithm a data structure called kd-tree may be used. A kd-tree is similar to a decision tree, except that we split using the median value along the dimension having the highest variance, and points are stored in every internal node (the leaves are empty). The following figure shows an example of kd-tree and how it splits the x,y plane.

Propose an algorithm for finding the nearest neighbor of a point given a kd-tree.