ML: Lecture 9

It is important to test an algorithm on different data from the training data.
cross-validation: divide the data into n disjoint sets, train the data on one subset and test it on the others. It is generally better to train on more data. For 10 folds, train on 9/10 of the data and test it on 1/10. Repeat this 10 times, and then repeat that 10 times with different folds.
General rules: stratification generally improves the results, 10 folds is often good.
Bootstrap is resampling with replacement: given n instances, choose a random subsample with n instances. Get about 63% of the set, use the other 37% for testing.
Leave one out is similar.
Sometimes there are three data sets: training, validation, testing

How would you test if two different algorithms produce significantly different results?
What is a paired t-test and what is the difference between that and a standard t-test?
```
       d_avg
  t =  --------
       √σ² / n
  
```

What does it mean to solve the equation above?

    f(x, y) = (1, 5)
    (x, y) = f^-1(1, 5)