linkstream2 microblog

Thoughts on “A few useful things to know about machine learning”

Some thoughts on a good paper giving intuition on machine learning approaches

http://homes.cs.washington.edu/~pedrod/papers/cacm12.pdf
http://dl.acm.org/citation.cfm?id=2347755

In particular, the paper gives good intuition about:

– overfitting (e.g. how it’s related to multiple testing & bias v variance)
– the curse of dimensionality (in high-D all neighbors look the same)
– the non-practicality of theoretical guarantees
– how different frontiers can give the same prediction
– ensembles (which reduce variance greatly without increasing bias that much)
– ensembles vs Bayesian model averaging (which essentially select the best model)

Tags: 00x27ips, data, machinelearning, mining, toreadinhardcopy

This entry was posted on Thursday, February 14th, 2013 at 5:08 am and is filed under SciLit, Uncategorized, x78update. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

Thoughts on “A few useful things to know about machine learning”

Leave a Reply

View All Tags

Recent Posts

Recent Comments

Archives

Categories

Meta