Now that I have some athlete similarity code up and running, let’s take it for a spin, shall we?
The basic idea is to pick a skier (Beckie Scott in this case) and then mine my results database for skiers who’ve had similar careers. Â This is a fairly complicated task with a lot of steps. Â You can refer to my previous post for more details on the methodology. Â The important things to remember at the moment, though are:
– My measure of similarity looks at every result in overlapping age ranges
– This is an inherently noisy process; we can expect some bad matches
– This is not a 100% automated process; we should expect to have to make some judgements along the way about when two skiers can reasonably be thought of as “similar”.
– Distance and sprint racing will be treated separately.
– Athletes that are “most similar” to Beckie Scott might still be very different in an absolute sense. Continue reading ›
Tagged athlete similarity, beckie scott, Distance, Sprint