皇家华人 Grad Steps Up to Help U.S. Olympic Committee Find Future Speed Skaters
February 16, 2022
- Author
- Jay Pfeifer
When the Winter Olympics kicked off in Beijing, China, Owen Bezick 鈥21 tuned in from his apartment in 皇家华人, laptop computer at the ready. An unlikely expert in Olympic long-track speed skating, Bezick studied the sport during his senior year in the hope of helping the (USOPC) enhance their talent pipeline.
Working with Tim Chartier, the Joseph R. Morton Professor of Mathematics and Computer Science, last spring, Bezick developed an analytic tool aimed at identifying promising young skaters.
The Question
The roots of this project stretch back a couple of years when Chartier and Dan Webb, lead analytics consultant for the USOPC, met at a conference at the Fields Institute, an international hub of mathematical research in Toronto. Meeting there is kind of like two Michelin-starred chefs meeting at Le Cordon Bleu鈥攁nd Webb and Chartier immediately started cooking.
鈥淭here were a handful of professors that mentioned they were working on sourcing projects for student work,鈥 Webb said. 鈥淎nd Tim was one of the ones that we moved forward with.鈥
Webb and the USOPC staff were hoping to capitalize on their strengths in data analysis to offset a deficiency in Olympic sports that have comparatively low participation in the United States.
Take speed skating, for example: The pipeline of young speed skaters in the U.S. is much smaller than in perennial medal-winner Nordic and Scandinavian countries. But that doesn鈥檛 mean the U.S. doesn鈥檛 have elite athletes. Some of the nation鈥檚 most-decorated Olympians are speed skaters. Eric Heiden won five golds in 1976 and 1980. Bonnie Blair, the most decorated female U.S. winter Olympian, won five golds and a bronze as she competed in four straight Olympics (1984-92). Shani Davis won gold in 2006 and 2010, and Erin Jackson just claimed a gold in the 500 meter event in Beijing.
But finding those promising skaters is a challenge. If the USOPC could provide a tool to help identify them earlier鈥攚hen they are still junior athletes鈥攖he speedskating organization could concentrate on developing them into medal-winners.
鈥淲e've been curious what we can learn from the junior data,鈥 Webb said. 鈥淲e wanted to see if there鈥檚 anything there that we can use to project future elite performance.鈥
And they wanted Chartier and a 皇家华人 student to help answer that question.
Chartier, who helped found , the student sports analytics group, recognized the unique opportunity. So, he asked his colleagues in the Mathematics and Computer Science Department whether they knew a student who could handle the work and impress the USOPC.
Bezick鈥檚 name came back over and over again.
Already, he鈥檇 developed a reputation as an expert in data visualization and analysis.
Bezick had worked on interactive maps and for the and developed 皇家华人鈥檚 Covid-19 dashboard that helped the community navigate the return to campus in fall 2020.
He also did some work through the program at the , where he built data dashboards for a healthcare consultant. That work went so well that Bezick co-founded a software company called GatherWare before graduation. During his senior year, he and his partner鈥檚 pitch won the top prize in the 2021 Venture Fund competition, a $25,000 equity investment. He now works out of the Hurt Hub, building the new company.
Bezick brought more than just skill to the project however. He brought passion. Despite growing up in south Florida, Bezick had spent a lot of time on the ice as a skater and hockey player. Hockey was a family hobby. In fact, his younger brother is currently playing professional minor-league hockey in Amarillo, Texas.
When Chartier reached out, Bezick jumped at the chance.
鈥淚 was interested right away,鈥 Bezick said. 鈥淚t鈥檚 not too often you get to work with a group like the USOPC.鈥
The Problem
With the 皇家华人 crew on board, the USOPC sent over about a half-million rows of data to digest.
鈥淲e gave them pretty much every international speed skating competition for the last 15 or 20 years,鈥 Webb said.
Despite the scale of the data set, it didn鈥檛 include some of the most obvious-sounding competitive information. Basic building blocks like skaters鈥 race times were omitted.
鈥淲e actually didn't provide Owen a lot of speed skating-specific data,鈥 Webb said. 鈥淲e kept it very general with the idea that if there was something interesting that we learned, we hoped we could apply it to other sports or events as well.鈥
What the USOPC gave Bezick instead was a data set built on performance ratings that the organization uses in a number of Olympic events. If Bezick鈥檚 project was successful, it could be ported over to other sports.
But first: Bezick rolled up his sleeves.
鈥淚n a class project, we would get a very clean data set that has everything you need to make a nice, pretty model,鈥 he said. 鈥淏ut in this setting, there's a lot of things that arise.鈥
The data was comprehensive but contained a host of overlaps or missing items. Some data stretched back to 2003 while the rest dated to 2008. Some senior athletes didn鈥檛 compete in junior events. Bezick had to make small adjustments by hand to get the data ready for analysis.
鈥淎 lot of people say they want to get into data analysis until they realize the majority of the time is just really gritty, dirty, data-engineering work that no one likes doing,鈥 Bezick said.
Bezick then used an algorithm to divide senior-level competitors into five groups. The top-tier athletes were truly elite: Olympic medalists or high finishers. The second tier might have qualified for the Olympics, while the third, fourth and fifth never rose to distinction. Then, Bezick matched those competitors with their junior-level results and built a model based on their trajectories; trying to find patterns of success in the junior level that corresponded to the winning-est athletes.
Once he developed that model, he could turn it loose on current junior-level athletes to see which of today鈥檚 young skaters showed the same performance as the previous Olympic champions.
Then, Bezick added a signature flourish: interactive graphs.
鈥淥wen answered the questions we provided and then he went above and beyond and took it in his own direction as well,鈥 Webb said. 鈥淗e built out an app that let us investigate the data and look for other kinds of correlations or patterns that we hadn't been able to before.
鈥淚t was a little bit outside of what we had asked for,鈥 Webb said, 鈥渂ut it was a super-creative way to answer the questions.鈥
Bezick鈥檚 research didn鈥檛 uncover a hidden generation of future American medalist speed skaters, unfortunately.
鈥淭here was a little too much uncertainty in the data to get really specific answers,鈥 Webb said.
But in research, getting any kind of answer is valuable. Bezick鈥檚 work will guide further USOPC research in new directions.
The experience also showed Bezick that data science could take him anywhere鈥揻rom founding a business to scouting future Olympians.
鈥淵ou're not confined to any sort of single field or any sort of place, because really you can take it as far as you want,鈥 he said. 鈥淭here's data for everything.鈥