AI EXPRESS - Hot Deal 4 VCs instabooks.co
  • AI
    AI think tank calls GPT-4 a risk to public safety

    AI think tank calls GPT-4 a risk to public safety

    Skillprint launches science-backed platform to match players with the right skill-based games

    Skillprint launches science-backed platform to match players with the right skill-based games

    Got It AI’s ELMAR challenges GPT-4 and LLaMa, scores well on hallucination benchmarks

    Got It AI’s ELMAR challenges GPT-4 and LLaMa, scores well on hallucination benchmarks

    Don't be fooled by AI washing: 3 questions to ask before you invest

    5 ways machine learning must evolve in a difficult 2023

    OpenAI's GPT-4 violates FTC rules, argues AI policy group

    OpenAI’s GPT-4 violates FTC rules, argues AI policy group

    Google advances AlloyDB, BigQuery at Data Cloud and AI Summit

    Google advances AlloyDB, BigQuery at Data Cloud and AI Summit

  • ML
    Recommend top trending items to your users using the new Amazon Personalize recipe

    Recommend top trending items to your users using the new Amazon Personalize recipe

    Snapper provides machine learning-assisted labeling for pixel-perfect image object detection

    Snapper provides machine learning-assisted labeling for pixel-perfect image object detection

    Achieve effective business outcomes with no-code machine learning using Amazon SageMaker Canvas

    Achieve effective business outcomes with no-code machine learning using Amazon SageMaker Canvas

    HAYAT HOLDING uses Amazon SageMaker to increase product quality and optimize manufacturing output, saving $300,000 annually

    HAYAT HOLDING uses Amazon SageMaker to increase product quality and optimize manufacturing output, saving $300,000 annually

    Enable predictive maintenance for line of business users with Amazon Lookout for Equipment

    Enable predictive maintenance for line of business users with Amazon Lookout for Equipment

    Build custom code libraries for your Amazon SageMaker Data Wrangler Flows using AWS Code Commit

    Build custom code libraries for your Amazon SageMaker Data Wrangler Flows using AWS Code Commit

    Access Snowflake data using OAuth-based authentication in Amazon SageMaker Data Wrangler

    Access Snowflake data using OAuth-based authentication in Amazon SageMaker Data Wrangler

    Enable fully homomorphic encryption with Amazon SageMaker endpoints for secure, real-time inferencing

    Enable fully homomorphic encryption with Amazon SageMaker endpoints for secure, real-time inferencing

    Will ChatGPT help retire me as Software Engineer anytime soon? – The Official Blog of BigML.com

    Will ChatGPT help retire me as Software Engineer anytime soon? –

  • NLP
    ChatGPT, Large Language Models and NLP – a clinical perspective

    ChatGPT, Large Language Models and NLP – a clinical perspective

    What could ChatGPT mean for Medical Affairs?

    What could ChatGPT mean for Medical Affairs?

    Want to Improve Clinical Care? Embrace Precision Medicine Through Deep Phenotyping

    Want to Improve Clinical Care? Embrace Precision Medicine Through Deep Phenotyping

    Presight AI and G42 Healthcare sign an MOU

    Presight AI and G42 Healthcare sign an MOU

    Meet Sketch: An AI code Writing Assistant For Pandas

    Meet Sketch: An AI code Writing Assistant For Pandas

    Exploring The Dark Side Of OpenAI's GPT Chatbot

    Exploring The Dark Side Of OpenAI’s GPT Chatbot

    OpenAI launches tool to catch AI-generated text

    OpenAI launches tool to catch AI-generated text

    Year end report, 1 May 2021- 30 April 2022.

    U.S. Consumer Spending Starts to Sputter; Labor Report to Give Fed Look at Whether Rate Increases Are Cooling Rapid Wage Growth

    Meet ETCIO SEA Transformative CIOs 2022 Winner Edmund Situmorang, CIOSEA News, ETCIO SEA

    Meet ETCIO SEA Transformative CIOs 2022 Winner Edmund Situmorang, CIOSEA News, ETCIO SEA

  • Vision
    Data2Vec: Self-supervised general framework

    Data2Vec: Self-supervised general framework

    NVIDIA Metropolis Ecosystem Grows With Advanced Development Tools to Accelerate Vision AI

    NVIDIA Metropolis Ecosystem Grows With Advanced Development Tools to Accelerate Vision AI

    Low Code and No Code Platforms for AI and Computer Vision

    Low Code and No Code Platforms for AI and Computer Vision

    Computer Vision Model Performance Evaluation (Guide 2023)

    Computer Vision Model Performance Evaluation (Guide 2023)

    PepsiCo Leads in AI-Powered Automation With KoiVision Platform

    PepsiCo Leads in AI-Powered Automation With KoiVision Platform

    USB3 & GigE Frame Grabbers for Machine Vision

    USB3 & GigE Frame Grabbers for Machine Vision

    Active Learning in Computer Vision - Complete 2023 Guide

    Active Learning in Computer Vision – Complete 2023 Guide

    Ensembling Neural Network Models With Tensorflow

    Ensembling Neural Network Models With Tensorflow

    Autoencoder in Computer Vision - Complete 2023 Guide

    Autoencoder in Computer Vision – Complete 2023 Guide

  • Robotics
    Keys to using ROS 2 & other frameworks for medical robots

    Keys to using ROS 2 & other frameworks for medical robots

    Watch Bill Gates take a ride in a Wayve AV

    Watch Bill Gates take a ride in a Wayve AV

    Researchers taught a quadruped to use its legs for manipulation

    Researchers taught a quadruped to use its legs for manipulation

    Times Microwave Systems launches coaxial cable for robotics

    Times Microwave Systems launches coaxial cable for robotics

    neubility robot on the sidewalk.

    Sidewalk delivery robot company Neubility secures $2.42M investment

    Gecko Robotics expands work with U.S. Navy

    Gecko Robotics expands work with U.S. Navy

    German robotics industry to grow 9% in 2023

    German robotics industry to grow 9% in 2023

    head shot of larry sweet.

    ARM Institute hires Larry Sweet as Director of Engineering

    Destaco launches end-of-arm tooling line for cobots

    Destaco launches end-of-arm tooling line for cobots

  • RPA
    What is IT Process Automation? Use Cases, Benefits, and Challenges in 2023

    What is IT Process Automation? Use Cases, Benefits, and Challenges in 2023

    Benefits of Automated Claims Processing in Insurance Industry

    Benefits of Automated Claims Processing in Insurance Industry

    ChatGPT and RPA Join Force to Create a New Tech-Revolution

    ChatGPT and RPA Join Force to Create a New Tech-Revolution

    How does RPA in Accounts Payable Enhance Data Accuracy?

    How does RPA in Accounts Payable Enhance Data Accuracy?

    10 Best Use Cases to Automate using RPA in 2023

    10 Best Use Cases to Automate using RPA in 2023

    How will RPA Improve the Employee Onboarding Process?

    How will RPA Improve the Employee Onboarding Process?

    Key 2023 Banking Automation Trends / Blogs / Perficient

    Key 2023 Banking Automation Trends / Blogs / Perficient

    AI-Driven Omnichannel is the Future of Insurance Industry

    AI-Driven Omnichannel is the Future of Insurance Industry

    Avoid Patient Queues with Automated Query Resolution

    Avoid Patient Queues with Automated Query Resolution

  • Gaming
    God of War Ragnarok had a banner debut week at UK retail

    God of War Ragnarok had a banner debut week at UK retail

    A Little To The Left Review (Switch eShop)

    A Little To The Left Review (Switch eShop)

    Horizon Call of the Mountain will release alongside PlayStation VR2 in February

    Horizon Call of the Mountain will release alongside PlayStation VR2 in February

    Sonic Frontiers has Dreamcast-era jank and pop-in galore - but I can't stop playing it

    Sonic Frontiers has Dreamcast-era jank and pop-in galore – but I can’t stop playing it

    Incredible November Xbox Game Pass addition makes all other games obsolete

    Incredible November Xbox Game Pass addition makes all other games obsolete

    Free Monster Hunter DLC For Sonic Frontiers Now Available On Switch

    Free Monster Hunter DLC For Sonic Frontiers Now Available On Switch

    Somerville review: the most beautiful game I’ve ever played

    Somerville review: the most beautiful game I’ve ever played

    Microsoft Flight Sim boss confirms more crossover content like Halo's Pelican and Top Gun Maverick

    Microsoft Flight Sim boss confirms more crossover content like Halo’s Pelican and Top Gun Maverick

    The Game Awards nominations are in, with God of War Ragnarok up for 10 of them

    The Game Awards nominations are in, with God of War Ragnarok up for 10 of them

  • Investment
    Travelport

    Travelport Receives $200M Investment

    Pulse Industrial

    Pulse Industrial Raises New Funding Round

    Horizon Quantum Computing

    Horizon Quantum Computing Raises USD 18.1M in Series A Funding

    PxE Holographic Imaging Raises $5.4M in Seed Funding

    PxE Holographic Imaging Raises $5.4M in Seed Funding

    Ledger

    Ledger Closes €100M Series C Extension Round

    personal finance

    3 Reliable Ways to Generate Some Income for Investment

    trading

    Index Futures Trading Receives First Ever Crypto Market Deployment on Bitget Exchange

    BioCorteX

    BioCorteX Raises $5M in Seed Funding

    Hirebotics Receives Investment From Sverica Capital Management

    Hirebotics Receives Investment From Sverica Capital Management

  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video
No Result
View All Result
AI EXPRESS - Hot Deal 4 VCs instabooks.co
No Result
View All Result
Home Machine Learning

Predict football punt and kickoff return yards with fat-tailed distribution using GluonTS

by
February 2, 2023
in Machine Learning
0
Predict football punt and kickoff return yards with fat-tailed distribution using GluonTS
0
SHARES
3
VIEWS
Share on FacebookShare on Twitter

At present, the NFL is continuous their journey to extend the variety of statistics offered by the Next Gen Stats Platform to all 32 groups and followers alike. With superior analytics derived from machine studying (ML), the NFL is creating new methods to quantify soccer, and to supply followers with the instruments wanted to extend their information of the video games inside the sport of soccer. For the 2022 season, the NFL aimed to leverage player-tracking information  and new superior analytics methods to better understand special teams.

The aim of the venture was to foretell what number of yards a returner would acquire on a punt or kickoff play. One of many challenges when constructing predictive fashions for punt and kickoff returns is the supply of very uncommon occasions — akin to touchdowns — which have important significance within the dynamics of a sport. A knowledge distribution with fats tails is frequent in real-world purposes, the place uncommon occasions have important affect on the general efficiency of the fashions. Utilizing a strong methodology to precisely mannequin distribution over excessive occasions is essential for higher total efficiency.

On this submit, we reveal how one can use Spliced Binned-Pareto distribution applied in GluonTS to robustly mannequin such fat-tailed distributions.

We first describe the dataset used. Subsequent, we current the information preprocessing and different transformation strategies utilized to the dataset. We then clarify the main points of the ML methodology and mannequin coaching procedures. Lastly, we current the mannequin efficiency outcomes.

Dataset

On this submit, we used two datasets to construct separate fashions for punt and kickoff returns. The participant monitoring information accommodates the participant’s place, route, acceleration, and extra (in x,y coordinates). There are round 3,000 and 4,000 performs from 4 NFL seasons (2018–2021) for punt and kickoff performs, respectively. As well as, there are only a few punt and kickoff-related touchdowns within the datasets—solely 0.23% and 0.8%, respectively. The info distribution for punt and kickoff are totally different. For instance, the true yardage distribution for kickoff and punts are comparable however shifted, as proven within the following determine.

Information preprocessing and have engineering

First, the monitoring information was filtered for simply the information associated to punts and kickoff returns. The participant information was used to derive options for mannequin growth:

  • X – Participant place alongside the lengthy axis of the sphere
  • Y – Participant place alongside the quick axis of the sphere
  • S – Velocity in yards/second; changed by Dis*10 to make it extra correct (Dis is the gap up to now 0.1 seconds)
  • Dir – Angle of participant movement (levels)

From the previous information, every play was remodeled into 10X11X14 of knowledge with 10 offensive gamers (excluding the ball service), 11 defenders, and 14 derived options:

  • sX – x velocity of a participant
  • sY – y velocity of a participant
  • s – Velocity of a participant
  • aX – x acceleration of a participant
  • aY – y acceleration of a participant
  • relX – x distance of participant relative to ball service
  • relY – y distance of participant relative to ball service
  • relSx – x velocity of participant relative to ball service
  • relSy – y velocity of participant relative to ball service
  • relDist – Euclidean distance of participant relative to ball service
  • oppX – x distance of offense participant relative to protection participant
  • oppY – y distance of offense participant relative to protection participant
  • oppSx –x velocity of offense participant relative to protection participant
  • oppSy – y velocity of offense participant relative to protection participant

To enhance the information and account for the precise and left positions, the X and Y place values have been additionally mirrored to account for the precise and left area positions. The info preprocessing and have engineering was tailored from the winner of the NFL Big Data Bowl competitors on Kaggle.

ML methodology and mannequin coaching

As a result of we’re excited about all doable outcomes from the play, together with the likelihood of a landing, we are able to’t merely predict the common yards gained as a regression downside. We have to predict the complete likelihood distribution of all doable yard features, so we framed the issue as a probabilistic prediction.

One technique to implement probabilistic predictions is to assign the yards gained to a number of bins (akin to lower than 0, from 0–1, from 1–2, …, from 14–15, greater than 15) and predict the bin as a classification downside. The draw back of this method is that we would like small bins to have a excessive definition image of the distribution, however small bins imply fewer information factors per bin and our distribution, particularly the tails, could also be poorly estimated and irregular.

See also  What do the Short-Term Technicals Predict for Scala (XLA) Sunday?

One other technique to implement probabilistic predictions is to mannequin the output as a steady likelihood distribution with a restricted variety of parameters (for instance, a Gaussian or Gamma distribution) and predict the parameters. This method provides a really excessive definition and common image of the distribution, however is just too inflexible to suit the true distribution of yards gained, which is multi-modal and heavy tailed.

To get the very best of each strategies, we use Spliced Binned-Pareto distribution (SBP), which has bins for the middle of the distribution the place a whole lot of information is out there, and Generalized Pareto distribution (GPD) at each ends, the place uncommon however necessary occasions can occur, like a landing. The GPD has two parameters: one for scale and one for tail heaviness, as seen within the following graph (supply: Wikipedia).

By splicing the GPD with the binned distribution (see the next left graph) on either side, we get hold of the next SBP on the precise. The decrease and higher thresholds the place splicing is completed are hyperparameters.

Binned and SPB distributions

As a baseline, we used the mannequin that received our NFL Big Data Bowl competitors on Kaggle. This mannequin makes use of CNN layers to extract options from the ready information, and predicts the end result as a “1 yard per bin” classification downside. For our mannequin, we stored the characteristic extraction layers from the baseline and solely modified the final layer to output SBP parameters as an alternative of chances for every bin, as proven within the following determine (picture edited from the submit 1st place solution The Zoo).

Model Architecture

We used the SBP distribution offered by GluonTS. GluonTS is a Python package deal for probabilistic time collection modeling, however the SBP distribution just isn’t particular to time collection, and we have been in a position to repurpose it for regression. For extra info on how one can use GluonTS SBP, see the next demo notebook.

Fashions have been skilled and cross-validated on the 2018, 2019, and 2020 seasons and examined on the 2021 season. To keep away from leakage throughout cross-validation, we grouped all performs from the identical sport into the identical fold.

For analysis, we stored the metric used within the Kaggle competitors, the continuous ranked probability score (CRPS), which might be seen as an alternative choice to the log-likelihood that’s extra strong to outliers. We additionally used the Pearson correlation coefficient and the RMSE as normal and interpretable accuracy metrics. Moreover, we seemed on the likelihood of a landing and likelihood plots to guage calibration.

The mannequin was skilled on the CRPS loss utilizing Stochastic Weight Averaging and early stopping.

To cope with the irregularity of the binned a part of the output distributions, we used two methods:

  • A smoothness penalty proportional to the squared distinction between two consecutive bins
  • Ensembling fashions skilled throughout cross-validation

Mannequin efficiency outcomes

For every dataset, we carried out a grid search over the next choices:

  • Probabilistic fashions
    • Baseline was one likelihood per yard
    • SBP was one likelihood per yard within the middle, generalized SBP within the tails
  • Distribution smoothing
    • No smoothing (smoothness penalty = 0)
    • Smoothness penalty = 5
    • Smoothness penalty = 10
  • Coaching and inference process
    • 10 folds cross-validation and ensemble inference (k10)
    • Coaching on prepare and validation information for 10 epochs or 20 epochs

Then we seemed on the metrics for the highest 5 fashions sorted by CRPS (decrease is best).

For kickoff information, the SBP mannequin barely over-performs by way of CRPS however extra importantly it estimates the landing likelihood higher (true likelihood is 0.80% within the check set). We see that the very best fashions use 10 folds ensembling (k10) and no smoothness penalty, as proven within the following desk.

Coaching Mannequin Smoothness CRPS RMSE CORR % P(landing)%
k10 SBP 4.071 9.641 47.15 0.78
k10 Baseline 4.074 9.62 47.585 0.306
k10 Baseline 5 4.075 9.626 47.43 0.274
k10 SBP 5 4.079 9.656 46.977 0.682
k10 Baseline 10 4.08 9.621 47.519 0.265

The next plot of the noticed frequencies and predicted chances signifies a great calibration of our greatest mannequin, with an RMSE of 0.27 between the 2 distributions. Be aware the occurrences of excessive yardage (for instance, 100) that happen within the tail of the true (blue) empirical distribution, whose chances are extra capturable by the SBP than the baseline methodology.

See also  Deepspatial Announces Revenue-generating Contract with Leading FMCG (fast-moving consumer goods) Company in South Asia to Optimize its Distribution Network

Kickoff observed frequencies and predicted probability distribution

For punt information, the baseline outperforms the SBP, maybe as a result of the tails of maximum yardage have fewer realizations. Subsequently, it’s a greater trade-off to seize the modality between 0–10 yards peaks; and opposite to kickoff information, the very best mannequin makes use of a smoothness penalty. The next desk summarizes our findings.

Coaching Mannequin Smoothness CRPS RMSE CORR % P(landing)%
k10 Baseline 5 3.961 8.313 35.227 0.547
k10 Baseline 3.972 8.346 34.227 0.579
k10 Baseline 10 3.978 8.351 34.079 0.555
k10 SBP 5 3.981 8.342 34.971 0.723
k10 SBP 3.991 8.378 33.437 0.677

The next plot of noticed frequencies (in blue) and predicted chances for the 2 finest punt fashions signifies that the non-smoothed mannequin (in orange) is barely higher calibrated than the smoothed mannequin (in inexperienced) and could also be a better option total.

Punt true and predicted probabilities

Conclusion

On this submit, we confirmed how one can construct predictive fashions with fat-tailed information distribution. We used Spliced Binned-Pareto distribution, applied in GluonTS, which might robustly mannequin such fat-tailed distributions. We used this system to construct fashions for punt and kickoff returns. We will apply this resolution to comparable use instances the place there are only a few occasions within the information, however these occasions have important affect on the general efficiency of the fashions.

If you want assist with accelerating the usage of ML in your services and products, please contact the Amazon ML Options Lab program.


Concerning the Authors

Tesfagabir Meharizghi is a Information Scientist on the Amazon ML Options Lab the place he helps AWS clients throughout varied industries akin to healthcare and life sciences, manufacturing, automotive, and sports activities and media, speed up their use of machine studying and AWS cloud companies to resolve their enterprise challenges.

Marc van Oudheusden is a Senior Information Scientist with the Amazon ML Options Lab staff at Amazon Internet Providers. He works with AWS clients to resolve enterprise issues with synthetic intelligence and machine studying. Exterior of labor it’s possible you’ll discover him on the seaside, taking part in along with his kids, browsing or kitesurfing.

Panpan Xu is a Senior Utilized Scientist and Supervisor with the Amazon ML Options Lab at AWS. She is engaged on analysis and growth of Machine Studying algorithms for high-impact buyer purposes in a wide range of industrial verticals to speed up their AI and cloud adoption. Her analysis curiosity contains mannequin interpretability, causal evaluation, human-in-the-loop AI and interactive information visualization.

Kyeong Hoon (Jonathan) Jung is a senior software program engineer on the Nationwide Soccer League. He has been with the Subsequent Gen Stats staff for the final seven years serving to to construct out the platform from streaming the uncooked information, constructing out microservices to course of the information, to constructing API’s that exposes the processed information. He has collaborated with the Amazon Machine Studying Options Lab in offering clear information for them to work with in addition to offering area information concerning the information itself. Exterior of labor, he enjoys biking in Los Angeles and climbing within the Sierras.

Michael Chi is a Senior Director of Know-how overseeing Subsequent Gen Stats and Information Engineering on the Nationwide Soccer League. He has a level in Arithmetic and Pc Science from the College of Illinois at Urbana Champaign. Michael first joined the NFL in 2007 and has primarily targeted on know-how and platforms for soccer statistics. In his spare time, he enjoys spending time along with his household outside.

  Mike Band is a Senior Supervisor of Analysis and Analytics for Subsequent Gen Stats on the Nationwide Soccer League. Since becoming a member of the staff in 2018, he has been chargeable for ideation, growth, and communication of key stats and insights derived from player-tracking information for followers, NFL broadcast companions, and the 32 golf equipment alike. Mike brings a wealth of information and expertise to the staff with a grasp’s diploma in analytics from the College of Chicago, a bachelor’s diploma in sport administration from the College of Florida, and expertise in each the scouting division of the Minnesota Vikings and the recruiting division of Florida Gator Soccer.

Source link

Tags: distributionfattailedfootballGluonTSKickoffPredictpuntReturnyards
Previous Post

Novel Raises £5M in Growth Funding 

Next Post

Women with prepregnancy migraine had a greater risk of complications

Next Post
Image showing a woman with migraine

Women with prepregnancy migraine had a greater risk of complications

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Newsletter

Popular Stories

  • Wordle on New York Times

    Today’s Wordle marks the start of a new era for the game – here’s why

    0 shares
    Share 0 Tweet 0
  • iOS 16.4 is rolling out now – here are 7 ways it’ll boost your iPhone

    0 shares
    Share 0 Tweet 0
  • Increasing your daily magnesium intake prevents dementia

    0 shares
    Share 0 Tweet 0
  • Beginner’s Guide for Streaming TV

    0 shares
    Share 0 Tweet 0
  • Twitter’s blue-check doomsday date is set and it’s no April Fool’s joke

    0 shares
    Share 0 Tweet 0

ML Jobs

View 115 ML Jobs at Tesla

View 165 ML Jobs at Nvidia

View 105 ML Jobs at Google

View 135 ML Jobs at Amamzon

View 131 ML Jobs at IBM

View 95 ML Jobs at Microsoft

View 205 ML Jobs at Meta

View 192 ML Jobs at Intel

Accounting and Finance Hub

Raised Seed, Series A, B, C Funding Round

Get a Free Insurance Quote

Try Our Accounting Service

AI EXPRESS – Hot Deal 4 VCs instabooks.co

AI EXPRESS is a news site that covers the latest developments in Artificial Intelligence, Data Analytics, ML & DL, Algorithms, RPA, NLP, Robotics, Smart Homes & Cities, Cloud & Quantum Computing, AR & VR and Blockchains

Categories

  • AI
  • Ai videos
  • Apps
  • AR & VR
  • Blockchain
  • Cloud
  • Computer Vision
  • Crypto Currency
  • Data analytics
  • Esports
  • Gaming
  • Gaming Videos
  • Investment
  • IOT
  • Iot Videos
  • Low Code No Code
  • Machine Learning
  • NLP
  • Quantum Computing
  • Robotics
  • Robotics Videos
  • RPA
  • Security
  • Smart City
  • Smart Home

Quick Links

  • Reviews
  • Deals
  • Best
  • AI Jobs
  • AI Events
  • AI Directory
  • Industries

© 2021 Aiexpress.io - All rights reserved.

  • Contact
  • Privacy Policy
  • Terms & Conditions

No Result
View All Result
  • AI
  • ML
  • NLP
  • Vision
  • Robotics
  • RPA
  • Gaming
  • Investment
  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video

© 2021 Aiexpress.io - All rights reserved.