Tillämpad artificiell intelligens på Utexpo

På den här sidan har deltagarna på Utexpo sammanfattat sina projekt. Här kan du upptäcka och läsa om spännande projekt från programmet Tillämpad artificiell intelligens.

AI vs. DDoS – A Comparative Study of Attack Detection Models in 5G Infrastructure

Participant: Lina Larsson.

This thesis explores the use of AI-based models to detect and predict Distributed Denial of Service (DDoS) attacks in 5G networks. The project compares three model categories – traditional supervised models, anomaly-based models, and time-based models – evaluating their accuracy and ability to reduce false positives and negatives. Models were trained and tested using the 5GAD dataset. Explainable AI tools like SHAP and LIME were used to interpret model decisions and improve transparency. The results show that traditional models, especially the Decision Tree classifier, performed the best. The study highlights that combining high-performing detection models with interpretability is key to building trustworthy cybersecurity systems for critical 5G infrastructure. This work contributes to the development of more secure, reliable and transparent AI systems that can proactively protect essential digital services.

An Experimental Study on ObjectTracking

Deltagare: Mahmoud Alshaikh.
Samarbetspartner: ROADVIEW.

Detta projekt undersöker hur robusta olika 3D-objektspårningsalgoritmer är under svåra väderförhållanden, med fokus på snöiga scenarier som påverkar autonoma fordon. Fyra metoder jämförs: Kalmanfilter, Utökat Kalmanfilter, Partikelfilter och ByteTrack (en djupinlärningsbaserad algoritm). Alla testas med LiDAR-data från CADC-datamängden (snötäckta vägar) och unScenes-datamängden (bra väder).

Projektet använder standardmått som HOTA, IDF1, AMOTA och AMOTP för att utvärdera spårningsnoggrannhet och stabilitet. Resultaten visar att alla algoritmer presterar sämre i snöiga förhållanden, men att ByteTrack är mest robust. Djupinlärningsmetoder visar sig vara mer tillförlitliga i ogynnsamma miljöer.

Projektets syfte är att bidra till utvecklingen av säkrare perceptionssystem för autonoma fordon i vinterklimat. Studien är en del av ROADVIEW-projektet och finansieras av EU.

Imbalance Data and Transfer Learning in Music Genre Classification

Participant: Caj Morgan Norlén.

This project investigates the performance and effects of varying degrees of data imbalance and the role of transfer learning in music genre classification (MGC), using a relatively small, imbalanced private music dataset labeled by domain experts. We refine the prominent deep learning model Audio Spectrogram Transformer (AST), under these constraints. Our results show that the refined model performs exceptionally well in the task of MGC, outperforming results of previous studies. Notably, we observe that different levels of class imbalance have minimal effect on performance, contrasting previous research based on non-transformer models which reports improvements of minority class predictions through balancing techniques. These findings suggest that refining a pre-trained AST model on small and imbalanced datasets, can still yield prominent results. Furthermore, we suggest that the AST model may be suitable as a foundational model for a broader range of music and audio related tasks, both in research and practical applications.

Improving Rare Disease Prediction with Specialized Loss Functions

Deltagare: Alexander Reevcrona.

Automatiserad medicinsk kodning syftar till att tilldela standardiserade ICD-9-koder till klinisk text, men stora klass-obalanser gör att modeller ofta missar sällsynta diagnoser. I mitt examensarbete utvärderar jag två specialiserade förlustfunktioner, Focal Loss och Asymmetric Loss, med PLM-ICD-arkitekturen . Resultaten visar att Asymmetric Loss höjer Macro-F1 med 0,023 jämfört med traditionell Binary Cross-Entropy – samtidigt som Micro-F1 hålls intakt. Arbetet visar därmed att riktad förlustfunktionsdesign är ett enkelt och effektivt sätt att minska gapet mellan vanliga och sällsynta koder.

Non-Basketball Features in the NBA: Machine Learning and SHAP Analysis of Player Contracts and Team Priorities

Participant: Emil Falk.

The NBA, a major basketball league with 30 teams in the US and Canada, generates significant revenue. This study examines how off-court factors, like Instagram followers and player loyalty, impact NBA player contracts. Using Machine Learning models, it compared predictions based on basketball-only data versus combined basketball and non-basketball data. The Explainable AI tool SHAP analyzed how high-value versus low-value NBA teams prioritize non-basketball features. Results showed that including non-basketball features improved most Machine Learning models’ predictions, but not Deep Learning models. SHAP revealed that high-value teams pay more for all features, especially for All-Star players, compared to low-value teams. While SHAP was effective here, its general reliability wasn’t proven. The study underscores the importance of non-basketball factors in NBA salaries and the value of explainable AI in salary prediction.

Premier League Predictions: Leveraging AI and Feature Engineering

Participant: Christoffer Karlsson and Caio Araujo Rossmann Leite.

Our project focuses on predicting match outcomes in the English Premier League using artificial intelligence and feature engineering. By collecting and processing historical match data, we developed a machine learning model and compared it to a baseline model. Key features include an Elo-inspired team rating system, rolling performance metrics and creating a play style features. We applied statistical methods to evaluate model accuracy and to understand what factors impact match results the most.

Time-Series Healthcare Data Imputation using Deep Learning

Deltagare: Mikael Weege.

Detta projektet undersöker hur olika modeller, särskilt djupinlärningsbaserade, kan rekonstruera saknade värden i ett sjukvårdsdataset. Tre traditionella metoder och fyra djupinlärningsmodeller testades på PhysioNet 2012 ICU-datasetet, med simulerade nivåer av saknad data på 10 %, 30 %, 50 % och 80 %.

Imputationsprestanda utvärderades med tre felmått: medelabsolutfel, medelkvadratfel och medelrelativt fel, samt även körtid. Därefter användes den bäst imputerade datan i en klassificeringsuppgift där en Random Forest-modell förutsade patientdödlighet. Klassificeringen utvärderades med träffsäkerhet, ROC AUC och F1-makrovärde.

GRIN var den modell som gav lägst imputationsfel vid alla nivåer av saknad data, och presterade bättre än både traditionella och andra djupinlärningsbaserade modeller. Trots detta gav inte GRIN bättre klassificeringsresultat än medelvärdesimputering. Slutsatsen är att hög imputationsnoggrannhet inte nödvändigtvis leder till bättre prediktiv prestanda, vilket kan bero på överanpassning eller skevheter i de imputerade värdena.

uppdaterad

2025-06-02

Kontakt

Katarina Tran

Dela

Kontakt

Utexporådet
utexpo@hh.se