Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • apa.csl
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
End-to-End Data Quality-Driven Framework for Machine Learningin Production Environment
Karlstad University, Faculty of Health, Science and Technology (starting 2013), Department of Mathematics and Computer Science (from 2013).ORCID iD: 0000-0003-0683-2783
Karlstad University, Faculty of Health, Science and Technology (starting 2013), Department of Mathematics and Computer Science (from 2013).ORCID iD: 0000-0001-9051-7609
(English)In: Heliyon, E-ISSN 2405-8440Article in journal (Refereed) Submitted
Abstract [en]

This paper introduces a novel end-to-end framework that efficiently inte-grates data quality assessment with machine learning (ML) model operationsin real-time production environments. While existing approaches treat dataquality assessment and ML systems as isolated processes, our framework ad-dresses the critical gap between theoretical methods and practical implemen-tation by combining dynamic drift detection, adaptive data quality metrics,and MLOps into a cohesive, lightweight system. The key innovation liesin its operational efficiency, enabling real-time, quality-driven ML decision-making with minimal computational overhead. We validate the frameworkin a steel manufacturing company’s Electroslag Remelting (ESR) vacuumpumping process, demonstrating a 12% improvement in model performance(R² = 94%) and a fourfold reduction in prediction latency. By exploringthe impact of data quality acceptability thresholds, we provide actionableinsights into balancing data quality standards and predictive performancein industrial applications. This framework represents a significant advance-ment in MLOps, offering a robust solution for time-sensitive, data-drivendecision-making in dynamic industrial environments.

National Category
Artificial Intelligence
Identifiers
URN: urn:nbn:se:kau:diva-103578OAI: oai:DiVA.org:kau-103578DiVA, id: diva2:1944998
Available from: 2025-03-17 Created: 2025-03-17 Last updated: 2025-03-17
In thesis
1. Machine Learning in Motion: Engineering Self-Adaptive Systems
Open this publication in new window or tab >>Machine Learning in Motion: Engineering Self-Adaptive Systems
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

In the rapidly evolving landscape of artificial intelligence (AI) and machine learning (ML), transitioning from theoretical models to robust, adaptive production systems remains a significant challenge in the digital age. As data-driven methodologies revolutionize problem solving across industries, a critical gap exists between research achievements and reliable real-world deployment. Key obstacles include concept drift and data quality issues that arise from unpredictable changes in data distributions and operational complexities that systematically affect model performance, reliability, and efficiency. This thesis addresses these challenges by introducing an overarching framework for adaptive ML systems that operate reliably in dynamic, real-world environments, incorporating innovative methodologies for dynamic drift detection and real-time data quality assessment bounded by robust Machine Learning Operations (MLOps) strategies. These integrated components enable the creation of production-grade ML systems that can efficiently adapt to shifts in data distributions and assess data quality in real-time, ensuring stable and reliable performance in dynamic environments. The proposed approaches are validated through real-world use cases, demonstrating significant improvements in predictive accuracy and operational efficiency. By deploying these adaptive systems in industrial contexts, the thesis highlights their potential to deliver reliable, high-performance ML solutions tailored to the demands of complex time-sensitive applications. This work offers concrete solutions for translating theoretical advances into practical applications, contributing to developing robust and scalable ML systems for real-world deployment.

Abstract [en]

In the era of AI-driven innovation, ensuring the reliability and adaptability of machine learning (ML) systems in dynamic environments remains a fundamental challenge. This thesis addresses key obstacles such as concept drift and data quality, which affect model performance in real-world applications. By integrating novel drift detection mechanisms, real-time data quality assessment, and robust MLOps strategies, this work proposes a comprehensive framework for self-adaptive ML systems. These solutions enable industrial ML deployments to maintain predictive accuracy, efficiency, and resilience against evolving data conditions. Through real-world case studies, this research demonstrates significant improvements in operational stability and decision-making, bridging the gap between theoretical advancements and practical implementation. By tackling these challenges, this thesis contributes to the development of scalable, trustworthy, and high-performance ML solutions tailored for industrial and mission-critical applications.

Place, publisher, year, edition, pages
Karlstad: Karlstads universitet, 2025. p. 30
Series
Karlstad University Studies, ISSN 1403-8099 ; 2025:14
Keywords
machine learning, concept drift, data quality, adaptive learning, MLOps, data-driven development, performance robustness
National Category
Artificial Intelligence Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:kau:diva-103579 (URN)10.59217/rjrp8643 (DOI)978-91-7867-560-9 (ISBN)978-91-7867-561-6 (ISBN)
Public defence
2025-04-25, Eva Eriksson lecture hall, 21A342, Karlstad, 09:00 (English)
Opponent
Supervisors
Available from: 2025-04-07 Created: 2025-03-17 Last updated: 2025-04-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Bayram, FirasAhmed, Bestoun S.

Search in DiVA

By author/editor
Bayram, FirasAhmed, Bestoun S.
By organisation
Department of Mathematics and Computer Science (from 2013)
In the same journal
Heliyon
Artificial Intelligence

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 44 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • apa.csl
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf