Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • apa.csl
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards Robust and Adaptive Machine Learning: A Fresh Perspective on Evaluation and Adaptation Methodologies in Non-Stationary Environments
Karlstad University, Faculty of Health, Science and Technology (starting 2013), Department of Mathematics and Computer Science (from 2013).ORCID iD: 0000-0003-0683-2783
2023 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Machine learning (ML) has become ubiquitous in various disciplines and applications, serving as a powerful tool for developing predictive models to analyze diverse variables of interest. With the advent of the digital era, the proliferation of data has presented numerous opportunities for growth and expansion across various domains. However, along with these opportunities, there is a unique set of challenges that arises due to the dynamic and ever-changing nature of data. These challenges include concept drift, which refers to shifting data distributions over time, and other data-related issues that can be framed as learning problems. Traditional static models are inadequate in handling these issues, underscoring the need for novel approaches to enhance the performance robustness and reliability of ML models to effectively navigate the inherent non-stationarity in the online world. The field of concept drift is characterized by several intricate aspects that challenge learning algorithms, including the analysis of model performance, which requires evaluating and understanding how the ML model's predictive capability is affected by different problem settings. Additionally, determining the magnitude of drift necessary for change detection is an indispensable task, as it involves identifying substantial shifts in data distributions. Moreover, the integration of adaptive methodologies is essential for updating ML models in response to data dynamics, enabling them to maintain their effectiveness and reliability in evolving environments. In light of the significance and complexity of the topic, this dissertation offers a fresh perspective on the performance robustness and adaptivity of ML models in non-stationary environments. The main contributions of this research include exploring and organizing the literature, analyzing the performance of ML models in the presence of different types of drift, and proposing innovative methodologies for drift detection and adaptation that solve real-world problems. By addressing these challenges, this research paves the way for the development of more robust and adaptive ML solutions capable of thriving in dynamic and evolving data landscapes.

Abstract [en]

Machine learning (ML) is widely used in various disciplines as a powerful tool for developing predictive models to analyze diverse variables. In the digital era, the abundance of data has created growth opportunities, but it also brings challenges due to the dynamic nature of data. One of these challenges is concept drift, the shifting data distributions over time. Consequently, traditional static models are inadequate for handling these challenges in the online world. Concept drift, with its intricate aspects, presents a challenge for learning algorithms. Analyzing model performance and detecting substantial shifts in data distributions are crucial for integrating adaptive methodologies to update ML models in response to data dynamics, maintaining effectiveness and reliability in evolving environments. In this dissertation, a fresh perspective is offered on the robustness and adaptivity of ML models in non-stationary environments. This research explores and organizes existing literature, analyzes ML model performance in the presence of drift, and proposes innovative methodologies for detecting and adapting to drift in real-world problems. The aim is to develop more robust and adaptive ML solutions capable of thriving in dynamic and evolving data landscapes.

Place, publisher, year, edition, pages
Karlstad: Karlstads universitet, 2023. , p. 24
Series
Karlstad University Studies, ISSN 1403-8099 ; 2023:23
Keywords [en]
machine learning, concept drift, covariate shift, performance robustness, evaluation methodology, adaptive learning
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kau:diva-95645ISBN: 978-91-7867-390-2 (print)ISBN: 978-91-7867-391-9 (electronic)OAI: oai:DiVA.org:kau-95645DiVA, id: diva2:1773449
Presentation
2023-10-06, Sjöström Lecture Hall, 1B309, Karlstad University, Karlstad, 09:00 (English)
Opponent
Supervisors
Available from: 2023-09-15 Created: 2023-06-22 Last updated: 2023-09-15Bibliographically approved
List of papers
1. From concept drift to model degradation: An overview on performance-aware drift detectors
Open this publication in new window or tab >>From concept drift to model degradation: An overview on performance-aware drift detectors
2022 (English)In: Knowledge-Based Systems, ISSN 0950-7051, E-ISSN 1872-7409, Vol. 245, article id 108632Article, review/survey (Refereed) [Artistic work] Published
Abstract [en]

The dynamicity of real-world systems poses a significant challenge to deployed predictive machine learning (ML) models. Changes in the system on which the ML model has been trained may lead to performance degradation during the system’s life cycle. Recent advances that study non-stationary environments have mainly focused on identifying and addressing such changes caused by a phenomenon called concept drift. Different terms have been used in the literature to refer to the same type of concept drift and the same term for various types. This lack of unified terminology is set out to create confusion on distinguishing between different concept drift variants. In this paper, we start by grouping concept drift types by their mathematical definitions and survey the different terms used in the literature to build a consolidated taxonomy of the field. We also review and classify performance-based concept drift detection methods proposed in the last decade. These methods utilize the predictive model’s performance degradation to signal substantial changes in the systems. The classification is outlined in a hierarchical diagram to provide an orderly navigation between the methods. We present a comprehensive analysis of the main attributes and strategies for tracking and evaluating the model’s performance in the predictive system. The paper concludes by discussing open research challenges and possible research directions.

National Category
Computer and Information Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:kau:diva-89786 (URN)10.1016/j.knosys.2022.108632 (DOI)000795145800005 ()2-s2.0-85127829086 (Scopus ID)
Projects
AIDA
Available from: 2022-05-16 Created: 2022-05-16 Last updated: 2023-06-22Bibliographically approved
2. A domain-region based evaluation of ML performance robustness to covariate shift
Open this publication in new window or tab >>A domain-region based evaluation of ML performance robustness to covariate shift
2023 (English)In: Neural Computing & Applications, ISSN 0941-0643, E-ISSN 1433-3058Article in journal (Refereed) Epub ahead of print
Abstract [en]

Most machine learning methods assume that the input data distribution is the same in the training and testing phases.However, in practice, this stationarity is usually not met and the distribution of inputs differs, leading to unexpectedperformance of the learned model in deployment. The issue in which the training and test data inputs follow differentprobability distributions while the input–output relationship remains unchanged is referred to as covariate shift. In thispaper, the performance of conventional machine learning models was experimentally evaluated in the presence of covariateshift. Furthermore, a region-based evaluation was performed by decomposing the domain of probability density function ofthe input data to assess the classifier’s performance per domain region. Distributional changes were simulated in a twodimensional classification problem. Subsequently, a higher four-dimensional experiments were conducted. Based on theexperimental analysis, the Random Forests algorithm is the most robust classifier in the two-dimensional case, showing thelowest degradation rate for accuracy and F1-score metrics, with a range between 0.1% and 2.08%. Moreover, the resultsreveal that in higher-dimensional experiments, the performance of the models is predominantly influenced by the complexity of the classification function, leading to degradation rates exceeding 25% in most cases. It is also concluded that themodels exhibit high bias toward the region with high density in the input space domain of the training samples.

Place, publisher, year, edition, pages
Springer Science+Business Media B.V., 2023
Keywords
Degradation, Function evaluation, Input output programs, Machine learning, Probability distributions, Classifier evaluation, Concept drifts, Covariate shifts, Input datas, Machine-learning, Model degradations, Performance, Region-based, Robust machine learning, Two-dimensional, Probability density function
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:kau:diva-94905 (URN)10.1007/s00521-023-08622-w (DOI)000988193400003 ()2-s2.0-85159332578 (Scopus ID)
Funder
Knowledge Foundation, 20200067Karlstad University
Available from: 2023-05-29 Created: 2023-05-29 Last updated: 2023-06-22Bibliographically approved
3. DA-LSTM: A dynamic drift-adaptive learning framework for interval load forecasting with LSTM networks
Open this publication in new window or tab >>DA-LSTM: A dynamic drift-adaptive learning framework for interval load forecasting with LSTM networks
Show others...
2023 (English)In: Engineering applications of artificial intelligence, ISSN 0952-1976, E-ISSN 1873-6769, Vol. 123, article id 106480Article in journal (Refereed) Published
Abstract [en]

Load forecasting is a crucial topic in energy management systems (EMS) due to its vital role in optimizing energy scheduling and enabling more flexible and intelligent power grid systems. As a result, these systems allow power utility companies to respond promptly to demands in the electricity market. Deep learning (DL) models have been commonly employed in load forecasting problems supported by adaptation mechanisms to cope with the changing pattern of consumption by customers, known as concept drift. A drift magnitude threshold should be defined to design change detection methods to identify drifts. While the drift magnitude in load forecasting problems can vary significantly over time, existing literature often assumes a fixed drift magnitude threshold, which should be dynamically adjusted rather than fixed during system evolution. To address this gap, in this paper, we propose a dynamic drift-adaptive Long Short-Term Memory (DA-LSTM) framework that can improve the performance of load forecasting models without requiring a drift threshold setting. We integrate several strategies into the framework based on active and passive adaptation approaches. To evaluate DA-LSTM in real-life settings, we thoroughly analyze the proposed framework and deploy it in a real-world problem through a cloud-based environment. Efficiency is evaluated in terms of the prediction performance of each approach and computational cost. The experiments show performance improvements on multiple evaluation metrics achieved by our framework compared to baseline methods from the literature. Finally, we present a trade-off analysis between prediction performance and computational costs.

Place, publisher, year, edition, pages
Elsevier, 2023
Keywords
Concept drift Change-point detection Dynamic drift adaptation Adaptive LSTM Interval load forecasting
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:kau:diva-95022 (URN)10.1016/j.engappai.2023.106480 (DOI)001013639800001 ()2-s2.0-85160615665 (Scopus ID)
Funder
Knowledge Foundation, 20200067Swedish Energy Agency, 50246-1; 52693-1
Available from: 2023-06-02 Created: 2023-06-02 Last updated: 2023-07-06Bibliographically approved
4. A Drift Handling Approach for Self-Adaptive ML Software in Scalable Industrial Processes
Open this publication in new window or tab >>A Drift Handling Approach for Self-Adaptive ML Software in Scalable Industrial Processes
2022 (English)In: Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering / [ed] Mario Aehnelt and Thomas Kirste, Association for Computing Machinery (ACM), 2022, p. 1-5, article id 129Conference paper, Published paper (Refereed)
Abstract [en]

Most industrial processes in real-world manufacturing applications are characterized by the scalability property, which requires an automated strategy to self-adapt machine learning (ML) software systems to the new conditions. In this paper, we investigate an Electroslag Remelting (ESR) use case process from the Uddeholms AB steel company. The use case involves predicting the minimum pressure value for a vacuum pumping event. Taking into account the long time required to collect new records and efficiently integrate the new machines with the built ML software system. Additionally, to accommodate the changes and satisfy the non-functional requirement of the software system, namely adaptability, we propose an automated and adaptive approach based on a drift handling technique called importance weighting. The aim is to address the problem of adding a new furnace to production and enable the adaptability attribute of the ML software. The overall results demonstrate the improvements in ML software performance achieved by implementing the proposed approach over the classical non-adaptive approach. 

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2022
Keywords
Application programs, Automation, Electroslag remelting, Adaptive approach, Automated industrial process, Changing environment, Concept drifts, Electroslag remelting, Industrial processs, Machine learning software, Machine learning software adaptability, Software adaptability, Software-systems, Remelting
National Category
Software Engineering
Research subject
Computer Science
Identifiers
urn:nbn:se:kau:diva-93618 (URN)10.1145/3551349.3559495 (DOI)2-s2.0-85146949434 (Scopus ID)978-1-4503-9475-8 (ISBN)
Conference
37th IEEE/ACM International Conference on Automated Software Engineering, Rochester, USA, October 10-14, 2022.
Funder
Knowledge Foundation, 20200067
Available from: 2023-02-13 Created: 2023-02-13 Last updated: 2023-06-22Bibliographically approved

Open Access in DiVA

KAPPAN(1062 kB)43 downloads
File information
File name FULLTEXT02.pdfFile size 1062 kBChecksum SHA-512
71558d9a1057e7e72dfe4e9be0856199b30cafe574bc2ec968e951bf7becd601b350484dab779a0e151dac08c7d6804ea4ce6f44ad6be481df9679ae87f765b1
Type fulltextMimetype application/pdf

Authority records

Bayram, Firas

Search in DiVA

By author/editor
Bayram, Firas
By organisation
Department of Mathematics and Computer Science (from 2013)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 43 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 159 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • apa.csl
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf