Fault-Tolerant Cost-Efficient Scheduling for Energy and Deadline-Constrained IoT Workflows in Edge-Cloud Continuum
2025 (English)In: IEEE Transactions on Services Computing, E-ISSN 1939-1374, Vol. 18, no 5, p. 2892-2903Article in journal (Refereed) Published
Abstract [en]
Edge computing brings computation closer to Internet-of-Things (IoT) data sources, reducing latency but increasing energy consumption and susceptibility to node failures. The cloud platform provides extensive computational capabilities, but comes with significant costs and communication delays due to network congestion. The edge-cloud continuum strategically combines these approaches to mitigate their individual drawbacks. However, effectively scheduling IoT workflows to minimize costs while adhering to strict requirements for latency, energy efficiency, and reliability remains a major challenge in real-time IoT applications. To address these challenges, we propose the Reliable Energy-constrained Cost-aware Real-time (RECR) algorithm for optimizing IoT workflow scheduling across the edge-cloud continuum. RECR minimizes monetary costs and enhances reliability while adhering to strict energy and deadline constraints. We also introduce RECR-D, a fault-tolerant extension that employs adaptive task duplication to manage transient and permanent failures, with reliability rigorously modeled using Continuous-Time Markov Chains (CTMCs) to integrate dynamic failure behavior. Extensive simulations demonstrate that RECR reduces workflow monetary costs by approximately 21% and improves deadline adherence by 37% compared to state-of-the-art algorithms. Furthermore, RECR-D improves compliance with reliability and energy constraints by 27% and by up to 208%, respectively, highlighting its robust performance in dynamic, failure-prone environments. These contributions significantly advance workflow management for IoT applications, proving crucial for real-time traffic control and video analytics in smart cities, ensuring timely processing and lower costs. They are also vital for remote patient monitoring and medical imaging analysis in healthcare, improving reliability and meeting deadlines for patient safety.
Place, publisher, year, edition, pages
IEEE, 2025. Vol. 18, no 5, p. 2892-2903
Keywords [en]
Computational cost, Constrained optimization, Continuous time systems, Costs, Edge computing, Fault tolerance, Green computing, Markov processes, Medical imaging, Scheduling algorithms, Traffic congestion, Workflow management, Cost-aware, Edge, Edge clouds, Energy-constrained, Fault-tolerant, Latency, Real- time, Reliable energy, Work-flows, Workflow scheduling, Clouds
National Category
Computer Sciences Computer Systems
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kau:diva-106747DOI: 10.1109/TSC.2025.3599497ISI: 001591693600029Scopus ID: 2-s2.0-105013296423OAI: oai:DiVA.org:kau-106747DiVA, id: diva2:1994798
2025-09-032025-09-032026-02-12Bibliographically approved