AI Service Deployment and Resource Allocation Optimization Based on Human-Like Networking Architecture

IEEE Internet of Things Journal(2024)

引用 0|浏览11
暂无评分
摘要
In the forthcoming sixth-generation (6G) era, edge-network-cloud collaboration is needed to support artificial intelligence as a service (AIaaS) with a strong demand for computing power. However, how to guarantee the Quality of AI Service (QoAIS) and utilize the edge-network-cloud collaboration to enhance the performance of AI service is a big challenge. In this paper, we propose an AI service management and network resource scheduling architecture based on human-like networking. Considering the Quality of Service (QoS) requirements and AI tasks, we propose a joint AI agent placement with deep neural network (DNN) deployment and dynamic bandwidth resource allocation algorithm (JAAPD-D). JAAPD-D is proposed to solve the short-term and long-term joint resource allocation problem which includes communication, computation, and memory resources in the network. We adjust the agent placement, DNN deployment, and schedule routing path to ensure effective service transmission in the long time interval and dynamically allocate bandwidth resources in the short time interval. We use Lyapunov optimization to ensure the system stability of the whole network, meet the QoS requirements of various services, and minimize the average end-to-end delay of services. Simulation results show that JAAPD-D outperforms existing algorithms in terms of delay, traffic accepted rate, network system throughput, and cost.
更多
查看译文
关键词
Artificial Intelligence as a Service,Quality of AI Service,DNN Inference,Human-like Networking,Resource Management,Lyapunov Optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要