1. What tasks are performed in cloud-edge-terminal IoT networks?
In cloud-edge-terminal IoT networks, various dynamic scheduling tasks are performed. These tasks include radio resource management, data gathering, and wireless power transfer. Each edge in the network carries out one or more of these tasks, denoted by the set L. The tasks are represented as ( ) L, and the edges involved in a specific task are defined as N ( ) = { : ( ) = }. The network also considers the maximum bandwidth, memory resource, and computing resource available in the cloud server for performing these tasks. Additionally, the system model allows for the extension of scenarios where an edge carries out multiple tasks by conceptualizing the edge as a collection of distinct virtual edges, each representing an individual task.
read more
2. What is the state information vector of IoT device M?
The state information vector of IoT device M in time slot is defined as = ( , ,1 , . . ., , , ( ( ) ) ), where , , represents the th state information of IoT device in time slot, and ( ) is the number of types of state information for task . This vector captures the current conditions and status of the IoT device, which is crucial for effective scheduling decisions. It includes various parameters such as queue length, channel conditions, and other relevant factors that influence the scheduling process. By considering this state information, edges can make informed decisions to optimize the scheduling tasks and achieve the desired goals.
read more
3. How does FRL aggregate DNNs in IoT network?
FRL aggregates DNNs in IoT network by effectively aggregating the local DNNs (i.e., the local policies) from all edges conducting the task over multiple time slots in each round. The central parameters of the DNN for task at the cloud server are denoted by , and the local parameters of the DNN at edge are denoted by w. The cloud server broadcasts the central parameters, , for task in round to the edges in N ( ). Then, in round, each edge substitutes its local parameters, w , with . After this substitution, each edge trains its local parameters using its local experiences. These trained parameters are then uploaded to the cloud server. The cloud server updates its central parameters for task by aggregating the received parameters from edges in N ( ), using EQUATION. This process repeats for multiple rounds, effectively aggregating the DNNs and solving the problem in (6).
read more
4. What are the key challenges in collaborative policy learning?
The key challenges in collaborative policy learning include limited cloud resources for collaborative policy learning on multiple tasks, and the inapplicability of conventional policy structures. Limited cloud resources require careful task selection to maximize participation while maintaining fairness. The inapplicability of conventional policy structures arises from varying dynamics due to different numbers of IoT devices and system uncertainties, leading to different state and action spaces and transition probabilities. This necessitates a policy structure with generalization capability to collaboratively learn a central policy for all tasks. These challenges will be addressed in Section IV-A and IV-B, respectively.
read more