發送短信 : VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints

__    __   __   __    _____    __   __   _____    
\ \\ / //  \ \\/ //  |__  //   \ \\/ // |  __ \\  
 \ \/ //    \ ` //     / //     \ ` //  | |  \ || 
  \  //      | ||     / //__     | ||   | |__/ || 
   \//       |_||    /_____||    |_||   |_____//  
    `        `-`'    `-----`     `-`'    -----`