Il full-text non è disponibile per scelta dell'autore.
      
        (
Contatta l'autore)
      
    
  
    
  
  
    
      Abstract
      The problem addressed in this thesis is to use swarm agents to find the optimal placement to reach optimal localization performance of a target node in a wireless sensor network scenario. Localization can be based on simply received signal strength (RSSI) and trilateration. To measure the accuracy of the localization process, geometric dilution of precision (GDOP) has been used. Trilateration is performed by mobile anchors that, in this work, will be supposed to be drones. Three anchors are used. The anchors are free to move in an environment represented by a grid. Each drone can assume a grid cell as location. To move from a cell to another there are five actions allowed. Each agent can move one cell square north, south, east, west or remain in its current position, if possible. Localization is performed on a target node arbitrarily positioned in the environment. Each time drones make a move, a reward is awarded to them depending on the estimated distance from the target and the GDOP. This allows drones to determine whether or not the action taken in a particular cell was valid. Three different algorithms have been proposed and implemented. The first one called 'Multi agent Q-learning' is used in small gridworld. Each executable action in a cell is assigned a certain value, called q-value, indicating how much that action is useful to reach the final goal. The tested scenarios include both environments with and without obstacles. A deep reinforcement learning approach was used to shift the problem even to larger environments. Thanks to the use of neural networks, an algorithm called  'actor-critic' has been implemented. The action will be chosen over a distribution of probabilities. Finally, the two algorithms have been united in a hybrid technique that allows trilateration to be performed even on mobile targets.
     
    
      Abstract
      The problem addressed in this thesis is to use swarm agents to find the optimal placement to reach optimal localization performance of a target node in a wireless sensor network scenario. Localization can be based on simply received signal strength (RSSI) and trilateration. To measure the accuracy of the localization process, geometric dilution of precision (GDOP) has been used. Trilateration is performed by mobile anchors that, in this work, will be supposed to be drones. Three anchors are used. The anchors are free to move in an environment represented by a grid. Each drone can assume a grid cell as location. To move from a cell to another there are five actions allowed. Each agent can move one cell square north, south, east, west or remain in its current position, if possible. Localization is performed on a target node arbitrarily positioned in the environment. Each time drones make a move, a reward is awarded to them depending on the estimated distance from the target and the GDOP. This allows drones to determine whether or not the action taken in a particular cell was valid. Three different algorithms have been proposed and implemented. The first one called 'Multi agent Q-learning' is used in small gridworld. Each executable action in a cell is assigned a certain value, called q-value, indicating how much that action is useful to reach the final goal. The tested scenarios include both environments with and without obstacles. A deep reinforcement learning approach was used to shift the problem even to larger environments. Thanks to the use of neural networks, an algorithm called  'actor-critic' has been implemented. The action will be chosen over a distribution of probabilities. Finally, the two algorithms have been united in a hybrid technique that allows trilateration to be performed even on mobile targets.
     
  
  
    
    
      Tipologia del documento
      Tesi di laurea
(Laurea magistrale)
      
      
      
      
        
      
        
          Autore della tesi
          Bertozzi, Enrico
          
        
      
        
          Relatore della tesi
          
          
        
      
        
          Correlatore della tesi
          
          
        
      
        
          Scuola
          
          
        
      
        
          Corso di studio
          
          
        
      
        
      
        
      
        
          Ordinamento Cds
          DM270
          
        
      
        
          Parole chiave
          Reinforcement Learning,Localization,Target tracking,Neural Network,MATLAB
          
        
      
        
          Data di discussione della Tesi
          12 Marzo 2020
          
        
      
      URI
      
      
     
   
  
    Altri metadati
    
      Tipologia del documento
      Tesi di laurea
(NON SPECIFICATO)
      
      
      
      
        
      
        
          Autore della tesi
          Bertozzi, Enrico
          
        
      
        
          Relatore della tesi
          
          
        
      
        
          Correlatore della tesi
          
          
        
      
        
          Scuola
          
          
        
      
        
          Corso di studio
          
          
        
      
        
      
        
      
        
          Ordinamento Cds
          DM270
          
        
      
        
          Parole chiave
          Reinforcement Learning,Localization,Target tracking,Neural Network,MATLAB
          
        
      
        
          Data di discussione della Tesi
          12 Marzo 2020
          
        
      
      URI
      
      
     
   
  
  
  
  
  
  
    
      Gestione del documento: 
      
        