FGV Digital Repository
    • português (Brasil)
    • English
    • español
      Visit:
    • FGV Digital Library
    • FGV Scientific Journals
  • English 
    • português (Brasil)
    • English
    • español
  • Login
View Item 
  •   DSpace Home
  • FGV EMAp - Escola de Matemática Aplicada
  • FGV EMAp - Trabalhos de Conclusão de Curso
  • View Item
  •   DSpace Home
  • FGV EMAp - Escola de Matemática Aplicada
  • FGV EMAp - Trabalhos de Conclusão de Curso
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

All of DSpaceFGV Communities & CollectionsAuthorsAdvisorSubjectTitlesBy Issue DateKeywordsThis CollectionAuthorsAdvisorSubjectTitlesBy Issue DateKeywords

My Account

LoginRegister

Statistics

View Usage Statistics

Desenvolvimento de estratégias e fenômenos em dinâmicas de jogos de múltiplos agentes

Thumbnail
View/Open
Trabalho de conclusão de curso - Giovanni Almeida Argento de Amorim (8.721Mb)
Date
2020-11
Author
Amorim, Giovanni Almeida Argento de
Advisor
Coelho, Flávio Codeço
Metadata
Show full item record
Abstract
Recent developments in Reinforcement Learning (RL) methods are focused on models that can learn good policies in non stationary environments, such as multi-agent games, where agents must learn how to react to changes in other agent’s strategies or in the environment. Some development has been made by studying not only how one agent can develop it’s policy, but how a population of agents can evolve from initial distributions to stable states of strategies. Evolutionary Game Theory (EGT) is the theoretical framework that applies mathematical and economical knowledge from game theory and biological evolution inspiration to study how individuals from a population dynamically interact in an environment. In this paper, we first introduce EGT concepts and show how they can be applied to understanding a population’s learning dynamics in the context of RL. Then we link those concepts with learning algorithms and study how one can infer the behaviour of those methods from links with evolutionary dynamics. Finally, we study and evaluate a recently proposed algorithm derived from policy gradient model and EGT dynamics and discuss next steps.
URI
https://hdl.handle.net/10438/30458
Collections
  • FGV EMAp - Trabalhos de Conclusão de Curso [45]
Knowledge Areas
Matemática
Subject
Teoria dos jogos
Jogos estratégicos (Matemática)
Keyword
Evolutionary game theory
Reinforcement learning
Multi agent
Learning dynamics
Replicator dynamics
Neural replicator dynamics

DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 


DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 

Import Metadata