<rdf:RDF xmlns:rdf="http://www.openarchives.org/OAI/2.0/rdf/" xmlns:ow="http://www.ontoweb.org/ontology/1#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ds="http://dspace.org/ds/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/rdf/ http://www.openarchives.org/OAI/2.0/rdf.xsd">
   <ow:Publication rdf:about="oai:digibug.ugr.es:10481/77969">
      <dc:title>A repeated imitation model with dependence between stages: decision strategies and rewards</dc:title>
      <dc:creator>Villacorta Iglesias, Pablo José</dc:creator>
      <dc:creator>Pelta Mochcovsky, David Alejandro</dc:creator>
      <dc:subject>Adversarial decision making</dc:subject>
      <dc:subject>Imitation</dc:subject>
      <dc:subject>Strategies</dc:subject>
      <dc:subject>State dependence</dc:subject>
      <dc:subject>Reward</dc:subject>
      <dc:subject>Inteligencia artificial</dc:subject>
      <dc:subject>Artificial intelligence</dc:subject>
      <dc:description>Adversarial decision making is aimed at determining strategies to anticipate the behavior of an opponent trying to learn from&#xd;
our actions. One defense is to make decisions intended to confuse the opponent, although our rewards can be diminished.&#xd;
This idea has already been captured in an adversarial model introduced in a previous work, in which two agents separately&#xd;
issue responses to an unknown sequence of external inputs. Each agent’s reward depends on the current input and the&#xd;
responses of both agents. In this contribution, (a) we extend the original model by establishing stochastic dependence&#xd;
between an agent’s responses and the next input of the sequence, and (b) we study the design of time varying decision&#xd;
strategies for the extended model. The strategies obtained are compared against static strategies from theoretical and&#xd;
empirical points of view. The results show that time varying strategies outperform static ones.</dc:description>
      <dc:date>2022-11-15T08:00:03Z</dc:date>
      <dc:date>2022-11-15T08:00:03Z</dc:date>
      <dc:date>2015-09-30</dc:date>
      <dc:type>journal article</dc:type>
      <dc:identifier>Villacorta,P. &amp; Pelta,D.(2015).A repeated imitation model with dependence between stages: Decision strategies and rewards. International Journal of Applied Mathematics and Computer Science,25(3) 617-630. [https://doi.org/10.1515/amcs-2015-0045]</dc:identifier>
      <dc:identifier>https://hdl.handle.net/10481/77969</dc:identifier>
      <dc:identifier>10.1515/amcs-2015-0045</dc:identifier>
      <dc:language>eng</dc:language>
      <dc:rights>http://creativecommons.org/licenses/by-nc-nd/4.0/</dc:rights>
      <dc:rights>open access</dc:rights>
      <dc:rights>Attribution-NonCommercial-NoDerivatives 4.0 Internacional</dc:rights>
      <dc:publisher>Sciendo</dc:publisher>
   </ow:Publication>
</rdf:RDF>