• français 
    • español
    • English
    • français
  • FacebookPinterestTwitter
  • español
  • English
  • français
Voir le document 
  •   Accueil de DIGIBUG
  • 1.-Investigación
  • Departamentos, Grupos de Investigación e Institutos
  • Departamento de Ciencias de la Computación e Inteligencia Artificial
  • DCCIA - Artículos
  • Voir le document
  •   Accueil de DIGIBUG
  • 1.-Investigación
  • Departamentos, Grupos de Investigación e Institutos
  • Departamento de Ciencias de la Computación e Inteligencia Artificial
  • DCCIA - Artículos
  • Voir le document
JavaScript is disabled for your browser. Some features of this site may not work without it.

Forgetting as a way to avoid deception in a repeated imitation game

[PDF] self_archived_jaamas_2013.pdf (1.154Mo)
Identificadores
URI: https://hdl.handle.net/10481/86080
DOI: 10.1007/s10458-012-9205-x
Exportar
RISRefworksMendeleyBibtex
Estadísticas
Statistiques d'usage de visualisation
Metadatos
Afficher la notice complète
Auteur
Villacorta Iglesias, Pablo José; Pelta Mochcovsky, David Alejandro; Lamata Jiménez, María Teresa
Editorial
Springer Nature
Materia
adversarial reasoning
 
decision making
 
adversarial decision making
 
strategies
 
imitation
 
repeated game
 
game
 
Date
2013-11
Referencia bibliográfica
Villacorta, P.J., Pelta, D.A. & Lamata, M.T. Forgetting as a way to avoid deception in a repeated imitation game. Auton Agent Multi-Agent Syst 27, 329–354 (2013). https://doi.org/10.1007/s10458-012-9205-x
Patrocinador
Grupo de investigación TIC-169: Modelos de Decisión y Optimización (MODO)
Résumé
Adversarial decision making is aimed at determining optimal decision strategies to deal with an adaptive opponent. A clear example of such situation is the repeated imitation game presented here. Two agents compete in an adversarial model where one agent wants to learn how to imitate the actions taken by the other agent by means of the observation and memorization of the past actions. One defense against this adversary is to make decisions that are intended to confuse him. To achieve this, randomized strategies that change along time for one of the agents are proposed and their performance is analysed from both a theoretical and empirical point of view. We also study the ability of the imitator to avoid deception and adapt to a new behaviour by forgetting the oldest observations. The results confirm that wrong assumptions about the imitator’s behaviour lead to dramatic losses due to a failure in causing deception.
Colecciones
  • DCCIA - Artículos

Mon compte

Ouvrir une sessionS'inscrire

Parcourir

Tout DIGIBUGCommunautés et CollectionsPar date de publicationAuteursTitresSujetsFinanciaciónPerfil de autor UGRCette collectionPar date de publicationAuteursTitresSujetsFinanciación

Statistiques

Statistiques d'usage de visualisation

Servicios

Pasos para autoarchivoAyudaLicencias Creative CommonsSHERPA/RoMEODulcinea Biblioteca UniversitariaNos puedes encontrar a través deCondiciones legales

Contactez-nous | Faire parvenir un commentaire