Mostrar el registro sencillo del ítem

dc.contributor.authorShamsolmoali, Pourya
dc.contributor.authorZareapoor, Masoumeh
dc.contributor.authorDas, Swagatam
dc.contributor.authorGranger, Eric
dc.contributor.authorGarcía López, Salvador 
dc.date.accessioned2024-05-22T07:40:17Z
dc.date.available2024-05-22T07:40:17Z
dc.date.issued2023-10-24
dc.identifier.citationPublished version: P. Shamsolmoali, M. Zareapoor, S. Das, E. Granger and S. García, "Hybrid Gromov–Wasserstein Embedding for Capsule Learning," in IEEE Transactions on Neural Networks and Learning Systems, doi: 10.1109/TNNLS.2023.3348657es_ES
dc.identifier.urihttps://hdl.handle.net/10481/91950
dc.description.abstractCapsule networks (CapsNets) aim to parse images into a hierarchy of objects, parts, and their relations using a twostep process involving part-whole transformation and hierarchical component routing. However, this hierarchical relationship modeling is computationally expensive, which has limited the wider use of CapsNet despite its potential advantages. The current state of CapsNet models primarily focuses on comparing their performance with capsule baselines, falling short of achieving the same level of proficiency as deep CNN variants in intricate tasks. To address this limitation, we present an efficient approach for learning capsules that surpasses canonical baseline models and even demonstrates superior performance compared to highperforming convolution models. Our contribution can be outlined in two aspects: firstly, we introduce a group of subcapsules onto which an input vector is projected. Subsequently, we present the Hybrid Gromov-Wasserstein framework, which initially quantifies the dissimilarity between the input and the components modeled by the subcapsules, followed by determining their alignment degree through optimal transport. This innovative mechanism capitalizes on new insights into defining alignment between the input and subcapsules, based on the similarity of their respective component distributions. This approach enhances CapsNets’ capacity to learn from intricate, high-dimensional data while retaining their interpretability and hierarchical structure. Our proposed model offers two distinct advantages: (i) its lightweight nature facilitates the application of capsules to more intricate vision tasks, including object detection; (ii) it outperforms baseline approaches in these demanding tasks. Our empirical findings illustrate that Hybrid Gromov-Wasserstein Capsules (HGWCapsules) exhibit enhanced robustness against affine transformations, scale effectively to larger datasets, and surpass CNN and CapsNet models across various vision tasks.es_ES
dc.description.sponsorshipQueens University Startup under Project D8203EECes_ES
dc.language.isoenges_ES
dc.publisherInstitute of Electrical and Electronics Engineerses_ES
dc.rightsAtribución-NoComercial-CompartirIgual 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/*
dc.subjectCapsule Networkses_ES
dc.subjectOptimal Transportes_ES
dc.subjectWasserstein Distanceses_ES
dc.titleHybrid Gromov-Wasserstein Embedding for Capsule Learninges_ES
dc.typejournal articlees_ES
dc.rights.accessRightsopen accesses_ES
dc.identifier.doi10.1109/TNNLS.2023.3348657
dc.type.hasVersionSMURes_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución-NoComercial-CompartirIgual 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial-CompartirIgual 4.0 Internacional