<rdf:RDF xmlns:rdf="http://www.openarchives.org/OAI/2.0/rdf/" xmlns:ow="http://www.ontoweb.org/ontology/1#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ds="http://dspace.org/ds/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/rdf/ http://www.openarchives.org/OAI/2.0/rdf.xsd">
   <ow:Publication rdf:about="oai:digibug.ugr.es:10481/79149">
      <dc:title>New applications of models based on imprecise probabilities within data mining</dc:title>
      <dc:creator>Moral García, Serafín</dc:creator>
      <dc:contributor>Abellán Mulero, Joaquín</dc:contributor>
      <dc:contributor>Mantas Ruiz, Carlos Javier</dc:contributor>
      <dc:contributor>Universidad de Granada. Programa de Doctorado en Tecnologías de la Información y de la Comunicación</dc:contributor>
      <dc:description>When we have information about a finite set of possible alternatives provided&#xd;
by an expert or dataset, a mathematical model is needed to represent&#xd;
such information. In some cases, a unique probability distribution is not appropriate&#xd;
for this purpose because the available information is not sufficient.&#xd;
For this reason, several mathematical theories and models based on imprecise&#xd;
probabilities have been developed in the literature. In this thesis work, we analyze&#xd;
the relations between some imprecise probability theories and study the&#xd;
properties of some models based on imprecise probabilities. When imprecise&#xd;
probability theories and models arise, tools for quantifying the uncertaintybased&#xd;
information in such theories and models, usually called uncertainty&#xd;
measures, are needed. In this thesis work, we analyze the properties of some&#xd;
existing uncertainty measures in theories based on imprecise probabilities and&#xd;
propose uncertainty measures in imprecise probability theories and models&#xd;
that present some advantages over the existing ones.&#xd;
Situations in which it is necessary to represent the information provided&#xd;
by a dataset about a finite set of possible alternatives arise in classification, an&#xd;
essential task within Data Mining. This well-known task consists of predicting,&#xd;
for a given instance described via a set of attributes, the value of a variable&#xd;
under study, known as the class variable. In classification, it is often needed to&#xd;
quantify the uncertainty-based information about the class variable. For this&#xd;
purpose, classical probability theory (PT) has been employed for many years.&#xd;
In the last years, classification algorithms that represent the information about&#xd;
the class variable via imprecise probability models have been developed. Via&#xd;
experimental studies, it has been shown that classification methods based on&#xd;
imprecise probabilities significantly outperform the ones that utilize PT when&#xd;
data contain errors.&#xd;
When classifying an instance, classifiers tend to predict a single value of the&#xd;
class variable. Nonetheless, in some cases, there is not enough information&#xd;
available for a classifier to point out a single class value. In these situations, it&#xd;
is more logical that classifiers predict a set of class values instead of a single&#xd;
value of the class variable. This is known as Imprecise Classification.&#xd;
Classification algorithms (including Imprecise Classification) often aim to&#xd;
minimize the number of instances erroneously classified. This would be optimal&#xd;
if all classification errors had the same importance. Nevertheless, in practical applications, different classification errors usually lead to different costs.&#xd;
For this reason, classifiers that take the misclassification costs into account,&#xd;
also known as cost-sensitive classifiers, have been developed in the literature.&#xd;
Traditional classification (including Imprecise Classification) assumes that&#xd;
each instance has a single value of a class variable. However, in some domains,&#xd;
this task does not fit well because an instance may belong to multiple labels&#xd;
simultaneously. In these domains, the Multi-Label Classification task (MLC)&#xd;
is more suitable than traditional classification. MLC aims to predict the set of&#xd;
labels associated with a given instance described via an attribute set. Most of&#xd;
the MLC methods proposed so far represent the information provided by an&#xd;
MLC dataset about the set of labels via classical PT.&#xd;
In this thesis work, we develop new classification algorithms based on imprecise&#xd;
probability models, including Imprecise Classification, cost-sensitive&#xd;
Imprecise Classification, and MLC, that present some advantages and obtain&#xd;
better experimental results than the ones of the state-of-the-art.</dc:description>
      <dc:description>En esta tesis seguimos la línea de investigación de teorías y modelos de&#xd;
probabilidades imprecisas y medidas de incertidumbre con probabilidades&#xd;
imprecisas. También proponemos nuevos métodos de clasificación basados en&#xd;
probabilidades imprecisas que obtienen mejor rendimiento que los del estado&#xd;
del arte.</dc:description>
      <dc:date>2023-01-19T12:21:57Z</dc:date>
      <dc:date>2023-01-19T12:21:57Z</dc:date>
      <dc:date>2023</dc:date>
      <dc:date>2022-12-21</dc:date>
      <dc:type>doctoral thesis</dc:type>
      <dc:identifier>Moral García, Serafín. New applications of models based on imprecise probabilities within data mining. Granada: Universidad de Granada, 2023. [https://hdl.handle.net/10481/79149]</dc:identifier>
      <dc:identifier>9788411176279</dc:identifier>
      <dc:identifier>https://hdl.handle.net/10481/79149</dc:identifier>
      <dc:language>eng</dc:language>
      <dc:rights>http://creativecommons.org/licenses/by-nc-nd/4.0/</dc:rights>
      <dc:rights>open access</dc:rights>
      <dc:rights>Attribution-NonCommercial-NoDerivatives 4.0 Internacional</dc:rights>
      <dc:publisher>Universidad de Granada</dc:publisher>
   </ow:Publication>
</rdf:RDF>