<rdf:RDF xmlns:rdf="http://www.openarchives.org/OAI/2.0/rdf/" xmlns:ow="http://www.ontoweb.org/ontology/1#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ds="http://dspace.org/ds/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/rdf/ http://www.openarchives.org/OAI/2.0/rdf.xsd">
   <ow:Publication rdf:about="oai:digibug.ugr.es:10481/110902">
      <dc:title>Inferring Gender from Author Names with Local LLMs: A Multi-Model Evaluation</dc:title>
      <dc:creator>Herrero Solana, Víctor</dc:creator>
      <dc:creator>González-Salmón, Elvira</dc:creator>
      <dc:creator>Robinson García, Nicolás</dc:creator>
      <dc:subject>Generative AI</dc:subject>
      <dc:subject>Local Large Language Models</dc:subject>
      <dc:subject>Gender Assignment Algorithms</dc:subject>
      <dc:description>Gender identification of researchers is a common practice in scientometric studies examining inequalities in science. The most widely used approach relies on inferring gender from author names using commercial APIs or name-gender dictionaries, which often lack transparency and reproducibility. This study explores the use of local open-weight Large Language Models (LLMs) as an alternative for name-based gender classification. We evaluate 25 models from seven leading families (Llama, Gemma, Phi, Mistral, Qwen, DeepSeek, and Yi), ranging from 270 million to 70 billion parameters, using a reference dataset of nearly 200,000 names across 195 countries extracted from Wikidata. Results show that top-performing models achieve F1-Scores above 0.93 for both gender categories, positioning local LLMs as a viable, cost-effective, and reproducible alternative to proprietary tools. A critical performance threshold emerges at approximately 7 billion parameters, above which all models achieve acceptable results, with diminishing returns beyond 12-14 billion. All models exhibit systematic gender bias, showing higher precision for men and higher recall for women, indicating a tendency to classify ambiguous names as male. Mistral-Nemo-12b emerges as the optimal choice, balancing accuracy, computational efficiency, and gender equity.</dc:description>
      <dc:date>2026-02-12T07:44:51Z</dc:date>
      <dc:date>2026-02-12T07:44:51Z</dc:date>
      <dc:date>2026</dc:date>
      <dc:type>journal article</dc:type>
      <dc:identifier>Herrero Solana, V.; González-Salmón, E. y Robinson García, N. (2026). Inferring Gender from Author Names with Local LLMs: A Multi-Model Evaluation.</dc:identifier>
      <dc:identifier>https://hdl.handle.net/10481/110902</dc:identifier>
      <dc:identifier>https://zenodo.org/records/18610104</dc:identifier>
      <dc:language>eng</dc:language>
      <dc:rights>http://creativecommons.org/licenses/by-nc-nd/3.0/</dc:rights>
      <dc:rights>open access</dc:rights>
      <dc:rights>Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License</dc:rights>
   </ow:Publication>
</rdf:RDF>