Identifying Sustainability in Public Tendering
Version
Published
Date Issued
2026-02-07
Author(s)
Bern University of Applied Sciences
Bern University of Applied Sciences
Ellendorff, Tilia
University of Zurich
Bern University of Applied Sciences
Bern University of Applied Sciences
Binder, Judith
Bern University of Applied Sciences
Bern University of Applied Sciences
Editor(s)
Cantini, Riccardo
University of Calabria
Ferragina, Luca
University of Calabria
Longo, Davide Mario
University of Calabria
Nikiforva, Anastasija
University of Palermo
Nisticò, Simona
University of Calabria
Scarcello, Francesco
Shahbazian, Reza
University of Palermo
Thakur, Dipanwita
University of Calabria
Trubitsyna, Irina
University of Calabria
Varricchio, Giovanna
University of Calabria
Type
Conference Paper
Language
English
Abstract
Public procurement serves as a significant lever for promoting sustainability, yet effectively assessing the integration of sustainability criteria within diverse and heterogeneous tender documents remains a challenge. This paper presents a Natural Language Processing (NLP) pipeline for automatically identifying sustainability criteria in Swiss public procurement documents written in German. To assess sustainability, we compiled four catalogs of official Sustainable Procurement Criteria (SPCs): three domain-specific (transport, food, furniture) and one domain-independent. Each call for tenders (CFT) document was segmented into sentences and encoded using a pre-trained sentence transformer. We then computed cosine similarity scores between each sentence and all SPCs, storing the top match from both the general and the domain-specific catalog, if applicable. While similarity scores were generally high for a majority of sentences, a preliminary manual inspection suggested that only matches with a score of 0.98 or higher tended to reflect meaningful alignment. To validate this threshold, two human experts independently reviewed 100 randomly sampled sentence-criterion pairs above this threshold. To explore whether this expert validation process could be scaled, we also prompted three different Large Language Models (LLMs) to assess the same samples, classifying each pair as a correct or incorrect match based on a majority vote. Our evaluation suggests that a similarity threshold of 0.98 is useful for reducing noise and identifying relevant sustainability criteria. LLM-based validation shows potential as a scalable alternative to human annotation, although performance varies between models. While Gemini 2.0 achieved substantial agreement with the expert judgments in terms of Fleiss’ Kappa (𝜅 = 0.754), other models demonstrated weaker alignment.
Journal or Serie
CEUR Workshop Proceedings
Series/Report No.
CEUR Workshop Proceedings; Vol-4165
ISSN
1613-0073
Publisher URL
Organization
Sponsors
Swiss National Science Foundation
Volume
4165
Project(s)
Swiss National Science Foundation (SNSF), Project 10000100: How environmental and social public procurement affects sustainability transformation in the public and private sector: The role of the revision of the public procurement law in Switzerland
Conference
2nd Workshop on Green-Aware Artificial Intelligence (Green-Aware AI@ECAI 2025)
Publisher
CEUR-WS.org
Submitter
Rolshoven, Luca Sven
Citation apa
Rolshoven, L. S., Matoshi, V., Ellendorff, T., Hostettler, S., Meili, R., Binder, J., & Stürmer, M. (2026). Identifying Sustainability in Public Tendering. In R. Cantini, L. Ferragina, D. M. Longo, A. Nikiforva, S. Nisticò, F. Scarcello, R. Shahbazian, D. Thakur, I. Trubitsyna, & G. Varricchio (Eds.), CEUR Workshop Proceedings (Vol. 4165, pp. 11–19). CEUR-WS.org. https://doi.org/10.24451/arbor.13428
Contact us
File(s)![Thumbnail Image]()
Loading...
open access
Name
short6.pdf
License
Attribution 4.0 International
Version
published
Size
1.26 MB
Format
Adobe PDF
Checksum (MD5)
2d6639863d071d7b06d0c202ea5e9d6e
