Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland
Version
Published
Date Issued
2025-11
Author(s)
Bern University of Applied Sciences
Rasiah, Vishvaksenan
University of Bern
Brügger Bose, Srinanda
University of Fribourg
Bern University of Applied Sciences
Bern University of Applied Sciences
Bern University of Applied Sciences
Niklaus, Joel
University of Bern
Editor(s)
Christodoulopoulos, Christos
Information Commissioner’s Office
Chakraborty, Tanmoy
Indian Institute of Technology Delhi
Rose, Carolyn
Carnegie Mellon University
Peng, Violet
University of California, Los Angeles
Type
Conference Paper
Language
English
Abstract
Legal research depends on headnotes: concise summaries that help lawyers quickly identify relevant cases. Yet, many court decisions lack them due to the high cost of manual annotation. To address this gap, we introduce the Swiss Landmark Decisions Summarization (SLDS) dataset containing 20K rulings from the Swiss Federal Supreme Court, each with headnotes in German, French, and Italian. SLDS has the potential to significantly improve access to legal information and transform legal research in Switzerland. We fine-tune open models (Qwen2.5, Llama 3.2, Phi-3.5) and compare them to larger general-purpose and reasoning-tuned LLMs, including GPT-4o, Claude 3.5 Sonnet, and the open-source DeepSeek R1. Using an LLM-as-a-Judge framework, we find that fine-tuned models perform well in terms of lexical similarity, while larger models generate more legally accurate and coherent summaries. Interestingly, reasoning-focused models show no consistent benefit, suggesting that factual precision is more important than deep reasoning in this task. We release SLDS under a CC BY 4.0 license to support future research in cross-lingual legal summarization.
Publisher DOI
Journal or Serie
Findings of the Association for Computational Linguistics: EMNLP 2025
Series/Report No.
Findings of ACL; EMNLP 2025
Publisher URL
Organization
Bern University of Applied Sciences
University of Bern
University of Zurich
University of Fribourg
Conference
The 2025 Conference on Empirical Methods in Natural Language Processing
Citation
Luca Rolshoven, Vishvaksenan Rasiah, Srinanda Brügger Bose, Sarah Hostettler, Lara Burkhalter, Matthias Stürmer, and Joel Niklaus. 2025. Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 15382–15411, Suzhou, China. Association for Computational Linguistics.
Publisher
Association for Computational Linguistics
Submitter
Rolshoven, Luca Sven
Citation apa
Rolshoven, L. S., Rasiah, V., Brügger Bose, S., Hostettler, S., Burkhalter, L., Stürmer, M., & Niklaus, J. (2025). Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland. In C. Christodoulopoulos, T. Chakraborty, C. Rose, & V. Peng (Eds.), Findings of the Association for Computational Linguistics: EMNLP 2025 (pp. 15382–15411). Association for Computational Linguistics. https://doi.org/10.24451/arbor.13208
Contact us
File(s)![Thumbnail Image]()
Loading...
open access
Name
2025.findings-emnlp.832.pdf
License
Attribution 4.0 International
Version
published
Size
466.91 KB
Format
Adobe PDF
Checksum (MD5)
7060ff11bcd665974f54dfaeddc78c8e
