Repository logo
  • English
  • Deutsch
  • Français
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. CRIS
  3. Publication
  4. Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling
 

Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling

URI
https://arbor.bfh.ch/handle/arbor/44656
Version
Published
Date Issued
2024
Author(s)
Stern Ronja
Kawamura Ken
Stürmer, Matthias  
Chalkidis Ilias
Niklaus, Joël  
Type
Working Paper
Language
English
Subjects

cs.CL

cs.AI

cs.LG

68T50

I.2; I.7

Abstract
Predicting case criticality helps legal professionals in the court system manage large volumes of case law. This paper introduces the Criticality Prediction dataset, a new resource for evaluating the potential influence of Swiss Federal Supreme Court decisions on future jurisprudence. Unlike existing approaches that rely on resource-intensive manual annotations, we semi-automatically derive labels leading to
a much larger dataset than otherwise possible. Our dataset features a two-tier labeling system: (1) the LD-Label, which identifies cases published as Leading Decisions (LD), and (2) the Citation-Label, which ranks cases by their citation frequency and recency. This allows for a more nuanced evaluation of case importance. We evaluate several multilingual models, including fine-tuned variants and large language models, and find that fine-tuned models consistently outperform zero-shot baselines, demonstrating the need for task-specific adaptation. Our contributions include the introduction of this task and the release of a multilingual
dataset to the research community.
Subjects
BJ Ethics
DOI
https://doi.org/10.24451/dspace/11453
Publisher DOI
10.48550/arXiv.2410.13460
Publisher URL
https://arxiv.org/abs/2410.13460
Related URL
https://arxiv.org/abs/2410.13460
Organization
Wirtschaft  
Digital Sustainability Lab  
Institut Public Sector Transformation (IPST)  
Publisher
Ithaca, NY
Submitter
Wilhelm, Seraina
Citation apa
Stern Ronja, Kawamura Ken, Stürmer, M., Chalkidis Ilias, & Niklaus, J. (2024). Breaking the Manual Annotation Bottleneck: Creating a Comprehensive  Legal Case Criticality Dataset through Semi-Automated Labeling. Ithaca, NY. https://doi.org/10.24451/dspace/11453
File(s)
Loading...
Thumbnail Image
Download

open access

Name

Breaking the Manual Annotation Bottleneck Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling.pdf

License
Attribution 4.0 International
Size

536.73 KB

Format

Adobe PDF

Checksum (MD5)

2deb03f60404c47e285b2a367a000f2b

About ARBOR

Built with DSpace-CRIS software - System hosted and mantained by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Our institution