Automated Extraction of Semantic Information from German Legal Documents

Bernhard Waltl; Jörg Landthaler; Elena Scepankova; Florian Matthes; Thomas Geiger; Christoph Stocker; Christian Schneider

Jusletter IT

Automated Extraction of Semantic Information from German Legal Documents

Autoren/Autorinnen: Bernhard Waltl / Jörg Landthaler / Elena Scepankova / Florian Matthes / Thomas Geiger / Christoph Stocker / Christian Schneider
Kategorie: Beiträge
Region: Deutschland
Rechtsgebiete: Recht und Sprache
Sammlung: Tagungsband IRIS 2017, Top 10 – Peer Reviewed Jury LexisNexis Best Paper Award des IRIS2017
Zitiervorschlag: Bernhard Waltl / Jörg Landthaler / Elena Scepankova / Florian Matthes / Thomas Geiger / Christoph Stocker / Christian Schneider, Automated Extraction of Semantic Information from German Legal Documents, in: Jusletter IT 23. Februar 2017

Based on a collaborative data science environment, and a large document corpus (> 130'000 documents from German tax law) we demonstrate the extraction of semantic information. This paper shows the potential of rule-based text analysis to automatically extract semantic information, such as the year of dispute in cases. Additionally, it demonstrates the extraction of legal definitions in laws and the usage of terms in a defining context. Based on an iterative and interdisciplinary process, legal experts, software engineers, and data scientists evaluate and continuously refine the model used for the computer-supported extraction.

Inhaltsverzeichnis

1. Introduction
2. Related work
3. Research Method
3.1. Data Science Environment
3.2. Legal Document Corpus
4. Structuring Unstructured Information in the German Tax Law
4.1. Rule-based text annotation as an interdisciplinary task
4.2. Extracting (meta-)data from legal documents
4.2.1. Approach
4.2.2. Evaluation
4.2.3. Critical reflection
4.3. Determining legal definitions and defining contexts
4.3.1. Approach
4.3.2. Critical reflection
5. Conclusion and Outlook
6. References

0 Kommentare

Es gibt noch keine Kommentare

Ihr Kommentar zu diesem Beitrag

AbonnentInnen dieser Zeitschrift können sich an der Diskussion beteiligen. Bitte loggen Sie sich ein, um Kommentare verfassen zu können.

Kontakt

Jusletter IT

Automated Extraction of Semantic Information from German Legal Documents

Inhaltsverzeichnis

0 Kommentare

Es gibt noch keine Kommentare

News@Weblaw

Per§onalia

Editions Weblaw

Services

Weblaw Inside

Competence

LegalTech

DocEngine

Loggen Sie sich bitte ein!

Angemeldet als

Lawsearch

Kontakt

Jusletter IT

Automated Extraction of Semantic Information from German Legal Documents

Inhaltsverzeichnis

0 Kommentare

Es gibt noch keine Kommentare

News@Weblaw

Per§onalia

Editions Weblaw

Services

Weblaw Inside

Competence

LegalTech

DocEngine

Community

Verlag

Weiterbildung

Produkte

Hotlinks