Literature by the same author
plus at Google Scholar

Bibliografische Daten exportieren
 

Organising Knowledge from Text: Prompt-Based Triple Extraction and Graph Enrichment with Large Language Models

Title data

Nandini, Durgesh ; Blöthner, Simon ; Larch, Mario ; Schönfeld, Mirco:
Organising Knowledge from Text: Prompt-Based Triple Extraction and Graph Enrichment with Large Language Models.
In: Balke, Wolf-Tilo ; Golub, Koraljka ; Manolopoulos, Yannis ; Stefanidis, Kostas ; Zhang, Zheying (ed.): Linking Theory and Practice of Digital Libraries : 29th International Conference on Theory and Practice of Digital Libraries, TPDL 2025, Tampere, Finland, September 23–26, 2025, Proceedings. - Cham : Springer , 2025 . - pp. 53-70 . - (Lecture Notes in Computer Science ; 16097 )
ISBN 978-3-032-05409-8
DOI: https://doi.org/10.1007/978-3-032-05409-8_5

Official URL: Volltext

Related URLs

Project information

Project title:
Project's official title
Project's id
Berücksichtigung von kontextuellen Faktoren und strukturellen Gegebenheiten in einem dynamischen Rahmen (KONECO)
16DKWN095

Project financing: Bundesministerium für Bildung, Forschung, Technologie und Raumfahrt (BMFTR)

Abstract in another language

Creating and organising structured knowledge from unstructured text is a fundamental problem in computer science, with applications ranging from search and reasoning to prediction and recommendation systems. Knowledge Graphs (KGs) offer a powerful abstraction for organising information as entities and their semantic relationships, but they depend on high-quality, domain-specific relational data. Extracting this data remains particularly challenging when source texts are lengthy, formal, and context-dependent. This work presents a framework combining prompt-engineered Large Language Models (LLMs) with knowledge graph embedding methods to support automatic knowledge creation and integration. We explore different prompting strategies, zero-shot, one-shot, few-shot, and prompts with negative examples, for extracting subject–predicate–object triples from complex legal texts. The extracted triples are used to build a supplemental KG, which is merged with an existing domain-specific graph and embedded using TransE for downstream relation prediction tasks. As a use case, we focus on the trade domain and use Regional Trade Agreements (RTAs) as the primary source of textual data. RTAs are challenging documents due to their legally formal and often ambiguous language, and they require domain knowledge to be interpreted effectively. Yet, they encode rich contextual information about international economic relations. Our results show that LLM-extracted triples meaningfully enhance the semantic structure of the KG and improve predictive performance. We further identify which categories of predicates contribute most to these gains.

Further data

Item Type: Article in a book
Refereed: Yes
Keywords: Large Language Models; Knowledge Graphs; Triples Extraction; Contextual Information; Regional Trade Agreements; Trade Volume Prediction
Institutions of the University: Faculties > Faculty of Law, Business and Economics > Department of Economics > Chair Economics VI - Empirical Economic Research > Chair Economics VI - Empirical Economic Research - Univ.-Prof. Dr. Mario Larch
Faculties > Faculty of Languages and Literature > Juniorprofessur Datenmodellierung und interdisziplinäre Wissensgenerierung > Juniorprofessur Datenmodellierung und interdisziplinäre Wissensgenerierung - Juniorprof. Dr. Mirco Schönfeld
Research Institutions > Central research institutes > Research Center for AI in Science and Society
Result of work at the UBT: Yes
DDC Subjects: 000 Computer Science, information, general works > 004 Computer science
300 Social sciences > 330 Economics
Date Deposited: 14 Oct 2025 08:23
Last Modified: 14 Oct 2025 08:23
URI: https://eref.uni-bayreuth.de/id/eprint/94906