Willkommen auf den GitHub Pages des Lehrstuhls für Informatik 6 (Datenmanagement) an der Universität Erlangen-Nürnberg.
Hier finden Sie Materialien zu unseren Lehrveranstaltungen und Projekten.
Lehrveranstaltungen
-
Kursübersicht:
- Dozent: Dominik Probst, M.Sc.
- Umfang: 5 ECTS
- Turnus: Jedes Sommersemester
- Sprache: Englisch
- Struktur: Besteht aus Vorlesung und Übung
- Studiengänge: u.a. Informatik, International Information Systems, Data Science
- Inhalt: Grundlagen der Datenwissenschaften. Behandelt unter anderem Frequent Pattern Analysis, Klassifikation, Clustering und Outlier Erkennung
Projekte & Publikationen
-
Graph-based QSS: A Graph-based Approach to Quantifying Semantic Similarity for Automated Linear SQL Grading Live Demo Paper (BTW 2025)
Autoren: Leo Köberlein, Dominik Probst, Richard Lenz
Abstract:
Determining the Quantified Semantic Similarity (QSS) between database queries is a critical challenge with broad applications, from query log analysis to automated SQL skill assessment. Traditional methods often rely solely on syntactic comparisons or are limited to checking for semantic equivalence. This paper introduces Graph-based QSS, a novel graph-based approach to measure the semantic dissimilarity between SQL queries. Queries are represented as nodes in an implicit graph, while the transitions between nodes are called edits, which are weighted by semantic dissimilarity. We employ shortest path algorithms to identify the lowest-cost edit sequence between two given queries, thereby defining a quantifiable measure of semantic distance. An empirical study of our prototype suggests that our method provides more accurate and comprehensible grading compared to existing techniques. Moreover, the results indicate that our approach comes close to the quality of manual grading, making it a robust tool for diverse database query comparison tasks.