Publications – AI-Lab

Basic Filters

Search with keywords

Basic Filters

231 entries « ‹ 1 of 5 › »

2026

Andreas Kontogiannis Vasilis Pollatos, Panayotis Mertikopoulos Ioannis Panageas

Efficient swap regret minimization in combinatorial bandits Conference

Twenty-Ninth Annual Conference on Artificial Intelligence and Statistics (AISTATS 2026), 2026.

Abstract | Links | BibTeX

Georgios Bouchouras Dimitrios Doumanas, Andreas Soularidis Konstantinos Kotis George Vouros

Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting Journal Article

AI, 7 (4), pp. 139, 2026, ISSN: 2673-2688.

Abstract | Links | BibTeX

@article{Bouchouras2026,
title = {Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting},
author = {Georgios Bouchouras, Dimitrios Doumanas, Andreas Soularidis, Konstantinos Kotis, George Vouros},
url = {https://www.mdpi.com/2673-2688/7/4/139},
doi = {https://doi.org/10.3390/ai7040139},
issn = {2673-2688},
year = {2026},
date = {2026-04-14},
journal = {AI},
volume = {7},
number = {4},
pages = {139},
abstract = {Ontology engineering plays a critical role in clinical decision support systems for Parkinson’s Disease (PD) monitoring and alerting. While Large Language Models (LLMs) have shown promise in knowledge modeling tasks, their effectiveness in autonomously constructing comprehensive ontologies for complex clinical domains remains unclear. This study investigates four ontology engineering methodologies for PD monitoring and alerting: One-shot (OS) prompting, Decomposed Sequential Prompting (DSP), X-HCOME, and SimX-HCOME+. Multiple LLMs were evaluated across these methodologies. Generated ontologies were assessed against a reference PD ontology using structural evaluation metrics focused on classes and object properties. Expert review was additionally conducted to analyze knowledge extensions beyond the gold standard. LLMs were able to autonomously generate syntactically valid and semantically meaningful ontologies using OS and DSP prompting; however, these ontologies exhibited limited conceptual coverage. Incorporating human expertise through X-HCOME significantly improved ontology completeness and evaluation metrics. Expert review further validated clinically relevant concepts absent from the reference ontology. SimX-HCOME+ demonstrated that iterative, supervised collaboration supports ontology refinement, although challenges persisted in natural language-to-rule formalization. The findings suggest that LLMs are more effective as collaborative assistants rather than standalone ontology engineers in the PD domain. Structured human–LLM collaboration is associated with improved ontology coverage and facilitates the identification of potential knowledge extensions in clinical monitoring applications. While the present evaluation focuses primarily on structural ontology elements, the proposed methodologies provide useful insights for LLM-assisted ontology engineering in complex healthcare domains.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Dimitrios Doumanas Andreas Soularidis, Nikolaos Zafeiropoulos Stamatis Chatzistamatis George Tsekouras Andreas El Saer Chrisaphis Nathanailidis Konstantinos Kotis E

Unbiasing Greek: In-Context Learning Strategies for Gender Bias Identification and Mitigation for Legal Documents and Job Ads Journal Article

Information, 17 (4), pp. 342, 2026.

Abstract | Links | BibTeX

@article{Doumanas2026b,
title = {Unbiasing Greek: In-Context Learning Strategies for Gender Bias Identification and Mitigation for Legal Documents and Job Ads},
author = {Dimitrios Doumanas, Andreas Soularidis, Nikolaos Zafeiropoulos, Stamatis Chatzistamatis, George E Tsekouras, Andreas El Saer, Chrisaphis Nathanailidis, Konstantinos Kotis},
url = {https://www.mdpi.com/2078-2489/17/4/342},
doi = {https://doi.org/10.3390/info17040342},
year = {2026},
date = {2026-04-02},
journal = {Information},
volume = {17},
number = {4},
pages = {342},
abstract = {Gender bias embedded in legal and professional texts perpetuates systemic inequality, yet research on bias identification and mitigation remains largely confined to English. Morphologically rich languages such as Greek, where grammatical gender pervades nouns, adjectives, pronouns, and participles, present unique challenges that existing approaches fail to address. This paper elaborates on a systematic methodology primarily focusing on identifying and mitigating gender bias in Greek-language job advertisements and legal documents. To accomplish that task, we define a taxonomy of nine gender bias rules tailored to the linguistic properties of Greek and construct domain-specific annotated datasets comprising 90 expert-curated few-shot examples across both textual domains. Using these resources, we employ XML-structured prompt engineering with in-context learning (ICL)and systematically compare three classes of models: (i) commercial large language models (LLMs), namely Claude Sonnet 4.5 and GPT-5.2, (ii) two open-weight small language models (SLMs), Mistral Small (24B) and Ministral (14B), and (iii) Llama Krikri (8B), a Greek-native language model built on Llama 3.1 and fine-tuned on high-quality Greek corpora. For each input text, the system identifies biased expressions, maps them to specific bias rules, provides explanations, and generates a fully corrected inclusive version. Our experiments reveal substantial performance disparities across model scales and linguistic specialization, with LLMs demonstrating superior contextual reasoning and SLMs exhibiting systematic over-correction and grammatical errors in Greek morphology. We further introduce a critical meta-rule addressing gender agreement with named entities to prevent spurious corrections in legal texts referencing identified individuals. The findings highlight the importance of model scale, language-specific adaptation, and carefully designed prompting strategies for bias mitigation in underrepresented languages.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Gender bias embedded in legal and professional texts perpetuates systemic inequality, yet research on bias identification and mitigation remains largely confined to English. Morphologically rich languages such as Greek, where grammatical gender pervades nouns, adjectives, pronouns, and participles, present unique challenges that existing approaches fail to address. This paper elaborates on a systematic methodology primarily focusing on identifying and mitigating gender bias in Greek-language job advertisements and legal documents. To accomplish that task, we define a taxonomy of nine gender bias rules tailored to the linguistic properties of Greek and construct domain-specific annotated datasets comprising 90 expert-curated few-shot examples across both textual domains. Using these resources, we employ XML-structured prompt engineering with in-context learning (ICL)and systematically compare three classes of models: (i) commercial large language models (LLMs), namely Claude Sonnet 4.5 and GPT-5.2, (ii) two open-weight small language models (SLMs), Mistral Small (24B) and Ministral (14B), and (iii) Llama Krikri (8B), a Greek-native language model built on Llama 3.1 and fine-tuned on high-quality Greek corpora. For each input text, the system identifies biased expressions, maps them to specific bias rules, provides explanations, and generates a fully corrected inclusive version. Our experiments reveal substantial performance disparities across model scales and linguistic specialization, with LLMs demonstrating superior contextual reasoning and SLMs exhibiting systematic over-correction and grammatical errors in Greek morphology. We further introduce a critical meta-rule addressing gender agreement with named entities to prevent spurious corrections in legal texts referencing identified individuals. The findings highlight the importance of model scale, language-specific adaptation, and carefully designed prompting strategies for bias mitigation in underrepresented languages.

Andreas Kontogiannis Ioannis Panageas, Vasilis Pollatos

The computational complexity of avoiding strict saddle points in constrained optimization Journal Article

arXiv, 2026.

Abstract | Links | BibTeX

@article{Kontogiannis2026b,
title = {The computational complexity of avoiding strict saddle points in constrained optimization},
author = {Andreas Kontogiannis, Ioannis Panageas, Vasilis Pollatos},
url = {https://arxiv.org/abs/2604.02285},
doi = {https://doi.org/10.48550/arXiv.2604.02285},
year = {2026},
date = {2026-04-02},
journal = {arXiv},
abstract = {While first-order stationary points (FOSPs) are the traditional targets of non-convex optimization, they often correspond to undesirable strict saddle points. To circumvent this, attention has shifted towards second-order stationary points (SOSPs). In unconstrained settings, finding approximate SOSPs is PLS-complete (Kontogiannis et al.), matching the complexity of finding unconstrained FOSPs (Hollender and Zampetakis). However, the complexity of finding SOSPs in constrained settings remained notoriously unclear and was highlighted as an important open question by both aforementioned works. Under one strict definition, even verifying whether a point is an approximate SOSP is NP-hard (Murty and Kabadi). Under another widely adopted, relaxed definition where non-negative curvature is required only along the null space of the active constraints, the problem lies in TFNP, and algorithms with O(poly(1/epsilon)) running times have been proposed (Lu et al.).
In this work, we settle the complexity of constrained SOSP by proving that computing an epsilon-approximate SOSP under the tractable definition is PLS-complete. We demonstrate that our result holds even in the 2D unit square [0,1]^2, and remarkably, even when stationary points are isolated at a distance of Omega(1) from the domain’s boundary. Our result establishes a fundamental barrier: unless PLS is a subset of PPAD (implying PLS = CLS), no deterministic, iterative algorithm with an efficient, continuous update rule can exist for finding approximate SOSPs. This contrasts with the constrained first-order counterpart, for which Fearnley et al. showed that finding an approximate KKT point is CLS-complete. Finally, our result yields the first problem defined in a compact domain to be shown PLS-complete beyond the canonical Real-LocalOpt (Daskalakis and Papadimitriou).”},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Georgios M Santipantakis Christos Doulkeridis, Petros Brimos

Semantic Data Transformation, FAIRification and Provenance for Data Spaces Journal Article

Data in Brief, 66 , pp. 112675, 2026, ISSN: 2352-3409.

Links | BibTeX

Michael Kenteris, Konstantinos Kotis

The Convergence of Federated Learning, Knowledge Graphs, and Large Language Models for Language Learning: A Scoping Review Journal Article

Applied Sciences, 16 (5), pp. 2611, 2026.

Abstract | Links | BibTeX

@article{Kenteris2026,
title = {The Convergence of Federated Learning, Knowledge Graphs, and Large Language Models for Language Learning: A Scoping Review},
author = {Michael Kenteris, Konstantinos Kotis},
url = {https://www.mdpi.com/2076-3417/16/5/2611},
doi = {https://doi.org/10.3390/app16052611},
year = {2026},
date = {2026-03-09},
journal = {Applied Sciences},
volume = {16},
number = {5},
pages = {2611},
abstract = {Large Language Models (LLMs) in Intelligent Computer-Assisted Language Learning enable highly personalized learning, yet raise significant challenges related to pedagogical grounding, data privacy, and instructional validity. Although Knowledge Graphs (KGs) and Federated Learning (FL) can mitigate these issues in isolation, evidence on systematic FL–KG–LLM integration for educational language learning remains limited. This scoping review maps the FL–KG–LLM convergence landscape. Following PRISMA-ScR guidelines, we searched six databases and screened 51 papers (2019–2025) using automated extraction. Our findings indicate limited convergence: no papers integrate all three domains, and 58.8% of approaches remain confined to isolated technological silos. Reporting is also uneven across the corpus, with an average “Not Reported” (NR) rate of 84.5%, most notably for privacy mechanisms (92.2%), validation metrics (90.2%), and Common European Framework of Reference for Languages (CEFR) alignment (88.2%). Domain-specific analysis reveals two distinct patterns: inter-domain gaps (disciplinary silos resulting in expected CEFR absence in single-domain papers) and intra-domain gaps (failure to report domain-critical variables, including 100% parameter NR in FL studies, 86.7% validation NR in KG studies, and 100% CEFR NR in convergence papers). Taken together, these gaps suggest that pedagogical grounding is treated as optional rather than structural. We therefore identify two pillars of pedagogical grounding: a Grounding Pillar, which constrains LLM outputs via Knowledge Graph rules, and a Validation Pillar, which concerns how authoritative frameworks (e.g., CEFR) are mapped onto Knowledge Graph schemas and evaluated. The near-universal absence of CEFR alignment and validation reporting suggests that this second pillar is currently missing, which we term the Integrity Gap—a systematic disconnection between technological innovation and pedagogical grounding inin Intelligent Computer-Assisted Language Learning. By reframing the problem as upstream control and validation, this review informs the design of user-facing automated systems where trust, transparency, and human oversight are critical.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Large Language Models (LLMs) in Intelligent Computer-Assisted Language Learning enable highly personalized learning, yet raise significant challenges related to pedagogical grounding, data privacy, and instructional validity. Although Knowledge Graphs (KGs) and Federated Learning (FL) can mitigate these issues in isolation, evidence on systematic FL–KG–LLM integration for educational language learning remains limited. This scoping review maps the FL–KG–LLM convergence landscape. Following PRISMA-ScR guidelines, we searched six databases and screened 51 papers (2019–2025) using automated extraction. Our findings indicate limited convergence: no papers integrate all three domains, and 58.8% of approaches remain confined to isolated technological silos. Reporting is also uneven across the corpus, with an average “Not Reported” (NR) rate of 84.5%, most notably for privacy mechanisms (92.2%), validation metrics (90.2%), and Common European Framework of Reference for Languages (CEFR) alignment (88.2%). Domain-specific analysis reveals two distinct patterns: inter-domain gaps (disciplinary silos resulting in expected CEFR absence in single-domain papers) and intra-domain gaps (failure to report domain-critical variables, including 100% parameter NR in FL studies, 86.7% validation NR in KG studies, and 100% CEFR NR in convergence papers). Taken together, these gaps suggest that pedagogical grounding is treated as optional rather than structural. We therefore identify two pillars of pedagogical grounding: a Grounding Pillar, which constrains LLM outputs via Knowledge Graph rules, and a Validation Pillar, which concerns how authoritative frameworks (e.g., CEFR) are mapped onto Knowledge Graph schemas and evaluated. The near-universal absence of CEFR alignment and validation reporting suggests that this second pillar is currently missing, which we term the Integrity Gap—a systematic disconnection between technological innovation and pedagogical grounding inin Intelligent Computer-Assisted Language Learning. By reframing the problem as upstream control and validation, this review informs the design of user-facing automated systems where trust, transparency, and human oversight are critical.

Elias Alevizos Georgios M Santipantakis, Christos Doulkeridis Alexander Artikis

Online spatial reasoning for complex event recognition Journal Article

GeoInformatica, 30 (1), pp. 9, 2026.

Abstract | Links | BibTeX

@article{Alevizos2026,
title = {Online spatial reasoning for complex event recognition},
author = {Elias Alevizos, Georgios M Santipantakis, Christos Doulkeridis, Alexander Artikis},
url = {https://link.springer.com/article/10.1007/s10707-026-00569-z},
doi = {https://doi.org/10.1007/s10707-026-00569-z},
year = {2026},
date = {2026-03-03},
journal = {GeoInformatica},
volume = {30},
number = {1},
pages = {9},
abstract = {Complex Event Recognition (CER) systems have the ability to process streams of events by detecting event patterns with minimal latency. Typically, these patterns have a temporal structure, often resembling the sequential structure of regular expressions. A pattern advances to the next state by checking various conditions on the current and possibly previous events of the stream. CER systems are very efficient in tracking all the possible paths that a pattern may follow and report when a path is complete and a complex event must be reported. In some cases, the conditions that need to be checked may be spatial. For example, in maritime situational awareness, a condition may need to check whether a vessel is close to any other vessel. Such conditions are not easily expressed directly as regular expressions. For such spatio-temporal tasks, there exist dedicated modules which can evaluate this type of conditions efficiently. Thus, we can integrate such a spatio-temporal module within a CER system in order to take advantage of both worlds: the CER engine can accommodate and process complex regular expressions and delegate the evaluation of expensive spatio-temporal tasks to a dedicated module whenever it needs to. We present an approach towards such an integration. We describe how a CER engine, based on symbolic automata, can cooperate with a spatio-temporal link discovery (stLD) module such that the former can leverage the spatio-temporal capabilities of the latter. This cooperation can take place in an online manner rendering the whole system suitable for real-time processing of event streams. We discuss two different communication schemes between the CER engine and the spatio-temporal module and explore when each one should be preferred. We provide a theoretical estimation of the predicted performance of the system under each communication scheme. Our extensive experimental evaluation confirms most of our theoretical predictions.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

George Papadopoulos, George Vouros A

Learning to maintain safety through expert demonstrations in settings with unknown constraints: A Q-learning perspective Journal Article

arXiv, 2026.

Abstract | Links | BibTeX

Dimitrios Doumanas, Konstantinos Kotis

ReaDS-KG: An LLM-Knowledge Graph Framework for Reasoned Decision Support in Dynamic Safety-Critical Domains Journal Article

TechRxiv, 2026.

Abstract | Links | BibTeX

@article{Doumanas2026,
title = {ReaDS-KG: An LLM-Knowledge Graph Framework for Reasoned Decision Support in Dynamic Safety-Critical Domains},
author = {Dimitrios Doumanas, Konstantinos Kotis},
url = {https://www.techrxiv.org/doi/full/10.36227/techrxiv.176826793.34811491/v1},
doi = {https://doi.org/10.36227/techrxiv.176826793.34811491/v1},
year = {2026},
date = {2026-01-13},
journal = {TechRxiv},
abstract = {Safety-critical domains such as military operations, border security, and search-and-rescue must operate under uncertainty, severe time pressure, and continuously changing conditions. In these settings, decision-support systems must not only provide accurate recommendations but also make the underlying reasoning explicit and auditable. This paper introduces ReaDS-KG (Reasoned Decision Support over Knowledge Graphs), an LLM-Knowledge Graph framework that delivers reasoned rather than purely predictive support. ReaDS-KG represents domain knowledge, assets, constraints, and causal dependencies in an ontology-driven knowledge graph, and uses a large language model to (i) translate natural-language questions into Cypher queries, (ii) orchestrate graph-based reasoning over causal structures, and (iii) return narrative answers with explicit justifications grounded in the graph. The framework follows a five-stage pipeline: ontology design, data-to-KG transformation, causal enrichment, LLM-mediated querying, and scenariobased evaluation. To demonstrate its applicability, we instantiate ReaDS-KG in a synthetic brigade-level operational scenario and pose twenty decision-oriented questions, covering feasibility, mobility, sustainment, command-and-control robustness, and risk. We then compare an LLM+KG agent powered by ReaDS-KG to ten active-duty officers using an eight-dimensional scoring rubric. The agent achieves decision-support quality comparable to field-grade officers and clearly above junior officers, while responding at machine response speed and providing transparent reasoning chains. These results suggest that ReaDS-KG can function as a quasi-expert, explainable staff assistant in dynamic safety-critical domains, and the architecture is readily transferable to other safety-critical settings that share similar uncertainty and causal-reasoning requirements, such as border management and disaster response.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

2025

Andreas Soularidis Dimitrios Doumanas, Konstantinos Kotis George Vouros A

Automating agentic collaborative ontology engineering with role-playing simulation of LLM-powered agents and RAG technology Journal Article

The Knowledge Engineering Review, 40 , pp. e10, 2025.

Abstract | Links | BibTeX

@article{Soularidis2025,
title = {Automating agentic collaborative ontology engineering with role-playing simulation of LLM-powered agents and RAG technology},
author = {Andreas Soularidis, Dimitrios Doumanas, Konstantinos Kotis, George A Vouros},
doi = {https://doi.org/10.1017/S026988892510009X},
year = {2025},
date = {2025-12-19},
journal = {The Knowledge Engineering Review},
volume = {40},
pages = {e10},
abstract = {Motivated by the astonishing capabilities of large language models (LLMs) in text-generation, reasoning, and simulation of complex human behaviors, in this paper, we propose a novel multi-component LLM-based framework, namely LLM4ACOE, that fully automates the collaborative ontology engineering (COE) process using role-playing simulation of LLM agents and retrieval augmented generation (RAG) technology. The proposed solution enhances the LLM-powered role-playing simulation with RAG ‘feeding’ the LLM with three different types of external knowledge. This knowledge corresponds to the knowledge required by each of the COE roles (agents), using a component-based framework, as follows: (a) domain-specific data-centric documents, (b) OWL documentation, and (c) ReAct guidelines. The aforementioned components are evaluated in combination, with the aim of investigating their impact on the quality of generated ontologies. The aim of this work is twofold, (a) to identify the capacity of LLM-based agents to generate acceptable (by human-experts) ontologies through agentic collaborative ontology engineering (ACOE) role-playing simulation, at specific levels of acceptance (accuracy, validity, and expressiveness of ontologies) without human intervention and (b) to investigate whether and/or to what extent the selected RAG components affect the quality of the generated ontologies. The evaluation of this novel approach is performed using ChatGPT-o in the domain of search and rescue (SAR) missions. To assess the generated ontologies, quantitative and qualitative measures are employed, focusing on coverage, expressiveness, structure, and human involvement.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Apostolos Glenis, George Vouros

Scalable Univariate and Multivariate Time-Series Classifiers with Deep Learning Methods Exploiting Symbolic Representations Journal Article

Computers, 14 (12), pp. 563, 2025.

Abstract | Links | BibTeX

Georgios Bouchouras Dimitrios Doumanas, Andreas Soularidis Konstantinos Kotis George Vouros A

Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting Journal Article

arXiv, 2025.

Abstract | Links | BibTeX

@article{Bouchouras2025,
title = {Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting},
author = {Georgios Bouchouras, Dimitrios Doumanas, Andreas Soularidis, Konstantinos Kotis, George A Vouros},
url = {https://arxiv.org/pdf/2512.14288},
doi = {https://doi.org/10.48550/arXiv.2512.14288},
year = {2025},
date = {2025-12-16},
journal = {arXiv},
abstract = {This paper explores the integration of Large Language Models (LLMs) in the engineering of a Parkinson’s Disease (PD) monitoring and alerting ontology through four key methodologies: One Shot (OS) prompt techniques, Chain of Thought (CoT) prompts, X-HCOME, and SimX-HCOME+. The primary objective is to determine whether LLMs alone can create comprehensive ontologies and, if not, whether human-LLM collaboration can achieve this goal. Consequently, the paper assesses the effectiveness of LLMs in automated ontology development and the enhancement achieved through human-LLM collaboration.
Initial ontology generation was performed using One Shot (OS) and Chain of Thought (CoT) prompts, demonstrating the capability of LLMs to autonomously construct ontologies for PD monitoring and alerting. However, these outputs were not comprehensive and required substantial human refinement to enhance their completeness and accuracy.
X-HCOME, a hybrid ontology engineering approach that combines human expertise with LLM capabilities, showed significant improvements in ontology comprehensiveness. This methodology resulted in ontologies that are very similar to those constructed by experts.
Further experimentation with SimX-HCOME+, another hybrid methodology emphasizing continuous human supervision and iterative refinement, highlighted the importance of ongoing human involvement. This approach led to the creation of more comprehensive and accurate ontologies.
Overall, the paper underscores the potential of human-LLM collaboration in advancing ontology engineering, particularly in complex domains like PD. The results suggest promising directions for future research, including the development of specialized GPT models for ontology construction.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Asimina Dimara Konstantinos Kotis, Stamatis Chatzistamatis Nikolaos Evangeliou Chrysaphis Nathanailidis George Tsekouras E

Towards Effective Data Process Pipelines for Legal NLP in English and Non-English Languages: A Greek Case Study Conference

Computing, Communications and IoT Applications (ComComAp), 2025, ISBN: 979-8-3315-9143-4.

Abstract | Links | BibTeX

Alexandros Karakikes, Konstantinos Kotis

AI-Assisted OSINT/SOCMINT for Safeguarding Borders: A Systematic Review Journal Article

Information, 16 (12), pp. 1095, 2025, ISSN: 2078-2489.

Abstract | Links | BibTeX

@article{Karakikes2025,
title = {AI-Assisted OSINT/SOCMINT for Safeguarding Borders: A Systematic Review},
author = {Alexandros Karakikes, Konstantinos Kotis},
url = {https://www.mdpi.com/2078-2489/16/12/1095},
doi = {https://doi.org/10.3390/info16121095},
issn = {2078-2489},
year = {2025},
date = {2025-12-10},
journal = {Information},
volume = {16},
number = {12},
pages = {1095},
abstract = {In the highly volatile realm of global security, the necessity for leading-edge and effectual border resilience tactics has never been more imperative. This PRISMA 2020 guided systematic literature review (SLR) examines the intersection of artificial intelligence (AI), open-source intelligence (OSINT), and social media intelligence (SOCMINT) for enhancing border protection. Our systematic investigation across major databases (IEEE Xplore, Scopus, SpringerLink, MDPI, ACM) and grey literature sources yielded 3932 initial records and, after screening and eligibility assessment, 73 studies and reports from acknowledged organizations, contributing to the evidence synthesis. Three research questions (RQ1–RQ3) were addressed concerning the following: (a) the effectiveness and application of AI in OSINT/SOCMINT for border protection, its (b) data, technical, and operational limitations, and its (c) ethical, legal, and societal implications (GELSI). Evidence matrices summarize the findings, while narrative syntheses underline and thematically group the extracted insights. Results indicate that AI techniques—fluctuating from machine learning (ML) and natural language processing (NLP) to computer vision and emerging large language models (LLMs)—produce quantifiable improvements in forecasting irregular migration, detecting human trafficking, and supporting multimodal intelligence fusion. However, limitations include misinformation, data bias, adversarial vulnerabilities, governance deficits, and sandbox-to-production gaps. Ethical and societal concerns highlight risks of surveillance overreach, discrimination, and insufficient oversight, among others. To our knowledge, this is the first SLR at this intersection. We conclude that, AI-assisted OSINT/SOCMINT presents transformative potential for border protection requiring, nonetheless, balanced governance, robust validation, and future research on LLM/agentic AI, human–AI teaming, and oversight mechanisms.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Dimitris Kostadimas Vlasios Kasapakis, Konstantinos Kotis

Exploiting VR, AIoT and Semantics Towards an Adaptive Virtual Museum Conference

20th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), 2025, ISBN: 979-8-3315-8704-8.

Abstract | Links | BibTeX

@conference{Kostadimas2025b,
title = {Exploiting VR, AIoT and Semantics Towards an Adaptive Virtual Museum},
author = {Dimitris Kostadimas, Vlasios Kasapakis, Konstantinos Kotis},
url = {https://ieeexplore.ieee.org/abstract/document/11309793},
doi = {https://doi.org/10.1109/SMAP66932.2025.00034},
isbn = {979-8-3315-8704-8},
year = {2025},
date = {2025-11-27},
booktitle = {20th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)},
pages = {157-162},
abstract = {Museums have long been spaces of wonder and discovery, but as technology evolves, so do the ways we engage with these cultural treasures. The design of adaptive virtual environments becomes essential to maintaining user interest and relevance. In this paper, an adaptive virtual museum system is proposed that explores the use of virtual reality (VR), artificial intelligence (AI), Internet of Things (IoT) as well as semantics to personalize and optimize virtual exhibition experiences. Based on the results of our previous research conducted regarding the possible combination of VR, AI and IoT (AIoT) for the design of innovative intelligent systems in different domains, our current work proposes a novel way to integrate all these technologies within the domain of cultural heritage (CH), a combination that remains relatively underexplored. The proposed framework, which is currently a work in progress, introduces new ways to modeling museums’ visitor behavior and preferences (mainly by using head-mounted displays (HMDs)) in a VR environment to dynamically adapt exhibition layouts, as well as to provide personalized content through a digital twin (DT) of a real museum. A key focus lies in intelligent user profiling and route/layout optimization to enhance visitor engagement and provide rich content through integration of Large Language Models (LLM). Although implementation is ongoing, this paper describes the conceptual design, core objectives, and anticipated impact on the broader scope of adaptive multimedia applications and personalized cultural experiences.},
keywords = {},
pubstate = {published},
tppubtype = {conference}
}

Andreas Sideras Konstantinos Bougiatiotis, Elias Zavitsanos Georgios Paliouras George Vouros

A Multimodal Alignment-Based Anomaly Detection Method for Bankruptcy Prediction Conference

Proceedings of the 6th ACM International Conference on AI in Finance, 2025, ISBN: 9798400722202.

Abstract | Links | BibTeX

Elias Zavitsanos Konstantinos Bougiatiotis, Andreas Sideras Georgios Paliouras

Positive-Unlabeled Learning for Financial Misstatement Detection under Realistic Constraints Conference

ICAIF ’25: Proceedings of the 6th ACM International Conference on AI in Finance, 2025, ISBN: 9798400722202.

Abstract | Links | BibTeX

Dimitrios Doumanas Andreas Soularidis, Konstantinos Kotis

Causal Reasoning and Large Language Models for Military Decision-Making: Rethinking the Command Structures in the Era of Generative AI Journal Article

AI, 7 (1), pp. 14, 2025, ISSN: 2673-2688.

Abstract | Links | BibTeX

@article{Doumanas2025e,
title = {Causal Reasoning and Large Language Models for Military Decision-Making: Rethinking the Command Structures in the Era of Generative AI},
author = {Dimitrios Doumanas, Andreas Soularidis, Konstantinos Kotis},
url = {https://www.mdpi.com/2673-2688/7/1/14},
doi = {https://doi.org/10.3390/ai7010014},
issn = {2673-2688},
year = {2025},
date = {2025-10-24},
journal = {AI},
volume = {7},
number = {1},
pages = {14},
abstract = {Military decision-making is inherently complex and highly critical, requiring commanders to assess multiple variables in real-time, anticipate second-order effects, and adapt strategies based on continuously evolving battlefield conditions. Traditional approaches rely on domain expertise, experience, and intuition, often supported by decision-support systems designed by military experts. With the rapid advancement of Large Language Models (LLMs) such as ChatGPT, Claude, and DeepSeek, a new research question emerges: can LLMs perform causal reasoning at a level that could meaningfully replace human decision-makers, or should they remain human-led decision-support tools in high-stakes environments? This paper explores the causal reasoning capabilities of LLMs for operational and strategic military decisions. Unlike conventional AI models that rely primarily on correlation-based predictions, LLMs are now able to engage in multi-perspective reasoning, intervention analysis, and scenario-based assessments. We introduce a structured empirical evaluation framework to assess LLM performance through 10 de-identified real-world-inspired battle scenarios, ensuring models reason over provided inputs rather than memorized data. Critically, LLM outputs are systematically compared against a human expert baseline, composed of military officers across multiple ranks and years of operational experience. The evaluation focuses on precision, recall, causal reasoning depth, adaptability, and decision soundness. Our findings provide a rigorous comparative assessment of whether carefully prompted LLMs can assist, complement, or approach expert-level performance in military planning. While fully autonomous AI-led command remains premature, the results suggest that LLMs can offer valuable support in complex decision processes when integrated as part of hybrid human-AI decision-support frameworks. Since our evaluation directly tests this capability, this paradigm shift raises fundamental question: Is there a possibility to fully replace high-ranking officers/commanders in leading critical military operations, or should AI-driven tools remain as decision-support systems enhancing human-driven battlefield strategies?},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Military decision-making is inherently complex and highly critical, requiring commanders to assess multiple variables in real-time, anticipate second-order effects, and adapt strategies based on continuously evolving battlefield conditions. Traditional approaches rely on domain expertise, experience, and intuition, often supported by decision-support systems designed by military experts. With the rapid advancement of Large Language Models (LLMs) such as ChatGPT, Claude, and DeepSeek, a new research question emerges: can LLMs perform causal reasoning at a level that could meaningfully replace human decision-makers, or should they remain human-led decision-support tools in high-stakes environments? This paper explores the causal reasoning capabilities of LLMs for operational and strategic military decisions. Unlike conventional AI models that rely primarily on correlation-based predictions, LLMs are now able to engage in multi-perspective reasoning, intervention analysis, and scenario-based assessments. We introduce a structured empirical evaluation framework to assess LLM performance through 10 de-identified real-world-inspired battle scenarios, ensuring models reason over provided inputs rather than memorized data. Critically, LLM outputs are systematically compared against a human expert baseline, composed of military officers across multiple ranks and years of operational experience. The evaluation focuses on precision, recall, causal reasoning depth, adaptability, and decision soundness. Our findings provide a rigorous comparative assessment of whether carefully prompted LLMs can assist, complement, or approach expert-level performance in military planning. While fully autonomous AI-led command remains premature, the results suggest that LLMs can offer valuable support in complex decision processes when integrated as part of hybrid human-AI decision-support frameworks. Since our evaluation directly tests this capability, this paradigm shift raises fundamental question: Is there a possibility to fully replace high-ranking officers/commanders in leading critical military operations, or should AI-driven tools remain as decision-support systems enhancing human-driven battlefield strategies?

Theodore Tranos Nikolaos Fesakis, Thomas Vasileiou Sotirios Christopoulos Georgio Loukos Maria Koutsoupidou

AI-Based Energy Forecasting at Different Distribution Grid Levels to Support Baseline Definition and DSO Participation in LFMs Conference

2025 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT Europe), IEEE, 2025, ISBN: 979-8-3315-2503-3.

Abstract | Links | BibTeX

Asimina Dimara Konstantinos Kotis, Alexios Papaioannou Stamatis Chatzistamatis Nikolaos Evangeliou Chrysaphis Nathanailidis George Tsekouras

Data Collection, Organization, and Privacy-Preserving Preparation for Edge-Based LLMs in Legal Text Analytics Conference

5th International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), 2025, ISBN: 979-8-3315-3556-8.

Abstract | Links | BibTeX

Eleftherios Efkleidis Stefanou Pavlos Bitilis, Georgios Bouchouras Konstantinos Kotis

Collecting, Integrating and Processing IoT Sensor Data on Edge Devices for PD Monitoring: A Scoping Review Journal Article

Applied Sciences, 15 (19), pp. 10541, 2025, ISSN: 2076-3417.

Abstract | Links | BibTeX

@article{Stefanou2025b,
title = {Collecting, Integrating and Processing IoT Sensor Data on Edge Devices for PD Monitoring: A Scoping Review},
author = {Eleftherios Efkleidis Stefanou, Pavlos Bitilis, Georgios Bouchouras, Konstantinos Kotis},
url = {https://www.mdpi.com/2076-3417/15/19/10541},
doi = {https://doi.org/10.3390/app151910541},
issn = {2076-3417},
year = {2025},
date = {2025-09-29},
journal = {Applied Sciences},
volume = {15},
number = {19},
pages = {10541},
abstract = {Bradykinesia and tremor are critical motor symptoms in diagnosing and monitoring Parkinson’s disease (PD), a progressive neurodegenerative disorder. The integration of IoT sensors, smartwatch technology, and edge computing has facilitated real-time collection, processing, and analysis of data related to these impairments, enabling continuous monitoring of PD beyond traditional clinical settings. This survey provides a comprehensive review of recent technological advancements in data collection from wearable IoT sensors and its semantic integration and processing on edge devices, emphasizing methods optimized for efficient and low-latency processing. Additionally, this survey explores AI-driven techniques for detecting and analyzing bradykinesia and tremor symptoms on edge devices. By leveraging localized computation on edge devices, these approaches facilitate energy efficiency, data privacy, and scalability, making them suitable for deployment in real environments. This paper also examines related open-source tools and datasets, assessing their roles in improving reproducibility and integration into these environments. Furthermore, key challenges, including variability in real environments, model generalization, and computational constraints, are discussed, along with potential strategies to enhance detection accuracy and system robustness. By bridging the gap between sensor data collection and integration, and AI-based detection of bradykinesia and tremor on edge devices, this survey intends to contribute to the development of efficient, scalable, and privacy-preserving healthcare solutions for continuous PD monitoring.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Andreas Kontogiannis Vasilis Pollatos, Gabriele Farina Panayotis Mertikopoulos Ioannis Panageas

Efficient kernelized learning in polyhedral games beyond full-information: From Colonel Blotto to congestion games Conference

The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025 poster), 2025.

Abstract | Links | BibTeX

Adam Koletis Pavlos Bitilis, Georgios Bouchouras Konstantinos Kotis

A Comparative Analysis of Parkinson’s Disease Diagnosis Approaches Using Drawing-Based Datasets: Utilizing Large Language Models, Machine Learning, and Fuzzy Ontologies Journal Article

Information, 16 (9), pp. 820, 2025, ISSN: 2078-2489.

Abstract | Links | BibTeX

Theocharis Kravaris, George Vouros A

Transferable aircraft trajectory prediction with generative deep imitation learning Journal Article

International Journal of Data Science and Analytics, 20 (3), pp. 1977-1999, 2025.

Abstract | Links | BibTeX

Dimitrios Doumanas Alexandros Karakikes, Andreas Soularidis Efstathios Mainas Konstantinos Kotis

Emerging Threat Vectors: How Malicious Actors Exploit LLMs to Undermine Border Security Journal Article

AI, 6 (9), pp. 232, 2025, ISSN: 2673-2688.

Abstract | Links | BibTeX

@article{Doumanas2025d,
title = {Emerging Threat Vectors: How Malicious Actors Exploit LLMs to Undermine Border Security},
author = {Dimitrios Doumanas, Alexandros Karakikes, Andreas Soularidis, Efstathios Mainas, Konstantinos Kotis},
url = {https://www.mdpi.com/2673-2688/6/9/232},
doi = {https://doi.org/10.3390/ai6090232},
issn = {2673-2688},
year = {2025},
date = {2025-09-01},
journal = {AI},
volume = {6},
number = {9},
pages = {232},
abstract = {The rapid proliferation of Large Language Models (LLMs) has democratized access to advanced generative capabilities while raising urgent concerns about misuse in sensitive security domains. Border security, in particular, represents a high-risk environment where malicious actors may exploit LLMs for document forgery, synthetic identity creation, logistics planning, or disinformation campaigns. Existing studies often highlight such risks in theory, yet few provide systematic empirical evidence of how state-of-the-art LLMs can be exploited. This paper introduces the Silent Adversary Framework (SAF), a structured pipeline that models the sequential stages by which obfuscated prompts can covertly bypass safeguards. We evaluate ten high-risk scenarios using five leading models—GPT-4o, Claude 3.7 Sonnet, Gemini 2.5 Flash, Grok 3, and Runway Gen-2—and assess outputs through three standardized metrics: Bypass Success Rate (BSR), Output Realism Score (ORS), and Operational Risk Level (ORL). Results reveal that, while all models exhibited some susceptibility, vulnerabilities were heterogeneous. Claude showed greater resistance in chemistry-related prompts, whereas GPT-4o and Gemini generated highly realistic outputs in identity fraud and logistics optimization tasks. Document forgery attempts produced only partially successful templates that lacked critical security features. These findings highlight the uneven distribution of risks across models and domains. By combining a reproducible adversarial framework with empirical testing, this study advances the evidence base on LLM misuse and provides actionable insights for policymakers and border security agencies, underscoring the need for stronger safeguards and oversight in the deployment of generative AI.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Konstantinos Kotis Eleni Angoura, Eleni-Ioanna Lyngri

Emerging technologies in smart libraries for visually impaired people: challenges and design considerations Journal Article

ACM Journal on Computing and Cultural Heritage, 18 (3), pp. 1-37, 2025, ISSN: 1556-4673.

Abstract | Links | BibTeX

Sotiris Angelis Joana Pinho, Athanasia Sykiotou Dimitar Markov Stamatis Chatzistamatis Stamatis Spirou George Tsekouras Konstantinos Kotis

RRAO: An Ontology for the Representation of Reoffending Risk Assessment Knowledge Conference

16th International Conference on Information, Intelligence, Systems & Applications (IISA), 2025, ISBN: 979-8-3315-5636-5.

Abstract | Links | BibTeX

Eleftherios-Efkleidis Stefanou Pavlos Bitilis, Konstantinos Kotis

Current Status, Trends and Challenges in AI-Based Bradykinesia and Tremor Detection on Edge Devices Conference

16th International Conference on Information, Intelligence, Systems & Applications (IISA), 2025, ISBN: 979-8-3315-5636-5.

Abstract | Links | BibTeX

George Papadopoulos, George Vouros A

Learning safe, constrained policies via imitation learning: Connection to Probabilistic Inference and a Naive Algorithm Journal Article

arXiv, 2025.

Abstract | Links | BibTeX

Piyabhum Chaysri Theodoros Tranos, George Papadopoulos George Vouros Konstantinos Blekas A

Efficient Autonomous Marine Vessel Navigation with Safe Deep Reinforcement Learning Conference

2025 Symposium on Maritime Informatics and Robotics (MARIS), 2025.

Abstract | Links | BibTeX

Dimitrios Doumanas Efthalia Ntalouka, Costas Vassilakis Manolis Wallace Konstantinos Kotis

Stitching History into Semantics: LLM-Supported Knowledge Graph Engineering for 19th-Century Greek Bookbinding Journal Article

Machine Learning and Knowledge Extraction, 7 (3), pp. 59, 2025, ISSN: 2504-4990.

Abstract | Links | BibTeX

@article{Doumanas2025c,
title = {Stitching History into Semantics: LLM-Supported Knowledge Graph Engineering for 19th-Century Greek Bookbinding},
author = {Dimitrios Doumanas, Efthalia Ntalouka, Costas Vassilakis, Manolis Wallace, Konstantinos Kotis},
url = {https://www.mdpi.com/2504-4990/7/3/59},
doi = {https://doi.org/10.3390/make7030059},
issn = {2504-4990},
year = {2025},
date = {2025-06-24},
journal = {Machine Learning and Knowledge Extraction},
volume = {7},
number = {3},
pages = {59},
abstract = {Preserving cultural heritage can be efficiently supported by structured and semantic representation of historical artifacts. Bookbinding, a critical aspect of book history, provides valuable insights into past craftsmanship, material use, and conservation practices. However, existing bibliographic records often lack the depth needed to analyze bookbinding techniques, provenance, and preservation status. This paper presents a proof-of-concept system that explores how Large Language Models (LLMs) can support knowledge graph engineering within the context of 19th-century Greek bookbinding (1830–1900), and as a result, generate a domain-specific ontology and a knowledge graph. Our ontology encapsulates materials, binding techniques, artistic styles, and conservation history, integrating metadata standards like MARC and Dublin Core to ensure interoperability with existing library and archival systems. To validate its effectiveness, we construct a Neo4j knowledge graph, based on the generated ontology and utilize Cypher Queries—including LLM-generated queries—to extract insights about bookbinding practices and trends. This study also explores how semantic reasoning over the knowledge graph can identify historical binding patterns, assess book conservation needs, and infer relationships between bookbinding workshops. Unlike previous bibliographic ontologies, our approach provides a comprehensive, semantically rich representation of bookbinding history, methods and techniques, supporting scholars, conservators, and cultural heritage institutions. By demonstrating how LLMs can assist in ontology/KG creation and query generation, we introduce and evaluate a semi-automated pipeline as a methodological demonstration for studying historical bookbinding, contributing to digital humanities, book conservation, and cultural informatics. Finally, the proposed approach can be used in other domains, thus, being generally applicable in knowledge engineering.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Myrto Stogia Asimina Dimara, Alexios Papaioannou Christos-Nikolaos Anagnostopoulos Konstantinos Kotis Stelios Krinidis

The Role of IoT and 3D Modeling in Shaping Industry 5.0 Conference

IFIP International Conference on Artificial Intelligence Applications and Innovations, 2025, ISBN: 978-3-031-97313-0.

Abstract | Links | BibTeX

George Giannakopoulos Andreas Sideras, Konstantinos Stamatakis Nikolaos Melanitis

NAVMAT: An AI-supported naval failures knowledge management system Journal Article

Expert Systems with Applications, 277 , pp. 127117, 2025.

Abstract | Links | BibTeX

Foteini Oikonomou Eleftherios Bailis, Sotiris Bentos Stamatis Chatzistamatis Marianna Tzortzi Konstantinos Kotis Stamatis Spirou George Tsekouras E

Towards Fair Recidivism Prediction: Addressing Bias in Machine Learning for the Greek Prison System Conference

5th International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), 2025, ISBN: 979-8-3315-3297-0.

Abstract | Links | BibTeX

Andreas Kontogiannis Konstantinos Papathanasiou, Yi Shen Giorgos Stamou Michael Zavlanos George Vouros M

Enhancing cooperative multi-agent reinforcement learning with state modelling and adversarial exploration Journal Article

arXiv, 2025.

Abstract | Links | BibTeX

Dimitris Kostadimas Vlasios Kasapakis, Konstantinos Kotis

A systematic review on the combination of VR, IoT and AI technologies, and their integration in applications Journal Article

Future Internet, 17 (4), pp. 163, 2025, ISSN: 1999-5903.

Abstract | Links | BibTeX

@article{Kostadimas2025,
title = {A systematic review on the combination of VR, IoT and AI technologies, and their integration in applications},
author = {Dimitris Kostadimas, Vlasios Kasapakis, Konstantinos Kotis},
url = {https://www.mdpi.com/1999-5903/17/4/163},
doi = {https://doi.org/10.3390/fi17040163},
issn = {1999-5903},
year = {2025},
date = {2025-04-07},
journal = {Future Internet},
volume = {17},
number = {4},
pages = {163},
abstract = {The convergence of Virtual Reality (VR), Artificial Intelligence (AI), and the Internet of Things (IoT) offers transformative potential across numerous sectors. However, existing studies often examine these technologies independently or in limited pairings, which overlooks the synergistic possibilities of their combined usage. This systematic review adheres to the PRISMA guidelines in order to critically analyze peer-reviewed literature from highly recognized academic databases related to the intersection of VR, AI, and IoT, and identify application domains, methodologies, tools, and key challenges. By focusing on real-life implementations and working prototypes, this review highlights state-of-the-art advancements and uncovers gaps that hinder practical adoption, such as data collection issues, interoperability barriers, and user experience challenges. The findings reveal that digital twins (DTs), AIoT systems, and immersive XR environments are promising as emerging technologies (ET), but require further development to achieve scalability and real-world impact, while in certain fields a limited amount of research is conducted until now. This review bridges theory and practice, providing a targeted foundation for future interdisciplinary research aimed at advancing practical, scalable solutions across domains such as healthcare, smart cities, industry, education, cultural heritage, and beyond. The study found that the integration of VR, AI, and IoT holds significant potential across various domains, with DTs, IoT systems, and immersive XR environments showing promising applications, but challenges such as data interoperability, user experience limitations, and scalability barriers hinder widespread adoption.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Georgios Bouchouras Georgios Sofianidis, Konstantinos Kotis

Predicting freezing of gait in parkinson’s disease: A machine-learning-based approach in on and off medication states Journal Article

Journal of Clinical Medicine, 14 (6), pp. 2120, 2025, ISSN: 2077-0383.

Abstract | Links | BibTeX

@article{Bouchouras2025c,
title = {Predicting freezing of gait in parkinson’s disease: A machine-learning-based approach in on and off medication states},
author = {Georgios Bouchouras, Georgios Sofianidis, Konstantinos Kotis},
url = {https://www.mdpi.com/2077-0383/14/6/2120},
doi = {https://doi.org/10.3390/jcm14062120},
issn = {2077-0383},
year = {2025},
date = {2025-03-20},
journal = {Journal of Clinical Medicine},
volume = {14},
number = {6},
pages = {2120},
abstract = {Freezing of gait (FoG) is a debilitating motor symptom of Parkinson’s disease (PD), characterized by sudden episodes where patients struggle to initiate or sustain movement, often describing a sensation of their feet being “glued to the ground.” This study investigates the potential of machine-learning (ML) models to predict FoG severity in PD patients, focusing on the influence of dopaminergic medication by comparing gait parameters in ON and OFF medication states. Methods: Specifically, this study employed spatiotemporal gait features to develop a predictive model for FoG severity, leveraging a random forest regressor to identify the most influential gait parameters associated with this in each medication state. The results indicate that the model achieved higher predictive performance in the OFF-medication condition (R² = 0.82, MAE = 2.25, MSE = 15.23) compared to the ON-medication condition (R² = 0.52, MAE = 4.16, MSE = 42.00). Results: These findings suggest that dopaminergic treatment alters gait dynamics, potentially reducing the reliability of FoG predictions when patients are medicated. Feature importance analysis revealed distinct gait characteristics associated with FoG severity across medication states. In the OFF condition, step length parameters, particularly left step length mean, were the most dominant predictors, alongside swing time and stride width, indicating the role of spatial and temporal gait control in FoG severity without medication. In contrast, under the ON medication condition, stride width and gait speed emerged as the most influential predictors, followed by stepping frequency, reflecting how medication influences stability and movement rhythm. Conclusions: These findings highlight the need for predictive models that account for medication-induced gait variability, ensuring more reliable FoG detection. By integrating spatiotemporal gait analysis and ML-based prediction, this study contributes to the development of personalized intervention strategies for PD patients experiencing FoG episodes.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Freezing of gait (FoG) is a debilitating motor symptom of Parkinson’s disease (PD), characterized by sudden episodes where patients struggle to initiate or sustain movement, often describing a sensation of their feet being “glued to the ground.” This study investigates the potential of machine-learning (ML) models to predict FoG severity in PD patients, focusing on the influence of dopaminergic medication by comparing gait parameters in ON and OFF medication states. Methods: Specifically, this study employed spatiotemporal gait features to develop a predictive model for FoG severity, leveraging a random forest regressor to identify the most influential gait parameters associated with this in each medication state. The results indicate that the model achieved higher predictive performance in the OFF-medication condition (R² = 0.82, MAE = 2.25, MSE = 15.23) compared to the ON-medication condition (R² = 0.52, MAE = 4.16, MSE = 42.00). Results: These findings suggest that dopaminergic treatment alters gait dynamics, potentially reducing the reliability of FoG predictions when patients are medicated. Feature importance analysis revealed distinct gait characteristics associated with FoG severity across medication states. In the OFF condition, step length parameters, particularly left step length mean, were the most dominant predictors, alongside swing time and stride width, indicating the role of spatial and temporal gait control in FoG severity without medication. In contrast, under the ON medication condition, stride width and gait speed emerged as the most influential predictors, followed by stepping frequency, reflecting how medication influences stability and movement rhythm. Conclusions: These findings highlight the need for predictive models that account for medication-induced gait variability, ensuring more reliable FoG detection. By integrating spatiotemporal gait analysis and ML-based prediction, this study contributes to the development of personalized intervention strategies for PD patients experiencing FoG episodes.

Despoina P Kiouri Georgios C Batsis, Thomas Mavromoustakos Alessandro Giuliani Christos Chasapis T

Structure-Based Modeling of the Gut Bacteria–Host Interactome Through Statistical Analysis of Domain–Domain Associations Using Machine Learning Journal Article

BioTech, 14 (1), pp. 13, 2025.

Abstract | Links | BibTeX

Natalia Koliou, George Vouros

Ranking Joint Policies in Dynamic Games using Evolutionary Dynamics Journal Article

arXiv, 2025.

Abstract | Links | BibTeX

Dimitrios Doumanas Andreas Soularidis, Dimitris Spiliotopoulos Costas Vassilakis Konstantinos Kotis

Fine-tuning large language models for ontology engineering: A comparative analysis of GPT-4 and Mistral Journal Article

Applied Sciences, 15 (4), pp. 2146, 2025, ISSN: 2076-3417.

Abstract | Links | BibTeX

@article{Doumanas2025b,
title = {Fine-tuning large language models for ontology engineering: A comparative analysis of GPT-4 and Mistral},
author = {Dimitrios Doumanas, Andreas Soularidis, Dimitris Spiliotopoulos, Costas Vassilakis, Konstantinos Kotis},
doi = {https://doi.org/10.3390/app15042146},
issn = {2076-3417},
year = {2025},
date = {2025-02-18},
journal = {Applied Sciences},
volume = {15},
number = {4},
pages = {2146},
abstract = {Ontology engineering (OE) plays a critical role in modeling and managing structured knowledge across various domains. This study examines the performance of fine-tuned large language models (LLMs), specifically GPT-4 and Mistral 7B, in efficiently automating OE tasks. Foundational OE textbooks are used as the basis for dataset creation and for feeding the LLMs. The methodology involved segmenting texts into manageable chapters, generating question–answer pairs, and translating visual elements into description logic to curate fine-tuned datasets in JSONL format. This research aims to enhance the models’ abilities to generate domain-specific ontologies, with hypotheses asserting that fine-tuned LLMs would outperform base models, and that domain-specific datasets would significantly improve their performance. Comparative experiments revealed that GPT-4 demonstrated superior accuracy and adherence to ontology syntax, albeit with higher computational costs. Conversely, Mistral 7B excelled in speed and cost efficiency but struggled with domain-specific tasks, often generating outputs that lacked syntactical precision and relevance. The presented results highlight the necessity of integrating domain-specific datasets to improve contextual understanding and practical utility in specialized applications, such as Search and Rescue (SAR) missions in wildfire incidents. Both models, despite their limitations, exhibited potential in understanding OE principles. However, their performance underscored the importance of aligning training data with domain-specific knowledge to emulate human expertise effectively. This study, based on and extending our previous work on the topic, concludes that fine-tuned LLMs with targeted datasets enhance their utility in OE, offering insights into improving future models for domain-specific applications. The findings advocate further exploration of hybrid solutions to balance accuracy and efficiency.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Christos Spatharis Konstantinos Blekas, George Santipantakis George Vouros

Modular and Multimodal Generative Adversarial Imitation Learning for Modeling Flight Trajectories Journal Article

Journal of Air Transportation, 33 (3), pp. 188-204, 2025.

Abstract | Links | BibTeX

Despoina P Kiouri Georgios C Batsis, Christos Chasapis T

Structure-Based Deep Learning Framework for Modeling Human–Gut Bacterial Protein Interactions Journal Article

Proteomes, 13 (1), pp. 10, 2025.

Abstract | Links | BibTeX

George Papadopoulos Andreas Kontogiannis, Foteini Papadopoulou Chaido Poulianou Ioannis Koumentis George Vouros

An extended benchmarking of multi-agent reinforcement learning algorithms in complex fully cooperative tasks Journal Article

arXiv, 2025.

Abstract | Links | BibTeX

@article{Papadopoulos2025,
title = {An extended benchmarking of multi-agent reinforcement learning algorithms in complex fully cooperative tasks},
author = {George Papadopoulos, Andreas Kontogiannis, Foteini Papadopoulou, Chaido Poulianou, Ioannis Koumentis, George Vouros},
url = {https://arxiv.org/pdf/2502.04773},
doi = {https://doi.org/10.48550/arXiv.2502.04773},
year = {2025},
date = {2025-02-07},
journal = {arXiv},
abstract = {Multi-Agent Reinforcement Learning (MARL) has recently emerged as a significant area of research. However, MARL evaluation often lacks systematic diversity, hindering a comprehensive understanding of algorithms’ capabilities. In particular, cooperative MARL algorithms are predominantly evaluated on benchmarks such as SMAC and GRF, which primarily feature team game scenarios without assessing adequately various aspects of agents’ capabilities required in fully cooperative real-world tasks such as multi-robot cooperation and warehouse, resource management, search and rescue, and human-AI cooperation. Moreover, MARL algorithms are mainly evaluated on low dimensional state spaces, and thus their performance on high-dimensional (e.g., image) observations is not well-studied. To fill this gap, this paper highlights the crucial need for expanding systematic evaluation across a wider array of existing benchmarks. To this end, we conduct extensive evaluation and comparisons of well-known MARL algorithms on complex fully cooperative benchmarks, including tasks with images as agents’ observations. Interestingly, our analysis shows that many algorithms, hailed as state-of-the-art on SMAC and GRF, may underperform standard MARL baselines on fully cooperative benchmarks. Finally, towards more systematic and better evaluation of cooperative MARL algorithms, we have open-sourced PyMARLzoo+, an extension of the widely used (E)PyMARL libraries, which addresses an open challenge from [49], facilitating seamless integration and support with all benchmarks of PettingZoo, as well as Overcooked, PressurePlate, Capture Target and Box Pushing.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Fotis Assimakopoulos Costas Vassilakis, Dionisis Margaris Konstantinos Kotis Dimitris Spiliotopoulos

Information, 16 (2), pp. 100, 2025.

Abstract | Links | BibTeX

Dimitrios Doumanas Georgios Bouchouras, Andreas Soularidis Konstantinos Kotis George Vouros

From human-to LLM-centered collaborative ontology engineering Journal Article

Applied Ontology, 19 (4), pp. 334-367, 2025.

Abstract | Links | BibTeX

@article{Doumanas2025,
title = {From human-to LLM-centered collaborative ontology engineering},
author = {Dimitrios Doumanas, Georgios Bouchouras, Andreas Soularidis, Konstantinos Kotis, George Vouros},
doi = {https://doi.org/10.1177/15705838241305067},
year = {2025},
date = {2025-01-31},
journal = {Applied Ontology},
volume = {19},
number = {4},
pages = {334-367},
abstract = {In the continuously evolving landscape of knowledge engineering, the symbiosis and teaming of humans and machines emerge as a pivotal new domain. This article explores the multifaceted realms of human and machine collaborative ontology engineering (OE). The goal of the presented work is to explore the potential of Large Language Models (LLMs) to speed up and automate the processes of collaborative OE, experimenting with different levels of LLM involvement. The proposed approach is based on a human-centered approach, that is, the HCOME approach to collaborative OE, and follows a process of exploring the declining involvement of humans and the parallel increase of LLM involvement, concluding at a level of automation where the OE is exclusively performed by LLMs. This experimentation is organized based on a series of human/LLM collaboration levels (a spectrum of OE), each one aligned to a specific OE methodology, that is, Level-0 HCOME (Human), Level-1 X-HCOME (Human and LLMs), Level-2 SimX-HCOME (LLMs and Human), and Level-3 Sim-HCOME (LLMs). The evaluation of these methodologies (one per level) is performed by measuring the similarity of the generated ontologies against “reference” ontologies (precision, recall, and F1-score of reference-to-LLM-generated ontological mappings). The results presented in this paper demonstrate that while LLMs significantly expedite the OE process, the accuracy and completeness of the resulting ontologies are notably enhanced by maintaining a high level of human involvement. This study is expected to contribute to a deeper understanding of evolving dynamics in LLM-based/enhanced OE, paving the way for future advancements toward more effective collaborative OE frameworks.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Despoina P Kiouri Georgios C Batsis, Christos Chasapis T

Structure-based approaches for protein–protein interaction prediction using machine learning and deep learning Journal Article

Biomolecules, 15 (1), pp. 141, 2025.

Abstract | Links | BibTeX

Georgios Bouchouras, Konstantinos Kotis

Integrating artificial intelligence, internet of things, and sensor-based technologies: a systematic review of methodologies in autism Spectrum disorder detection Journal Article

Algorithms, 18 (1), pp. 34, 2025.

Abstract | Links | BibTeX

2024

Georgios Bouchouras Georgios Sofianidis, Konstantinos Kotis

Temporal Anomaly Detection in Attention-Deficit/Hyperactivity Disorder Using Recurrent Neural Networks Journal Article

Cureus, 16 (12), 2024.

Abstract | Links | BibTeX

Davide Ferraris Konstantinos Kotis, Christos Kalloniatis

Enhancing TrUStAPIS Methodology in the Web of Things with LLM-generated IoT Trust Semantics Conference

The 2024 International Conference on Information and Communications Security (ICICS 2024), 2024.

Abstract | Links | BibTeX

Andreas Soularidis Konstantinos Kotis, Myriam Lamolle Zakaria Mejdoul Gaëlle Lortal George Vouros

LLM-Assisted Generation of SWRL Rules from Natural Language Conference

2024 International Conference on AI x Data and Knowledge Engineering (AIxDKE), 2024, ISBN: 979-8-3315-1704-5.

Abstract | Links | BibTeX

231 entries « ‹ 1 of 5 › »

Search with keywords

231 entries « ‹ 1 of 12 › »

1.	Andreas Kontogiannis Vasilis Pollatos, Panayotis Mertikopoulos Ioannis Panageas : Efficient swap regret minimization in combinatorial bandits. Twenty-Ninth Annual Conference on Artificial Intelligence and Statistics (AISTATS 2026), 2026. (Type: Conference \| Abstract \| Links \| BibTeX) @conference{Kontogiannis2026, title = {Efficient swap regret minimization in combinatorial bandits}, author = {Andreas Kontogiannis, Vasilis Pollatos, Panayotis Mertikopoulos, Ioannis Panageas}, url = {https://arxiv.org/pdf/2602.02087}, year = {2026}, date = {2026-05-02}, booktitle = {Twenty-Ninth Annual Conference on Artificial Intelligence and Statistics (AISTATS 2026)}, abstract = {This paper addresses the problem of designing efficient no-swap regret algorithms for combinatorial bandits, where the number of actions N is exponentially large in the dimensionality of the problem. In this setting, designing efficient no-swap regret translates to sublinear — in horizon T — swap regret with polylogarithmic dependence on N. In contrast to the weaker notion of external regret minimization – a problem which is fairly well understood in the literature – achieving no-swap regret with a polylogarithmic dependence on N has remained elusive in combinatorial bandits. Our paper resolves this challenge, by introducing a no-swap-regret learning algorithm with regret that scales polylogarithmically in N and is tight for the class of combinatorial bandits. To ground our results, we also demonstrate how to implement the proposed algorithm efficiently — that is, with a per-iteration complexity that also scales polylogarithmically in N — across a wide range of well-studied applications.}, keywords = {}, pubstate = {published}, tppubtype = {conference} } Close This paper addresses the problem of designing efficient no-swap regret algorithms for combinatorial bandits, where the number of actions N is exponentially large in the dimensionality of the problem. In this setting, designing efficient no-swap regret translates to sublinear — in horizon T — swap regret with polylogarithmic dependence on N. In contrast to the weaker notion of external regret minimization – a problem which is fairly well understood in the literature – achieving no-swap regret with a polylogarithmic dependence on N has remained elusive in combinatorial bandits. Our paper resolves this challenge, by introducing a no-swap-regret learning algorithm with regret that scales polylogarithmically in N and is tight for the class of combinatorial bandits. To ground our results, we also demonstrate how to implement the proposed algorithm efficiently — that is, with a per-iteration complexity that also scales polylogarithmically in N — across a wide range of well-studied applications. Close https://arxiv.org/pdf/2602.02087 Close
2.	Georgios Bouchouras Dimitrios Doumanas, Andreas Soularidis Konstantinos Kotis George Vouros : Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting. In: AI, 7 (4), pp. 139, 2026, ISSN: 2673-2688. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Bouchouras2026, title = {Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting}, author = {Georgios Bouchouras, Dimitrios Doumanas, Andreas Soularidis, Konstantinos Kotis, George Vouros}, url = {https://www.mdpi.com/2673-2688/7/4/139}, doi = {https://doi.org/10.3390/ai7040139}, issn = {2673-2688}, year = {2026}, date = {2026-04-14}, journal = {AI}, volume = {7}, number = {4}, pages = {139}, abstract = {Ontology engineering plays a critical role in clinical decision support systems for Parkinson’s Disease (PD) monitoring and alerting. While Large Language Models (LLMs) have shown promise in knowledge modeling tasks, their effectiveness in autonomously constructing comprehensive ontologies for complex clinical domains remains unclear. This study investigates four ontology engineering methodologies for PD monitoring and alerting: One-shot (OS) prompting, Decomposed Sequential Prompting (DSP), X-HCOME, and SimX-HCOME+. Multiple LLMs were evaluated across these methodologies. Generated ontologies were assessed against a reference PD ontology using structural evaluation metrics focused on classes and object properties. Expert review was additionally conducted to analyze knowledge extensions beyond the gold standard. LLMs were able to autonomously generate syntactically valid and semantically meaningful ontologies using OS and DSP prompting; however, these ontologies exhibited limited conceptual coverage. Incorporating human expertise through X-HCOME significantly improved ontology completeness and evaluation metrics. Expert review further validated clinically relevant concepts absent from the reference ontology. SimX-HCOME+ demonstrated that iterative, supervised collaboration supports ontology refinement, although challenges persisted in natural language-to-rule formalization. The findings suggest that LLMs are more effective as collaborative assistants rather than standalone ontology engineers in the PD domain. Structured human–LLM collaboration is associated with improved ontology coverage and facilitates the identification of potential knowledge extensions in clinical monitoring applications. While the present evaluation focuses primarily on structural ontology elements, the proposed methodologies provide useful insights for LLM-assisted ontology engineering in complex healthcare domains.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Ontology engineering plays a critical role in clinical decision support systems for Parkinson’s Disease (PD) monitoring and alerting. While Large Language Models (LLMs) have shown promise in knowledge modeling tasks, their effectiveness in autonomously constructing comprehensive ontologies for complex clinical domains remains unclear. This study investigates four ontology engineering methodologies for PD monitoring and alerting: One-shot (OS) prompting, Decomposed Sequential Prompting (DSP), X-HCOME, and SimX-HCOME+. Multiple LLMs were evaluated across these methodologies. Generated ontologies were assessed against a reference PD ontology using structural evaluation metrics focused on classes and object properties. Expert review was additionally conducted to analyze knowledge extensions beyond the gold standard. LLMs were able to autonomously generate syntactically valid and semantically meaningful ontologies using OS and DSP prompting; however, these ontologies exhibited limited conceptual coverage. Incorporating human expertise through X-HCOME significantly improved ontology completeness and evaluation metrics. Expert review further validated clinically relevant concepts absent from the reference ontology. SimX-HCOME+ demonstrated that iterative, supervised collaboration supports ontology refinement, although challenges persisted in natural language-to-rule formalization. The findings suggest that LLMs are more effective as collaborative assistants rather than standalone ontology engineers in the PD domain. Structured human–LLM collaboration is associated with improved ontology coverage and facilitates the identification of potential knowledge extensions in clinical monitoring applications. While the present evaluation focuses primarily on structural ontology elements, the proposed methodologies provide useful insights for LLM-assisted ontology engineering in complex healthcare domains. Close https://www.mdpi.com/2673-2688/7/4/139 doi:https://doi.org/10.3390/ai7040139 Close
3.	Dimitrios Doumanas Andreas Soularidis, Nikolaos Zafeiropoulos Stamatis Chatzistamatis George Tsekouras Andreas El Saer Chrisaphis Nathanailidis Konstantinos Kotis E: Unbiasing Greek: In-Context Learning Strategies for Gender Bias Identification and Mitigation for Legal Documents and Job Ads. In: Information, 17 (4), pp. 342, 2026. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Doumanas2026b, title = {Unbiasing Greek: In-Context Learning Strategies for Gender Bias Identification and Mitigation for Legal Documents and Job Ads}, author = {Dimitrios Doumanas, Andreas Soularidis, Nikolaos Zafeiropoulos, Stamatis Chatzistamatis, George E Tsekouras, Andreas El Saer, Chrisaphis Nathanailidis, Konstantinos Kotis}, url = {https://www.mdpi.com/2078-2489/17/4/342}, doi = {https://doi.org/10.3390/info17040342}, year = {2026}, date = {2026-04-02}, journal = {Information}, volume = {17}, number = {4}, pages = {342}, abstract = {Gender bias embedded in legal and professional texts perpetuates systemic inequality, yet research on bias identification and mitigation remains largely confined to English. Morphologically rich languages such as Greek, where grammatical gender pervades nouns, adjectives, pronouns, and participles, present unique challenges that existing approaches fail to address. This paper elaborates on a systematic methodology primarily focusing on identifying and mitigating gender bias in Greek-language job advertisements and legal documents. To accomplish that task, we define a taxonomy of nine gender bias rules tailored to the linguistic properties of Greek and construct domain-specific annotated datasets comprising 90 expert-curated few-shot examples across both textual domains. Using these resources, we employ XML-structured prompt engineering with in-context learning (ICL)and systematically compare three classes of models: (i) commercial large language models (LLMs), namely Claude Sonnet 4.5 and GPT-5.2, (ii) two open-weight small language models (SLMs), Mistral Small (24B) and Ministral (14B), and (iii) Llama Krikri (8B), a Greek-native language model built on Llama 3.1 and fine-tuned on high-quality Greek corpora. For each input text, the system identifies biased expressions, maps them to specific bias rules, provides explanations, and generates a fully corrected inclusive version. Our experiments reveal substantial performance disparities across model scales and linguistic specialization, with LLMs demonstrating superior contextual reasoning and SLMs exhibiting systematic over-correction and grammatical errors in Greek morphology. We further introduce a critical meta-rule addressing gender agreement with named entities to prevent spurious corrections in legal texts referencing identified individuals. The findings highlight the importance of model scale, language-specific adaptation, and carefully designed prompting strategies for bias mitigation in underrepresented languages.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Gender bias embedded in legal and professional texts perpetuates systemic inequality, yet research on bias identification and mitigation remains largely confined to English. Morphologically rich languages such as Greek, where grammatical gender pervades nouns, adjectives, pronouns, and participles, present unique challenges that existing approaches fail to address. This paper elaborates on a systematic methodology primarily focusing on identifying and mitigating gender bias in Greek-language job advertisements and legal documents. To accomplish that task, we define a taxonomy of nine gender bias rules tailored to the linguistic properties of Greek and construct domain-specific annotated datasets comprising 90 expert-curated few-shot examples across both textual domains. Using these resources, we employ XML-structured prompt engineering with in-context learning (ICL)and systematically compare three classes of models: (i) commercial large language models (LLMs), namely Claude Sonnet 4.5 and GPT-5.2, (ii) two open-weight small language models (SLMs), Mistral Small (24B) and Ministral (14B), and (iii) Llama Krikri (8B), a Greek-native language model built on Llama 3.1 and fine-tuned on high-quality Greek corpora. For each input text, the system identifies biased expressions, maps them to specific bias rules, provides explanations, and generates a fully corrected inclusive version. Our experiments reveal substantial performance disparities across model scales and linguistic specialization, with LLMs demonstrating superior contextual reasoning and SLMs exhibiting systematic over-correction and grammatical errors in Greek morphology. We further introduce a critical meta-rule addressing gender agreement with named entities to prevent spurious corrections in legal texts referencing identified individuals. The findings highlight the importance of model scale, language-specific adaptation, and carefully designed prompting strategies for bias mitigation in underrepresented languages. Close https://www.mdpi.com/2078-2489/17/4/342 doi:https://doi.org/10.3390/info17040342 Close
4.	Andreas Kontogiannis Ioannis Panageas, Vasilis Pollatos : The computational complexity of avoiding strict saddle points in constrained optimization. In: arXiv, 2026. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Kontogiannis2026b, title = {The computational complexity of avoiding strict saddle points in constrained optimization}, author = {Andreas Kontogiannis, Ioannis Panageas, Vasilis Pollatos}, url = {https://arxiv.org/abs/2604.02285}, doi = {https://doi.org/10.48550/arXiv.2604.02285}, year = {2026}, date = {2026-04-02}, journal = {arXiv}, abstract = {While first-order stationary points (FOSPs) are the traditional targets of non-convex optimization, they often correspond to undesirable strict saddle points. To circumvent this, attention has shifted towards second-order stationary points (SOSPs). In unconstrained settings, finding approximate SOSPs is PLS-complete (Kontogiannis et al.), matching the complexity of finding unconstrained FOSPs (Hollender and Zampetakis). However, the complexity of finding SOSPs in constrained settings remained notoriously unclear and was highlighted as an important open question by both aforementioned works. Under one strict definition, even verifying whether a point is an approximate SOSP is NP-hard (Murty and Kabadi). Under another widely adopted, relaxed definition where non-negative curvature is required only along the null space of the active constraints, the problem lies in TFNP, and algorithms with O(poly(1/epsilon)) running times have been proposed (Lu et al.). In this work, we settle the complexity of constrained SOSP by proving that computing an epsilon-approximate SOSP under the tractable definition is PLS-complete. We demonstrate that our result holds even in the 2D unit square [0,1]^2, and remarkably, even when stationary points are isolated at a distance of Omega(1) from the domain’s boundary. Our result establishes a fundamental barrier: unless PLS is a subset of PPAD (implying PLS = CLS), no deterministic, iterative algorithm with an efficient, continuous update rule can exist for finding approximate SOSPs. This contrasts with the constrained first-order counterpart, for which Fearnley et al. showed that finding an approximate KKT point is CLS-complete. Finally, our result yields the first problem defined in a compact domain to be shown PLS-complete beyond the canonical Real-LocalOpt (Daskalakis and Papadimitriou).”}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close While first-order stationary points (FOSPs) are the traditional targets of non-convex optimization, they often correspond to undesirable strict saddle points. To circumvent this, attention has shifted towards second-order stationary points (SOSPs). In unconstrained settings, finding approximate SOSPs is PLS-complete (Kontogiannis et al.), matching the complexity of finding unconstrained FOSPs (Hollender and Zampetakis). However, the complexity of finding SOSPs in constrained settings remained notoriously unclear and was highlighted as an important open question by both aforementioned works. Under one strict definition, even verifying whether a point is an approximate SOSP is NP-hard (Murty and Kabadi). Under another widely adopted, relaxed definition where non-negative curvature is required only along the null space of the active constraints, the problem lies in TFNP, and algorithms with O(poly(1/epsilon)) running times have been proposed (Lu et al.). In this work, we settle the complexity of constrained SOSP by proving that computing an epsilon-approximate SOSP under the tractable definition is PLS-complete. We demonstrate that our result holds even in the 2D unit square [0,1]^2, and remarkably, even when stationary points are isolated at a distance of Omega(1) from the domain’s boundary. Our result establishes a fundamental barrier: unless PLS is a subset of PPAD (implying PLS = CLS), no deterministic, iterative algorithm with an efficient, continuous update rule can exist for finding approximate SOSPs. This contrasts with the constrained first-order counterpart, for which Fearnley et al. showed that finding an approximate KKT point is CLS-complete. Finally, our result yields the first problem defined in a compact domain to be shown PLS-complete beyond the canonical Real-LocalOpt (Daskalakis and Papadimitriou)." Close https://arxiv.org/abs/2604.02285 doi:https://doi.org/10.48550/arXiv.2604.02285 Close
5.	Georgios M Santipantakis Christos Doulkeridis, Petros Brimos : Semantic Data Transformation, FAIRification and Provenance for Data Spaces. In: Data in Brief, 66 , pp. 112675, 2026, ISSN: 2352-3409. (Type: Journal Article \| Links \| BibTeX) @article{Santipantakis2026, title = {Semantic Data Transformation, FAIRification and Provenance for Data Spaces}, author = {Georgios M Santipantakis, Christos Doulkeridis, Petros Brimos}, url = {https://www.sciencedirect.com/science/article/pii/S2352340926002283}, doi = {https://doi.org/10.1016/j.dib.2026.112675}, issn = {2352-3409}, year = {2026}, date = {2026-03-10}, journal = {Data in Brief}, volume = {66}, pages = {112675}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close https://www.sciencedirect.com/science/article/pii/S2352340926002283 doi:https://doi.org/10.1016/j.dib.2026.112675 Close
6.	Michael Kenteris, Konstantinos Kotis : The Convergence of Federated Learning, Knowledge Graphs, and Large Language Models for Language Learning: A Scoping Review. In: Applied Sciences, 16 (5), pp. 2611, 2026. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Kenteris2026, title = {The Convergence of Federated Learning, Knowledge Graphs, and Large Language Models for Language Learning: A Scoping Review}, author = {Michael Kenteris, Konstantinos Kotis}, url = {https://www.mdpi.com/2076-3417/16/5/2611}, doi = {https://doi.org/10.3390/app16052611}, year = {2026}, date = {2026-03-09}, journal = {Applied Sciences}, volume = {16}, number = {5}, pages = {2611}, abstract = {Large Language Models (LLMs) in Intelligent Computer-Assisted Language Learning enable highly personalized learning, yet raise significant challenges related to pedagogical grounding, data privacy, and instructional validity. Although Knowledge Graphs (KGs) and Federated Learning (FL) can mitigate these issues in isolation, evidence on systematic FL–KG–LLM integration for educational language learning remains limited. This scoping review maps the FL–KG–LLM convergence landscape. Following PRISMA-ScR guidelines, we searched six databases and screened 51 papers (2019–2025) using automated extraction. Our findings indicate limited convergence: no papers integrate all three domains, and 58.8% of approaches remain confined to isolated technological silos. Reporting is also uneven across the corpus, with an average “Not Reported” (NR) rate of 84.5%, most notably for privacy mechanisms (92.2%), validation metrics (90.2%), and Common European Framework of Reference for Languages (CEFR) alignment (88.2%). Domain-specific analysis reveals two distinct patterns: inter-domain gaps (disciplinary silos resulting in expected CEFR absence in single-domain papers) and intra-domain gaps (failure to report domain-critical variables, including 100% parameter NR in FL studies, 86.7% validation NR in KG studies, and 100% CEFR NR in convergence papers). Taken together, these gaps suggest that pedagogical grounding is treated as optional rather than structural. We therefore identify two pillars of pedagogical grounding: a Grounding Pillar, which constrains LLM outputs via Knowledge Graph rules, and a Validation Pillar, which concerns how authoritative frameworks (e.g., CEFR) are mapped onto Knowledge Graph schemas and evaluated. The near-universal absence of CEFR alignment and validation reporting suggests that this second pillar is currently missing, which we term the Integrity Gap—a systematic disconnection between technological innovation and pedagogical grounding inin Intelligent Computer-Assisted Language Learning. By reframing the problem as upstream control and validation, this review informs the design of user-facing automated systems where trust, transparency, and human oversight are critical.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Large Language Models (LLMs) in Intelligent Computer-Assisted Language Learning enable highly personalized learning, yet raise significant challenges related to pedagogical grounding, data privacy, and instructional validity. Although Knowledge Graphs (KGs) and Federated Learning (FL) can mitigate these issues in isolation, evidence on systematic FL–KG–LLM integration for educational language learning remains limited. This scoping review maps the FL–KG–LLM convergence landscape. Following PRISMA-ScR guidelines, we searched six databases and screened 51 papers (2019–2025) using automated extraction. Our findings indicate limited convergence: no papers integrate all three domains, and 58.8% of approaches remain confined to isolated technological silos. Reporting is also uneven across the corpus, with an average “Not Reported” (NR) rate of 84.5%, most notably for privacy mechanisms (92.2%), validation metrics (90.2%), and Common European Framework of Reference for Languages (CEFR) alignment (88.2%). Domain-specific analysis reveals two distinct patterns: inter-domain gaps (disciplinary silos resulting in expected CEFR absence in single-domain papers) and intra-domain gaps (failure to report domain-critical variables, including 100% parameter NR in FL studies, 86.7% validation NR in KG studies, and 100% CEFR NR in convergence papers). Taken together, these gaps suggest that pedagogical grounding is treated as optional rather than structural. We therefore identify two pillars of pedagogical grounding: a Grounding Pillar, which constrains LLM outputs via Knowledge Graph rules, and a Validation Pillar, which concerns how authoritative frameworks (e.g., CEFR) are mapped onto Knowledge Graph schemas and evaluated. The near-universal absence of CEFR alignment and validation reporting suggests that this second pillar is currently missing, which we term the Integrity Gap—a systematic disconnection between technological innovation and pedagogical grounding inin Intelligent Computer-Assisted Language Learning. By reframing the problem as upstream control and validation, this review informs the design of user-facing automated systems where trust, transparency, and human oversight are critical. Close https://www.mdpi.com/2076-3417/16/5/2611 doi:https://doi.org/10.3390/app16052611 Close
7.	Elias Alevizos Georgios M Santipantakis, Christos Doulkeridis Alexander Artikis : Online spatial reasoning for complex event recognition. In: GeoInformatica, 30 (1), pp. 9, 2026. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Alevizos2026, title = {Online spatial reasoning for complex event recognition}, author = {Elias Alevizos, Georgios M Santipantakis, Christos Doulkeridis, Alexander Artikis}, url = {https://link.springer.com/article/10.1007/s10707-026-00569-z}, doi = {https://doi.org/10.1007/s10707-026-00569-z}, year = {2026}, date = {2026-03-03}, journal = {GeoInformatica}, volume = {30}, number = {1}, pages = {9}, abstract = {Complex Event Recognition (CER) systems have the ability to process streams of events by detecting event patterns with minimal latency. Typically, these patterns have a temporal structure, often resembling the sequential structure of regular expressions. A pattern advances to the next state by checking various conditions on the current and possibly previous events of the stream. CER systems are very efficient in tracking all the possible paths that a pattern may follow and report when a path is complete and a complex event must be reported. In some cases, the conditions that need to be checked may be spatial. For example, in maritime situational awareness, a condition may need to check whether a vessel is close to any other vessel. Such conditions are not easily expressed directly as regular expressions. For such spatio-temporal tasks, there exist dedicated modules which can evaluate this type of conditions efficiently. Thus, we can integrate such a spatio-temporal module within a CER system in order to take advantage of both worlds: the CER engine can accommodate and process complex regular expressions and delegate the evaluation of expensive spatio-temporal tasks to a dedicated module whenever it needs to. We present an approach towards such an integration. We describe how a CER engine, based on symbolic automata, can cooperate with a spatio-temporal link discovery (stLD) module such that the former can leverage the spatio-temporal capabilities of the latter. This cooperation can take place in an online manner rendering the whole system suitable for real-time processing of event streams. We discuss two different communication schemes between the CER engine and the spatio-temporal module and explore when each one should be preferred. We provide a theoretical estimation of the predicted performance of the system under each communication scheme. Our extensive experimental evaluation confirms most of our theoretical predictions.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Complex Event Recognition (CER) systems have the ability to process streams of events by detecting event patterns with minimal latency. Typically, these patterns have a temporal structure, often resembling the sequential structure of regular expressions. A pattern advances to the next state by checking various conditions on the current and possibly previous events of the stream. CER systems are very efficient in tracking all the possible paths that a pattern may follow and report when a path is complete and a complex event must be reported. In some cases, the conditions that need to be checked may be spatial. For example, in maritime situational awareness, a condition may need to check whether a vessel is close to any other vessel. Such conditions are not easily expressed directly as regular expressions. For such spatio-temporal tasks, there exist dedicated modules which can evaluate this type of conditions efficiently. Thus, we can integrate such a spatio-temporal module within a CER system in order to take advantage of both worlds: the CER engine can accommodate and process complex regular expressions and delegate the evaluation of expensive spatio-temporal tasks to a dedicated module whenever it needs to. We present an approach towards such an integration. We describe how a CER engine, based on symbolic automata, can cooperate with a spatio-temporal link discovery (stLD) module such that the former can leverage the spatio-temporal capabilities of the latter. This cooperation can take place in an online manner rendering the whole system suitable for real-time processing of event streams. We discuss two different communication schemes between the CER engine and the spatio-temporal module and explore when each one should be preferred. We provide a theoretical estimation of the predicted performance of the system under each communication scheme. Our extensive experimental evaluation confirms most of our theoretical predictions. Close https://link.springer.com/article/10.1007/s10707-026-00569-z doi:https://doi.org/10.1007/s10707-026-00569-z Close
8.	George Papadopoulos, George Vouros A: Learning to maintain safety through expert demonstrations in settings with unknown constraints: A Q-learning perspective. In: arXiv, 2026. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Papadopoulos2026, title = {Learning to maintain safety through expert demonstrations in settings with unknown constraints: A Q-learning perspective}, author = {George Papadopoulos, George A Vouros}, url = {https://arxiv.org/abs/2602.23816 https://arxiv.org/pdf/2602.23816}, doi = {https://doi.org/10.48550/arXiv.2602.23816}, year = {2026}, date = {2026-02-27}, journal = {arXiv}, abstract = {Given a set of trajectories demonstrating the execution of a task safely in a constrained MDP with observable rewards but with unknown constraints and non-observable costs, we aim to find a policy that maximizes the likelihood of demonstrated trajectories trading the balance between being conservative and increasing significantly the likelihood of high-rewarding trajectories but with potentially unsafe steps. Having these objectives, we aim towards learning a policy that maximizes the probability of the most $promising$ trajectories with respect to the demonstrations. In so doing, we formulate the “promise” of individual state-action pairs in terms of $Q$ values, which depend on task-specific rewards as well as on the assessment of states’ safety, mixing expectations in terms of rewards and safety. This entails a safe Q-learning perspective of the inverse learning problem under constraints: The devised Safe $Q$ Inverse Constrained Reinforcement Learning (SafeQIL) algorithm is compared to state-of-the art inverse constraint reinforcement learning algorithms to a set of challenging benchmark tasks, showing its merits.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Given a set of trajectories demonstrating the execution of a task safely in a constrained MDP with observable rewards but with unknown constraints and non-observable costs, we aim to find a policy that maximizes the likelihood of demonstrated trajectories trading the balance between being conservative and increasing significantly the likelihood of high-rewarding trajectories but with potentially unsafe steps. Having these objectives, we aim towards learning a policy that maximizes the probability of the most $promising$ trajectories with respect to the demonstrations. In so doing, we formulate the “promise" of individual state-action pairs in terms of $Q$ values, which depend on task-specific rewards as well as on the assessment of states’ safety, mixing expectations in terms of rewards and safety. This entails a safe Q-learning perspective of the inverse learning problem under constraints: The devised Safe $Q$ Inverse Constrained Reinforcement Learning (SafeQIL) algorithm is compared to state-of-the art inverse constraint reinforcement learning algorithms to a set of challenging benchmark tasks, showing its merits. Close https://arxiv.org/abs/2602.23816 https://arxiv.org/pdf/2602.23816 doi:https://doi.org/10.48550/arXiv.2602.23816 Close
9.	Dimitrios Doumanas, Konstantinos Kotis : ReaDS-KG: An LLM-Knowledge Graph Framework for Reasoned Decision Support in Dynamic Safety-Critical Domains. In: TechRxiv, 2026. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Doumanas2026, title = {ReaDS-KG: An LLM-Knowledge Graph Framework for Reasoned Decision Support in Dynamic Safety-Critical Domains}, author = {Dimitrios Doumanas, Konstantinos Kotis}, url = {https://www.techrxiv.org/doi/full/10.36227/techrxiv.176826793.34811491/v1}, doi = {https://doi.org/10.36227/techrxiv.176826793.34811491/v1}, year = {2026}, date = {2026-01-13}, journal = {TechRxiv}, abstract = {Safety-critical domains such as military operations, border security, and search-and-rescue must operate under uncertainty, severe time pressure, and continuously changing conditions. In these settings, decision-support systems must not only provide accurate recommendations but also make the underlying reasoning explicit and auditable. This paper introduces ReaDS-KG (Reasoned Decision Support over Knowledge Graphs), an LLM-Knowledge Graph framework that delivers reasoned rather than purely predictive support. ReaDS-KG represents domain knowledge, assets, constraints, and causal dependencies in an ontology-driven knowledge graph, and uses a large language model to (i) translate natural-language questions into Cypher queries, (ii) orchestrate graph-based reasoning over causal structures, and (iii) return narrative answers with explicit justifications grounded in the graph. The framework follows a five-stage pipeline: ontology design, data-to-KG transformation, causal enrichment, LLM-mediated querying, and scenariobased evaluation. To demonstrate its applicability, we instantiate ReaDS-KG in a synthetic brigade-level operational scenario and pose twenty decision-oriented questions, covering feasibility, mobility, sustainment, command-and-control robustness, and risk. We then compare an LLM+KG agent powered by ReaDS-KG to ten active-duty officers using an eight-dimensional scoring rubric. The agent achieves decision-support quality comparable to field-grade officers and clearly above junior officers, while responding at machine response speed and providing transparent reasoning chains. These results suggest that ReaDS-KG can function as a quasi-expert, explainable staff assistant in dynamic safety-critical domains, and the architecture is readily transferable to other safety-critical settings that share similar uncertainty and causal-reasoning requirements, such as border management and disaster response.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Safety-critical domains such as military operations, border security, and search-and-rescue must operate under uncertainty, severe time pressure, and continuously changing conditions. In these settings, decision-support systems must not only provide accurate recommendations but also make the underlying reasoning explicit and auditable. This paper introduces ReaDS-KG (Reasoned Decision Support over Knowledge Graphs), an LLM-Knowledge Graph framework that delivers reasoned rather than purely predictive support. ReaDS-KG represents domain knowledge, assets, constraints, and causal dependencies in an ontology-driven knowledge graph, and uses a large language model to (i) translate natural-language questions into Cypher queries, (ii) orchestrate graph-based reasoning over causal structures, and (iii) return narrative answers with explicit justifications grounded in the graph. The framework follows a five-stage pipeline: ontology design, data-to-KG transformation, causal enrichment, LLM-mediated querying, and scenariobased evaluation. To demonstrate its applicability, we instantiate ReaDS-KG in a synthetic brigade-level operational scenario and pose twenty decision-oriented questions, covering feasibility, mobility, sustainment, command-and-control robustness, and risk. We then compare an LLM+KG agent powered by ReaDS-KG to ten active-duty officers using an eight-dimensional scoring rubric. The agent achieves decision-support quality comparable to field-grade officers and clearly above junior officers, while responding at machine response speed and providing transparent reasoning chains. These results suggest that ReaDS-KG can function as a quasi-expert, explainable staff assistant in dynamic safety-critical domains, and the architecture is readily transferable to other safety-critical settings that share similar uncertainty and causal-reasoning requirements, such as border management and disaster response. Close https://www.techrxiv.org/doi/full/10.36227/techrxiv.176826793.34811491/v1 doi:https://doi.org/10.36227/techrxiv.176826793.34811491/v1 Close
10.	Andreas Soularidis Dimitrios Doumanas, Konstantinos Kotis George Vouros A: Automating agentic collaborative ontology engineering with role-playing simulation of LLM-powered agents and RAG technology. In: The Knowledge Engineering Review, 40 , pp. e10, 2025. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Soularidis2025, title = {Automating agentic collaborative ontology engineering with role-playing simulation of LLM-powered agents and RAG technology}, author = {Andreas Soularidis, Dimitrios Doumanas, Konstantinos Kotis, George A Vouros}, doi = {https://doi.org/10.1017/S026988892510009X}, year = {2025}, date = {2025-12-19}, journal = {The Knowledge Engineering Review}, volume = {40}, pages = {e10}, abstract = {Motivated by the astonishing capabilities of large language models (LLMs) in text-generation, reasoning, and simulation of complex human behaviors, in this paper, we propose a novel multi-component LLM-based framework, namely LLM4ACOE, that fully automates the collaborative ontology engineering (COE) process using role-playing simulation of LLM agents and retrieval augmented generation (RAG) technology. The proposed solution enhances the LLM-powered role-playing simulation with RAG ‘feeding’ the LLM with three different types of external knowledge. This knowledge corresponds to the knowledge required by each of the COE roles (agents), using a component-based framework, as follows: (a) domain-specific data-centric documents, (b) OWL documentation, and (c) ReAct guidelines. The aforementioned components are evaluated in combination, with the aim of investigating their impact on the quality of generated ontologies. The aim of this work is twofold, (a) to identify the capacity of LLM-based agents to generate acceptable (by human-experts) ontologies through agentic collaborative ontology engineering (ACOE) role-playing simulation, at specific levels of acceptance (accuracy, validity, and expressiveness of ontologies) without human intervention and (b) to investigate whether and/or to what extent the selected RAG components affect the quality of the generated ontologies. The evaluation of this novel approach is performed using ChatGPT-o in the domain of search and rescue (SAR) missions. To assess the generated ontologies, quantitative and qualitative measures are employed, focusing on coverage, expressiveness, structure, and human involvement.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Motivated by the astonishing capabilities of large language models (LLMs) in text-generation, reasoning, and simulation of complex human behaviors, in this paper, we propose a novel multi-component LLM-based framework, namely LLM4ACOE, that fully automates the collaborative ontology engineering (COE) process using role-playing simulation of LLM agents and retrieval augmented generation (RAG) technology. The proposed solution enhances the LLM-powered role-playing simulation with RAG ‘feeding’ the LLM with three different types of external knowledge. This knowledge corresponds to the knowledge required by each of the COE roles (agents), using a component-based framework, as follows: (a) domain-specific data-centric documents, (b) OWL documentation, and (c) ReAct guidelines. The aforementioned components are evaluated in combination, with the aim of investigating their impact on the quality of generated ontologies. The aim of this work is twofold, (a) to identify the capacity of LLM-based agents to generate acceptable (by human-experts) ontologies through agentic collaborative ontology engineering (ACOE) role-playing simulation, at specific levels of acceptance (accuracy, validity, and expressiveness of ontologies) without human intervention and (b) to investigate whether and/or to what extent the selected RAG components affect the quality of the generated ontologies. The evaluation of this novel approach is performed using ChatGPT-o in the domain of search and rescue (SAR) missions. To assess the generated ontologies, quantitative and qualitative measures are employed, focusing on coverage, expressiveness, structure, and human involvement. Close doi:https://doi.org/10.1017/S026988892510009X Close
11.	Apostolos Glenis, George Vouros : Scalable Univariate and Multivariate Time-Series Classifiers with Deep Learning Methods Exploiting Symbolic Representations. In: Computers, 14 (12), pp. 563, 2025. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Glenis2025, title = {Scalable Univariate and Multivariate Time-Series Classifiers with Deep Learning Methods Exploiting Symbolic Representations}, author = {Apostolos Glenis, George Vouros}, url = {https://www.mdpi.com/2073-431X/14/12/563}, doi = {https://doi.org/10.3390/computers14120563}, year = {2025}, date = {2025-12-17}, journal = {Computers}, volume = {14}, number = {12}, pages = {563}, abstract = {Time-series classification (TSC) is an important task across sciences. Symbolic representations (especially SFA) are very effective at combating noise. In this paper, we employ symbolic representations to create state-of-the-art time-series classifiers, with the aim to advance scalability without sacrificing accuracy. First, we create a graph representation of the time series based on SFA words. We use this representation together with graph kernels and an SVM classifier to create a scalable time-series classifier. Next, we use the graph representation together with a Graph Convolutional Neural Network to test how it fares against state-of-the-art time-series classifiers. Additionally, we devised deep neural networks exploiting the SFA representation, inspired by the text classification domain, to study how they fare against state-of-the-art classifiers. The proposed deep learning classifiers have been adapted and evaluated for the multivariate time-series case and also against state-of-the-art time-series classification algorithms based on symbolic representations.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Time-series classification (TSC) is an important task across sciences. Symbolic representations (especially SFA) are very effective at combating noise. In this paper, we employ symbolic representations to create state-of-the-art time-series classifiers, with the aim to advance scalability without sacrificing accuracy. First, we create a graph representation of the time series based on SFA words. We use this representation together with graph kernels and an SVM classifier to create a scalable time-series classifier. Next, we use the graph representation together with a Graph Convolutional Neural Network to test how it fares against state-of-the-art time-series classifiers. Additionally, we devised deep neural networks exploiting the SFA representation, inspired by the text classification domain, to study how they fare against state-of-the-art classifiers. The proposed deep learning classifiers have been adapted and evaluated for the multivariate time-series case and also against state-of-the-art time-series classification algorithms based on symbolic representations. Close https://www.mdpi.com/2073-431X/14/12/563 doi:https://doi.org/10.3390/computers14120563 Close
12.	Georgios Bouchouras Dimitrios Doumanas, Andreas Soularidis Konstantinos Kotis George Vouros A: Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting. In: arXiv, 2025. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Bouchouras2025, title = {Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting}, author = {Georgios Bouchouras, Dimitrios Doumanas, Andreas Soularidis, Konstantinos Kotis, George A Vouros}, url = {https://arxiv.org/pdf/2512.14288}, doi = {https://doi.org/10.48550/arXiv.2512.14288}, year = {2025}, date = {2025-12-16}, journal = {arXiv}, abstract = {This paper explores the integration of Large Language Models (LLMs) in the engineering of a Parkinson’s Disease (PD) monitoring and alerting ontology through four key methodologies: One Shot (OS) prompt techniques, Chain of Thought (CoT) prompts, X-HCOME, and SimX-HCOME+. The primary objective is to determine whether LLMs alone can create comprehensive ontologies and, if not, whether human-LLM collaboration can achieve this goal. Consequently, the paper assesses the effectiveness of LLMs in automated ontology development and the enhancement achieved through human-LLM collaboration. Initial ontology generation was performed using One Shot (OS) and Chain of Thought (CoT) prompts, demonstrating the capability of LLMs to autonomously construct ontologies for PD monitoring and alerting. However, these outputs were not comprehensive and required substantial human refinement to enhance their completeness and accuracy. X-HCOME, a hybrid ontology engineering approach that combines human expertise with LLM capabilities, showed significant improvements in ontology comprehensiveness. This methodology resulted in ontologies that are very similar to those constructed by experts. Further experimentation with SimX-HCOME+, another hybrid methodology emphasizing continuous human supervision and iterative refinement, highlighted the importance of ongoing human involvement. This approach led to the creation of more comprehensive and accurate ontologies. Overall, the paper underscores the potential of human-LLM collaboration in advancing ontology engineering, particularly in complex domains like PD. The results suggest promising directions for future research, including the development of specialized GPT models for ontology construction.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close This paper explores the integration of Large Language Models (LLMs) in the engineering of a Parkinson’s Disease (PD) monitoring and alerting ontology through four key methodologies: One Shot (OS) prompt techniques, Chain of Thought (CoT) prompts, X-HCOME, and SimX-HCOME+. The primary objective is to determine whether LLMs alone can create comprehensive ontologies and, if not, whether human-LLM collaboration can achieve this goal. Consequently, the paper assesses the effectiveness of LLMs in automated ontology development and the enhancement achieved through human-LLM collaboration. Initial ontology generation was performed using One Shot (OS) and Chain of Thought (CoT) prompts, demonstrating the capability of LLMs to autonomously construct ontologies for PD monitoring and alerting. However, these outputs were not comprehensive and required substantial human refinement to enhance their completeness and accuracy. X-HCOME, a hybrid ontology engineering approach that combines human expertise with LLM capabilities, showed significant improvements in ontology comprehensiveness. This methodology resulted in ontologies that are very similar to those constructed by experts. Further experimentation with SimX-HCOME+, another hybrid methodology emphasizing continuous human supervision and iterative refinement, highlighted the importance of ongoing human involvement. This approach led to the creation of more comprehensive and accurate ontologies. Overall, the paper underscores the potential of human-LLM collaboration in advancing ontology engineering, particularly in complex domains like PD. The results suggest promising directions for future research, including the development of specialized GPT models for ontology construction. Close https://arxiv.org/pdf/2512.14288 doi:https://doi.org/10.48550/arXiv.2512.14288 Close
13.	Asimina Dimara Konstantinos Kotis, Stamatis Chatzistamatis Nikolaos Evangeliou Chrysaphis Nathanailidis George Tsekouras E: Towards Effective Data Process Pipelines for Legal NLP in English and Non-English Languages: A Greek Case Study. Computing, Communications and IoT Applications (ComComAp), 2025, ISBN: 979-8-3315-9143-4. (Type: Conference \| Abstract \| Links \| BibTeX) @conference{Dimara2025b, title = {Towards Effective Data Process Pipelines for Legal NLP in English and Non-English Languages: A Greek Case Study}, author = {Asimina Dimara, Konstantinos Kotis, Stamatis Chatzistamatis, Nikolaos Evangeliou, Chrysaphis Nathanailidis, George E Tsekouras}, url = {https://ieeexplore.ieee.org/abstract/document/11353184}, doi = {https://doi.org/10.1109/ComComAp68359.2025.11353184}, isbn = {979-8-3315-9143-4}, year = {2025}, date = {2025-12-14}, booktitle = {Computing, Communications and IoT Applications (ComComAp)}, abstract = {Natural Language Processing (NLP) pipelines form the backbone of legal artificial intelligence applications, yet most existing tools are designed for English corpora and perform poorly when transferred to morphologically rich, non-English languages. This paper investigates these limitations through a comparative study of English and Greek legal texts. It is shown that English-centric pipelines exhibit systematic errors in preprocessing (tokenization, lemmatization, stop-word removal) and fail to capture legal semantics in embeddings, resulting in degraded downstream performance. To address these issues, a generalized framework is proposed that introduces language-specific preprocessing, curated legal resources, and multilingual embeddings fine-tuned on legal corpora. A case study demonstrates how adapted tools substantially improve similarity scores and classification accuracy in Greek legal texts, while highlighting persistent challenges such as grammatical gender bias. The findings underscore the need for fairness-aware, language-specific NLP pipelines to support robust and inclusive legal AI across diverse jurisdictions.}, keywords = {}, pubstate = {published}, tppubtype = {conference} } Close Natural Language Processing (NLP) pipelines form the backbone of legal artificial intelligence applications, yet most existing tools are designed for English corpora and perform poorly when transferred to morphologically rich, non-English languages. This paper investigates these limitations through a comparative study of English and Greek legal texts. It is shown that English-centric pipelines exhibit systematic errors in preprocessing (tokenization, lemmatization, stop-word removal) and fail to capture legal semantics in embeddings, resulting in degraded downstream performance. To address these issues, a generalized framework is proposed that introduces language-specific preprocessing, curated legal resources, and multilingual embeddings fine-tuned on legal corpora. A case study demonstrates how adapted tools substantially improve similarity scores and classification accuracy in Greek legal texts, while highlighting persistent challenges such as grammatical gender bias. The findings underscore the need for fairness-aware, language-specific NLP pipelines to support robust and inclusive legal AI across diverse jurisdictions. Close https://ieeexplore.ieee.org/abstract/document/11353184 doi:https://doi.org/10.1109/ComComAp68359.2025.11353184 Close
14.	Alexandros Karakikes, Konstantinos Kotis : AI-Assisted OSINT/SOCMINT for Safeguarding Borders: A Systematic Review. In: Information, 16 (12), pp. 1095, 2025, ISSN: 2078-2489. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Karakikes2025, title = {AI-Assisted OSINT/SOCMINT for Safeguarding Borders: A Systematic Review}, author = {Alexandros Karakikes, Konstantinos Kotis}, url = {https://www.mdpi.com/2078-2489/16/12/1095}, doi = {https://doi.org/10.3390/info16121095}, issn = {2078-2489}, year = {2025}, date = {2025-12-10}, journal = {Information}, volume = {16}, number = {12}, pages = {1095}, abstract = {In the highly volatile realm of global security, the necessity for leading-edge and effectual border resilience tactics has never been more imperative. This PRISMA 2020 guided systematic literature review (SLR) examines the intersection of artificial intelligence (AI), open-source intelligence (OSINT), and social media intelligence (SOCMINT) for enhancing border protection. Our systematic investigation across major databases (IEEE Xplore, Scopus, SpringerLink, MDPI, ACM) and grey literature sources yielded 3932 initial records and, after screening and eligibility assessment, 73 studies and reports from acknowledged organizations, contributing to the evidence synthesis. Three research questions (RQ1–RQ3) were addressed concerning the following: (a) the effectiveness and application of AI in OSINT/SOCMINT for border protection, its (b) data, technical, and operational limitations, and its (c) ethical, legal, and societal implications (GELSI). Evidence matrices summarize the findings, while narrative syntheses underline and thematically group the extracted insights. Results indicate that AI techniques—fluctuating from machine learning (ML) and natural language processing (NLP) to computer vision and emerging large language models (LLMs)—produce quantifiable improvements in forecasting irregular migration, detecting human trafficking, and supporting multimodal intelligence fusion. However, limitations include misinformation, data bias, adversarial vulnerabilities, governance deficits, and sandbox-to-production gaps. Ethical and societal concerns highlight risks of surveillance overreach, discrimination, and insufficient oversight, among others. To our knowledge, this is the first SLR at this intersection. We conclude that, AI-assisted OSINT/SOCMINT presents transformative potential for border protection requiring, nonetheless, balanced governance, robust validation, and future research on LLM/agentic AI, human–AI teaming, and oversight mechanisms.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close In the highly volatile realm of global security, the necessity for leading-edge and effectual border resilience tactics has never been more imperative. This PRISMA 2020 guided systematic literature review (SLR) examines the intersection of artificial intelligence (AI), open-source intelligence (OSINT), and social media intelligence (SOCMINT) for enhancing border protection. Our systematic investigation across major databases (IEEE Xplore, Scopus, SpringerLink, MDPI, ACM) and grey literature sources yielded 3932 initial records and, after screening and eligibility assessment, 73 studies and reports from acknowledged organizations, contributing to the evidence synthesis. Three research questions (RQ1–RQ3) were addressed concerning the following: (a) the effectiveness and application of AI in OSINT/SOCMINT for border protection, its (b) data, technical, and operational limitations, and its (c) ethical, legal, and societal implications (GELSI). Evidence matrices summarize the findings, while narrative syntheses underline and thematically group the extracted insights. Results indicate that AI techniques—fluctuating from machine learning (ML) and natural language processing (NLP) to computer vision and emerging large language models (LLMs)—produce quantifiable improvements in forecasting irregular migration, detecting human trafficking, and supporting multimodal intelligence fusion. However, limitations include misinformation, data bias, adversarial vulnerabilities, governance deficits, and sandbox-to-production gaps. Ethical and societal concerns highlight risks of surveillance overreach, discrimination, and insufficient oversight, among others. To our knowledge, this is the first SLR at this intersection. We conclude that, AI-assisted OSINT/SOCMINT presents transformative potential for border protection requiring, nonetheless, balanced governance, robust validation, and future research on LLM/agentic AI, human–AI teaming, and oversight mechanisms. Close https://www.mdpi.com/2078-2489/16/12/1095 doi:https://doi.org/10.3390/info16121095 Close
15.	Dimitris Kostadimas Vlasios Kasapakis, Konstantinos Kotis : Exploiting VR, AIoT and Semantics Towards an Adaptive Virtual Museum. 20th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), 2025, ISBN: 979-8-3315-8704-8. (Type: Conference \| Abstract \| Links \| BibTeX) @conference{Kostadimas2025b, title = {Exploiting VR, AIoT and Semantics Towards an Adaptive Virtual Museum}, author = {Dimitris Kostadimas, Vlasios Kasapakis, Konstantinos Kotis}, url = {https://ieeexplore.ieee.org/abstract/document/11309793}, doi = {https://doi.org/10.1109/SMAP66932.2025.00034}, isbn = {979-8-3315-8704-8}, year = {2025}, date = {2025-11-27}, booktitle = {20th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)}, pages = {157-162}, abstract = {Museums have long been spaces of wonder and discovery, but as technology evolves, so do the ways we engage with these cultural treasures. The design of adaptive virtual environments becomes essential to maintaining user interest and relevance. In this paper, an adaptive virtual museum system is proposed that explores the use of virtual reality (VR), artificial intelligence (AI), Internet of Things (IoT) as well as semantics to personalize and optimize virtual exhibition experiences. Based on the results of our previous research conducted regarding the possible combination of VR, AI and IoT (AIoT) for the design of innovative intelligent systems in different domains, our current work proposes a novel way to integrate all these technologies within the domain of cultural heritage (CH), a combination that remains relatively underexplored. The proposed framework, which is currently a work in progress, introduces new ways to modeling museums’ visitor behavior and preferences (mainly by using head-mounted displays (HMDs)) in a VR environment to dynamically adapt exhibition layouts, as well as to provide personalized content through a digital twin (DT) of a real museum. A key focus lies in intelligent user profiling and route/layout optimization to enhance visitor engagement and provide rich content through integration of Large Language Models (LLM). Although implementation is ongoing, this paper describes the conceptual design, core objectives, and anticipated impact on the broader scope of adaptive multimedia applications and personalized cultural experiences.}, keywords = {}, pubstate = {published}, tppubtype = {conference} } Close Museums have long been spaces of wonder and discovery, but as technology evolves, so do the ways we engage with these cultural treasures. The design of adaptive virtual environments becomes essential to maintaining user interest and relevance. In this paper, an adaptive virtual museum system is proposed that explores the use of virtual reality (VR), artificial intelligence (AI), Internet of Things (IoT) as well as semantics to personalize and optimize virtual exhibition experiences. Based on the results of our previous research conducted regarding the possible combination of VR, AI and IoT (AIoT) for the design of innovative intelligent systems in different domains, our current work proposes a novel way to integrate all these technologies within the domain of cultural heritage (CH), a combination that remains relatively underexplored. The proposed framework, which is currently a work in progress, introduces new ways to modeling museums’ visitor behavior and preferences (mainly by using head-mounted displays (HMDs)) in a VR environment to dynamically adapt exhibition layouts, as well as to provide personalized content through a digital twin (DT) of a real museum. A key focus lies in intelligent user profiling and route/layout optimization to enhance visitor engagement and provide rich content through integration of Large Language Models (LLM). Although implementation is ongoing, this paper describes the conceptual design, core objectives, and anticipated impact on the broader scope of adaptive multimedia applications and personalized cultural experiences. Close https://ieeexplore.ieee.org/abstract/document/11309793 doi:https://doi.org/10.1109/SMAP66932.2025.00034 Close
16.	Andreas Sideras Konstantinos Bougiatiotis, Elias Zavitsanos Georgios Paliouras George Vouros : A Multimodal Alignment-Based Anomaly Detection Method for Bankruptcy Prediction. Proceedings of the 6th ACM International Conference on AI in Finance, 2025, ISBN: 9798400722202. (Type: Conference \| Abstract \| Links \| BibTeX) @conference{Sideras2025, title = {A Multimodal Alignment-Based Anomaly Detection Method for Bankruptcy Prediction}, author = {Andreas Sideras, Konstantinos Bougiatiotis, Elias Zavitsanos, Georgios Paliouras, George Vouros}, url = {https://dl.acm.org/doi/full/10.1145/3768292.3770380}, doi = {https://doi.org/10.1145/3768292.3770380}, isbn = {9798400722202}, year = {2025}, date = {2025-11-15}, booktitle = {Proceedings of the 6th ACM International Conference on AI in Finance}, pages = {53-61}, abstract = {We present a novel anomaly detection method for next-year bankruptcy prediction, utilizing a combination of financial figures and textual content from annual reports. Our approach, MABAD, learns a shared representation space where non-bankrupt firms share position and orientation. Samples that deviate from this pattern are assigned a higher anomaly score. The proposed method is tailored for highly imbalanced scenarios and is robust to heterogeneous, incomplete, and potentially contradictory inputs. We demonstrate that MABAD consistently outperforms a range of strong baselines, and we also curate and release a new publicly available multisource dataset to foster further research in the domain.}, keywords = {}, pubstate = {published}, tppubtype = {conference} } Close We present a novel anomaly detection method for next-year bankruptcy prediction, utilizing a combination of financial figures and textual content from annual reports. Our approach, MABAD, learns a shared representation space where non-bankrupt firms share position and orientation. Samples that deviate from this pattern are assigned a higher anomaly score. The proposed method is tailored for highly imbalanced scenarios and is robust to heterogeneous, incomplete, and potentially contradictory inputs. We demonstrate that MABAD consistently outperforms a range of strong baselines, and we also curate and release a new publicly available multisource dataset to foster further research in the domain. Close https://dl.acm.org/doi/full/10.1145/3768292.3770380 doi:https://doi.org/10.1145/3768292.3770380 Close
17.	Elias Zavitsanos Konstantinos Bougiatiotis, Andreas Sideras Georgios Paliouras : Positive-Unlabeled Learning for Financial Misstatement Detection under Realistic Constraints. ICAIF ’25: Proceedings of the 6th ACM International Conference on AI in Finance, 2025, ISBN: 9798400722202. (Type: Conference \| Abstract \| Links \| BibTeX) @conference{Zavitsanos2025, title = {Positive-Unlabeled Learning for Financial Misstatement Detection under Realistic Constraints}, author = {Elias Zavitsanos, Konstantinos Bougiatiotis, Andreas Sideras, Georgios Paliouras}, url = {https://dl.acm.org/doi/full/10.1145/3768292.3770366 https://dl.acm.org/doi/epdf/10.1145/3768292.3770366}, doi = {https://doi.org/10.1145/3768292.3770366}, isbn = {9798400722202}, year = {2025}, date = {2025-11-15}, booktitle = {ICAIF ’25: Proceedings of the 6th ACM International Conference on AI in Finance}, pages = {864-872}, abstract = {Detecting financial misstatements is critical for market integrity but remains challenging due to class imbalance, delayed discovery, and limited labeled data. We propose a novel Positive-Unlabeled (PU) learning framework that models the detection task under realistic constraints, where only a small subset of misstatements is known at training time. Our approach integrates unlabeled data into training, preserves temporal structure, and accounts for extreme imbalance. We construct and release a benchmark dataset reflecting these characteristics and evaluate several PU learning methods against recent baselines. Results show that PU-based models consistently outperform supervised approaches, highlighting their suitability for real-world misstatement detection.}, keywords = {}, pubstate = {published}, tppubtype = {conference} } Close Detecting financial misstatements is critical for market integrity but remains challenging due to class imbalance, delayed discovery, and limited labeled data. We propose a novel Positive-Unlabeled (PU) learning framework that models the detection task under realistic constraints, where only a small subset of misstatements is known at training time. Our approach integrates unlabeled data into training, preserves temporal structure, and accounts for extreme imbalance. We construct and release a benchmark dataset reflecting these characteristics and evaluate several PU learning methods against recent baselines. Results show that PU-based models consistently outperform supervised approaches, highlighting their suitability for real-world misstatement detection. Close https://dl.acm.org/doi/full/10.1145/3768292.3770366 https://dl.acm.org/doi/epdf/10.1145/3768292.3770366 doi:https://doi.org/10.1145/3768292.3770366 Close
18.	Dimitrios Doumanas Andreas Soularidis, Konstantinos Kotis : Causal Reasoning and Large Language Models for Military Decision-Making: Rethinking the Command Structures in the Era of Generative AI. In: AI, 7 (1), pp. 14, 2025, ISSN: 2673-2688. (Type: Journal Article \| Abstract \| Links \| BibTeX) @article{Doumanas2025e, title = {Causal Reasoning and Large Language Models for Military Decision-Making: Rethinking the Command Structures in the Era of Generative AI}, author = {Dimitrios Doumanas, Andreas Soularidis, Konstantinos Kotis}, url = {https://www.mdpi.com/2673-2688/7/1/14}, doi = {https://doi.org/10.3390/ai7010014}, issn = {2673-2688}, year = {2025}, date = {2025-10-24}, journal = {AI}, volume = {7}, number = {1}, pages = {14}, abstract = {Military decision-making is inherently complex and highly critical, requiring commanders to assess multiple variables in real-time, anticipate second-order effects, and adapt strategies based on continuously evolving battlefield conditions. Traditional approaches rely on domain expertise, experience, and intuition, often supported by decision-support systems designed by military experts. With the rapid advancement of Large Language Models (LLMs) such as ChatGPT, Claude, and DeepSeek, a new research question emerges: can LLMs perform causal reasoning at a level that could meaningfully replace human decision-makers, or should they remain human-led decision-support tools in high-stakes environments? This paper explores the causal reasoning capabilities of LLMs for operational and strategic military decisions. Unlike conventional AI models that rely primarily on correlation-based predictions, LLMs are now able to engage in multi-perspective reasoning, intervention analysis, and scenario-based assessments. We introduce a structured empirical evaluation framework to assess LLM performance through 10 de-identified real-world-inspired battle scenarios, ensuring models reason over provided inputs rather than memorized data. Critically, LLM outputs are systematically compared against a human expert baseline, composed of military officers across multiple ranks and years of operational experience. The evaluation focuses on precision, recall, causal reasoning depth, adaptability, and decision soundness. Our findings provide a rigorous comparative assessment of whether carefully prompted LLMs can assist, complement, or approach expert-level performance in military planning. While fully autonomous AI-led command remains premature, the results suggest that LLMs can offer valuable support in complex decision processes when integrated as part of hybrid human-AI decision-support frameworks. Since our evaluation directly tests this capability, this paradigm shift raises fundamental question: Is there a possibility to fully replace high-ranking officers/commanders in leading critical military operations, or should AI-driven tools remain as decision-support systems enhancing human-driven battlefield strategies?}, keywords = {}, pubstate = {published}, tppubtype = {article} } Close Military decision-making is inherently complex and highly critical, requiring commanders to assess multiple variables in real-time, anticipate second-order effects, and adapt strategies based on continuously evolving battlefield conditions. Traditional approaches rely on domain expertise, experience, and intuition, often supported by decision-support systems designed by military experts. With the rapid advancement of Large Language Models (LLMs) such as ChatGPT, Claude, and DeepSeek, a new research question emerges: can LLMs perform causal reasoning at a level that could meaningfully replace human decision-makers, or should they remain human-led decision-support tools in high-stakes environments? This paper explores the causal reasoning capabilities of LLMs for operational and strategic military decisions. Unlike conventional AI models that rely primarily on correlation-based predictions, LLMs are now able to engage in multi-perspective reasoning, intervention analysis, and scenario-based assessments. We introduce a structured empirical evaluation framework to assess LLM performance through 10 de-identified real-world-inspired battle scenarios, ensuring models reason over provided inputs rather than memorized data. Critically, LLM outputs are systematically compared against a human expert baseline, composed of military officers across multiple ranks and years of operational experience. The evaluation focuses on precision, recall, causal reasoning depth, adaptability, and decision soundness. Our findings provide a rigorous comparative assessment of whether carefully prompted LLMs can assist, complement, or approach expert-level performance in military planning. While fully autonomous AI-led command remains premature, the results suggest that LLMs can offer valuable support in complex decision processes when integrated as part of hybrid human-AI decision-support frameworks. Since our evaluation directly tests this capability, this paradigm shift raises fundamental question: Is there a possibility to fully replace high-ranking officers/commanders in leading critical military operations, or should AI-driven tools remain as decision-support systems enhancing human-driven battlefield strategies? Close https://www.mdpi.com/2673-2688/7/1/14 doi:https://doi.org/10.3390/ai7010014 Close
19.	Theodore Tranos Nikolaos Fesakis, Thomas Vasileiou Sotirios Christopoulos Georgio Loukos Maria Koutsoupidou : AI-Based Energy Forecasting at Different Distribution Grid Levels to Support Baseline Definition and DSO Participation in LFMs. 2025 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT Europe), IEEE, 2025, ISBN: 979-8-3315-2503-3. (Type: Conference \| Abstract \| Links \| BibTeX) @conference{Tranos2025, title = {AI-Based Energy Forecasting at Different Distribution Grid Levels to Support Baseline Definition and DSO Participation in LFMs}, author = {Theodore Tranos, Nikolaos Fesakis, Thomas Vasileiou, Sotirios Christopoulos, Georgio Loukos, Maria Koutsoupidou}, url = {https://ieeexplore.ieee.org/abstract/document/11305676}, doi = {https://doi.org/10.1109/ISGTEurope64741.2025.11305676}, isbn = {979-8-3315-2503-3}, year = {2025}, date = {2025-10-20}, booktitle = {2025 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT Europe)}, pages = {1-5}, publisher = {IEEE}, abstract = {A crucial aspect of Local Flexibility Markets (LFMs) is the definition of a baseline for energy production and demand forecasting, which serves as a reference for validating and compensating flexibility services. In this study, we explore the application of machine learning techniques, specifically Long Short-Term Memory (LSTM) networks, to establish accurate baselines for consumers and producers connected to the LV grid. The LSTM models leverage real historical demand and generation data from DSO smart meters in Mesogeia, Greece, combined with weather variables such as temperature and cloud coverage, to enhance forecasting accuracy. Our goal is to evaluate forecasting accuracy at the individual participant level and compare it with the accuracy obtained from forecasting on aggregated consumption/production data within a specific grid segment or using data from the secondary substation to which the participants are connected.}, keywords = {}, pubstate = {published}, tppubtype = {conference} } Close A crucial aspect of Local Flexibility Markets (LFMs) is the definition of a baseline for energy production and demand forecasting, which serves as a reference for validating and compensating flexibility services. In this study, we explore the application of machine learning techniques, specifically Long Short-Term Memory (LSTM) networks, to establish accurate baselines for consumers and producers connected to the LV grid. The LSTM models leverage real historical demand and generation data from DSO smart meters in Mesogeia, Greece, combined with weather variables such as temperature and cloud coverage, to enhance forecasting accuracy. Our goal is to evaluate forecasting accuracy at the individual participant level and compare it with the accuracy obtained from forecasting on aggregated consumption/production data within a specific grid segment or using data from the secondary substation to which the participants are connected. Close https://ieeexplore.ieee.org/abstract/document/11305676 doi:https://doi.org/10.1109/ISGTEurope64741.2025.11305676 Close
20.	Asimina Dimara Konstantinos Kotis, Alexios Papaioannou Stamatis Chatzistamatis Nikolaos Evangeliou Chrysaphis Nathanailidis George Tsekouras : Data Collection, Organization, and Privacy-Preserving Preparation for Edge-Based LLMs in Legal Text Analytics. 5th International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), 2025, ISBN: 979-8-3315-3556-8. (Type: Conference \| Abstract \| Links \| BibTeX) @conference{Dimara2025, title = {Data Collection, Organization, and Privacy-Preserving Preparation for Edge-Based LLMs in Legal Text Analytics}, author = {Asimina Dimara, Konstantinos Kotis, Alexios Papaioannou, Stamatis Chatzistamatis, Nikolaos Evangeliou, Chrysaphis Nathanailidis, George Tsekouras}, url = {https://ieeexplore.ieee.org/abstract/document/11277858}, doi = {https://doi.org/10.1109/ICECCME64568.2025.11277858}, isbn = {979-8-3315-3556-8}, year = {2025}, date = {2025-10-16}, booktitle = {5th International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)}, abstract = {Providing fairness and privacy in automated legal text processing is an essential issue, especially with the increasing usage of Large Language Models (LLMs), in sensitive public sector applications. This paper presents a modular edge native domain-specific architecture for legal document processing that avoids cloud infrastructure and external APIs. The system combines local ingestion, semantic embedding, and retrievalaugmented generation to empower autonomous agents for applications such as bias detection and clause summarization. Inference is done exclusively on-device by a 4-bit quantized LLaMA model run by CPU-only runtimes. Tested on the CLEAR-Bias benchmark, the system gets 92% prompt relevance and 90% output coherence, inference latency below 6.5 s, and memory usage below 5.5 GB. These findings validate the effectiveness of privacy-preserving, regulation-conforming legal NLP in constrained environments.}, keywords = {}, pubstate = {published}, tppubtype = {conference} } Close Providing fairness and privacy in automated legal text processing is an essential issue, especially with the increasing usage of Large Language Models (LLMs), in sensitive public sector applications. This paper presents a modular edge native domain-specific architecture for legal document processing that avoids cloud infrastructure and external APIs. The system combines local ingestion, semantic embedding, and retrievalaugmented generation to empower autonomous agents for applications such as bias detection and clause summarization. Inference is done exclusively on-device by a 4-bit quantized LLaMA model run by CPU-only runtimes. Tested on the CLEAR-Bias benchmark, the system gets 92% prompt relevance and 90% output coherence, inference latency below 6.5 s, and memory usage below 5.5 GB. These findings validate the effectiveness of privacy-preserving, regulation-conforming legal NLP in constrained environments. Close https://ieeexplore.ieee.org/abstract/document/11277858 doi:https://doi.org/10.1109/ICECCME64568.2025.11277858 Close

231 entries « ‹ 1 of 12 › »