14–18 Sept 2025
Piraeus, Greece
Europe/Athens timezone

From Unstructured Data to Knowledge Discovery: A Novel Deep Learning Framework for Text Mining

Not scheduled
20m
Piraeus, Greece

Piraeus, Greece

Machine Learning

Speaker

Grigorios Papageorgiou (University of Patras)

Description

Extracting meaningful insights from vast amounts of unstructured textual data presents significant challenges in text mining, particularly when attempting to separate valuable information from noise. This research introduces a novel deep learning framework for text mining that identifies latent structures within comprehensive text corpora. The proposed methodology incorporates an initial sentence classification phase to filter out irrelevant content while preserving essential information. Following this preprocessing step, we implement a deep learning-powered Named Entity Recognition (NER) system that uses predefined feature extraction to identify and extract critical entities, transforming them into structured data formats. We validate our approach using two datasets: BioCreative II Gene Mention (BC2GM) to compare it with other established approaches, and shipping industry datasets—a real-world dataset that contains emails for orders that have been executed. The findings demonstrate that deep learning significantly enhances text mining capabilities, proving its value for extracting essential information from large-scale textual repositories.

Classification Both methodology and application
Keywords Text Mining, Deep Learning, Transformers, Feature Extraction, Artificial Intelligence

Primary author

Grigorios Papageorgiou (University of Patras)

Co-authors

Mr Kyriakos Skalratos (University of Piraeus) Polychronis Economou (University of Patras)

Presentation materials

There are no materials yet.