A Novel Automatic Relational Database Normalization Method
A Novel Automatic Relational Database Normalization Method
Author(s): Emre Akadal, Mehmet Hakan SatmanSubject(s): ICT Information and Communications Technologies
Published by: Vysoká škola ekonomická v Praze
Keywords: Relational databases; Automatic normalization; Genetic algorithms; Optimization; Decision support
Summary/Abstract: The increase in data diversity and the fact that database design is a difficult process make it practically impossible to design a unique database schema for all datasets encountered. In this paper, we introduce a fully automatic genetic algorithm-based relational database normalization method for revealing the right database schema using a raw dataset and without the need for any prior knowledge. For measuring the performance of the algorithm, we perform a simulation study using 250 datasets produced using 50 well-known databases. A total of 2500 simulations are carried out, ten times for each of five denormalized variations of all database designs containing different synthetic contents. The results of the simulation study show that the proposed algorithm discovers exactly 72% of the unknown database schemas. The performance can be improved by fine-tuning the optimization parameters. The results of the simulation study also show that the devised algorithm can be used in many datasets to reveal structs of databases when only a raw dataset is available at hand.
Journal: Acta Informatica Pragensia
- Issue Year: 11/2022
- Issue No: 3
- Page Range: 293-308
- Page Count: 16
- Language: English