Big data is defined as data sets that are too large and/or complex to be processed by classical data processing methods. Big data analysis is essential because it enables more competent business movements, more efficient operations, and higher profits by using the data of institutions and organizations. However, large datasets are difficult to analyze because they are produced quickly, require large storage areas in computer systems, and the diversity of their data. In this study, a new approach using the denormalization method is proposed to accelerate the response time of the database in database systems where large volumes of data containing historical information are stored. Denormalization is defined as the process of adding rows or columns that are not needed to increase the reading performance of the database to the database system that has been normalized. In the proposed approach in this study, a large-volume dataset consisting of real spatial data belonging to Kayseri Metropolitan Municipality, containing temporal information and having approximately 96,000,000 row records, was used. In the proposed approach, the response time of the query is accelerated by recording the time information as numbers to increase the query performance of large volumes of data recorded in date format due to the temporal query process. The performance of the proposed method is compared with the performance of the normalization method using actual data on Microsoft SQL Server and Oracle database systems. The method proposed in the experimental evaluations shows that it works approximately eight times faster. In addition, the experimental results showed that the proposed method improves query performance more than the normalization-based method as the data size increases.
Large valume data Denormalization Temporal query Database performance
Kayseri Üniversitesi
FHD-2021-1043
We would like to thank Kayseri Metropolitan Municipality for sharing large volumes of spatial data containing real-time information of the vehicles operating in the public transportation system used within the scope of this study. In addition, this study was supported by the Scientific Research Projects Coordination Unit of Kayseri University within the scope of project #FHD-2021-1043.
FHD-2021-1043
Birincil Dil | İngilizce |
---|---|
Konular | Bilgisayar Yazılımı |
Bölüm | Araştırma Makalesi |
Yazarlar | |
Proje Numarası | FHD-2021-1043 |
Yayımlanma Tarihi | 19 Ekim 2022 |
Yayımlandığı Sayı | Yıl 2022 Cilt: 10 Sayı: 4 |
All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.