University of Oulu

Tuovinen, L. and Suutala, J. (2021). Ontology-based Framework for Integration of Time Series Data: Application in Predictive Analytics on Data Center Monitoring Metrics. In Proceedings of the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KEOD, ISBN 978-989-758-533-3; ISSN 2184-3228, pages 151-161. DOI: 10.5220/0010650300003064

Ontology-based framework for integration of time series data : application in predictive analytics on data center monitoring metrics

Saved in:
Author: Tuovinen, Lauri1; Suutala, Jaakko1
Organizations: 1Biomimetics and Intelligent Systems Group, University of Oulu, Finland
Format: article
Version: published version
Access: open
Online Access: PDF Full Text (PDF, 0.5 MB)
Persistent link: http://urn.fi/urn:nbn:fi-fe2021110353382
Language: English
Published: Scitepress Science and Technology Publications, 2021
Publish Date: 2021-11-03
Description:

Abstract

Monitoring a large and complex system such as a data center generates many time series of metric data, which are often stored using a database system specifically designed for managing time series data. Different, possibly distributed, databases may be used to collect data representing different aspects of the system, which complicates matters when, for example, developing data analytics applications that require integrating data from two or more of these. From the developer’s point of view, it would be highly convenient if all of the required data were available in a single database, but it may well be that the different databases do not even implement the same query language. To address this problem, we propose using an ontology to capture the semantic similarities among different time series database systems and to hide their syntactic differences. Alongside the ontology, we have developed a Python software framework that enables the developer to build and execute queries using classes and properties defined by the ontology. The ontology thus effectively specifies a semantic query language that can be used to retrieve data from any of the supported database systems, and the Python framework can be set up to treat the different databases as a single data store that can be queried using this semantic language. This is demonstrated by presenting an application involving predictive analytics on resource usage and electricity consumption metrics gathered from a Kubernetes cluster, stored in Prometheus and KairosDB databases, but the framework can be extended in various ways and adapted to different use cases, enabling machine learning research using distributed heterogeneous data sources.

see all

ISBN Print: 978-989-758-533-3
Pages: 151 - 161
DOI: 10.5220/0010650300003064
OADOI: https://oadoi.org/10.5220/0010650300003064
Host publication: Proceedings of the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
Conference: International Conference on Knowledge Engineering and Ontology Development
Type of Publication: A4 Article in conference proceedings
Field of Science: 113 Computer and information sciences
Subjects:
Copyright information: © 2021 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved. Published under the CC-BY-NC-ND license.
  https://creativecommons.org/licenses/by-nc-nd/4.0/