University of Oulu

N. Saarimäki, S. Moreschini, F. Lomio, R. Penaloza and V. Lenarduzzi, "Towards a Robust Approach to Analyze Time-Dependent Data in Software Engineering," 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), Honolulu, HI, USA, 2022, pp. 36-40, doi: 10.1109/SANER53432.2022.00015

Towards a robust approach to analyze time-dependent data in software engineering

Saved in:
Author: Saarimäki, Nyyti1; Moreschini, Sergio1; Lomio, Francesco1;
Organizations: 1Tampere University, Finland
2University of Milano-Bicocca
3University of Oulu, Finland
Format: article
Version: accepted version
Access: open
Online Access: PDF Full Text (PDF, 0.5 MB)
Persistent link: http://urn.fi/urn:nbn:fi-fe2023032332998
Language: English
Published: Institute of Electrical and Electronic Engineers, 2022
Publish Date: 2023-03-23
Description:

Abstract

Background: Several recent software engineering studies use data mined from the version control systems adopted by the different software projects. However, inspecting the data and statistical methods used in those studies reveals several problems with the current approach, mainly related to the dependent nature of the data.

Objective: We analyzed time-dependent data in software engineering at commit level, and propose an alternative approach based on time series analysis.

Method: We identified statistical tests designed for time series analysis and propose a technique to model time dependent data, similarly to what is done in finance and weather forecasting. We applied our approach to a small set of projects of different sizes, investigating the behaviour of the SQALE Index, in order to highlight the time and interdependency of the different commits.

Results: Using these techniques, we analysed and model the data, showing that it is possible to investigate this type of commit data using methods from time series analysis.

Conclusion: Based on the promising results, we plan to validate the robustness of the approach by replicating previous works.

see all

Series: IEEE International Conference on Software Analysis, Evolution and Reengineering
ISSN: 1534-5351
ISSN-E: 2640-7574
ISSN-L: 1534-5351
ISBN: 978-1-6654-3786-8
ISBN Print: 978-1-6654-3787-5
Pages: 26 - 40
DOI: 10.1109/saner53432.2022.00015
OADOI: https://oadoi.org/10.1109/saner53432.2022.00015
Host publication: 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)
Conference: IEEE International Conference on Software Analysis, Evolution and Reengineering
Type of Publication: A4 Article in conference proceedings
Field of Science: 213 Electronic, automation and communications engineering, electronics
Subjects:
Copyright information: © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.