Classification of role stereotypes for classes in UML class diagrams using machine learning

Ahmed, Jobaer
Huang, Maoyi
Göteborgs universitet/Institutionen för data- och informationsteknikswe
University of Gothenburg/Department of Computer Science and Engineeringeng
2021-03-03T10:45:11Z
2021-03-03T10:45:11Z
2021-03-03
Software development process is becoming inherently complex in recent decades. To reduce the complexity in the development process developers, software practitioners are constantly looking for newer approach. One approach can be understanding the software design for instance, the UML models earlier in the software development process. For analyzing UML models, one could use knowledge about role-stereotypes. Knowledge about role stereotypes can help during software quality assessment, for summarizing software and thereby to ease the understanding of software designs. This study presents a machine learning-based approach for classifying the role-stereotype of classes in UML class diagrams. We have established a ground truth by manually labelling 391+ classes from 15 open source projects (using various programming languages). We analyze the performance of the machine learning approach with the manually established ground truth. Besides, we show a comparison between our approach and another machine learning approach from an earlier case study which is based on source code. Furthermore, we compare different machine learning (ML) algorithms to find out the best ML algorithm for classifying our dataset. Another noteworthy contribution of this study is an analysis of which features are most relevant for classifying classes into role stereotype and which features generate the best classification performance. According to our findings, the J48 classifier performs best when classifying the raw dataset and the Random Forest classifier performs best on a more balanced dataset which has been obtained by applying SMOTE oversampling. By using our classifier software developers can analyze patterns in their software design at the early stage of software development process.sv
http://hdl.handle.net/2077/67955
engsv
Technology
role-stereotypessv
machine learning algorithmsv
classificationsv
data analysissv
data miningsv
UML class diagramsv
software designsv
software engineeringsv
Classification of role stereotypes for classes in UML class diagrams using machine learningsv
text
Student essay
H2

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
gupea_2077_67955_1.pdf
Size:
19.2 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
876 B
Format:
Item-specific license agreed upon to submission
Description:

Collections