Machine learning-based model categorization using textual and structural features
Künye
Khalilipour, A., Bozyiğit, F., Utku, C. & Challenger, M. (2022). Machine learning-based model categorization using textual and structural features. Paper presented at the Communications in Computer and Information Science, 1652, 425-436. doi:10.1007/978-3-031-15743-1_39Özet
Model Driven Engineering (MDE), where models are the core elements in the entire life cycle from the specification to maintenance phases, is one of the promising techniques to provide abstraction and automation. However, model management is another challenging issue due to the increasing number of models, their size, and their structural complexity. So that the available models should be organized by modelers to be reused and overcome the development of the new and more complex models with less cost and effort. In this direction, many studies are conducted to categorize models automatically. However, most of the studies focus either on the textual data or structural information in the intelligent model management, leading to less precision in the model management activities. Therefore, we utilized a model classification using baseline machine learning approaches on a dataset including 555 Ecore metamodels through hybrid feature vectors including both textual and structural information. In the proposed approach, first, the textual information of each model has been summarized in its elements through text processing as well as the ontology of synonyms within a specific domain. Then, the performances of machine learning classifiers were observed on two different variants of the datasets. The first variant includes only textual features (represented both in TF-IDF and word2vec representations), whereas the second variant consists of the determined structural features and textual features. It was finally concluded that each experimented machine learning algorithm gave more successful prediction performance on the variant containing structural features. The presented model yields promising results for the model classification task with a classification accuracy of 89.16%.
Kaynak
Communications in Computer and Information ScienceCilt
1652İlgili Öğeler
Başlık, yazar, küratör ve konuya göre gösterilen ilgili öğeler.
-
Pros and cons of using building information modeling in the AEC industry
Seyis Kazazoğlu, Senem (ASCE-AMER Soc Civil Engineers, 2019-08-01)Although a plethora of studies on building information modeling (BIM) have been conducted in the last decade, none of the previous studies collate and/or prioritize the benefits, risks, and challenges of BIM based on the ... -
Immitance data modelling via linear interpolation techniques: a classical circuit theory approach
Yarman, Bekir Sıddık Binboğa; Kılınç, Ali; Aksen, Ahmet (Wiley-Blackwell, 2004-11)With the advancement of the manufacturing technologies to produce new generation analog/digital communication systems, immitance data modelling has gained renewed importance in the literature. Specifically, models are ... -
An incremental model selection algorithm based on cross-validation for finding the architecture of a Hidden Markov model on hand gesture data sets
Ulaş, Aydın; Yıldız, Olcay Taner (IEEE, 2009-12-13)In a multi-parameter learning problem, besides choosing the architecture of the learner, there is the problem of finding the optimal parameters to get maximum performance. When the number of parameters to be tuned increases, ...