Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application

Mazniha Berahim, Mazniha Berahim and Samsudin, Noor Azah and Mustapha, Aida and Mohd Salleh, Rohayu and Mohd Nasi, Muhammad Jaffri (2024) Bag-Based Feature-Class Correlation Analysis for Multi-Instance Learning Application. COMPENDIUM by paperASIA. pp. 51-61.

[img] Text
J17533_bdc74aa47f8929110c75df69a7d6f211.pdf
Restricted to Registered users only

Download (2MB) | Request a copy

Abstract

Multi-instance Learning (MIL) is widely applied in image classification. In MIL, an image is presented as a bag. A bag consists of multi-instance which is known as patches. Irrelevant features of the image presented to the classifier affects the classification performance. Feature selection is one of the essential phases to select relevant. However, limited studies discuss the feature selection phase in MIL. Correlation between feature-class (FC) relationship is one important criterion to analyse features’ relevance. However, it cannot be performed directly in MIL. To address this gap, this study proposed the MultiBag-FCCorr feature selection technique. It consists of three steps: transformation, evaluation and fusion. The bags of feature information are acquired from summarization from different statistical central tendency measures of trimmed mean, mode and median. In feature evaluation step, extended point biserial correlation has been used to measure FC correlation and then the FC score has been analysed. The selected features are validated via two prominent classifiers (Support Vector Machine (SVM) and K-Nearest Neighbour (KNN)) on benchmark MI image datasets: UCSB Breast Cancer, Tiger, Elephant and Fox datasets. The selected features of UCSB Breast Cancer dataset were reduced to 92% number of features from the proposed technique giving the best result of average accuracy with 86.8.% using SVM and 84.5% using KNN. The average accuracy improved 6.3% using SVM and 16.4% using KNN compared without implementing the proposed feature selection. The results proved that the selected feature set improved the performance of MI image classification.

Item Type: Article
Uncontrolled Keywords: Bag-based feature selection, Feature correlation, Feature analysis, Fusion scheme, Multi-instance learning
Subjects: Q Science > QA Mathematics > QA76 Computer software
Divisions: Faculty of Computer Science and Information Technology > FSKTM
Depositing User: Mr. Mohamad Zulkhibri Rahmad
Date Deposited: 04 Jun 2024 02:45
Last Modified: 04 Jun 2024 02:45
URI: http://eprints.uthm.edu.my/id/eprint/11058

Actions (login required)

View Item View Item