Feature selection methods and sampling techniques to financial distress prediction for Vietnamese listed companies

Vu, Loan Thi and Vu, Lien Thi and Nguyen, Nga Thu and Do, Phuong Thi Thuy and Dao, Daniel (2019) Feature selection methods and sampling techniques to financial distress prediction for Vietnamese listed companies. Investment Management and Financial Innovations, 16 (1). pp. 276-290. (https://doi.org/10.21511/imfi.16(1).2019.22)

[thumbnail of Vu-etal-IMFI2019-Feature-selection-methods-sampling-techniques-financial-distress-prediction-Vietnamese-listed-companies]
Preview
Text. Filename: Vu_etal_IMFI2019_Feature_selection_methods_sampling_techniques_financial_distress_prediction_Vietnamese_listed_companies.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (489kB)| Preview

Abstract

The research is taken to integrate the effects of variable selection approaches, as well as sampling techniques, to the performance of a model to predict the financial distress for companies whose stocks are traded on securities exchanges of Vietnam. A firm is financially distressed when its stocks are delisted as requirement from Vietnam Stock Exchange because of making a loss in 3 consecutive years or having accumulated a loss greater than the company’s equity. There are 12 models, constructed differently in feature selection methods, sampling techniques, and classifiers. The feature selection methods are factor analysis and F-score selection, while 3 sets of data samples are chosen by choice-based method with different percentages of financially distressed firms. In terms of classifying technique, logistic regression together with SVM are used in these models. Data are collected from listed firms in Vietnam from 2009 to 2017 for 1, 2 and 3 years before the announcement of their delisting requirement. The experiment’s results highlight the outperformance of the SVM model with F-score selection method in a data sample containing the highest percentage of non-financially distressed firms.