IMBALANCED DATA: A COMPARATIVE ANALYSIS OF CLASSIFICATION ENHANCEMENTS USING AUGMENTED DATA

Authors

DOI:

https://doi.org/10.30890/2709-2313.2024-28-00-017

Keywords:

augmented data, different data modalities, various classifiers

Abstract

The paper discusses aspects of classification applied to different data modalities, including text, images, and audio. The classification challenges in the context of working with imbalanced data are highlighted, the consequences of which can be mitigated

Metrics

Metrics Loading ...

References

L. C. Ottoni, R. M. de Amorim, M. S. Novo, і D. B. Costa, «Tuning of data augmentation hyperparameters in deep learning to building construction image classification with small datasets», Intl. J. Mach. Learn. Cybern., 2023,

DOI: 10.1007/s13042-022-01555-1.

M. A. Kutlugun, Y. Sirin, і M. Karakaya, «The effects of augmented training dataset on performance of convolutional neural networks in face recognition system», в Proc. Fed. Conf. Comput. Sci. Inf. Syst., FedCSIS, Ganzha M., Maciaszek L., Maciaszek L., і Paprzycki M., Ред., Institute of Electrical and Electronics Engineers Inc., 2019, pp. 929–932. DOI: 10.15439/2019F181.

O. O. Abayomi-Alli, R. Damasevicius, R. Maskeliunas, і A. Abayomi-Alli, «BiLSTM with Data Augmentation using Interpolation Methods to Improve Early Detection of Parkinson Disease», в Proc. Fed. Conf. Comput. Sci. Inf. Syst., FedCSIS, Ganzha M., Maciaszek L., Maciaszek L., і Paprzycki M., Ред., Institute of Electrical and Electronics Engineers Inc., 2020, pp. 371–380.

DOI: 10.15439/2020F188.

L. Taylor і G. Nitschke, «Improving deep learning with generic data augmentation», в 2018 IEEE symposium series on computational intelligence (SSCI), IEEE, 2018, pp. 1542–1547.

W. Alosaimi і M. I. Uddin, «Efficient Data Augmentation Techniques for Improved Classification in Limited Data Set of Oral Squamous Cell Carcinoma», CMES Comput. Model. Eng. Sci., 2022. DOI: 10.32604/cmes.2022.018433.

K. Kim і J. Jeong, «Deep learning-based data augmentation for hydraulic condition monitoring system», в Procedia Comput. Sci., Shakshuki E., Yasar A-U-H., і Malik H., Elsevier B.V., 2020, pp. 20–27. DOI: 10.1016/j.procs.2020.07.007.

M. Bayer, M.-A. Kaufhold, B. Buchhold, M. Keller, J. Dallmeyer, і C. Reuter, «Data augmentation in natural language processing: a novel text generation approach for long and short text classifiers», Intl. J. Mach. Learn. Cybern., 2023.

DOI: 10.1007/s13042-022-01553-3.

Mikołajczyk і M. Grochowski, «Data augmentation for improving deep learning in image classification problem», в 2018 international interdisciplinary PhD workshop (IIPhDW), IEEE, 2018, pp. 117–122.

K. Dunphy, M. N. Fekri, K. Grolinger, і A. Sadhu, «Data Augmentation for Deep-Learning-Based Multiclass Structural Damage Detection Using Limited Information», Sensors, 2022. DOI: 10.3390/s22166193.

R. Pappagari, J. Villalba, P. Zelasko, L. Moro-Velazquez, і N. Dehak, «Copypaste: An augmentation method for speech emotion recognition», в ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, Institute of Electrical and Electronics Engineers Inc., 2021, pp. 6324–6328. DOI: 10.1109/ICASSP39728.2021.9415077.

N. F. Aminuddin, Z. Tukiran, A. Joret, R. Tomari, і M. Morsin, «An Improved Deep Learning Model of Chili Disease Recognition with Small Dataset», Intl. J. Adv. Comput. Sci. Appl., 2022. DOI: 10.14569/IJACSA.2022.0130750.

S. T. Aroyehun і A. Gelbukh, «Aggression detection in social media: Using deep neural networks, data augmentation, and pseudo labeling», в Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), 2018, pp. 90–97.

Shorten і T. M. Khoshgoftaar, «A survey on Image Data Augmentation for Deep Learning», J. Big Data, 2019. DOI: 10.1186/s40537-019-0197-0.

T. Dao, A. Gu, A. Ratner, V. Smith, C. De Sa, і C. Ré, «A kernel theory of modern data augmentation», в International Conference on Machine Learning, PMLR, 2019, pp. 1528–1537. DOI: 10.1109/ICPR48806.2021.9412492.

Shorten, T. M. Khoshgoftaar, і B. Furht, «Text Data Augmentation for Deep Learning», J Big Data, 2021. DOI: 10.1186/s40537-021-00492-0.

Published

2024-03-30

How to Cite

Paterega, I., & Melnykova, N. (2024). IMBALANCED DATA: A COMPARATIVE ANALYSIS OF CLASSIFICATION ENHANCEMENTS USING AUGMENTED DATA. European Science, 3(sge28-03), 54–72. https://doi.org/10.30890/2709-2313.2024-28-00-017