Open Journal Systems

Model Prediktif Indeks Kebahagiaan Berbasis Gradient Boosting Regressor dengan Optimalisasi Seleksi Fitur dan Implementasi Web

       Dani Ferdinan, Nisa Hanum Harani, Syafrial Fachri Pane

Abstract


Penelitian ini menghadapi tantangan dalam memodelkan Indeks Kebahagiaan 2021 dari Badan Pusat Statistik (BPS) yang memiliki dimensi fitur sangat tinggi dan potensi redundansi, yang dapat menurunkan akurasi dan interpretabilitas model. Tujuan utama penelitian ini adalah untuk mengidentifikasi fitur-fitur paling berpengaruh dalam data tersebut untuk meningkatkan akurasi, efisiensi komputasi, dan transparansi model prediksi berbasis pohon keputusan. Metodologi mencakup pra-pemrosesan data dengan imputasi modus, transformasi Yeo-Johnson, dan Robust Scaler. Tiga algoritma regresi diuji: Decision Tree, Random Forest, dan Gradient Boosting Regressor, yang dioptimalkan menggunakan Particle Swarm Optimization (PSO). Model terbaik dievaluasi menggunakan metrik R², MSE, RMSE, dan MAE serta dianalisis lebih lanjut menggunakan SHAP untuk interpretasi. Hasil menunjukkan bahwa Gradient Boosting Regressor adalah model paling unggul dengan nilai R² sebesar 0,696 saat menggunakan 20 fitur terseleksi. Selain itu, sebagai bentuk implementasi praktis, model diimplementasikan ke dalam sebuah aplikasi web interaktif berbasis Flask yang memungkinkan pengguna memasukkan data melalui antarmuka kuisioner dan menerima prediksi indeks kebahagiaan secara real-time. Integrasi ini menjembatani hasil riset dengan pemanfaatan nyata oleh pengguna akhir.


  http://dx.doi.org/10.31544/jtera.v10.i2.2025.59-68

Keywords


Indeks Kebahagiaan; Machine Learning; Feature Selection; Gradient Boosting; Aplikasi Web

Full Text:

  PDF

References


A. Jannani, N. Sael, and F. Benabbou, “Machine learning for the analysis of quality of life using the World Happiness Index and Human Development Indicators,” Mathematical Modeling and Computing, vol. 10, no. 2, pp. 534–546, 2023, doi: 10.23939/mmc2023.02.534.

N. Zhang et al., “Prediction of adolescent subjective well-being: A machine learning approach,” Gen Psychiatr, vol. 32, no. 5, Sep. 2019, doi: 10.1136/gpsych-2019-100096.

M. A. Rohmaniar, R. Habibi, and S. F. Pane, “Pengaruh Metode Seleksi Fitur terhadap Akurasi Model SVM dalam Klasifikasi Customer Churn pada Perusahaan Telekomunikasi,” (IJAI) Indonesian Journal of Applied Informatics, vol. 09, no. 01, pp. 94–103, 2024.

U. Suchaini, W. P. S. Nugraha, Dwipayana I Kadek Dede, and S. A. Lestari, Indeks Kebahagiaan 2021. Badan Pusat Statistik RI, 2021.

J. J. Palamar and A. Le, “Underreporting of drug use on a survey of electronic dance music party attendees,” Addiction Research and Theory, vol. 28, no. 4, pp. 321–327, Jul. 2020, doi: 10.1080/16066359.2019.1653860.

F. Abdullah, S. Fachri Pane, and R. Habibi, “Deteksi Emosi Pada Teks Berbahasa Indonesia Menggunakan Pendekatan Ensemble,” Jurnal Teknologi Terapan) |, vol. 10, no. 2, 2024.

N. H. Harani and C. Prianto, “Sentiment Analysis of Student Emotion During Online Learning Using Recurrent Neural Networks (RNN),” International Journal of Information System & Technology Akreditasi, vol. 5, no. 3, pp. 299–307, 2021.

N. Kalpourtzi, J. R. Carpenter, and G. Touloumi, “Handling Missing Values in Surveys With Complex Study Design: A Simulation Study,” J Surv Stat Methodol, vol. 12, no. 1, pp. 105–129, Feb. 2024, doi: 10.1093/jssam/smac039.

D. K. Lee, “Data transformation: A focus on the interpretation,” Korean J Anesthesiol, vol. 73, no. 6, pp. 503–508, Dec. 2020, doi: 10.4097/kja.20137.

J. Raymaekers and P. J. Rousseeuw, “Transforming variables to central normality,” Mach Learn, vol. 113, no. 8, pp. 4953–4975, Aug. 2024, doi: 10.1007/s10994-021-05960-5.

L. T. Quang, B. H. Baek, W. Yoon, S. K. Kim, and I. Park, “Comparison of Normalization Techniques for Radiomics Features From Magnetic Resonance Imaging in Predicting Histologic Grade of Meningiomas,” Investig Magn Reson Imaging, vol. 28, no. 2, pp. 61–67, Jun. 2024, doi: 10.13104/imri.2024.0010.

N. H. Harani and C. Prianto, “Penerapan algoritma Adaboost guna menentukan pola masuknya calon mahasiswa,” TRANSFORMTIKA, vol. 18, no. 1, pp. 123–132, 2020.

F. Özen, “Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey,” Heliyon, vol. 10, no. 4, Feb. 2024, doi: 10.1016/j.heliyon.2024.e25746.

U. Singh, M. Rizwan, M. Alaraj, and I. Alsaidan, “A machine learning-based gradient boosting regression approach for wind power production forecasting: A step towards smart grid environments,” Energies (Basel), vol. 14, no. 16, Aug. 2021, doi: 10.3390/en14165196.

B. Ramadhan and S. F. Pane, “Pengaruh Hyperparameter Tuning untuk Efektivitas pada Pendekatan Hybrid dalam Mendiagnosis Stres dan Depresi : Tinjauan Studi Literatur,” Jurnal Tekno Insentif, vol. 18, no. 2, pp. 104–118, Dec. 2024, doi: 10.36787/jti.v18i2.1516.

B. Zuhri and N. H. Harani, “Studi Literatur: Optimasi Algoritma Machine Learning Untuk Prediksi Penerimaan Mahasiswa Pascasarjana,” International Journal of Information System & Technology, vol. 03, no. 05, pp. 299–307, 2019, [Online]. Available: https://ejurnalunsam.id/index.php/jicom/

D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” PeerJ Comput Sci, vol. 7, pp. 1–24, 2021, doi: 10.7717/PEERJ-CS.623.

S. Rezaei Melal, M. Aminian, and S. M. Shekarian, “A machine learning method based on stacking heterogeneous ensemble learning for prediction of indoor humidity of greenhouse,” J Agric Food Res, vol. 16, Jun. 2024, doi: 10.1016/j.jafr.2024.101107.

A. Ahmad et al., “Prediction of compressive strength of fly ash based concrete using individual and ensemble algorithm,” Materials, vol. 14, no. 4, pp. 1–21, Feb. 2021, doi: 10.3390/ma14040794.

P. Panicheva, L. Mararitsa, S. Sorokin, O. Koltsova, and P. Rosso, “Predicting subjective well-being in a high-risk sample of Russian mental health app users,” EPJ Data Sci, vol. 11, no. 1, Dec. 2022, doi: 10.1140/epjds/s13688-022-00333-x.

A. Baba and K. Bunji, “Prediction of Mental Health Problem Using Annual Student Health Survey: Machine Learning Approach,” JMIR Ment Health, vol. 10, 2023, doi: 10.2196/42420.

L. Zhang, “Subjective Well-Being Prediction Using Data Mining Techniques: Evidence from Chinese General Social Survey,” Applied and Computational Mathematics, vol. 7, no. 4, p. 197, 2018, doi: 10.11648/j.acm.20180704.13.

E. Oparina et al., “Machine learning in the prediction of human wellbeing,” Sci Rep, vol. 15, no. 1, Dec. 2025, doi: 10.1038/s41598-024-84137-1.




DOI: http://dx.doi.org/10.31544/jtera.v10.i2.2025.59-68
Abstract 67 View    PDF viewed = 29 View

Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 JTERA (Jurnal Teknologi Rekayasa)

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Copyright @2016-2025 JTERA (Jurnal Teknologi Rekayasa) p-ISSN 2548-737X e-ISSN 2548-8678.

  Lisensi Creative Commons

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

JTERA Editorial Office:
Politeknik Sukabumi
Jl. Babakan Sirna 25, Sukabumi 43132, West Java, Indonesia
Phone/Fax: +62 266215417
Whatsapp: +62 81809214709
Website: https://jtera.polteksmi.ac.id
E-mail: [email protected]