The Effect of Data Types' on the Performance of Machine Learning Algorithms for Cryptocurrency Prediction

Tanrikulu, Hulusi Mehmet; Pabuccu, Hakan

The Effect of Data Types' on the Performance of Machine Learning Algorithms for Cryptocurrency Prediction

dc.authorid	0000-0003-2267-5175
dc.authorid	0000-0002-5994-2874
dc.contributor.author	Tanrikulu, Hulusi Mehmet
dc.contributor.author	Pabuccu, Hakan
dc.date.accessioned	2026-02-28T12:17:41Z
dc.date.available	2026-02-28T12:17:41Z
dc.date.issued	2025
dc.department	Bayburt Üniversitesi
dc.description.abstract	Forecasting cryptocurrencies as a financial issue is crucial as it provides investors with possible financial benefits. A slight improvement in forecasting performance can lead to increased profitability; Therefore, obtaining a realistic forecast is very important for investors. Bitcoin, frequently mentioned in recent due to its volatility and chaotic behavior, has become an investment tool, especially during and after the COVID-19 pandemic. In this study, selected ML techniques were investigated for predicting cryptocurrency movements by using technical indicator-based data sets and measuring the applicability of the techniques to cryptocurrencies that do not have sufficient historical data. In order to measure the effect of data size, Bitcoin's last 1 year and 7 years of data were used. Following the related literature, Google trends and the number of tweets were used as input features, in addition to the most commonly used twelve technical indicators. Random Forest, K-Nearest Neighbors, Extreme Gradient Boosting (XGBoost-XGB), Support Vector Machine (SVM), Naive Bayes (NB), Artificial Neural Networks (ANN), and Long-Short-Term Memory (LSTM) network were optimized for best results. Accuracy, F1, and area under the ROC curve values were used to compare the model performance. For continuous data, ANN and SVM performed the best with the highest accuracy and outperformed the other ML models for complete and reduced sets. LSTM reached the best accuracy for trend data, but SVM, NB, and XGB models showed similar performance. The research shows that some indicators significantly affect prediction performance, and the data discretization process also improved the model's accuracy. While the number of samples affects the results of many ML models, correctly optimized and fine-tuned models may also give excellent results even with less data.
dc.description.sponsorship	Scientific and Technological Research Council of Turkiye (TUBITAK)
dc.description.sponsorship	Open access funding provided by the Scientific and Technological Research Council of Turkiye (TUBITAK). We declare that we have no relevant or material financial interests that relate to the research described in this paper. The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.
dc.identifier.doi	10.1007/s10614-025-10919-y
dc.identifier.issn	0927-7099
dc.identifier.issn	1572-9974
dc.identifier.scopus	2-s2.0-105000615246
dc.identifier.scopusquality	Q1
dc.identifier.uri	https://doi.org/10.1007/s10614-025-10919-y
dc.identifier.uri	https://hdl.handle.net/20.500.12403/5925
dc.identifier.wos	WOS:001449431000001
dc.identifier.wosquality	Q2
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Springer
dc.relation.ispartof	Computational Economics
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.snmz	KA_WoS_20260218
dc.subject	Financial prediction
dc.subject	Machine learning
dc.subject	Bitcoin
dc.subject	Continuous data
dc.subject	Trend data
dc.title	The Effect of Data Types' on the Performance of Machine Learning Algorithms for Cryptocurrency Prediction
dc.type	Article

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

The Effect of Data Types' on the Performance of Machine Learning Algorithms for Cryptocurrency Prediction

Dosyalar

Koleksiyon