THE ECONOMIC COST OF FINANCIAL FAKE NEWS IN EUROPEAN CAPITAL MARKETS
DOI:
https://doi.org/10.35774/jee2026.01.004Keywords:
Big Data, economic cost, European capital markets, financial decision-making, financial misinformation, market manipulation, Natural Language Processing.Abstract
Financial news misinformation is a systemic risk that distorts price discovery and capital allocation by widening information asymmetry and weakening investor trust. This study develops a scalable NLP and machine-learning framework to detect deceptive financial narratives in large digital corpora through Big Data. After text normalization, lemmatization, and stopword elimination, the framework contrasts TF-IDF with Word2Vec embeddings and trains Logistic Regression, Random Forest, and Gradient Boosting classifiers. Performance is assessed with Accuracy, Precision, Recall, F1-score, and ROC-AUC. Across models, TF-IDF provides stronger discrimination than Word2Vec; the TF-IDF Random Forest reaches near-perfect results (ROC-AUC 0.9999; Precision 0.9977). The emphasis on transparent, feature-based models supports auditability (for example, via feature importance) and helps limit harmful false positives that could suppress legitimate signals. The results indicate that high-precision, interpretable pipelines can reduce the verification gap in fast-moving information environments, mitigate macroeconomic costs of deceptive narratives, and inform DSA- and ESMAaligned market surveillance workflows. The framework is designed for deployment on streaming news feeds and large-scale platform archives.
JEL: G14, D82, C55.
References
Ahmad, I., Yousaf, M., Yousaf, S., & Ahmad, M. O. (2020). Fake news detection using machine learning ensemble methods. Complexity, Article 8885861. https://doi.org/10.1155/2020/8885861
Alghamdi, J., Luo, S., & Lin, Y. (2024). A comprehensive survey on machine learning approaches for fake news detection. Multimedia Tools and Applications, 83, 51009–51067. https://doi.org/10.1007/s11042-023-17470-8
Alshuwaier, F. A., & Alsulaiman, F. A. (2025). Fake news detection using machine learning and deep learning algorithms: A comprehensive review and future perspectives. Computers, 14(9), Article 394. https://doi.org/10.3390/computers14090394
Cheng, L.-C., Lu, W.-T. & Yeo, B. (2023). Predicting abnormal trading behavior from internet rumor propagation: A machine learning approach. Financial Innovation, 9, Article 3. https://doi.org/10.1186/s40854-022-00423-9
Du, K., Xing, F., Mao, R., & Cambria, E. (2024). Financial sentiment analysis: Techniques and applications. ACM Computing Surveys, 56(9), 1–42. https://doi.org/10.1145/3649451
D’Ulizia, A., Caschera, M. C., Ferri, F., & Grifoni, P. (2021). Fake news detection: A survey of evaluation datasets. PeerJ Computer Science 7, Article e518. https://doi.org/10.7717/peerj-cs.518
European Parliament & Council of the European Union. (2014). Regulation (EU) No. 596/2014 of the European Parliament and of the Council of 16 April 2014 on market abuse (market abuse regulation) and repealing Directive 2003/6/EC of the European Parliament and of the Council and Commission Directives 2003/124/EC, 2003/125/EC and 2004/72/EC and Commission Decision 2004/693/EC. Official Journal of the European Union, L 173, 1–142. https://eur-lex.europa.eu/eli/reg/2014/596/oj
European Parliament & Council of the European Union. (2022). Regulation (EU) 2022/2065 of the European Parliament and of the Council of 19 October 2022 on a Single Market for Digital Services and amending Directive 2000/31/EC (Digital Services Act). Official Journal of the European Union,L 277, 1–102. https://eur-lex.europa.eu/eli/reg/2022/2065/oj
Hu, L., Wei, S., Zhao, Z., & Wu, B. (2022). Deep learning for fake news detection: A comprehensive survey. AI Open, 3, 133–155. https://doi.org/10.1016/j.aiopen.2022.09.001
kaggle.com. (n.d.). Fake and real news dataset [Data set]. Retrieved July 17, 2025, from https://www.kaggle.com/datasets/clmentbisaillon/fake-and-realnews-dataset
Larcker, D. F., & Zakolyukina, A. A. (2012). Detecting deceptive discussions in conference calls. Journal of Accounting Research, 50(2), 495–540. https://doi.org/10.1111/j.1475-679X.2012.00450.x
Loughran, T., & McDonald, B. (2011). When is a liability not a liability? Textual analysis, dictionaries, and 10‐Ks. The Journal of Finance, 66(1), 35–65. https://doi.org/10.1111/j.1540-6261.2010.01625.x
Lyzun, M., Desyatnyuk, O., Savelyev, Y., Kuryliak, V., Sachenko, S., Lishchynskyy, I. (2023). Architectonics of the European Currency Integration: Cluster and Gravity Modeling. 2023 IEEE 12th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS), Dortmund, Germany, 2023, pp. 661–664. https://doi.org/10.1109/IDAACS58523.2023.10348652
Lyzun, M., Lishchynskyy, I., Savelyev, Y., Kuryliak, V. and Kurylyak, Y. (2019). Modeling Evaluation of Dollarization Economic Efficiency. International Conference on Advanced Computer Information Technologies (ACIT). Ceske Budejovice, Czech Republic: 366–370.
Majumdar, A., & Bose, I. (2018). Detection of financial rumors using big data analytics: The case of the Bombay Stock Exchange. Journal of Organizational Computing and Electronic Commerce, 28(2), 79–97. https://doi.org/10.1080/10919392.2018.1444337
Mishra, S., Shukla, P., & Agarwal, R. (2022). Analyzing machine learning enabled fake news detection techniques for diversified datasets. Wireless Communications and Mobile Computing, Article 1575365. https://doi.org/10.1155/2022/1575365
Nghiem, H., Muric, G., Morstatter, F., & Ferrara, E. (2021, November). Detecting cryptocurrency pump-and-dump frauds using market and social signals. Expert Systems with Applications, 182, Article 115284. https://doi.org/10.1016/j.eswa.2021.115284
Rangapur, A., Wang, H., Jian, L., & Shu, K. (2025, April 28-May 2). Fin-Fact: A benchmark dataset for multimodal financial fact-checking and explanation generation. In WWW Companion’25: Companion Proceedings of the ACM Web Conference 2025 (pp. 785–788). ACM. https://doi.org/10.1145/3701716.3715292
Selyukh, A. (2013, April 24). Hackers send fake market-moving AP tweet on White House explosions. Reuters. https://www.reuters.com/article/technology/hackers-send-fake-market-moving-ap-tweet-on-white-house-explosionsidUSBRE93M12Y/
Theodorakopoulos, L., Theodoropoulou, A., Tsimakis, A., & Halkiopoulos, C. (2025). Big Data-driven distributed machine learning for scalable credit card fraud detection using PySpark, XGBoost, and CatBoost. Electronics, 14(9), Article 1754. https://doi.org/10.3390/electronics14091754
Received: November 7, 2025.
Reviewed: March 10, 2026.
Accepted: March 20, 2026.
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).



