Abstract
Technical analysis aims to predict stock returns based on price and volume patterns and has seen growing adoption of machine learning methods. However, most approaches rely on hand-crafted features. This paper investigates whether deep learning applied to stock chart images can predict future returns without manual feature engineering. Building on Jiang et al. (2023), this study applies Convolutional Neural Networks (CNNs), a computer vision architecture, to predict stock returns from price charts and extends their approach by implementing Vision Transformers, specifically the Class-Attention in Image Transformer (CaiT). Stock prices, volumes, and moving averages are encoded into images, which are used to train models that classify future stock returns as either positive or negative. Results show that both CNN and CaiT models outperform traditional technical indicators such as momentum and reversal strategies when applied to US stocks. Moreover, combining the two models yields incremental predictive power. An investment strategy based on their joint predictions achieves higher returns and Sharpe ratios than either model alone.
Keywords: technical analysis; stock return prediction; deep learning; convolutional neural networks; vision transformers

Dieses Werk steht unter der Lizenz Creative Commons Namensnennung 4.0 International.
Copyright (c) 2026 Ege Özkul
