Advanced Machine Learning Days (AMLD) 2025 at EPFL

AI in the financial sector: opportunities and challenges

Conference
Artificial Intelligence
LLM
Synthetic Data
podcast
Author

Charles-Albert Lehalle

Published

February 16, 2025

A Quick Overview Of ML in Finance

With Petter Kolm and Nino Antulov-Fantulin, we co-organized a track titled “Advances of ML Approaches for Finance: Alternative Data, Decision Making & Time Series Analysis” during the AMLD 2025.

Nino, Petter and me

We had different speakers, with the ambition to cover a lot of aspects of the current use of ML in Financial Markets:

1. On Risk Intermediation

  • RL for Stochastic LQR with Provable Convergence, Robustness and Stability: Petter has shown how giving some structure to a Reinforcement Learning algo could be so useful:
  • Model Calibration to Market-Implied Risk Measures - Gabriele VISENTIN

2. About LLM

  • LLMs and Biases in Finance - Nino insisted on the point-in-timeness in backtesting, and the way it could be complex for LLM to achieve this; I have myself a paper on this topic with Mengda LI: “Mathematics of Embeddings: Spillover of Polarities over Financial Texts” in Reviews In Modern Quantitative Finance (2024): 151-188.
  • Financial Workflows: Putting Large Language Models into Action - Peter HAFEZ from Ravenpack has explained how they built BigData.com
  • Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow - Tian GUO interestingly shows that very Large Language Models do not perform better than previous generations for financial prediction.

3. For timeseries modelling

We talked about

4. And then we focussed on Inflation

with

  • Nowcasting and Forecasting in Macroeconomics, Alexander DENEV from Turnleaf Analytics
  • Bringing ML into Production for Inflation Forecasting, by Franziska ECKERT from the Zurich National Bank

Personally, I gave an overview on Using Alternative Data for Financial Decisions and especially about the unique way alternative data are biased for financial applications.

We have been very lucky this track has been supported by Ravenpack and Turnleaf Analytics.

Main Take-Aways

It was the occasion to look at the advances in ML for Financial Markets since with published our book Machine Learning and Data Sciences for Financial Markets: A Guide to Contemporary Practices with Agostino Capponi.

So many attendees!

A Starting point

Remember that we split ML applications for finance in three main Parts of the book

  1. A.I. for users and products; you put recommendation engines, robot-advisors, chatbots, goal-based investment and the on-the-fly building of products (ETN) there.
  2. Improvements of risk intermediation. It is where Deep Hedging, Fast Pricing, better portfolio construction, Reinforcement Learning for trading, etc. takes place.
  3. Better connection with the real economy: this means using alternative data to improve financial decisions. Nowcasting and prediction are there.

In the book, we put Ethics of Algorithms and biases in the third part.

Apparently this structure (that indeed we did not invent, it came from a workshop the Institut Louis Bachelier organised a the Collège de France in January 2019 to launch the FaIR initiative) continues to be topical.

Progressed and directions

Nevertheless our knowledge is progressing in these directions. I would say that

  1. Large Language Models have given an impulsion in the realm of AI for users of products. The help to interact with users because they do not have to be familiar with the internal language of markets to use products, somehow they should allow a better access to the financial services, increasing financial inclusion. But it is only the beginning and a lot of caveats have to be fixed: biases, hallucinations, and real secondary innovations towards financial education.
  2. Reinforcement Learning works well either when you can implement it an offline way (in fornt of Monte-Carlo simulations), either on high frequency data if you mix it with expert knowledge to structure the state space, having in mind to get stationarity (or at least ergodicity, you can have a look at my paper with Othman Mounjid Improving reinforcement learning algorithms: Towards optimal learning rate policies).
  3. It is only the start of alternative data, and if you want to focus on two ways they can be useful, I would bet on multi-scale modelling of the supply chain and the monitoring of narratives.

At the crossing of these three directions, I will insist on trying to answer to this question: what is a company? via data and models. It is important since

  1. Investment diversification and the way retail investors approach investment is very naturally towards the companies and projects they are investing on;
  2. The risk that is intermediated is always a portfolio of claims on projects that are pursued by companies;
  3. Alternative data are describing companies, their clients, their suppliers and their competitors.

press to play or pause

The PodCast

I asked Google’s NotebookLM to produce a podcast from our slides, here is the result in two episodes (20 minutes + 5 minutes, click on the left or right part of the picture):

Back to top