Creating a Financial Data Lake for Academic Fintech Research

Broby, Daniel and Hopper, Huckleberry (2019) Creating a Financial Data Lake for Academic Fintech Research. Other. University of Strathclyde, Glasgow.

[thumbnail of Broby-Hopper-CFRI-2019-Creating-a-Financial-Data-Lake-for-Academic]
Preview
Text. Filename: Broby_Hopper_CFRI_2019_Creating_a_Financial_Data_Lake_for_Academic.pdf
Final Published Version

Download (309kB)| Preview

Abstract

This paper presents the case for a Financial Technology (Fintech) data lake. Fintech is impacting business models and its concepts require testing. The definition of Fintech is imprecise, but it is characterized by the use of technology as applied to digital financial transformation. The software and programming driving it is evolving and should be evaluated before being introduced into financial markets. Its development impacts "client money" and this can be risky unless supervised. Fortunately, such experimentation can be done in a controlled way using a regulatory sandbox. This allows Fintech concepts to be checked for reliability and robustness, using consenting live accounts (which receive a special regulatory exception). We propose a less risky supplementary approach, namely the testing of concepts on real but “blinded” financial big data files stored in a data lake. In this way, back-testing, out of sample experiments and forward performance checks can be done without the risk of losing money. We investigate how to implement such a data lake in order to do this.