MetaRisk: Semi-supervised few-shot operational risk classification in banking industry

doi:10.1016/j.ins.2020.11.027

Information Sciences

Volume 552, April 2021, Pages 1-16

https://doi.org/10.1016/j.ins.2020.11.027 Get rights and content

Abstract

We study the operational risk classification problem, a critical yet challenging problem in the banking industry. In practice, banks build supervised multi-label classification models to identify the pre-defined risks using financial news sources. However, the models are often suboptimal due to the lack of labeled data and diverse combinations of risk types. To address these practical issues, we re-frame multi-label supervised operational risk classification as a semi-supervised few-shot learning problem, named MetaRisk, which can then be effectively learned using the prototypical network. We also propose a weighted scheme to help obtain accurately prototype vectors of multi-risk classes. We evaluate the proposed approach MetaRisk using a real-world operational risk classification dataset, and the results demonstrate that it outperforms a set of standard baselines. Especially, MetaRisk is capable of predicting risk types that are new to the system. We expect our work provides a direct and relevant toolkit that may assist risk officers to predict and intervene risks in the banking industry.

Introduction

In the decade since the global financial crisis, banks and regulators have become increasingly alert to operational risks. However, the banks and regulators still struggle to deal with operational risk effectively [20], [15]. It is reported that major banks global wide have suffered nearly $210 billion in operational risk losses since 2011.¹ Operational risk, according to Basel Accord II,² is defined as the risks of loss due to errors, breaches, interruptions or damages caused by people, internal processes, systems or external events. In the banking industry, one of the daily jobs of risk officers is to screen potential risks with a large number of online news outlets and to assess any news events that may expose risks to the bank’s operations. Therefore, it is of keen interest of financial organizations to develop effective machine learning methods for operational risk classification.

While this task can be easily formulated as a standard document classification problem, there are at least two challenges in designing an effective operational risk prediction system. First, labeling financial news into different risk types requires substantial domain knowledge, and thus it is impossible to build a dataset using crowd-labeling services. Moreover, in practice, different banks are exposed to and are potentially vulnerable to different types of operational risks so no public dataset is available. As a result, there are only a few labeled news articles that are manually labeled by risk officers, while a large number of news articles remain unlabeled. Second, a small portion of financial news is related to multiple risk types which makes the problem essentially a multi-label classification task. For example, Internal Fraud (such as Bribery and Corruption) and Clients and Market Practice (such as Money Laundering) are two types of operational risks. Actually, bribery and corruption are intrinsically linked with money laundering. Not only do they tend to occur together, but also the presence of one tends to reinforce the other. Therefore, a news article that has an Internal Fraud label is likely to be labeled with Clients and Market Practice labels. However, those multi-label instances consist of a small portion of the entire dataset and thus some may not even appear in the training set. As a result, a standard multi-label discriminative classification model often performs suboptimally. It is desirable that the classifier can generalize well to those rare multi-label instances and alert risk officers such “black swan” events.

To tackle the aforementioned practice problems, we re-frame multi-label supervised operational risk classification as a semi-supervised few-shot learning problem. We do so for two reasons. First, semi-supervised learning [6] leverages unlabeled data to learn better data distribution which helps the discriminative model. Second, few-shot learning [36] is expected to learn generalizable classifier and thus may accommodate new multi-label classes that are not frequently seen in the training set. These two learning paradigms are largely independently studied in prior research, with most work addressing one or the other. Recently, a few studies [35], [28] propose semi-supervised few-shot learning framework for multi-class image recognition, while some researchers focus on few-shot text classifications [37], [14]. However, these methods are not applicable for our operational risk context as we face a multi-label classification task where instances are usually associated with more than one label [50].

In this work, we propose MetaRisk, a novel multi-label semi-supervised few-shot learning model for operational risk classification. Our method is built on the prototypical network [36] but improves the prototypes of risk combinations (multi-risk classes) by adjusting the weight of each risk type for each instance. Specifically, MetaRisk first utilizes a weighted scheme to learn the prior knowledge for risk class combinations from the relevant individual risk class. It then builds and refines a prototypical network to learn the single label and multi-label prototypes. We adopt attention mechanism [4] from neural network training to calculate the weighted prototype vector for multi-label risk type combinations. Furthermore, a soft-masking mechanism is introduced to refine the prototypes using unlabeled data, which allows our model to obtain decision boundaries for better fitting underlying risk distribution. We empirically evaluate MetaRisk on a proprietary dataset collected by an international banking organization. Experiment results show that MetaRisk outperforms a set of standard baselines. In particular, it is more effective than baselines on recognizing new risk type combinations task with a small number of known labeled and a large number of unlabeled instances.

Our main contributions can be summarized as two-folds.

•
First, to the best of our knowledge, we are the first to study the operational risk classification problem using semi-supervised meta-learning method. We identify two practical challenges associated with operational risk classification and frame the problem using semi-supervised few-shot learning framework with a weighted scheme. We further modify the framework so that it can be generalized to minority multi-label risk classes.
•
Second, we evaluate the framework on a real-world dataset and demonstrate its effectiveness. The system prototype has been used internally by the bank’s risk management team. We hope this work provides key insight into designing the practical semi-supervised meta-learning model for important financial applications.

The rest of the paper is arranged as follows. We first review related literature and position our work in that context in Section 2. The formal problem definition, as well as the necessary background with respect to operational risk classification and meta-learning techniques, are introduced in Section 3. The details of our MetaRisk model are presented in Section 4. Comprehensive experimental results demonstrating the superiority of our model are presented in Section 5. We conclude this work and point out the future directions in Section 6.

Section snippets

Related work

We now review the relevant literature from three basic perspectives and position our work in that context.

Preliminaries

In this section, we present the dataset and formally define the problem. We also describe the semi-supervised setting and meta-learning paradigm, as well as the necessary backgrounds of the operational risk classification problem. In Table 1, we summarize the frequently used notations in this paper.

Main methodologies

In this section, we present the few-shot risk prediction framework, MetaRisk. The overall architecture of our proposed MetaRisk is shown in Fig. 1. The high-level workflow is as follows.

We first construct the support sets and query sets using a modified episode (task) paradigm and turn the learning task into few-shot learning. All training financial articles are then encoded into an embedding space by using standard Bi-LSTM and self-attention mechanisms as our document encoding component. We

Experimental observations

In this section, we evaluate our proposed methods on a real-world dataset. We start by covering baselines, followed by results and discussions.

Conclusions and future work

Financial Technology (FinTech) is transforming the financial service industry by providing new services, controlling costs and supporting profitability. In the banking industry, using big data analytics and machine learning to identify potential operational risks has attracted executives and managers’ attention from a practical perspective. Due to the nature of the financial service industry, obtaining high-quality labeled data is usually costly. Moreover, it is desirable that the intelligent

CRediT authorship contribution statement

Fan Zhou: Conceptualization, Methodology, Data curation, Writing - original draft. Xiuxiu Qi: Software, Validation, Investigation. Chunjing Xiao: Conceptualization, Methodology, Resources. Jiahao Wang: Resources, Visualization, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by National Natural Science Foundation of China (Grant Nos. 62072077, 61602097 and 61402151).

References (50)

W.P. Amorim et al.
Multi-label semi-supervised classification through optimum-path forest
Information Sciences
(2018)
M.R. Boutell et al.
Learning multi-label scene classification
Pattern Recognition
(2004)
I. Gonzalez-Carrasco et al.
Automatic detection of relationships between banking operations using machine learning
Information Sciences
(2019)
T. Hosaka
Bankruptcy prediction using imaged financial ratios and convolutional neural networks
Expert Systems with Applications
(2019)
E.W. Ngai et al.
The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature
Decision Support Systems
(2011)
Y. Wang et al.
Leveraging deep learning with lda-based text analytics to detect automobile insurance fraud
Decision Support Systems
(2018)
M. Zhang et al.
ML-KNN: A lazy learning approach to multi-label learning
Pattern Recognition
(2007)
A. Adhikari, A. Ram, R. Tang, J. Lin, Rethinking complex neural network architectures for document classification, in:...
A. Ayyad, N. Navab, M. Elhoseiny, S. Albarqouni, Semi-supervised few-shot learning with local and global consistency,...
D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate, in:...

O. Chapelle et al.

Semi-supervised learning

IEEE Transactions on Neural Networks

(2009)

W.-Y. Chen, Y.-C. Liu, Z. Kira, Y.-C. F. Wang, J.-B. Huang, A closer look at few-shot classification, in: International...

J. Chung, C. Gulcehre, K. H. Cho, Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence...

N. F. F. [da Silva], L. F. Coletta, E. R. Hruschka, E. R. H. Jr.], Using unsupervised information to improve...

X. Ding, Y. Zhang, T. Liu, J. Duan, Deep learning for event-driven stock prediction, in: International Joint...

A. Elisseeff, J. Weston, A kernel method for multi-labelled classification, in: Advances in Neural Information...

L. Feng et al.

Collaboration based multi-label learning

C. Finn, K. Xu, S. Levine, Probabilistic model-agnostic meta-learning, in: Advances in Neural Information Processing...

R. Geng et al.

Induction networks for few-shot text classification

V. Gupta, R. Wadbude, N. Natarajan, H. Karnick, P. Jain, P. Rai, Distributional semantics meets multi label learning,...

S. Gururangan, T. Dang, D. Card, N.A. Smith, Variational pretraining for semi-supervised text classification, in:...

J. Han, U. Barman, J. Hayes, J. Du, E. Burgin, D. Wan, NextGen AML: Distributed deep learning based language...

S. Hochreiter et al.

Long short-term memory

Neural Computation

(1997)

D.G. Hoffman

Managing Operational Risk: 20 Firmwide Best Practice Strategies

(2002)

T. Hospedales, A. Antoniou, P. Micaelli, A. Storkey, Meta-learning in neural networks: a survey, arXiv preprint...

Cited by (11)

Semi-supervised imbalanced multi-label classification with label propagation
2024, Pattern Recognition
Multi-label learning tasks usually encounter the problem of the class-imbalance, where samples and their corresponding labels are non-uniformly distributed over multi-label data space. It has attracted increasing attention during the past decade, however, there is a lack of methods capable of handling the imbalanced problem in a semi-supervised setting. This study proposes a label propagation technique to settle the semi-supervised imbalanced multi-label issue. Specially, we first utilize a collaborative manner to exploit the correlations from labels and instances, and learn a label regularization matrix to overcome the imbalanced problem in the labeled instance. After that, we extend to semi-supervised learning and explore to represent the similarity of instances with weighted graphs on labeled and unlabeled data. Then, the data distribution information and label correlations are fully utilized to design the loss function under the consistency assumption manner. At last, we present an iterative scheme to settle the optimization issue, thereby achieving label propagation to address the imbalanced challenge. Experiments on a variety of multi-label data sets show the favorable performance of the proposed method against related comparing approaches. Notably, the proposed method is also validated to be robust with a limited number of training instances.
Ensembling Multi-View Discriminative Semantic Feature for Few-Shot Classification
2024, Engineering Applications of Artificial Intelligence
Few-Shot Classification (FSC) is an innovative application in machine learning. FSC consists of two main components: (1) Pre-training, where a feature extraction model (FEM) is trained using base data, and (2) Meta-testing, where the FEM is utilized to extract features from novel data (with categories different from the base data) for classification. Implementing FSC presents several challenges. For example, due to the cross-domain limitation (base data $\to$ novel data), the FEM may generate inappropriate features for new classes, leading to a Sample-Feature-Mismatch problem, and the corresponding feature is dubbed as Original-Shift-Feature (OSF). This paper proposes a generative method to construct Multi-View Discriminative Semantic Feature (MVDSF) to address the fundamental problem from the perspective of improving the discriminability of OSF. Typically, a linear projection is designed to transform OSF into a semantic space, generating Discriminative-Semantic-Features (DSF). By incorporating a reconstructive representation that is not solely reliant on the FEM, the influence of the Sample-Feature-Mismatch problem is reduced. Furthermore, considering that descriptions based on a single DSF tend to be one-sided, an attention mechanism is devised to fuse multi-view DSF, thereby improving the robustness of the proposed method. The effectiveness of MVDSF is evaluated on five benchmark few-shot learning datasets, where it achieves outstanding performance. This evaluation demonstrates the efficiency and performance of the proposed approach.
STID-Prompt: Prompt learning for sentiment-topic-importance detection in financial news
2024, Knowledge-Based Systems
With the development of the Internet and the financial industry, the analysis and judgement of financial news has become increasingly important. Common tasks in this area include sentiment analysis, topic classification, and importance judgement. Existing approaches use fine-tuning paradigms to address these tasks. However, the lack of labeled data in the field of financial news poses a small sample learning problem. The new paradigm represented by prompt learning provides a new way and means to improve the performance of small-sample classification. In addition, existing methods do not focus on research on sentiment, topic, and importance simultaneously, which is also an important factor in evaluating financial news. In practical applications, joint judgment of sentiment, topic, and importance is more effective. Finally, these methods require different tuning for each specific task. They do not consider relationships between individual tasks and cannot utilize information across tasks. This limits their application in complex situations. In this work, we propose a prompt-based financial news classification model (STID-Prompt) to address these issues. For the first two problems, we solve the sentiment analysis task, the topic classification task, the importance judgment task, and the 〈sentiment, topic, importance〉 classification task by designing complex prompt templates. For the third problem, we propose a unique prompt-based joint multi-task learning approach. It learns knowledge from multiple tasks and integrates it into the target task, and the multi-task learning approach further improves the performance of the model. Experimental results show the effectiveness of our approach even with less training data.
Coarse-to-fine few-shot classification with deep metric learning
2022, Information Sciences
Citation Excerpt :
This inspires researchers to develop various FSC models [16,29,12]. Essentially, they predict the labels for unseen samples using only a few labeled samples, and have found widespread applications, such as semantic segmentation [21], multi-label node classification [46], and operational risk classification [50]. In few-shot classification, the data set consists of training set, i.e., rich labeled samples in source domain, support set, i.e., very limited labeled samples in target domain, and query set, i.e., the unseen samples in target domain.
Few-shot classification predicts the labels of unseen samples using only a few labeled samples, and employs the samples of the classes disjoint with unseen classes for model training. It faces two primary challenges, i.e., handling sample pairs with different similarity degrees by single classifier, and learning discriminant patterns from very few labeled samples per class. To address them, this work presents a Coarse-to-Fine few-shot classification framework under the guidance of Metric-based Auxiliary learning, abbreviated as CFMA. In particular, it discriminates the image pairs with large difference by capturing global features, and models the similarity relation of the image pairs with small difference by mining the local region of interests. Moreover, CFMA adopts deep metric learning to improve the model adaptivity on the set of limited samples, and generates pseudo labels to dynamically guide the coarse learning in iteration. Empirical studies on several benchmark databases, including mini-ImageNet, tiered-ImageNet, and CUB, demonstrate that our method achieves more promising classification performance compared to many state-of-the-art alternatives.
A text analysis of operational risk loss descriptions
2023, Journal of Operational Risk
Twin prototype networks with noisy label self-correction for fault diagnosis of wind turbine gearboxes
2023, Measurement Science and Technology

View all citing articles on Scopus

View full text

MetaRisk: Semi-supervised few-shot operational risk classification in banking industry

Abstract

Introduction

Section snippets

Related work

Preliminaries

Main methodologies

Experimental observations

Conclusions and future work

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Information Sciences

Pattern Recognition

Information Sciences

Expert Systems with Applications

Decision Support Systems

Decision Support Systems

Pattern Recognition

Semi-supervised learning

IEEE Transactions on Neural Networks

Collaboration based multi-label learning

Induction networks for few-shot text classification

Long short-term memory

Neural Computation

Managing Operational Risk: 20 Firmwide Best Practice Strategies