Proposal for a method to predict growth of classification performance in response to increasing the amount of data for fine-tuning of Pre-Trained language models

Toshiki Kuramoto

3:50 PM - 4:10 PM

[3E5-GS-2-02] Proposal for a method to predict growth of classification performance in response to increasing the amount of data for fine-tuning of Pre-Trained language models

〇Toshiki Kuramoto¹, Jun Suzuki² (1. Bridgestone Corporation, 2. Tohoku University)

Keywords:Machine Learning, Natural Language Processing

Recently, pre-trained model based on large corpus have been developed and released, and opportunities are expanding to use them to analyze linguistic data such as product review for business purposes by fine-tuning with training data specific to the problem to be solved. However, in business situations, available datasets are not always plentiful due to various constraints, and it's not easy to determine how much data is enough to achieve the target performance. This paper proposes a method to estimate the amount of data required to achieve target performance by predicting the growth of classification performance when the amount of data for additional training increases, based on the results of classification performance of fine-tuned model from hundred to one thousand data initially obtained. Specifically, we show that when a pre-trained model is fine-tuned, the classification performance increases with a similar trend regardless of the original dataset size as the number of epochs is increased. We then verify that approximate formula based on that tendency can be used to estimate the classification performance obtained when the model is trained with 10 times or more training data, even when the initial additional training data is limited.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[3E5-GS-2] Machine learning

[3E5-GS-2-02] Proposal for a method to predict growth of classification performance in response to increasing the amount of data for fine-tuning of Pre-Trained language models

Password