Improve Code Summarization via Prompt-Tuning CodeT5

Huanzhen LI

doi:10.1051/wujns/2023286474

All issues

Volume 28 / No 6 (December 2023)

Wuhan Univ. J. Nat. Sci., 28 6 (2023) 474-482

Abstract

Open Access

Issue		Wuhan Univ. J. Nat. Sci. Volume 28, Number 6, December 2023


Page(s)		474 - 482
DOI		https://doi.org/10.1051/wujns/2023286474
Published online		15 January 2024

Wuhan University Journal of Natural Sciences, 2023 Vol.28 No.6, 474-482

Computer Science

CLC number: TP311

Improve Code Summarization via Prompt-Tuning CodeT5

Huanzhen LI

College of Information Engineering, Jiangxi University of Technology, Nanchang 330022, Jiangxi, China

Received: 15 January 2023

Abstract

Code comments are crucial in software engineering, aiding in program maintenance and code reuse. The process of generating clear and descriptive code comments, outlining code functionality, is called code summarization. Existing code summarization methods are typically trained using transformer-based models. However, these trained models often possess limited parameters and lack specific training tasks, hindering their ability to capture code semantics effectively. This paper uses a high-capacity pre-trained model, CodeT5, for code summarization. CodeT5 is designed with an encoder-decoder architecture that excels in code summarization tasks. Furthermore, we adopt a novel paradigm, "pre-train, prompt, predict", to unlock the knowledge embedded within CodeT5. We devise a prompt template to convert input code into code prompts and fine-tune CodeT5 with these prompts—a process we term prompt tuning. Our effectiveness experiments demonstrate that prompt tuning CodeT5 with only 40% of the dataset can achieve comparable performance to fine-tuning CodeT5 with 100% of the dataset. This means our approach is applicable in few-shot learning scenarios. Additionally, our prompt learning method is not sensitive to the size of the tuning dataset. Our practicality experiments show that the performance of prompt-tuned CodeT5 far surpasses that of transformer-based models trained on code-comment datasets collected from Stack Overflow.

Key words: code summarization / transformer-based model / prompt learning / CodeT5 / few-shot learning

Biography: LI Huanzhen, female, Lecturer, research direction: software engineering. E-mail: 313925356@qq.com

© Wuhan University 2023

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

Homepage

Table of Contents

Previous article Next article

Article contents

Metrics

Show article metrics

Services

Articles citing this article
CrossRef (4)
Same authors
- Google Scholar
- EDP Sciences database
- PubMed