Skip to content

Commit 42c9581

Browse files
isHuangXinmallamanis
authored andcommitted
create wan2020NaturalCC
1 parent e115346 commit 42c9581

File tree

1 file changed

+14
-0
lines changed

1 file changed

+14
-0
lines changed

_publications/wan2020NaturalCC

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
---
2+
layout: publication
3+
title: NaturalCC: A Toolkit to Naturalize the Source Code Corpus
4+
authors: Yao Wan, Yang He, Jian-Guo Zhang, Yulei Sui, Hai Jin, Guandong Xu, Caiming Xiong, Philip S. Yu
5+
conference: arXiv - CS - Software Engineering
6+
year: 2020
7+
bibkey: wan2020NaturalCC
8+
additional_links:
9+
- {name: "ArXiV", url: "https://arxiv.org/abs/2012.03225"}
10+
- {name: "website", url: "https://xcodemind.github.io"}
11+
- {name: "code", url: ""}
12+
tags: ["generation", "summarization", "documentation"]
13+
---
14+
We present NATURALCC, an efficient and extensible toolkit to bridge the gap between natural language and programming language, and facilitate the research on big code analysis. Using NATURALCC, researchers both from natural language or programming language communities can quickly and easily reproduce the state-of-the-art baselines and implement their approach. NATURALCC is built upon Fairseq and PyTorch, providing (1) an efficient computation with multi-GPU and mixed- precision data processing for fast model training, (2) a modular and extensible framework that makes it easy to reproduce or implement an approach for big code analysis, and (3) a command line interface and a graphical user interface to demonstrate each model’s performance. Currently, we have included several state- of-the-art baselines across different tasks (e.g., code completion, code comment generation, and code retrieval) for demonstration. The video of this demo is available at https://www.youtube.com/watch?v=q4W5VSI-u3E&t=25s.

0 commit comments

Comments
 (0)