Skip to content

Commit 6637c71

Browse files
committed
Add MegaDiffg
1 parent b08c387 commit 6637c71

File tree

1 file changed

+13
-0
lines changed

1 file changed

+13
-0
lines changed
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
---
2+
layout: publication
3+
title: "Megadiff: A Dataset of 600k Java Source Code Changes Categorized by Diff Size"
4+
authors: Martin Monperrus, Matias Martinez, He Ye, Fernanda Madeiral, Thomas Durieux, Zhongxing Yu
5+
conference:
6+
year: 2021
7+
bibkey: monperrus2021megadiff
8+
additional_links:
9+
- {name: "ArXiV", url: "https://arxiv.org/abs/2108.04631"}
10+
- {name: "Dataset", url: "https://zenodo.org/record/5013515"}
11+
tags: ["dataset", "edit"]
12+
---
13+
This paper presents Megadiff, a dataset of source code diffs. It focuses on Java, with strict inclusion criteria based on commit message and diff size. Megadiff contains 663 029 Java diffs that can be used for research on commit comprehension, fault localization, automated program repair, and machine learning on code changes.

0 commit comments

Comments
 (0)