基于QuixeyChallenge的多语言程序修复基准集_Java_Python

共410个文件

java：182个

py：144个

class：44个

版权申诉

135 浏览量 2023-04-28 13:53:00 上传评论收藏 581KB ZIP 举报

QuixeyChallenge是一个知名的程序修复挑战，旨在促进自动程序修复技术的发展。这个基准集包含了Java和Python两种编程语言的程序修复任务，为研究者和开发者提供了丰富的数据来测试和改进他们的修复算法。"QuixBugs-master"是压缩包内的主要文件夹，很可能包含了这个挑战的源代码、测试用例、错误信息以及修复后的代码等资源。在这个基准集中，每个修复任务通常由以下几个部分组成： 1. **错误代码**：这是待修复的原始代码，其中包含了一个或多个bug。研究者可以通过分析这些代码来理解问题所在，并设计修复策略。 2. **测试用例**：这些是用于检测代码是否正确运行的输入/输出对。在修复过程中，修复后的代码必须通过所有测试用例才能被视为成功修复。 3. **错误描述**：QuixeyChallenge可能会提供关于每个bug的简短描述，帮助参与者理解问题的本质。这些描述可能来自实际的用户报告或开发者的注释。 4. **修复方案**：对于训练集，基准集通常会提供正确的修复代码，这样研究人员可以学习和评估他们的修复算法。对于测试集，修复方案通常被隐藏，以评估新算法的真实性能。 5. **评估指标**：修复算法的效果通常通过几个指标来衡量，如修复的bug数量、修复的正确率、以及修复时间等。在研究自动程序修复时，这些数据集是非常宝贵的。通过分析错误代码的模式，机器学习模型可以被训练来识别和修复常见的bug类型。此外，这些修复任务可以帮助我们理解自然语言描述与代码错误之间的关系，从而推动语义理解和代码生成技术的进步。为了参与QuixeyChallenge或者利用这个基准集进行研究，你需要具备一定的编程基础，特别是Java和Python，还需要熟悉软件调试和版本控制工具。同时，了解机器学习和自然语言处理的基本概念将有助于你设计和实现更有效的修复算法。 "基于QuixeyChallenge的多语言程序修复基准集"为学术界和工业界提供了一个测试和比较自动程序修复技术的平台，促进了代码质量和开发效率的提升。无论是对于研究新算法的学者，还是想要改善开发流程的工程师，这个基准集都是一份宝贵的资源。

资源推荐

资源详情

资源评论

收起资源包目录

基于QuixeyChallenge的多语言程序修复基准集_Java_Python_下载.zip （410个子文件）

JavaDeserialization.class 3KB

RPN_EVAL.class 2KB

SHORTEST_PATH_LENGTH.class 2KB

Node.class 2KB

BREADTH_FIRST_SEARCH$Node.class 2KB

LCS_LENGTH.class 2KB

SHORTEST_PATH_LENGTHS.class 2KB

SIEVE.class 2KB

SHORTEST_PATHS.class 1KB

MERGESORT.class 1KB

SHUNTING_YARD.class 1KB

NEXT_PERMUTATION.class 1KB

HANOI$Pair.class 1KB

BUCKETSORT.class 1KB

QUICKSORT.class 1KB

TOPOLOGICAL_ORDERING.class 1KB

DEPTH_FIRST_SEARCH$1Search.class 1KB

LIS.class 1KB

HANOI.class 1KB

KTH.class 1KB

KHEAPSORT.class 1KB

POWERSET.class 1010B

WRAP.class 982B

SUBSEQUENCES.class 967B

NEXT_PALINDROME.class 952B

LONGEST_COMMON_SUBSEQUENCE.class 878B

PASCAL.class 860B

WeightedEdge.class 772B

GET_FACTORS.class 767B

FLATTEN.class 750B

IS_VALID_PARENTHESIZATION.class 730B

TO_BASE.class 672B

LEVENSHTEIN.class 671B

KNAPSACK.class 602B

DEPTH_FIRST_SEARCH.class 571B

REVERSE_LINKED_LIST.class 539B

FIND_IN_SORTED.class 471B

DETECT_CYCLE.class 463B

MAX_SUBLIST_SUM.class 447B

FIND_FIRST_IN_SORTED.class 444B

POSSIBLE_CHANGE.class 438B

SQRT.class 389B

BITCOUNT.class 325B

GCD.class 303B

.gitignore 31B

build.gradle 1KB

TestsGenerator.java 15KB

WRAP_TEST.java 10KB

SUBSEQUENCES_TEST.java 7KB

JavaTest.java 6KB

QUICKSORT_TEST.java 6KB

MERGESORT_TEST.java 6KB

QUICKSORT_TEST.java 6KB

MINIMUM_SPANNING_TREE_TEST.java 5KB

TOPOLOGICAL_ORDERING_TEST.java 5KB

LONGEST_COMMON_SUBSEQUENCE_TEST.java 4KB

BREADTH_FIRST_SEARCH_TEST.java 4KB

GET_FACTORS_TEST.java 4KB

FLATTEN_TEST.java 4KB

DETECT_CYCLE_TEST.java 4KB

JavaDeserialization.java 4KB

FLATTEN_TEST.java 4KB

GET_FACTORS_TEST.java 4KB

SHORTEST_PATH_LENGTHS_TEST.java 3KB

NEXT_PERMUTATION_TEST.java 3KB

KNAPSACK_TEST.java 3KB

SHORTEST_PATHS_TEST.java 3KB

KNAPSACK_TEST.java 3KB

NEXT_PERMUTATION_TEST.java 3KB

DEPTH_FIRST_SEARCH_TEST.java 3KB

MINIMUM_SPANNING_TREE_TEST.java 3KB

SHORTEST_PATH_LENGTHS_TEST.java 3KB

BREADTH_FIRST_SEARCH_TEST.java 3KB

LCS_LENGTH_TEST.java 3KB

BUCKETSORT_TEST.java 3KB

HANOI_TEST.java 3KB

LCS_LENGTH_TEST.java 3KB

BUCKETSORT_TEST.java 3KB

HANOI_TEST.java 3KB

TO_BASE_TEST.java 3KB

SHORTEST_PATH_LENGTH_TEST.java 2KB

DEPTH_FIRST_SEARCH_TEST.java 2KB

SHORTEST_PATH_LENGTH_TEST.java 2KB

POSSIBLE_CHANGE_TEST.java 2KB

TO_BASE_TEST.java 2KB

SHORTEST_PATHS_TEST.java 2KB

POSSIBLE_CHANGE_TEST.java 2KB

SIEVE_TEST.java 2KB

共 410 条

# QuixBugs Benchmark [![CI Status](https://github.com/jkoppel/QuixBugs/actions/workflows/ci.yml/badge.svg)](https://github.com/jkoppel/QuixBugs/actions/workflows/ci.yml) The QuixBugs benchmark consists of 40 programs from the Quixey Challenge translated into both Python and Java. Each contains a one-line defect, along with passing (when possible) and failing testcases. Defects fall into one of 14 defect classes. Corrected Python programs are also supplied. Quixbugs is intended for investigating cross-language performance by _multi-lingual_ program repair tools. For more details, see the ["QuixBugs: A Multi-Lingual Program Repair Benchmark Set Based on the Quixey Challenge"](quixbugs.pdf). Researchers at KTH have run 5 repair systems on the Java version of Quixbugs programs, see ["A Comprehensive Study of Automatic Program Repair on the QuixBugs Benchmark"](http://arxiv.org/pdf/1805.03454). # Background: Quixey Challenge From 2011 to 2013, mobile app search startup Quixey ran a challenge in which programmers were given an implementation of a classic algorithm with a bug on a single line, and had one minute to supply a fix. Success entailed $100 and a possible interview. These programs were developed as challenges for humans by people unaware of program repair. # Installation & Usage Simply clone the repo. git clone https://github.com/jkoppel/QuixBugs The Java programs are already compiled (see `*.class` files in `java_programs`). Note the all java programs are in the same package called `java_programs`. The utility class `JavaDeserialization.java` requires you to download the external library Gson. All Python is written in Python3. To run both defective versions of a program against their tests, as well as the corrected Python version, use the test driver: > python3 tester.py _program\_name_ Output is printed for visual comparison. ## Using JUnit tests There are JUnit tests in the `java_testcases/junit` folder for the Java version. Running `TestsGenerator.java` can regenerate them if needed. To run these tests, you can use [Gradle](https://gradle.org/) tasks provided by the `build.gradle` file. First, install Gradle. Then, - `gradle test` can be used to run tests on the buggy programs (Runs JUnit tests from the `java_testcases/junit` folder); - `gradle crtTest` can be used to run tests on the correct programs (Runs JUnit tests from the `java_testcases/junit/crt_program` folder). It is also possible to run tests for a single program with the `--tests` option: ```bash $ gradle test --tests KNAPSACK_TEST > Task :test java_testcases.junit.KNAPSACK_TEST > test_1 FAILED java.lang.AssertionError at KNAPSACK_TEST.java:14 java_testcases.junit.KNAPSACK_TEST > test_3 FAILED java.lang.AssertionError at KNAPSACK_TEST.java:26 java_testcases.junit.KNAPSACK_TEST > test_4 FAILED java.lang.AssertionError at KNAPSACK_TEST.java:32 java_testcases.junit.KNAPSACK_TEST > test_5 FAILED java.lang.AssertionError at KNAPSACK_TEST.java:38 java_testcases.junit.KNAPSACK_TEST > test_6 FAILED java.lang.AssertionError at KNAPSACK_TEST.java:44 java_testcases.junit.KNAPSACK_TEST > test_7 FAILED java.lang.AssertionError at KNAPSACK_TEST.java:50 10 tests completed, 6 failed ``` ```bash $ gradle crtTest --tests KNAPSACK_TEST BUILD SUCCESSFUL in 4s ``` ## Using pytest tests For the Python version, there are [pytest](https://pytest.org/) tests for each program in the `python_testcases` folder. To run them, install pytest using `pip` and then, from the root of the repository, call `pytest` to run tests for a single program or target the whole directory to run every test inside it. ```bash pip install pytest pytest python_testcases/test_quicksort.py # Or pytest python_testcases ``` Tests work for both buggy and correct versions of programs. The default test calls the buggy version, but there is a custom `--correct` flag that uses the correct version of a program. ```bash pytest --correct python_testcases ``` Most of the tests run fast and finish in less than a second, but two tests are slow. The first one is the last test case of the `knapsack` program, and the second one is the fourth test case of the `levenshtein` program. The default behavior skips both these tests. For the `knapsack` test case, using the `--runslow` pytest option will include it in the running tests. However, the `levenshtein` test case is always skipped since it takes a long time to pass and is ignored by the JUnit tests as well. ```bash $ pytest --correct --runslow python_testcases/test_knapsack.py collected 10 items python_testcases/test_knapsack.py .......... [100%] ========== 10 passed in 240.97s (0:04:00) ========== ``` ```bash $ pytest --correct python_testcases/test_knapsack.py collected 10 items python_testcases/test_knapsack.py .......... [100%] ========== 9 passed, 1 skipped in 0.08s ========== ``` Some tests, such as the `bitcount` ones, need a timeout. pytest itself doesn't have a timeout mechanism, but there is a [pytest-timeout](https://github.com/pytest-dev/pytest-timeout) plugin for it. Installing pytest-timeout adds additional options to the `pytest` CLI so, for example, to timeout `bitcount` tests after five seconds, you can do like this: ```bash pip install pytest-timeout pytest --timeout=5 python_testcases/test_bitcount.py ``` Make sure to check pytest-timeout's documentation to understand its caveats and how it handles timeouts on different systems. There is also a [pytest-xdist](https://github.com/pytest-dev/pytest-xdist) plugin that runs tests in parallel and can be used similarly to the timeout plugin. # Structure & Details The root folder holds the test driver. It deserializes the JSON testcases for a selected program, then runs them against the defective versions located in java\_programs/ and python\_programs/. The exception is graph-based programs, for which the testcases are located in the same folder as the corresponding program (they are still run with the test driver in the same manner). For reference, corrected versions of the Python programs are in correct\_python\_programs/. Programs include: - bitcount - breadth\_first\_search\* - bucketsort - depth\_first\_search\* - detect\_cycle\* - find\_first\_in\_sorted - find\_in\_sorted - flatten - gcd - get\_factors - hanoi - is\_valid\_parenthesization - kheapsort - knapsack - kth - lcs\_length - levenshtein - lis - longest\_common\_subsequence - max\_sublist\_sum - mergesort - minimum\_spanning\_tree\* - next\_palindrome - next\_permutation - pascal - possible\_change - powerset - quicksort - reverse\_linked\_list\* - rpn\_eval - shortest\_path\_length\* - shortest\_path\_lengths\* - shortest\_paths\* - shunting\_yard - sieve - sqrt - subsequences - to\_base - topological\_ordering\* - wrap \* - graph-based algorithm # Authors Contact Derrick Lin @drrckln, Angela Chen @angchen, or James Koppel @jkoppel for questions.

评论收藏

内容反馈

版权申诉