Regression Greybox Fuzzing

Zhu, X.; Böhme, M.

doi:10.1145/3460120.3484596

Regression Greybox Fuzzing

Date

2021

Authors

Zhu, X.

Böhme, M.

Type:

Conference paper

Citation

Proceedings of the ACM Conference on Computer and Communications Security, 2021, pp.2169-2182

Statement of Responsibility

Xiaogang Zhu, Marcel Böhme

Conference Name

ACM Conference on Computer and Communications Security (CCS) (15 Nov 2021 - 19 Nov 2021 : Republic of Korea Virtual Event)

DOI

10.1145/3460120.3484596

Abstract

What you change is what you fuzz! In an empirical study of all fuzzer-generated bug reports in OSSFuzz, we found that four in every five bugs have been introduced by recent code changes. That is, 77% of 23k bugs are regressions. For a newly added project, there is usually an initial burst of new reports at 2-3 bugs per day. However, after that initial burst, and after weeding out most of the existing bugs, we still get a constant rate of 3-4 bug reports per week. The constant rate can only be explained by an increasing regression rate. Indeed, the probability that a reported bug is a regression (i.e., we could identify the bug-introducing commit) increases from 20% for the first bug to 92% after a few hundred bug reports. In this paper, we introduce regression greybox fuzzing (RGF) a fuzzing approach that focuses on code that has changed more recently or more often. However, for any active software project, it is impractical to fuzz sufficiently each code commit individually. Instead, we propose to fuzz all commits simultaneously, but code present in more (recent) commits with higher priority. We observe that most code is never changed and relatively old. So, we identify means to strengthen the signal from executed code-of-interest. We also extend the concept of power schedules to the bytes of a seed and introduce Ant Colony Optimization to assign more energy to those bytes which promise to generate more interesting inputs. Our large-scale fuzzing experiment demonstrates the validity of our main hypothesis and the efficiency of regression greybox fuzzing. We conducted our experiments in a reproducible manner within Fuzzbench, an extensible fuzzer evaluation platform. Our experiments involved 3+ CPU-years worth of fuzzing campaigns and 20 bugs in 15 open-source C programs available on OSSFuzz.

Rights

Grant ID

http://purl.org/au-research/grants/arc/DE190100046

Published Version

https://doi.org/10.1145/3460120.3484596

Persistent link to this record

https://hdl.handle.net/2440/148976

Full item page

Regression Greybox Fuzzing

Date

Authors

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Citation

Statement of Responsibility

Conference Name

DOI

Abstract

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

License

Grant ID

Published Version

Call number

Persistent link to this record