Automatic and lightweight grammar generation for fuzz testing

Su Yong Kim, Sungdeok Cha, Doo Hwan Bae

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Blackbox fuzz testing can only test a small portion of code when rigorously checking the well-formedness of input values. To overcome this problem, blackbox fuzz testing is performed using a grammar that delineates the format information of input values. However, it is almost impossible to manually construct a grammar if the input specifications are not known. We propose an alternative technique: the automatic generation of fuzzing grammars using API-level concolic testing. API-level concolic testing collects constraints at the library function level rather than the instruction level. While API-level concolic testing may be less accurate than instruction-level concolic testing, it is highly useful for speedily generating fuzzing grammars that enhance code coverage for real-world programs. To verify the feasibility of the proposed concept, we implemented the system for generating ActiveX control fuzzing grammars, named YMIR. The experiment results showed that the YMIR system was capable of generating fuzzing grammars that can raise branch coverage for ActiveX control using highly-structured input string by 15-50%. In addition, the YMIR system discovered two new vulnerabilities revealed only when input values are well-formed. Automatic fuzzing grammar generation through API-level concolic testing is not restricted to the testing of ActiveX controls; it should also be applicable to other string processing program whose source code is unavailable.

Original languageEnglish
Pages (from-to)1-11
Number of pages11
JournalComputers and Security
Volume36
DOIs
Publication statusPublished - 2013 Mar 25

    Fingerprint

Keywords

  • ActiveX control
  • Blackbox fuzz testing
  • Grammar-based fuzzer
  • Hybrid fuzz testing
  • Whitebox fuzz testing

ASJC Scopus subject areas

  • Computer Science(all)
  • Law

Cite this