Rendezvous: A search engine for binary code

Wei Ming Khoo, Alan Mycroft, Ross Anderson

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The problem of matching between binaries is important for software copyright enforcement as well as for identifying disclosed vulnerabilities in software. We present a search engine prototype called Rendezvous which enables indexing and searching for code in binary form. Rendezvous identifies binary code using a statistical model comprising instruction mnemonics, control flow sub-graphs and data constants which are simple to extract from a disassembly, yet normalising with respect to different compilers and optimisations. Experiments show that Rendezvous achieves F2 measures of 86.7% and 83.0% on the GNU C library compiled with different compiler optimisations and the GNU coreutils suite compiled with gcc and clang respectively. These two code bases together comprise more than one million lines of code. Rendezvous will bring significant changes to the way patch management and copyright enforcement is currently performed.
Original languageEnglish
Title of host publication2013 10th Working Conference on Mining Software Repositories (MSR)
EditorsThomas Zimmermann, Massimiliano Di Penta, Sunghun Kim
PublisherInstitute of Electrical and Electronics Engineers
Pages329-338
Number of pages10
ISBN (Electronic)978-1-4673-2936-1
ISBN (Print)978-1-4799-0345-0
DOIs
Publication statusPublished - 10 Oct 2013
Event10th Working Conference on Mining Software Repositories (MSR) - San Francisco, United States
Duration: 18 May 201319 May 2013
Conference number: 10
https://2013.msrconf.org/

Publication series

Name2013 10th Working Conference on Mining Software Repositories (MSR)
PublisherIEEE
ISSN (Print)2160-1852
ISSN (Electronic)2160-1860

Conference

Conference10th Working Conference on Mining Software Repositories (MSR)
Abbreviated titleMSR 2013
Country/TerritoryUnited States
CitySan Francisco
Period18/05/1319/05/13
Internet address

Fingerprint

Dive into the research topics of 'Rendezvous: A search engine for binary code'. Together they form a unique fingerprint.

Cite this