Guided Randomized Search over Programs for Synthesis and Program Optimization

January 2018

Author:
Stefan Heule
Stanford University
,
Advisors:
Aiken, Alex
Stanford University
,
Mitchell, John
Stanford University
,
Sridharan, Manu
Stanford University

Publisher:

Stanford University
408 Panama Mall, Suite 217
Stanford
CA
United States

ISBN:979-8-6625-5635-5

Order Number:AAI28114923

Purchase on ProQuest

Bibliometrics

Abstract

The ability to automatically reason about programs and extract useful information from them is very important and has received a lot of attention from both the academic community as well as practitioners in industry. Scaling such program analyses to real system is a significant challenge, as real systems tend to be very large, very complex, and often at least part of the system is not available for analysis. A common solution to this problem is to manually write models for the parts of the system that are not analyzable. However, writing these models is both challenging and time consuming. Instead, we propose the use of guided randomized search to find models automatically, and we show how this idea can be applied in three diverse contexts. First, we show how we can use guided randomized search to automatically find models for opaque code, a common problem in program analysis. Opaque code is code that is executable but whose source code is unavailable or difficult to process. We present a technique to first observe the opaque code by collecting partial program traces and then automatically synthesize a model. We demonstrate our method by learning models for a collection of array-manipulating routines. Second, we tackle automatically learning a formal specification for the x86-64 instruction set. Many software analysis and verification tools depend, either explicitly or implicitly, on correct modeling of the semantics of x86-64 instructions. However, formal semantics for the x86-64 ISA are difficult to obtain and often written manually with great effort. Instead, we show how to automatically synthesize formal semantics for 1,795 instruction variants of x86-64. Crucial to our success is a new technique, stratified synthesis, that allows us to scale to longer programs. We evaluate the specification we learned and find that it contains no errors, unlike all manually written specifications we compare against. Third, we consider the problem of program optimization on recent CPU architectures. These modern architectures are incredibly complex and make it difficult to statically determine the performance of a program. Using guided randomized search with a new cost function we are able to outperform the previous state-of-the-art on several metrics, sometimes by a wide margin.

Contributors

Stefan Heule
Stanford University
- Publication Years2011 - 2022
- Publication counts11
- Citation count421
- Available for Download7
- Downloads (cumulative)7,269
- Downloads (12 months)1,204
- Downloads (6 weeks)137
- Average Downloads per Article1,038
- Average Citation per Article38
View Full Profile
Aiken Alex
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile
Mitchell John
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile
Sridharan Manu
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile

Index Terms

Guided Randomized Search over Programs for Synthesis and Program Optimization

Index terms have been assigned to the content through auto-classification.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Probabilistic source-level optimisation of embedded programs
LCTES '05: Proceedings of the 2005 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems

Efficient implementation of DSP applications is critical for many embedded systems. Optimising C compilers for embedded processors largely focus on code generation and instruction scheduling which, with their growing maturity, are providing diminishing ...
Probabilistic source-level optimisation of embedded programs
Proceedings of the 2005 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems

Efficient implementation of DSP applications is critical for many embedded systems. Optimising C compilers for embedded processors largely focus on code generation and instruction scheduling which, with their growing maturity, are providing diminishing ...
Program synthesis by model finding

Program synthesis aims to automate the task of programming. In this paper, we present a clear and elegant formulation of program synthesis as an Alloy* specification by applying its model finder to search for a program that satisfies a contract in terms ...

Browse Theses

Sections

Index Terms

Probabilistic source-level optimisation of embedded programs

Probabilistic source-level optimisation of embedded programs

Program synthesis by model finding

Sections

Save to Binder

Index Terms

Recommendations

Probabilistic source-level optimisation of embedded programs

Probabilistic source-level optimisation of embedded programs

Program synthesis by model finding