research-article

LAMBDABEAM: neural program search with higher-order functions and lambdas

AUTHORs:

Charles SuttonAuthors Info & Claims

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing Systems

Article No.: 2234, Pages 51327 - 51346

Published: 30 May 2024 Publication History

Abstract

Search is an important technique in program synthesis that allows for adaptive strategies such as focusing on particular search directions based on execution results. Several prior works have demonstrated that neural models are effective at guiding program synthesis searches. However, a common drawback of those approaches is the inability to handle iterative loops, higher-order functions, or lambda functions, thus limiting prior neural searches from synthesizing longer and more general programs. We address this gap by designing a search algorithm called LAMBDABEAM that can construct arbitrary lambda functions that compose operations within a given DSL. We create semantic vector representations of the execution behavior of the lambda functions and train a neural policy network to choose which lambdas to construct during search, and pass them as arguments to higher-order functions to perform looping computations. Our experiments show that LAMBDABEAM outperforms neural, symbolic, and LLM-based techniques in an integer list manipulation domain.

References

[1]

Miltiadis Allamanis, Earl T Barr, Premkumar Devanbu, and Charles Sutton. A survey of machine learning for big code and naturalness. ACM Computing Surveys (CSUR), 51(4), 2018.

[2]

Rajeev Alur, Arjun Radhakrishna, and Abhishek Udupa. Scaling enumerative program synthesis via divide and conquer. In International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS), pages 319-336. Springer, 2017.

[3]

Matej Balog, Alexander L Gaunt, Marc Brockschmidt, Sebastian Nowozin, and Daniel Tarlow. DeepCoder: Learning to write programs. In International Conference on Learning Representations (ICLR), 2017.

[4]

Shraddha Barke, Hila Peleg, and Nadia Polikarpova. Just-in-time learning for bottom-up enumerative synthesis. In Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA), 2020.

Digital Library

[5]

Kai-Wei Chang, Akshay Krishnamurthy, Alekh Agarwal, Hal Daume III, and John Langford. Learning to search better than your teacher. In International Conference on Machine Learning (ICML), 2015.

Digital Library

[6]

Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde, Jared Kaplan, Harri Edwards, Yura Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-Voss, Will Guss, Alex Nichol, Igor Babuschkin, Suchir Balaji, Shantanu Jain, Andrew Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob McGrew, Dario Amodei, Sam McCandlish, Ilya Sutskever, and Wojciech Zaremba. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374, 2021. URL https://arxiv.org/abs/2107.03374.

[7]

Xinyun Chen, Chang Liu, and Dawn Song. Execution-guided neural program synthesis. In International Conference on Learning Representations (ICLR), 2019.

[8]

Xinyun Chen, Dawn Song, and Yuandong Tian. Latent execution for neural program synthesis. In Advances in Neural Information Processing Systems (NeurIPS), 2021.

[9]

Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov, and Noah Fiedel. PaLM: Scaling language modeling with pathways. Journal of Machine Learning Research (JMLR), 24: 240:1-240:113, 2023. URL http://jmlr.org/papers/v24/22-1144.html.

[10]

Alonzo Church. The calculi of lambda-conversion. Number 6. Princeton University Press, 1985.

[11]

Hal Daumé III, John Langford, and Daniel Marcu. Search-based structured prediction. Machine Learning, 75(3), 2009.

[12]

Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-Rahman Mohamed, and Pushmeet Kohli. RobustFill: Neural program learning under noisy I/O. In International Conference on Machine Learning (ICML), 2017.

Digital Library

[13]

Kevin Ellis, Maxwell Nye, Yewen Pu, Felix Sosa, Josh Tenenbaum, and Armando Solar-Lezama. Write, execute, assess: Program synthesis with a REPL. In Advances in Neural Information Processing Systems (NeurIPS), 2019.

[14]

Kevin Ellis, Catherine Wong, Maxwell Nye, Mathias Sablé-Meyer, Lucas Morales, Luke Hewitt, Luc Cary, Armando Solar-Lezama, and Joshua B Tenenbaum. DreamCoder: Bootstrapping inductive program synthesis with wake-sleep library learning. In Programming Language Design and Implementation (PLDI), pages 835-850, 2021.

Digital Library

[15]

John K Feser, Swarat Chaudhuri, and Isil Dillig. Synthesizing data structure transformations from input-output examples. In Programming Language Design and Implementation (PLDI), pages 229-239, 2015.

Digital Library

[16]

George Fink and Matt Bishop. Property-based testing: a new approach to testing for assurance. ACM SIGSOFT Software Engineering Notes, 22(4):74-80, 1997.

Digital Library

[17]

Justin Gottschlich, Armando Solar-Lezama, Nesime Tatbul, Michael Carbin, Martin Rinard, Regina Barzilay, Saman Amarasinghe, Joshua B Tenenbaum, and Tim Mattson. The three pillars of machine programming. In International Workshop on Machine Learning and Programming Languages (MAPL at PLDI), 2018.

Digital Library

[18]

Sumit Gulwani, Oleksandr Polozov, and Rishabh Singh. Program synthesis. Foundations and Trends® in Programming Languages, 4(1-2), 2017.

[19]

Joey Hong, David Dohan, Rishabh Singh, Charles Sutton, and Manzil Zaheer. Latent programmer: Discrete latent codes for program synthesis. In International Conference on Machine Learning (ICML), 2021.

[20]

Woosuk Lee, Kihong Heo, Rajeev Alur, and Mayur Naik. Accelerating search-based program synthesis using learned probabilistic models. In Programming Language Design and Implementation (PLDI), 2018.

Digital Library

[21]

Zohar Manna and Richard J Waldinger. Toward automatic program synthesis. Communications of the ACM, 14(3):151-165, 1971.

Digital Library

[22]

Vijayaraghavan Murali, Letao Qi, Swarat Chaudhuri, and Chris Jermaine. Neural sketch learning for conditional program generation. In International Conference on Learning Representations (ICLR), 2018.

[23]

Renato Negrinho, Matthew Gormley, and Geoffrey Gordon. Learning beam search policies via imitation learning. In Advances in Neural Information Processing Systems (NeurIPS), 2018.

[24]

Renato Negrinho, Matthew Gormley, and Geoffrey Gordon. An empirical investigation of beam-aware training in supertagging. In Findings of the Association for Computational Linguistics: EMNLP, 2020.

[25]

Maxwell Nye, Luke Hewitt, Joshua Tenenbaum, and Armando Solar-Lezama. Learning to infer program sketches. In International Conference on Machine Learning (ICML), 2019.

[26]

Augustus Odena and Charles Sutton. Learning to represent programs with property signatures. In International Conference on Learning Representations (ICLR), 2020.

[27]

Augustus Odena, Kensen Shi, David Bieber, Rishabh Singh, Charles Sutton, and Hanjun Dai. BUSTLE: Bottom-up program synthesis through learning-guided exploration. In International Conference on Learning Representations (ICLR), 2021.

[28]

Benjamin C Pierce. Types and programming languages. MIT press, 2002.

Digital Library

[29]

Stephane Ross, Geoffrey Gordon, and Drew Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. In Conference on Artificial Intelligence and Statistics (AISTATS), 2011.

[30]

Kensen Shi, Jacob Steinhardt, and Percy Liang. FrAngel: Component-based synthesis with control structures. Proceedings of the ACM on Programming Languages, 3(POPL), 2019.

Digital Library

[31]

Kensen Shi, David Bieber, and Charles Sutton. Incremental sampling without replacement for sequence models. In International Conference on Machine Learning (ICML), 2020.

[32]

Kensen Shi, David Bieber, and Rishabh Singh. TF-Coder: Program synthesis for tensor manipulations. ACM Transactions on Programming Languages and Systems (TOPLAS), 44(2): 1-36, 2022.

[33]

Kensen Shi, Hanjun Dai, Kevin Ellis, and Charles Sutton. CrossBeam: Learning to search in bottom-up program synthesis. In International Conference on Learning Representations (ICLR), 2022.

[34]

Disha Shrivastava, Hugo Larochelle, and Daniel Tarlow. Learning to combine per-example solutions for neural program synthesis. In Advances in Neural Information Processing Systems (NeurIPS), 2021.

[35]

Oriol Vinyals, Meire Fortunato, and Navdeep Jaitly. Pointer networks. In Advances in Neural Information Processing Systems (NIPS), 2015.

[36]

Pengcheng Yin and Graham Neubig. A syntactic neural model for general-purpose code generation. In Assocation for Computational Linguistics (ACL), 2017.

[37]

Amit Zohar and Lior Wolf. Automatic program synthesis of long programs with a learned garbage collector. In Advances in Neural Information Processing Systems (NeurIPS), 2018.

Recommendations

Towards a theory of meaningfulness and truth: an introduction to variational semantics
Coincidental extension of scattered context languages

All the results given in the paper hold true. In the proof of Theorem 1, change steps IV, V, and VI to

IV. for every $a,b,c \in T$ , add $(\langle a\rangle,\langle b \rangle,\langle c \rangle,\$) \rightarrow (\langle 0a \rangle,\langle 0b \rangle,\langle 0c \rangle,\S)$ to P ;

V. for every $a,b,c,d \in T$ , add $(Y, \langle 0a \rangle, Y, \langle 0b \rangle, Y, \langle 0c \rangle, \S) \rightarrow \# , \langle 0a \rangle, X, \langle 0b \rangle, Y, \langle 0c \rangle, \S)$ , $(\langle 0a\rangle, \langle 0b \rangle, \langle 0c \rangle, \S) \rightarrow (\langle 4a \rangle, \langle 1b \rangle, \langle 2c \rangle, \S)$ , $(\langle 4a \rangle, X, \langle 1b \rangle, Y, \langle 2c \rangle, \S) \rightarrow (\langle 4a \rangle, \# , \langle 1b \rangle, X, \langle 2c \rangle, \S)$ , $(\langle 4a \rangle, \langle 1b \rangle, \langle 2c \rangle, \langle d \rangle, \S \rightarrow (a, \langle 4b \rangle, \langle 1c \rangle, \langle 2d \rangle, \S)$ , $(\langle 4a \rangle, \langle 1b \rangle, \langle 2c \rangle, \S) \rightarrow (a, \langle 1b \rangle, \langle 3c \rangle, \S)$ , $(\langle 1a \rangle, X, \langle 3b \rangle, Y, \S) \rightarrow (\langle 1a \rangle, \# , \langle 3b \rangle, \# , \S)$ to P ;

VI. for every $a, b \in T$ , add $(\langle 1a \rangle, X, \langle 3b \rangle, \S) \rightarrow (a, \# , b, \# )$ to P .
Some Multiple Power Series with Zero-One Coefficients

The paper is concerned with sums of the type \[S_{n,j} = \sum {x_1^{a_1 } x_2^{a_2 } \cdots x_n^{a_n } } \quad (n > 1),\] where the summation is over either \[( * )\qquad ja_i \leqq a_1 + a_2 + \cdots + a_n \quad (1 \leqq j \leqq n;1 \leqq i \leqq ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing Systems

December 2023

80772 pages

Copyright © 2023 Neural Information Processing Systems Foundation, Inc.

Publisher

Curran Associates Inc.

Red Hook, NY, United States

Publication History

Published: 30 May 2024

Qualifiers

Research-article
Research
Refereed limited

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 14 Aug 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

View Table of Contents