Computer Science > Machine Learning

arXiv:2012.00377 (cs)

[Submitted on 1 Dec 2020 (v1), last revised 5 Aug 2021 (this version, v2)]

Title:Latent Programmer: Discrete Latent Codes for Program Synthesis

Authors:Joey Hong, David Dohan, Rishabh Singh, Charles Sutton, Manzil Zaheer

View PDF

Abstract:In many sequence learning tasks, such as program synthesis and document summarization, a key problem is searching over a large space of possible output sequences. We propose to learn representations of the outputs that are specifically meant for search: rich enough to specify the desired output but compact enough to make search more efficient. Discrete latent codes are appealing for this purpose, as they naturally allow sophisticated combinatorial search strategies. The latent codes are learned using a self-supervised learning principle, in which first a discrete autoencoder is trained on the output sequences, and then the resulting latent codes are used as intermediate targets for the end-to-end sequence prediction task. Based on these insights, we introduce the \emph{Latent Programmer}, a program synthesis method that first predicts a discrete latent code from input/output examples, and then generates the program in the target language. We evaluate the Latent Programmer on two domains: synthesis of string transformation programs, and generation of programs from natural language descriptions. We demonstrate that the discrete latent representation significantly improves synthesis accuracy.

Comments:	ICML 2021; 15 pages, 9 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2012.00377 [cs.LG]
	(or arXiv:2012.00377v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2012.00377

Submission history

From: Joey Hong [view email]
[v1] Tue, 1 Dec 2020 10:11:35 UTC (278 KB)
[v2] Thu, 5 Aug 2021 18:49:02 UTC (292 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Joey Hong
David Dohan
Rishabh Singh
Charles Sutton
Manzil Zaheer

export BibTeX citation

Computer Science > Machine Learning

Title:Latent Programmer: Discrete Latent Codes for Program Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Latent Programmer: Discrete Latent Codes for Program Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators