A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Maybe Deep Neural Networks are the Best Choice for Modeling Source Code
[article]
2019
arXiv
pre-print
Statistical language modeling techniques have successfully been applied to source code, yielding a variety of new software development tools, such as tools for code suggestion and improving readability. A major issue with these techniques is that code introduces new vocabulary at a far higher rate than natural language, as new identifier names proliferate. But traditional language models limit the vocabulary to a fixed set of common words. For code, this strong assumption has been shown to have
arXiv:1903.05734v1
fatcat:lql53s3x4zdf5d7nsv5h2jnw54