Modeling Vocabulary for Big Code Machine Learning
arXiv: Computation and Language, 2019.
When building machine learning models that operate on source code, several decisions have to be made to model source-code vocabulary. These decisions can have a large impact: some can lead to not being able to train models at all, others significantly affect performance, particularly for Neural Language Models. Yet, these decisions are no...More
PPT (Upload PPT)