FACOS: Finding API Relevant Contents on Stack Overflow with Semantic and Syntactic Analysis [article]

Kien Luong, Mohammad Hadi, Ferdian Thung, Fatemeh Fard, David Lo
2021 arXiv   pre-print
Collecting API examples, usages, and mentions relevant to a specific API method over discussions on venues such as Stack Overflow is not a trivial problem. It requires efforts to correctly recognize whether the discussion refers to the API method that developers/tools are searching for. The content of the thread, which consists of both text paragraphs describing the involvement of the API method in the discussion and the code snippets containing the API invocation, may refer to the given API
more » ... hod. Leveraging this observation, we develop FACOS, a context-specific algorithm to capture the semantic and syntactic information of the paragraphs and code snippets in a discussion. FACOS combines a syntactic word-based score with a score from a predictive model fine-tuned from CodeBERT. FACOS beats the state-of-the-art approach by 13.9% in terms of F1-score.
arXiv:2111.07238v1 fatcat:atheh6gxfjdhznoynbwgz6it7u