113 Hits in 4.3 sec

Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow [article]

Kaibo Cao
2021 arXiv   pre-print
As query reformulation is tedious for developers, especially for novices, we propose an automated software-specific query reformulation approach based on deep learning.  ...  With query logs provided by Stack Overflow, we construct a large-scale query reformulation corpus, including the original queries and corresponding reformulated ones.  ...  ACKNOWLEDGEMENT The authors would like to thank Stack Exchange Inc. for sharing the dataset, and the anonymous reviewers for their insightful comments and suggestions.  ... 
arXiv:2102.00826v2 fatcat:5wurhi5usnaqbftgo26vj45svq

A Systematic Literature Review of Automated Query Reformulations in Source Code Search [article]

Mohammad Masudur Rahman, Chanchal K. Roy
2021 arXiv   pre-print
In this systematic literature review, we carefully select 70 primary studies on query reformulations from 2,970 candidate studies, perform an in-depth qualitative analysis using the Grounded Theory approach  ...  Finally, we discuss several open issues in search query reformulations and suggest multiple future research opportunities.  ...  Other studies analyze proprietary search logs from Bing search engine [127] and Stack Overflow [25] to generate their reformulated queries.  ... 
arXiv:2108.09646v1 fatcat:7eweftlz4bff5gitbru7et5644

Automatic query reformulation for code search using crowdsourced knowledge

Mohammad M. Rahman, Chanchal K. Roy, David Lo
2019 Empirical Software Engineering  
In this article, we propose a novel query reformulation technique-RACK-that suggests a list of relevant API classes for a natural language query intended for code search.  ...  We first motivate our idea using an exploratory study with 19 standard Java API packages and 344K Java related posts from Stack Overflow.  ...  Besides, the titles from Stack Overflow questions are a major source of query keywords for code search.  ... 
doi:10.1007/s10664-018-9671-0 fatcat:o6o3ukhn6rh2tol5jvr73cd6we

Augmenting and structuring user queries to support efficient free-form code search

Raphael Sirres, Tegawendé F. Bissyandé, Dongsun Kim, David Lo, Jacques Klein, Kisub Kim, Yves Le Traon
2018 Empirical Software Engineering  
To instantiate this approach, we build GITSEARCH, a code search engine, on top of GitHub and Stack Overflow Q&A data.  ...  acceptable or interesting to the community as answers for Stack Overflow questions.  ...  Acknowledgments The authors would like to thank the anonymous reviewers for their helpful comments and suggestions.  ... 
doi:10.1007/s10664-017-9544-y fatcat:7rdrrdw5xrdujgrnfsbolvbtoa

Domain-specific cross-language relevant question retrieval

Bowen Xu, Zhenchang Xing, Xin Xia, David Lo, Shanping Li
2017 Empirical Software Engineering  
For the purpose of helping Chinese developers take advantage of the rich knowledge base of Stack Overflow and simplify the question retrieval process, we propose an automated cross-language relevant question  ...  Chinese developers often cannot effectively search questions in English, because they may have difficulties in translating technical words from Chinese to English and formulating proper English queries  ...  Based on the term frequency of Stack Overflow questions for a particular domain (e.g., Java.  ... 
doi:10.1007/s10664-017-9568-3 fatcat:v6blb7xpxrfo3erytbreuruwzy

Neural Code Search Revisited: Enhancing Code Snippet Retrieval through Natural Language Intent [article]

Geert Heyman, Tom Van Cutsem
2020 arXiv   pre-print
In this work, we propose and study annotated code search: the retrieval of code snippets paired with brief descriptions of their intent using natural language queries.  ...  Building on recent progress in transfer learning and natural language processing, we create a domain-specific retrieval model for code annotated with a natural language description.  ...  Threats to validity Stack Overflow titles as queries: Similar to previous code search studies [2, 3, 5] , ground truth queries are compiled from Stack Overflow post titles.  ... 
arXiv:2008.12193v1 fatcat:5iyktr2rpbb2bmqqw23yve7jwq

Opportunities and Challenges in Code Search Tools [article]

Chao Liu and Xin Xia and David Lo and Cuiyun Gao and Xiaohu Yang and John Grundy
2020 arXiv   pre-print
Based on our findings, we identified a set of outstanding challenges in existing studies and a research roadmap for future code search research.  ...  focusing on supporting seven different search tasks.  ...  ). • Most code search studies have tested their proposed tools with top-n frequently used text-based queries collected from Q&A forums (e.g., Stack Overflow). • Performance of 55% of code search tools  ... 
arXiv:2011.02297v1 fatcat:pklxhrm4g5buror5abukwlb5h4

What do developers search for on the web?

Xin Xia, Lingfeng Bao, David Lo, Pavneet Singh Kochhar, Ahmed E. Hassan, Zhenchang Xing
2017 Empirical Software Engineering  
To address this gap, we collected search queries from 60 developers, surveyed 235 software engineers from more than 21 countries across five continents.  ...  queries.  ...  for online using web search engines or on Q&A sites such as Stack Overflow?  ... 
doi:10.1007/s10664-017-9514-4 fatcat:taavsyjoyrbkfdigciydop7dam

In-IDE Code Generation from Natural Language: Promise and Challenges [article]

Frank F. Xu, Bogdan Vasilescu, Graham Neubig
2021 arXiv   pre-print
Recently, there has been a proliferation of machine learning methods for code generation and retrieval from natural language queries, but these have primarily been evaluated purely based on retrieval accuracy  ...  We release all data and software to pave the road for future empirical studies and development of better models.  ...  We also thank anonymous reviewers for their comments on revising this paper.  ... 
arXiv:2101.11149v3 fatcat:3wyizxvtvnarjehq5koxbesq64

Knowledge reuse in software projects: Retrieving software development Q&A posts based on project task similarity

Glaucia Melo, Toacy Oliveira, Paulo Alencar, Donald Cowan, Talib Al-Ameri
2020 PLoS ONE  
In this paper, we present an approach that (i) allows developers to associate project tasks with Stack Overflow posts, and (ii) recommends which Stack Overflow posts might be reused based on task similarity  ...  We analyze an industry dataset, which contains project tasks associated with Stack Overflow posts, looking for the similarity of project tasks that reuse a Stack Overflow post.  ...  Acknowledgments The authors thank the Computer Science Library Liaison Rebecca Hutchinson for the support during the Systematic Mapping Study.  ... 
doi:10.1371/journal.pone.0243852 pmid:33332398 fatcat:ujnqai5fpne23jkfdic3mvs7fa

Holistic recommender systems for software engineering

Luca Ponzanelli
2014 Companion Proceedings of the 36th International Conference on Software Engineering - ICSE Companion 2014  
Any improvement in the search process would result in time saved for the developers.  ...  We selected the four tasks from the issue tracker based on their type (two bug fixes and two enhancements) and difficulty (non-trivial, but doable in a limited amount of time).  ...  The query is sent to search engines (Google, Bing) to perform a Web search on the Stack Overflow website.  ... 
doi:10.1145/2591062.2591081 dblp:conf/icse/Ponzanelli14 fatcat:jsvbb3jz4fbx3opvqntxrwlune

Reusing debugging knowledge via trace-based bug search

Zhongxian Gu, Earl T. Barr, Drew Schleck, Zhendong Su
2012 Proceedings of the ACM international conference on Object oriented programming systems languages and applications - OOPSLA '12  
We evaluated OSCILLOSCOPE on bugs collected from popular open-source projects.  ...  One bug-fixing tactic is to search for similar bugs that have been reported and resolved in the past. A fix for a similar bug can help a developer understand a bug, or even directly fix it.  ...  OSCILLOSCOPE's query engine, which is based on Hadoop, performs the search.  ... 
doi:10.1145/2384616.2384684 dblp:conf/oopsla/GuBSS12 fatcat:24twha4f4jhtxec4j52lkpsx44

Reusing debugging knowledge via trace-based bug search

Zhongxian Gu, Earl T. Barr, Drew Schleck, Zhendong Su
2012 SIGPLAN notices  
We evaluated OSCILLOSCOPE on bugs collected from popular open-source projects.  ...  One bug-fixing tactic is to search for similar bugs that have been reported and resolved in the past. A fix for a similar bug can help a developer understand a bug, or even directly fix it.  ...  OSCILLOSCOPE's query engine, which is based on Hadoop, performs the search.  ... 
doi:10.1145/2398857.2384684 fatcat:gewao56n5vc7hgur2aoqkekema

Boosting API Recommendation with Implicit Feedback [article]

Yu Zhou, Xinying Yang, Taolue Chen, Zhiqiu Huang, Xiaoxing Ma, Harald Gall
2021 arXiv   pre-print
Developers often need to use appropriate APIs to program efficiently, but it is usually a difficult task to identify the exact one they need from a vast of candidates.  ...  Existing query-based API recommendation approaches can be plugged into BRAID.  ...  For BIKER, the oracle is based on the 125,847 Stack Overflow posts provided by BIKER after preprocessing.  ... 
arXiv:2002.01264v2 fatcat:4jmzuulltrgjlleszqrmwix7wq

A system for scalable and reliable technical-skill testing in online labor markets

Maria Christoforaki, Panagiotis G. Ipeirotis
2015 Computer Networks  
Our platform leverages content that is already available on question answering sites such as Stack Overflow and re-purposes these questions to generate tests.  ...  existing test banks, and an order of magnitude lower than the cost of producing such questions from scratch using experts.  ...  More than 91% of the questions on Stack Overflow have at least one answer. Each question on Stack Overflow is associated with one ore more tags (topics).  ... 
doi:10.1016/j.comnet.2015.05.020 fatcat:bjxflc3be5eyxprn6hmwbu363y
« Previous Showing results 1 — 15 out of 113 results