Filters








191 Hits in 6.7 sec

Usage and Attribution of Stack Overflow Code Snippets in GitHub Projects [article]

Sebastian Baltes, Stephan Diehl
2018 arXiv   pre-print
We present results of a large-scale empirical study analyzing the usage and attribution of non-trivial Java code snippets from SO answers in public GitHub (GH) projects.  ...  Stack Overflow (SO) is the most popular question-and-answer website for software developers, providing a large amount of copyable code snippets.  ...  Moreover, we thank Richard Kiefer for his help with the calibration of CPD and the extraction of the snippet sets and Florian Reitz for his help with database-related issues.  ... 
arXiv:1802.02938v4 fatcat:55nhx2mjd5hm5j65okgnmofvhy

Attribution Required: Stack Overflow Code Snippets in GitHub Projects

Sebastian Baltes, Richard Kiefer, Stephan Diehl
2017 2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C)  
In this paper, we present the research design and summarized results of an empirical study analyzing attributed and unattributed usages of SO code snippets in GitHub projects.  ...  Stack Overflow (SO) is the largest Q&A website for developers, providing a huge amount of copyable code snippets. Using these snippets raises various maintenance and legal issues.  ...  High-level research design to study the usage and attribution of Stack Overflow (SO) code snippets in GitHub (GH) projects.  ... 
doi:10.1109/icse-c.2017.99 dblp:conf/icse/BaltesK017 fatcat:5z56hkf4pvfrvpju3xbmtllzv4

Software Developers' Work Habits and Expertise: Empirical Studies on Sketching, Code Plagiarism, and Expertise Development [chapter]

Sebastian Baltes
2020 Ernst Denert Award for Software Engineering 2019  
Besides, we report on methodological implications of our research and present the open dataset SOTorrent, which supports researchers in analyzing the origin, evolution, and usage of content on Stack Overflow  ...  Then, we explore to what degree developers copy code from the popular online platform Stack Overflow without adhering to license requirements and motivate why this behavior may lead to legal issues for  ...  To fill this gap, we conducted a large-scale empirical study analyzing software developers' usage and attribution of non-trivial Java code snippets from Stack Overflow answers in public GitHub projects  ... 
doi:10.1007/978-3-030-58617-1_4 fatcat:7w4xnov5ffbgjmkojaf32vjmty

Are code examples on an online Q&A forum reliable?

Tianyi Zhang, Ganesha Upadhyaya, Anastasia Reinhardt, Hridesh Rajan, Miryung Kim
2018 Proceedings of the 40th International Conference on Software Engineering - ICSE '18  
violations in Stack Overflow posts.  ...  violations in Stack Overflow posts.  ...  This work is supported by AFRL grant FA8750-15-2-0075, and NSF grants CCF-1527923, CCF-1460325, CCF-1423370, CNS-1513263, and CCF-1518897.  ... 
doi:10.1145/3180155.3180260 dblp:conf/icse/0001URRK18 fatcat:rvxwdbol6nbw7daurjdvmg7zve

Toxic Code Snippets on Stack Overflow

Chaiyong Ragkhitwetsagul, Jens Krinke, Matheus Paixao, Giuseppe Bianco, Rocco Oliveto
2019 IEEE Transactions on Software Engineering  
Furthermore, we found 214 code snippets that could potentially violate the license of their original software and appear 7,112 times in 2,427 GitHub projects.  ...  Our clone detection found online clone pairs between 72,365 Java code snippets on Stack Overflow and 111 open source projects in the curated Qualitas corpus.  ...  Cristina Lopes and Di Yang from University of California, Irvine for their help in running SourcererCC clone detector and implementing a custom tokeniser for Stack Overflow snippets.  ... 
doi:10.1109/tse.2019.2900307 fatcat:wjnlucsfwvevpazvghafosiy6m

Understanding the Consistency of Stack Overflow Code: A Cautionary Suggestion

Mohammed Lawal Toro
2021 Zenodo  
Outcomes indicate variability in the consistency of Stack Overflow code snippets for the various dimensions; but generally, quality problems were not necessarily dangerous in Stack Overflow snippets.  ...  By assessing the consistency of code snippets on Stack Overload, this work fills the void.  ...  Campos et al. compiled and evaluated Stack Overflow JavaScript fragments in terms of code law be included in projects under GitHub.  ... 
doi:10.5281/zenodo.5149851 fatcat:hlgex5yev5ffdjfd2bk5ez2674

Crypto Experts Advise What They Adopt [article]

Mohammadreza Hazhirpasand, Oscar Nierstrasz, Mohammad Ghafari
2021 arXiv   pre-print
We collected the top 1% of responders who have participated in crypto discussions on Stack Overflow, and we manually analyzed their crypto contributions to open source projects on GitHub.  ...  Moreover, 90% of the analyzed users employed the same concept of cryptography in their projects as they advised about on Stack Overflow.  ...  In particular, platforms such as Stack Overflow contain insecure code snippets and inexperienced developers blindly use such snippets [7] .  ... 
arXiv:2109.15093v1 fatcat:nvenamcmtraifekmwwcbmdvyjq

Stack Overflow: A Code Laundering Platform? [article]

Le An, Ons Mlouki, Foutse Khomh, Giuliano Antoniol
2017 arXiv   pre-print
We found 232 code snippets in 62 Android apps from our dataset that were potentially reused from Stack Overflow, and 1,226 Stack Overflow posts containing code examples that are clones of code released  ...  We investigated the licenses of these pieces of code and observed 1,279 cases of potential license violations (related to code posting to Stack overflow or code reuse from Stack overflow).  ...  ACKNOWLEDGMENT This work is partially supported by Natural Sciences and Engineering Research Council of Canada (NSERC) and by Fonds de Recherche du Québec -Nature et Technologies (FRQNT).  ... 
arXiv:1703.03897v1 fatcat:jbps6jsj65hm7geq65ecigts44

GitHub Discussions: An Exploratory Study of Early Adoption [article]

Hideaki Hata, Nicole Novielli, Sebastian Baltes, Raula Gaikovina Kula, Christoph Treude
2021 arXiv   pre-print
and (5) positive sentiment in Discussions is more frequent than in Stack Overflow posts.  ...  3) developers consider GitHub Discussions useful but face the problem of topic duplication between Discussions and Issues; (4) Discussions play a crucial role in advancing the development of projects;  ...  We preprocessed all the posts to remove HTML tags and code snippets using the Beautiful Soup library. 18 To extract the sentiment conveyed by the posts in the GitHub Discussions and in Stack Overflow questions  ... 
arXiv:2102.05230v3 fatcat:ztgequaltbgynpybh4all7ejqm

Sourcerer's Apprentice and the study of code snippet migration [article]

Stephen Romansky, Cheng Chen, Baljeet Malhotra, Abram Hindle
2018 arXiv   pre-print
of the original Python modules and documentation: software snippets shared through StackOverflow are often being relicensed improperly to CC-BY-SA 3.0 without maintaining the appropriate attribution.  ...  In this paper we put the Apprentice to work on empirical studies that demonstrate there is much sharing between StackOverflow code and Python modules and Python documentation that violates the licensing  ...  However, the Stack-Overflow code may have cited its source in the surrounding text of the posts, which we did not analyze in our evaluation.  ... 
arXiv:1808.00106v1 fatcat:cpwo6vhq4fed5c6wjnlbqg2gpm

Stack Overflow Considered Harmful? The Impact of Copy&Paste on Android Application Security [article]

Felix Fischer, Konstantin Böttinger, Huang Xiao, Christian Stransky, Yasemin Acar, Michael Backes, Sascha Fahl
2017 arXiv   pre-print
Hence, integrating a security-related code snippet from Stack Overflow into production software requires caution and expertise.  ...  We answer this highly important question by quantifying the proliferation of security-related code snippets from Stack Overflow in Android applications available on Google Play.  ...  ACKNOWLEDGEMENTS The authors would like to thank Siddharth Subramanian for his strong support with JavaBaker and the anonymous reviewers for their helpful comments.  ... 
arXiv:1710.03135v1 fatcat:vmwooobi3rghpbxueykbadgfwm

Library Adoption Dynamics in Software Teams [article]

Pamela Bilo Thomas, Rachel Krohn, Tim Weninger
2020 arXiv   pre-print
We find that a variety of factors, including team size, library popularity, and prevalence on Stack Overflow are associated with how quickly teams learn and successfully adopt new software libraries.  ...  In these repositories, we observe code additions, which represent successfully implemented ideas, and code deletions, which represent ideas that have failed or been superseded.  ...  Fig. 5 : 5 Growth of library usage after adoption grouped by Stack Overflow usage.  ... 
arXiv:2003.00045v1 fatcat:gytozj253febbgwykdwykxmb74

A Dataset for API Usage

Anand Ashok Sawant, Alberto Bacchelli
2015 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories  
We try collecting as many usages of an API as possible, this is achieved by targeting projects hosted on GitHub.  ...  By making such a large and rich dataset public, we hope to stimulate some more research in the field of APIs with the aid of accurate API usage samples.  ...  [10] , which can parse incomplete code snippets and give accurate type information on the code snippet.  ... 
doi:10.1109/msr.2015.75 dblp:conf/msr/SawantB15 fatcat:thb7rntyevfbxokc3etb7lrnne

9.6 Million Links in Source Code Comments: Purpose, Evolution, and Decay [article]

Hideaki Hata, Christoph Treude, Raula Gaikovina Kula, Takashi Ishio
2019 arXiv   pre-print
Almost 10% of the links included in source code comments are dead.  ...  In this paper, we investigate the role of links contained in source code comments from these perspectives.  ...  ACKNOWLEDGMENT We thank the respondents to our pull requests for their availability and Sebastian Baltes for his support in querying the SOTorrent dataset.  ... 
arXiv:1901.07440v2 fatcat:tyevtehj4zf2vc37ma525tghg4

When Deep Learning Met Code Search [article]

Jose Cambronero, Hongyu Li, Seohyun Kim, Koushik Sen, Satish Chandra
2019 arXiv   pre-print
The goal of this supervision is to produce embeddings that are more similar for a query and the corresponding desired code snippet.  ...  Clearly, there are choices in whether to use supervised techniques at all, and if one does, what sort of network and training to use for supervision.  ...  ACKNOWLEDGEMENTS We would like to thank the authors of CODEnn for making their system and data public. Similarly, we thank the authors of SCS for making their blog post and code available.  ... 
arXiv:1905.03813v4 fatcat:yazurqj7azcw7jmebaymwjtilu
« Previous Showing results 1 — 15 out of 191 results