263 Hits in 4.4 sec

Usage and Attribution of Stack Overflow Code Snippets in GitHub Projects [article]

Sebastian Baltes, Stephan Diehl
2018 arXiv   pre-print
We present results of a large-scale empirical study analyzing the usage and attribution of non-trivial Java code snippets from SO answers in public GitHub (GH) projects.  ...  Stack Overflow (SO) is the most popular question-and-answer website for software developers, providing a large amount of copyable code snippets.  ...  Moreover, we thank Richard Kiefer for his help with the calibration of CPD and the extraction of the snippet sets and Florian Reitz for his help with database-related issues.  ... 
arXiv:1802.02938v4 fatcat:55nhx2mjd5hm5j65okgnmofvhy

Attribution Required: Stack Overflow Code Snippets in GitHub Projects

Sebastian Baltes, Richard Kiefer, Stephan Diehl
2017 2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C)  
In this paper, we present the research design and summarized results of an empirical study analyzing attributed and unattributed usages of SO code snippets in GitHub projects.  ...  Stack Overflow (SO) is the largest Q&A website for developers, providing a huge amount of copyable code snippets. Using these snippets raises various maintenance and legal issues.  ...  High-level research design to study the usage and attribution of Stack Overflow (SO) code snippets in GitHub (GH) projects.  ... 
doi:10.1109/icse-c.2017.99 dblp:conf/icse/BaltesK017 fatcat:5z56hkf4pvfrvpju3xbmtllzv4

Gathering GitHub OSS Requirements from Q A Community: an Empirical Study [article]

Hao Huang, Yao Lu, Xinjun Mao
2020 arXiv   pre-print
Recently increasing users in open source software (OSS) community like GitHub attempt to gather software requirements from question and answer (Q&A) communities such as Stack Overflow (SO).  ...  In order to investigate this emerging crosscommunity collaboration phenomenon, the paper presents an exploratory study on cross-community requirements gathering of OSS projects in GitHub.  ...  For example, according to our analysis, 1892 issues of GitHub project TensorFlow contains Stack Overflows links, many of them are software requirements gathered from the discussions in Stack Overflow.  ... 
arXiv:2008.02485v1 fatcat:ih5h5bydqjcszdmmqsif2oke3q

Software Developers' Work Habits and Expertise: Empirical Studies on Sketching, Code Plagiarism, and Expertise Development [chapter]

Sebastian Baltes
2020 Ernst Denert Award for Software Engineering 2019  
Then, we explore to what degree developers copy code from the popular online platform Stack Overflow without adhering to license requirements and motivate why this behavior may lead to legal issues for  ...  Besides, we report on methodological implications of our research and present the open dataset SOTorrent, which supports researchers in analyzing the origin, evolution, and usage of content on Stack Overflow  ...  Our research revealed that at most one quarter of the code snippets copied from Stack Overflow into public GitHub Java projects are attributed as required by Stack Overflow's license (CC BY-SA).  ... 
doi:10.1007/978-3-030-58617-1_4 fatcat:7w4xnov5ffbgjmkojaf32vjmty

Authorship Attribution of Source Code: A Language-Agnostic Approach and Applicability in Software Engineering [article]

Egor Bogomolov
2021 arXiv   pre-print
practical use in software engineering.  ...  Authorship attribution (i.e., determining who is the author of a piece of source code) is an established research topic.  ...  ACKNOWLEDGMENTS Alberto Bacchelli gratefully acknowledges the support of the Swiss National Science Foundation through the SNSF Project 200021_197227.  ... 
arXiv:2001.11593v2 fatcat:d4rh4glis5ba5gabktz6wunc6m

Toxic Code Snippets on Stack Overflow

Chaiyong Ragkhitwetsagul, Jens Krinke, Matheus Paixao, Giuseppe Bianco, Rocco Oliveto
2019 IEEE Transactions on Software Engineering  
Furthermore, we found 214 code snippets that could potentially violate the license of their original software and appear 7,112 times in 2,427 GitHub projects.  ...  Our clone detection found online clone pairs between 72,365 Java code snippets on Stack Overflow and 111 open source projects in the curated Qualitas corpus.  ...  Cristina Lopes and Di Yang from University of California, Irvine for their help in running SourcererCC clone detector and implementing a custom tokeniser for Stack Overflow snippets.  ... 
doi:10.1109/tse.2019.2900307 fatcat:wjnlucsfwvevpazvghafosiy6m

Awareness and Experience of Developers to Outdated and License-Violating Code on Stack Overflow: An Online Survey [article]

Chaiyong Ragkhitwetsagul, Jens Krinke, Rocco Oliveto
2018 arXiv   pre-print
Although most of the code snippets in the answers are written from scratch, there are code snippets cloned from the corresponding questions, from personal or company projects, or from open source projects  ...  We performed two online surveys of Stack Overflow answerers and visitors to assess their awareness to outdated code and software licenses in Stack Overflow answerers.  ...  Yang et al (2017) analysed 909k non-fork Python projects on GitHub and 1.9 million python code snippets on Stack Overflow and found thousands of code blocks that are copied from Stack Overflow to GitHub  ... 
arXiv:1806.08149v1 fatcat:jusrjvpw2jcc5mnumtaxgtagra

Are code examples on an online Q&A forum reliable?

Tianyi Zhang, Ganesha Upadhyaya, Anastasia Reinhardt, Hridesh Rajan, Miryung Kim
2018 Proceedings of the 40th International Conference on Software Engineering - ICSE '18  
violations in Stack Overflow posts.  ...  violations in Stack Overflow posts.  ...  From the Stack Overflow dump taken in October 2016, 3 we scan and parse all Java code snippets and extract API method calls.  ... 
doi:10.1145/3180155.3180260 dblp:conf/icse/0001URRK18 fatcat:rvxwdbol6nbw7daurjdvmg7zve

Stack Overflow: A Code Laundering Platform? [article]

Le An, Ons Mlouki, Foutse Khomh, Giuliano Antoniol
2017 arXiv   pre-print
We found 232 code snippets in 62 Android apps from our dataset that were potentially reused from Stack Overflow, and 1,226 Stack Overflow posts containing code examples that are clones of code released  ...  Although all Stack Overflow posts are free to access, code examples on Stack Overflow are governed by the Creative Commons Attribute-ShareAlike 3.0 Unported license that developers should obey when reusing  ...  We gratefully thank the developers who participated in our survey.  ... 
arXiv:1703.03897v1 fatcat:jbps6jsj65hm7geq65ecigts44

Understanding the Consistency of Stack Overflow Code: A Cautionary Suggestion

Mohammed Lawal Toro
2021 Zenodo  
Outcomes indicate variability in the consistency of Stack Overflow code snippets for the various dimensions; but generally, quality problems were not necessarily dangerous in Stack Overflow snippets.  ...  By assessing the consistency of code snippets on Stack Overload, this work fills the void.  ...  Campos et al. compiled and evaluated Stack Overflow JavaScript fragments in terms of code law be included in projects under GitHub.  ... 
doi:10.5281/zenodo.5149851 fatcat:hlgex5yev5ffdjfd2bk5ez2674

Crypto Experts Advise What They Adopt [article]

Mohammadreza Hazhirpasand, Oscar Nierstrasz, Mohammad Ghafari
2021 arXiv   pre-print
We collected the top 1% of responders who have participated in crypto discussions on Stack Overflow, and we manually analyzed their crypto contributions to open source projects on GitHub.  ...  Moreover, 90% of the analyzed users employed the same concept of cryptography in their projects as they advised about on Stack Overflow.  ...  In particular, platforms such as Stack Overflow contain insecure code snippets and inexperienced developers blindly use such snippets [7] .  ... 
arXiv:2109.15093v1 fatcat:nvenamcmtraifekmwwcbmdvyjq

Stack Overflow in Github: Any Snippets There? [article]

Di Yang, Pedro Martins, Vaibhav Saini, Cristina Lopes
2017 arXiv   pre-print
We are interested in studying how programmers use these snippets of code in their projects. Can we find Stack Overflow snippets in real projects?  ...  The large-scale study presented on this paper analyzes 909k non-fork Python projects hosted on Github, which contain 290M function definitions, and 1.9M Python snippets captured in Stack Overflow.  ...  As datasets, we took the collection of 909k non-forked Python projects hosted in Github, as well as the SO dump provided by Stack Exchange.  ... 
arXiv:1705.01198v1 fatcat:3qpzr2cun5dbxcadnsvd5nicxy

An Empirical Analysis on Reducing Open Source Software Development Tasks using Stack Overflow

Tirath Prasad Sahu, Naresh Kumar Nagwani, Shrish Verma
2016 Indian Journal of Science and Technology  
OSS projects and Stack Overflow respectively.  ...  The statistical result shows that the bug fix time can be reduced by posting the bugs into Stack Overflow.  ...  The linking of Stack Overflow to GitHub (the largest coding repository) is presented by the authors 10 to match questions in Stack Overflow with codes which are forked in GitHub.  ... 
doi:10.17485/ijst/2016/v9i21/89198 fatcat:qycfcvuaafgf3fbuy4mxyshoaa

Sourcerer's Apprentice and the study of code snippet migration [article]

Stephen Romansky, Cheng Chen, Baljeet Malhotra, Abram Hindle
2018 arXiv   pre-print
In this paper we put the Apprentice to work on empirical studies that demonstrate there is much sharing between StackOverflow code and Python modules and Python documentation that violates the licensing  ...  of the original Python modules and documentation: software snippets shared through StackOverflow are often being relicensed improperly to CC-BY-SA 3.0 without maintaining the appropriate attribution.  ...  StackOverflow is currently distributing thousands of snippets in violation of both license and attribution requirements.  ... 
arXiv:1808.00106v1 fatcat:cpwo6vhq4fed5c6wjnlbqg2gpm

GitHub Discussions: An Exploratory Study of Early Adoption [article]

Hideaki Hata, Nicole Novielli, Sebastian Baltes, Raula Gaikovina Kula, Christoph Treude
2021 arXiv   pre-print
and (5) positive sentiment in Discussions is more frequent than in Stack Overflow posts.  ...  Before being available to all projects in December 2020, it had been tested on selected open source software projects.  ...  We preprocessed all the posts to remove HTML tags and code snippets using the Beautiful Soup library. 18 To extract the sentiment conveyed by the posts in the GitHub Discussions and in Stack Overflow questions  ... 
arXiv:2102.05230v3 fatcat:ztgequaltbgynpybh4all7ejqm
« Previous Showing results 1 — 15 out of 263 results