295 Hits in 2.3 sec

Attribution Required: Stack Overflow Code Snippets in GitHub Projects

Sebastian Baltes, Richard Kiefer, Stephan Diehl
2017 2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C)  
On average, 3.22% of all analyzed repositories and 7.33% of the popular ones contained a reference to SO.  ...  In this paper, we present the research design and summarized results of an empirical study analyzing attributed and unattributed usages of SO code snippets in GitHub projects.  ...  With more than 33 million repositories, GitHub (GH) is one of the most popular code hosting platforms.  ... 
doi:10.1109/icse-c.2017.99 dblp:conf/icse/BaltesK017 fatcat:5z56hkf4pvfrvpju3xbmtllzv4

Does the First Mover Advantage Exist on GitHub? [article]

Aditya Mehta, Arun Paudyal, Atul Sharma, Zyanya Ambros, Ipek Baris, Jun Sun, Oul Han, Akram Sadat Hosseini
2020 arXiv   pre-print
In a social coding network such as GitHub, activities of a set of users can have influence on other users who can get interested in further actions, possibly contributing to a new project together with  ...  We find that early adopter dynamics exist in the GitHub community, where early adopters have more followers in the end as expected.  ...  Some of those studies are focused in understanding the dynamics of influential users and repositories on GitHub.  ... 
arXiv:2006.02193v1 fatcat:2hqm3noxbnf57om2ipcqejekyy

Empirical Study of Test Case and Test Framework Presence in Public Projects on GitHub

Matej Madeja, Jaroslav Porubän, Sergej Chodarev, Matúš Sulír, Filip Gurbáľ
2021 Applied Sciences  
We created an overview of tests occurrence in publicly available GitHub projects and the use of test frameworks in them. The results showed that 52% of the projects contain at least one test case.  ...  In this paper, we performed a large analysis of 6.3 M public GitHub projects using Java as the primary programming language.  ...  the last commit, number of commits, or number of watchers of the project on the X-axis.  ... 
doi:10.3390/app11167250 fatcat:rpgo4oqkwzhqlbwea2yrxrnlzy

Social coding in GitHub

Laura Dabbish, Colleen Stuart, Jason Tsay, Jim Herbsleb
2012 Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work - CSCW '12  
Based on a series of in-depth interviews with central and peripheral GitHub users, we examined the value of transparency for large-scale distributed collaborations and communities of practice.  ...  Social applications on the web let users track and follow the activities of a large number of others regardless of location or affiliation.  ...  Our goal in these interviews was to document and understand in more detail the different ways GitHub functionality was used by our participants.  ... 
doi:10.1145/2145204.2145396 dblp:conf/cscw/DabbishSTH12 fatcat:s3daik5tjfba5ihl24tmu5yuzi

SourceFinder: Finding Malware Source-Code from Publicly Available Repositories in GitHub

Md Omar Faruk Rokon, Risul Islam, Ahmad Darki, Evangelos E. Papalexakis, Michalis Faloutsos
2020 International Symposium on Recent Advances in Intrusion Detection  
Capitalizing on this opportunity, we propose, SourceFinder, a supervised-learning approach to identify repositories of malware source code efficiently.  ...  Our work is driven by the following insight: public archives, like GitHub, have a surprising number of malware repositories.  ...  Understanding malware authors Intrigued by the fact that authors create public malware repositories, we attempt to understand and profile their behavior.  ... 
dblp:conf/raid/RokonIDPF20 fatcat:udmaqwuxy5bybmvmhbftqmnxt4

"May the fork be with you": novel metrics to analyze collaboration on GitHub

Marco Biazzini, Benoit Baudry
2014 Proceedings of the 5th International Workshop on Emerging Trends in Software Metrics - WETSoM 2014  
Interesting outcomes, obtained by applying our metrics on a large sample of projects hosted on GitHub, show the usefulness of our contribution.  ...  Multi-repository software projects are becoming more and more popular, thanks to web-based facilities such as GitHub.  ...  Our outcomes are obtained on a sample of GitHub projects which is small, with respect to the amount of existing repositories on GitHub. Thus, reported results may not generalize.  ... 
doi:10.1145/2593868.2593875 dblp:conf/icse/BiazziniB14 fatcat:gmyghyc4y5eiling6iv5ha464a

Parameter-free probabilistic API mining across GitHub

Jaroslav Fowkes, Charles Sutton
2016 Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering - FSE 2016  
We show that PAM significantly outperforms both MAPO and UPMiner, achieving 69% test-set precision, at retrieving relevant API call sequences from GitHub.  ...  Existing API mining algorithms can be difficult to use as they require expensive parameter tuning and the returned set of API calls can be large, highly redundant and difficult to understand.  ...  Popularity in the GitHub corpus is calculated as the sum of the number of project forks and watchers, where each is separately normalized into a z-score.  ... 
doi:10.1145/2950290.2950319 dblp:conf/sigsoft/FowkesS16 fatcat:q5z3g57l3ne2jo5tlnkyreno2m

License usage and changes: a large-scale study on gitHub

Christopher Vendome, Gabriele Bavota, Massimiliano Di Penta, Mario Linares-Vásquez, Daniel German, Denys Poshyvanyk
2016 Empirical Software Engineering  
Specifically, we first identify licenses' changes in 39,563,885 commits, representing the entire history of 51,757 projects hosted on GitHub written in C, C++, C#, JavaScript, Python, and Ruby.  ...  Then, to understand the rationale of license changes, we perform a qualitative analysis-following an open coding approach inspired by grounded theory-on commit notes and issue tracker discussions concerning  ...  , and (ii) they had at least one star (i.e., at least one user expressed appreciation for the repository) or watcher (i.e., at least one user asked to receive notification about changes made in the repository  ... 
doi:10.1007/s10664-016-9438-4 fatcat:ncpk7r36lrgrfmnscac6ah4apm

A Large Scale Study of Long-Time Contributor Prediction for GitHub Projects

Lingfeng Bao, Xin Xia, David Lo, Gail C. Murphy
2019 IEEE Transactions on Software Engineering  
In this paper, we investigate whether we can effectively predict newcomers in OSS projects to be LTCs based on their activity data that is collected from GITHUB.  ...  To build a prediction model, we extract many features from the activities of developers on GITHUB, which group into five dimensions: developer profile, repository profile, developer monthly activity, repository  ...  CONCLUSION & FUTURE WORK In this paper, based on multiple kinds of data in software development from GITHUB, we apply data mining techniques to investigate whether newcomers will become LTCs of a project  ... 
doi:10.1109/tse.2019.2918536 fatcat:6y7453uggrcl3a3w2dr4lnrera

Predicting the Popularity of GitHub Repositories

Hudson Borges, Andre Hora, Marco Tulio Valente
2016 Proceedings of the The 12th International Conference on Predictive Models and Data Analytics in Software Engineering - PROMISE 2016  
In this paper, we use multiple linear regressions to predict the number of stars of GitHub repositories.  ...  Finally, we evaluate the ability to predict not the number of stars of a repository but its rank among the GitHub repositories.  ...  Yuan et al. investigate 28 factors along eight dimensions to understand how high-rated Android applications are different from low-rated ones [23] .  ... 
doi:10.1145/2972958.2972966 dblp:conf/promise/BorgesHV16 fatcat:jw4s44bp65dqtgme2xlyj3lani

An Empirical Analysis of the Docker Container Ecosystem on GitHub

Jurgen Cito, Gerald Schermann, John Erik Wittern, Philipp Leitner, Sali Zumberi, Harald C. Gall
2017 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR)  
The most popular projects change more often than the rest of the Docker population, with 5.81 revisions per year and 5 lines of code changed on average.  ...  Docker allows packaging an application with its dependencies into a standardized, self-contained unit (a so-called container), which can be used for software development and to run the application on any  ...  CONCLUSION We conduct the first large-scale empirical study to analyze the ecosystem, quality aspects and evolution behavior of Docker containers on Github.  ... 
doi:10.1109/msr.2017.67 dblp:conf/msr/CitoSWLZG17 fatcat:lsshdo7yrzf5nn3mwgq3k6ggwe

Identifying unmaintained projects in github

Jailton Coelho, Marco Tulio Valente, Luciana L. Silva, Emad Shihab
2018 Proceedings of the 12th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement - ESEM '18  
Aims: In this paper, we propose an approach to identify GitHub projects that are not actively maintained.  ...  We empirically validate the model with the best performance with the principal developers of 129 GitHub projects.  ...  We would also like to thank the 112 GitHub developers who kindly answered our survey.  ... 
doi:10.1145/3239235.3240501 dblp:conf/esem/CoelhoVSS18 fatcat:h4kb27mgpzejvbyicccvlqr4bm

TravisTorrent: Synthesizing Travis CI and GitHub for Full-Stack Research on Continuous Integration

Moritz Beller, Georgios Gousios, Andy Zaidman
2017 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR)  
In order to lay the groundwork for data-driven research on CI, we built TravisTorrent,, a freely available data set based on Travis CI and GitHub that provides easy access to  ...  Thanks in part to its tight integration with GitHub, Travis CI has emerged as arguably the most widely used CI platform for Open-Source Software (OSS) development.  ...  To be able to do this, we restricted our project space using established filtering criteria to all non-fork, non-toy, somewhat popular (> 10 watchers on GITHUB) projects with a history of TRAVIS CI use  ... 
doi:10.1109/msr.2017.24 dblp:conf/msr/BellerGZ17a fatcat:5mp2eqtqmfc6dolyhoapczyyti

Investigating and Projecting Population Structures in Open Source Software Projects: A Case Study of Projects in GitHub

Saya ONOUE, Hideaki HATA, Akito MONDEN, Kenichi MATSUMOTO
2016 IEICE transactions on information and systems  
This paper presents a case study, investigating the characteristics of the population structures of OSS projects on GitHub, and shows population projections generated with the well-known cohort component  ...  Understanding current and future structures of OSS projects can help practitioners to monitor a project, gain awareness of what is happening, manage risks, and evaluate past decisions. key words: OSS,  ...  Related Work Web services for software developers, such as GitHub † and Open Hub † † , are popular. Therefore, we can easily understand the activity of contributors in OSS. Dabbish et al.  ... 
doi:10.1587/transinf.2015edp7363 fatcat:gzmaskbhcjexlgdx5fmctl54zy

Oops, My Tests Broke the Build: An Explorative Analysis of Travis CI with GitHub

Moritz Beller, Georgios Gousios, Andy Zaidman
2017 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR)  
However, testing on TRAVIS CI does not seem an adequate surrogate for running tests locally in the IDE. To further research on TRAVIS CI with GITHUB, we introduce TRAVISTORRENT.  ...  if testing on the CI can serve as a surrogate to local testing in the IDE.  ...  RQ1: How common is TRAVIS CI use on GitHub?  ... 
doi:10.1109/msr.2017.62 dblp:conf/msr/BellerGZ17 fatcat:zzbopra7rzfznnsnl2htttvcv4
« Previous Showing results 1 — 15 out of 295 results