1,337 Hits in 6.7 sec

A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared Commits [article]

Audris Mockus, Diomidis Spinellis, Zoe Kotti, Gabriel John Dusing
2020 arXiv   pre-print
Git commits are based on Merkle Tree and two commits are highly unlikely to be produced independently.  ...  To address this, we apply Louvain community detection algorithm to this very large graph consisting of links between commits and projects.  ...  While there are many ways to identify related projects, here we focus on a single approach: linking projects sharing at least one commit.  ... 
arXiv:2002.02707v2 fatcat:lxbtu4rfyfchjitadyropyowly

Precise Data Identification Services for Long Tail Research Data

Stefan Proell, Kristof Meixner, Andreas Rauber
2016 Figshare  
Theyprovide implicit change tracking of all data modifications,while precise subsets are identified via the respective subset-ting process.  ...  These enhances reproducibility of experimentsand allows efficient sharing of specific subsets of data evenin highly dynamic data settings  ...  Acknowledgement Part of this work was co-funded by the research project DEXHELPP, supported by BMVIT, BMWFW and the state of Vienna, and COMET K1, FFG -Austrian Research Promotion Agency.  ... 
doi:10.6084/m9.figshare.3847632.v1 fatcat:idtfqwyvv5acpicldujduw2jo4

World of Code: Enabling a Research Workflow for Mining and Analyzing the Universe of Open Source VCS data [article]

Yuxing Ma, Tapajit Dey, Chris Bogart, Sadika Amreen, Marat Valiev, Adam Tutko, David Kennard, Russell Zaretzki, Audris Mockus
2020 arXiv   pre-print
Our current WoC implementation is capable of being updated on a monthly basis and contains over 18B Git objects.  ...  To answer such questions we: a) create a very large and frequently updated collection of version control data in the entire FLOSS ecosystems named World of Code (WoC), that can completely cross-reference  ...  The simplest approach is to create a local copy of the remote repositories via "git clone" command.  ... 
arXiv:2010.16196v1 fatcat:wonvbkuqtncttja2us5uiidvdy

Gitana: A SQL-Based Git Repository Inspector [chapter]

Valerio Cosentino, Javier Luis Cánovas Izquierdo, Jordi Cabot
2015 Lecture Notes in Computer Science  
In this paper, we propose a conceptual schema for Git and an approach that, given a Git repository, exports its data to a relational database in order to (1) promote data integration with other existing  ...  While such tools efficiently track the evolution of a given aspect of the project (e.g., bug reports), they provide just a partial view of the project and often lack of advanced querying mechanisms limiting  ...  the Git repository but not in the database, optionally together with a before date; while the commit analysis is launched for each set of new commits per reference.  ... 
doi:10.1007/978-3-319-25264-3_24 fatcat:lxxpu5t6c5cptjegkkeqllsjh4

"May the fork be with you": novel metrics to analyze collaboration on GitHub

Marco Biazzini, Benoit Baudry
2014 Proceedings of the 5th International Workshop on Emerging Trends in Software Metrics - WETSoM 2014  
We present a set of novel metrics, based on an original classification of commits, conceived to capture some interesting aspects of a multi-repository development process.  ...  We also describe an efficient way to build a data structure that allows to compute these metrics on a set of Git repositories.  ...  Finally, we define a set of metrics based on our commit classification.  ... 
doi:10.1145/2593868.2593875 dblp:conf/icse/BiazziniB14 fatcat:gmyghyc4y5eiling6iv5ha464a

D13.1 – Software Release Procedures and Tools JRA2

Assante Massimiliano, Coro Gianpaolo, Frosini Luca, Pagano Pasquale, Simi Manuele
2019 Zenodo  
The procedures are documented through a set of documentation pages for single facilities hosted by the gCube wiki.  ...  The second pattern is related to the provision of software methods elaborated by the ARIADNEplus community requiring execution within the ARIADNEplus infrastructure.  ...  a new commit in the Git repository; • on a Jenkins slave, as part of the release pipeline.  ... 
doi:10.5281/zenodo.4926461 fatcat:6u2wnob2u5gjtmqtwepla34pyi

Using Object Replication for Building a Dependable Version Control System [chapter]

Rüdiger Kapitza, Peter Baumann, Hans P. Reiser
2008 Lecture Notes in Computer Science  
This paper presents DiGit, a replicated version control system based on the architecture of Git. DiGit is implemented with the help of the FTflex infrastructure for object replication.  ...  We identify two advantages of the replication infrastructure: the ability to provide client-side code as a conceptually integral part of a remote service, and support for an optimized protocol for remote  ...  A Tree object thus is similar to a directory. Third, a Commit object uniquely identifies a certain version of a branch.  ... 
doi:10.1007/978-3-540-68642-2_7 fatcat:lgze3jnlmfbrna7kvw56hljdiq

An Analysis of Merge Conflicts and Resolutions in Git-Based Open Source Projects

Hoai Le Nguyen, Claudia-Lavinia Ignat
2018 Computer Supported Cooperative Work (CSCW)  
In this paper we present an analysis of concurrency and conflicts in official Git repository of four projects: Rails, IkiWiki, Samba and Linux Kernel.  ...  While Git offers some very interesting features, resolving conflicts that arise during synchronization of parallel changes is a time-consuming task.  ...  The server keeps a complete history of versions while clients keep only a local copy of the shared documents.  ... 
doi:10.1007/s10606-018-9323-3 fatcat:kmkydh7hlfdixo5o3djdz222yy

Forking Without Clicking

Antoine Pietri, Guillaume Rousseau, Stefano Zacchiroli
2020 Proceedings of the 17th International Conference on Mining Software Repositories  
In both cases the VCS repositories participating in a fork share parts of a common development history.  ...  These "forge forks" however can only identify as forks repositories that have been created on the platform, e.g., by clicking a "fork" button on the platform user interface.  ...  ACKNOWLEDGMENTS The authors would like to thank Théo Zimmermann for his careful review and comments on an early version of this paper.  ... 
doi:10.1145/3379597.3387450 dblp:conf/msr/PietriRZ20 fatcat:6cjkp54ahzdvloiyfbm55uafiq

Decentralized Collaborative Knowledge Management Using Git

Natanael Arndt, Patrick Naumann, Norman Radtke, Michael Martin, Edgard Marx
2018 Journal of Web Semantics  
The approach is based on a formal expression of evolution and consolidation of distributed datasets.  ...  The World Wide Web and the Semantic Web are designed as a network of distributed services and datasets.  ...  In contrast to other VCS such as Subversion or CVS 26 , Git is a DVCS. As such, in Git, users work on a local version of a remote Git repository that is a complete clone of the remote repository.  ... 
doi:10.1016/j.websem.2018.08.002 fatcat:mzl4i2pvljgjvedpgo7sxrwiqi

Decentralized Evolution and Consolidation of RDF Graphs [chapter]

Natanael Arndt, Michael Martin
2017 Lecture Notes in Computer Science  
The system is based on Git and provides distributed collaboration on RDF graphs.  ...  The World Wide Web and the Semantic Web are designed as a network of distributed services and datasets. In this network and its genesis, collaboration played and still plays a crucial role.  ...  This work was partly supported by a grant from the German Federal Ministry of Education and Research (BMBF) for the LEDS Project under grant agreement No 03WKCG11C.  ... 
doi:10.1007/978-3-319-60131-1_2 fatcat:tjsuzrqcjjfytj42qek2i3xwcu

Research Artifact: The Potential of Meta-Maintenance on GitHub

Hideaki Hata, Raula Gaikovina Kula, Takashi Ishio, Christoph Treude
2021 Zenodo  
This artifact is a data repository including a list of studied 32,007 repositories on GitHub, a list of targeted 401,610,677 files, the results of the qualitative analysis for RQ2, RQ3, and RQ4, the results  ...  This is a research artifact for the ICSE'21 paper "Same File, Different Changes: The Potential of Meta-Maintenance on GitHub".  ...  Based on visual inspection of the plot in Figure 1 , we set the thresholds at a family size of at least 331 for common, and at least 28 for sometimes.  ... 
doi:10.5281/zenodo.4456667 fatcat:snsjfygut5cdhi6o4agd2q2one

A Survey on Mining Software Repositories

Woosung JUNG, Eunjoo LEE, Chisu WU
2012 IEICE transactions on information and systems  
This paper presents fundamental concepts, overall process and recent research issues of Mining Software Repositories.  ...  Finally, evaluation approaches, opportunities and challenge issues are given.  ...  Archives of Communications A lot of data related to communications are generated via email, messenger or off-line meeting, which are very useful for identifying the structure or efficiency of teams.  ... 
doi:10.1587/transinf.e95.d.1384 fatcat:kfje3mzcufchzdj7qyt5smaaum

On Omitting Commits and Committing Omissions: Preventing Git Metadata Tampering That (Re)introduces Software Vulnerabilities

Santiago Torres-Arias, Anil Kumar Ammula, Reza Curtmola, Justin Cappos
2016 USENIX Security Symposium  
By documenting the state of the repository at a particular time when an action is taken, developers are given a shared history, so irregularities are easily detected.  ...  We performed responsible disclosure of the attacks and are working with the Git community to fix these issues in an upcoming version of Git.  ...  Likewise, we thank Lois A. DeLong, Vladimir Diaz, and the anonymous reviewers for their feedback on the writing on this paper.  ... 
dblp:conf/uss/Torres-AriasACC16 fatcat:trscfhvcybfxjkamhjbo7bu4cm

Library adoption in public software repositories

Rachel Krohn, Tim Weninger
2019 Journal of Big Data  
We also show that a simple linear model is capable of classifying future commits as being an adoption or not, based on the commit contents and the preceding history of the user and repository.  ...  For this task we introduce a large corpus containing all commits, diffs, messages, and source code from 259,690 Python repositories (about 13% of all Python projects on Github), including all Git activity  ...  Git is a source code management system allowing individual users to pull updated code from the shared software repository, merge them into a local codebase, and commit changes back to the shared repository  ... 
doi:10.1186/s40537-019-0201-8 fatcat:x4jtges3nfav5igfur3xq7ru5e
« Previous Showing results 1 — 15 out of 1,337 results