Designing a Knowledge Base for OSS Project Recommender System: a Big Data Analytics Approach

Shahabedin Bayati, Arvind K. Tripathi
2016 European Conference on Information Systems  
Online software engineering repositories like GitHub are great resources of socio-technical data about software development process. GitHub as a large-scale social coding environment contains various types of open source projects. Selecting a suitable project from a developer's perspective is difficult and time-consuming task. In this paper, general Big Data approaches and machine learning techniques are used to analyse GitHub data. Variety of socio-technical metrics and factors are extracted
more » ... om online repositories for data analysis. We find that data pre-processing plays an important role in the proposed approach for GitHub Mining. Design science research method is applied on the pre-processed data on open source software (OSS) projects to design recommendation system for project selection. Content-Based recommendation techniques are proposed with evaluation mechanism.
dblp:conf/ecis/BayatiT16 fatcat:hz626lcb2rcdveyafirhsfjxym