123,801 Hits in 8.0 sec

Modeling the evolution of topics in source code histories

Stephen W. Thomas, Bram Adams, Ahmed E. Hassan, Dorothea Blostein
2011 Proceeding of the 8th working conference on Mining software repositories - MSR '11  
case in source code histories.  ...  for analyzing source code histories.  ...  This work is supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) and Research in Motion (RIM).  ... 
doi:10.1145/1985441.1985467 dblp:conf/msr/ThomasAHB11 fatcat:ks4rpnryzbgqxc6g3mmqjbk3ii

Studying software evolution using topic models

Stephen W. Thomas, Bram Adams, Ahmed E. Hassan, Dorothea Blostein
2014 Science of Computer Programming  
In this paper, we take a first step towards evaluating topic models in the analysis of software evolution by performing a detailed manual analysis on the source code histories of two well-known and well-documented  ...  However, research has yet to determine whether these automatically discovered topic evolutions describe the evolution of source code in a way that is relevant or meaningful to project stakeholders, and  ...  Acknowledgments We would like to thank the anonymous reviewers for their careful and insightful feedback, which helped improve the quality of this paper.  ... 
doi:10.1016/j.scico.2012.08.003 fatcat:dke4f6bp4jadheoyf63z3einum

An Application of Latent Dirichlet Allocation to Analyzing Software Evolution

Erik Linstead, Cristina Lopes, Pierre Baldi
2008 2008 Seventh International Conference on Machine Learning and Applications  
We develop and apply unsupervised statistical topic models, in particular Latent Dirichlet Allocation, to identify functional components of source code and study their evolution over multiple project versions  ...  In addition to detecting the emergence of topics on the release timeline which represent integration points for key source code functionality, our techniques can also be used to pinpoint refactoring events  ...  Acknowledgments: Work in part supported by National Science Foundation MRI grant EIA-0321390 and a Microsoft Faculty Research Award to PB, as well as National Science Foundation grant CCF-0347902 to CL  ... 
doi:10.1109/icmla.2008.47 dblp:conf/icmla/LinsteadLB08 fatcat:le64bzhcxzcoxnnt63tk35kkae

Using Topic Models to Support Software Maintenance

Scott Grant, James R. Cordy, David B. Skillicorn
2012 2012 16th European Conference on Software Maintenance and Reengineering  
Our recent research has shown that the latent information found by commonly used topic models generally relates to the development history of a software system.  ...  While it is not always possible to associate these latent topics with human-oriented concepts, it is demonstrable that they identify historical maintenance relationships in source code.  ...  ACKNOWLEDGEMENTS This work is supported in part by the Natural Sciences and Engineering Research Council of Canada, by the Ontario Graduate Scholarship Program, and by an IBM CAS faculty award.  ... 
doi:10.1109/csmr.2012.51 dblp:conf/csmr/GrantCS12 fatcat:bgt66ojstzabxmhbesxoq4t5ei

Mining Unstructured Software Repositories [chapter]

Stephen W. Thomas, Ahmed E. Hassan, Dorothea Blostein
2013 Evolving Software Systems  
M INING SOFTWARE REPOSITORIES, which is the process of analyzing the data related to software development practices, is an emerging field which aims to aid development teams in their day to day tasks.  ...  She believed in me, helped me, and kept me on track. I am truly grateful for the one-two punch of Ahmed and Dorothea that was easily the most significant factor of my success.  ...  In addition, by using topic evolution models on the history of source code, we can uncover the evolution of source code concepts with an accuracy of 87-89%. i Finally, we show the risks of current research  ... 
doi:10.1007/978-3-642-45398-4_5 fatcat:n5ivswn6brfupjbsuomwddokd4

Software Evolution via Topic Modeling: An Analytic Stud

Shadi Banitaan, Mamdouh Alenezi
2015 International Journal of Software Engineering and Its Applications  
In this work, we apply Latent Dirichlet Allocation (LDA) to discover source code topics and study their evolution over multiple software versions.  ...  We study the software evolution of two large open-source projects, JEdit and JHotDraw, over twelve versions.  ...  Approach and Case Study In this section, we investigate the use of LDA to support software evolution. We describe the approach that is used to identify the topics from source code.  ... 
doi:10.14257/ijseia.2015.9.5.05 fatcat:gvsdl36izze2jckqay3k3n2kxu

Reverse Engineering Co-maintenance Relationships Using Conceptual Analysis of Source Code

Scott Grant, James R. Cordy, David B. Skillicorn
2011 2011 18th Working Conference on Reverse Engineering  
of source code methods.  ...  In this work, we explore the relationship between topic models and co-maintenance history by introducing a visualization that compares conceptual cohesion within changelists.  ...  Each of these source code fragments has some degree of membership in each of the topics found by the concept model.  ... 
doi:10.1109/wcre.2011.20 dblp:conf/wcre/GrantCS11 fatcat:4gf64fqybrcg3fnyykp6qahrka

Correlating Social Interactions to Release History during Software Evolution

Olga Baysal, Andrew J. Malton
2007 Fourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007)  
We employ an information retrieval approach to find correlation between source code change history and history of social interactions surrounding these changes.  ...  The results of these exploratory case studies demonstrate the evidence of similarity between the content of free-form text emails among developers and the actual modifications in the code.  ...  We used fine grained analysis of release repositories to recover the history of source code modifications indicated by lines that have been added, deleted and changed during the evolution of a source file  ... 
doi:10.1109/msr.2007.4 dblp:conf/msr/BaysalM07 fatcat:75khtjcjkjfi3f7hztkn2jk2hq

Explore the evolution of development topics via on-line LDA

Jiajun Hu, Xiaobing Sun, Bin Li
2015 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER)  
These previously LDA-based topic analysis models can capture either changes on the strength (popularity) of various development topics over time (i.e., strength evolution) or changes in the content (the  ...  the evolution of development topics.  ...  applied the Hall Model [9] to analyze the history of source code documents to discover and monitor the drift of topics in source code [4] , [10] .  ... 
doi:10.1109/saner.2015.7081876 dblp:conf/wcre/HuSL15 fatcat:hyqw5hdapzefvj435euqvpp7di

Modeling Changeset Topics

Christopher S. Corley, Kelly L. Kashuda, Daniel S. May, Nicholas A. Kraft
2014 2014 IEEE 4th Workshop on Mining Unstructured Data  
However, source code evolves, which causes models to become obsolete. In this paper, we explore the approach of topic modeling changesets over the traditional release approach.  ...  We conduct an exploratory study of four open source systems. We investigate the differences in corpora in each project, and evaluate the topic distinctness of the models.  ...  ACKNOWLEDGMENT We thank the anonymous reviewers for their insightful comments and helpful suggestions. This material is based upon work supported by the U.S. Department of Education under Grant No.  ... 
doi:10.1109/mud.2014.9 dblp:conf/icsm/CorleyKMK14 fatcat:tul7buo64nfthdet57n7rv4azi

Identifying Knowledge Divergence by Vocabulary Monitoring in Software Projects

Jan Nonnen, Paul Imhoff
2012 2012 16th European Conference on Software Maintenance and Reengineering  
During the development of a project, words used in source code add up to a big vocabulary, which may lead to a divergent word-understanding and word-knowledge between developers.  ...  In this work we propose a way to identify such situations by analysing the words contained in identifiers obtained through the commit history in a version control system.  ...  Topic evolution: Thomas et at. [10] presented an topic evolution model based on differences between versions. Topics are collections of words that co-occur frequently in the project history.  ... 
doi:10.1109/csmr.2012.56 dblp:conf/csmr/NonnenI12 fatcat:wisbw6lezvhcfaecoovbzu5jpa

Detailed author index

2009 2009 6th IEEE International Working Conference on Mining Software Repositories  
Bug Reports Using a Vocabulary-Based Expertise Model of Developers Mockus, Audris 11 Amassing and Indexing a Large Sample of Version Control Systems: Towards the Census of Public Source Code History  ...  of Developers 175 Automatic Labeling of Software Components and Their Evolution Using Log-Likelihood Ratio of Word Frequencies in Source Code [Search] A B C D E F G H I J K L M N O P Q R S T U  ... 
doi:10.1109/msr.2009.5069464 fatcat:hbptjwwpvng4hebf6c7ni72siu

Automatic labeling of software components and their evolution using log-likelihood ratio of word frequencies in source code

Adrian Kuhn
2009 2009 6th IEEE International Working Conference on Mining Software Repositories  
This understanding cannot be gained at the level of source code due to the semantic gap between source code and the domain model.  ...  In particular, we apply the approach to detect trends in the evolution of a software system.  ...  We gratefully acknowledge the financial support of the Swiss National Science Foundation for the project "Bringing Models Closer to Code" (SNF Project No. 200020-121594, Oct. 2008 -Sept. 2010).  ... 
doi:10.1109/msr.2009.5069499 dblp:conf/msr/Kuhn09 fatcat:jqhojunm5nhfpfeibura7gnu2q

Using topic models to understand the evolution of a software ecosystem

Nicolas Lopez
2013 Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering - ESEC/FSE 2013  
In my research I seek to explore how topic modeling techniques can aid in understanding the evolution of a software ecosystem.  ...  Topic modeling techniques show promise for mining the data stored in software repositories to understand the evolution of a system.  ...  To find out if topics characterize changes, we performed a study analyzing the change history of an open source system, Mylyn.  ... 
doi:10.1145/2491411.2492402 dblp:conf/sigsoft/Lopez13 fatcat:shmnpiucc5dcxkmt3t2frd6cvq

Analysing the Vocabulary to Identify Knowledge Divergence

Jan Nonnen, Paul Imhoff
2012 Softwaretechnik-Trends  
Topics are collections of words that co-occur frequently in the project history. They could identify global changes in their evolution visualisation.  ...  Our history model H is a set of tuples (c, a, w), where c is the commit time, a the author, and w the word.  ... 
doi:10.1007/bf03323462 fatcat:xx6ofxorcnbrrahjozrt74zj3e
« Previous Showing results 1 — 15 out of 123,801 results