Software is data too

doi:10.1145/1882362.1882410

Proceedings Article10.1145/1882362.1882410

Software is data too

Andrian Marcus, +1 more

- 07 Nov 2010

- pp 229-232

28

TL;DR: It is argued in this position paper that data mining, statistical analysis, machine learning, information retrieval, data integration, etc., are necessary solutions to deal with software data.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/ICPC.2013.6613829

Evaluating source code summarization techniques: Replication and expansion

Brian P. Eddy, +3 more

- 20 May 2013

TL;DR: A new topic modeling based approach to source code summarization is proposed, and via a study of 14 developers, source code summaries generated using the proposed technique are evaluated.

...read moreread less

174

•Journal Article•10.1002/ASI.23358

Data journals: A survey

Leonardo Candela, +3 more

- 01 Sep 2015

TL;DR: This study of more than 100 currently existing data journals describes the approaches they promote for data set description, availability, citation, quality, and open access and identifies ways to expand and strengthen the data journals approach as a means to promote data set access and exploitation.

...read moreread less

148

•Proceedings Article•10.1109/ICPC.2013.6613841

Structural information based term weighting in text retrieval for feature location

Blake Bassett, +1 more

- 20 May 2013

TL;DR: This paper studies over 400 bugs and features from five open source Java systems and finds that structural term weighting can cause a statistically significant improvement in the accuracy of the FLT.

...read moreread less

50

•Proceedings Article•10.5555/2337223.2337356

Goldfish bowl panel: software development analytics

Tim Menzies, +1 more

- 02 Jun 2012

TL;DR: This panel will address the open issues with analytics and address the potential and strengths and weaknesses of the current generation of analytics tools.

...read moreread less

15

•Proceedings Article•10.1109/ICPC.2012.6240485

Modeling the ownership of source code topics

Christopher S. Corley, +2 more

- 11 Jun 2012

TL;DR: This paper combines software repository mining and topic modeling to measure the ownership of linguistic topics in source code and finds that classes that belong to the same linguistic topic tend to have similar ownership characteristics, which suggests that conceptually related classes often share the same owner.

...read moreread less

13

...

Expand

References

Lecture Notes in Artificial Intelligence

P. Brezillon, +1 more

- 01 Jan 1999

TL;DR: The topics in LNAI include automated reasoning, automated programming, algorithms, knowledge representation, agent-based systems, intelligent systems, expert systems, machine learning, natural-language processing, machine vision, robotics, search systems, knowledge discovery, data mining, and related programming languages.

...read moreread less

7.5K

Proceedings Article•10.1145/543613.543644

Data integration: a theoretical perspective

Maurizio Lenzerini

- 03 Jun 2002

TL;DR: The tutorial is focused on some of the theoretical issues that are relevant for data integration: modeling a data integration application, processing queries in data integration, dealing with inconsistent data sources, and reasoning on queries.

...read moreread less

2.8K

Book Chapter•10.1007/3-540-44631-1_4

Developing Multiagent Systems with agentTool

Scott A. DeLoach, +1 more

- 07 Jul 2000

TL;DR: MaSE guides a designer from an initial system specification to implementation by guiding the designer through a set of inter-related graphically based system models as envisioned by MaSE.

...read moreread less

1.7K

•Journal Article•10.1109/TSE.2007.10

Data Mining Static Code Attributes to Learn Defect Predictors

Tim Menzies, +2 more

- 01 Jan 2007

- IEEE Transactions on Software Engineerin...

TL;DR: It is shown that static code attributes used to build defect predictors are much more important than which particular attributes are used, and contrary to prior pessimism, they are demonstrably useful and yield predictors with a mean probability of detection and mean false alarms rates.

...read moreread less

1.3K

•Journal Article•10.1109/TSE.2006.3

Advancing candidate link generation for requirements tracing: the study of methods

Jane Huffman Hayes, +2 more

- 01 Jan 2006

- IEEE Transactions on Software Engineerin...

TL;DR: This paper defines goals for a tracing tool based on analyst responsibilities in the tracing process, introduces several new measures for validating that the goals have been satisfied, and presents a prototype tool that is built, RETRO (REquirements TRacing On-target), to address these goals.

...read moreread less

552