Proceedings Article10.1145/1985429.1985436
Towards sharing source code facts using linked data
Iman Keivanloo,Christopher Forbes,Juergen Rilling,Philippe Charland +3 more
- 28 May 2011
- pp 25-28
31
TL;DR: The Source code ECOsystem Linked Data (SECOLD) framework provides not only source code and facts that are usable by both humans and machines for browsing or querying, but it will also assist the research community at large in sharing and utilizing a standardized source code representation.
read more
Abstract: Linked Data is designed to support interoperability and sharing of open datasets by allowing on the fly inter-linking of data using the basic layers of the Semantic Web and the HTTP protocol. In our research, we focus on providing a Uniform Resource Locator (URL) generation schema and a supporting ontological representation for the inter-linking of data extracted from source code ecosystems. As a result, we created the Source code ECOsystem Linked Data (SECOLD) framework that adheres to the Linked Data publication standard. The framework provides not only source code and facts that are usable by both humans and machines for browsing or querying, but it will also assist the research community at large in sharing and utilizing a standardized source code representation. The dataset has been submitted and registered to ckan.net, under the SECOLD project name, as the first source code Linked Data repository. In order to maintain its relevance to the research community, we plan to update the data set every four months.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
SeClone - A Hybrid Approach to Internet-Scale Real-Time Code Clone Search
Iman Keivanloo,Juergen Rilling,Philippe Charland +2 more
- 22 Jun 2011
TL;DR: This research presents a hybrid clone search approach using source code pattern indexing, information retrieval clustering, and Semantic Web reasoning to respectively achieve short response time, handle false positives, and support automated grouping/querying.
28
Evaluating Software Product Quality: A Systematic Mapping Study
Sofia Ouhbi,Ali Idri,José Luis Fernández Alemán,Ambrosio Toval +3 more
- 06 Oct 2014
TL;DR: A systematic mapping study was performed to summarize the existing SPQ evaluation (SPQE) approaches in literature and to classify the selected studies according to seven classification criteria: SPQE approaches, research types, empirical types, data sets used in the empirical evaluation of these studies, artifacts, SQ models, and SQ characteristics.
Similarity search plug-in: Clone detection meets internet-scale code search
Iman Keivanloo,Christopher Forbes,Juergen Rilling +2 more
- 05 Jun 2012
TL;DR: An Eclipse plug-in is presented that provides source code similarity search over source code available on the Internet that can provide the enabling technology for an open Internet-scale similarity search service.
16
Predicting Software Product Quality: A Systematic Mapping Study
TL;DR: A systematic mapping study was performed to summarize the existing SPQ prediction (SPQP) approaches in literature and to organize selected studies according to seven classification criteria: SPQP approaches, research types, empirical types, data sets used in the empirical evaluation of the studies, artifacts, SQ models, and SQ characteristics.
12
References
Enabling Tailored Therapeutics with Linked Data
Anja Jentzsch,Bo Andersson,Oktie Hassanzadeh,Susie Stephens,Christian Bizer +4 more
- 01 Jan 2009
TL;DR: The applicability and potential benefits of using Linked Data to connect drug and clinical trials related data sources are examined and an overview of ongoing work within the W3C's Semantic Web for Health Care and Life Sciences Interest Group on publishing drug related data sets on the Web and interlinking them with existing Linked data sources is given.
SourcererDB: An aggregated repository of statically analyzed and cross-linked open source Java projects
Joel Ossher,Sushil Bajracharya,Erik Linstead,Pierre Baldi,Cristina V. Lopes +4 more
- 16 May 2009
TL;DR: The goal in building SourcererDB is to provide a rich dataset of source code to facilitate the sharing of extracted data and to encourage reuse and repeatability of experiments.
55
•Posted Content
Publishing Math Lecture Notes as Linked Data
Catalin David,Michael Kohlhase,Christoph Lange,Florian Rabe,Nikita Zhiltsov,Vyacheslav Zholudev +5 more
TL;DR: This work marks up a corpus of lecture notes semantically and exposes them as Linked Data in XHTML+MathML+RDFa, and makes the resulting documents interactively browsable for students.
Publishing math lecture notes as linked data
Catalin David,Michael Kohlhase,Christoph Lange,Florian Rabe,Nikita Zhiltsov,Vyacheslav Zholudev +5 more
- 30 May 2010
TL;DR: The authors mark up a corpus of lecture notes semantically and expose them as Linked Data in XHTML+MathML+RDFa, making the resulting documents interactively browsable for students.
Semantic Web-based Source Code Search
Iman Keivanloo,Laleh Roostapour,Philipp Schugerl,Juergen Rilling +3 more
- 01 Jan 2010
TL;DR: A Semantic Web-based approach to source code search that uses ontologies to model and connect source code fragments extracted from repositories on the Internet and allows us to reason and search across project boundaries while dealing with incomplete knowledge and ambiguities is presented.
17