Embedding Metadata and Other Semantics in Word Processing Documents

Peter Sefton; Ian Barnes; Ron Ward; Jim Downing

doi:10.2218/ijdc.v4i2.96

Associating Natural Language Comment and Source Code Entities

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6382 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8592-8599

Author(s):

Sheena Panthaplackel ◽

Milos Gligoric ◽

Raymond J. Mooney ◽

Junyi Jessy Li

Keyword(s):

Software Development ◽

Natural Language ◽

Open Source ◽

Source Code ◽

Initial Step ◽

Binary Classifier ◽

Sequence Labeling ◽

Evaluation Dataset ◽

Revision Histories

Comments are an integral part of software development; they are natural language descriptions associated with source code elements. Understanding explicit associations can be useful in improving code comprehensibility and maintaining the consistency between code and comments. As an initial step towards this larger goal, we address the task of associating entities in Javadoc comments with elements in Java source code. We propose an approach for automatically extracting supervised data using revision histories of open source projects and present a manually annotated evaluation dataset for this task. We develop a binary classifier and a sequence labeling model by crafting a rich feature set which encompasses various aspects of code, comments, and the relationships between them. Experiments show that our systems outperform several baselines learning from the proposed supervision.

Download Full-text

Key Concepts and Definitions of Open Source Communities

Encyclopedia of Networked and Virtual Organizations ◽

10.4018/978-1-59904-885-7.ch099 ◽

2010 ◽

pp. 753-760

Author(s):

Ruben van Wendel de Joode ◽

Sebastian Spaeth

Keyword(s):

Software Development ◽

Open Source ◽

Open Source Software ◽

Online Communities ◽

Source Code ◽

Professional Organizations ◽

Large Numbers ◽

Key Concepts ◽

Open Source Communities ◽

Do So

Most open source software is developed in online communities. These communities are typically referred to as “open source software communities” or “OSS communities.” In OSS communities, the source code, which is the human-readable part of software, is treated as something that is open and that should be downloadable and modifiable to anyone who wishes to do so. The availability of the source code has enabled a practice of decentralized software development in which large numbers of people contribute time and effort. Communities like Linux and Apache, for instance, have been able to connect thousands of individual programmers and professional organizations (although most project communities remain relatively small). These people and organizations are not confined to certain geographical places; on the contrary, they come from literally all continents and they interact and collaborate virtually.

Download Full-text

Logging Analysis and Prediction in Open Source Java Project

Research Anthology on Usage and Development of Open Source Software ◽

10.4018/978-1-7998-9158-1.ch038 ◽

2021 ◽

pp. 733-761

Author(s):

Sangeeta Lal ◽

Neetu Sardana ◽

Ashish Sureka

Keyword(s):

Machine Learning ◽

Content Analysis ◽

Software Development ◽

Anomaly Detection ◽

Open Source ◽

Large Scale ◽

Source Code ◽

Scale Analysis ◽

Large Scale Analysis ◽

Research Questions

Log statements present in source code provide important information to the software developers because they are useful in various software development activities such as debugging, anomaly detection, and remote issue resolution. Most of the previous studies on logging analysis and prediction provide insights and results after analyzing only a few code constructs. In this chapter, the authors perform an in-depth, focused, and large-scale analysis of logging code constructs at two levels: the file level and catch-blocks level. They answer several research questions related to statistical and content analysis. Statistical and content analysis reveals the presence of differentiating properties among logged and nonlogged code constructs. Based on these findings, the authors propose a machine-learning-based model for catch-blocks logging prediction. The machine-learning-based model is found to be effective in catch-blocks logging prediction.

Download Full-text

Integrating Projects from Multiple Open Source Code Forges

Database Technologies ◽

10.4018/978-1-60566-058-5.ch141 ◽

2009 ◽

pp. 2301-2312

Author(s):

Megan Squire

Keyword(s):

Software Development ◽

Open Source ◽

Relevant Literature ◽

Source Code ◽

Scoring Systems ◽

Open Source Code ◽

Multiple Code ◽

Future Work

Much of the data about free, libre, and open source (FLOSS) software development comes from studies of code forges or code repositories used for managing projects. This paper presents a method for integrating data about open source projects by way of matching projects (entities) across multiple code forges. After a review of the relevant literature, a few of the methods are chosen and applied to the FLOSS domain, including a comparison of some simple scoring systems for pairwise project matches. Finally, the paper describes limitations of this approach and recommendations for future work.

Download Full-text

Use of Free and Open-Source Software (FOSS) in the U.S. Department of Defense

Terry's Archive Online ◽

10.48034/20030102 ◽

2003 ◽

Vol 2003 (01) ◽

pp. 0102

Author(s):

Terry Bollinger

Keyword(s):

Software Development ◽

Open Source ◽

Open Source Software ◽

Department Of Defense ◽

Low Cost ◽

Source Code ◽

Leading Edge ◽

Cyber Attacks ◽

Software Analysis ◽

The U.S

This report documents the results of a study by The MITRE Corporation on the use of free and open-source software (FOSS) in the U.S. Department of Defense (DoD). FOSS gives users the right to run, copy, distribute, study, change, and improve it as they see fit, without asking permission or making fiscal payments to any external group or person. The study showed that FOSS provides substantial benefits to DoD security, infrastructure support, software development, and research. Given the openness of its source code, the finding that FOSS profoundly benefits security was both counterintuitive and instructive. Banning FOSS in DoD would remove access to exceptionally well-verified infrastructure components such as OpenBSD and robust network and software analysis tools needed to detect and respond to cyber-attacks. Finally, losing the hands-on source code accessibility of FOSS source code would reduce DoD’s ability to respond rapidly to cyberattacks. In short, banning FOSS would have immediate, broad, and strongly negative impacts on the DoD’s ability to defend the U.S. against cyberattacks. For infrastructure support, the deep historical ties between FOSS and the emergence of the Internet mean that removing FOSS applications would strongly negatively impact the DoD’s ability to support web and Internet-based applications. Software development would be hit especially hard due to many leading-edge and broadly used tools being FOSS. Finally, the loss of access to low-cost data processing tools and the inability to share results in the more potent form of executable FOSS software would seriously and negatively impact nearly all forms of scientific and data-driven research.

Download Full-text

Semantic Web Support for Open-Source Software Development

2008 IEEE International Conference on Signal Image Technology and Internet Based Systems ◽

10.1109/sitis.2008.114 ◽

2008 ◽

Cited By ~ 5

Author(s):

Tharam S. Dillon ◽

Gregory Simmons

Keyword(s):

Semantic Web ◽

Software Development ◽

Open Source ◽

Open Source Software ◽

Open Source Software Development

Download Full-text

Logging Analysis and Prediction in Open Source Java Project

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Optimizing Contemporary Application and Processes in Open Source Software ◽

10.4018/978-1-5225-5314-4.ch003 ◽

2018 ◽

pp. 57-85

Author(s):

Sangeeta Lal ◽

Neetu Sardana ◽

Ashish Sureka

Keyword(s):

Machine Learning ◽

Content Analysis ◽

Software Development ◽

Anomaly Detection ◽

Open Source ◽

Large Scale ◽

Source Code ◽

Scale Analysis ◽

Large Scale Analysis ◽

Research Questions

Log statements present in source code provide important information to the software developers because they are useful in various software development activities such as debugging, anomaly detection, and remote issue resolution. Most of the previous studies on logging analysis and prediction provide insights and results after analyzing only a few code constructs. In this chapter, the authors perform an in-depth, focused, and large-scale analysis of logging code constructs at two levels: the file level and catch-blocks level. They answer several research questions related to statistical and content analysis. Statistical and content analysis reveals the presence of differentiating properties among logged and nonlogged code constructs. Based on these findings, the authors propose a machine-learning-based model for catch-blocks logging prediction. The machine-learning-based model is found to be effective in catch-blocks logging prediction.

Download Full-text

The relevance of Open Source to hydroinformatics

Journal of Hydroinformatics ◽

10.2166/hydro.2002.0022 ◽

2002 ◽

Vol 4 (4) ◽

pp. 219-234 ◽

Cited By ~ 14

Author(s):

Hamish Harvey ◽

Dawei Han

Keyword(s):

Operating System ◽

Software Development ◽

Open Source ◽

Open Source Software ◽

Rapid Development ◽

Source Code ◽

Web Server ◽

High Profile ◽

History Of ◽

Closed Approach

Open Source, in which the source code to software is freely shared and improved upon, has recently risen to prominence as an alternative to the more usual closed approach to software development. A number of high profile projects, such as the Linux operating system kernel and the Apache web server, have demonstrated that Open Source can be technically effective, and companies such as Cygnus Solutions (now owned by Red Hat) and Zope Corporation have demonstrated that it is possible to build successful companies around open source software. Open Source could have significant benefits for hydroinformatics, encouraging widespread interoperability and rapid development. In this paper we present a brief history of Open Source, a summary of some reasons for its effectiveness, and we explore how and why Open Source is of particular interest in the field of hydroinformatics. We argue that for technical, scientific and business reasons, Open Source has a lot to offer.

Download Full-text

Integrating Projects from Multiple Open Source Code Forges

Multi-Disciplinary Advancement in Open Source Software and Processes ◽

10.4018/978-1-60960-513-1.ch003 ◽

2011 ◽

pp. 43-53

Author(s):

Megan Squire

Keyword(s):

Software Development ◽

Open Source ◽

Relevant Literature ◽

Source Code ◽

Scoring Systems ◽

Open Source Code ◽

Multiple Code ◽

Future Work

Much of the data about free, libre, and open source (FLOSS) software development comes from studies of code forges or code repositories used for managing projects. This paper presents a method for integrating data about open source projects by way of matching projects (entities) across multiple code forges. After a review of the relevant literature, a few of the methods are chosen and applied to the FLOSS domain, including a comparison of some simple scoring systems for pairwise project matches. Finally, the paper describes limitations of this approach and recommendations for future work.

Download Full-text

FEATURES OF EVOLUTION OF LOWLEVEL SOFTWARE DEVELOPMENT TOOLS WITH OPEN SOURCE CODE

PROCESSING, TRANSMISSION AND PROTECTION OF INFORMATION IN COMPUTER SYSTEMS ◽

10.31799/978-5-8088-1452-3-2020-1-108-112 ◽

2020 ◽

Author(s):

S. V. Schyokin ◽

Keyword(s):

Software Development ◽

Open Source ◽

Source Code ◽

Open Source Code ◽

Development Tools ◽

Software Development Tools

Download Full-text