The use of domain knowledge in program understanding

Rugaber, Spencer

doi:10.1023/A:1018976708691

The use of domain knowledge in program understanding

Published: May 2000

Volume 9, pages 143–192, (2000)
Cite this article

Annals of Software Engineering

Spencer Rugaber¹

142 Accesses
37 Citations
Explore all metrics

Abstract

Program understanding is an essential part of all software maintenance and enhancement activities. As currently practiced, program understanding consists mainly of code reading. The few automated understanding tools that are actually used in industry provide helpful but relatively shallow information, such as the line numbers on which variable names occur or the calling structure possible among system components. These tools rely on analyses driven by the nature of the programming language used. As such, they are adequate to answer questions concerning implementation details, so called what questions. They are severely limited, however, when trying to relate a system to its purpose or requirements, the why questions. Application programs solve real‐world problems. The part of the world with which a particular application is concerned is that application's domain. A model of an application's domain can serve as a supplement to programming‐language‐based analysis methods and tools. A domain model carries knowledge of domain boundaries, terminology, and possible architectures. This knowledge can help an analyst set expectations for program content. Moreover, a domain model can provide information on how domain concepts are related. This article discusses the role of domain knowledge in program understanding. It presents a method by which domain models, together with the results of programming‐language‐based analyses, can be used to answers both what and why questions. Representing the results of domain‐based program understanding is also important, and a variety of representation techniques are discussed. Although domain‐based understanding can be performed manually, automated tool support can guide discovery, reduce effort, improve consistency, and provide a repository of knowledge useful for downstream activities such as documentation, reengineering, and reuse. A tools framework for domain‐based program understanding, a dowser, is presented in which a variety of tools work together to make use of domain information to facilitate understanding. Experience with domain‐based program understanding methods and tools is presented in the form of a collection of case studies. After the case studies are described, our work on domain‐based program understanding is compared with that of other researchers working in this area. The paper concludes with a discussion of the issues raised by domain‐based understanding and directions for future work.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

ACM (1995), Proceedings ACM SIGPLAN Workshop on Intermediate Representations (IR'95), ACM.
Arango, G. and R. Prieto-Díaz (1991), "Domain Analysis Concepts and Research Directions," In Domain Analysis and Software Systems Modeling, R. Prieto-Díaz and G. Arango, Eds., IEEE Computer Society Press, pp. 9–32.
Arango, G., E. Schoen, and R. Pettengill (1993), "A Process for Consolidating and Reusing Design Knowledge," 15th International Conference on Software Engineering, IEEE Computer Society Press, Baltimore, MD, pp. 231–242.
Chapter Google Scholar
Arthur, L.J. (1988), Software Evolution, Wiley, New York.
Google Scholar
Batory, D. and S. O'Malley (1992), "The Design and Implementation of Hierarchical Software Systems with Reusable Components," ACM Transactions on Software Engineering and Methodology 1, 4, 355–398.
Article Google Scholar
Biggerstaff, T.J. (1989), "Design Recovery for Maintenance and Reuse," IEEE Computer 7, 22, 36–49.
Google Scholar
Biggerstaff, T.J., B.G. Mitbander, and D. Webster (1994), "Program Understanding and the Concept Assignment Problem," Communications of the ACM 37, 5, 72–83.
Article Google Scholar
Boehm, B. (1981), Software Engineering Economics, Prentice-Hall, Englewood Cliffs, NJ.
MATH Google Scholar
Borgida, A., R.J. Brachman, D.L. McGuinness and L.A. Resnick (1989), "CLASSIC: A Structural Data Model for Objects," In Proceedings ACM SIGMOD International Conference on Management of Data.
Brachman, R., D. McGuinness, P. Patel-Schneider, L. Resnick, and A. Borgida (1990), "Living with CLASSIC: When and How to Use a KL-ONE-Like Language," In Principles of Semantic Networks, J. Sowa, Ed., Morgan Kaufmann, San Mateo, CA.
Brooks, R. (1983), "Towards a Theory of the Comprehension of Computer Programs," International Journal of Man-Machine Studies 18, 543–554.
Google Scholar
Caere Corporation (1994), OmniPage Professional Reference Manual, Los Gatos, CA.
Campbell, R.H. (1974), "The Specification of Process Synchronization by Path-Expressions," In Lecture Notes in Computer Science, Vol. 16, Springer-Verlag, pp. 89–102.
Article MATH Google Scholar
Chen, P.P. (1976), "The Entity-Relationship Model-Toward a Unified View of Data," ACM Transactions on Database Systems 1, 1, pp. 9–36.
Article Google Scholar
Chen, Y.F. and C.V. Ramamoorthy (1986), "The C Information Abstractor," In Proceedings COMPASC 86, IEEE, pp. 291–298.
Chikofsky, E.J. and J.H. Cross II (1990), "Reverse Engineering and Design Recovery: A Taxonomy," IEEE Software 7, 1, 13–17.
Article Google Scholar
Clayton, R. and S. Rugaber (1993), "The Representation Problem in Reverse Engineering," In Proceedings of the First Working Conference on Reverse Engineering, pp. 8–16.
Clayton, R., S. Rugaber, L. Taylor, and L. Wills (1997a), "A Case Study of Domain-based Program Understanding," In 5th International Workshop on Program Comprehension, pp. 102–110.
Clayton, R., S. Rugaber, and L. Wills (1997b), "Domain Based Design Documentation and Component Reuse and their Application to a System Evolution Record; Final Report," College of Computing, Georgia Institute of Technology, http://www.cc.gatech.edu/reverse/dare/finalreport/index.html.
Clayton, R., S. Rugaber, and L. Wills (1998a), "Dowsing: A Tools Framework for Domain-Oriented Browsing of Software Artifacts," In Proceedings ASE 99, pp. 204–208.
Clayton, R., S. Rugaber, and L. Wills (1998b), "On the Knowledge Required to Understand a Program," In The Fifth IEEE Working Conference on Reverse Engineering, pp. 69–78.
Cleaveland, J.C. (1988), "Building Application Generators," IEEE Software 5, 4, 25–33.
Article Google Scholar
DeBaud, J.-M. (1994), "From Domain Analysis to Object-Oriented Frameworks, A Reuse Oriented Software Engineering Methodology," Technical Report CIMR TR# 94-04, Center for Information Management Research, Georgia Institute of Technology.
Debaud, J.-M. (1996), "Lessons From a Domain-based Reengineering Effort," In Proceedings of the Third Working Conference on Reverse Engineering, pp. 217–226.
DeBaud, J.-M., B. Moopen, and S. Rugaber (1994), "Domain Analysis and Reverse Engineering," In Proceedings of the Conference on Software Maintenance, pp. 326–335.
DeBaud, J.-M. and S. Rugaber (1995), "A Software Re-engineering Method Using Domain Models," In International Conference on Software Maintenance, pp. 204–213.
Defense Modeling and Simulation Office (1999), "High Level Architecture (HLA)," http:// hla.dmso.mil/
Devambu, P.T. (1992), "GENOA/GENII-A customizable, language-and front-end-independent code analyzer," In Fourteenth International Conference on Software Engineering, pp. 307–319.
Devanbu, P., R.J. Brachman, P.G. Selfridge, and B.W. Ballard (1991), "LaSSIE: A Knowledge-Based Software Information System," Communications of the ACM 34, 5, 35–49.
Google Scholar
Eidbo, M., M. Ammar, R. Clark, R. Clayton, S. Doddapaneni, R. Dodge, M. McCracken, B. Nguyen, W. Roberts, S. Rogers, and S. Rugaber (1993), "Transitioning to the Open Systems Environment (TRANSOPEN) Final Report," Technical Report CIMR-93-01, Center for Information Management Research, Georgia Institute of Technology.
Fjeldstad, R.K. and W.T. Hamlen (1983), "Application Program Maintenance Study: Report to Our Respondents," In Proceedings GUIDE 48, Philadelphia, PA, Tutorial on Software Maintenance, G. Parikh and N. Zvegintozov, Eds., IEEE Computer Society.
Forsythe, G., M. Malcolm, and M. Moler (1977), Computer Methods for Mathematical Computations, Prentice-Hall, Englewood Cliffs, NJ, pp. 161–166.
MATH Google Scholar
Garlan, D. and M. Shaw (1995), Software Architecture: Perspectives on an Emerging Discipline, Prentice-Hall, Englewood Cliffs, NJ.
Google Scholar
Grass, J.E. and Y.-F. Chen (1990), "The C++ Information Abstractor," In 1990 USENIX Conference, pp. 265–277.
Harris, D., H.B. Reubenstein, and A.S. Yeh (1995), "Recognizers for Extracting Architectural Features from Source Code," In Second Working Conference on Reverse Engineering, L. Wills, P. Newcomb, and E. Chikofsky, Eds., IEEE Computer Society Press, pp. 252–261.
Hildreth, H. (1994), "Reverse Engineering Requirements for Process-Control Software," In Proceedings of the Conference on Software Maintenance, pp. 316–325.
Johnson, R.E. and B. Foote (1988), "Designing Reusable Classes," Journal of Object-Oriented Programming1, 2, 22–35.
Google Scholar
Johnson, W.L. and A. Erdem (1997), "Interactive Explanation of Software Systems," In Automated Software Engineering 2, 1, 53–75.
Article MATH Google Scholar
Jones, C.B. (1990), Systematic Software Development Using VDM, Prentice-Hall, Englewood Cliffs, NJ.
MATH Google Scholar
Jullig, R., Y.V. Srinivas, L. Blaine, L.-M. Gilham, A. Goldberg, C. Green, J. McDonald, and R. Waldinger (1995), Specware Languages Manual, Version 1.1, Kestrel Institute.
Loral Federal Systems-Owego (1999), "DSSA-Domain-Specific Software Architectures (DSSA)," Owego, New York, http://www.owego.com/dssa/foils/dssafoils.ps.
Lowry, M., A. Philpot, T. Pressburger, and I. Underwood (1994), "Amphion: Specification-based Programming for Scientific Subroutine Libraries," In SAIRAS'94.
MacDougall, M.H. (1987), Simulating Computer Systems: Techniques and Tools, The MIT Press, Cambridge, MA.
Google Scholar
Moore, M. (1996), "Rule-Based Detection for Reverse Engineering User Interfaces," In Proceedings of the Third Working Conference on Reverse Engineering, IEEE Computer Society Press, pp. 42–48.
Moore, M. and S. Rugaber (1997a), "Using a Knowledge Representation for Understanding Interactive Systems," In Proceedings of the International Workshop on Program Comprehension, pp. 60–67.
Moore, M., and S. Rugaber (1997b), "Domain Analysis for Transformational Reuse," In Proceedings of the Fourth Working Conference on Reverse Engineering, IEEE Computer Society Press, pp. 156–163.
Moore, M., S. Rugaber, and H. Astudillo (1993), "Knowledge Worker Platform Analysis Final Report," Technical Report CIMR-93-02, Center for Information Management Research, College of Computing, Georgia Institute of Technology.
Moore, M., S. Rugaber, and P. Seaver (1994), "Knowledge-based User Interface Migration," In Proceedings of the 1994 International Conference on Software Maintenance, pp. 72–79.
Murphy, G.C., D. Notkin, and K. Sullivan (1995), "Software Reflexion Models: Bridging the Gap Between Source and High-Level Models," In Proceedings of the Third ACM SIGSOFT Symposium on the Foundations of Software Engineering, ACM, pp. 18–28.
Neighbors, J. (1980), Software Construction from Components, PhD Dissertation, ICS Department, University of California at Irvine.
Google Scholar
Neighbors, J.M. (1989), "Draco: A Method for Engineering Reusable Software Components," In Software Reusability/Concepts and Models, Vol. 1,T.J. Biggerstaff and A.J. Perlis, Eds., Addison-Wesley, Reading, MA.
Google Scholar
Ousterhout, J.K. (1994), Tcl and Tk Toolkit, Addison-Wesley, Reading, MA.
MATH Google Scholar
Overton, R.K. et al. (1971), "A Study of the Fundamental Factors Underlying Software Maintenance Problems: Final Report," Corporation for Information Systems Research and Development.
Prieto-Díaz, R. (1989), "Classification of Reusable Modules," In Software Reusability/Concepts and Models, Vol. 1, T.J. Biggerstaff and A.J. Perlis, Eds., Addison-Wesley, Reading, MA, pp. 99-123.
Google Scholar
Prieto-Díaz, R. (1991), "Domain Analysis for Reusability," In Domain Analysis and Software Systems Modeling, R. Prieto-Díaz and G. Arango, Eds., IEEE Computer Society Press, pp. 63-69.
Prieto-Díaz, R. and G. Arango (1991), Domain Analysis and Software Systems Modeling, IEEE Computer Society Press, Los Alamitos, CA.
Google Scholar
Quilici, A. and D.N. Chin (1995), "DECODE: A Cooperative Environment for Reverse-Engineering Legacy Software," In Second Working Conference on Reverse Engineering, L. Wills, P. Newcomb, and E. Chikofsky, Eds., IEEE Computer Society Press, pp. 156-165.
Reasoning Systems Incorporated (1990), Software Refinery Toolkit, Palo Alto, CA.
Resnick, L.A. et al. (1993), CLASSIC Description and Reference Manual for the Common LISP Implementation Version 2.1, AT&T Bell Labs, Murray Hill, NJ.
Google Scholar
Rugaber, S. (1996), "Program Understanding," In Encyclopedia of Computer Science and Technology, Supplement 20, 35, A. Kent and J.G. Williams, Eds., Marcel Dekker, pp. 341-368.
Rugaber, S. (1997), "An Example of Program Understanding," Technical Report GIT-CC-98-14, College of Computing, Georgia Institute of Technology.
Rugaber, S., S.B. Ornburn, and R.J. LeBlanc, Jr. (1990), "Recognizing Design Decisions in Programs," IEEE Software 7, 1, 46-54.
Article Google Scholar
Rugaber, S., K. Stirewalt, and L. Wills (1995a), "Detecting Interleaving," In International Conference on Software Maintenance, pp. 265-274.
Rugaber, S., K. Stirewalt, and L. Wills (1995b), "The Detection and Extraction of Interleaving Code Segments," Technical Report GIT-CC-95-49, College of Computing, Georgia Institute of Technology.
Rugaber, S., K. Stirewalt and L. Wills (1996), "Understanding Interleaved Code," Automated Software Engineering 1-2, 3, 47-76.
Article MathSciNet Google Scholar
Rumbaugh, J., M. Blaha, W. Premerlani, F. Eddy, and W. Lorensen (1991), Object-Oriented Modeling and Design, Prentice-Hall, Englewood Cliffs, NJ.
Google Scholar
Soloway, E., J. Pinto, S. Letovsky, D. Littman, and R. Lampert (1988), "Designing Documentation to Compensate for Delocalized Plans," Communications of the ACM 31, 11, 1259-1267.
Article Google Scholar
Spivey, J.M. (1987), Understanding Z: A Specification Language and Its Formal Semantics, Cambridge University Press.
Srinivas, Y.V. (1991a), "Algebraic Specification of Domains," In Domain Analysis and Software Systems Modeling, R. Prieto-Díaz and G. Arango, Eds., IEEE Computer Society Press, pp. 90-124.
Srinivas, Y.V. (1991b), "Pattern Matching: A Sheaf-Theoretic Approach," PhD Dissertation, Department of Information and Computer Science, University of California at Irvine.
Google Scholar
SUN Microsystems (1994), Browsing Source Code.
Yeh, A., D. Harris, and H. Reubenstein (1995), "Recovering Abstract Data Types and Object Instances from a Conventional Procedural Language," In Proceedings of the Second Working Conference on Reverse Engineering, pp. 227-236.
Zeigler, B.P. (1976), Theory of Modeling and Simulation, Wiley, New York.
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computing, Georgia Institute of Technology, Atlanta, GA, 30332‐0280, USA E-mail:
Spencer Rugaber

Authors

Spencer Rugaber
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rugaber, S. The use of domain knowledge in program understanding. Annals of Software Engineering 9, 143–192 (2000). https://doi.org/10.1023/A:1018976708691

Download citation

Issue Date: May 2000
DOI: https://doi.org/10.1023/A:1018976708691

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The use of domain knowledge in program understanding

Abstract

Access this article

Similar content being viewed by others

Domain Modelling: A Foundation for Software Development

Meaningful Models

Domain-Specific Languages: A Systematic Mapping Study

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The use of domain knowledge in program understanding

Abstract

Access this article

Similar content being viewed by others

Domain Modelling: A Foundation for Software Development

Meaningful Models

Domain-Specific Languages: A Systematic Mapping Study

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation