Identifying Objects Using Cluster And Concept Analysis

Program-Transformation.Org: The Program Transformation Wiki
Identifying Objects using Cluster and Concept Analysis

ArieVanDeursen and TobiasKuipers.

In 21st International Conference on Software Engineering, ICSE-99, 1999, ACM, pp 246-255.


Many approaches to support (semi-automatic) identification of objects in LegacySystems take the data structures as starting point for candidate classes. Unfortunately, legacy data structures tend to grow over time, and may contain many unrelated fields at the time of migration. We propose a method for ObjectIdentification by semi-automatically restructuring the legacy data structures. Issues involved include the selection of record fields of interest, the identification of procedures actually dealing with such fields, and the construction of coherent groups of fields and procedures into candidate classes. We explore the use of ClusterAnalysis and ConceptAnalysis for the purpose of object identification, and we illustrate their effect on a 100,000 lines of code COBOL system. Furthermore, we use these results to contrast clustering with concept analysis techniques.