ACM Home Page
Please provide us with feedback. Feedback
Investigating data quality problems in the PSP
Full text PdfPdf (1.20 MB)
Source Foundations of Software Engineering archive
Proceedings of the 6th ACM SIGSOFT international symposium on Foundations of software engineering table of contents
Lake Buena Vista, Florida, United States
Pages: 143 - 152  
Year of Publication: 1998
ISBN:1-58113-108-9
Also published in ...
Authors
Anne M. Disney  Dept. of Information and Computer Sciences, University of Hawaii, Honolulu, HI
Philip M. Johnson  Dept. of Information and Computer Sciences, University of Hawaii, Honolulu, HI
Sponsors
SIGSOFT: ACM Special Interest Group on Software Engineering
SIGPLAN: ACM Special Interest Group on Programming Languages
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 84,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/288195.288292
What is a DOI?

ABSTRACT

The Personal Software Process (PSP) is used by software engineers to gather and analyze data about their work. Published studies typically use data collected using the PSP to draw quantitative conclusions about its impact upon programmer behavior and product quality. However, our experience using PSP in both industrial and academic settings revealed problems both in collection of data and its later analysis. We hypothesized that these two kinds of data quality problems could make a significant impact upon the value of PSP measures. To test this hypothesis, we built a tool to automate the PSP and then examined 89 projects completed by ten subjects using the PSP manually in an educational setting. We discovered 1539 primary errors and categorized them by type, subtype, severity, and age. To examine the collection problem we looked at the 90 errors that represented impossible combinations of data and at other less concrete anomalies in Time Recording Logs and Defect Recording Logs. To examine the analysis problem we developed a rule set, corrected the errors as far as possible, and compared the original and corrected data. This resulted in significant differences for measures such as yield and the cost-performance ratio, confirming our hypothesis. Our results raise questions about the accuracy of manually collected and analyzed PSP data, indicate that integrated tool support may be required for high quality PSP data analysis, and suggest that external measures should be used when attempting to evaluate the impact of the PSP upon programmer behavior and product quality.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Progress Software (Data Language Corporation). Information is available at: www.progress.comlcore/develop.htm.
 
3
George A. Ferguson and Yoshio Takane. Statistical Analysis In Psychology And Education. McGraw-Hill Book Company, 6th edition, 1989.
 
4
 
5
 
6
Will Hayes and James W. Over. The Personal Software Process (PSP): An empirical study of the impact of PSP on individual engineers. Technical Report CMU/SEI-97-TR-001, Software Eng. Inst., Pittsburgh, 1997.
 
7
Joel Henry. Personal software process studio. http://wwwcs.etsu.edu/softeng/psp/, 1997.
 
8
 
9
 
10
 
11
 
12

CITED BY  15

Collaborative Colleagues:
Anne M. Disney: colleagues
Philip M. Johnson: colleagues