Enabling Analysis and Measurement of Conventional Software Development Documents Using Project-Specific Formalism

Software Measurement, 2011 Joint Conference of the 21st Int'l Workshop and 6th Int'l Conference Software Process and Product Measurement(2011)

引用 2|浏览0
暂无评分
摘要
We describe a new approach to modeling and analyzing software development documents that are typically written using conventional office applications. Our approach brings automation to content extraction, quality checking and measurement of massive document artifacts that tend to be handled by labor-intensive manual work in industry today. Rather than seeking an approach based on creation or rewriting of contents using more rigid, machine-friendly representations such as standardized formal models and restricted languages, we provide a method to deal with the diversity of document artifacts by making use of project-specific formalism that exists in target documents. We demonstrate that such project-specific formalism often tends to "naturally" exist at syntactic levels, and it is possible to define a "document model", a logical data representation gained by transformation rule from the physical, syntactic structure to the logical, semantic structure. With this transformation, various quality checking rules for completeness, consistency, traceability, etc., are realized by evaluating constraints for data items in the logical structure, and measurement of these quality aspects is automated. We developed a tool to allow a user to easily define document models and checking rules, and provide the insights on transformations when defining document models for various industry specification documents written in word processor files, spreadsheets and presentations. We also demonstrate the use of natural language processing can improve document modeling and quality checking by compensating for a weakness of formalism and applying analysis to specific parts of the target documents.
更多
查看译文
关键词
massive document artifact,document artifact,document model,target document,project-specific formalism,enabling analysis,defining document model,conventional software development documents,various industry specification document,quality checking,software development document,document modeling,programming,text analysis,natural language processing,data representation,software development,data models,formal verification,business,data model,software quality,estimation,text analytics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要