Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals: an approach and evaluation at Roche Diagnostics GmbH

Mature software systems comprise a vast number of heterogeneous system capabilities which are usually requested by different groups of stakeholders and which evolve over time. Software features describe and bundle low level capabilities logically on an abstract level and thus provide a structured an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Quirchmayr, Thomas (VerfasserIn) , Paech, Barbara (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 19 February 2018
In: Empirical software engineering
Year: 2018, Jahrgang: 23, Heft: 6, Pages: 3630-3683
ISSN:1573-7616
DOI:10.1007/s10664-018-9597-6
Online-Zugang:Verlag, Volltext: http://dx.doi.org/10.1007/s10664-018-9597-6
Verlag, Volltext: https://link.springer.com/article/10.1007/s10664-018-9597-6
Volltext
Verfasserangaben:Thomas Quirchmayr, Barbara Paech, Roland Kohl, Hannes Karey, Gunar Kasdepke

MARC

LEADER 00000caa a2200000 c 4500
001 1577716418
003 DE-627
005 20220814193815.0
007 cr uuu---uuuuu
008 180718s2018 xx |||||o 00| ||eng c
024 7 |a 10.1007/s10664-018-9597-6  |2 doi 
035 |a (DE-627)1577716418 
035 |a (DE-576)507716418 
035 |a (DE-599)BSZ507716418 
035 |a (OCoLC)1341013911 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 28  |2 sdnb 
100 1 |a Quirchmayr, Thomas  |d 1984-  |e VerfasserIn  |0 (DE-588)1162970774  |0 (DE-627)1027063829  |0 (DE-576)507716647  |4 aut 
245 1 0 |a Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals  |b an approach and evaluation at Roche Diagnostics GmbH  |c Thomas Quirchmayr, Barbara Paech, Roland Kohl, Hannes Karey, Gunar Kasdepke 
264 1 |c 19 February 2018 
300 |a 54 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Published online: 19 February 2018 
500 |a Gesehen am 15.04.2019 
520 |a Mature software systems comprise a vast number of heterogeneous system capabilities which are usually requested by different groups of stakeholders and which evolve over time. Software features describe and bundle low level capabilities logically on an abstract level and thus provide a structured and comprehensive overview of the entire capabilities of a software system. Software features are often not explicitly managed. Quite the contrary, feature-relevant information is often spread across several software engineering artifacts (e.g., user manual, issue tracking systems). It requires huge manual effort to identify and extract feature-relevant information from these artifacts in order to make feature knowledge explicit. In this paper we present a two-step-approach to extract feature-relevant information from a user manual: First we semi-automatically extract a domain terminology from a natural language user manual based on linguistic patterns. Then, we apply natural language processing techniques based on the extracted domain terminology and structural sentence information. Our approach is able to extract atomic feature-relevant information with an F1-score of at least 92.00%. We describe the implementation of the approach as well as evaluations based on example sections of a user manual taken from industry. 
700 1 |a Paech, Barbara  |d 1959-  |e VerfasserIn  |0 (DE-588)172299799  |0 (DE-627)697208648  |0 (DE-576)133166821  |4 aut 
773 0 8 |i Enthalten in  |t Empirical software engineering  |d Dordrecht [u.a.] : Springer Science + Business Media B.V, 1996  |g 23(2018), 6, Seite 3630-3683  |h Online-Ressource  |w (DE-627)271350032  |w (DE-600)1479898-0  |w (DE-576)110350596  |x 1573-7616  |7 nnas  |a Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals an approach and evaluation at Roche Diagnostics GmbH 
773 1 8 |g volume:23  |g year:2018  |g number:6  |g pages:3630-3683  |g extent:54  |a Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals an approach and evaluation at Roche Diagnostics GmbH 
856 4 0 |u http://dx.doi.org/10.1007/s10664-018-9597-6  |x Verlag  |x Resolving-System  |3 Volltext 
856 4 0 |u https://link.springer.com/article/10.1007/s10664-018-9597-6  |x Verlag  |3 Volltext 
951 |a AR 
992 |a 20180718 
993 |a Article 
994 |a 2018 
998 |g 172299799  |a Paech, Barbara  |m 172299799:Paech, Barbara  |d 110000  |d 110300  |e 110000PP172299799  |e 110300PP172299799  |k 0/110000/  |k 1/110000/110300/  |p 2 
998 |g 1162970774  |a Quirchmayr, Thomas  |m 1162970774:Quirchmayr, Thomas  |d 110000  |d 110300  |e 110000PQ1162970774  |e 110300PQ1162970774  |k 0/110000/  |k 1/110000/110300/  |p 1  |x j 
999 |a KXP-PPN1577716418  |e 3018250990 
BIB |a Y 
SER |a journal 
JSO |a {"relHost":[{"origin":[{"publisherPlace":"Dordrecht [u.a.] ; Dordrecht [u.a.]","publisher":"Springer Science + Business Media B.V ; Kluwer","dateIssuedKey":"1996","dateIssuedDisp":"1996-"}],"part":{"pages":"3630-3683","volume":"23","issue":"6","text":"23(2018), 6, Seite 3630-3683","extent":"54","year":"2018"},"id":{"zdb":["1479898-0"],"issn":["1573-7616"],"eki":["271350032"]},"language":["eng"],"recId":"271350032","physDesc":[{"extent":"Online-Ressource"}],"note":["Gesehen am 01.11.05"],"disp":"Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals an approach and evaluation at Roche Diagnostics GmbHEmpirical software engineering","type":{"bibl":"periodical","media":"Online-Ressource"},"pubHistory":["1.1996 -"],"title":[{"subtitle":"an international journal","title_sort":"Empirical software engineering","title":"Empirical software engineering"}]}],"origin":[{"dateIssuedKey":"2018","dateIssuedDisp":"19 February 2018"}],"id":{"eki":["1577716418"],"doi":["10.1007/s10664-018-9597-6"]},"person":[{"given":"Thomas","family":"Quirchmayr","roleDisplay":"VerfasserIn","role":"aut","display":"Quirchmayr, Thomas"},{"role":"aut","roleDisplay":"VerfasserIn","display":"Paech, Barbara","given":"Barbara","family":"Paech"}],"note":["Published online: 19 February 2018","Gesehen am 15.04.2019"],"recId":"1577716418","language":["eng"],"physDesc":[{"extent":"54 S."}],"type":{"bibl":"article-journal","media":"Online-Ressource"},"name":{"displayForm":["Thomas Quirchmayr, Barbara Paech, Roland Kohl, Hannes Karey, Gunar Kasdepke"]},"title":[{"title_sort":"Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals","title":"Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals","subtitle":"an approach and evaluation at Roche Diagnostics GmbH"}]} 
SRT |a QUIRCHMAYRSEMIAUTOMA1920