Exploring adjectival modification in biomedical discourse across two genres

Olivier Bodenreider, Serguei V. Pakhomov
2003 Proceedings of the ACL 2003 workshop on Natural language processing in biomedicine -  
Objectives: To explore the phenomenon of adjectival modification in biomedical discourse across two genres: the biomedical literature and patient records. Methods: Adjectival modifiers are removed from phrases extracted from two corpora (three million noun phrases extracted from MEDLINE, on the one hand, and clinical notes from the Mayo Clinic, on the other). The original phrases, the adjectives extracted, and the resulting demodified phrases are compared across the two corpora after
more » ... on. Quantitative comparisons (frequency of occurrence) are performed on the whole domain. Qualitative comparisons are performed on the two subdomains (disorders and procedures). Results: Although the average number of adjectives per phrase is equivalent in the two corpora (1.4), there are more adjective types in MAYO than in MEDLINE for disorders and procedures. For disorder phrases, the 38% of adjective types common to the two corpora account for 85% of the occurrences. The predominance of adjectives in one corpus is analyzed. Discussion: Potential applications of this approach are discussed, namely terminology acquisition, information retrieval, and genre characterization.
doi:10.3115/1118958.1118972 dblp:conf/bionlp/BodenreiderP03 fatcat:3loti25i5ndzfmuwqaseo6w5zy