PRIM versus CART in subgroup discovery: when patience is harmful.

Ameen Abu-Hanna,Barry Nannings,Dave Dongelmans,Arie Hasman

Journal of Biomedical Informatics（2010）

引用 21|浏览0

暂无评分

摘要

We systematically compare the established algorithms CART (Classification and Regression Trees) and PRIM (Patient Rule Induction Method) in a subgroup discovery task on a large real-world high-dimensional clinical database. Contrary to current conjectures, PRIM's performance was generally inferior to CART's. PRIM often considered "peeling of" a large chunk of data at a value of a relevant discrete ordinal variable unattractive, ultimately missing an important subgroup. This finding has considerable significance in clinical medicine where ordinal scores are ubiquitous. PRIM's utility in clinical databases would increase when global information about (ordinal) variables is better put to use and when the search algorithm keeps track of alternative solutions.

查看译文

关键词

real-world high-dimensional clinical database,ordinal scores,clinical databases,established algorithms cart,prim (patient rule induction method),important subgroup,patience,subgroup discovery task,coverage,cart (classification and regression trees),subgroup discovery,high-dimensionality,large chunk,patient rule induction method,regression trees,clinical medicine,ordinal score,bootstrap,search algorithm

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要