Medical Prescription Recognition Using Heuristic Clustering and Similarity Search.

Ngoc-Thao Nguyen, Hieu Vo,Khanh Tran, Duy Ha,Duc Nguyen,Thanh Le

International Conference on Computational Collective Intelligence (ICCCI)(2022)

引用 0|浏览0
暂无评分
摘要
The necessity to convert printed documents to facilitate the storage and retrieval of information is growing, particularly in the medical and healthcare industries. In our last work, we presented a method to extract prescriptions from images using CRAFT and TESSERACT so that patients could quickly save and check up on their pharmaceutical use information. However, the slow processing speed and the limited number of medication names lead to it being impractical. Based on this model structure, a new system is introduced, using bounding box clustering heuristics to detect the featured text areas, before employing VietOCR tool to identify the texts in prescription images. Simultaneously, a fast and accurate technique for extracting prescriptions is developed, utilizing word embedding and the vector search algorithm. The experiment results reveal that the proposed model significantly reduces the error of the retrieved data on the two standard measures, WER and CER, prominently with CER lowered to 26.95. Furthermore, the execution time decreases from 17.81 s to an average of 3.64 s, demonstrating the great effectiveness of our effort to improve the prior system.
更多
查看译文
关键词
Document layout analysis,Optical character recognition,Word embedding,Approximate nearest neighbor
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要