IMPROVING DOCUMENT UNDERSTANDING OF MEDICAL INVOICES USING MULTIMODAL APPROACH: A COMPARATIVE STUDY

dc.contributor.authorMahendratama, Abyakta Nadhif
dc.contributor.authorIpung, Heru Purnomo
dc.contributor.authorTaqwim, Andi Darma
dc.date.accessioned2026-05-21T08:44:24Z
dc.date.issued2025-08-14
dc.description.abstractThis thesis presents a comprehensive comparative study on document understanding of medical invoices using both OCR and advanced multimodal models. A stratified dataset of 186 real-world Indonesian medical invoices, encompassing diverse forms of visual degradation, was manually annotated at the field level to ensure robust and representative evaluation. The research investigates the effectiveness of OCR systems, advanced multimodal models, and the combination of both systems in extracting structured information from preprocessed invoice images. Standard image preprocessing techniques were applied to all samples prior to evaluation. Performance was quantitatively assessed using established metrics, including precision, recall, and F1-score. The results demonstrate that multimodal models consistently outperform OCR systems on visually degraded invoices. This work offers practical insights for deploying robust automated document understanding solutions in claims processing, highlighting the advantages of integrating preprocessing with modern multimodal models for real-world, domain-specific applications.
dc.identifier.urihttps://dspace-repository.sgu.ac.id/handle/123456789/201
dc.language.isoen
dc.publisherSwiss German University
dc.subjectDocument understanding
dc.subjectOptical Character Recognition
dc.subjectmultimodal models
dc.subjectmedical invoices
dc.subjectGemini
dc.subjectstructured data extraction
dc.subjectpreprocessing
dc.subjectMistral OCR
dc.subjectPixtral
dc.subjectQwen 2.5-VL
dc.subjectPaddleOCR
dc.titleIMPROVING DOCUMENT UNDERSTANDING OF MEDICAL INVOICES USING MULTIMODAL APPROACH: A COMPARATIVE STUDY
dc.typeThesis

Files

Original bundle

Now showing 1 - 5 of 6
Loading...
Thumbnail Image
Name:
COVER.pdf
Size:
351.91 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
CHAPTER 1.pdf
Size:
93.67 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
CHAPTER 2.pdf
Size:
189.65 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
CHAPTER 3.pdf
Size:
193.74 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
CHAPTER 4.pdf
Size:
144.92 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections