UTAR Institutional Repository

Text recognition (OCR) for patient records digitization using CNN

Ong, Zi Leong (2022) Text recognition (OCR) for patient records digitization using CNN. Final Year Project, UTAR.

Download (2670Kb) | Preview


    Optical character recognition (OCR) is widely used to transcribe texts from images in computer vision. Although current OCR methods can accurately transcribe printed text (structured), they often fall short on unstructured or handwritten text recognition. This project proposed a text recognition method to recognize handwritten text on patients' clinical data using a convolutional neural network (CNN). We compiled custom handwriting datasets from MNIST 0-9 and Kaggle A-Z datasets to add more handwriting diversity in training a more robust OCR model. The CNN has 3-convolutional layers to learn high-level features and a dropout layer to prevent overfitting. The preliminary results showed that the proposed model achieved 93.75% classification accuracy while Tesseract (the state-of-the-art OCR) scored 69.79%. The data will be transformed from handwritten text to computer-readable text and then stored in files in xml form for further development.

    Item Type: Final Year Project / Dissertation / Thesis (Final Year Project)
    Subjects: Q Science > Q Science (General)
    T Technology > T Technology (General)
    Divisions: Faculty of Information and Communication Technology > Bachelor of Computer Science (Honours)
    Depositing User: ML Main Library
    Date Deposited: 15 Jan 2023 21:28
    Last Modified: 15 Jan 2023 21:28
    URI: http://eprints.utar.edu.my/id/eprint/4662

    Actions (login required)

    View Item