Sabaragamuwa University of Sri Lanka

Annotate Ease: PDF Metadata Extraction Application Specializing in Research Publications

Show simple item record

dc.contributor.author Nivetha, K. K.
dc.contributor.author Priyasadini, Samanthi
dc.date.accessioned 2025-02-25T09:37:08Z
dc.date.available 2025-02-25T09:37:08Z
dc.date.issued 2025-02-19
dc.identifier.issn 3084-8911
dc.identifier.uri http://repo.lib.sab.ac.lk:8080/xmlui/handle/susl/4868
dc.description.abstract Annotate Ease is an innovative software application designed to address the challenges of extracting metadata from research publications, predominantly in Portable Document Format (PDF). Utilizing advanced artificial intelligence (AI) models, Annotate Ease automates the extraction of critical information such as titles, authors, abstracts, affiliations, and bibliographic data, organizing it into structured JavaScript Object Notation (JSON) files for seamless integration with other research tools and databases. By incorporating Open Researcher and Contributor Identifiers (ORCIDs), the tool enhances the accuracy of author identification and affiliation validation, achieving a tested precision rate of over 95%. Annotate Ease specializes in processing structured PDFs, providing reliable outputs with consistent formatting and significantly reducing the effort required for manual extraction. Although bulk processing is not yet supported, the tool can process up to 10 structured PDFs in approximately 20 minutes, making it well-suited for smaller-scale academic workflows. Future versions aim to enhance throughput, enabling the handling of larger document batches. However, Annotate Ease faces challenges with corrupted or poorly formatted PDFs, which may require preprocessing to ensure optimal performance. Addressing these issues is a priority for ongoing development, along with cost optimization strategies to reduce API-related expenses. Comparative analysis demonstrates that Annotate Ease excels in accuracy, user-friendliness, and efficiency when compared to other metadata extraction tools. Many competing solutions struggle with academic workflows, lack intuitive interfaces, or deliver inconsistent outputs. Annotate Ease’s structured JSON outputs seamlessly integrate with downstream tools such as citation managers and analytics platforms, positioning it as a transformative solution for academic research. Beyond its technical advantages, Annotate Ease saves researchers time by automating repetitive tasks and enabling them to focus on critical analysis and discovery. Future enhancements will aim to expand the tool’s applicability to a broader range of academic documents, such as white papers, theses, and conference proceedings while introducing preprocessing steps to handle variability in document formatting. These improvements will ensure Annotate Ease remains an indispensable resource for researchers, publishers, and database administrators, driving efficiency and innovation in academic workflows. en_US
dc.language.iso en en_US
dc.publisher Faculty of Computing, Sabaragamuwa University of Sri Lanka, P.O. Box 02, Belihuloya, 70140, Sri Lanka. en_US
dc.subject Artificial Intelligence en_US
dc.subject Academic Tools en_US
dc.subject Metadata Extraction en_US
dc.subject ORCID Validation en_US
dc.subject PDF Processing en_US
dc.subject Research Publications. en_US
dc.title Annotate Ease: PDF Metadata Extraction Application Specializing in Research Publications en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account