Are Abstracts Really the Abstracts: Alignment in Academic Research Papers and Summarizing Medical Research Paper With Deep Learning

dc.contributor.advisorMohammad Ashrafuzzaman Khan
dc.contributor.authorTanvir Khan
dc.contributor.id1911481642
dc.coverage.departmentElectrical and Computer Engineering
dc.date.accessioned2024-05-20
dc.date.accessioned2024-05-20T05:01:14Z
dc.date.available2024-05-20T05:01:14Z
dc.date.issued2023
dc.description.abstractThis research project investigates the degree of alignment between academic research and its summarizing abstracts in the medical domain, employing deep learning models for abstractive summarization. Abstractive summarization is a technique that generates summaries of texts using natural language generation rather than extracting sentences from the original texts. However, most existing studies on abstractive summarization focus on news articles or short stories, and few have explored its application to scientific literature, especially medical research papers. We used two state-of-the-art deep learning models, bert2bert and roberta2roberta, which are based on the encoder-decoder architecture and use pre-trained language models as encoders and decoders. We trained our models on a dataset of full-text papers and their abstracts from CORD-19 and other sources, which provided access to a large and diverse collection of medical research papers. We evaluated the performance of our models using ROUGE and METEOR scores, which are commonly used metrics for measuring the quality of summaries. We found that roberta2roberta outperformed bert2bert on both metrics, achieving a ROUGE score of 0.50 and a METEOR score of 0.48. We also compared the generated summaries with the original abstracts and performed a qualitative analysis of their differences and similarities. We found that the abstracts in the papers most often suffer from some common errors and limitations, such as repetition, inconsistency, omission, or exaggeration. We identified some potential benefits and challenges of applying abstractive summarization to scientific literature, such as improving accessibility and readability of research papers, reducing information overload, enhancing scientific communication, promoting interdisciplinary research, ensuring ethical and responsible use of AI, protecting intellectual property rights, preserving data quality and integrity, ensuring data privacy and security, reducing environmental footprint, etc. We concluded that abstracts don’t always reflect the whole paper through abstractive summarization of medical research papers, but further research and development are still required to overcome its challenges and limitations.
dc.description.degreeUndergraduate
dc.identifier.cd600000346
dc.identifier.print-thesisTo be assigned
dc.identifier.urihttps://repository.northsouth.edu/handle/123456789/769
dc.language.isoen
dc.publisherNorth South University
dc.rights©NSUlibrary
dc.titleAre Abstracts Really the Abstracts: Alignment in Academic Research Papers and Summarizing Medical Research Paper With Deep Learning
dc.typeProject
oaire.citation.endPage38
oaire.citation.startPage1
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
600000346-Abstract.pdf
Size:
97.89 KB
Format:
Adobe Portable Document Format
Description:
Loading...
Thumbnail Image
Name:
600000346.pdf
Size:
555.91 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.93 KB
Format:
Item-specific license agreed to upon submission
Description: