Generate A Textual Description Based on An Image
dc.contributor.advisor | Dr. Mohammad Ashrafuzzaman Khan | |
dc.contributor.author | Name: Shah Alvi Hossain | |
dc.contributor.author | Md. Tahmidur Rahman | |
dc.contributor.id | 1721427042 | |
dc.contributor.id | 1721370042 | |
dc.coverage.department | Electrical and Computer Engineering | |
dc.date.accessioned | 2024-05-09 | |
dc.date.accessioned | 2024-05-09T04:39:23Z | |
dc.date.available | 2024-05-09T04:39:23Z | |
dc.date.issued | 2022 | |
dc.description.abstract | The project goal is to get the computer to detect what was going on in the image and provide a general description. Our approach is to build a dataset of images with in-depth descriptions to train an appropriate model to tell what objects are in the picture and make their relations relevant. The dataset is based on images from the web, where we collect the images and get a near accurate description. We collected the images and descriptions from authentic news websites and stored them on google sheets and GitHub. Grammar checking tools were used to test the description's grammar and generate better words. We used the encoderdecoder system to encode the image with a pre-trained Convolutional Neural Network (VGG16) in a hidden state. It would then use an LSTM to decode this concealed state and generate a caption. | |
dc.description.degree | Undergraduate | |
dc.identifier.cd | 600000055 | |
dc.identifier.print-thesis | To be assigned | |
dc.identifier.uri | https://repository.northsouth.edu/handle/123456789/631 | |
dc.language.iso | en_US | |
dc.publisher | North South University | |
dc.rights | © NSU Library | |
dc.subject | TECHNOLOGY::Electrical engineering, electronics and photonics::Electrical engineering | |
dc.title | Generate A Textual Description Based on An Image | |
dc.type | Project | |
oaire.citation.endPage | 30 | |
oaire.citation.startPage | 1 |
Files
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.93 KB
- Format:
- Item-specific license agreed to upon submission
- Description: