Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
Repository logo
  • Collections
  • Browse
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Browse by Author

Browsing by Author "1621535042"

Now showing 1 - 1 of 1
Results Per Page
Sort Options
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Bangla Text to Speech With Emotion
    (North South University, 2022) Shaif Hossain Emon; Maruf Mustar Moon; Md Farhad Gazi; Md. Riazat Kabir Shuvo; Md Shahriar Karim; 1811603642; 1811009642; 1621760042; 1621535042
    Spoken language technology improved a lot. There are many text to speech models like tacotron, tacotron 2, deep voice, Fastspeech, and wavelet are used for synthesizing speech. Tacotron 2 has a 4.58 mean opinion score, mostly human-like speech generated by a computer. There is no work done for an emotional Bangla text-to-speech. This paper proposes a transfer learning approach for generating emotional speech with the respective tacotron two models. We have created a Bangla Emotional Text To speech web app. It generates Bangla speech for a text or audio input with a specific emotion. Initially, it gives speeches for three types of emotions and the poem. Users can choose which kind of emotions they want in the speech. Then their text will go to one of our tacotron two models. We have three models for creating sad, neutral, and happy speech. We have created another model for reading poems. For training our sad and happy model, we have built our dataset.

NSU IR. All rights reserved. © 2025 Powered by NSU Library

  • Cookie settings
  • NSU Library
  • NSU Home
  • Feedback