Skip to main content

Week 2 - MBA 6601 - AI Voice Generator Tool

There are many reasons someone might use AI to read aloud text. Whether it be for convenience, to assist with comprehension, or for use in educational and professional environments, this tool can be incredibly helpful in a wide range of settings. With that being said, I thought it would be interesting to investigate play.ht. This is an AI powered text to voice generator tool that creates realistic audio. The program uses an online voice generator along with synthetic voices. It has a wide range of AI voices, speech styles, pronunciations, and features that users can choose from.




The above image depicts the homepage I was brought to after creating my free account on play.ht. This page contains many resources including access to current projects, audio tools, and voiceover samples. From here, I decided to go straight into creating my audio project. I had the option of choosing between the "standard and premium" or "ulta realistic" voices. I tried the realistic voices first where I simply inputted my desired text and chose a voice. The below image displays a few of many voices that I had the option of selecting. It varies by gender, accent, age, and style, among others. I was shocked by the wide range of options and learned that the program can create speech in 142 languages and accents. 




After inputting my text and selecting the voice, I very easily generated audio which I have inserted below. Instead of simply attaching an audio file, I was able to import and connect it with a video. I think this is an especially important feature as there are many situations where audio needs to be connected with videos. Having this tool directly in the platform is very useful as users will not have to use a third-party application to link the files.




After reviewing the final audio and video, I noticed that the AI mispronounced "AI." It pronounced it correctly the first time but as "A" the second time. I was surprised by this as it said it correctly the first time, so I was confused why it later mispronounced the word. This made me think of a notice I received when initially creating the video which states, "Each sample is unique. You can ‘Re-Generate Previews’ to generate multiple samples and select the one you prefer." I believe this helps explain why the program mispronounced AI the second time. Clearly, each sample is unique, and every time it says a word it may not always sound the same.

After reviewing the audio and further investigating the tools available, I do have to say I am pleasantly surprised with the program. I have only really used text to voice in Word and Google Translate, so I was not aware of how complex this AI can get. I learned that play.ht is being used by well-known sources including Harvard University and Product Hunt. It is pretty neat that I can use this tool alongside popular entity's like those mentioned. Overall, I was happy to investiage and learn more about this AI. I am looking forward to seeing how it continues to grow and evolve over time.

Comments

Popular posts from this blog

Week 5 - MBA 6601 - Midjourney for Creating AI Images

I recently discovered an AI-based image generator by Midjourney. As a relatively artistic person, I quickly became interested in exploring this tool's capabilities. What initally sparked my interest was the art I saw posted by Generative AI on LinkedIn. The page shares images that users had created in Midjourney including a chonky Chipmunk and "what-if" scenario if Robert Pattinson and Kristen Stewart had a family. It's funny that such different photos could have one thing in common, that is, artificial intelligence! AI can clearly make a wide range (if not limitless) style and type of art. Not only that, but the tool can create something factitious that appears to be real. As it continues improving, I am sure the AI-based images will fool many into thinking it produced an authentic image. According to its website,  Midjourney is "an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species." You can...

Week 8 - MBA 6601 - TikTok and AI

One thing I gave up over this past Lent was watching TikTok videos. I had the app for a while but found myself watching its content more and more often. It became an addiction. It is such a powerful tool, and I know the program displayed personalized content for me in hopes it would keep me watching. That is why I decided to take some time away from it - so I could focus on other things in my life instead of just watching videos. Surprisingly, after giving up TikTok over Lent I never went back to it. And I haven't thought about it much since. However, that doesn't mean it hasn't popped up again in my life. Recently, I came across a couple interesting articles about TikTok and AI. As with many other companies, TikTok is testing AI tools and technologies. Check out this article by The Verge  on the new TikTok "Tako" that is still in the testing phase. The idea is users can ask it to provide recommendations on what to watch, for example. As mentioned in the article, ...

Week 3 - MBA 6601 - Hootsuite's OwlyWriter AI

I currently work in the marketing department for a small company. One of my responsibilities is creating content for our social media pages. This includes writing posts for projects, upcoming events and webinars, reposts, holidays, and anything else relevant to our industry. I have familiarized myself with the company's writing style, such as how we format posts for sharing projects and our professionalism when creating holiday posts. Recently, I discovered that I can use AI to assist in the creation of social media posts. I no longer have to come up with the content by myself each time. Rather, I can use AI to generate ideas and provide me with inspiration. I utilize Hootsuite  for creating and scheduling social media content. The company recently devloped a new tool, OwlyWriter AI, that assists in the process of creating social media content. There are options for repurposing posts, getting inspiration, and generating captions, among others. It is easy to select a tool from OwlyW...